A Large-Scale Evolvable Dataset for Model Context Protocol Ecosystem and Security AnalysisZhiwei Lin, Bonan Ruan, Jiahao Liu, Weibo Zhaohttps://arxiv.org/abs/2506.23474
A Large-Scale Evolvable Dataset for Model Context Protocol Ecosystem and Security AnalysisThe Model Context Protocol (MCP) has recently emerged as a standardized interface for connecting language models with external tools and data. As the ecosystem rapidly expands, the lack of a structured, comprehensive view of existing MCP artifacts presents challenges for research. To bridge this gap, we introduce MCPCorpus, a large-scale dataset containing around 14K MCP servers and 300 MCP clients. Each artifact is annotated with 20+ normalized attributes capturing its identity, interface conf…