News - SGIWorld

SeevoMap

Environment Apr 2026

Dynamic AI research knowledge graph now live! SeevoMap provides an interactive visualization of the AI research landscape, enabling exploration of research topics, paper connections, and emerging trends. Perfect for navigating the fast-evolving field.

Homepage Explore

MarkScientist

Agent Apr 2026

Self-evolving research agent framework released! MarkScientist features a three-agent workflow (Proposer → Solver → Reviewer) with JudgeBuddy system for scenario-aware evaluation. Includes 15 research scenarios and 12 reviewer personas with built-in taste learning.

GitHub Learn More

ResearchClawBench

Benchmark Mar 2026

End-to-end auto-research evaluation benchmark launched! ResearchClawBench measures AI agents' ability to conduct complete research workflows — from literature review and hypothesis generation to experimental execution and paper writing. Now accepting task submissions from all research domains.

Homepage Learn More

SGI-Bench

Benchmark Dec 2025

SGI-Bench leaderboard now features 30+ models! The Scientific General Intelligence Benchmark continues to evaluate frontier models across multi-disciplinary scientific tasks. New model submissions welcome.

Leaderboard Learn More

SciEvalKit (Probe)

Toolkit Dec 2025

A unified evaluation toolkit and leaderboard for rigorously assessing scientific intelligence of LLMs and VLMs. Features 7 core capability dimensions across 6 scientific disciplines. Now integrated with OpenCompass for standardized evaluation.

GitHub Leaderboard Learn More

SFE (Scientific Frontier Evaluation)

Dataset May 2025

SFE dataset now available on Hugging Face! Comprehensive evaluation of frontier scientific knowledge across physics, chemistry, biology, materials science, and more. Designed to probe the cutting edge of AI scientific understanding.

HuggingFace

News & Updates

Submit Your Research Tasks Now!