Artificial Intelligence

Meet Google PaperBanana: The AI That Draws Your Research for You

California: Google AI and Peking University researchers unveiled PaperBanana Thursday, a multi-agent framework automating academic diagram generation for research papers.

The system addresses a critical bottleneck in scientific publishing. While AI scientists handle literature reviews and code, visualizing complex discoveries remains labor-intensive.

PaperBanana orchestrates five specialized agents across two phases. The Linear Planning Phase deploys Retriever, Planner, and Stylist agents. The Iterative Refinement Phase uses Visualizer and Critic agents across three improvement rounds.

The Retriever Agent identifies 10 relevant reference examples from databases. The Planner Agent translates technical methodology text into detailed figure descriptions. The Stylist Agent ensures outputs match conference aesthetics like the “NeurIPS Look”.

The Visualizer Agent generates visuals using Nano-Banana-Pro for diagrams. For statistical plots, it writes executable Python Matplotlib code. The Critic Agent inspects images against source text, identifying factual errors or visual glitches.

Researchers introduced PaperBananaBench, a dataset of 292 test cases from NeurIPS 2025 publications. PaperBanana outperformed baselines by 17% overall score, 37.2% conciseness improvement, 12.9% readability gains, and 6.6% aesthetics enhancement.

The system excels in Agent and Reasoning diagrams, achieving 69.9% overall scores. For statistical plots, code-based generation ensures 100% data fidelity versus image models prone to numerical hallucinations.

Just as Google’s AI-generated news alerts raised concerns about AI content reliability, PaperBanana ensures data fidelity and accuracy in scientific visualizations by using code-based generation for statistical plots, eliminating risks of numerical hallucinations.

Domain-specific aesthetic preferences vary significantly. Agent and Reasoning papers favor illustrative 2D vector robots and chat bubbles. Computer Vision research uses camera cones and point clouds. Generative Learning employs 3D cuboids for tensors. Theory papers maintain minimalist grayscale palettes.

Similar to how OpenAI’s ChatGPT Prism simplifies the writing and editing processes for research papers, PaperBanana automates the creation of research diagrams, transforming labor-intensive tasks into AI-powered workflows.

The framework is available on GitHub with full documentation.

Anurag Shukla

Anurag Shukla is a Senior Journalist with over two decades of experience across television, digital, and print media. He has worked with leading national news organisations and has also served as a Research Officer in the Prime Minister’s Office (PMO), contributing to media research and policy-level content. A former journalism academic, Anurag brings strong editorial depth and a keen understanding of how technology, governance, and society intersect at Tea4Tech.

Recent Posts

Amazon Pledges Fresh $13 Bn to Scale Up AI, Cloud Infrastructure in India

New Delhi: Amazon has announced a fresh $13 billion investment in India focused on expanding…

2 days ago

Sakana AI Launches Fugu to Orchestrate Frontier Models

TOKYO: Tokyo-based AI startup Sakana AI has introduced two new products, Fugu and Fugu Ultra,…

3 days ago

Meta Invests $900 Mn in CRED, Gets Kunal Shah as WhatsApp Global Head

New Delhi: In a major leadership shake-up, Meta has appointed Kunal Shah, the founder of…

4 days ago

Odyssey Raises $310 Million Series B to Scale Its AI World Models

PALO ALTO, Calif.: Odyssey, an AI lab focused on building general-purpose AI world models, has…

4 days ago

AI Inference Startup Baseten Targets $13B Valuation in $1.5B Round

SAN FRANCISCO: Baseten is closing in on a massive $1.5 billion funding round at a…

5 days ago

Prem AI Eyes $100M Series A for Self-Hosted Enterprise AI Stack

LUGANO, Switzerland: Prem AI, a Swiss startup building a self-hosted enterprise AI platform, is looking…

5 days ago