Screen 10,000 Rows

Screening 10,000 rows requires each row to be individually evaluated against a natural-language criterion. This case study screens 9,949 FDA product recalls to find products relevant to a child born on 2021-08-01, where each row requires reasoning about whether a recalled product would plausibly be used for a child of that age.

Metric	Value
Rows processed	9,949
Rows passing	2,271 (22.8%)
Total cost	$37.13
Time	11.8 minutes

Add FutureSearch to Claude Code if you haven't already:

claude mcp add futuresearch --scope project --transport http https://mcp.futuresearch.ai/mcp

With the FDA recalls CSV in your working directory, tell Claude:

Screen this FDA product recalls dataset to find recalls of products that I might
have used for my child born on 2021-08-01.

Claude calls FutureSearch's classify MCP tool. At this scale, the two-pass pipeline is critical: a fast first pass triages all 9,949 rows, then a careful second pass re-evaluates borderline cases with a stronger model:

Tool: futuresearch_classify
├─ task: "Find recalls of products that I might have used for my child born on 2021-08-01."
├─ input_csv: "/Users/you/fda_product_recalls.csv"
└─ categories: ["yes", "no"]

→ Submitted: 9,949 rows for classification.
  Session: https://futuresearch.ai/sessions/310fc823-0adc-402c-bff1-7dc43fda2636
  Task ID: 310f...

Tool: futuresearch_progress
├─ task_id: "310f..."
→ Running: 0/9949 complete (30s elapsed)

...

Tool: futuresearch_progress
→ Completed: 9949/9949 (0 failed) in 707s.

Tool: futuresearch_results
├─ task_id: "310f..."
├─ output_path: "/Users/you/child_relevant_recalls.csv"
→ Saved 2271 rows to /Users/you/child_relevant_recalls.csv

2,271 of 9,949 recalls are relevant. View the session.

Add the FutureSearch connector if you haven't already. Then upload the FDA product recalls CSV and ask Claude:

Screen this FDA product recalls dataset to find recalls of products that I might have used for my child born on 2021-08-01.

Go to futuresearch.ai/app, upload the FDA product recalls CSV, and enter:

Screen this FDA product recalls dataset to find recalls of products that I might have used for my child born on 2021-08-01.

Screening is binary classification: set categories=["yes", "no"] and only passing rows are returned.

pip install futuresearch
export FUTURESEARCH_API_KEY=your_key_here  # Get one at futuresearch.ai/app/api-key

import asyncio
import pandas as pd
from futuresearch import create_session
from futuresearch.ops import classify

fda_recalls = pd.read_csv("fda_product_recalls.csv")
fda_recalls["center_classification_date"] = pd.to_datetime(
    fda_recalls["center_classification_date"], errors="coerce"
)
fda_recalls = fda_recalls[
    fda_recalls["center_classification_date"] > pd.Timestamp("2021-08-01")
]

async def main():
    async with create_session(name="FDA Recall Screening") as session:
        result = await classify(
            task="Find recalls of products that I might have used for my child born on 2021-08-01.",
            categories=["yes", "no"],
            input=fda_recalls,
        )
        return result.data

results = asyncio.run(main())

Results

Sample passing recalls (products a child could have been exposed to):

Product	Firm	Why Relevant
White Hot Dog Enriched Buns	Perfection Bakeries	Food item, child eating solids by recall date
ExactaMed Oral Dispenser	Baxter Healthcare	Medical device used for infant medication
Chickenless Crispy Tenders	Dr. Praeger's	Food item for toddler-age child

Sample non-passing recalls (correctly excluded):

Product	Why Excluded
Lase Discectomy Device Kit	Surgical back surgery device, not for children
Heparin/Lidocaine irrigation	Clinical use only

The two-pass pipeline uses a fast model for initial triage and a stronger model for borderline cases, keeping accuracy high while controlling cost. At $0.001 per row, the cost scales linearly.

Screen 10,000 Rows

Metric

Value

Rows processed

9,949

Rows passing

2,271 (22.8%)

Total cost

$37.13

Time

11.8 minutes

Add FutureSearch to Claude Code if you haven't already:

claude mcp add futuresearch --scope project --transport http https://mcp.futuresearch.ai/mcp

With the FDA recalls CSV in your working directory, tell Claude:

Screen this FDA product recalls dataset to find recalls of products that I might
have used for my child born on 2021-08-01.

Tool: futuresearch_classify
├─ task: "Find recalls of products that I might have used for my child born on 2021-08-01."
├─ input_csv: "/Users/you/fda_product_recalls.csv"
└─ categories: ["yes", "no"]

→ Submitted: 9,949 rows for classification.
  Session: https://futuresearch.ai/sessions/310fc823-0adc-402c-bff1-7dc43fda2636
  Task ID: 310f...

Tool: futuresearch_progress
├─ task_id: "310f..."
→ Running: 0/9949 complete (30s elapsed)

...

Tool: futuresearch_progress
→ Completed: 9949/9949 (0 failed) in 707s.

Tool: futuresearch_results
├─ task_id: "310f..."
├─ output_path: "/Users/you/child_relevant_recalls.csv"
→ Saved 2271 rows to /Users/you/child_relevant_recalls.csv

2,271 of 9,949 recalls are relevant. View the session.

Add the FutureSearch connector if you haven't already. Then upload the FDA product recalls CSV and ask Claude:

Screen this FDA product recalls dataset to find recalls of products that I might have used for my child born on 2021-08-01.

Go to futuresearch.ai/app, upload the FDA product recalls CSV, and enter:

Screen this FDA product recalls dataset to find recalls of products that I might have used for my child born on 2021-08-01.

Screening is binary classification: set categories=["yes", "no"] and only passing rows are returned.

pip install futuresearch
export FUTURESEARCH_API_KEY=your_key_here  # Get one at futuresearch.ai/app/api-key

import asyncio
import pandas as pd
from futuresearch import create_session
from futuresearch.ops import classify

fda_recalls = pd.read_csv("fda_product_recalls.csv")
fda_recalls["center_classification_date"] = pd.to_datetime(
    fda_recalls["center_classification_date"], errors="coerce"
)
fda_recalls = fda_recalls[
    fda_recalls["center_classification_date"] > pd.Timestamp("2021-08-01")
]

async def main():
    async with create_session(name="FDA Recall Screening") as session:
        result = await classify(
            task="Find recalls of products that I might have used for my child born on 2021-08-01.",
            categories=["yes", "no"],
            input=fda_recalls,
        )
        return result.data

results = asyncio.run(main())

Results

Sample passing recalls (products a child could have been exposed to):

Product

Firm

Why Relevant

White Hot Dog Enriched Buns

Perfection Bakeries

Food item, child eating solids by recall date

ExactaMed Oral Dispenser

Baxter Healthcare

Medical device used for infant medication

Chickenless Crispy Tenders

Dr. Praeger's

Food item for toddler-age child

Sample non-passing recalls (correctly excluded):

Product

Why Excluded

Lase Discectomy Device Kit

Surgical back surgery device, not for children

Heparin/Lidocaine irrigation

Clinical use only

The two-pass pipeline uses a fast model for initial triage and a stronger model for borderline cases, keeping accuracy high while controlling cost. At $0.001 per row, the cost scales linearly.