0
Skip to Content
Tatta Bio
Home
About
Projects
Blog
Careers
Tatta Bio
Home
About
Projects
Blog
Careers
Home
About
Projects
Blog
Careers
Today's sequence data infrastructure is set up for failure in the age of AI.
Yunha Hwang 6/5/25 Yunha Hwang 6/5/25

Today's sequence data infrastructure is set up for failure in the age of AI.

Read More
Gaia Agent: Context-Aware Functional Insights at Scale
Yunha Hwang 12/17/24 Yunha Hwang 12/17/24

Gaia Agent: Context-Aware Functional Insights at Scale

An AI biologist discovers previously uncharacterized systems in the Mtb genome.

Read More
Introducing Gaia: Context-Aware Protein Search Across Genomic Datasets
Yunha Hwang 11/18/24 Yunha Hwang 11/18/24

Introducing Gaia: Context-Aware Protein Search Across Genomic Datasets

Gaia is an embedding-based search engine for sequences.

Read More
gLM2: The First Mixed-Modality Genomic Language Model
Yunha Hwang 8/15/24 Yunha Hwang 8/15/24

gLM2: The First Mixed-Modality Genomic Language Model

Read More
The OMG Dataset: the CommonCrawl of Biological Sequences
Yunha Hwang 8/15/24 Yunha Hwang 8/15/24

The OMG Dataset: the CommonCrawl of Biological Sequences

Read More
Introducing DGEB: the Diverse Genomic Embedding Benchmark
Yunha Hwang 7/12/24 Yunha Hwang 7/12/24

Introducing DGEB: the Diverse Genomic Embedding Benchmark

Read More