Close Menu
  • Home
  • AI
  • Education
  • Entertainment
  • Food Health
  • Health
  • Sports
  • Tech
  • Well Being

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

What's Hot

The DeepMind trio who built a poker AI, are now making money for quant hedge funds

June 30, 2026

Google introduces a faster, cheaper image generator with Nano Banana 2 Lite

June 30, 2026

Amarillo’s AI Assistant Emma Saves $1.8M, Gains 90% Approval

June 30, 2026
Facebook X (Twitter) Instagram
  • Home
  • About Us
  • Advertise With Us
  • Contact us
  • DMCA
  • Privacy Policy
  • Terms & Conditions
Facebook X (Twitter) Instagram
IQ Times Media – Smart News for a Smarter YouIQ Times Media – Smart News for a Smarter You
  • Home
  • AI
  • Education
  • Entertainment
  • Food Health
  • Health
  • Sports
  • Tech
  • Well Being
IQ Times Media – Smart News for a Smarter YouIQ Times Media – Smart News for a Smarter You
Home » Machine Learning Accelerates Ancestry’s Record Digitization
Tech

Machine Learning Accelerates Ancestry’s Record Digitization

IQ TIMES MEDIABy IQ TIMES MEDIAJune 30, 2026No Comments5 Mins Read
Facebook Twitter Pinterest LinkedIn Tumblr Email
Share
Facebook Twitter LinkedIn Pinterest Email


Over the past 42 years, Ancestry has collected more than 71 billion birth certificates, marriage licenses, and other family records from 88 countries to build 148 million family trees.

For most of the genealogy company’s history, collecting, labeling, and organizing that data was a painstakingly slow process. Ancestry’s employees and third-party vendors would spend months manually inputting data and transcribing international family records. International expansion began in 2001 when the company launched a website in the UK, but adding additional markets came at a high cost, said Sriram Thiagarajan, Ancestry’s chief technology officer since November 2020.

“The cost for the time taken to digitize this rich content from around the world was the limiting factor for us,” Thiagarajan told Business Insider.

Most of Ancestry’s AI efforts have been led by Thiagarajan, who joined the company as chief information officer in September 2017; His expanded role came a month before the investment firm Blackstone closed on its $4.7 billion acquisition of Ancestry.

Since then, Ancestry’s investments in machine learning and artificial intelligence, along with advancements in traditional and generative AI, have accelerated the digitization process, Thiagarajan said. AI has also made way for new user tools, including AI-powered facial and handwriting recognition technologies, he added.

Timeline of key events in the use of AI at Ancestry

Training AI models

Back in 2003, Jackson Reese, a data scientist and software research engineer, was recruited by a friend to join Ancestry as its head of digital imaging in preservation services. At the time, Ancestry had a one-person imaging department digitizing census data, birth and death records, immigration forms, and other historical records, Reese told Business Insider. He was initially hired to bring the company’s imaging operations in-house and he also expanded its digital imaging team to more than 70 employees within three years.

Reese told Business Insider that the expanded team worked with now-outdated technologies such as microfilm scanners, which convert government archives and newspaper clippings into digital files.

Beginning in 2014, Ancestry leaned into early AI projects focused on developing Ancestry’s own proprietary machine learning models and computer vision systems to build algorithms that could read paper documents, Reese said. This early work continued through 2016.

His team then worked with BERT, a family of natural language processing models that Google released in October 2018, to build more precise data extraction tools. Previously, when the Ancestry team received millions of new birth records, domain experts would review the documents then pass them to indexers, who transcribed and labeled them, said Reese.

Ancestry would then train its proprietary AI models on this data, with the hope that, after a few rounds of back-and-forth between domain experts, indexers, and data scientists, the AI model’s accuracy would be north of 90%.

“That was the best-case scenario; sometimes we’d be at eight, 10, or a dozen iterations to really get a model dialed in,” Reese said.

By 2019, Ancestry incorporated BERT-based models to more quickly process its obituary collection and other record-extraction initiatives. The company also kept employees in the loop to continue to validate the training data and ensure the models were effectively handling the records, said Reese.

The ChatGPT tipping point

Thiagarajan said that the advent of ChatGPT in November 2022 was another “tipping point in terms of figuring out the art of possibilities.” New large language models from OpenAI, Anthropic, and other AI hyperscalers opened up the possibility of speeding up the digitization of unstructured data, including user-generated images, scanned documents, and written stories, he said.

Now that AI is capable of performing record extraction more quickly and accurately, Ancestry can ingest birth records and other data, apply a blend of proprietary models from OpenAI, Google, and Anthropic as well as open-source AI models, and “fine-tune them a little bit for our use cases,” Reese said. He added that the company can process nearly 200 different languages with very little iterative model training.

By September 2023, Ancestry was using LLMs for user-facing features too, said Thiagarajan. Face Match, an AI-powered facial recognition tool that helps users identify people in their family photos, debuted in July 2024.

An example of Ancestry's handwritten note transcription tool, which is powered by AI.

An example of Ancestry’s handwritten note transcription tool, which is powered by AI. 

Courtesy of Ancestry



In April 2025, the company unveiled a document transcription feature that lets customers upload scans of JPG and PNG files and generate transcriptions of their families’ handwritten notes. Ancestry’s AI Stories, launched in December 2025, allows customers to click on an ancestor’s page within the company’s database and hear a narrated audio story of their life, read by AI.

The outcome

By the end of 2025, more than 50% of Ancestry’s historical records published on the website were generated using AI, said Thiagarajan. AI has helped triple the rate of content growth from 800 million records in 2021, to 5.2 billion new records in 2022, and 18.6 billion the following year, according to the company.

Ancestry has also continued to launch external AI use cases, including in June 2026, when it added language translation to its customer-facing document transcription tool, Thiagarajan said.



Source link

Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
IQ TIMES MEDIA
  • Website

Related Posts

Amarillo’s AI Assistant Emma Saves $1.8M, Gains 90% Approval

June 30, 2026

Mark Zuckerberg Doesn’t Buy the Anxiety About AI Job Displacement

June 30, 2026

OpenAI Teases New Device for AI Coders That Looks Like a Mini Keyboard

June 30, 2026
Add A Comment
Leave A Reply Cancel Reply

Editors Picks

Carnegie Foundation list of 2026 ‘Great Immigrants, Great Americans’

June 30, 2026

Nursing degrees gain professional designation, for now, after court ruling

June 29, 2026

Texas school board to vote on required Bible readings

June 26, 2026

Judge blocks part of Trump’s student loan caps for graduate programs

June 25, 2026
Education

Carnegie Foundation list of 2026 ‘Great Immigrants, Great Americans’

By IQ TIMES MEDIAJune 30, 20260

The Andrew Carnegie Foundation announced its annual list of “Great Immigrants, Great Americans” on Tuesday.…

Nursing degrees gain professional designation, for now, after court ruling

June 29, 2026

Texas school board to vote on required Bible readings

June 26, 2026

Judge blocks part of Trump’s student loan caps for graduate programs

June 25, 2026
IQ Times Media – Smart News for a Smarter You
Facebook X (Twitter) Instagram Pinterest Vimeo YouTube
  • Home
  • About Us
  • Advertise With Us
  • Contact us
  • DMCA
  • Privacy Policy
  • Terms & Conditions
© 2026 iqtimes. Designed by iqtimes.

Type above and press Enter to search. Press Esc to cancel.