Several million Americans may have some form of rare genetic disease. When Shayla Haddock was born in 1997, she had unusual facial features. She had club feet and shorter-than-normal limbs. She was smaller than most newborns. Hearing tests showed she was deaf.
As her parents, Cheryl and Levko Siloti, searched for answers about her condition, they worried: Had some preventable event during Cheryl's pregnancy caused Shayla's symptoms? Could identifying her diagnosis improve her treatment options? If Shayla's siblings wanted to become parents someday, would their children be at risk for the same illness?
"It was kind of an emotional roller coaster," Cheryl Siloti said. Over the years, doctors suggested many diagnoses for Shayla, but medical tests repeatedly disproved their theories. "We would get these possibilities and then hear 'Nope, that's not the answer.'"
As much as Shayla's parents longed for a diagnosis, they almost didn't get one. On August 10, 2012 - only two weeks after Shayla's doctors at Lucile Packard Children's Hospital Stanford concluded that they could not match her genetic patterns and symptoms to a disease - a scientific report about a newly discovered link between a genetic defect and a rare disease was published that would have allowed them to diagnose her. But at the time, genetic-testing results were not routinely re-analyzed to take into account new knowledge. The family and doctors remained unaware that the answer was out there.
In 2015, as part of a scientific study, Shayla's parents agreed to have her genome re-analyzed. This time, Stanford computer scientists used new computational tools they had developed to compare Shayla's gene sequences to the scientific literature. They found the 2012 scientific report and predicted that Shayla had a rare genetic disease called Wiedemann-Steiner syndrome, which her doctors confirmed.
"With each passing month, more of the world's genetic diversity is represented in scientific databases, and each time more information is there, it's easier to interpret the next thing you see," said Jon Bernstein, Shayla's clinical geneticist at Packard Children's and an author of the new report, which was published online in Genetics in Medicine.
10% of the patients in the study - four individuals, including Shayla, out of 40 who did not receive diagnoses after their first genetic analysis - were diagnosed with various rare diseases based on recent discoveries, even though the initial analyses had been conducted an average of only 20 months earlier.
These "near misses" highlight a big challenge in the realm of precision health: Although the speed, cost and effort involved in obtaining individuals' genetic sequences has dropped dramatically in recent years, it still requires about 20 to 40 hours of work by trained experts to match a patient's rare mutations to information in the scientific literature that might reveal a diagnosis. Among patients suspected of having a rare genetic disease, 75% aren't diagnosed the first time they have their DNA analyzed. And yet the knowledge base is growing fast. Each year, researchers discover the cause of about 250 genetic diseases and also find 9,200 links between specific gene variants and known diseases.
"Our study demonstrates that reanalysis of patients' gene-testing results is useful because there's a steady rate of discovery," said Bernstein, who is also an associate professor of pediatrics at the School of Medicine.
"But there is no way we'll have enough manpower to continue to do all the analysis manually, as clinicians and scientists have done in the past," said Gill Bejerano, senior author of the study and associate professor of developmental biology, of computer science and of pediatrics.
Bejerano led the computer scientists who devised the automated approach used in the new research. Bejerano said, "Rather than continuing to invest dozens of hours in each patient's analysis, our team thought it made more sense to spend that time building computer science tools that can do much of the work for us," he said.
In the new study, the scientists tested whether automated comparisons between undiagnosed patients' genomes and existing gene databases could accelerate diagnosis. The approach worked.
"The genome is ultimately a programming language," Bejerano said. "We really would like to use machine learning and other approaches to build computer systems that leave as little as possible work for the human expert. A computer is going to be weaker than a human at doing this, but we think we can take the process 80 to 90% of the way by computer and provide a huge time savings for the human in the loop."
Another key finding from the new research, according to Bernstein and Bejerano, is that comparing patients' gene sequences to those of their parents greatly speeds the diagnostic process. Such comparisons help turn up new disease-causing mutations that occurred in the patients but are not present in their parents. "These things stand out more easily if you have the parents' data in front you," Bernstein said.
In Shayla's case, her diagnosis brought her family the answers they'd long been seeking. She doesn't share her disease-causing mutation with her parents; instead, it occurred spontaneously in her. It wasn't preventable, nor is there any expectation it would affect her siblings' children. "It really relieves a lot of worry to know that," Siloti said.
The diagnosis also has helped the Silotis find other families whose children have the same diagnosis. They share stories on a Facebook group and feel they've found a new sense of support and community. "We've always believed that knowledge is power," Siloti said. "It is wonderful to have some answers, especially after such a long search."