More stories

  • in

    100 kilometers of quantum-encrypted transfer

    Researchers at DTU have successfully distributed a quantum-secure key using a method called Continuous Variable Quantum Key Distribution (CV QKD). The researchers have managed to make the method work over a record 100 km distance — the longest distance ever achieved using the CV QKD method. The advantage of the method is that it can be applied to the existing Internet infrastructure.
    Quantum computers threaten existing algorithm-based encryptions, which currently secure data transfers against eavesdropping and surveillance. They are not yet powerful enough to break them, but it’s a matter of time. If a quantum computer succeeds in figuring out the most secure algorithms, it leaves an open door to all data connected via the internet. This has accelerated the development of a new encryption method based on the principles of quantum physics.
    But to succeed, researchers must overcome one of the challenges of quantum mechanics — ensuring consistency over longer distances. Continuous Variable Quantum Key Distribution has so far worked best over short distances.
    “We have achieved a wide range of improvements, especially regarding the loss of photons along the way. In this experiment, published in Science Advances, we securely distributed a quantum-encrypted key 100 kilometres via fibre optic cable. This is a record distance with this method,” says Tobias Gehring, an associate professor at DTU, who, together with a group of researchers at DTU, aims to be able to distribute quantum-encrypted information around the world via the internet.
    Secret keys from quantum states of light
    “When data needs to be sent from A to B, it must be protected. Encryption combines data with a secure key distributed between sender and receiver so both can access the data. A third party must not be able to figure out the key while it is being transmitted; otherwise, the encryption will be compromised. Key exchange is, therefore, essential in encrypting data.
    Quantum Key Distribution (QKD) is an advanced technology that researchers are working on for crucial exchanges. The technology ensures the exchange of cryptographic keys by using light from quantum mechanical particles called photons.

    When a sender sends information encoded in photons, the quantum mechanical properties of the photons are exploited to create a unique key for the sender and receiver. Attempts by others to measure or observe photons in a quantum state will instantly change their state. Therefore, it is physically only possible to measure light by disturbing the signal.
    “It is impossible to make a copy of a quantum state, as when making a copy of an A4 sheet — if you try, it will be an inferior copy. That’s what ensures that it is not possible to copy the key. This can protect critical infrastructure such as health records and the financial sector from being hacked,” explains Tobias Gehring.
    Works via existing infrastructure
    The Continuous Variable Quantum Key Distribution (CV QKD) technology can be integrated into the existing internet infrastructure.
    “The advantage of using this technology is that we can build a system that resembles what optical communication already relies on.”
    The backbone of the internet is optical communication. It works by sending data via infrared light running through optical fibres. They function as light guides laid in cables, ensuring we can send data worldwide. Data can be sent faster and over longer distances via fibre optic cables, and light signals are less susceptible to interference, which is called noise in technical terms.

    “It is a standard technology that has been used for a long time. So, you don’t need to invent anything new to be able to use it to distribute quantum keys, and it can make implementation significantly cheaper. And we can operate at room temperature,” explains Tobias Gehring, adding:
    “But CV QKD technology works best over shorter distances. Our task is to increase the distance. And the 100 kilometres is a big step in the right direction.”
    Noise, Errors, and Assistance from Machine Learning
    The researchers succeeded in increasing the distance by addressing three factors that limit their system in exchanging the quantum-encrypted keys over longer distances:
    Machine learning provided earlier measurements of the disturbances affecting the system. Noise, as these disturbances are called, can arise, for example, from electromagnetic radiation, which can distort or destroy the quantum states being transmitted. The earlier detection of the noise made it possible to reduce its corresponding effect more effectively.
    Furthermore, the researchers have become better at correcting errors that can occur along the way, which can be caused by noise, interference, or imperfections in the hardware.
    “In our upcoming work, we will use the technology to establish a secure communication network between Danish ministries to secure their communication. We will also attempt to generate secret keys between, for example, Copenhagen and Odense to enable companies with branches in both cities to establish quantum-safe communication,” Tobias Gehring says.
    Facts:
    We don’t exactly know what happens — yet.
    Quantum Key Distribution was developed as a concept in 1984 by Bennett and Brassard, while the Canadian physicist and computer pioneer Artur Ekert and his colleagues carried out the first practical implementation of QKD in 1992. Their contribution has been crucial for developing modern QKD protocols, a set of rules, procedures, or conventions that determine how a device should perform a task.
    Quantum Key Distribution (QKD) is based on a fundamental uncertainty in copying photons in a quantum state. Photons are the quantum mechanical particles that light consists of.
    Photons in a quantum state carry a fundamental uncertainty, meaning it is not possible with certainty to know whether the photon is one or several photons collected in the given state, also called coherent photons. This prevents a hacker from measuring the number of photons, making it impossible to make an exact copy of a state.
    They also carry a fundamental randomness because photons are in multiple states simultaneously, also called superposition. The superposition of photons collapses into a random state when the measurement occurs. This makes it impossible to measure precisely which phase they are in while in superposition.
    Together, it becomes nearly impossible for a hacker to copy a key without introducing errors, and the system will know if a hacker is trying to break in and can shut down immediately. In other words, it becomes impossible for a hacker to first steal the key and then to avoid the door locking as he tries to put the key in the lock.
    Continuous Variable Quantum Key Distribution (CV QKD) focuses on measuring the smooth properties of quantum states in photons. It can be compared to conveying information in a stream of all the nuances of colours instead of conveying information step by step in each colour.
    Facts:
    The Innovation Fund Denmark, the Danish National Research Foundation, the European Union’s Horizon Europe research and innovation program, the Carlsberg Foundation, and the Czech Science Foundation support the project.
    The research group comprises Adnan A.E. Hajomer, Nitin Jain, Hou-Man Chin, Ivan Derkach, Ulrik L. Andersen, and Tobias Gehring.
    The Danish Quantum Communication Infrastructure (QCI.DK) targets the first deployment of Danish quantum communication technologies in a versatile network supporting real-life Quantum Key Distribution applications. More

  • in

    I spy with my speedy eye — scientists discover speed of visual perception ranges widely in humans

    Using a blink-and-you’ll-miss-it experiment, researchers from Trinity College Dublin have discovered that individuals differ widely in the rate at which they perceive visual signals. Some people perceive a rapidly changing visual cue at frequencies that others cannot, which means some access more visual information per timeframe than others.
    This discovery suggests some people have an innate advantage in certain settings where response time is crucial, such as in ball sports, or in competitive gaming.
    The rate with which we perceive the world is known as our “temporal resolution,” and in many ways it is similar to the refresh rate of a computer monitor.
    The researchers, from the Department of Zoology in the School of Natural Sciences and the Trinity College Institute of Neuroscience, found that there is considerable variation among people in their temporal resolution, meaning some people effectively see more “images per second” than others.
    To quantify this, the scientists used the “critical flicker fusion threshold,” a measure for the maximum frequency at which an individual can perceive a flickering light source.
    If the light source flickers above a person’s threshold, they will not be able to see that it is flickering, and instead see the light as steady. Some participants in the experiment indicated they saw the light as completely still when it was in fact flashing about 35 times per second, while others were still able to perceive the flashing at rates of over 60 times per second.
    Clinton Haarlem, PhD Candidate in the School of Natural Sciences, is the first author of the article that has just been published in leading journal PLOS ONE. He said: “We also measured temporal resolution on multiple occasions in the same participants and found that even though there is significant variation among individuals, the trait appears to be quite stable over time within individuals.”
    Though our visual temporal resolution is quite stable from day to day in general, a post-hoc analysis did suggest that there may be slightly more variation over time within females than within males.

    “We don’t yet know how this variation in visual temporal resolution might affect our day-to-day lives, but we believe that individual differences in perception speed might become apparent in high-speed situations where one might need to locate or track fast-moving objects, such as in ball sports, or in situations where visual scenes change rapidly, such as in competitive gaming,” added Clinton Haarlem.
    “This suggests that some people may have an advantage over others before they have even picked up a racquet and hit a tennis ball, or grabbed a controller and jumped into some fantasy world online.”
    Andrew Jackson, Professor in Zoology in Trinity’s School of Natural Sciences, said: “What I think is really interesting about this project is how a zoologist, a geneticist and a psychologist can all find different angles to this work. For me as a zoologist the consequences of variation in visual perception likely has profound implications for how predators and prey interact, with various arms-races existing for investment in brain processing power and clever strategies to exploit weaknesses in one’s enemy.”
    Kevin Mitchell, Associate Professor in Developmental Neurobiology in Trinity’s School of Genetics and Microbiology, and the Trinity College Institute of Neuroscience, said: “Because we only have access to our own subjective experience, we might naively expect that everyone else perceives the world in the same way we do. Examples like colour blindness show that isn’t always true, but there are many less well known ways that perception can vary too. This study characterises one such difference — in the ‘frame rate”‘ of our visual systems. Some people really do seem to see the world faster than others.” More

  • in

    Study uses artificial intelligence to show how personality influences the expression of our genes

    An international study led by the UGR using artificial intelligence has shown that our personalities alter the expression of our genes. The findings shed new light on the long-standing mystery of how the mind and body interact.
    The study, published in the journal Molecular Psychiatry (Nature), examines how an individual’s personality and underlying outlook on life regulate their gene expression, and thus affect their health and well-being. It is the first study to measure the transcription of the entire genome in relation to human personality.
    The multi- and interdisciplinary study was led by researchers from the Andalusian Interuniversity Research Institute in Data Science and Computational Intelligence (DaSCI), the UGR’s Department of Computer Science and Artificial Intelligence, and the Biohealth Research Institute in Granada (ibs.GRANADA). It was carried out in collaboration with Professor Robert Cloninger (Washington University in St. Louis), researchers from Baylor College of Medicine (Texas, USA) and the Young Finns Study (Finland).
    The international research team (made up of specialists in genetics, medicine, psychology and computer science) used data from the Young Finns Study, an extensive study conducted in the general population of Finland over four decades during which relevant information was collected on participants’ health, physical condition and lifestyle. In addition, participants were subjected to extensive personality assessments that addressed both temperament (habits and emotional reactivity) and character (conscious goals and values). The results showed that certain outlooks on life are conducive to a healthy, fulfilling and long life, while others lead to a stressful, unhealthy and short life.
    The study analysed the regulation of gene expression in these individuals, taking into account three levels of self-awareness that were measured through their combined temperament and character profiles. These levels were designated “unregulated” — individuals dominated by irrational emotions and habits associated with their traditions and obedience to authority, “organised” — self-sufficient individuals capable of intentionally regulating their habits and cooperating with others for mutual benefit, and lastly, “creative” — self-transcendent individuals who adapt their habits to live in harmony with others, with nature or with the universe, even if this requires occasional personal sacrifices.
    Two key findings
    As UGR researcher and co-lead author of the study Coral del Val explains: “In our research we made two key discoveries about the expression and organisation of genes according to the personality profiles of these individuals. First, we discovered a network of 4,000 genes that clustered into multiple modules that were expressed in specific regions of the brain. Some of these genes had already been linked in previous studies to the inheritance of human personality. Second, we discovered that the modules formed a functional interaction network capable of orchestrating changes in gene expression in order to adapt to varying internal and external conditions. The modules turned on and off in a flexible manner, facilitating adaptation to the everyday challenges we all face, and choreographing our development.”
    The researchers showed that the changes in the patterns of interaction between these modules were orchestrated by two sub-networks. One network regulated emotional reactivity (anxiety, fear, etc.), while the other regulated what a person perceives as meaningful (e.g. production of concepts and language). “What’s most remarkable is the fact that the networks for emotion and meaning are coordinated by a control centre made up of six genes,” notes Elisa Díaz de la Guardia-Bolívar, the other co-lead author of the study. “It is particularly interesting that we found that the six genes of the control hub are highly preserved throughout evolution, from single-celled organisms to modern humans. This finding confirms their beneficial role in regulating the functioning of all forms of life on Earth,” she adds.

    Identifying these gene networks and the control hub regulating gene expression in humans has practical value because it shows how people can improve the quality of their health, happiness and overall quality of daily life, despite the challenges and stresses we all face.
    The UGR’s Igor Zwir explains: “In previous research, we found significant differences in well-being between people in the three personality groups, depending on their level of self-awareness. Specifically, those with greater self-awareness (the creative group) reported greater well-being compared to the organised and unregulated groups. We have now shown that these levels of self-awareness are also strongly associated with the regulation of gene expression in the same order (creative > organised > unregulated). This suggests that a person can improve their health and well-being by cultivating a more self-transcendent and creative outlook on life.”
    However, he cautions that it remains to be confirmed whether the regulation of gene expression through interventions that enhance self-awareness is the mediating factor in the association between self-awareness and well-being. Nevertheless, treatments that promote greater self-transcendence and mindfulness have also been shown to contribute to improvements in all aspects of health, including physical, mental, social and spiritual well-being. It is therefore plausible that the regulation of gene expression is the real mediator in this association.
    As the researchers predicted, certain types of genes, such as transcription factors, microRNAs and long non-coding RNAs, showed extensive enrichment in the 4000-gene integrated molecular network. However, the most significant enrichment was observed in a group of RNAs that are thought to have played a crucial role in the origin of cellular life. These RNAs have the ability to form membraneless compartments and carry out chemical reactions, allowing them to adapt rapidly to stress. This process, known as liquid-liquid phase separation (LLPS), creates a comprehensive bioreactor in which the chemicals that are essential for life can be synthesised.
    “We are delighted to discover the important roles of different types of genes in health and personality. It is amazing to see that evolution has preserved genes that are thought to have been important in the origin of life, allowing for the increasing plasticity, complexity and consciousness that we observe in humans. The innovative computational methods used in this project enable us to study complex biological systems in humans in an ethical, non-intrusive and beneficial way, with the aim of understanding how to live healthily,” says Professor Cloninger. He adds: “These findings clearly demonstrate that a person’s mind and body are deeply interconnected. Each influences the other, so they are not separate. It is important to recognise that our future well-being is not entirely determined by our past or present conditions; rather, we can cultivate our own well-being in a creative process full of open-ended possibilities.” More

  • in

    Cellphone compass can measure tiny concentrations of compounds important for human health

    Nearly every modern cellphone has a built-in compass, or magnetometer, that detects the direction of Earth’s magnetic field, providing critical information for navigation. Now a team of researchers at the National Institute of Standards and Technology (NIST) has developed a technique that uses an ordinary cellphone magnetometer for an entirely different purpose — to measure the concentration of glucose, a marker for diabetes, to high accuracy.
    The same technique, which uses the magnetometer in conjunction with magnetic materials designed to change their shape in response to biological or environmental cues, could be used to rapidly and cheaply measure a host of other biomedical properties for monitoring or diagnosing human disease. The method also has the potential to detect environmental toxins, said NIST scientist Gary Zabow.
    In their proof-of-concept study, Zabow and fellow NIST researcher Mark Ferris clamped to a cellphone a tiny well containing the solution to be tested and a strip of hydrogel — a porous material that swells when immersed in water. The researchers embedded tiny magnetic particles within the hydrogel, which they had engineered to react either to the presence of glucose or to pH levels (a measure of acidity) by expanding or contracting. Changing pH levels can be associated with a variety of biological disorders.
    As the hydrogels enlarged or shrunk, they moved the magnetic particles closer to or farther from the cellphone’s magnetometer, which detected the corresponding changes in the strength of the magnetic field. Employing this strategy, the researchers measured glucose concentrations as small as a few millionths of a mole (the scientific unit for a certain number of atoms or molecules in a substance). Although such high sensitivity is not required for at-home monitoring of glucose levels using a drop of blood, it might in the future enable routine testing for glucose in saliva, which contains a much smaller concentration of the sugar.
    The researchers reported their findings in the March 30, 2024 edition of Nature Communications.
    Engineered, or “smart,” hydrogels like the ones the NIST team employed are inexpensive and relatively easy to fabricate, Ferris said, and can be tailored to react to a host of different compounds that medical researchers may want to measure. In their experiments, he and Zabow stacked single layers of two different hydrogels, each of which contracted and expanded at different rates in response to pH or glucose. These bilayers amplified the motion of the hydrogels, making it easier for the magnetometer to track changes in magnetic field strength.
    Because the technique does not require any electronics or power source beyond that of the cellphone nor call for any special processing of the sample, it offers an inexpensive way to conduct testing — even in locations with relatively few resources.

    Future efforts to improve the accuracy of such measurements using cellphone magnetometers might allow detection of DNA strands, specific proteins and histamines — compounds involved in the body’s immune response — at concentrations as low as a few tens of nanomoles (billionths of a mole).
    That improvement could have substantial benefit. For instance, measuring histamines, which are typically detected in urine at concentrations ranging from about 45 to 190 nanomoles, would ordinarily require a 24-hour urine collection and a sophisticated laboratory analysis.
    “An at-home test using a cellphone magnetometer sensitive to nanomolar concentrations would allow measurements to be done with much less hassle,” said Ferris. More generally, enhanced sensitivity would be essential when only a small amount of a substance is available for testing in extremely dilute quantities, Zabow added.
    Similarly, the team’s study suggests that a cellphone magnetometer can measure pH levels with the same sensitivity as a thousand-dollar benchtop meter but at a fraction of the cost. A home-brewer or a baker could use the magnetometer to quickly test the pH of various liquids to perfect their craft, and an environmental scientist could measure the pH of ground water samples on-site with higher accuracy than a litmus test strip could provide.
    In order to make the cellphone measurements a commercial success, engineers will need to develop a method to mass produce the hydrogel test strips and ensure that they have a long shelf life, Zabow said. Ideally, he added, the hydrogel strips should be designed to react more quickly to environmental cues in order to speed up measurements. More

  • in

    Physics-based predictive tool will speed up battery and superconductor research

    From lithium-ion batteries to next-generation superconductors, the functionality of many modern, advanced technologies depends on the physical property known as intercalation. Unfortunately, it’s difficult to identify in advance which of the many possible intercalated materials are stable, which necessitates a lot of trial-and-error lab work in product development.
    Now, in a study recently published in ACS Physical Chemistry Au, researchers from the Institute of Industrial Science, The University of Tokyo, and collaborating partners have devised a straightforward equation that correctly predicts the stability of intercalated materials. The systematic design guidelines enabled by this work will speed up the development of upcoming high-performance electronics and energy-storage devices.
    To appreciate the research team’s achievement, we need to understand the context of this research. Intercalation is the reversible insertion of guests (atoms or molecules) into hosts (for example, 2D-layered materials). The purpose of intercalation is commonly to modify the host’s properties or structure for improved device performance, as seen in, for example, commercial lithium-ion batteries. Although many synthetic methods are available for preparing intercalated materials, researchers have had no reliable means of predicting which host-guest combinations are stable. Therefore, much lab work has been needed to devise new intercalated materials for imparting next-generation device functionalities. Minimizing this lab work by proposing a straightforward predictive tool for host-guest stability was the goal of the research team’s study.
    “We are the first to develop accurate predictive tools for host-guest intercalation energies, and the stability of intercalated compounds,” explains Naoto Kawaguchi, lead author of the study. “Our analysis, based on a database of 9,000 compounds, uses straightforward principles from undergraduate first-year chemistry.”
    A particular highlight of the work is that only two guest properties and eight host-derived descriptors were necessary for the researchers’ energy and stability calculations. In other words, initial ‘best guesses’ weren’t necessary; only the underlying physics of the host-guest systems. Furthermore, the researchers validated their model against nearly 200 sets of regression coefficients.
    “We’re excited because our regression model formulation is straightforward and physically reasonable,” says Teruyasu Mizoguchi, senior author. “Other computational models in the literature lack a physical basis or validation against unknown intercalated compounds.”
    This work is an important step forward in minimizing the laborious lab work that’s typically required to prepare intercalated materials. Given that many current and upcoming energy storage and electronic devices depend on such materials, the time and expense required for corresponding research and development will be minimized. Consequently, products with advanced functionalities will reach the market faster than what has been previously possible. More

  • in

    App may pave way to treatments for no. 1 dementia in under-60s

    UCSF-led research shows smartphone cognitive testing is comparable to gold-standard methods; may detect FTD in gene carriers before symptoms start.
    A smartphone app could enable greater participation in clinical trials for people with frontotemporal dementia (FTD), a devastating neurological disorder that often manifests in midlife.
    Research into the condition has been hampered by problems with early diagnosis and difficulty tracking how people are responding to treatments that are only likely to be effective at the early stages of disease.
    To address this, a research team led by UC San Francisco deployed cognitive tests through a mobile app and found it could detect early signs of FTD in people who were genetically predisposed to get the disease but had not yet developed symptoms. These tests were at least as sensitive as neuropsychological evaluations done in the clinic.
    The study appears in JAMA Network Open on April 1, 2024.
    More than 30 FTD clinical trials are underway or in the planning stages, including one that may become the first drug approved to slow progression in some gene carriers. Researchers hope the new mobile technology will hasten the work.
    “Eventually, the app may be used to monitor treatment effects, replacing many or most in-person visits to clinical trials’ sites,” said first author Adam Staffaroni, PhD, clinical neuropsychologist and associate professor in the UCSF Department of Neurology and the Weill Institute for Neurosciences.

    FTD is the No. 1 cause of dementia in patients under 60, with up to 30% of cases attributed to genetics. It has three main variants with symptoms that may overlap. The most common causes dramatic personality shifts, which may manifest as lack of empathy, apathy, impulsivity, compulsive eating, and socially and sexually inappropriate behavior. Another affects movement, and a third impacts speech, language and comprehension, which is the variant that Bruce Willis is reported to have. In rare cases, FTD triggers bursts of visual creativity.
    FTD is not easy to diagnose
    As with Alzheimer’s disease, patients with FTD are believed to be most responsive to treatment early on, ideally before their symptoms even emerge. “Most FTD patients are diagnosed relatively late in the disease, because they are young, and their symptoms are mistaken for psychiatric disorders,” said senior author Adam Boxer, MD, PhD, endowed professor in memory and aging at the UCSF Department of Neurology.
    “We’ve heard from families that they often suspect their loved one has FTD long before a physician agrees that is the diagnosis,” said Boxer, who is also director of the UCSF Alzheimer’s Disease and Frontotemporal Dementia Clinical Trials Program.
    The researchers tracked 360 participants with an average age of 54 enrolled in ongoing studies at ALLFTDcenters and UCSF. About 90% had data on disease stage. These included 60% who did not have FTD or were gene carriers who had not yet developed symptoms, 20% with early signs of the disease and 21% with symptoms.
    Software that can detect a waning ability to plan
    Staffaroni and Boxer collaborated with software company Datacubed Health, which developed the platform, to include tests of executive function, such as planning and prioritizing, filtering distractions and controlling impulses. In FTD, the part of the brain responsible for executive functioning shrinks as the disease progresses.

    The rich data collected by the app, including voice recordings and body movements, enabled the researchers to develop new tests that eventually could help with early diagnosis and monitoring of symptoms.
    “We developed the capability to record speech while participants engaged with several different tests,” said Staffaroni. “We also created tests of walking, balance and slowed movements, as well as different aspects of language.”
    FTD researchers say they are closer to finding treatments that may eventually slow the progression of the disease, which is fatal. These include gene and other therapies, such as antisense oligonucleotides (ASOs), to increase or decrease the production of proteins that are abnormal in gene carriers.
    Although there are currently no plans to make the app available to the public, it could be a boon to research.
    “A major barrier has been a lack of outcome measures that can be easily collected and are sensitive to treatment effects at early stages of the disease,” said Staffaroni. “We hope that smartphone assessments will facilitate new trials of promising therapies.” More

  • in

    Chatbot outperformed physicians in clinical reasoning in head-to-head study

    ChatGPT-4, an artificial intelligence program designed to understand and generate human-like text, outperformed internal medicine residents and attending physicians at two academic medical centers at processing medical data and demonstrating clinical reasoning. In a research letter published in JAMA Internal Medicine, physician-scientists at Beth Israel Deaconess Medical Center (BIDMC) compared a large language model’s (LLM) reasoning abilities directly against human performance using standards developed to assess physicians.
    “It became clear very early on that LLMs can make diagnoses, but anybody who practices medicine knows there’s a lot more to medicine than that,” said Adam Rodman MD, an internal medicine physician and investigator in the department of medicine at BIDMC. “There are multiple steps behind a diagnosis, so we wanted to evaluate whether LLMs are as good as physicians at doing that kind of clinical reasoning. It’s a surprising finding that these things are capable of showing the equivalent or better reasoning than people throughout the evolution of clinical case.”
    Rodman and colleagues used a previously validated tool developed to assess physicians’ clinical reasoning called the revised-IDEA (r-IDEA) score. The investigators recruited 21 attending physicians and 18 residents who each worked through one of 20 selected clinical cases comprised of four sequential stages of diagnostic reasoning. The authors instructed physicians to write out and justify their differential diagnoses at each stage. The chatbot GPT-4 was given a prompt with identical instructions and ran all 20 clinical cases. Their answers were then scored for clinical reasoning (r-IDEA score) and several other measures of reasoning.
    “The first stage is the triage data, when the patient tells you what’s bothering them and you obtain vital signs,” said lead author Stephanie Cabral, MD, a third-year internal medicine resident at BIDMC. “The second stage is the system review, when you obtain additional information from the patient. The third stage is the physical exam, and the fourth is diagnostic testing and imaging.”
    Rodman, Cabral and their colleagues found that the chatbot earned the highest r-IDEA scores, with a median score of 10 out of 10 for the LLM, 9 for attending physicians and 8 for residents. It was more of a draw between the humans and the bot when it came to diagnostic accuracy — how high up the correct diagnosis was on the list of diagnosis they provided — and correct clinical reasoning. But the bots were also “just plain wrong” — had more instances of incorrect reasoning in their answers — significantly more often than residents, the researchers found. The finding underscores the notion that AI will likely be most useful as a tool to augment, not replace, the human reasoning process.
    “Further studies are needed to determine how LLMs can best be integrated into clinical practice, but even now, they could be useful as a checkpoint, helping us make sure we don’t miss something,” Cabral said. “My ultimate hope is that AI will improve the patient-physician interaction by reducing some of the inefficiencies we currently have and allow us to focus more on the conversation we’re having with our patients.
    “Early studies suggested AI could makes diagnoses, if all the information was handed to it,” Rodman said. “What our study shows is that AI demonstrates real reasoning — maybe better reasoning than people through multiple steps of the process. We have a unique chance to improve the quality and experience of healthcare for patients.”
    Co-authors included Zahir Kanjee, MD, Philip Wilson, MD, and Byron Crowe, MD, of BIDMC; Daniel Restrepo, MD, of Massachusetts General Hospital; and Raja-Elie Abdulnour, MD, of Brigham and Women’s Hospital.
    This work was conducted with support from Harvard Catalyst | The Harvard Clinical and Translational Science Center (National Center for Advancing Translational Sciences, National Institutes of Health) (award UM1TR004408) and financial contributions from Harvard University and its affiliated academic healthcare centers.
    Potential Conflicts of Interest: Rodman reports grant funding from the Gordon and Betty Moore Foundation. Crowe reports employment and equity in Solera Health. Kanjee reports receipt of royalties for books edited and membership on a paid advisory board for medical education products not related to AI from Wolters Kluwer, as well as honoraria for continuing medical education delivered from Oakstone Publishing. Abdulnour reports employment by the Massachusetts Medical Society (MMS), a not-for-profit organization that owns NEJM Healer. Abdulnour does not receive royalty from sales of NEJM Healer and does not have equity in NEJM Healer. No funding was provided by the MMS for this study. Abdulnour reports grant funding from the Gordan and Betty Moore Foundation via the National Academy of Medicine Scholars in Diagnostic Excellence. More