More stories

  • in

    Scientists just cracked the cryptographic code behind quantum supremacy

    Experts say quantum computing is the future of computers. Unlike conventional computers, quantum computers leverage the properties of quantum physics such as superposition and interference, theoretically outperforming current equipment to an exponential degree.
    When a quantum computer is able to solve a problem unfeasible for current technologies, this is called the quantum advantage. However, this edge is not guaranteed for all calculations, raising fundamental questions regarding the conditions under which such an advantage exists. While previous studies have proposed various sufficient conditions for quantum advantage, the necessity of these conditions has remained unclear.
    Motivated by this uncertainty, a team of researchers at Kyoto University has endeavored to understand the necessary and sufficient conditions for quantum advantage, using an approach combining techniques from quantum computing and cryptography, the science of coding information securely.
    Specifically, the team focused on interactive protocols called inefficient-verifier proofs of quantumness, which allow a verifier without a quantum computer to interact with a quantum prover and verify that it indeed possesses quantum computational power. In their study, the team demonstrated that the existence of these proofs depends on the existence of a certain cryptographic primitive called a one-way puzzle.
    By integrating these methods, the team introduced a novel framework uniting the seemingly unrelated concepts of quantum advantage and cryptographic security. As a result, the team was able to completely characterize quantum advantage for the first time.
    “We were able to identify the necessary and sufficient conditions for quantum advantage by proving an equivalence between the existence of quantum advantage and the security of certain quantum cryptographic primitives,” says corresponding author Yuki Shirakawa.
    The results imply that when quantum advantage does not exist, then the security of almost all cryptographic primitives — previously believed to be secure — is broken. Importantly, these primitives are not limited to quantum cryptography but also include widely-used conventional cryptographic primitives as well as post-quantum ones that are rapidly evolving.
    The established equivalence between quantum computing and cryptography also provides a stronger cryptographic foundation for future experimental demonstrations of quantum advantage, as well as for ongoing theoretical investigations in the field.
    “Quantum advantage is a highly expected and actively studied concept, but it is still not fully understood. Our study represents a significant step toward a deeper understanding of this property,” says Shirakawa.
    The team expects that future research will extend characterization to other types of quantum advantage and lead to a more general theoretical framework. More

  • in

    The real-life Kryptonite found in Serbia—and why it could power the future

    Jadarite has been likened to Superman’s ‘kryptonite’ based on their similar chemical compositions. It was discovered in the Jadar Valley of Serbia and officially recognized as a new mineral in 2006. Whilst lacking any actual superpowers, jadarite has great potential as an important resource of lithium and boron.Kryptonite’s twin on Earth Described as ‘Earth’s kryptonite twin’, jadarite is a rare and fascinating mineral that quickly caught the attention of scientists and Superman fans alike.
    The mineral was discovered by exploration geologists from Rio Tinto in 2004 in the Jadar Valley of Serbia. Its chemical composition is exactly like the fictional kryptonite right out of the comic books — with a few differences. Where kryptonite glows green and weakens superheroes, jadarite offers immense potential for Earth’s energy transition away from fossil fuels.
    A new mineral on the scene
    Jadarite was identified by Rio Tinto geologists during exploration drilling and didn’t match any known mineral at the time. After analysis by the Natural History Museum in London and the National Research Council of Canada, it was officially recognised as a new mineral in 2006.
    Jadarite is a “sodium lithium boron silicate hydroxide” mineral, coincidentally the same scientific name written on a case containing kryptonite stolen by Lex Luther from a museum in the film Superman Returns.

    While the film version of kryptonite contains fluorine and glows an eery green, the chemical formula of the real version is LiNaSiB₃O₇(OH) and is a much less supernatural dull white — though it does fluoresce a pinkish-orange under UV light.
    Super in its own right
    Michael Page, a scientist with Australia’s Nuclear Science and Technology Organisation (ANSTO), said that the mineral is ‘super’ in its own right.
    “While lacking any supernatural powers the real jadarite has great potential as an important source of lithium and boron,” Michael said.
    “In fact, the Jadar deposit where it was first discovered is considered one of the largest lithium deposits in the world, making it a potential game-changer for the global green energy transition.”
    ANSTO is one of the three supporting agencies of the Australian Critical Minerals R&D Hub, alongside Geoscience Australia and CSIRO, hosted by CSIRO. One of the Hub’s key missions is to better connect Australia’s R&D ecosystem, including Australian industry, to enable access and utilization of critical minerals to strengthen Australia’s value chain domestically and across the globe.
    The work that ANSTO does has a significant focus on how these critical minerals, such as jadarite, can be utilised to support Australian industry in a commercial capacity.
    “At ANSTO, we work with industry to develop process solutions for many critical elements including lithium, and the challenges posed by a new type of mineral resource are very exciting,” Michael said.
    ANSTO has produced battery grade lithium chemicals from many different mineral deposits, such as spodumene, lepidolite and even jadarite, ensuring that Australian miners receive the support they need to meet the challenges of the energy transition. More

  • in

    Trapped by moon dust: The physics error that fooled NASA for years

    When a multimillion-dollar extraterrestrial vehicle gets stuck in soft sand or gravel — as did the Mars rover Spirit in 2009 — Earth-based engineers take over like a virtual tow truck, issuing a series of commands that move its wheels or reverse its course in a delicate, time-consuming effort to free it and continue its exploratory mission.
    While Spirit remained permanently stuck, in the future, better terrain testing right here on terra firma could help avert these celestial crises.
    Using computer simulations, University of Wisconsin-Madison mechanical engineers have uncovered a flaw in how rovers are tested on Earth. That error leads to overly optimistic conclusions about how rovers will behave once they’re deployed on extraterrestrial missions.
    An important element in preparing for these missions is an accurate understanding of how a rover will traverse extraterrestrial surfaces in low gravity to prevent it from getting stuck in soft terrain or rocky areas.
    On the moon, the gravitational pull is six times weaker than on Earth. For decades, researchers testing rovers have accounted for that difference in gravity by creating a prototype that is a sixth of the mass of the actual rover. They test these lightweight rovers in deserts, observing how it moves across sand to gain insights into how it would perform on the moon.
    It turns out, however, that this standard testing approach overlooked a seemingly inconsequential detail: the pull of Earth’s gravity on the desert sand.
    Through simulation, Dan Negrut, a professor of mechanical engineering at UW-Madison, and his collaborators determined that Earth’s gravity pulls down on sand much more strongly than the gravity on Mars or the moon does. On Earth, sand is more rigid and supportive — reducing the likelihood it will shift under a vehicle’s wheels. But the moon’s surface is “fluffier” and therefore shifts more easily — meaning rovers have less traction, which can hinder their mobility.

    “In retrospect, the idea is simple: We need to consider not only the gravitational pull on the rover but also the effect of gravity on the sand to get a better picture of how the rover will perform on the moon,” Negrut says. “Our findings underscore the value of using physics-based simulation to analyze rover mobility on granular soil.”
    The team recently detailed its findings in the Journal of Field Robotics.
    The researchers’ discovery resulted from their work on a NASA-funded project to simulate the VIPER rover, which had been planned for a lunar mission. The team leveraged Project Chrono, an open-source physics simulation engine developed at UW-Madison in collaboration with scientists from Italy. This software allows researchers to quickly and accurately model complex mechanical systems — like full-size rovers operating on “squishy” sand or soil surfaces.
    While simulating the VIPER rover, they noticed discrepancies between the Earth-based test results and their simulations of the rover’s mobility on the moon. Digging deeper with Chrono simulations revealed the testing flaw.
    The benefits of this research also extend well beyond NASA and space travel. For applications on Earth, Chrono has been used by hundreds of organizations to better understand complex mechanical systems — from precision mechanical watches to U.S. Army trucks and tanks operating in off-road conditions.
    “It’s rewarding that our research is highly relevant in helping to solve many real-world engineering challenges,” Negrut says. “I’m proud of what we’ve accomplished. It’s very difficult as a university lab to put out industrial-strength software that is used by NASA.”
    Chrono is free and publicly available for unfettered use worldwide, but the UW-Madison team puts in significant ongoing work to develop and maintain the software and provide user support.

    “It’s very unusual in academia to produce a software product at this level,” Negrut says. “There are certain types of applications relevant to NASA and planetary exploration where our simulator can solve problems that no other tool can solve, including simulators from huge tech companies, and that’s exciting.”
    Since Chrono is open source, Negrut and his team are focused on continually innovating and enhancing the software to stay relevant.
    “All our ideas are in the public domain and the competition can adopt them quickly, which is drives us to keep moving forward,” he says. “We have been fortunate over the last decade to receive support from the National Science Foundation, U.S. Army Research Office and NASA. This funding has really made a difference, since we do not charge anyone for the use of our software.”
    Co-authors on the paper include Wei Hu of Shanghai Jiao Tong University, Pei Li of UW-Madison, Arno Rogg and Alexander Schepelmann of NASA, Samuel Chandler of ProtoInnovations, LLC, and Ken Kamrin of MIT.
    This work was supported by NASA STTR (80NSSC20C0252), the National Science Foundation (OAC2209791) and the U.S. Army Research Office, (W911NF1910431 and W911NF1810476). More

  • in

    Harvard’s ultra-thin chip could revolutionize quantum computing

    New research shows that metasurfaces could be used as strong linear quantum optical networks This approach could eliminate the need for waveguides and other conventional optical components Graph theory is helpful for designing the functionalities of quantum optical networks into a single metasurfaceIn the race toward practical quantum computers and networks, photons — fundamental particles of light — hold intriguing possibilities as fast carriers of information at room temperature. Photons are typically controlled and coaxed into quantum states via waveguides on extended microchips, or through bulky devices built from lenses, mirrors, and beam splitters. The photons become entangled – enabling them to encode and process quantum information in parallel – through complex networks of these optical components. But such systems are notoriously difficult to scale up due to the large numbers and imperfections of parts required to do any meaningful computation or networking.Could all those optical components could be collapsed into a single, flat, ultra-thin array of subwavelength elements that control light in the exact same way, but with far fewer fabricated parts?
    Optics researchers in the Harvard John A. Paulson School of Engineering and Applied Sciences (SEAS) did just that. The research team led by Federico Capasso, the Robert L. Wallace Professor of Applied Physics and Vinton Hayes Senior Research Fellow in Electrical Engineering, created specially designed metasurfaces — flat devices etched with nanoscale light-manipulating patterns — to act as ultra-thin upgrades for quantum-optical chips and setups.
    The research was published in Science and funded by the Air Force Office of Scientific Research (AFOSR).
    Capasso and his team showed that a metasurface can create complex, entangled states of photons to carry out quantum operations – like those done with larger optical devices with many different components.
    “We’re introducing a major technological advantage when it comes to solving the scalability problem,” said graduate student and first author Kerolos M.A. Yousef. “Now we can miniaturize an entire optical setup into a single metasurface that is very stable and robust.”
    Metasurfaces: Robust and scalable quantum photonics processors

    Their results hint at the possibility of paradigm-shifting optical quantum devices based not on conventional, difficult-to-scale components like waveguides and beam splitters, or even extended optical microchips, but instead on error-resistant metasurfaces that offer a host of advantages: designs that don’t require intricate alignments, robustness to perturbations, cost-effectiveness, simplicity of fabrication, and low optical loss. Broadly speaking, the work embodies metasurface-based quantum optics which, beyond carving a path toward room-temperature quantum computers and networks, could also benefit quantum sensing or offer “lab-on-a-chip” capabilities for fundamental science
    Designing a single metasurface that can finely control properties like brightness, phase, and polarization presented unique challenges because of the mathematical complexity that arises once the number of photons and therefore the number of qubits begins to increase. Every additional photon introduces many new interference pathways, which in a conventional setup would require a rapidly growing number of beam splitters and output ports.
    Graph theory for metasurface design
    To bring order to the complexity, the researchers leaned on a branch of mathematics called graph theory, which uses points and lines to represent connections and relationships. By representing entangled photon states as many connected lines and points, they were able to visually determine how photons interfere with each other, and to predict their effects in experiments. Graph theory is also used in certain types of quantum computing and quantum error correction but is not typically considered in the context of metasurfaces, including their design and operation.
    The resulting paper was a collaboration with the lab of Marko Loncar, whose team specializes in quantum optics and integrated photonics and provided needed expertise and equipment.
    “I’m excited about this approach, because it could efficiently scale optical quantum computers and networks — which has long been their biggest challenge compared to other platforms like superconductors or atoms,” said research scientist Neal Sinclair. “It also offers fresh insight into the understanding, design, and application of metasurfaces, especially for generating and controlling quantum light. With the graph approach, in a way, metasurface design and the optical quantum state become two sides of the same coin.”
    The research received support from federal sources including the AFOSR under award No. FA9550-21-1-0312. The work was performed at the Harvard University Center for Nanoscale Systems More

  • in

    AI turns immune cells into precision cancer killers—in just weeks

    Precision cancer treatment on a larger scale is moving closer after researchers have developed an AI platform that can tailor protein components and arm the patient’s immune cells to fight cancer. The new method, published in the scientific journal Science, demonstrates for the first time, that it is possible to design proteins in the computer for redirecting immune cells to target cancer cells through pMHC molecules.
    This dramatically shortens the process of finding effective molecules for cancer treatment from years to a few weeks.
    “We are essentially creating a new set of eyes for the immune system. Current methods for individual cancer treatment are based on finding so-called T-cell receptors in the immune system of a patient or donor that can be used for treatment. This is a very time-consuming and challenging process. Our platform designs molecular keys to target cancer cells using the AI platform, and it does so at incredible speed, so that a new lead molecule can be ready within 4-6 weeks,” says Associate Professor at the Technical University of Denmark (DTU) and last author of the study Timothy P. Jenkins.
    Targeted missiles against cancer
    The AI platform, developed by a team from DTU and the American Scripps Research Institute, aims to solve a major challenge in cancer immunotherapy by demonstrating how scientists can generate target treatments for tumor cells and avoid damaging healthy tissue.
    Normally, T cells naturally identify cancer cells by recognizing specific protein fragments, known as peptides, presented on the cell surface by molecules called pMHCs.It is a slow and challenging process to utilize this knowledge for therapy, often because the variation in the body’s own T-cell receptors makes it challenging to create a personalized treatment.
    Boosting the body’s immune system
    In the study, the researchers tested the strength of the AI platform on a well-known cancer target, NY-ESO-1, which is found in a wide range of cancers. The team succeeded in designing a minibinder that bound tightly to the NY-ESO-1 pMHC molecules. When the designed protein was inserted into T cells, it created a unique new cell product named ‘IMPAC-T’ cells by the researchers, which effectively guided the T cells to kill cancer cells in laboratory experiments.

    “It was incredibly exciting to take these minibinders, which were created entirely on a computer, and see them work so effectively in the laboratory,” says postdoc Kristoffer Haurum Johansen, co-author of the study and researcher at DTU.
    The researchers also applied the pipeline to design binders for a cancer target identified in a metastatic melanoma patient, successfully generating binders for this target as well. This documented that the method also can be used for tailored immunotherapy against novel cancer targets.
    Screening of treatments
    A crucial step in the researchers’ innovation was the development of a ‘virtual safety check’. The team used AI to screen their designed minibinders and assess them in relation to pMHC molecules found on healthy cells. This method enabled them to filter out minibinders that could cause dangerous side effects before any experiments were carried out.
    “Precision in cancer treatment is crucial. By predicting and ruling out cross-reactions already in the design phase, we were able to reduce the risk associated with the designed proteins and increase the likelihood of designing a safe and effective therapy,” says DTU professor and co-author of the study Sine Reker Hadrup.
    Five years to treatment
    Timothy Patrick Jenkins expects that it will take up to five years before the new method is ready for initial clinical trials in humans. Once the method is ready, the treatment process will resemble current cancer treatments using genetically modified T cells, known as CAR-T cells, which are currently used to treat lymphoma and leukemia.Patients will first have blood drawn at the hospital, similar to a routine blood test. Their immune cells will then be extracted from this blood sample and modified in the laboratory to carry the AI-designed minibinders. These enhanced immune cells are returned to the patient, where they act like targeted missiles, precisely finding and eliminating cancer cells in the body. More

  • in

    Google’s deepfake hunter sees what you can’t—even in videos without faces

    In an era where manipulated videos can spread disinformation, bully people, and incite harm, UC Riverside researchers have created a powerful new system to expose these fakes.
    Amit Roy-Chowdhury, a professor of electrical and computer engineering, and doctoral candidate Rohit Kundu, both from UCR’s Marlan and Rosemary Bourns College of Engineering, teamed up with Google scientists to develop an artificial intelligence model that detects video tampering — even when manipulations go far beyond face swaps and altered speech. (Roy-Chowdhury is also the co-director of the UC Riverside Artificial Intelligence Research and Education (RAISE) Institute, a new interdisciplinary research center at UCR.)
    Their new system, called the Universal Network for Identifying Tampered and synthEtic videos (UNITE), detects forgeries by examining not just faces but full video frames, including backgrounds and motion patterns. This analysis makes it one of the first tools capable of identifying synthetic or doctored videos that do not rely on facial content.
    “Deepfakes have evolved,” Kundu said. “They’re not just about face swaps anymore. People are now creating entirely fake videos — from faces to backgrounds — using powerful generative models. Our system is built to catch all of that.”
    UNITE’s development comes as text-to-video and image-to-video generation have become widely available online. These AI platforms enable virtually anyone to fabricate highly convincing videos, posing serious risks to individuals, institutions, and democracy itself.
    “It’s scary how accessible these tools have become,” Kundu said. “Anyone with moderate skills can bypass safety filters and generate realistic videos of public figures saying things they never said.”
    Kundu explained that earlier deepfake detectors focused almost entirely on face cues.

    “If there’s no face in the frame, many detectors simply don’t work,” he said. “But disinformation can come in many forms. Altering a scene’s background can distort the truth just as easily.”
    To address this, UNITE uses a transformer-based deep learning model to analyze video clips. It detects subtle spatial and temporal inconsistencies — cues often missed by previous systems. The model draws on a foundational AI framework known as SigLIP, which extracts features not bound to a specific person or object. A novel training method, dubbed “attention-diversity loss,” prompts the system to monitor multiple visual regions in each frame, preventing it from focusing solely on faces.
    The result is a universal detector capable of flagging a range of forgeries — from simple facial swaps to complex, fully synthetic videos generated without any real footage.
    “It’s one model that handles all these scenarios,” Kundu said. “That’s what makes it universal.”
    The researchers presented their findings at the high ranking 2025 Conference on Computer Vision and Pattern Recognition (CVPR) in Nashville, Tenn. Titled “Towards a Universal Synthetic Video Detector: From Face or Background Manipulations to Fully AI-Generated Content,” their paper, led by Kundu, outlines UNITE’s architecture and training methodology. Co-authors include Google researchers Hao Xiong, Vishal Mohanty, and Athula Balachandra. Co-sponsored by the IEEE Computer Society and the Computer Vision Foundation, CVPR is among the highest-impact scientific publication venues in the world.
    The collaboration with Google, where Kundu interned, provided access to expansive datasets and computing resources needed to train the model on a broad range of synthetic content, including videos generated from text or still images — formats that often stump existing detectors.
    Though still in development, UNITE could soon play a vital role in defending against video disinformation. Potential users include social media platforms, fact-checkers, and newsrooms working to prevent manipulated videos from going viral.
    “People deserve to know whether what they’re seeing is real,” Kundu said. “And as AI gets better at faking reality, we have to get better at revealing the truth.” More

  • in

    One small qubit, one giant leap for quantum computing

    On July 8, 2025, physicists from Aalto University in Finland published a transmon qubit coherence dramatically surpassing previous scientifically published records. The millisecond coherence measurement marks a quantum leap in computational technology, with the previous maximum echo coherence measurements approaching 0.6 milliseconds.
    Longer qubit coherence allows for an extended window of time in which quantum computers can execute error-free operations, enabling more complex quantum computations and more quantum logic operations before errors occur. Not only does this allow for more calculations with noisy quantum computers, but it also decreases the resources needed for quantum error correction, which is a path to noiseless quantum computing.
    “We have just measured an echo coherence time for a transmon qubit that landed at a millisecond at maximum with a median of half a millisecond,” says Mikko Tuokkola, the PhD student who conducted and analyzed the measurements. The median reading is particularly significant, as it also surpasses current recorded readings.
    The findings have been just published in the prestigious peer-reviewed journal Nature Communications.
    The researchers report their approach as thoroughly as possible, with the aim of making it reproducible for research groups around the world.
    Finland cements position at forefront of quantum
    Tuokkala was supervised at Aalto University by postdoctoral researcher Dr. Yoshiki Sunada, who fabricated the chip and built the measurement setup.

    “We have been able to reproducibly fabricate high-quality transmon qubits. The fact that this can be achieved in a cleanroom which is accessible for academic research is a testament to Finland’s leading position in quantum science and technology,” adds Sunada who is currently working in Stanford University, USA.
    The work is a result of the Quantum Computing and Devices (QCD) research group which is a part of Aalto University’s Department of Applied Physics, Academy of Finland Centre of Excellence in Quantum Technology (QTF), and the Finnish Quantum Flagship (FQF).
    The qubit was fabricated by the QCD group at Aalto using high-quality superconducting film supplied by the Technical Research Centre of Finland (VTT). The success reflects the high quality of Micronova cleanrooms at OtaNano, Finland’s national research infrastructure for micro-, nano-, and quantum technologies.
    “This landmark achievement has strengthened Finland’s standing as a global leader in the field, moving the needle forward on what can be made possible with the quantum computers of the future,” says Professor of Quantum Technology Mikko Möttönen, who heads the QCD group.
    Scaling up the quantum computers of the future requires advancements across several domains. Among them are noise reduction, qubit-count increases, and the qubit coherence time improvements at the center of the new observations from the QCD. The group just opened a senior staff member and two postdocs positions for achieving future breakthroughs faster. More

  • in

    A simple twist fooled AI—and revealed a dangerous flaw in medical ethics

    A study by investigators at the Icahn School of Medicine at Mount Sinai, in collaboration with colleagues from Rabin Medical Center in Israel and other collaborators, suggests that even the most advanced artificial intelligence (AI) models can make surprisingly simple mistakes when faced with complex medical ethics scenarios.
    The findings, which raise important questions about how and when to rely on large language models (LLMs), such as ChatGPT, in health care settings, were reported in the July 22 online issue of NPJ Digital Medicine[10.1038/s41746-025-01792-y].
    The research team was inspired by Daniel Kahneman’s book “Thinking, Fast and Slow,” which contrasts fast, intuitive reactions with slower, analytical reasoning. It has been observed that large language models (LLMs) falter when classic lateral-thinking puzzles receive subtle tweaks. Building on this insight, the study tested how well AI systems shift between these two modes when confronted with well-known ethical dilemmas that had been deliberately tweaked.
    “AI can be very powerful and efficient, but our study showed that it may default to the most familiar or intuitive answer, even when that response overlooks critical details,” says co-senior author Eyal Klang, MD, Chief of Generative AI in the Windreich Department of Artificial Intelligence and Human Health at the Icahn School of Medicine at Mount Sinai. “In everyday situations, that kind of thinking might go unnoticed. But in health care, where decisions often carry serious ethical and clinical implications, missing those nuances can have real consequences for patients.”
    To explore this tendency, the research team tested several commercially available LLMs using a combination of creative lateral thinking puzzles and slightly modified well-known medical ethics cases. In one example, they adapted the classic “Surgeon’s Dilemma,” a widely cited 1970s puzzle that highlights implicit gender bias. In the original version, a boy is injured in a car accident with his father and rushed to the hospital, where the surgeon exclaims, “I can’t operate on this boy — he’s my son!” The twist is that the surgeon is his mother, though many people don’t consider that possibility due to gender bias. In the researchers’ modified version, they explicitly stated that the boy’s father was the surgeon, removing the ambiguity. Even so, some AI models still responded that the surgeon must be the boy’s mother. The error reveals how LLMs can cling to familiar patterns, even when contradicted by new information.
    In another example to test whether LLMs rely on familiar patterns, the researchers drew from a classic ethical dilemma in which religious parents refuse a life-saving blood transfusion for their child. Even when the researchers altered the scenario to state that the parents had already consented, many models still recommended overriding a refusal that no longer existed.
    “Our findings don’t suggest that AI has no place in medical practice, but they do highlight the need for thoughtful human oversight, especially in situations that require ethical sensitivity, nuanced judgment, or emotional intelligence,” says co-senior corresponding author Girish N. Nadkarni, MD, MPH, Chair of the Windreich Department of Artificial Intelligence and Human Health, Director of the Hasso Plattner Institute for Digital Health, Irene and Dr. Arthur M. Fishberg Professor of Medicine at the Icahn School of Medicine at Mount Sinai, and Chief AI Officer of the Mount Sinai Health System. “Naturally, these tools can be incredibly helpful, but they’re not infallible. Physicians and patients alike should understand that AI is best used as a complement to enhance clinical expertise, not a substitute for it, particularly when navigating complex or high-stakes decisions. Ultimately, the goal is to build more reliable and ethically sound ways to integrate AI into patient care.”
    “Simple tweaks to familiar cases exposed blind spots that clinicians can’t afford,” says lead author Shelly Soffer, MD, a Fellow at the Institute of Hematology, Davidoff Cancer Center, Rabin Medical Center. “It underscores why human oversight must stay central when we deploy AI in patient care.”

    Next, the research team plans to expand their work by testing a wider range of clinical examples. They’re also developing an “AI assurance lab” to systematically evaluate how well different models handle real-world medical complexity.
    The paper is titled “Pitfalls of Large Language Models in Medical Ethics Reasoning.”
    The study’s authors, as listed in the journal, are Shelly Soffer, MD; Vera Sorin, MD; Girish N. Nadkarni, MD, MPH; and Eyal Klang, MD.
    About Mount Sinai’s Windreich Department of AI and Human Health
    Led by Girish N. Nadkarni, MD, MPH — an international authority on the safe, effective, and ethical use of AI in health care — Mount Sinai’s Windreich Department of AI and Human Health is the first of its kind at a U.S. medical school, pioneering transformative advancements at the intersection of artificial intelligence and human health.
    The Department is committed to leveraging AI in a responsible, effective, ethical, and safe manner to transform research, clinical care, education, and operations. By bringing together world-class AI expertise, cutting-edge infrastructure, and unparalleled computational power, the department is advancing breakthroughs in multi-scale, multimodal data integration while streamlining pathways for rapid testing and translation into practice.

    The Department benefits from dynamic collaborations across Mount Sinai, including with the Hasso Plattner Institute for Digital Health at Mount Sinai — a partnership between the Hasso Plattner Institute for Digital Engineering in Potsdam, Germany, and the Mount Sinai Health System — which complements its mission by advancing data-driven approaches to improve patient care and health outcomes.
    At the heart of this innovation is the renowned Icahn School of Medicine at Mount Sinai, which serves as a central hub for learning and collaboration. This unique integration enables dynamic partnerships across institutes, academic departments, hospitals, and outpatient centers, driving progress in disease prevention, improving treatments for complex illnesses, and elevating quality of life on a global scale.
    In 2024, the Department’s innovative NutriScan AI application, developed by the Mount Sinai Health System Clinical Data Science team in partnership with Department faculty, earned Mount Sinai Health System the prestigious Hearst Health Prize. NutriScan is designed to facilitate faster identification and treatment of malnutrition in hospitalized patients. This machine learning tool improves malnutrition diagnosis rates and resource utilization, demonstrating the impactful application of AI in health care.
    * Mount Sinai Health System member hospitals: The Mount Sinai Hospital; Mount Sinai Brooklyn; Mount Sinai Morningside; Mount Sinai Queens; Mount Sinai South Nassau; Mount Sinai West; and New York Eye and Ear Infirmary of Mount Sinai More