TechNrs Hub

TechNrs Hub Your health and Knowledge of technological advancement are important
How to inculcate health and technology for better functionality

Why everyone is freaking out about DeepSeek?It took about a month for the finance world to start freaking out about Deep...
29/01/2025

Why everyone is freaking out about DeepSeek?

It took about a month for the finance world to start freaking out about DeepSeek, but when it did, it took more than half a trillion dollars — or one entire Stargate — off Nvidia’s market cap. It wasn’t just Nvidia, either: Tesla, Google, Amazon, and Microsoft tanked.

DeepSeek’s two AI models, released in quick succession, put it on par with the best available from American labs, according to Alexandr Wang, Scale AI CEO. And DeepSeek seems to be working within constraints that mean it trained much more cheaply than its American peers. One of its recent models is said to cost just $5.6 million in the final training run, which is about the salary an American AI expert can command. Last year, Anthropic CEO Dario Amodei said the cost of training models ranged from $100 million to $1 billion. OpenAI’s GPT-4 cost more than $100 million, according to CEO Sam Altman. DeepSeek seems to have just upended our idea of how much AI costs, with potentially enormous implications across the industry.

This has all happened over just a few weeks. On Christmas Day, DeepSeek released a reasoning model (v3) that caused a lot of buzz. Its second model, R1, released last week, has been called “one of the most amazing and impressive breakthroughs I’ve ever seen” by Marc Andreessen, VC and adviser to President Donald Trump. The advances from DeepSeek’s models show that “the AI race will be very competitive,” says Trump’s AI and crypto czar David Sacks. Both models are partially open source, minus the training data.

DeepSeek’s successes call into question whether billions of dollars in compute are actually required to win the AI race. The conventional wisdom has been that big tech will dominate AI simply because it has the spare cash to chase advances. Now, it looks like big tech has simply been lighting money on fire. Figuring out how much the models actually cost is a little tricky because, as Scale AI’s Wang points out, DeepSeek may not be able to speak honestly about what kind and how many GPUs it has — as the result of sanctions.

Even if critics are correct and DeepSeek isn’t being truthful about what GPUs it has on hand (napkin math suggests the optimization techniques used means they are being truthful), it won’t take long for the open-source community to find out, according to Hugging Face’s head of research, Leandro von Werra. His team started working over the weekend to replicate and open-source the R1 recipe, and once researchers can create their own version of the model, “we’re going to find out pretty quickly if numbers add up.”

What is DeepSeek?

Led by CEO Liang Wenfeng, the two-year-old DeepSeek is China’s premier AI startup. It spun out from a hedge fund founded by engineers from Zhejiang University and is focused on “potentially game-changing architectural and algorithmic innovations” to build artificial general intelligence (AGI) — or at least, that’s what Liang says. Unlike OpenAI, it also claims to be profitable.

In 2021, Liang started buying thousands of Nvidia GPUs (just before the US put sanctions on chips) and launched DeepSeek in 2023 with the goal to “explore the essence of AGI,” or AI that’s as intelligent as humans. Liang follows a lot of the same lofty talking points as OpenAI CEO Altman and other industry leaders. “Our destination is AGI,” Liang said in an interview, “which means we need to study new model structures to realize stronger model capability with limited resources.”

So, that’s exactly what DeepSeek did. With a few innovative technical approaches that allowed its model to run more efficiently, the team claims its final training run for R1 cost $5.6 million. That’s a 95 percent cost reduction from OpenAI’s o1. Instead of starting from scratch, DeepSeek built its AI by using existing open-source models as a starting point — specifically, researchers used Meta’s Llama model as a foundation. While the company’s training data mix isn’t disclosed, DeepSeek did mention it used synthetic data, or artificially generated information (which might become more important as AI labs seem to hit a data wall).

Without the training data, it isn’t exactly clear how much of a “copy” this is of o1

Without the training data, it isn’t exactly clear how much of a “copy” this is of o1 — did DeepSeek use o1 to train R1? Around the time that the first paper was released in December, Altman posted that “it is (relatively) easy to copy something that you know works” and “it is extremely hard to do something new, risky, and difficult when you don’t know if it will work.” So the claim is that DeepSeek isn’t going to create new frontier models; it’s simply going to replicate old models. OpenAI investor Joshua Kushner also seemed to say that DeepSeek “was trained off of leading US frontier models.”

R1 used two key optimization tricks, former OpenAI policy researcher Miles Brundage told The Verge: more efficient pre-training and reinforcement learning on chain-of-thought reasoning. DeepSeek found smarter ways to use cheaper GPUs to train its AI, and part of what helped was using a new-ish technique for requiring the AI to “think” step by step through problems using trial and error (reinforcement learning) instead of copying humans. This combination allowed the model to achieve o1-level performance while using way less computing power and money.

“DeepSeek v3 and also DeepSeek v2 before that are basically the same sort of models as GPT-4, but just with more clever engineering tricks to get more bang for their buck in terms of GPUs,” Brundage said.

To be clear, other labs employ these techniques (DeepSeek used “mixture of experts,” which only activates parts of the model for certain queries. GPT-4 did that, too). The DeepSeek version innovated on this concept by creating more finely tuned expert categories and developing a more efficient way for them to communicate, which made the training process itself more efficient. The DeepSeek team also developed something called DeepSeekMLA (Multi-Head Latent Attention), which dramatically reduced the memory required to run AI models by compressing how the model stores and retrieves information.

What is shocking the world isn’t just the architecture that led to these models but the fact that it was able to so rapidly replicate OpenAI’s achievements within months, rather than the year-plus gap typically seen between major AI advances, Brundage added.

OpenAI positioned itself as uniquely capable of building advanced AI, and this public image just won the support of investors to build the world’s biggest AI data center infrastructure. But DeepSeek’s quick replication shows that technical advantages don’t last long — even when companies try to keep their methods secret.

“These close sourced companies, to some degree, they obviously live off people thinking they’re doing the greatest things and that’s how they can maintain their valuation. And maybe they overhyped a little bit to raise more money or build more projects,” von Werra says. “Whether they overclaimed what they have internally, nobody knows, obviously it’s to their advantage.”

Wearable devices like Fitbit can predict IBD flares 7 weeks in advanceWrist-worn devices could give advance notice of wh...
28/01/2025

Wearable devices like Fitbit can predict IBD flares 7 weeks in advance

Wrist-worn devices could give advance notice of when the wearer might experience an IBD flare.

For people with the unpredictable conditions falling under the inflammatory bowel disease (IBD) umbrella, an early warning system for flares may one day be available in the wearable device on the wrist.
A new study from researchers at Mount Sinai reports that such devices can predict imminent inflammatory and symptomatic IBD flares as far in advance as 7 weeks.
With such advance notice, it will be possible for people with IBD and their physicians to adjust medications to blunt the upcoming flares before it arrives.
The wearables already track physiological indicators such as heart rate, heart rate variability, steps, and pulse oximetry that exhibit significant changes far in advance of IBD flare-ups.

Wearable devices could provide an unprecedented 7 weeks’ advance warning of an inflammatory bowel disease (IBD) flare, according to a new study published by researchers at the Icahn School of Medicine at Mount Sinai in New York City.

The study finds that significant changes in physiological metrics tracked by three popular wearable devices, Apple Watch, Fitbit, and the Oura Ring, occurred in the weeks preceding an IBD flare.

What are the signs of an IBD flare?

As many as 3.1 million Americans have IBD, an umbrella-term form conditions — primarily Crohn’s disease and ulcerative colitis — in which inflammation of the bowel produces various unwelcome and often painful digestive symptoms.

IBD tends not to be continually active, and people who have it may go for extended periods without experiencing a flare-up of the symptoms that typically arrives without warning.

In the early stages of a flare, a physician may seek confirmation with a blood test and stool analysis, but at that point, the event is already underway. There is a need to better predict flares before they happen.

The Mount Sinai researchers identified signals in the body that turned out to be associated with an imminent flare-up of IBD symptoms:

longitudinal heart rate — heart rate changes over time
resting heart rate — a person’s heart rate when resting
heart rate variability — the amount of variation in time between heartbeats, or their regularity
steps — an indicator or physical activity
oxygenation — or pulse ox, the amount of oxygen blood hemoglobin is carrying.

Significantly, all of these indicators exhibited changes from baseline values up to 7 weeks before any indication of inflammation or IBD symptoms.

For the study, the authors recruited 309 adults from across the United States. All participants had a diagnosis of either Crohn’s disease or ulcerative colitis, and were taking medication for IBD.

Participants were expected to wear their devices 8 hours a day and respond to questionnaires a minimum of four times a week. The study began in December 2021 and ran until June 2023, with individuals remaining involved for as long as they wished.

High blood pressure poses several health risks, including a potentially higher risk for cognitive impairment.Researchers...
14/06/2024

High blood pressure poses several health risks, including a potentially higher risk for cognitive impairment.

Researchers are interested in finding what protective factors can help reduce the risk of cognitive problems among individuals with high blood pressure.

A recent study suggests that vigorous exercise habits may help decrease the risk of future cognitive impairment.
Impairment of cognitive function can affect all aspects of a person’s life, including quality of life and day-to-day activities.

Multiple factors can contribute to someone’s risk of developing problems in cognitive function, including high blood pressure, or hypertension. Researchers are interested in finding potential protective actions that people with high blood pressure can take.

A new study published in Alzheimer’s & Dementia: The Journal of the Alzheimer’s Association examined the relationship between vigorous physical activity and risk for mild cognitive impairment among people with high blood pressure.

Researchers found that participants who engaged in one or more sessions of vigorous physical activity each week were at a lower risk for mild cognitive impairment and probable dementia.

The results suggest that vigorous exercise may help preserve cognitive function among certain individuals.

High blood pressure’s impact on cognitive function

High blood pressure occurs when the force of blood pressing against blood vessel walls gets outside of a certain range. It can lead to damaged blood vessels and increase people’s risk for heart problems and stroke.

A normal blood pressure reading is less than 120/80 millimeter of mercury (mmHg), and doctors may diagnose someone with high blood pressure when a systolic reading 130 mmHg or more or when a diastolic reading is 80 mmHg or more.

Previous research has also linked high blood pressure in midlife with a higher risk for cognitive disorders. The authors of the current study note that people with high blood pressure are at a higher risk for Alzheimer’s disease, vascular dementia, and mild cognitive impairment.

José Morales, MD, a vascular neurologist and neurointerventional surgeon at the Pacific Neuroscience Institute in Santa Monica, CA, not involved in the current research, explained to Medical News Today that:

“Hypertension damages the small blood vessels in our brain and also causes them to malfunction. This results in progressive damage to the brain, which in turn leads to cognitive impairment.”

Vigorous exercise as protective factor for cognition

The researchers who conducted the current study wanted to evaluate if vigorous exercise helped with the risk for mild cognitive impairment and probable dementia.

This study was a post hoc analysis using data from the SPRINT MIND STUDY, which formed part of the SPRINT trial. This trial involved over 9,000 adults in the United States who had high blood pressure.

At enrolment, participants were asked about the frequency of participating in vigorous physical activity. Vigorous physical activity was defined as activities that induced sweat, increased heart rate, or increased breathing.

Participants could pick their level of vigorous physical activity from the following categories:

- Rarely or never
- One to three vigorous activity sessions a month
- One vigorous activity session a week
- Two to four vigorous activity sessions a week
- Five or more vigorous activity sessions a week.

In the analysis, researchers divided participants into a low-vigorous physical activity group and a high-vigorous physical activity group.

The low-vigorous physical activity group had less than one vigorous activity session a week, and the high-vigorous physical activity group had one or more vigorous activity sessions a week.

All participants also underwent cognitive assessment tests, and covariates included components like age, education, smoking, use of antihypertensive medication, body mass index (BMI), and alcohol use.

The researchers excluded participants who had limited physical function or missing cognitive assessments, allowing them to include 7,670 participants in their final analysis.

The average follow-up time with participants was 4.5 years, and over this time, there were identified cases of mild cognitive impairment and probable dementia.

Overall, participants in the high vigorous physical activity group were at a lower risk for mild cognitive impairment and probable dementia.

The association was stronger among participants less than 75 years old at baseline, and Black participants. The association also appeared stronger in participants with prior cardiovascular disease.

Address

Abuja

Alerts

Be the first to know and let us send you an email when TechNrs Hub posts news and promotions. Your email address will not be used for any other purpose, and you can unsubscribe at any time.

Contact The Practice

Send a message to TechNrs Hub:

Share