Smaller, Faster, Greener: Cutting-Edge AI Models Do More with Less

The demand for electricity and water to power and cool AI servers is ever increasing. Researchers are developing innovative solutions to mitigate the environmental impact. Four promising techniques include model reuse, ReLora, MoE, and quantization. As AI becomes more prevalent, we need to proactively reduce energy and water usage to benefit clients and contribute to a sustainable future.

Overview of AI’s Environmental Impact

As AI continues to advance and expand, the industry is proactively addressing the growing demand for electricity and water to power and cool the servers that make this technology possible. A standard DGX computer, the gold standard for AI work, consumes over 10KW of power. Big Tech will buy millions of these systems this year, using more power than all of New York City, and with this comes a responsibility to find sustainable ways to manage the energy consumption. To mitigate the environmental impact, researchers and engineers are already developing innovative solutions.

The Growing Environmental Challenges in AI Technology


But it is not just the electricity needed to run these computers.  They get hot, really hot, and so they need cooling.  You have to get rid of that heat.  That typically takes up two times more power than the actual computer.  So now that 10KW machine is really using 30KW when running.  These new servers will consume three times more than all of the electricity used in California in 2022! To get around this, server farms are exploring alternative cooling methods, such as using water, to reduce electricity usage. While this shifts the resource burden, it also presents an opportunity to develop more efficient and eco-friendly cooling technologies.

Sustainable Solutions for AI Energy and Water Usage

This saves electricity, but is using our precious fresh water to help cut costs.

Case Studies: Effective AI Sustainability Techniques

AI is hungry for power, and things are going to get worse.  How can we solve this problem?  Fortunately, researchers are already starting to pursue more efficient methods of making and using AI.  Four promising techniques are model reuse, ReLora, MoE, and quantization.

Selecting Technologies for Sustainable AI

Model reuse involves retraining an already trained model for a new purpose, saving time and energy compared to training from scratch. This approach not only conserves resources but also often results in better-performing models.  Both Meta (Facebooks parent) and Mixtral have been good about releasing models that can be reused.

ReLora and Lora reduce the number of calculations needed when retraining models for new uses, further saving energy and enabling the use of smaller, less power-hungry computers. This means that instead of relying on large, energy-intensive systems like NVidia’s DGX, a modest graphics card can often suffice for retraining.

MoE models, such as those recently released by Mistral, have fewer parameters than conventional models, resulting in fewer calculations and reduced energy consumption.

Moreover, MoE models only activate the necessary blocks when in use, much like turning off lights in unused rooms, leading to a 65% reduction in energy usage.

Advantages of Energy-Efficient AI Models

Quantization is an innovative technique that reduces the size of AI models with minimal impact on performance. By quantizing a model, the number of bits required to represent each parameter is reduced. This shrinks the model size, enabling the use of less powerful and more energy-efficient hardware. For instance, a massive 40 billion parameter model would typically require an energy-hungry GPU system like the DGX to run efficiently. But with quantization, this same model can be optimized to run on a low-power consumer GPU, like those found in most laptops. While quantization can slightly reduce model accuracy in some cases, for many practical applications this tradeoff is negligible or unnoticeable. The performance remains excellent while requiring a fraction of the computing resources.

The Impact of Sustainable Practices in AI on Industry

Overall, quantization provides a way to make AI models much more efficient, compact and eco-friendly, minimizing the hardware requirements and energy consumption. This allows state-of-the-art AI to run on ubiquitous consumer devices while maintaining accuracy where it matters most. Quantization represents an important step towards scalable and sustainable AI.

Current Status of Sustainable AI Developments

By combining these four techniques, we have successfully reused a 47 billion parameter MoE model and retrained it for a client using a server that consumes less than 1KW of power, completing the process in just 10 hours. Furthermore, the client can run the model on standard Apple Mac computers with energy-efficient M2 silicon chips. At smartR AI, when developing and training new models, such as our generative AI loyal companion SCOTi® AI, we have been privileged to be able to utilize the super computer at EPCC, Edinburgh University, reducing the time span required for training of models substantially – we trained a model from scratch in nearly one hour.

Timeline of Advances in AI Sustainability

As AI becomes more prevalent, we all need to start thinking more proactively about the energy and water usage.  Research into more efficient training and utilization methods is yielding promising results. But we also need to start using these methods actively; by integrating these new techniques into our tool flows, we not only benefit our clients but also contribute to a more sustainable future for our planet.


Recent Content

The telecom industry is in the midst of a major shift from “telco” to “techco”, with operators investing in AI, 5G, cloud computing, and digital services to compete with tech giants like Amazon and Google. At MWC 2025, leaders from e&, KDDI, MTN, and SK Telecom discussed their AI-driven strategies, including self-healing networks, smart city infrastructure, fintech expansion, and enterprise 5G solutions. As telcos embrace AI-powered automation and cloud-based innovations, they are redefining their role in the digital economy.
Ericsson, Volvo Group, and Airtel have joined forces to explore how 5G Advanced, Digital Twin technology, and Extended Reality (XR) can transform manufacturing in India. The research, conducted at Volvo’s R&D Centre in Bangalore, will focus on smart factories, immersive training, and real-time process optimization. With Airtel’s low-latency 5G network, the collaboration aims to enhance industrial automation, workforce training, and AI-driven efficiencies, setting a benchmark for Industry 4.0 and Industry 5.0 innovations.
The Department of Telecommunications (DoT) has announced the 5G Innovation Hackathon 2025, a six-month competition to drive 5G-powered solutions across industries. Open to students, startups, and professionals, the hackathon will focus on AI, IoT, smart cities, and next-gen connectivity innovations. Participants will receive funding, mentorship, and access to 100+ 5G Use Case Labs. Winners will showcase their solutions at India Mobile Congress 2025.
Telecom networks are facing unprecedented complexity with 5G, IoT, and cloud services. Traditional service assurance methods are becoming obsolete, making AI-driven, real-time analytics essential for competitive advantage. This independent industry whitepaper explores how DPUs, GPUs, and Generative AI (GenAI) are enabling predictive automation, reducing operational costs, and improving service quality. Discover key insights, real-world case studies, and strategic actions for telecom leaders. Download the Full Report Now to stay ahead in AI-powered service assurance.
Dive into our in-depth coverage of MWC 2025, highlighting the latest innovations in 5G, AI, IoT, and more. Discover how industry leaders are shaping the future of technology with groundbreaking announcements and developments unveiled during the event.
At MWC 2025 Keynote 12: Future of Work and Economic Growth, industry leaders explored how AI, talent shortages, and startup growth are reshaping global markets. From Europe’s role in applied AI to the importance of scaling startups internationally, the discussions offered crucial insights for entrepreneurs, investors, and tech professionals. Discover key takeaways on AI-driven industries, workforce transformation, and economic innovation. Featuring Euan Blair (Multiverse), Saadia Zahidi (WEF), Yoram Wijngaarde (Dealroom.co), Renate Nikolay (European Commission), and Jordi Romero (Factorial), this session explores workforce transformation, AI’s role in labor markets, and strategies to boost Europe’s innovation and competitiveness.

Download Magazine

With Subscription
Whitepaper
Telecom networks are facing unprecedented complexity with 5G, IoT, and cloud services. Traditional service assurance methods are becoming obsolete, making AI-driven, real-time analytics essential for competitive advantage. This independent industry whitepaper explores how DPUs, GPUs, and Generative AI (GenAI) are enabling predictive automation, reducing operational costs, and improving service quality....
Whitepaper
Explore the collaboration between Purdue Research Foundation, Purdue University, Ericsson, and Saab at the Aviation Innovation Hub. Discover how private 5G networks, real-time analytics, and sustainable innovations are shaping the "Airport of the Future" for a smarter, safer, and greener aviation industry....
Article & Insights
This article explores the deployment of 5G NR Transparent Non-Terrestrial Networks (NTNs), detailing the architecture's advantages and challenges. It highlights how this "bent-pipe" NTN approach integrates ground-based gNodeB components with NGSO satellite constellations to expand global connectivity. Key challenges like moving beam management, interference mitigation, and latency are discussed, underscoring...

Subscribe To Our Newsletter

Scroll to Top

Sponsored by RADCOM

AI-Powered Service Assurance: Are You Ready?

5G, IoT, and cloud networks demand real-time, AI-driven service assurance.
  • How AI, DPUs & GenAI are transforming network operations.
  • Why predictive automation is critical for telecom success.
  • How leading CSPs are reducing costs & optimizing performance with AI.

Don’t get left behind—embrace AI-powered service assurance today!