Anyscale

Anyscale

Software Development

San Francisco, California 26,329 followers

Scalable compute for AI and Python

About us

Scalable compute for AI and Python Anyscale enables developers of all skill levels to easily build applications that run at any scale, from a laptop to a data center.

Website
https://anyscale.com
Industry
Software Development
Company size
51-200 employees
Headquarters
San Francisco, California
Type
Privately Held
Founded
2019

Products

Locations

Employees at Anyscale

Updates

  • Anyscale reposted this

    View profile for Robert Nishihara, graphic

    Co-founder at Anyscale (We are hiring!)

    Many of the companies you'll hear from at #RaySummit have gone through a 5-10 year AI infrastructure journey (even longer in some cases). They've managed the migration from classical ML to deep learning, then from deep learning to generative AI, and they are gearing up for the next generation of advances. The amount of collective wisdom and hard-earned lessons is astounding. I'm always blown away by how technical and how information dense the content is. https://lnkd.in/gMVdCgiK

    • No alternative text description for this image
  • View organization page for Anyscale, graphic

    26,329 followers

    🚨 It's not too late! 🚨 Join our webinar 8/1 (tomorrow) to learn how to scale and productionize GenAI and LLM workloads cost-effectively with Anyscale & AWS! Topics we'll cover: ⭐ Utilizing CPU & GPU for optimal performance ⭐ Leveraging AWS compute for large-scale GPU workloads ⭐ Anyscale cluster management optimizations Sign up now: https:// https://lnkd.in/gqU-deHp

  • View organization page for Anyscale, graphic

    26,329 followers

    ✨ Exciting Announcement ✨ Sergey Edunov, Director of Engineering, GenAI, at Meta, is joining us as a speaker at #RaySummit! 🎉 At Meta, Sergey has spearheaded breakthrough projects in AI and machine learning, with work that includes multiple patents and significant contributions to leading tech journals. Join us to hear Sergey discuss scalable AI, cutting-edge engineering practices, and the future of GenAI. Sign up here: https://lnkd.in/gWTKRzwc

    • No alternative text description for this image
  • Anyscale reposted this

    View profile for Robert Nishihara, graphic

    Co-founder at Anyscale (We are hiring!)

    This migration began 4 years ago. 😲 Not our typical Ray use case, but so impressive and it illustrates Ray's versatility. Also, it was worth it because they're saving over *$100 million annually*. Some fascinating excerpts. 2016: Amazon aims to remove all dependencies on Oracle. 2018: Shutdown last Oracle Data Warehouse cluster, 50PB of table data migrated from Oracle to S3. The tables store "deltas," that is, records to insert, update, or delete, which need to be merged at read time when used. The reads grow too expensive, so Apache Spark is used to merge these deltas offline to produce a read-optimized versions of the tables. 2019: The data has grown from petabyte scale to exabyte scale. The current system needs constant tuning and optimization to handle the scale. 2020: The team completes a PoC using Ray for this workload, demonstrating the ability to handle "12X larger datasets than Apache Spark, improve cost efficiency by 91%, and process 13X more data per hour." 2021: The team settled on an overall architecture and shared early results at the Ray Summit. 2022: More testing of Ray to expose any issues when handling exabyte-scale production data. The main problems were around the management of Amazon EC2 instances at scale (poor resource utilization and slow cluster start times) and out-of-memory errors. Late 2022: The migration begins in earnest beginning with the largest ~1% of tables (which accounted for ~40% of the cost and the vast majority of job failures). 2023: Most issues fixed. Began moving to fully automated shadow compaction on Ray. Whenever new inserts / updates / deletes arrived in a table to be compacted, both Spark and Ray would kick off the same compaction job to verify the benefits and correctness (temporarily increasing the overall cost of compaction before lowering it). 2024 Q1: Ray compacted 1.5 exabytes of Apache Parquet data from S3 using 10,000 years of CPU compute time. Today: Reading over 20 petabytes of data / day across 1600 Ray jobs / day. Ray has maintained a 100% on-time delivery rate of newly compacted data to table subscribers. This is being done with 82% better cost efficiency! This means annual savings of over $120 million / year. https://lnkd.in/g-pJhFei

    Amazon’s Exabyte-Scale Migration from Apache Spark to Ray on Amazon EC2 | Amazon Web Services

    Amazon’s Exabyte-Scale Migration from Apache Spark to Ray on Amazon EC2 | Amazon Web Services

    aws.amazon.com

  • View organization page for Anyscale, graphic

    26,329 followers

    🚀 Have you seen the Anyscale newsletter? Get the scoop on new features, products, & webinars. What's new in July: 🟢Ray Summit 2024: Join us September 30 - October 2, with speakers from OpenAI and Meta. 🟢Community Spotlight: See how Pinterest, Dreamfold, and SewerAI are leveraging Ray and Anyscale. 🟢Product Updates: Explore new features like cost-saving replica compaction, new user interface, and enhanced elastic training. And much more! See how companies are tackling AI challenges with Anyscale & Ray, plus stay up-to-date with all things AI 🙌 Read more here: https://lnkd.in/g5gYc7K9

    • No alternative text description for this image
  • View organization page for Anyscale, graphic

    26,329 followers

    💥 Another feature update💥 Excited to announce Anyscale Job Queues Job Queues deliver improved utilization and simplified cluster management by running multiple concurrent jobs on a single cluster. • Users submit jobs to a specified queue, Anyscale automatically prioritizes and schedules them. • Jobs are executed based on their queue position, with limits on concurrent jobs per cluster. • Jobs run to completion, including retries, repeating until all jobs in the queue are completed or errored. Read full details here: https://lnkd.in/g3VfuApt And watch how it works below:

Similar pages

Browse jobs

Funding