From a "simple" finetuning to your own Mixture of Expert model using opensource models. Nowadays training from scratch an LLM is a so huge effort also for very big company. Starting from pre-trained models to create your own model is no more a way for resourceless companies, but a often a must starting point.
- Lora - Quantization and QLora - Injecting embeddings model into Lora to manage multiple Lora adapters. - Mixing models - Creating your MoE (Mixture of experts) model using several finetuned (Your own) models
Winner of three AI awards, I’ve been working in AI and machine learning for 25 years, designing and developing AI and computer graphic algorithms.I’m very passionate about AI, focusing on Audio, Image and Natural Language Processing, and predictive analysis as well.I received... Read More →
Anil Inamdar, Instaclustr by NetApp, Head of Data Solutions
For dev teams building and training their own AI models, ending up with AI solutions plagued by hallucinations and reliability issues is a (rightfully) huge concern. The good news: vector databases make generative AI considerably more reliable and less prone to hallucinations. The even better news: a number of *100% free and open source* vector databases are especially great options for supporting AI workloads.
The good news keeps coming for teams considering an open source vector database path: it isn’t necessary to invest in implementing new or exotic or proprietary or specialized data-layer solutions to harness vector databases. Many enterprises will find that their existing infrastructure can already support AI workloads (while continuing to provide the familiar data availability, scalability, and performance they already know they can trust). In particular, PostgreSQL (with the pgvector extension), OpenSearch, and Apache Cassandra 5.0 (with its new native vector indexing) are three completely open source technologies—no proprietary or open core solutions needed—that tick all the boxes for meeting enterprises’ AI workloads requirements.
Attendees of this Dev Innovation Summit session will learn how open source vector databases utilize vector embeddings to enable more accurate LLMs, and how to strategically approach and implement retrieval augmented generation (RAG) processes. Attendees will also come away with a clear understanding of the advantages offered by PostgreSQL with pgvector, OpenSearch and Cassandra 5.0 as particularly mature open source vector database strategies ready to jumpstart your organization’s competitive AI capabilities.
Anil Inamdar is the VP & Head of Data Solutions at Instaclustr by NetApp. Anil has 20+ years of experience in data and analytics roles. Joining Instaclustr in 2019, he works with organizations to drive successful data-centric digital transformations via the right cultural, operational... Read More →
Ankit Jain, Aviator Technologies, Co-founder & CEO
In this talk we will delve into the nuanced world of software engineering, where the emphasis on metrics can often overshadow the crucial aspect of developer experience. We will explore how an over-reliance on metrics can hinder innovation, cause unexpected behaviors, and ultimately erode team morale.
Discover alternative approaches to scaling developer experience that prioritize human-centric methodologies over mere data points. We will also dive into practical strategies to use automation and ownership to scale developer experience.
Ankit is a cofounder and CEO of dev-productivity startup Aviator Technologies and also leads the ex-Google alumni network (Xoogler.co). Previously he led engineering teams at Sunshine, Homejoy and Shippo. Prior to that, Ankit was also an EIR at Unshackled Ventures and an engineer... Read More →
Gaganjot Kaur Kang, Sony PlayStation, Senior Software Engineer
This session is designed to provide a comprehensive framework for developing real-time data analytics and data processing pipelines and to explore the cloud-based technologies that facilitate this process like AWS Timestream, and AWS S3. The agenda includes:
- An overview of the general architecture of real-time event processing systems - The role and importance of technologies such as Apache Kafka, Apache Flink, AWS Timestream, AWS S3, and others in this architecture - Addressing challenges in data processing, including state management, event-time handling, and job monitoring - Real-world applications, such as metric aggregation over a time window and real-time computation of product prices and discounts
Venkatesh Kunchenapalli, Senior Devops and Infrastructure Engineer
In the rapidly evolving landscape of software development, the pressure on development teams to deliver high-quality applications swiftly is ever-increasing. This scenario often burdens developers with undifferentiated tasks, hindering their productivity and focus on core application development. Platform engineering, facilitated by Internal Developer Portals (IDP), presents a strategic approach to alleviating these challenges by automating infrastructure management and providing pre-configured tools and environments. Venkatesh Kunchenapalli's experience at Wipro, USA, has demonstrated significant enhancements in developer efficiency through the adoption of IDPs. This paper discusses the implementation of a self-service platform that automatically provisions GitHub repositories, CI/CD pipelines, and cloud infrastructure using templates tailored for various project types such as Java, Node.js, and Python. The automated setup includes comprehensive workflows for Terraform, Docker, and Kubernetes, ensuring environments are ready for immediate use. Key findings from our internal evaluations reveal a substantial improvement in productivity, with a 40-50% reduction in manual hours required for project setups, directly translating to an increase in the speed of application delivery. For instance, web application development saw man-hours reduced from 200 to 120, achieving a 40% productivity gain. Similarly, API service projects noted a 50% reduction, from 160 to 80 man-hours. Further, the introduction of IDPs has led to a notable increase in developer satisfaction, with survey results showing an improvement from 60 to 85 out of 100, marking a 42% increase in developer contentment post-implementation. These enhancements are supported by a robust support system that addresses issues through data-driven insights and proactive template updates, ensuring the IDP remains effective and relevant. This paper underscores the critical role of platform engineering in modern software development by demonstrating how it can significantly enhance developer experience and operational efficiency, thereby fostering faster and more effective application development cycles.
Venkatesh Kunchenapalli is a seasoned DevOps Engineer with over 15 years of extensive experience in various technical roles, including Site Reliability Engineer and Build and Release Engineer. He holds a Master of Science in Computer Engineering from California State University, Long... Read More →
In this talk, we share the results of an in-depth analysis of data gathered from over 1 billion open source package downloads across more than 2000 diverse projects on Scarf. Our findings offer valuable insights into user behaviors and interactions with open source software, making it essential for maintainers, founders, and executives in open source companies. During the presentation, we delve deep into our data, uncovering the best practices employed by successful open source projects. We explore a wide array of topics, including various download formats, packaging systems, regional download trends, and user-favored documentation types. Additionally, we discuss the impact of community engagement and how maintainers can harness their user base to boost project adoption and drive business growth. Attendees can expect to leave this talk equipped with actionable insights and best practices to optimize their open source projects and thrive in the competitive landscape of open source software.
Avi Press is a developer tool author, and functional programming language enthusiast, serving as a founder & CEO of Scarf. Avi loves thinking about and discussing how people can solve problems by more effectively sharing data, and how that applies to building a sustainable open-source... Read More →
From a "simple" finetuning to your own Mixture of Expert model using opensource models. Nowadays training from scratch an LLM is a so huge effort also for very big company. Starting from pre-trained models to create your own model is no more a way for resourceless companies, but a often a must starting point.
- Lora - Quantization and QLora - Injecting embeddings model into Lora to manage multiple Lora adapters. - Mixing models - Creating your MoE (Mixture of experts) model using several finetuned (Your own) models
Winner of three AI awards, I’ve been working in AI and machine learning for 25 years, designing and developing AI and computer graphic algorithms.I’m very passionate about AI, focusing on Audio, Image and Natural Language Processing, and predictive analysis as well.I received... Read More →
Anil Inamdar, Instaclustr by NetApp, Head of Data Solutions
For dev teams building and training their own AI models, ending up with AI solutions plagued by hallucinations and reliability issues is a (rightfully) huge concern. The good news: vector databases make generative AI considerably more reliable and less prone to hallucinations. The even better news: a number of *100% free and open source* vector databases are especially great options for supporting AI workloads.
The good news keeps coming for teams considering an open source vector database path: it isn’t necessary to invest in implementing new or exotic or proprietary or specialized data-layer solutions to harness vector databases. Many enterprises will find that their existing infrastructure can already support AI workloads (while continuing to provide the familiar data availability, scalability, and performance they already know they can trust). In particular, PostgreSQL (with the pgvector extension), OpenSearch, and Apache Cassandra 5.0 (with its new native vector indexing) are three completely open source technologies—no proprietary or open core solutions needed—that tick all the boxes for meeting enterprises’ AI workloads requirements.
Attendees of this Dev Innovation Summit session will learn how open source vector databases utilize vector embeddings to enable more accurate LLMs, and how to strategically approach and implement retrieval augmented generation (RAG) processes. Attendees will also come away with a clear understanding of the advantages offered by PostgreSQL with pgvector, OpenSearch and Cassandra 5.0 as particularly mature open source vector database strategies ready to jumpstart your organization’s competitive AI capabilities.
Anil Inamdar is the VP & Head of Data Solutions at Instaclustr by NetApp. Anil has 20+ years of experience in data and analytics roles. Joining Instaclustr in 2019, he works with organizations to drive successful data-centric digital transformations via the right cultural, operational... Read More →
Ankit Jain, Aviator Technologies, Co-founder & CEO
In this talk we will delve into the nuanced world of software engineering, where the emphasis on metrics can often overshadow the crucial aspect of developer experience. We will explore how an over-reliance on metrics can hinder innovation, cause unexpected behaviors, and ultimately erode team morale.
Discover alternative approaches to scaling developer experience that prioritize human-centric methodologies over mere data points. We will also dive into practical strategies to use automation and ownership to scale developer experience.
Ankit is a cofounder and CEO of dev-productivity startup Aviator Technologies and also leads the ex-Google alumni network (Xoogler.co). Previously he led engineering teams at Sunshine, Homejoy and Shippo. Prior to that, Ankit was also an EIR at Unshackled Ventures and an engineer... Read More →
Gaganjot Kaur Kang, Sony PlayStation, Senior Software Engineer
This session is designed to provide a comprehensive framework for developing real-time data analytics and data processing pipelines and to explore the cloud-based technologies that facilitate this process like AWS Timestream, and AWS S3. The agenda includes:
- An overview of the general architecture of real-time event processing systems - The role and importance of technologies such as Apache Kafka, Apache Flink, AWS Timestream, AWS S3, and others in this architecture - Addressing challenges in data processing, including state management, event-time handling, and job monitoring - Real-world applications, such as metric aggregation over a time window and real-time computation of product prices and discounts
Venkatesh Kunchenapalli, Senior Devops and Infrastructure Engineer
In the rapidly evolving landscape of software development, the pressure on development teams to deliver high-quality applications swiftly is ever-increasing. This scenario often burdens developers with undifferentiated tasks, hindering their productivity and focus on core application development. Platform engineering, facilitated by Internal Developer Portals (IDP), presents a strategic approach to alleviating these challenges by automating infrastructure management and providing pre-configured tools and environments. Venkatesh Kunchenapalli's experience at Wipro, USA, has demonstrated significant enhancements in developer efficiency through the adoption of IDPs. This paper discusses the implementation of a self-service platform that automatically provisions GitHub repositories, CI/CD pipelines, and cloud infrastructure using templates tailored for various project types such as Java, Node.js, and Python. The automated setup includes comprehensive workflows for Terraform, Docker, and Kubernetes, ensuring environments are ready for immediate use. Key findings from our internal evaluations reveal a substantial improvement in productivity, with a 40-50% reduction in manual hours required for project setups, directly translating to an increase in the speed of application delivery. For instance, web application development saw man-hours reduced from 200 to 120, achieving a 40% productivity gain. Similarly, API service projects noted a 50% reduction, from 160 to 80 man-hours. Further, the introduction of IDPs has led to a notable increase in developer satisfaction, with survey results showing an improvement from 60 to 85 out of 100, marking a 42% increase in developer contentment post-implementation. These enhancements are supported by a robust support system that addresses issues through data-driven insights and proactive template updates, ensuring the IDP remains effective and relevant. This paper underscores the critical role of platform engineering in modern software development by demonstrating how it can significantly enhance developer experience and operational efficiency, thereby fostering faster and more effective application development cycles.
Venkatesh Kunchenapalli is a seasoned DevOps Engineer with over 15 years of extensive experience in various technical roles, including Site Reliability Engineer and Build and Release Engineer. He holds a Master of Science in Computer Engineering from California State University, Long... Read More →
Alain Chautard, Angular Training, Expert Web Consultant
We will test a web application from scratch using Cypress. Step by step, we will learn about Cypress features, implement those in our tests. We will be testing various aspects of the application (navigation, clicks, user input), and see how to debug and improve our tests, make then more readable, and even how to mock the server-side in order to test more scenarios on the front-end.
Alain is a Google Developer Expert in Angular and Maps platform. He started working with Angular JS in 2011.Since then, he has worked with all Angular versions daily, both as a developer and as a technical trainer. He runs the Angular certification program for Angular Training.He... Read More →