The G-Research Open-Source Software team contributes to a variety of projects in the open-source ecosystem, with a focus on three fundamental and overlapping areas of research:
- Data science and machine learning tools
- The infrastructure to support those tools
- Security across those tools and infrastructure
Armada is an application to achieve high throughput of run-to-completion jobs on multiple Kubernetes clusters.
It stores queues for users/projects with pod specifications and creates these pods once there is available resource in one of the connected Kubernetes clusters.
ILGPU is a JIT (just-in-time) compiler for high-performance GPU programs written in .Net-based languages.
ILGPU is entirely written in C# without any native dependencies. It offers the flexibility and the convenience of C++ AMP on the one hand and the high performance of Cuda programs on the other hand
FastTrackML is an ML experiment tracker focused on speed and scalability, with a platform built to handle large amounts of data and experiments.
It combines the benefits of the open source ecosystem around MLflow, and the flexible analysis tools of
Aim’s user interface. With FastTrackML, data scientists can track and monitor their ML experiments, including metrics, parameters, artefacts and versioning.
G-Research Open-Source Program Office
The G-Research OSPO plays an active and public role in the larger open-source community, taking responsibility for maintaining public projects, supporting open-source foundations, and helping guide continued development of security and infrastructure projects.
In the realm of data science, we work with the Spark, Arrow, and related communities as well as open-source alternatives for improved GPU support for .Net Core. We pursue general improvements to the F#, C# and .Net Core ecosystems, which help bolster G-Research’s engineering teams.
For infrastructure, OpenStack and Kubernetes are the clear areas of attention. These ecosystems are deep and wide and our support touches upon many tools and systems that surround these two core technologies, such as Ansible, Prometheus, and Thanos, as well as other high performance computing tools.
Our activities in the security space are far-reaching as well. They include taking over maintenance of Consul.NET, creating an aeron-vault plugin, improving the Ozone vault plugin, contributing to planbot-tfe, developing the jenkins pipelines plugin security fix, and more.
We are proud to support the open-source ecosystem with our technical efforts as well as sponsoring many individual developers and partnering with several OS foundations: the CNCF & Linux Foundation, OpenInfra, Ursa Labs/Voltron Data, Real Logic (Aeron), and Julia Computing, to name a few.
Specific enquiries for the G-Research Open-Source Software team can be sent to: firstname.lastname@example.org.
Projects we contribute to
A unified analytics engine for large-scale data processing
Consul.NET is a .NET client library for the Consul HTTP API
G-Research F# code formatting guidelines
Repository for all of G Research-hosted helm charts
Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.
A service for managing and provisioning Bare Metal servers.
A connector for Apache Spark and PySpark to Dgraph databases.
A library that provides useful extensions to Apache Spark and PySpark.
Jupyter magics and kernels for working with remote Spark clusters
Terraform Enterprise/Cloud Plan Bot
Type equalities for F#