Data Software Engineering Daily

Sinopse

Databases and data engineering episodes of Software Engineering Daily

Episódios

  • COVID Modeling with Josh Wills and Sam Shah

    COVID Modeling with Josh Wills and Sam Shah

    20/10/2020 Duração: 52min

    Predicting the spread of COVID-19 is not easy. The best methods we have available require us to extrapolate trends from a large volume of data, and this requires the construction of large-scale models. Because of the expertise needed for developing these models, Silicon Valley engineers were brought in to help develop a maintainable model. Two The post COVID Modeling with Josh Wills and Sam Shah appeared first on Software Engineering Daily.

  • Basedash: Low Code Database Editor with Max Musing

    Basedash: Low Code Database Editor with Max Musing

    12/10/2020 Duração: 35min

    Databases are the source of truth for every company. Editing the data in the database normally requires writing a query in SQL or a domain specific querying language–languages that are only accessible to engineers and highly technical people.  BaseDash is a tool for interfacing with a database without requiring the usage of a query language. The post Basedash: Low Code Database Editor with Max Musing appeared first on Software Engineering Daily.

  • Pachyderm Engineering with Joe Doliner

    Pachyderm Engineering with Joe Doliner

    29/09/2020 Duração: 52min

    Pachyderm is a system for data version control. Code has been version controlled for many years, but not data. In previous episodes with Joe Doliner, we explored the evolution of Pachyderm. In today’s show, we talk about the state of the company in 2020, as well as Pachyderm Hub, and end-to-end machine learning and data The post Pachyderm Engineering with Joe Doliner appeared first on Software Engineering Daily.

  • Salesforce Ecosystem with Kevin Poorman

    Salesforce Ecosystem with Kevin Poorman

    24/09/2020 Duração: 53min

    Salesforce is a platform with a large number of developers, ISVs, and companies built on top of it. There is a thriving ecosystem of applications built and managed around Salesforce, leading to an important set of relationships and integration points between Salesforce and the other entities involved with the company. Kevin Poorman works at Salesforce The post Salesforce Ecosystem with Kevin Poorman appeared first on Software Engineering Daily.

  • Twitter Search with Nico Tonozzi

    Twitter Search with Nico Tonozzi

    23/09/2020 Duração: 44min

    Twitter is a social media platform with billions of objects: people, tweets, words, events, and other entities. The high volume of information that gets created on Twitter everyday leads to a complex engineering problem for the developers building the Twitter search index. Nico Tonozzi is an engineer at Twitter. He joins the show to talk The post Twitter Search with Nico Tonozzi appeared first on Software Engineering Daily.

  • Robinhood Engineering with Jaren Glover

    Robinhood Engineering with Jaren Glover

    22/09/2020 Duração: 50min

    Robinhood is a platform for buying and selling stocks and cryptocurrencies. Robinhood is complex, fast-moving, and financial, and together these things require high quality engineering in distributed systems, observability, and data infrastructure. Jaren Glover is an engineer at Robinhood, and he joins the show to talk about the problem space within Robinhood, as well as The post Robinhood Engineering with Jaren Glover appeared first on Software Engineering Daily.

  • Superhuman with Rahul Vohra

    Superhuman with Rahul Vohra

    16/09/2020 Duração: 59min

    The most popular email client is Gmail, the web-based email client from Google. Gmail is dominant, but that dominance has come at a price, namely speed. Gmail caters to the lowest common denominator, serving a large ecosystem of use cases and plugins. This makes for a slow overall performance. Superhuman is an email client built The post Superhuman with Rahul Vohra appeared first on Software Engineering Daily.

  • Internet Archive Book Scanning with Davide Semenzin

    Internet Archive Book Scanning with Davide Semenzin

    15/09/2020 Duração: 51min

    The Internet Archive collects historical records of the Internet. The Wayback Machine is one tool from the Internet Archive which you may be familiar with. One project you may be unfamiliar with is book scanning. Internet Archive scans high volumes of books in order to digitize them. In today’s episode, Davide Semenzin joins the show The post Internet Archive Book Scanning with Davide Semenzin appeared first on Software Engineering Daily.

  • UnifyID: Biometric Authentication with John Whaley

    UnifyID: Biometric Authentication with John Whaley

    14/09/2020 Duração: 53min

    Biometric authentication uses signals from a human’s unique biology to verify identity. Forms of biometric authentication include fingerprints, eye patterns, and the way a person walks, otherwise known as gait. UnifyID is a company that builds systems for biometric authentication. John Whaley is the CEO of UnifyID, and he joins the show to talk through The post UnifyID: Biometric Authentication with John Whaley appeared first on Software Engineering Daily.

  • API Change Management with Aidan Cunniffe

    API Change Management with Aidan Cunniffe

    02/09/2020 Duração: 42min

    APIs within a company change all the time. Every service owner has an API to manage, and those APIs have upstream and downstream connections. APIs need to be tested for integration points as well as for their “contract”, the agreement between an API owner and the consumers of that API. Aidan Cuniffe is the founder The post API Change Management with Aidan Cunniffe appeared first on Software Engineering Daily.

  • Data Version Control with Dmitry Petrov

    Data Version Control with Dmitry Petrov

    24/08/2020 Duração: 54min

    Code is version controlled through git, the version control system originally built to manage the Linux codebase. For decades, software has been developed using git for version control. More recently, data engineering has become an unavoidable facet of software development. It is reasonable to ask–why are we not version controlling our data? Dmitry Petrov is The post Data Version Control with Dmitry Petrov appeared first on Software Engineering Daily.

  • Ray Applications with Richard Liaw

    Ray Applications with Richard Liaw

    24/07/2020 Duração: 54min

    Ray is a general purpose distributed computing framework. At a low level, Ray provides fault-tolerant primitives that support applications running across multiple processors. At a higher level, Ray supports scalable reinforcement learning, including the common problem of hyperparameter tuning. In a previous episode, we explored the primitives of Ray as well as Anyscale, the business The post Ray Applications with Richard Liaw appeared first on Software Engineering Daily.

  • Modin: Pandas Scalability with Devin Petersohn

    Modin: Pandas Scalability with Devin Petersohn

    23/07/2020 Duração: 58min

    Pandas is a Python data analysis library, and an essential tool in data science. Pandas allows users to load large quantities of data into a data structure called a dataframe, over which the user can call mathematical operations. When the data fits entirely into memory this works well, but sometimes there is too much data The post Modin: Pandas Scalability with Devin Petersohn appeared first on Software Engineering Daily.

  • Sourcegraph: Code Search and Intelligence with Beyang Liu

    Sourcegraph: Code Search and Intelligence with Beyang Liu

    22/07/2020 Duração: 59min

    A large codebase cannot be searched with naive indexing algorithms. In order to search through a codebase the size of Uber’s it is necessary to build a much more sophisticated indexing system than simple pure text search. Sourcegraph is a system for universal code search. It allows developers to more easily onboard to a new The post Sourcegraph: Code Search and Intelligence with Beyang Liu appeared first on Software Engineering Daily.

  • ADP Engineering with Tim Halbur

    ADP Engineering with Tim Halbur

    17/07/2020 Duração: 55min

    ADP has been around for more than 70 years, fulfilling payroll and other human resources services. Payroll processing is a complex business, involving the movement of money in accordance with regulatory and legal strictures.  From an engineering point of view, ADP has decades of software behind it, and a bright future of a platform company The post ADP Engineering with Tim Halbur appeared first on Software Engineering Daily.

  • Chronosphere: Scalable Metrics Database with Rob Skillington

    Chronosphere: Scalable Metrics Database with Rob Skillington

    09/07/2020 Duração: 41min

    M3 is a scalable metrics database originally built to host Uber’s rapidly growing data storage from Prometheus. When Rob Skillington was at Uber, he helped design, implement, and deploy M3. Since leaving Uber, he has co-founded a company around a hosted version of M3 called Chronosphere. If you have access to a scalable metrics database, The post Chronosphere: Scalable Metrics Database with Rob Skillington appeared first on Software Engineering Daily.

  • DynamoDB with Alex DeBrie

    DynamoDB with Alex DeBrie

    02/07/2020 Duração: 01h01min

    DynamoDB is a managed NoSQL database service from AWS. It is widely used as a transactional database to fulfill key-value and wide-column data models. In a previous show with Rick Houlihan, we explored how to build a data model and optimize the query patterns for a NoSQL database.  Today’s show is about DynamoDB specifically: partitioning, The post DynamoDB with Alex DeBrie appeared first on Software Engineering Daily.

  • Snowplow Analytics: Data Collection Platform with Alex Dean

    Snowplow Analytics: Data Collection Platform with Alex Dean

    01/07/2020 Duração: 57min

    As a user browses a webpage, that browser session generates events that need to be recorded, validated, enriched, and stored. This data is sometimes called customer data infrastructure, or CDI. This data requires a full stack of different tools: a system on the frontend to collect the data, middleware to transport the data, and backend The post Snowplow Analytics: Data Collection Platform with Alex Dean appeared first on Software Engineering Daily.

  • Postman: API Development with Abhinav Asthana

    Postman: API Development with Abhinav Asthana

    30/06/2020 Duração: 55min

    A software company manages and interacts with hundreds of APIs. These APIs require testing, performance analysis, authorization management, and release management. In a word, APIs require collaboration. Postman is a system for API collaboration. It allows users to test APIs with collections of requests, monitor the API responses, and visualize the query results. Users of The post Postman: API Development with Abhinav Asthana appeared first on Software Engineering Daily.

  • Data Intensive Applications with Martin Kleppman (Summer Break Repeat)

    Data Intensive Applications with Martin Kleppman (Summer Break Repeat)

    23/06/2020 Duração: 01h04min

    Originally published May 2, 2017. We are taking a few weeks off. We’ll be back soon with new episodes. A new programmer learns to build applications using data structures like a queue, a cache, or a database. Modern cloud applications are built using more sophisticated tools like Redis, Kafka, or Amazon S3. These tools do The post Data Intensive Applications with Martin Kleppman (Summer Break Repeat) appeared first on Software Engineering Daily.

página 1 de 5

Informações: