PyTexas 2023

PyTexas is an annually held gathering for the Python community in Texas and is organized and run by community volunteers. The two-day event consists of talks about software development, data science, and community, centered on Python. Following is a summary of attending PyTexas 2023.

This was a single-track conference held at the Austin Public Library from April 1-2. There were about ~120 attendees from all over the world, from professionals in varying fields in industry and education. Most talks were thirty minutes long, with limited Q&A if time allowed. For more details on the presentations, see the Notes section below. You can also view the sessions here. Due to the state of the economy, there were no booths set up with sponsors, and networking was limited to speaking with attendees during breaks between speakers.

Shreya had the opportunity to give a lightning talk in which she discussed Azure Functions use cases and introduced the v2 programming model for Python. Speaking to the audience, she found that of the attendees, approximately thirty individuals had experience using serverless technologies, less than ten had experience using Azure Functions, and fewer had used Azure Functions in Python.

Overall, PyTexas shined a light on the character of the Python community. Though there were presumably over 100 people in the main conference room, it felt more intimate as attendees voiced their questions casually. Known qualities of the Python developer persona were evident, through the shared frustration of the language within the room and amazement at the solutions that were presented which significantly simplified development. For many attendees, it was their 5+ year attending the event, but there were also a fair share of attendees attending for the first time.

PyTexas would be a good conference for Microsoft to have a presence at in the future as the reach for a presentation would be to 100+ attendees, and there was space for education on Serverless technologies. Limited data showed that not a lot of Python developers at this event were Azure Functions users, but that they would be open to education. I hypothesize that smaller communities of language focused developers would be interested to learn more about Azure and due to presence from smaller organizations, it is possible that potential customers have a straightforward path to including Azure in their infrastructures.

Some takeaways for the team include talking on decorators which helped us determine what principles of decorators make sense to Python developers when considering the v2 programming model. Several discussions also centered on development principles like writing clean code, security principles of open-source repositories, and test driven development. These principles can likely be applied to improve the Python worker code base.

Notes

Walking the Line by Brandon Rhodes

This keynote was centered on how errors in code make Python developers feel. Brandon demonstrated how too much going right can lead to insecurity and doubt and consider if the code a developer is writing is really being tested. In contrast, too many errors were made to impact developers to feel frustration. It was also pointed out that many times the errors that came up are not conceptual but regarding smaller things that don't relate to the logic or even code being written.

Brandon also emphasized how the goal is to ensure that you don't make changes that lead to a plethora of more changes - rather that developers should find the root cause and go from there. Furthermore, he pointed out a classic mistake that people think simplifying code doesn't always optimize it. Brandon, as well as many speakers through the conference also discussed Test Driven Development as a key principle to shipping better, more reliable code.

Tale of Two Typings by Thomas Stephens

This was a talk about new processes that helped the team improve their practices. Thomas shared that "Legacy code is just code without tests” and went in depth regarding Type & Test driven development.

Hidden Gems in ML Flow that improve Model Credibility by Krishi Sharma

Krishi described the three gems that have changed her Machine Learning team's way of developing. She shared anecdotes which led her team to believe that these three gems have improved their AI model's credibility:

back up your data
commit frequently
version your data

Exploring Sociotechnical Security Concerns in Critical Open-Source Python Repositories by Jessy Ayala

During this talk, Jessy wanted to emphasize how many open-source repositories are lacking when it comes to security, and the importance of leveraging free GitHub extensions such as CodeQL to protect these repositories. He demonstrated some key issues by walking the audience through security risks on the vast number of open-source repositories for 3D printers, and the attacks that can be easily sent on these. Jessy pointed out that 99% of phishing attacks have human involvement and there is reason to be hesitant about security vulnerabilities for open-source repositories.

A Build Engineer in a Buildless Lang by Joshua Cannon

Joshua gave this talk from the perspective of caring about the developer team's experience when it comes to following engineering and auto refactoring when required. Sophisticated tools are generally required when having a mono repository due to the amount of code that is in one place. He also shared his preference for tools including Poetry and Black.

The idea of the talk was that Python development is a social thing. If it wasn’t, we would always write assembly. Part of the reason we use Python is so it can be easy for humans to read.

Build Your Own Chat GPT by Lizzie Siegle

Lizzie demonstrated how to use Flask to make a Web App using the Chat GPT API. This is something Gavin and I were brainstorming to do with Functions as well.

Unlocking the Power of Health Data: An Introduction to FHIR and Python by Aly Sivji

Aly shared some key issues with health care and technology that exists currently. He explained that getting health data across facilities is not very straightforward.

HL7 was formed in 1987. It was the first attempt to create a common data interface standards for rapidly growing EHR and health information technologies. EHR adoption increased, but does it mean better outcomes? A system to create more documentation for patients and easily share with members of the medical team is something that can lead to better outcomes.

Duck Typing, Metaclasses, & Recursion: Building a Generalized Deep Collection Type by Joseph Nix

In this talk, Joseph demonstrated how to build a deep collection that could handle a myriad of scenarios.

How to Build and Ship More Secure Python Apps with Sigstore by Lisa Tagliaferri

Lisa started the talk explaining how at the Gum wall, lots of people stick their gum there but not many people take gum off to chew it, and this same thinking should apply to the packages developers choose to use. Many packages have dependencies and issues that should be considered and though through - public packages should not be blindly trusted.

Note that PyPi and npm are both working on adding more security features, such as artifact signing through Sigstore.

HoloViz: Visualization and Interactive Dashboards in Python by Sophia Yang

Sophia discussed Panel which is 'an open-source Python library that lets you create custom interactive web apps and dashboards by connecting user-defined widgets to plots, images, tables, or text'.

Sophia also hosts a book club for AI/ML books that is held virtually on Discord. Join the book club by signing up [here](book club http://dsbookclub.github.io/).

Using Python for Digital Investigations by Tristan Lee

Tristan works at Bellingcat, a non-profit independent research collective that was founded in 2014. The organization participates in open-source research.

What is a digital investigation?

open-source research
research that uses information open to the public
investigator and consumer have equal access toinformation sources
increases transparency, accountability, reproductibility
many sources were previously only available to governments
there’s a lot of information on the internet

Most common open source materials

geospatial i.e. Google maps
media i.e. CNN
user-generated i.e. YouTube
databases i.e. Google eBooks
archived material

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

PyTexas 2023

Notes

Walking the Line by Brandon Rhodes

Tale of Two Typings by Thomas Stephens

Hidden Gems in ML Flow that improve Model Credibility by Krishi Sharma

Exploring Sociotechnical Security Concerns in Critical Open-Source Python Repositories by Jessy Ayala

A Build Engineer in a Buildless Lang by Joshua Cannon

Build Your Own Chat GPT by Lizzie Siegle

Unlocking the Power of Health Data: An Introduction to FHIR and Python by Aly Sivji

Duck Typing, Metaclasses, & Recursion: Building a Generalized Deep Collection Type by Joseph Nix

How to Build and Ship More Secure Python Apps with Sigstore by Lisa Tagliaferri

HoloViz: Visualization and Interactive Dashboards in Python by Sophia Yang

Using Python for Digital Investigations by Tristan Lee

Files

README.md

Latest commit

History

README.md

File metadata and controls

PyTexas 2023

Notes

Walking the Line by Brandon Rhodes

Tale of Two Typings by Thomas Stephens

Hidden Gems in ML Flow that improve Model Credibility by Krishi Sharma

Exploring Sociotechnical Security Concerns in Critical Open-Source Python Repositories by Jessy Ayala

A Build Engineer in a Buildless Lang by Joshua Cannon

Build Your Own Chat GPT by Lizzie Siegle

Unlocking the Power of Health Data: An Introduction to FHIR and Python by Aly Sivji

Duck Typing, Metaclasses, & Recursion: Building a Generalized Deep Collection Type by Joseph Nix

How to Build and Ship More Secure Python Apps with Sigstore by Lisa Tagliaferri

HoloViz: Visualization and Interactive Dashboards in Python by Sophia Yang

Using Python for Digital Investigations by Tristan Lee