Skip to content

Commit

Permalink
Merge branch 'main' into spider-full
Browse files Browse the repository at this point in the history
  • Loading branch information
arjunattam committed May 21, 2024
2 parents 1b26a43 + 5aaf4cc commit 9dba00b
Show file tree
Hide file tree
Showing 209 changed files with 9,147 additions and 1,913 deletions.
6 changes: 6 additions & 0 deletions .eslintrc.js
Original file line number Diff line number Diff line change
Expand Up @@ -3,6 +3,12 @@
module.exports = {
ignorePatterns: ["apps/**", "packages/**"],
extends: ["@empiricalrun/eslint-config/library.js"],
ignorePatterns: [
// Ignore dotfiles
".*.js?(x)",
"node_modules/",
"examples/"
],
parser: "@typescript-eslint/parser",
parserOptions: {
project: true,
Expand Down
10 changes: 9 additions & 1 deletion .github/workflows/build.yml
Original file line number Diff line number Diff line change
Expand Up @@ -36,9 +36,17 @@ jobs:

- name: Build
run: pnpm build

env:
POSTHOG_API_KEY: ${{ secrets.POSTHOG_API_KEY }}

- name: Test
run: pnpm test
env:
OPENAI_API_KEY: ${{ secrets.OPENAI_API_KEY }}
MISTRAL_API_KEY: ${{ secrets.MISTRAL_API_KEY }}
GOOGLE_API_KEY: ${{ secrets.GOOGLE_API_KEY }}
ANTHROPIC_API_KEY: ${{ secrets.ANTHROPIC_API_KEY }}
AZURE_OPENAI_API_KEY: ${{ secrets.AZURE_OPENAI_API_KEY }}
AZURE_OPENAI_RESOURCE_NAME: ${{ secrets.AZURE_OPENAI_RESOURCE_NAME }}
FIREWORKS_API_KEY: ${{ secrets.FIREWORKS_API_KEY }}
AZURE_OPENAI_BASE_URL: ${{ secrets.AZURE_OPENAI_BASE_URL }}
7 changes: 6 additions & 1 deletion .github/workflows/example.yml
Original file line number Diff line number Diff line change
Expand Up @@ -24,11 +24,16 @@ jobs:
node-version: 20

- name: Run Empirical
run: npx @empiricalrun/cli run
run: npx empiricalrun
working-directory: ./examples/basic
id: empirical-run
env:
OPENAI_API_KEY: ${{ secrets.OPENAI_API_KEY }}
GOOGLE_API_KEY: ${{ secrets.GOOGLE_API_KEY }}
MISTRAL_API_KEY: ${{ secrets.MISTRAL_API_KEY }}
AZURE_OPENAI_API_KEY: ${{ secrets.AZURE_OPENAI_API_KEY }}
AZURE_OPENAI_RESOURCE_NAME: ${{ secrets.AZURE_OPENAI_RESOURCE_NAME }}
AZURE_OPENAI_BASE_URL: ${{ secrets.AZURE_OPENAI_BASE_URL }}

- name: Find Comment
uses: peter-evans/find-comment@v3
Expand Down
7 changes: 6 additions & 1 deletion .github/workflows/json-schema-upload.yml
Original file line number Diff line number Diff line change
@@ -1,12 +1,17 @@
name: Upload JSON Schema

on: workflow_dispatch
on:
workflow_run:
workflows: [Publish packages]
types:
- completed

jobs:
upload:
name: Upload JSON Schema
timeout-minutes: 8
runs-on: ubuntu-latest
if: ${{ github.event.workflow_run.conclusion == 'success' }}
steps:
- name: Checkout code
uses: actions/checkout@v4
Expand Down
11 changes: 10 additions & 1 deletion .github/workflows/publish.yml
Original file line number Diff line number Diff line change
Expand Up @@ -6,6 +6,7 @@ on:
jobs:
publish:
name: Publish packages
# Update json-schema-upload.yml if you are changing the name above
timeout-minutes: 8
runs-on: ubuntu-latest
steps:
Expand All @@ -32,12 +33,20 @@ jobs:

- name: Build
run: pnpm build

env:
POSTHOG_API_KEY: ${{ secrets.POSTHOG_API_KEY }}

- name: Test
run: pnpm test
env:
OPENAI_API_KEY: ${{ secrets.OPENAI_API_KEY }}
MISTRAL_API_KEY: ${{ secrets.MISTRAL_API_KEY }}
GOOGLE_API_KEY: ${{ secrets.GOOGLE_API_KEY }}
ANTHROPIC_API_KEY: ${{ secrets.ANTHROPIC_API_KEY }}
AZURE_OPENAI_API_KEY: ${{ secrets.AZURE_OPENAI_API_KEY }}
AZURE_OPENAI_RESOURCE_NAME: ${{ secrets.AZURE_OPENAI_RESOURCE_NAME }}
FIREWORKS_API_KEY: ${{ secrets.FIREWORKS_API_KEY }}
AZURE_OPENAI_BASE_URL: ${{ secrets.AZURE_OPENAI_BASE_URL }}

- name: Creating .npmrc
run: |
Expand Down
14 changes: 6 additions & 8 deletions README.md
Original file line number Diff line number Diff line change
@@ -1,10 +1,9 @@
# Empirical

[![npm](https://img.shields.io/npm/v/@empiricalrun/cli)](https://npmjs.com/package/@empiricalrun/cli)
[![npm](https://img.shields.io/npm/v/empiricalrun)](https://npmjs.com/package/empiricalrun)
[![Discord](https://img.shields.io/badge/discord-empirical.run-blue?logo=discord&logoColor=white&color=5d68e8)](https://discord.gg/NeR6jj8dw9)

Empirical is the fastest way to test different LLMs, prompts and other model configurations, across all the scenarios
that matter for your application.
Empirical is the fastest way to test your LLM app and iterate over prompts and other model configuration.

With Empirical, you can:

Expand All @@ -14,8 +13,7 @@ With Empirical, you can:
- Score your outputs with [scoring functions](https://docs.empirical.run/scoring/basics)
- Run [tests on CI/CD](https://docs.empirical.run/running-in-ci)

[Watch demo video](https://www.loom.com/share/5992fdf0edc443e282f44936e6c32672) | [See all docs](https://docs.empirical.run)

https://github.com/empirical-run/empirical/assets/284612/3309283c-ddad-4c4e-8175-08a32460686c

## Usage

Expand All @@ -39,7 +37,7 @@ Our test will succeed if the model outputs valid JSON.
1. Use the CLI to create a sample configuration file called `empiricalrc.json`.

```sh
npx @empiricalrun/cli init
npx empiricalrun init
cat empiricalrc.json
```

Expand All @@ -48,13 +46,13 @@ Our test will succeed if the model outputs valid JSON.
execution will cost $0.0026, based on the selected models.

```sh
npx @empiricalrun/cli run
npx empiricalrun
```

3. Use the `ui` command to open the reporter web app and see side-by-side results.

```sh
npx @empiricalrun/cli ui
npx empiricalrun ui
```

### Make it yours
Expand Down
58 changes: 58 additions & 0 deletions apps/web/CHANGELOG.md
Original file line number Diff line number Diff line change
@@ -1,5 +1,63 @@
# web

## 0.12.1

### Patch Changes

- d45cc70: feat: add support for js and ts config

## 0.12.0

### Minor Changes

- 740a844: feat: add support for merging inputs and add multi-turn chat example

## 0.11.0

### Minor Changes

- 973c9b2: feat: add support for tool calls

### Patch Changes

- 73e35c5: feat: support editing of dataset sample in UI

## 0.10.0

### Minor Changes

- 2b03d24: feat: dedicated edit UI for assistant instructions
- bde6bf0: feat: add support for configuring assistant tools
- 93e12e0: feat: add support for assistant tool calls

### Patch Changes

- d8842ef: fix: output and dataset sample alignment issues

## 0.9.0

### Minor Changes

- 2517c74: feat: add support for openai assistants

## 0.8.0

### Minor Changes

- ba98ebb: feat: add support for chat format prompt

## 0.7.3

### Patch Changes

- 65eec6a: feat: get aggregate latency and token numbers for the run

## 0.7.2

### Patch Changes

- 837528d: feat: add support for sharing results using empirical link

## 0.7.1

### Patch Changes
Expand Down
Loading

0 comments on commit 9dba00b

Please sign in to comment.