Gaia-ECS is a fast and easy-to-use ECS framework. Some of its current features and highlights are:
- very simple and safe API
- based on C++17 with no external dependencies (no STL strings or containers)
- compiles warning-free on all major compilers
- archetype / chunk-based storage for maximum iteration speed and easy code parallelization
- supports run-time defined tags
- supports entity relationships
- supports applications with large number of components and archetypes
- automatic component registration
- integrated compile-time serialization
- comes with multithreading support with job-dependencies
- ability to organize data as AoS or SoA on the component level with very few changes to your code
- compiles almost instantly
- stability and correctness secured by running thousands of unit tests and debug-mode asserts in the code
- thoroughly documented both public and internal code
- exists also as a single-header library so you can simply drop it into your project and start using it
- Introduction
- Usage
- Requirements
- Installation
- Repository structure
- Future
- Contributions
- License
Entity-Component-System (ECS) is a software architectural pattern based on organizing your code around data which follows the principle of composition over inheritance.
Instead of looking at "items" in your program as objects you normally know from the real world (car, house, human) you look at them as pieces of data necessary for you to achieve some result.
This way of thinking is more natural for machines than people but when used correctly it allows you to write faster code (on most architectures). What is most important, however, is it allows you to write code that is easier to maintain, expand and reason about.
For instance, when moving an object from point A to point B you do not care if it is a house or a car. You only care about its position. If you want to move it at some specific speed you will consider also the object's velocity. Nothing else is necessary.
Three building blocks of ECS are:
- Entity (id) - an index that uniquely identifies a group of components
- Component (data) - a piece of data (position, velocity, age)
- System (processor) - a place where your program's logic is implemented
Following the example given above, a vehicle could be any entity with Position and Velocity components. If it is a car we could attach the Driving component to it. If it is an airplane we would attach the Flying component.
The actual movement is handled by systems. Those that match the Flying component will implement the logic for flying. Systems matching the Driving component handle the land movement.
On the outside ECS is not much different from database engines. The main difference is it does not need to follow the ACID principle which allows it to be optimized beyond what an ordinary database engine could ever be both in terms of latency and absolute performance. At the cost of data safety.
The main strengths of an ECS done right could be summarized as:
- modularity and reusability - promotes modular and reusable code with self-contained components
- decoupling of logic - separates data from logic
- facilitation of system design - encourages a data-driven design approach for cleaner and more organized code
- flexibility - allows dynamic object behavior through composition of entities with specific components
- scalability - scales well with a predictable performance impact as the number of entities increases
- ease of maintenance - promotes less spaghetti code with a modular structure that is easier to debug
- adaptability - easily adapts to changing project requirements through component and system modifications
- performance - optimized for data locality, supports data- and thread-level parallelism almost out-of-the-box
Gaia-ECS is an archetype-based entity component system. This means that unique combinations of components are grouped into archetypes. Each archetype consists of chunks - blocks of memory holding your entities and components. You can think of them as database tables where components are columns and entities are rows.
Each chunk is either 8 or 16 KiB big depending on how much data can be effectively used by it. This size is chosen so that the entire chunk at its fullest can fit into the L1 cache on most CPUs. Chunk memory is preallocated in blocks organized into pages via the internal chunk allocator.
Components of the same type are grouped together and laid out linearly in memory. Thanks to all that data is organized in a cache-friendly way which most computer architectures like and actual heap allocations which are slow are reduced to a minimum.
The main benefits of archetype-based architecture are fast iteration and good memory layout by default. They are also easy to parallelize.
On the other hand, adding and removing components can be somewhat slower because it involves moving data around. In our case, this weakness is mitigated by building an archetype graph and having the ability to add and remove components in batches.
In this project, components are entities with the Component component attached to them. Treating components as entities allows for great design simplification and big features.
The entire project is implemented inside gaia namespace. It is further split into multiple sub-projects each with a separate namespaces.
- core - core functionality, use by all other parts of the code
- mem - memory-related operations, memory allocators
- cnt - data containers
- meta - reflection framework
- ser - serialization framework
- mt - multithreading framework
- ecs - the ECS part of the project
A special part of the project is external. It contains 3rd-party code such as C++17 implementation of std::span and a modified robin-hood hash-map.
#include <gaia.h>
The entire framework is placed in a namespace called gaia.
The ECS part of the library is found under gaia::ecs namespace.
In the code examples below we will assume we are inside gaia namespace.
Entity a unique "thing" in World. Creating an entity at runtime is as easy as calling World::add. Deleting is done via World::del. Once deleted, entity is no longer valid and if used with some APIs it is going to trigger a debug-mode assert. Verifying that an entity is valid can be done by calling World::valid.
ecs::World w;
// Create a new entity
ecs::Entity e = w.add();
// Check if "e" is valid. Returns true.
bool isValid = w.valid(e); // true
// Delete the entity
w.del(e);
// Check if "e" is still valid. Return false.
isValid = w.valid(e); // false
It is also possible to attach entities to entities. This effectively means you are able to create your own components/tags at runtime.
ecs::Entity player0 = w.add();
ecs::Entity player1 = w.add();
ecs::Entity player2 = w.add();
ecs::Entity teamA = w.add();
ecs::Entity teamB = w.add();
// Add player0 and player1 to teamA
w.add(player0, teamA);
w.add(player1, teamA);
// Add player2 to teamB
w.add(player2, teamB);
Each entity can be assigned a unique name. This is useful for debugging or entity lookup when entity id is not present for any reason.
ecs::World w;
ecs::Entity e = w.add();
// Entity "e" named "my_unique_name".
// The string is copied and stored internally.
w.name(e, "my_unique_name");
// If you know the length of the string, you can provide it as well
w.name(e, "my_unique_name", 14);
// Pointer to the string used as entity name for entity "e"
const char* name = w.name(e);
// Entity identified by the string returned.
// In this case, "e_by_name" and "e" are equal.
ecs::Entity e_by_name = w.get("my_unique_name");
// The name can be unset by setting it to nullptr
w.name(e, nullptr);
If you already have a dedicated string storage it would be a waste to duplicate the memory. In this case you can use World::name_raw to name entities. It does NOT copy and does NOT store the string internally which means you are responsible for its lifetime. The pointer should be stable. Otherwise, any time your storage tries to move the string to a different place you have to unset the name before it happens and set it anew after the move is done.
const char* pUserManagedString = ...;
w.name_raw(e, pUserManagedString);
// If you now the length, you can provide it
w.name_raw(e, pUserManagedString, userManagedStringLength);
// If the user-managed string pointer is not stable, you need to unset the name before the pointer changes location
w.name_raw(e, nullptr);
...
// ... the change of pointer happens
...
// After the user-managed string changed location and obtained a new pointer, you set the name again
w.name_raw(e, pUserManagedString);
Components can be created using World::add. This function returns a descriptor of the object which is created and stored in the component cache. Each component is assigned one entity to uniquely identify it. You do not have to do this yourself, the framework performs this operation automatically behind the scenes any time you call some compile-time API where you interact with your structure. However, you can use this API to quickly fetch the component's entity if necessary.
struct Position {
float x, y, z;
};
const ecs::ComponentCacheItem& cci = w.add<Position>();
ecs::Entity position_entity = cci.entity;
Because components are entities as well, adding them is very similar to what we have seen previously.
struct Position {
float x, y, z;
};
struct Velocity {
float x, y, z;
};
ecs::World w;
// Create an entity with Position and Velocity.
ecs::Entity e = w.add();
w.add<Position>(e, {0, 100, 0});
w.add<Velocity>(e, {0, 0, 1});
// Remove Velocity from the entity.
w.del<Velocity>(e);
This also means the code above could be rewritten as following:
// Create Position and Velocity entities
ecs::Entity position = w.add<Position>().entity;
ecs::Entity velocity = w.add<Velocity>().entity;
// Create an entity with Position and Velocity.
ecs::Entity e = w.add();
w.add(e, position, Position{0, 100, 0});
w.add(e, velocity, Velocity{0, 0, 1});
// Remove Velocity from the entity.
w.del(e, velocity);
Whether or not a certain component is associated with an entity can be checked in two different ways. Either via an instance of a World object or by the means of Iter which can be acquired when running queries.
// Check if entity e has Velocity (via world).
const bool hasVelocity = w.has<Velocity>(e);
// Check if entity wheel is attached to the car
const bool hasWheel = w.has(car, wheel);
...
// Check if entities hidden behind the iterator have Velocity (via iterator).
ecs::Query q = w.query().any<Position, Velocity>();
q.each([&](ecs::Iter& it) {
const bool hasPosition = it.has<Position>();
const bool hasVelocity = it.has<Velocity>();
...
});
Providing entities is supported as well.
auto p = w.add<Position>().entity;
auto v = w.add<Velocity>().entity;
// Check if entities hidden behind the iterator have Velocity (via iterator).
ecs::Query q = w.query().any(p).any(v);
q.each([&](ecs::Iter& it) {
const bool hasPosition = it.has(p);
const bool hasVelocity = it.has(v);
...
});
Adding an entity to entity means it becomes a part of a new archetype. Like mentioned previously, becoming a part of a new archetype means that all data associated with the entity needs to be moved to a new place. The more ids in the archetype the slower the move (empty components/tags are an exception because they do not carry any data). For this reason it is not advised to perform large number of separate additions / removals per frame.
Instead, when adding or removing multiple entities/components at once it is more efficient doing it via bulk operations. This way only one archetype movement is performed in total rather than one per added/removed entity.
ecs::World w;
// Create an entity with Position. This is one archetype movement.
ecs::Entity e = w.add();
w.add<Position>();
// Add and remove multiple components.
// This does one archetype movement rather than 6 compared to doing these operations separate.
w.build(e)
// add Velocity to entity e
.add<Velocity>()
// remove Position from entity e
.del<Position>()
// add Rotation to entity e
.add<Rotation>()
// add a bunch of other components to entity e
.add<Something1, Something2, Something3>();
It is also possible to manually commit all changes by calling EntityBuilder::commit. This is useful in scenarios where you have some branching and do not want to duplicate your code for both branches or simply need to add/remove components based on some complex logic.
ecs::EntityBuilder builder = w.build(e);
builder
.add<Velocity>()
.del<Position>()
.add<Rotation>();
if (some_condition) {
builder.add<Something1, Something2, Something3>();
}
builder.commit();
NOTE:
Once EntityBuilder::commit is called (either manually or internally when the builder's destructor is invoked) the contents of builder are returned to its default state.
// Change Velocity's value.
w.set<Velocity>(e) = {0, 0, 2};
// Same as above but the world version is not updated so nobody gets notified of this change.
w.sset<Velocity>(e) = {4, 2, 0};
When setting multiple component values at once it is more efficient doing it via chaining:
w.acc_mut(e)
// Change Velocity's value on entity "e"
.set<Velocity>({0, 0, 2})
// Change Position's value on entity "e"
.set<Position>({0, 100, 0})
// Change...
.set...;
Similar to EntityBuilder::build you can also use the setter object in scenarios with complex logic.
ecs::ComponentSetter setter = w.acc_mut(e);
setter.set<Velocity>({0, 0, 2});
if (some_condition)
setter.set<Position>({0, 100, 0});
setter.set<Something>({ ... }).set<Else>({ ... });
// You can also retrieve a reference to data (for AoS) or the data accessor (for SoA)
auto& vel = setter.mut<Velocity>();
auto& pos = setter.mut<Position>();
Components up to 8 bytes (including) are returned by value. Bigger components are returned by const reference.
// Read Velocity's value. As shown above Velocity is 12 bytes in size.
// Therefore, it is returned by const reference.
const auto& velRef = w.get<Velocity>(e);
// However, it is easy to store a copy.
auto velCopy = w.get<Velocity>(e);
Both read and write operations are also accessible via views. Check the iteration sections to see how.
A copy of another entity can be easily created.
// Create an entity with Position and Velocity.
ecs::Entity e = w.add();
w.add(e, position, Position{0, 100, 0});
w.add(e, velocity, Velocity{0, 0, 1});
// Make a copy of "e". Component values on the copied entity will match the source.
// Value of Position on "e2" will be {0, 100, 0}.
// Value of Velocity on "e2" will be {0, 0, 1}.
ecs::Entity e2 = w.copy(e);
Another way to create entities is by creating many of them at once. This is more performant than creating entities one by one.
// Create 1000 empty entities
w.add(1000);
w.add(1000, [](Entity newEntity) {
// Do something with the new entity
// ...
})
// Create an entity with Position and Velocity.
ecs::Entity e = w.add();
w.add(e, position, Position{0, 100, 0});
w.add(e, velocity, Velocity{0, 0, 1});
// Create 1000 more entities like "e".
// Their component values are not initialized to any particular value.
w.add_n(e, 1000);
w.add_n(e, 1000, [](Entity newEntity) {
// Do something with the new entity
// ...
});
// Create 1000 more entities like "e".
// Their component values are going to be the same as "e".
w.copy_n(e, 1000);
w.copy_n(e, 1000, [](Entity newEntity) {
// Do something with the new entity
// ...
});
Once all entities of given archetype are deleted (and as a result all chunks in the archetypes are empty), the archetype stays alive for another 127 ticks of ecs::World::update(). However, there might be cases where this behavior is insufficient. Maybe you want the archetype deleted faster, or you want to keep it around forever.
For instance, you might often end up deleting all entities of a given archetype only to create new ones seconds later. In this case, keeping the archetype around can have several performance benefits:
- no need to recreate the archetype
- no need to rematch queries with the archetype
ecs::World w;
ecs::Entity player0 = w.add(); // player0 belongs to archetype A
ecs::Entity teamA = w.add(); // teamA belongs to archetype A
// Player0 becomes a part of archetype B.
w.add(player0, teamA);
// Archetype B is never going to be deleted.
w.set_max_lifespan(player0, 0);
// Archetype B is going to be deleted after 20 ticks of ecs::World::update.
w.set_max_lifespan(player0, 20);
// Reset maximum lifespan of the archetype B belongs to.
w.set_max_lifespan(player0);
Note, if the entity used to change an archetype's lifespan moves to a new archetype, the later's lifespan is not updated.
ecs::World w;
ecs::Entity player0 = w.add(); // player0 belongs to archetype A
ecs::Entity teamA = w.add(); // teamA belongs to archetype A
// Player0 becomes a part of archetype B.
w.add(player0, teamA);
// Maximum lifespan of archetype B changed to 20.
w.set_max_lifespan(player0, 20);
// Player0 becomes a part of archetype A again. Lifespan of B is still 20, lifespan of A is default.
w.del(player0, team1);
In case you want to affect an archetype directly without abstracting it away you can retrieve it via the entity's container returned by World::fetch() function:
EntityContainer& ec = w.fetch(player0);
// Maximum lifespan of archetype the player0 entity belongs to changed to 50.
ec.pArchetype->set_max_lifespan(50);
For querying data you can use a Query. It can help you find all entities, components or chunks matching a list of conditions and constraints and iterate them or return them as an array. You can also use them to quickly check if any entities satisfying your requirements exist or calculate how many of them there are.
Every Query is cached internally. You likely use the same query multiple times in your program, often without noticing. Because of that, caching becomes useful as it avoids wasting memory and performance when finding matches.
Note, the first Query invocation is always slower than the subsequent ones because internals of the Query need to be initialized.
ecs::Query q = w.query();
q.all<Position>(); // Consider only entities with Position
// Fill the entities array with entities with a Position component.
cnt::darray<ecs::Entity> entities;
q.arr(entities);
// Fill the positions array with position data.
cnt::darray<Position> positions;
q.arr(positions);
// Calculate the number of entities satisfying the query
const auto numberOfMatches = q.count();
// Check if any entities satisfy the query.
// Possibly faster than count() because it stops on the first match.
const bool hasMatches = !q.empty();
More complex queries can be created by combining All, Any and None in any way you can imagine:
ecs::Query q = w.query();
// Take into account everything with Position and Velocity (mutable access for both)...
q.all<Position&, Velocity&>();
// ... at least Something or SomethingElse (immutable access for both)...
q.any<Something, SomethingElse>();
// ... and no Player component... (no access done for no())
q.not<Player>();
All Query operations can be chained and it is also possible to invoke various filters multiple times with unique components:
ecs::Query q = w.query();
// Take into account everything with Position (mutable access)...
q.all<Position&>()
// ... and at the same time everything with Velocity (mutable access)...
.all<Velocity&>()
// ... at least Something or SomethingElse (immutable access)...
.any<Something, SomethingElse>()
// ... and no Player component (no access)...
.no<Player>();
Queries can be defined using a low-level API (used internally).
ecs::Entity p = w.add<Position>().entity;
ecs::Entity v = w.add<Velocity>().entity;
ecs::Entity s = w.add<Something>().entity;
ecs::Entity se = w.add<SomethingElse>().entity;
ecs::Entity pl = w.add<Player>().entity;
ecs::Query q = w.query();
// Take into account everything with Position (mutable access)...
q.add({p, QueryOpKind::All, QueryAccess::Write})
// ... and at the same time everything with Velocity (mutable access)...
.add({v, QueryOpKind::All, QueryAccess::Write})
// ... at least Something or SomethingElse (immutable access)..
.add({s, QueryOpKind::Any, QueryAccess::Read})
.add({se, QueryOpKind::Any, QueryAccess::Read})
// ... and no Player component (no access)...
.add({pl, QueryOpKind::None, QueryAccess::None});
Building cache requires memory. Because of that, sometimes it comes handy having the ability to release this data. Calling myQuery.reset()
will remove any data allocated by the query. The next time the query is used to fetch results the cache is rebuilt.
q.reset();
If this is a cached query, even after resetting it it still remains in the query cache. To remove it from there all queries with the matching signature will need to be destroyed first:
ecs::Query q1 = w.query();
ecs::Query q2 = w.query();
q1.add<Position>();
q2.add<Position>();
(void)q1.count(); // do some operation that compiles the query and inserts it into the query cache
(void)q2.count(); // do some operation that compiles the query and inserts it into the query cache
q1 = w.query(); // First reference to cached query is destroyed.
q2 = w.query(); // Last reference to cache query is destroyed. The cache is cleared of queries with the given signature
Technically, any query could be reset by default initializing it, e.g. myQuery = {}
. This, however, puts the query into an invalid state. Only queries created via World::query have a valid state.
Another way to define queries is using the string notation. This allows you to define the entire query or its parts using a string composed of simple expressions. Any spaces in between modifiers and expressions are trimmed.
Supported modifiers:
- ; - separates expressions
- ? - query::any
- ! - query::none
- & - read-write access
- %e - entity value
- (rel,tgt) - relationship pair, a wildcard character in either rel or tgt is translated into All
// Some context for the example
struct Position {...};
struct Velocity {...};
struct RigidBody {...};
struct Fuel {...};
ecs::Entity player = w.add();
// Create the query from a string expression.
ecs::Query q = w.query()
.add("&Position; !Velocity; ?RigidBody; (Fuel,*); %e", player.value());
// It does not matter how we split the expressions. This query is the same as the above.
ecs::Query q1 = w.query()
.add("&Position; !Velocity;")
.add("?RigidBody; (Fuel,*)")
.add("%e", player.value());
// The queries above can be rewritten as following:
ecs::Query q2 = w.query()
.all<Position&>()
.no<Velocity>()
.any<RigidBody>()
.all(ecs::Pair(w.add<Fuel>().entity, All)>()
.all(player);
From the implementation standpoint, uncached queries are the same as ordinary queries in all but one aspect - they do not use the query cache internally. This means that two uncached queries using the same setup are going to evaluate matches separately. As a result, if there are duplicates, more memory and performance will be wasted.
On the other hand, if you design your queries carefully and they are all different, uncached queries are actually a bit faster to create and match. Creation is faster because there is no hash to compute for the query and matching is faster because no query cache lookups are involved.
Uncached queries are created via World::query< false >.
// This is a cached query
ecs::Query q1 = w.query<true>(). ...;
// This is a cached query, shorter version of the above
ecs::Query q2 = w.query(). ...;
// This is an uncached query
ecs::QueryUncached q3 = w.query<false>(). ...;
To process data from queries one uses the Query::each function. It accepts either a list of components or an iterator as its argument.
ecs::Query q = w.query();
// Take into account all entities with Position and Velocity...
q.all<Position&, Velocity>();
// ... but no Player component.
q.no<Player>();
q.each([&](Position& p, const Velocity& v) {
// Run the scope for each entity with Position, Velocity and no Player component
p.x += v.x * dt;
p.y += v.y * dt;
p.z += v.z * dt;
});
NOTE:
Iterating over components not present in the query is not supported and results in asserts and undefined behavior. This is done to prevent various logic errors which might sneak in otherwise.
Processing via an iterator gives you even more expressive power and also opens doors for new kinds of optimizations. Iter is an abstraction over underlying data structures and gives you access to their public API.
There are three types of iterators:
- Iter - iterates over enabled entities
- IterDisabled - iterates over disabled entities
- IterAll - iterates over all entities
ecs::Query q = w.query();
q.all<Position&, Velocity>();
q.each([](ecs::IterAll& it) {
auto p = it.view_mut<Position>(); // Read-write access to Position
auto v = it.view<Velocity>(); // Read-only access to Velocity
// Iterate over all enabled entities and update their x-axis position.
// GAIA_EACH(it) translates to: for (uint32_t i=0; i<it.size(); ++i)
GAIA_EACH(it) {
if (!it.enabled(i))
return;
p[i].x += 1.f;
}
// Iterate over all entities and update their position based on their velocity.
GAIA_EACH(it) {
p[i].x += v[i].x * dt;
p[i].y += v[i].y * dt;
p[i].z += v[i].z * dt;
}
});
Performance of views can be improved slightly by explicitly providing the index of the component in the query.
ecs::Query q = w.query();
q.any<Something>().all<Position&, Velocity>();
q.each([](ecs::IterAll& it) {
auto s = it.view<Something>(0); // Something is fhe first defined component in the query
auto p = it.view_mut<Position>(1); // Position is the second defined component in the query
auto v = it.view<Velocity>(2); // Velocity is the third defined component in the query
....
}
Query behavior can also be modified by setting constraints. By default, only enabled entities are taken into account. However, by changing constraints, we can filter disabled entities exclusively or make the query consider both enabled and disabled entities at the same time.
Disabling/enabling an entity is a special operation that marks it invisible to queries by default. Archetype of the entity is not changed afterwards so it can be considered fast.
ecs::Entity e1, e2;
// Create 2 entities with Position component
w.add(e1);
w.add(e2);
w.add<Position>(e1);
w.add<Position>(e2);
// Disable the first entity
w.enable(e1, false);
// Check if e1 is enabled
const bool is_e1_enabled = w.enabled(e1);
if (is_e1_enabled) { ... }
// Prepare out query
ecs::Query q = w.query().all<Position&>();
// Fills the array with only e2 because e1 is disabled.
cnt::darray<ecs::Entity> entities;
q.arr(entities);
// Fills the array with both e1 and e2.
q.arr(entities, ecs::Query::Constraint::AcceptAll);
// Fills the array with only e1 because e1 is disabled.
q.arr(entities, ecs::Query::Constraint::DisabledOnly);
q.each([](ecs::Iter& it) {
auto p = it.view_mut<Position>(); // Read-Write access to Position
// Iterates over enabled entities
GAIA_EACH(it) p[i] = {}; // reset the position of each enabled entity
});
q.each([](ecs::IterDisabled& it) {
auto p = it.view_mut<Position>(); // Read-Write access to Position
// Iterates over disabled entities
GAIA_EACH(it) p[i] = {}; // reset the position of each disabled entity
});
q.each([](ecs::IterAll& it) {
auto p = it.view_mut<Position>(); // Read-Write access to Position
// Iterates over all entities
GAIA_EACH(it) {
if (it.enabled(i)) {
p[i] = {}; // reset the position of each enabled entity
}
}
});
If you do not wish to fragment entities inside the chunk you can simply create a tag component and assign it to your entity. This will move the entity to a new archetype so it is a lot slower. However, because disabled entities are now clearly separated calling some query operations might be slightly faster (no need to check if the entity is disabled or not internally).
struct Disabled {};
...
e.add<Disabled>(); // disable entity
ecs::Query q = w.query().all<Position, Disabled>;
q.each([&](ecs::Iter& it){
// Processes all disabled entities
});
e.del<Disabled>(); // enable entity
Using changed we can make the iteration run only if particular components change. You can save quite a bit of performance using this technique.
ecs::Query q = w.query();
// Take into account all entities with Position and Velocity...
q.all<Position&, Velocity>();
// ... no Player component...
q.no<Player>();
// ... but only iterate when Velocity changes
q.changed<Velocity>();
q.each([&](Position& p, const Velocity& v) {
// This scope runs for each entity with Position, Velocity and no Player component
// but only when Velocity has changed.
p.x += v.x * dt;
p.y += v.y * dt;
p.z += v.z * dt;
});
NOTE:
If there are 100 Position components in the chunk and only one of them changes, the other 99 are considered changed as well. This chunk-wide behavior might seem counter-intuitive but it is in fact a performance optimization. The reason why this works is because it is easier to reason about a group of entities than checking each of them separately.
Grouping is a feature that allows you to assign an id to each archetype and group them together or filter them based on this id. Archetypes are sorted by their groupId in ascending order. If descending order is needed, you can change your groupIds (e.g. instead of 100 you use ecs::GroupIdMax - 100).
Grouping is best used with relationships. It can be triggered by calling group_by before the first call to each or other functions that build the query (count, empty, arr).
ecs::Entity eats = wld.add();
ecs::Entity carrot = wld.add();
ecs::Entity salad = wld.add();
ecs::Entity apple = wld.add();
ecs::Entity ents[6];
GAIA_FOR(6) ents[i] = wld.add();
{
// Add Position and ecs::Pair(eats, salad) to our entity
wld.build(ents[0]).add<Position>().add({eats, salad});
wld.build(ents[1]).add<Position>().add({eats, carrot});
wld.build(ents[2]).add<Position>().add({eats, apple});
wld.build(ents[3]).add<Position>().add({eats, apple}).add<Healthy>();
wld.build(ents[4]).add<Position>().add({eats, salad}).add<Healthy>();
wld.build(ents[5]).add<Position>().add({eats, carrot}).add<Healthy>();
}
// This query is going to group entities by what they eat.
auto qq = wld.query().all<Position>().group_by(eats);
// The query cache is going to contain following 6 archetypes in 3 groups as follows:
// - Eats:carrot:
// - Position, (Eats, carrot)
// - Position, (Eats, carrot), Healthy
// - Eats:salad:
// - Position, (Eats, salad)
// - Position, (Eats, salad), Healthy
// - Eats::apple:
// - Position, (Eats, apple)
// - Position, (Eats, apple), Healthy
q.each([&](ecs::Iter& it) {
auto ents = it.view<ecs::Entity>();
GAIA_EACH(it) {
GAIA_LOG_N("GrpId:%u, Entity:%u.%u", it.group_id(), ents[i].id(), ents[i].gen());
}
});
You can choose what group to iterate specifically by calling group_id prior to iteration.
// This query is going to iterate the following group of 2 archetypes:
// - Eats:salad:
// - Position, (Eats, salad)
// - Position, (Eats, salad), Healthy
q.group_id(salad).each([&](ecs::Iter& it) {
...
});
// This query is going to iterate the following group of 2 archetypes:
// - Eats:carrot:
// - Position, (Eats, carrot)
// - Position, (Eats, carrot), Healthy
q.group_id(carrot).each([&](ecs::Iter& it) {
...
});
Custom sorting function can be provided if needed.
ecs::GroupId my_group_sort_func(const ecs::World& world, const ecs::Archetype& archetype, ecs::Entity groupBy) {
if (archetype.pairs() > 0) {
auto ids = archetype.ids_view();
for (auto id: ids) {
if (!id.pair() || id.id() != groupBy.id())
continue;
// Consider the pair's target the groupId
return id.gen();
}
}
// No group
return 0;
}
q.group_by(eats, my_group_sort_func).each(...) { ... };
Queries can make use of (#multithreading). By default, all queries are handles by the thread that calls iterates the query. However, it is possible to execute them by multiple threads at once simply by providing the right ecs::QueryExecType parameter.
// Ordinary single-thread query (default)
q.each([](ecs::Iter& iter) { ... });
// Ordinary single-thread query (explicit)
q.each([](ecs::Iter& iter) { ... }, ecs::QueryExecType::Default);
// Multi-thread query, use any cores available
q.each([](ecs::Iter& iter) { ... }, ecs::QueryExecType::Parallel);
// Multi-thread query, use performance cores only
q.each([](ecs::Iter& iter) { ... }, ecs::QueryExecType::ParallelPerf);
// Multi-thread query, use efficiency cores only
q.each([](ecs::Iter& iter) { ... }, ecs::QueryExecType::ParallelEff);
Not only is multi-threaded execution possible, but you can also influence what kind of cores actually run your logic. Maybe you want to limit your systems's power consumption in which case you target only the efficiency cores. Or, if you want maximum performance, you can easily have all your system's cores participate.
NOTE:
Dependencies are currently not possible. However, this is just a temporary limitation of the query API. The low-level threadpool API supports this as demonstrated here.
Entity relationship is a feature that allows users to model simple relations, hierarchies or graphs in an ergonomic, easy and safe way. Each relationship is expressed as following: "source, (relation, target)". All three elements of a relationship are entities. We call the "(relation, target)" part a relationship pair.
Relationship pair is a special kind of entity where the id of the "relation" entity becomes the pair's id and the "target" entity's id becomes the pairs generation. The pair is created by calling ecs::Pair(relation, target) with two valid entities as its arguments.
Adding a relationship to any entity is as simple as adding any other entity.
ecs::World w;
ecs::Entity rabbit = w.add();
ecs::Entity hare = w.add();
ecs::Entity carrot = w.add();
ecs::Entity eats = w.add();
w.add(rabbit, ecs::Pair(eats, carrot));
w.add(hare, ecs::Pair(eats, carrot));
// You can brace-initialize the pair as well which is shorter.
// w.add(hare, {eats, carrot});
ecs::Query q = w.query().all(ecs::Pair(eats, carrot));
q.each([](ecs::Entity entity)) {
// Called for each entity implementing (eats, carrot) relationship.
// Triggers for rabbit and hare.
}
This by itself would not be much different from adding entities/component to entities. A similar result can be achieved by creating a "eats_carrot" tag and assigning it to "hare" and "rabbit". What sets relationships apart is the ability to use wildcards in queries.
There are three kinds of wildcard queries possible:
- ( X, * ) - X that does anything
- ( * , X ) - anything that does X
- ( * , * ) - anything that does anything (aka any relationship)
The "*" wildcard is expressed via All entity.
w.add(rabbit, ecs::Pair(eats, carrot));
w.add(hare, ecs::Pair(eats, carrot));
w.add(wolf, ecs::Pair(eats, rabbit));
ecs::Query q1 = w.query().all(ecs::Pair(eats, All));
q1.each([]()) {
// Called for each entity implementing (eats, *) relationship.
// This can be read as "entity that eats anything".
// Triggers for rabbit, hare and wolf.
}
ecs::Query q2 = w.query().all(ecs::Pair(All, eats));
q2.each([]()) {
// Called for each entity implementing (*, carrot) relationship.
// This can be read as "anything that has something with carrot".
// Triggers for rabbit and hare.
}
ecs::Query q3 = w.query().all(ecs::Pair(All, All));
q3.each([]()) {
// Called for each entity implementing (*, *) relationship.
// This can be read as "anything that does/has anything".
// Triggers for rabbit, hare and wolf.
}
Relationships can be ended by calling World::del (just like it is done for regular entities/components)..
// Rabbit no longer eats carrot
w.del(rabbit, ecs::Pair(eats, carrot));
Whether a relationship exists can be check via World::has (just like it is done for regular entities/components).
// Checks if rabbit eats carrot
w.has(rabbit, ecs::Pair(eats, carrot));
// Checks if rabbit eats anything
w.has(rabbit, ecs::Pair(eats, All));
A nice side-effect of relationships is they allow for multiple components/entities of the same kind be added to one entity.
// "eats" is added twice to the entity "rabbit"
w.add(rabbit, ecs::Pair(eats, carrot));
w.add(rabbit, ecs::Pair(eats, salad));
Pairs do not need to be formed from tag entities only. You can use components to build a pair which means they can store data, too! To determine the storage type of Pair(relation, target) the following logic is applied:
- if "relation" is non-empty, the storage type is rel.
- if "relation" is empty and "target" is non-empty, the storage type is "target".
struct Start{};
struct Position{ int x, y; };
...
ecs::Entity e = w.add();
// Add (Start, Position) from component entities.
ecs::Entity start_entity = w.add<Start>.entity;
ecs::Entity pos_entity = w.add<Position>.entity;
w.add(e, ecs::Pair(start_entity, pos_entity));
// Add (Start, Position) pair to entity e using a compile-time component pair.
w.add<ecs::pair<Start, Position>(e);
// Add (Start, Position) pair to entity e using a compile-time component pair
// and set its value. According the rules defined above, the storage type used
// for the pair is Position.
w.add<ecs::pair<Start, Position>(e, {10, 15});
// Create a query matching all (Start, Position) pairs using component entities
ecs::Query q0 = w.query().all( ecs::Pair(start_entity, pos_entity) );
// Create a query matching all (Start, Position) pairs using compile-time
ecs::Query q1 = w.query().all< ecs::pair<Start, Position> >();
Targets of a relationship can be retrieved via World::target and World::targets.
w.add(rabbit, ecs::Pair(eats, carrot));
w.add(rabbit, ecs::Pair(eats, salad));
// Returns whatever the first found target of the rabbit(eats, *) relationship is.
// In our case it is the carrot entity because it was created before salad.
ecs::Entity first_target = w.target(rabbit, eats);
// Appends carrot and salad entities to the array
cnt::sarr_ext<ecs::Entity, 32> what_rabbit_eats;
w.target(rabbit, eats, what_rabbit_eats);
Relations of a relationship can be retrieved via World::relation and World::relations.
w.add(rabbit, ecs::Pair(eats, carrot));
w.add(rabbit, ecs::Pair(eats, salad));
// Returns whatever the first found relation of the rabbit(*, salad) relationship is.
// In our case it is eats.
ecs::Entity first_relation = w.relation(rabbit, salad);
// Appends eats to the array
cnt::sarr_ext<ecs::Entity, 32> related_to_salad;
w.relations(rabbit, salad, related_to_salad);
Defining dependencies among entities is made possible via the (Requires, target) relationship.
When adding an entity with a dependency to some source it is guaranteed the dependency will always be present on the source as well. It will also be impossible to delete it.
ecs::World w;
ecs::Entity rabbit = w.add();
ecs::Entity animal = w.add();
ecs::Entity herbivore = w.add();
ecs::Entity carrot = w.add();
w.add(carrot, ecs::Pair(ecs::Requires, herbivore));
w.add(herbivore, ecs::Pair(ecs::Requires, animal));
// Carrot depends on herbivore so the later is added as well.
// At the same time, herbivore depends on animal so animal is added, too.
w.add(rabbit, carrot);
const bool isHerbivore = w.has(rabbit, herbivore)); // true
const bool isAnimal = w.has(rabbit, animal); // true
// Animal will not be removed from rabbit because of the dependency chain.
// Carrot depends on herbivore which depends on animal.
w.del(rabbit, animal); // does nothing
// Herbivore will not be removed from rabbit because of the dependency chain.
// Carrot depends on herbivore.
w.del(rabbit, herbivore); // does nothing
// Carrot can be deleted. It requires that herbivore is present which is true.
w.del(rabbit, carrot); // removes carrot from rabbit
Entity constrains are used to define what entities can not be combined with others.
ecs::World w;
ecs::Entity weak = w.add();
ecs::Entity strong = w.add();
w.add(weak, ecs::Pair(ecs::CantCombine, strong));
ecs::Entity e = w.add();
w.add(e, strong);
// Following line is an invalid operation.
w.add(e, weak);
Entities can be defined as exclusive. This means that only one relationship with this entity as a relation can exist. Any attempts to create a relationship with a different target replaces the previous relationship.
ecs::World w;
// Helper entities defining the state of a wall switch
ecs::Entity on = w.add();
ecs::Entity off = w.add();
// Create the "toggled" entity and define it as exclusive
ecs::Entity toggled = w.add();
w.add(toggled, ecs::Exclusive);
// Create a wall switch entity. There can be only one relationship with {toggled, *} now.
// Therefore, adding {toggled, off} overrides the previous {toggled, on}.
ecs::Entity wallSwitch = w.add();
w.add(wallSwitch, ecs::Pair(toggled, on));
bool isSwitched = w.has(wallSwitch, ecs::Pair{toggled, on}); // true
w.add(wallSwitch, ecs::Pair(toggled, off));
isSwitched = w.has(wallSwitch, ecs::Pair{toggled, on}); // false
Entities can inherit from other entities by using the (Is, target) relationship. This is a powerful feature that helps you identify an entire group of entities using a single entity.
ecs::World w;
ecs::Entity animal = w.add();
ecs::Entity rabbit = w.add();
ecs::Entity wall = w.add();
// Make rabbit an animal.
// This is an equivalent of:
// 1) w.add(animal, animal) <-- makes sure animal has a separate archetype which can be matched
// 2) w.add(rabbit, ecs::Pair(ecs::Is, animal)) <-- forms the relationship
w.as(rabbit, animal);
// Check if an entity is inheriting from something
bool animal_is_animal = w.is(animal, animal); // true
bool rabbit_is_animal = w.is(herbivore, animal); // true
bool wall_is_animal = w.is(wall, animal); // false
The Is relation ship can be very helpful when used in queries. However, before this feature can be properly utilized, one needs to make sure the entity in the Is relationship is treated as a type (has a separate archetype).
// Iterate everything that is animal
ecs::Query q = w.query().all(Pair(ecs::Is, animal));
q.each([](ecs::Entity entity) {
// entity = animal, rabbit
});
// Iterate everything that is animal but skip wolfs
ecs::Query q2 = w.query().all(Pair(ecs::Is, animal)).no(animal);
q2.each([](ecs::Entity entity) {
// entity = rabbit
});
This also means the entity inherits all ids which are present on the entity we inherit from.
struct Age { int value; };
...
w.add<Age>(animal, {10});
// We did not add the Age component to hare but we added it to the entity it inherits from.
// Therefore, we can ask its age.
Age age = w.get<Age>(hare);
// We can decide to override the value with a custom one.
// This will only affect the hare entity (and any entity inheriting from it).
w.set<Age>(hare, {20});
When deleting an entity we might want to define how the deletion is going to happen. Do we simply want to remove the entity or does everything connected to it need to get deleted as well? This behavior can be customized via relationships called cleanup rules.
Cleanup rules are defined as ecs::Pair(Condition, Reaction).
Condition is one of the following:
- OnDelete - deleting an entity/pair
- OnDeleteTarget - deleting a pair's target
Reaction is one of the following:
- Remove - removes the entity/pair from anything referencing it
- Delete - delete everything referencing the entity
- Error - error out when deleted
The default behavior of deleting an entity is to simply remove it from the parent entity. This is an equivalent of Pair(OnDelete, Remove) relationship pair attached to the entity getting deleted.
Additionally, a behavior which can not be changed, all relationship pairs formed by this entity need to be deleted as well. This is needed because entity ids are recycled internally and we could not guarantee that the relationship entity would be be used for something unrelated later.
All core entities are defined with (OnDelete,Error). This means that instead of deleting the entity an error is thrown when an attempt to delete the entity is made.
ecs::Entity rabbit = w.add();
ecs::Entity hare = w.add();
ecs::Entity eats = w.add();
ecs::Entity carrot = w.add();
w.add(rabbit, ecs::Pair(eats, carrot));
w.add(hare, ecs::Pair(eats, carrot));
// Delete the rabbit. Everything else is unaffected.
w.del(rabbit);
// Delete eats. Deletes eats and all associated relationships.
w.del(eats);
Creating custom rules is just a matter of adding a relationship to an entity.
ecs::Entity bomb_exploding_on_del = w.add();
w.add(bomb_exploding_on_del, ecs::Pair(OnDelete, Delete));
// Attach a bomb to our rabbit
w.add(rabbit, bomb_exploding_on_del);
// Deleting the bomb will take out all entities associated with it. Rabbit included.
w.del(bomb_exploding_on_del);
A core entity ChildOf can be used to express a physical hierarchy. It uses the (OnDeleteTarget, Delete) relationship so if the parent is deleted, all its children are deleted as well.
ecs::Entity parent = w.add();
ecs::Entity child1 = w.add();
ecs::Entity child2 = w.add();
w.add(child1, ecs::Pair(ecs::ChildOf, parent));
w.add(child2, ecs::Pair(ecs::ChildOf, parent));
// Deleting parent deletes child1 and child2 as well.
w.del(parent);
Unique component is a special kind of data that exists at most once per chunk. In other words, you attach data to one chunk specifically. It survives entity removals and unlike generic components, they do not transfer to a new chunk along with their entity.
If you organize your data with care (which you should) this can save you some very precious memory or performance depending on your use case.
For instance, imagine you have a grid with fields of 100 meters squared. If you create your entities carefully they get organized in grid fields implicitly on the data level already without you having to use any sort of spatial map container.
w.add<Position>(e1, {10,1});
w.add<Position>(e2, {19,1});
// Make both e1 and e2 share a common grid position of {1,0}
w.add<ecs::uni<GridPosition>>(e1, {1, 0});
Sometimes you need to delay executing a part of the code for later. This can be achieved via CommandBuffers.
CommandBuffer is a container used to record commands in the order in which they were requested at a later point in time.
Typically you use them when there is a need to perform structural changes (adding or removing an entity or component) while iterating chunks.
Performing an unprotected structural change is undefined behavior and most likely crashes the program. However, using a CommandBuffer you can collect all requests first and commit them when it is safe.
ecs::CommandBuffer cb;
q.each([&](Entity e, const Position& p) {
if (p.y < 0.0f) {
// Queue entity e for deletion if its Y position falls below zero
cb.del(e);
}
});
// Make the queued command happen
cb.commit(&w);
If you try to make an unprotected structural change with GAIA_DEBUG enabled (set by default when Debug configuration is used) the framework will assert letting you know you are using it the wrong way.
Systems are were your programs logic is executed. This usually means logic that is performed every frame / all the time. You can either spin your own mechanism for executing this logic or use the build-in one.
Creating a system is very similar to creating a query. In fact, the built-in systems are queries internally. Ones which are performed at a later point in time. For each system an entity is created.
SystemBuilder mySystem = w.system()
// System considers all entities with Position and Velocity components.
// Position is mutable.
.all<Position&, Velocity>()
// Logic to execute every time the system is invoked.
.on_each([&sys1_cnt](Position& p, const Velocity& v) {
p.x += v.x * dt;
p.y += v.y * dt;
p.z += v.z * dt;
});
// Retrieve the entity representing the system.
Entity mySystemEntity = mySystem.entity();
// Disable the entity. This effectively disables the system.
w.disable(mySystemEntity);
// Enable the entity. This effectively makes the system runnable again.
w.enable(mySystemEntity);
The system can be run manually or automatically.
// Run the system manually.
mySystem.exec();
// Call each system when the time is right.
w.update();
Letting systems run via World::update automatically is the preferred way and what you would normally do. Gaia-ECS can resolve any dependencies and execute the systems in the right order.
By default, the order in which the systems are run depends on their entity id. The lower the id the earlier the system is executed. If a different order is needed, there are multiple ways to influence it.
One of them is adding the DependsOn relationship to a system's entity.
SystemBuilder system1 = w.system().all ...
SystemBuilder system2 = w.system().all ...
// Make system1 depend on system2. This way, system1 is always executed after system1.
w.add(system1.entity(), ecs::Pair{DependsOn, system2});
If you need a specific group of systems depend on another group it can be achieved via the ChildOf relationship.
// Create 2 entities for system groups
Entity group1 = w.add();
Entity group2 = w.add();
// Create 3 systems
SystemBuilder system1 = w.system().all ...
SystemBuilder system2 = w.system().all ...
SystemBuilder system3 = w.system().all ...
// System1 and System2 belong in group2.
// System3 belongs in group1.
// Therefore, system3 is executed first, followed by system1 and system2.
w.add(system1.entity(), ecs::Pair{ChildOf, group2});
w.add(system2.entity(), ecs::Pair{ChildOf, group2});
w.add(system3.entity(), ecs::Pair{ChildOf, group1});
By default, all data inside components are treated as an array of structures (AoS). This is the natural behavior of the language and what you would normally expect.
Consider the following component:
struct Position {
float x, y, z;
};
If we imagine an ordinary array of 4 such Position components they are organized like this in memory: xyz xyz xyz xyz.
However, in specific cases, you might want to consider organizing your component's internal data as an structure or arrays (SoA): xxxx yyyy zzzz.
To achieve this you can tag the component with a GAIA_LAYOUT of your choosing. By default, GAIA_LAYOUT(AoS) is assumed.
struct Position {
GAIA_LAYOUT(SoA); // Treat this component as SoA
float x, y, z;
};
If used correctly this can have vast performance implications. Not only do you organize your data in the most cache-friendly way this usually also means you can simplify your loops which in turn allows the compiler to optimize your code better.
struct PositionSoA {
GAIA_LAYOUT(SoA);
float x, y, z;
};
struct VelocitySoA {
GAIA_LAYOUT(SoA);
float x, y, z;
};
...
ecs::Query q = w.query().all<PositionSoA&, VelocitySoA>;
q.each([](ecs::Iter& it) {
// Position
auto vp = it.view_mut<PositionSoA>(); // read-write access to PositionSoA
auto px = vp.set<0>(); // continuous block of "x" from PositionSoA
auto py = vp.set<1>(); // continuous block of "y" from PositionSoA
auto pz = vp.set<2>(); // continuous block of "z" from PositionSoA
// Velocity
auto vv = it.view<VelocitySoA>(); // read-only access to VelocitySoA
auto vx = vv.get<0>(); // continuous block of "x" from VelocitySoA
auto vy = vv.get<1>(); // continuous block of "y" from VelocitySoA
auto vz = vv.get<2>(); // continuous block of "z" from VelocitySoA
// Handle x coordinates
GAIA_EACH(it) px[i] += vx[i] * dt;
// Handle y coordinates
GAIA_EACH(it) py[i] += vy[i] * dt;
// Handle z coordinates
GAIA_EACH(it) pz[i] += vz[i] * dt;
});
You can even use SIMD intrinsics now without a worry. Note, this is just an example not an optimal way to rewrite the loop. Also, most compilers will auto-vectorize this code in release builds anyway. The code bellow uses x86 SIMD intrinsics:
...
auto process_data = [](float* p, const float* v, const uint32_t cnt) {
uint32_t i = 0;
// Process SSE-sized blocks first
for (; i < cnt; i+=4) {
const auto pVec = _mm_load_ps(p + i);
const auto vVec = _mm_load_ps(v + i);
const auto respVec = _mm_fmadd_ps(vVec, dtVec, pVec);
_mm_store_ps(p + i, respVec);
}
// Process the rest of the elements
for (; i < cnt; ++i) p[i] += v[i] * dt;
}
// Handle x coordinates
process_data(px.data(), vx.data(), it.size());
// Handle y coordinates
process_data(py.data(), vy.data(), it.size());
// Handle z coordinates
process_data(pz.data(), vz.data(), it.size());
...
Different layouts use different memory alignments. GAIA_LAYOUT(SoA) and GAIA_LAYOUT(AoS) align data to 8-byte boundaries, while GAIA_LAYOUT(SoA8) and GAIA_LAYOUT(SoA16) align to 16 and 32 bytes respectively. This makes them a good candidate for AVX and AVX512 instruction sets (or their equivalent on different platforms, such as NEON on ARM).
Serialization of arbitrary data is available via following functions:
- ser::bytes - calculates how many bytes the data needs to serialize
- ser::save - writes data to serialization buffer
- ser::load - loads data from serialization buffer
Any data structure can be serialized at compile-time into the provided serialization buffer. Native types, compound types, arrays and anything with data() + size() functions are supported out-of-the-box. If resize() is available it will be utilized.
Example:
struct Position {
float x, y, z;
};
struct Quaternion {
float x, y, z, w;
};
struct Transform {
Position p;
Quaternion r;
};
struct Transform {
cnt::darray<Transform> transforms;
int some_int_data;
};
...
Transform in, out;
GAIA_FOR(10) t.transforms.push_back({});
t.some_int_data = 42069;
ecs::SerializationBuffer s;
// Calculate how many bytes is it necessary to serialize "in"
s.reserve(ser::bytes(in));
// Save "in" to our buffer
ser::save(s, in);
// Load the contents of buffer to "out"
s.seek(0);
ser::load(s, out);
Customization is possible for data types which require special attention. We can guide the serializer by either external or internal means.
External specialization comes handy in cases where we can not or do not want to modify the source type:
struct CustomStruct {
char* ptr;
uint32_t size;
};
namespace gaia::ser {
template <>
uint32_t bytes(const CustomStruct& data) {
return data.size + sizeof(data.size);
}
template <typename Serializer>
void save(Serializer& s, const CustomStruct& data) {
// Save the size integer
s.save(data.size);
// Save data.size raw bytes staring at data.ptr
s.save(data.ptr, data.size);
}
template <typename Serializer>
void load(Serializer& s, CustomStruct& data) {
// Load the size integer
s.load(data.size);
// Load data.size raw bytes to location pointed at by data.ptr
data.ptr = new char[data.size];
s.load(data.ptr, data.size);
}
}
...
CustomStruct in, out;
in.ptr = new char[5];
in.ptr[0] = 'g';
in.ptr[1] = 'a';
in.ptr[2] = 'i';
in.ptr[3] = 'a';
in.ptr[4] = '\0';
in.size = 5;
ecs::SerializationBuffer s;
// Reserve enough bytes in the buffer so it can fit the entire in
s.reserve(ser::bytes(in));
ser::save(s, in);
// Move to the start of the buffer and load its contents to out
s.seek(0);
ser::load(s, out);
// Let's release the memory we allocated
delete in.ptr;
delete out.ptr;
You will usually use internal specialization when you have the access to your data container and at the same time do not want to expose its internal structure. Or if you simply like intrusive coding style better. In order to use it the following 3 member functions need to be provided:
struct CustomStruct {
char* ptr;
uint32_t size;
constexpr uint32_t bytes() const noexcept {
return size + sizeof(size);
}
template <typename Serializer>
void save(Serializer& s) const {
s.save(size);
s.save(ptr, size);
}
template <typename Serializer>
void load(Serializer& s) {
s.load(size);
ptr = new char[size];
s.load(ptr, size);
}
};
It doesn't matter which kind of specialization you use. However, note that if both are used the external one has priority.
To fully utilize your system's potential Gaia-ECS allows you to spread your tasks into multiple threads. This can be achieved in multiple ways.
Tasks that can not be split into multiple parts or it does not make sense for them to be split can use ThreadPool::sched. It registers a job in the job system and immediately submits it so worker threads can pick it up:
mt::Job job0 {[]() {
InitializeScriptEngine();
}};
mt::Job job1 {[]() {
InitializeAudioEngine();
}};
ThreadPool &tp = ThreadPool::get();
// Schedule jobs for parallel execution
mt::JobHandle jobHandle0 = tp.sched(job0);
mt::JobHandle jobHandle1 = tp.sched(job1);
// Wait for jobs to complete
tp.wait(jobHandle0);
tp.wait(jobHandle1);
When crunching larger data sets it is often beneficial to split the load among threads automatically. This is what ThreadPool::sched_par is for.
static uint32_t SumNumbers(std::span<const uint32_t> arr) {
uint32_t sum = 0;
for (uint32_t val: arr)
sum += val;
return sum;
}
...
constexpr uint32_t N = 1'000'000;
cnt::darray<uint32_t> arr;
...
std::atomic_uint32_t sum = 0;
mt::JobParallel job {[&arr, &sum](const mt::JobArgs& args) {
sum += SumNumbers({arr.data() + args.idxStart, args.idxEnd - args.idxStart});
}};
// Schedule multiple jobs to run in parallel. Make each job process up to 1234 items.
mt::JobHandle jobHandle = tp.sched_par(job, N, 1234);
// Alternatively, we can tell the job system to figure out the group size on its own
// by simply omitting the group size or using 0:
// mt::JobHandle jobHandle = tp.sched_par(job, N);
// mt::JobHandle jobHandle = tp.sched_par(job, N, 0);
// Wait for jobs to complete
tp.wait(jobHandle);
// Use the result
GAIA_LOG("Sum: %u\n", sum);
Sometimes we need to wait for the result of another operation before we can proceed. To achieve this we need to use low-level API and handle job registration and submitting jobs on our own.
NOTE:
This is because once submitted we can not modify the job anymore. If we could, dependencies would not necessary be adhered to.
Let us say there is a job A depending on job B. If job A is submitted before creating the dependency, a worker thread could execute the job before the dependency is created. As a result, the dependency would not be respected and job A would be free to finish before job B.
mt::Job job0;
job0.func = [&arr, i]() {
arr[i] = i;
};
mt::Job job1;
job1.func = [&arr, i]() {
arr[i] *= i;
};
mt::Job job2;
job2.func = [&arr, i]() {
arr[i] += i;
};
// Register our jobs in the job system
auto job0Handle = tp.add(job0);
auto job1Handle = tp.add(job1);
auto job2Handle = tp.add(job2);
// Create dependencies
tp.dep(job0Handle, job1Handle);
tp.dep(job1Handle, job2Handle);
// Submit jobs so worker threads can pick them up.
// The order in which jobs are submitted does not matter.
tp.submit(job2Handle);
tp.submit(job1Handle);
tp.submit(job0Handle);
// Wait for the last job to complete.
tp.wait(job2Handle);
Nowadays, CPUs have multiple cores. Each of them is capable of running at different frequencies depending on the system's power-saving requirements and workload. Some CPUs contain cores designed to be used specifically in high-performance or efficiency scenarios. Or, some systems even have multiple CPUs.
Therefore, it is important to have the ability to utilize these CPU features with the right workload for our needs. Gaia-ECS allows jobs to be assigned a priority tag. You can either create a high-priority jobs (default) or low-priority ones.
The operating system should try to schedule the high-priority jobs to cores with highest level of performance (either performance cores, or cores with highest frequency etc.). Low-priority jobs are to target slowest cores (either efficiency cores, or cores with lowest frequency).
Where possible, the given system's QoS is utilized (Windows, MacOS). In case of operating systems based on Linux/FreeBSD that do not support QoS out-of-the-box, thread priorities are utilized.
Note, thread affinity is not managed and left on default because this plays better with QoS and gives the operating system more control over scheduling.
// Create a job designated for performance cores
mt::Job job0;
job0.priority = JobPriority::High;
job0.func = ...;
tp.sched(job0);
// Create a job designated for efficiency cores
mt::Job job0;
job0.priority = JobPriority::Low;
job0.func = ...;
tp.sched(job0);
Job behavior can be partial customized. For example, if we want to manage its lifetime manually, on its creation we can tell the threadpool.
mt::Job job;
// This job's lifetime won't be managed by the threadpool
job.flags = mt::JobCreationFlags::ManualDelete;
tp.func = ...;
mt::JobHandle handle = tp.sched(job);
tp.wait(handle);
// We release the job handle ourselves
tp.del(handle);
The total number of threads created for the pool is set via ThreadPool::set_max_workers. By default, the number of threads created is equal to the number of available CPU threads minus 1 (the main thread). However, no matter how many threads are requested, the final number if always capped to 31 (ThreadPool::MaxWorkers). The number of available threads on your hardware can be retrieved via ThreadPool::hw_thread_cnt.
auto& tp = mt::ThreadPool::get();
// Get the number of hardware threads.
const uint32_t hwThreads = tp.hw_thread_cnt();
// Create "hwThreads" worker threads. Make all of them high priority workers.
tp.set_max_workers(hwThreads, hwThreads);
// Create "hwThreads" worker threads. Make 1 of them a high priority worker.
// If more then 1 worker threads were created, the rest of them is dedicated
// for low-priority jobs.
tp.set_max_workers(hwThreads, 1);
// No workers threads are used. If there were any before, they are destroyed.
// All processing is happening on the main thread.
tp.set_max_workers(0, 0);
The number of worker threads for a given performance level can be adjusted via ThreadPool::set_workers_high_prio and ThreadPool::set_workers_low_prio. By default, all workers created are high-priority ones.
auto& tp = mt::ThreadPool::get();
// Get the number of worker threads available for this system.
const uint32_t workers = tp.workers();
// Set the number of worker threads dedicated for performance cores.
// E.g. if workers==5, this dedicates 4 worker threads for high-performance workloads
// and turns the remaining 1 into an efficiency worker.
tp.set_workers_high_prio(workers - 1);
// Make all workers high-performance ones.
tp.set_workers_high_prio(workers);
// Set the number of worker threads dedicated for efficiency cores.
// E.g. if workers==5, this dedicates 4 worker threads for efficiency workloads loads
// and turns the remaining 1 into a high-performance worker.
tp.set_workers_low_prio(workers - 1);
// Make all workers low-performance ones.
tp.set_workers_low_prio(workers);
The main thread normally does not participate as a worker thread. However, if needed, it can join workers by calling ThreadPool::update from the main thread.
auto& tp = mt::ThreadPool::get();
ecs::World w1, w2;
while (!stopProgram) {
// Create many jobs here
// ...
// Update both worlds
w1.update();
w2.update();
// Help execute jobs on the main thread, too
tp.update();
}
If you need to designate a certain thread as the main thread, you can do it by calling ThreadPool::make_main_thread from that thread.
Note, the operating system has the last word here. It might decide to schedule low-priority threads to high-performance cores or high-priority threads to efficiency cores depending on how the scheduler decides it should be.
Compiler with a good support of C++17 is required.
The project is continuously tested and guaranteed to build warning-free on the following compilers:
CMake 3.12 or later is required to prepare the build. Other tools are officially not supported at the moment. However, nothing stops you from placing gaia.h into your project.
Unit testing is handled via Catch2. It can be controlled via -DGAIA_BUILD_UNITTEST=ON/OFF when configuring the project (OFF by default).
The following shows the steps needed to build the library:
# Check out the library
git clone https://github.com/richardbiely/gaia-ecs.git
# Go to the library root
cd gaia-ecs
# Make a build directory
cmake -E make_directory "build"
# Generate cmake build files (Release for Release configuration)
cmake -DCMAKE_BUILD_TYPE=Release -S . -B "build"
# ... or if you use CMake older than 3.13 you can do:
# cmake -E chdir "build" cmake -DCMAKE_BUILD_TYPE=Release ../
# Build the library
cmake --build "build" --config Release
To target a specific build system you can use the -G parameter:
# Microsoft Visual Studio 2022, 64-bit, x86 architecture
cmake -G "Visual Studio 17 2022" -A x64 ...
# Microsoft Visual Studio 2022, 64-bit, ARM architecture
cmake -G "Visual Studio 17 2022" -A ARM64 ...
# XCode
cmake -G Xcode ...
# Ninja
cmake -G Ninja
NOTE
When using MacOS you might run into a few issues caused by the specifics of the platform unrelated to Gaia-ECS. Quick way to fix them is listed bellow.CMake issue:
After you update to a new version of Xcode you might start getting "Ignoring CMAKE_OSX_SYSROOT value: ..." warnings when building the project. Residual cmake cache is to blame here. A solution is to delete files generated by cmake.Linker issue:
When not building the project from Xcode and using ld as your linker, if XCode 15 or later is installed on your system you will most likely run into various issues: https://developer.apple.com/documentation/xcode-release-notes/xcode-15-release-notes#Linking. In the CMake project a workaround is implemented which adds "-Wl,-ld_classic" to linker settings but if you use a different build system or settings you might want to do same.
Following is a list of parameters you can use to customize your build
Parameter | Description |
---|---|
GAIA_BUILD_UNITTEST | Builds the unit test project |
GAIA_BUILD_BENCHMARK | Builds the benchmark project |
GAIA_BUILD_EXAMPLES | Builds example projects |
GAIA_GENERATE_CC | Generates compile_commands.json |
GAIA_MAKE_SINGLE_HEADER | Generates a single-header version of the framework |
GAIA_PROFILER_CPU | Enables CPU profiling features |
GAIA_PROFILER_MEM | Enabled memory profiling features |
GAIA_PROFILER_BUILD | Builds the profiler (Tracy by default) |
GAIA_USE_SANITIZER | Applies the specified set of sanitizers |
Possible options are listed in cmake/sanitizers.cmake.
Note, that some options don't work together or might not be supported by all compilers.
cmake -DCMAKE_BUILD_TYPE=Release -DGAIA_USE_SANITIZER=address -S . -B "build"
Gaia-ECS is shipped also as a single header file which you can simply drop into your project and start using. To generate the header we use a wonderful Python tool Quom.
To generate the header use the following command inside your root directory.
quom ./include/gaia.h ./single_include/gaia.h -I ./include
You can also use the attached make_single_header.sh or create your script for your platform.
Creation of the single header can be automated via -GAIA_MAKE_SINGLE_HEADER.
The repository contains some code examples for guidance.
Examples are built if GAIA_BUILD_EXAMPLES is enabled when configuring the project (OFF by default).
Project name | Description |
---|---|
External | A dummy example showing how to use the framework in an external project. |
Standalone | A dummy example showing how to use the framework in a standalone project. |
Basic | Simple example using some basic features of the framework. |
Roguelike | Roguelike game putting all parts of the framework to use and represents a complex example of how everything would be used in practice. It is work-in-progress and changes and evolves with the project. |
To be able to reason about the project's performance and prevent regressions benchmarks were created.
Benchmarking relies on picobench. It can be controlled via -DGAIA_BUILD_BENCHMARK=ON/OFF when configuring the project (OFF by default).
Project name | Description |
---|---|
Duel | Compares different coding approaches such as the basic model with uncontrolled OOP with data all-over-the heap, OOP where allocators are used to controlling memory fragmentation and different ways of data-oriented design and it puts them to test against our ECS framework itself. DOD performance is the target level we want to reach or at least be as close as possible to with this project because it does not get any faster than that. |
App | Somewhat similar to Duel but measures in a more complex scenario. Inspired by ECS benchmark. |
Iteration | Covers iteration performance with different numbers of entities and archetypes. |
Entity | Focuses on performance of creating and removing entities and components of various sizes. |
Multithreading | Measures performance of the job system. |
It is possible to measure the performance and memory usage of the framework via any 3rd party tool. However, support for Tracy is added by default.
CPU part can be controlled via -DGAIA_PROF_CPU=ON/OFF (OFF by default).
Memory part can be controlled via -DGAIA_PROF_MEM=ON/OFF (OFF by default).
Building the profiler server can be controlled via -DGAIA_PROF_CPU=ON (OFF by default).
NOTE:
This is a low-level feature mostly targeted for maintainers. However, if paired with your own profiler code it can become a very helpful tool.
Custom profiler support can be added by overriding GAIA_PROF_* preprocessor definitions:
#define GAIA_PROF_FRAME my_profilers_frame_function
#define GAIA_PROF_SCOPE my_profilers_zone_function
...
#include <gaia.h>
The project is thoroughly unit-tested and includes thousands of unit tests covering essentially every feature of the framework. Benchmarking relies on a modified picobench.
It can be controlled via -DGAIA_BUILD_UNITTEST=ON/OFF (OFF by default).
To see what the future holds for this project navigate here
Requests for features, PRs, suggestions, and feedback are highly appreciated.
Make sure to visit the project's discord or the discussions section here on GitHub. If necessary, you can contact me directly either via the e-mail (you can find it on my profile page) or you can visit my X.
If you find the project helpful, do not forget to leave a star or sponsor its development.
Thank you for using the project. You rock! :)
Code and documentation Copyright (c) 2021-2024 Richard Biely.
Code released under the MIT license.