DEV Community

Discussion on: Exploring the Use of C++ for Game Development

Collapse
 
taqmuraz profile image
Taqmuraz

C++ is not cross-platform very much. Different compilers frequently generate binaries that work differently. And, you have to recompile your code for different platforms, while Java has cross-platform binaries by default.
Also, manual memory control is always a negative trait of C++, not a positive one. Manual memory control fails everywhere, comparing with garbage collector.

Collapse
 
tymey profile image
Tyler Meyer

Interesting! This space has been very opinionated online. I kept reading how memory management in C++ helps with speed and performance. I have a background in JavaScript and am beginning my learning in C++. I also plan to learn C# and Java. From what I've read about Java, the cross-platform capabilities sound so convenient. Are you familiar with Java? Do you use it regularly? If so, what do you like about it?

Collapse
 
taqmuraz profile image
Taqmuraz

Java memory allocation in fact is faster than C or C++ ones. I made benchmarks recently, and I am going to write an article about that. I used Java for two years in my personal projects, but last 6 months I use Clojure (Lisp on JVM), making my game.
If you know how to benchmark properly, you can find yourself, that modern Java is faster than C and C++, or equal. JIT compiler has much more freedom and information, than traditional static compilers.

Collapse
 
tandrieu profile image
Thibaut Andrieu

The language is cross plateform. With the same source code you can build for windows, linux ARM, x86, etc... That's the definition of cross-plateform.

Garbage collector do the job until you face out of memory or performance issues.
When dealing with large data or performance critical application, like video games, you quickly end up fighting against the GC because it trigger at the wrong time, or too late, it fragment memory, etc...

In C++, deallocation is deterministic (at least virtually, in the end the system always decide when memory is actually freed). And using smart pointer, you no more care about manual deallocation.

Collapse
 
taqmuraz profile image
Taqmuraz

It seems to me you are making up problems with GC.
Modern Java, Go and many other languages GC's don't fragment memory.
If you have such terrible problems with performance, maybe it is not GC?
I am tired of appeals to majority or popularity.
Show me real benchmarks, where C++ or C show at least 20% higher performance, so it would be reasonable to pay with time and safety developing games with those languages.

Thread Thread
 
tandrieu profile image
Thibaut Andrieu

Typical GC issue: I have a C# wrapping over a C++ library. This C++ handle large dataset (like dozen of GB). But because of the way wrapping is done, the C# object is just an handle keeping the C++ object alive. From garbage collector point of view, this object is just a few byte in memory, so in very low priority to be garbaged collected.
So, when I'm sure no one use this object anymore, I have to manually call gc.collect(Forced); gc.WaitForPendingFinalizers(); and all the magical gc method to be sure memory is actually free. And even by doing so, there are still magical heuristic that will prevent GC to dispose this handle.

As Joel said:

All non-trivial abstractions, to some degree, are leaky.

joelonsoftware.com/2002/11/11/the-...

GC is an abstraction. It works well 99% of time, but as any abstraction, it will leak in corner cases.

Thread Thread
 
taqmuraz profile image
Taqmuraz

C# has poor GC, and now I understand why you have such opinion about GC in general. C# provides ability to interoperate C++ with CLR, also C# objects may have destructors, what makes garbage collection much more complex, even when there are no objects with a destructor in the heap.

Collapse
 
taqmuraz profile image
Taqmuraz

Though, good luck using C++ for making games. Having competitors struggling with C++ is a very good thing for everyone else

Collapse
 
miketalbot profile image
Mike Talbot ⭐

I used to use C++ decades ago and came back to it 2 years back with Unreal, tbf Unreal has a big ecosystem, but it does make C++ work better for me. Given how successful Unreal games are, I don't think you can dismiss C++ like it's a joke, though; a lot of significant games have a combination of blueprints and C++ developed for them.

Bare metal allocations in C++ can improve performance; they just take a lot more thinking about. I'm good with game developers not caring and using something else that deals with it for you - I do most games in Unity GameObjects so I definitely fall in that camp for many projects. Unity has introduced DOTS to combat the performance of lower-level languages and the kind of designs that work well in them.

Thread Thread
 
taqmuraz profile image
Taqmuraz

Bare metal allocations don't improve performance. Check it for yourself, compare with garbage collected languages, such as Java. Allocations there work faster for a reason. I am going to write an article about that (with benchmarks, of course), so you are welcome to read it in future.
In reality, C++ and C are not only slower (or same) with Java, Go, Lisp, but significantly less safe, less readable and less effective (we are talking here not about system programming, but about games)

Thread Thread
 
miketalbot profile image
Mike Talbot ⭐

Yes I've checked it for myself, garbage collection sweeps and allocation frequencies have big impacts in hard to fathom ways. There's a reason why most game engines are written in C++ - Unity and Unreal etc.

Thread Thread
 
taqmuraz profile image
Taqmuraz • Edited

Mister, do you have exact benchmarks? Because I do.
github.com/Taqmuraz/alloc-table
And they show, that C and C++ are slower in memory allocation, than Java 17.
Yes, there are reasons, why most game engines written on C++, yes. But speed is not a reason.

Thread Thread
 
miketalbot profile image
Mike Talbot ⭐

Sure, list your benchmarks and their code and I'll figure out where you got it wrong :)

Thread Thread
 
taqmuraz profile image
Taqmuraz

You have a link. I am open for any critics. More than that, I WANT you to show me if I am wrong. Please, e-mail me, if you find something. My e-mail you may find in my profile.

Thread Thread
 
plotarmouredtitan profile image
PlotArmouredTitan • Edited

Checked your c++ code and I'm confused on why you would create a pointer on one line only to delete it two lines after.

You would never see anyone do this in real code because that's no what pointers are meant for. Pointers are for resources that need to be held for long periods through different scopes and gives you the control of when to delete it.

A garbage collector which Java uses on the other hand do not give you this freedom. Instead they choose specific times to collect and delete unused resources all at once which can create visible performance degradation for demanding applications and would be a nightmare for a game.

Rather than allocating and deleting a pointer on every loop iteration, you should either allocate it all at once as a pointer to an array or just use a plain old variable, keep in mind the default c++ allocator is slow compared to other allocation methods

Edit: Additionally java requires a virtual machine which basically makes it an interpreted language. For very demanding tasks, an interpreted language will never match the speed of a compiled language like c++.

Thread Thread
 
taqmuraz profile image
Taqmuraz • Edited

I added that delete call only because without that C++ test cannot complete (out of memory). Table has results without that delete call.
Also, benchmark is a benchmark. It is not a program that does something useful. So, I would expect critics of measuring methods and results, not style.
UPDATE:
Actually, releasing memory exactly after you used it is a good thing in C and C++, it makes next allocation faster (benchmarked as well, though it is explained in specs).

Thread Thread
 
plotarmouredtitan profile image
PlotArmouredTitan • Edited

If you're going to use a resource for 3 lines of code then don't use memory on the heap, use a normal variable. It's not about coding style it's about what the feature was designed for. There's limited value in the testing scenario itself

Also you should use std::Chrono for timing because sys/time has been known to be inconsistent on different platforms

Thread Thread
 
taqmuraz profile image
Taqmuraz • Edited

First. I am testing heap memory allocation, so I allocate memory from the heap. Period.
Second. How much inconsistent sys/time is? Do you think that use of std::chrono would change results by at least 5%?
UPDATE
I really don't get it. Test clearly show, that C malloc is slower than java new[].
All point of the C test is to measure how much time does take malloc itself.
And you are saying something about malloc use cases. Did I say anything about use cases?

Thread Thread
 
_ce7114b1f3f00437dc3e0 profile image
張宏欣

lol, first of all, benchmarking memory allocation does not equal to benchmarking manual or automatic memory management, it only benchmark a part of it, so your test result does not mean manual memory management is slower than managed ones.

secondly, I wonder what would be the reason for game engines mostly written in C++ when you claim that its slower, unreadable, unsafe, and inefficient

Thread Thread
 
taqmuraz profile image
Taqmuraz

Starting conversation from "lol" does not make you look smart.
Memory management includes allocating, accessing and releasing memory. I proved that allocating and releasing memory is slower in C and C++, than in Java. So, you want to say, accessing memory in C or C++ is somehow faster or what?

Well, I also wonder, why people choose C++ in game industry. Though, appealing for its popularity is a dubious argument. Job market state, traditions, history, people's fallacies, many factors, and you must prove that speed is one of them.

Show me that speed you are talking about in actual benchmarks.

Thread Thread
 
tandrieu profile image
Thibaut Andrieu

DDR4 peak bandwidth is ~20GB/s (crucial.com/support/memory-speeds-...).

In your "Person" benchmark, you do 1 billion allocation of int value, which is 4GB. This would require 4 / 20 = 200ms assuming peak bandwidth. But your java benchmark do it in ~30ms....

I'm quite sure in you example java doesn't really allocate the memory but optimize the code.

How did you build the C++ code ? Did you use -O3 to enable optimizations ?

Thread Thread
 
taqmuraz profile image
Taqmuraz

Yes, I did use -O3. You can build yourself C, C++ and Java sources and run.
Also, Java specification is very strict about constructor calls. If it optimizes it somehow with no loss of side effects, that is an advantage as well.
Though I want to hear your ideas about how to detect and avoid JVM optimizations, if there are ones. Also, what will you say about Common Lisp arrays allocation? It is also faster than malloc in C, but Common Lisp runtime (SBCL) seem to be honest very much with memory allocation. Also, in the C test I allocate 4 bytes int arrays, while in the Common Lisp test are 8 bytes int arrays.

Thread Thread
 
tandrieu profile image
Thibaut Andrieu

malloc is a direct call to system. You just cannot be faster than that, otherwise you don't actually do an allocation. It's like saying C++ is faster than assembly because your code was evaluated as compile time (using constexpr and other optim), compare to run time evaluation in assembly.

But yes, we understood, you don't like C++.

Thread Thread
 
taqmuraz profile image
Taqmuraz

Seems you don't know how GC allocation works. Though it really does not call system to allocate memory, it has pre-allocated heap.

Thread Thread
 
tandrieu profile image
Thibaut Andrieu

Yes, so you don't measure the same thing. gc use memory pool system, but when does this allocation occured ? Did you take it in accout in your benchmark ? I don't think so. According to your code, you don't measured startup time, just the time of new int[i]; which probaly use free slot in gc memory pool, so no allocation.
I could do the same thing in C++, create a custom allocator using a memory pool, and then measure time to get a free slot in this pool. And I'm quite sure it will be faster than java.

To be honnest, you also have to take in acount JVM startup time.

Here is a list of benchmark of java VS Cpp, and cpp outperform java in any of them:
programming-language-benchmarks.ve...

Thread Thread
 
taqmuraz profile image
Taqmuraz • Edited

Sorry, I can't continue seriously talking with you.
Please, study theory about garbage collection.
And read again, what my point was.
Bye.
UPDATE
You seriously say "If I implement garbage collector in C++ it will be faster than traditional memory allocation in C++". Bravo, you proved my point.

Thread Thread
 
tandrieu profile image
Thibaut Andrieu

Dude, seriously you better have a step back and reevaluate the way you communicate and your position in general.
I read some of your comment and it's distressing.
I've worked with juniors like you, that are assertive on everything and think they know better than a whole industry. The truth is that no one want to work with people like this.

For your own good, try to listen and trust other's people opinion, especially when they have 15/20 years xp in their field. And I'm taling about real expert, not some self proclaimed senior that did the same job for the last 20 years in the same small company.

I just hope you are a troll and not like that in real life, else you might have bad time building your career.

Thread Thread
 
taqmuraz profile image
Taqmuraz

I can't stand how arrogant you are.
But here you have a point.
It is wrong to offend people while discussing programming things.
Come back when you have better arguments, and sorry for my rude behavior.

Thread Thread
 
plotarmouredtitan profile image
PlotArmouredTitan • Edited

You seriously say "If I implement garbage collector in C++ it will be faster than traditional memory allocation in C++". Bravo, you proved my point.

No it's actually a built in feature in c++, read about std::allocator. Like I have said, the default allocation method is not the fastest

Thread Thread
 
taqmuraz profile image
Taqmuraz

Can you demonstrate such code for C++? To replace C++ test code from my repo.
Requirement is – memory we get from new Person() call must be located in the heap, and new code must not exploit current test implementation (example of such exploit – always returning the same pointer).

Thread Thread
 
tandrieu profile image
Thibaut Andrieu

Arguing with software developer is like chasing a pig in the mud. It leads you to nowhere and at some point, you will realize the pig likes that.

Thread Thread
 
_ce7114b1f3f00437dc3e0 profile image
張宏欣 • Edited

I loled again. You are trying so hard on numbers that does not even translate to a thing. Here is a hint, you forgot memory consumption and memory constraint. Some games need to run on minimum memory, try your benchmark on a limited memory, the smaller it gets, you'll see the picture. And FYI, this is why Android phones needs more RAM compared to iOS phone, and why Android phones was laggy since they were only 1GB RAM at first. Your claim that Java is (I assume you mean always) faster than C++ is easily disproven by Android (java) vs iOS (objective C), Java can be faster only if it has significantly MORE memory to use.

Thread Thread
 
miketalbot profile image
Mike Talbot ⭐ • Edited

Also JVMs aren't carved out of stone and magic pixel dust. JVMs being written in C or C++ is common - the point being, you can implement anything Java does in C++ - you have to be able to, because that's how Java is made... The other way around is not true because the designers decided to allow a subset and enforce garbage collection - which seems really fast when there's lots of memory, but then really doesn't seem fast when the world pauses for mark and sweep.

Thread Thread
 
taqmuraz profile image
Taqmuraz

I am writing 3D games with Java (using own game engine) and usually they take 75-150 MB of RAM. So what do you say here.

Thread Thread
 
miketalbot profile image
Mike Talbot ⭐ • Edited

Years ago I worked on JMonkey Engine, Java is a great platform and it is fine for building games. I'm simply saying that C++ is a low level language capable of implementing any pattern, including garbage collection should you wish. It's also great at allowing a developer the flexibility to create what they need and to optimally ensure that there aren't glitches and slow downs. Your games may never suffer those, which is fine - but that says something about the type of games you are making and the performance in the cases you have discovered.

I definitely agree that it is easier not to mess stuff up in Java, but that comes with a flexibility cost that a seasoned developer can exploit in a lower level language to wring every ounce of performance out of their solution.

Language choice, language level, virtual machine versus specific compilation, these are tool choices for us as developers, we should choose the right tool for the right job. It's a bad idea to worship tools and a good idea to question assumptions. If your games are 150MB then unsurprising you can get good performance out of Java because it will use way more actual memory than that and use free space compaction and a variety of other techniques to make memory allocations optimal. If you made a game that used more memory, you'd find it didn't perform as well. If you knew C++ and system architecture well, you could do the same thing in that language, probably with a few optimisations - but why bother, you don't need to so its fine for the types of project you are working on.

It's easier to write bad C++ than it is bad Java. This is a weakness of C++ and the only antidote is to be very good at developing it.

Thread Thread
 
taqmuraz profile image
Taqmuraz

Mostly, you are correct. I just wonder why it is appropriate to say "your C++ code is slow because you don't know C++ well" while "my Java code is slow because I don't know Java well" is so rare to hear.

Thread Thread
 
miketalbot profile image
Mike Talbot ⭐

It's one of the great strengths of Java for sure and why I'd choose it over C++ for any business application and many other projects.

Thread Thread
 
_ce7114b1f3f00437dc3e0 profile image
張宏欣 • Edited

just put a demo for it, just claiming that without proof is moot, just like you insist on benchmark result, lol, also, you only mention your game's memory usage, but not constraint, try running it on a 256MB or 512MB RAM machine, years ago 512MB RAM could run 3D games smoothly, but still, it can't be compared with your game, since its not apple to apple, the thing is, you have more ways to reduce memory usage on C++ than on Java

Thread Thread
 
taqmuraz profile image
Taqmuraz

I already tried running it on 256 MB of RAM. JVM has options to limit heap size.
About demo – to much effort would be for this argument with you. You could run it yourself, my github repos are public.

 
plotarmouredtitan profile image
PlotArmouredTitan • Edited

This is a version that uses std::allocator to pre-allocate memory before later constructing it for usage

#include "stdio.h"
#include "stdlib.h"
#include <chrono>
#include <iostream>
#include <string>
#include <memory>

class Person {
public:
    int age;
    Person(int age)
    {
        this->age = age;
    }
};

auto time_ms()
{
    return std::chrono::high_resolution_clock::now();
}

int run(int r, int s) {
    auto start = time_ms();
    int e = 0;

    for (int i = 0; i < r; i++) {
        Person* p = new Person(s);
        e += p->age;
        delete p; // only to avoid `out of memory` error
        // in fact, `delete` improves performance here
        // almost twice
    }
    auto end = time_ms();
    auto took = std::chrono::duration_cast<std::chrono::milliseconds>(end - start);
    printf("Took : %lld ms  ", took.count());
    return took.count();
}

int run_alloc(int r, int s) {

    auto start = time_ms();

    std::allocator<Person> alloc;
    //allocate memory pool
    Person * memPool = alloc.allocate(r);

    int e = 0;

    for (int i = 0; i < r; i++) {
        //retrieve address from memory pool
        Person* p = memPool + i;

        //construct object
        alloc.construct(p, s);

        e += p->age;

        //destroy object
        alloc.destroy(p);
    }

    //deallocate memory pool
    alloc.deallocate(memPool, r);

    auto end = time_ms();
    auto took = std::chrono::duration_cast<std::chrono::milliseconds>(end - start);
    printf("Alloc method Took : %lld ms\n", took.count());
    return took.count();
}
int run_alloc_reused(int r, int s) {

    auto start = time_ms();

    std::allocator<Person> alloc;
    Person* memAddr = alloc.allocate(1);

    int e = 0;

    for (int i = 0; i < r; i++) {
        Person* p = memAddr;

        alloc.construct(p, s);

        e += p->age;

        alloc.destroy(p);
    }

    alloc.deallocate(memAddr, r);

    auto end = time_ms();
    auto took = std::chrono::duration_cast<std::chrono::milliseconds>(end - start);
    printf("Reused Alloc method Took : %lld ms\n", took.count());
    return took.count();
}


int main(int argc, char** argv) {

    /*
    int r = atoi(argv[1]);
    int s = atoi(argv[2]);

    printf("Inputs : %d %d\n", r, s);
    printf("Integer type size : %ld\n", sizeof(int));

    */

    int shouldRun = 1;

    while(shouldRun)
    {
        int r = 0;
        int s = 0;

        std::cout << "input r" << '\n';
        std::cin >> r;
        std::cout << "input s" << '\n';
        std::cin >> s;

        long long sum = 0;
        long long min = 1L << 32U;
        long long max = -1;

        long long sum2 = 0;
        long long min2 = 1L << 32U;
        long long max2 = -1;

        int ts = 10;
        for (int i = 0; i < ts; i++) {
            long long t = run(r, s);
            long long t2 = run_alloc(r, s);

            if (t < min) min = t;
            if (t > max) max = t;
            sum += t;

            if (t2 < min2) min2 = t2;
            if (t2 > max2) max2 = t2;
            sum2 += t2;
        }
        double avg = ((double)sum) / ts;
        double avg2 = ((double)sum2) / ts;
        printf("Min : %lld, max : %lld, average : %f\n", min, max, avg);
        printf("Min2 : %lld, max2 : %lld, average2 : %f\n", min2, max2, avg2);

        std::cout << "input 0 to exit, 1 to continue" << '\n';
        std::cin >> shouldRun;
    }
}
Enter fullscreen mode Exit fullscreen mode

Some other changes I made to the code

  • used std::chrono for measuring time because its cross-platform while the method you used was not

  • used std::iostream to handle inputing values at startup time instead of using main() arguments

Edit:made some corrections to the code

Edit2: also I think that allocation and deallocation shouldn't even be included in the time measurement since GC allocations and deallocations are likely done at startup time

Thread Thread
 
taqmuraz profile image
Taqmuraz

Sorry, I wasn't online. I appreciate your efforts, I am already reading your code

Thread Thread
 
taqmuraz profile image
Taqmuraz

Case with 'std::allocator' looks better than others, because it has abstraction over test case. I must test how fast it is and compare with other allocation tests. Probably I will update contents of my article. Thank you again for your code :)

Collapse
 
spekkiodancer profile image
spekkiodancer

I see more studios struggling to adopt Rust than to code with C++.

Thread Thread
 
taqmuraz profile image
Taqmuraz

Yes, Rust hysteria is a real thing.
When I say that C++ is bad, I don't say Rust is better.