Improve performance of molecular editor with many atoms (meshes)

Sangwon_Lee · January 27, 2021, 11:21am

Hi, I’m a postdoctoral researcher in material science. I want to make a molecular editor using Pand3D.
Sometimes a molecular system contains more than 10000 atoms (spheres), so a molecular editor must handle at least 10000 meshes.
I have read all your manuals and tested the given recommendations. However, it still hasn’t been solved because of some problems.

Instancing
It works well for thousands of atoms, but it slows down when there are more than 10,000 atoms.
flattenStrong()
It works well, but it is not suitable because the atom needs to be modified and moved (it’s an editor).
RigidBodyCombiner
It’s slower than when it’s not in use. In particular, it consumes a great deal of CPU and RAM resources.

My simple OpenGL software can draw 50,000 spheres without frame drop (by instancing). So I think Panda3D can do it also.
Is hardware installing the only way I can try?

Here is my test code:

import numpy as np

import direct.directbase.DirectStart
from panda3d.core import *
from direct.actor.Actor import Actor
from direct.filter.CommonFilters import CommonFilters

USE_RBC = False
N_SPHERES = 10000

# ========= MAKE SPHERE NODE ==============================================

# Original code from http://www.songho.ca/opengl/gl_sphere.html.
def make_sphere(stack_counts, sector_counts, radius=1):
    sector_step = 2 * np.pi / sector_counts
    stack_step = np.pi / stack_counts

    vertices = []
    normals = []
    tex_coords = []
    for i in range(stack_counts + 1):
        stack_angle = np.pi / 2 - i * stack_step
        xy = radius * np.cos(stack_angle)
        z = radius * np.sin(stack_angle)

        for j in range(sector_counts + 1):
            sector_angle = j * sector_step

            x = xy * np.cos(sector_angle)
            y = xy * np.sin(sector_angle)

            vertices += [x, y, z]
            normals += [x/radius, y/radius, z/radius]

            tex_coords += [j / sector_counts, i / stack_counts]

    # Make indices.
    indices = []
    for i in range(stack_counts):
        k1 = i * (sector_counts + 1)
        k2 = k1 + sector_counts + 1
        for j in range(sector_counts):
            if i != 0:
                indices += [k1, k2, k1 + 1]
            if i != (stack_counts - 1):
                indices += [k1 + 1, k2, k2 + 1]
            k1 += 1
            k2 += 1

    vertices_data = np.array(vertices, dtype=np.float32).reshape(-1, 3)
    normals_data = np.array(normals, dtype=np.float32).reshape(-1, 3)
    uvs_data = np.array(tex_coords, dtype=np.float32).reshape(-1, 2)
    indices = np.array(indices, dtype=np.uint32)

    data = {
        "vertices": vertices_data,
        "normals": normals_data,
        "texcoords": uvs_data,
        "indices": indices,
    }

    return data


# Generate custom sphere.
data = make_sphere(stack_counts=50, sector_counts=50)

array_format = GeomVertexArrayFormat()
# Same as GeomVertexArrayFormat.get_v3n3t2().
array_format.addColumn("vertex", 3, Geom.NTFloat32, Geom.CPoint)
array_format.addColumn("normal", 3, Geom.NTFloat32, Geom.CNormal)
array_format.addColumn("texcoord", 2, Geom.NTFloat32, Geom.CTexcoord)

vertex_format = GeomVertexFormat(array_format)
vertex_format = GeomVertexFormat.registerFormat(vertex_format)

vertex_data = GeomVertexData("sphere", vertex_format, Geom.UHStatic)

n_rows = len(data["vertices"])
vertex_data.setNumRows(n_rows)

vertex = GeomVertexWriter(vertex_data, "vertex")
normal = GeomVertexWriter(vertex_data, "normal")
texcoord = GeomVertexWriter(vertex_data, "texcoord")

for v in data["vertices"].tolist():
    vertex.addData3(*v)

for v in data["normals"].tolist():
    amount = 0.3
    v += np.random.uniform(-amount, amount, 3).astype(np.float32)
    v = v / np.linalg.norm(v)
    normal.addData3(*v)

for v in data["texcoords"].tolist():
    texcoord.addData2(*v)

primitive = GeomTriangles(Geom.UHStatic)

for v in data["indices"]:
    primitive.addVertex(int(v))
primitive.closePrimitive()

geom = Geom(vertex_data)
geom.addPrimitive(primitive)

node = GeomNode("gnode")
node.addGeom(geom)

# ========= END OF MAKE SPHERE NODE ==============================================

# Convert to NodePath.
# This is used for instancing.
node_path = NodePath(node)

myMaterial = Material()
# Make this material shiny.
myMaterial.setShininess(125.0)
myMaterial.setSpecular((0.8, 0.8, 0.8, 1))

node_path.setMaterial(myMaterial)
molecule_node = render.attachNewNode("molecule")

if USE_RBC:
    rbc = RigidBodyCombiner("rbc")
    rbcnp = NodePath(rbc)
    rbcnp.reparentTo(molecule_node)
else:
    rbcnp = molecule_node

for i in range(N_SPHERES):
    placeholder = rbcnp.attachNewNode("shpere-placeholder")
    placeholder.setPos(*np.random.uniform(low=-500, high=500, size=3))
    placeholder.setScale(np.random.uniform(low=5, high=70))
    placeholder.setColor(*np.random.uniform(size=3), 1)
    node_path.instanceTo(placeholder)

if USE_RBC:
    rbcnp.node().collect()

# Position the camera.
base.cam.setPos(0, -3000, 0)

# Now create some lights to apply to everything in the scene.
# Create Ambient Light.
ambientLight = AmbientLight('ambientLight')
ambientLight.setColor((0.7, 0.7, 0.7, 1))
ambientLightNP = render.attachNewNode(ambientLight)
render.setLight(ambientLightNP)

# Directional light.
directionalLight = DirectionalLight('directionalLight')
directionalLight.setColor((1, 1, 1, 1))

directionalLightNP = render.attachNewNode(directionalLight)
# This light is facing forwards, away from the camera.
directionalLightNP.setHpr(50, -30, 50)
render.setLight(directionalLightNP)

# Add shadow caster.
directionalLight.setShadowCaster(True, 1024, 1024)
directionalLight.getLens().setFilmSize(1000, 1000)
directionalLight.getLens().setNearFar(-500, 500)
#directionalLight.showFrustum()

render.setShaderAuto()

# Apply SSAO.
# Note that you need to do lots of tweaking to the parameters to get this
# filter to work for your particular situation.
base.camLens.setNearFar(100, -1000)
filters = CommonFilters(base.win, base.cam)
filters.setAmbientOcclusion(numsamples=16, strength=0.1, radius=0.015, amount=1)
filters.setBloom(intensity=0.3)

base.run()

Epihaius · January 27, 2021, 12:31pm

If all of the atom models can share the same RenderState, then you will be best off with combining all of them together into one big mesh (or a minimal number of meshes, for the sake of culling).

It will still be possible to transform parts of that mesh (the selected atoms) using calls to GeomVertexData.transform_vertices. Although this will be a bit slower than a simple NodePath.set_pos call for example, it should still be fast enough.

You’re welcome to consult my replies to topics dealing with similar performance issues (search for “Performance impact on thousands of Nodes” and “Can this code that generates ground tiles be optimized at all” in particular) and if you’re struggling to write up the needed code, let me know if I can help .

Sangwon_Lee · January 27, 2021, 12:35pm

There is no error, it runs well but slow. I think the problem is there are too many Geom objects (I’m just guessing because I don’t know the inside of the Panda3D).

I don’t think Python is the bottleneck. There are two pieces of evidence.

My OpenGL program (fast run with 50,000 meshes) is also written in Python with PyOpenGL.
flattenStrong() also gives a high frame rate.

Simulan · January 27, 2021, 3:20pm

It’s just a guess, but try turning off the shadow caster. Also, thanks for the interesting post!

# Add shadow caster.
directionalLight.setShadowCaster(True, 1024, 1024)
directionalLight.getLens().setFilmSize(1000, 1000)
directionalLight.getLens().setNearFar(-500, 500)
#directionalLight.showFrustum()

serega-kkz · January 27, 2021, 3:43pm

Perhaps you’d better have your own lightweight class for storing states. Then you can send this data to the shader. In panda, classes suffer from redundant logic.

rdb · January 27, 2021, 11:24pm

Hi, welcome to the community!

If you are comfortable with using a development build from the master branch of Panda3D, it includes support for automatic hardware instancing via the InstancedNode class. It does require a custom shader, but I’m happy to provide additional instructions.

An alternative, far more efficient method would be to just fake it. Instead of thousands of spheres, you would use a single Geom and GeomPoints with many points and configure them to render as point sprites. Using a shader, you could shade them as though they were spheres by calculating the normal vector for lighting based on the distance to the center of the card. This method may allow you to have hundreds of thousands of “spheres” with very low performance cost.

I am curious, though, about the specific method you used to implement it in OpenGL, and I may be able to advise you how to do the same thing with Panda3D.

Sangwon_Lee · January 28, 2021, 12:14am

Unfortunately, the shadow caster or filters did not affect performance. Thanks!

Sangwon_Lee · January 28, 2021, 12:23am

I think it works for me because I’m not use textures (or different RenderState) for the molecular structure.
I tested your code at

(but with tens of thousands of cubes) and I could see it working well.
I think based on this code, I need to develop a class that can handle shapes such as spheres and capsules easily (for the component of molecules).

Thank you for the advanced recommendation!

Sangwon_Lee · January 28, 2021, 12:51am

Hi rdb, nice to see you.

I think all your suggestions can be applied to my problem.
But I’m not an expert, so I don’t think it’s a good idea to use low-level API from the beginning (error-prone and hard to extend). For example, if I add a capsule shape (for bonds in a molecule), I will probably have to modify the shader.
I would like to use the excellent basic functions provided by Panda3D as possible.

My OpenGL program is very primitive. You can guess what I did with the GLSL code below. v_offset is the atomic positions.

#version 330 core

layout(location = 0) in vec3 v_position;
layout(location = 1) in vec3 v_uv;
layout(location = 2) in vec3 v_normal;
layout(location = 3) in vec3 v_offset;

out struct F {
    vec3 normal;
    vec3 pos_world;
} f;

uniform mat4 PVM;
uniform mat4 model;

void main()
{
    gl_Position = PVM * vec4(v_position+v_offset, 1.0);
    f.pos_world = vec3(model * vec4(v_position, 1.0)); 
    f.normal = v_normal;
}

glDrawElementsInstanced(
    GL_TRIANGLES,
    len(self.indices),
    GL_UNSIGNED_INT,
    None,
    len(self.offsets),
)

Thank you for your kind reply!

Sangwon_Lee · February 1, 2021, 9:08am

I wrote a new code that is based on your recommendation. Now I can draw more than 100,000 spheres without frame drop. Thank you!

Epihaius · February 1, 2021, 10:50am

Ah, that’s wonderful! Glad I was able to help .

panda3dmastercoder · February 1, 2021, 7:31pm

Can you send the code snippet?

Sangwon_Lee · February 2, 2021, 12:39am

Sure, here is the full script.

import numpy as np

from panda3d.core import *
from direct.filter.CommonFilters import CommonFilters
from direct.showbase.ShowBase import ShowBase

N_SPHERES = 15000

load_prc_file_data("", """
    win-size 1600 900
    window-title High performance spheres
    framebuffer-multisample 1
    multisamples 4
""")

base = ShowBase()
render = base.render
render.setAntialias(AntialiasAttrib.MMultisample)

# Original code from http://www.songho.ca/opengl/gl_sphere.html.
def make_sphere(stack_counts, sector_counts, radius=1):
    sector_step = 2 * np.pi / sector_counts
    stack_step = np.pi / stack_counts

    vertices = []
    normals = []
    tex_coords = []
    for i in range(stack_counts + 1):
        stack_angle = np.pi / 2 - i * stack_step
        xy = radius * np.cos(stack_angle)
        z = radius * np.sin(stack_angle)

        for j in range(sector_counts + 1):
            sector_angle = j * sector_step

            x = xy * np.cos(sector_angle)
            y = xy * np.sin(sector_angle)

            vertices += [x, y, z]
            normals += [x/radius, y/radius, z/radius]

            tex_coords += [j / sector_counts, i / stack_counts]

    # Make indices.
    indices = []
    for i in range(stack_counts):
        k1 = i * (sector_counts + 1)
        k2 = k1 + sector_counts + 1
        for j in range(sector_counts):
            if i != 0:
                indices += [k1, k2, k1 + 1]
            if i != (stack_counts - 1):
                indices += [k1 + 1, k2, k2 + 1]
            k1 += 1
            k2 += 1

    vertices_data = np.array(vertices, dtype=np.float32).reshape(-1, 3)
    normals_data = np.array(normals, dtype=np.float32).reshape(-1, 3)
    uvs_data = np.array(tex_coords, dtype=np.float32).reshape(-1, 2)
    indices = np.array(indices, dtype=np.uint32)

    data = {
        "vertices": vertices_data,
        "normals": normals_data,
        "texcoords": uvs_data,
        "indices": indices,
    }

    return data

# Generate custom sphere.
data = make_sphere(stack_counts=20, sector_counts=20, radius=1)

array_format = GeomVertexArrayFormat()

array_format.addColumn("vertex", 3, Geom.NTFloat32, Geom.CPoint)
array_format.addColumn("normal", 3, Geom.NTFloat32, Geom.CNormal)
array_format.addColumn("texcoord", 2, Geom.NTFloat32, Geom.CTexcoord)
array_format.addColumn("color", 4, Geom.NTUint8, Geom.CColor)

vertex_format = GeomVertexFormat(array_format)
vertex_format = GeomVertexFormat.registerFormat(vertex_format)

vertex_data = GeomVertexData("sphere", vertex_format, Geom.UHStatic)
n_verts = len(data["vertices"])
n_rows = N_SPHERES * n_verts

vertex_data.uncleanSetNumRows(n_rows)
data_array = vertex_data.modify_array(0)
memview = memoryview(data_array).cast("B").cast("f")

# Set indices for primitive.
n_inds = len(data["indices"])
primitive = GeomTriangles(Geom.UHStatic)
primitive.setIndexType(Geom.NTUint32)
tris_array = primitive.modify_vertices()
tris_array.uncleanSetNumRows(N_SPHERES * n_inds)
prim_memview = memoryview(tris_array).cast("B").cast("I")

# 3 + 3 + 2 + 1.
strides = 9

for i in range(N_SPHERES):
    color = [
        np.random.randint(0, 255),
        np.random.randint(0, 255),
        np.random.randint(0, 255),
        255
    ]

    # Convert 4 np.uint8 values to 1 np.float32 value.
    color_data = np.tile(
        np.array(color, dtype=np.uint8),
        reps=(n_verts, 1),
    ).view(np.float32)

    scale = np.random.uniform(0.1, 1)
    pos = np.random.uniform(-10, 10, size=(1, 3)).astype(np.float32)

    all_data = np.concatenate([
            scale*data["vertices"]+pos,
            data["normals"],
            data["texcoords"],
            color_data
        ],
        axis=1,
    ).copy()

    data_memview = memoryview(all_data).cast("B").cast("f")
    memview[i*n_verts*strides : (i+1)*n_verts*strides] = data_memview

    # Apply some transformation.
    #scale_mat = Mat4.scaleMat(1, 0.7, 2.0)
    #vertex_data.transformVertices(scale_mat)

    # Set indices for primitive.
    idx = (data["indices"] + i*n_verts).astype(np.uint32)
    data_memview = memoryview(idx).cast("B").cast("I")
    prim_memview[i*n_inds : (i+1)*n_inds] = data_memview

geom = Geom(vertex_data)
geom.addPrimitive(primitive)

node = GeomNode("gnode")
node.addGeom(geom)

node_path = render.attachNewNode(node)
node_path.setPos(0, -100, 100)
node_path.setScale(100)

myMaterial = Material()
myMaterial.setShininess(125.0)
myMaterial.setSpecular((0.8, 0.8, 0.8, 1))
node_path.setMaterial(myMaterial)

# Position the camera.
base.trackball.node().setPos(0, 4000, -100)
base.camLens.setFov(70)

# Create Ambient Light
ambientLight = AmbientLight('ambientLight')
ambientLight.setColor((0.7, 0.7, 0.7, 1))
ambientLightNP = render.attachNewNode(ambientLight)
render.setLight(ambientLightNP)

# Directional light 02
directionalLight = DirectionalLight('directionalLight')
directionalLight.setColor((1, 1, 1, 1))

directionalLightNP = render.attachNewNode(directionalLight)
# This light is facing forwards, away from the camera.
directionalLightNP.setHpr(50, -30, 50)
render.setLight(directionalLightNP)

# Add shadow caster.
directionalLight.setShadowCaster(True, 1024, 1024)
directionalLight.getLens().setFilmSize(1000, 1000)
directionalLight.getLens().setNearFar(-500, 500)

render.setShaderAuto()

# Apply SSAO.
base.camLens.setNearFar(100, -500)
filters = CommonFilters(base.win, base.cam)
filters.setAmbientOcclusion(numsamples=16, strength=0.01, radius=0.015, amount=1)

base.run()

tungsten · February 2, 2021, 5:46am

When i try to run the code I get the following error

Traceback (most recent call last):
  File "spheres.py", line 90, in <module>
    memview = memoryview(data_array).cast("B").cast("f")
AttributeError: 'memoryview' object has no attribute 'cast'

i am using panda3d 1.10.6
and python 3.8.5

Sangwon_Lee · February 2, 2021, 6:28am

I’m not sure what is the problem. The official python document shows that the memoryview object has member function cast().

https://docs.python.org/3.8/library/stdtypes.html?highlight=memoryview#memoryview

Thaumaturge · February 2, 2021, 7:30am

Are you sure that you’re running Python 3.8? It looks like older versions of Python may not have that method, and I wonder whether you’re not perhaps inadvertently using such a version.

To test this, what output is printed if you add the following to the script and run it?

import sys
print (sys.version)

tungsten · February 3, 2021, 5:38am

the problem is… i’m an idiot.
i do this all the time. i ran it with python on the command line and not python3

then when i check my version of python, i run python3. lol.

it works ! sorry for the noise.

i really have to find a way to python default to python3. i guess i should set up an alias.

panda3dmastercoder · February 3, 2021, 5:53am

Did you install python from its default page? If you did, you will get an application called IDLE. You can code in it and it runs the code according to the version you have installed

tungsten · February 3, 2021, 5:55am

i’m running on linux.
it’s not hard, you type “python” for python2 and “python3” for python3.

i constantly leave off the 3…

the other way to handle this is put

#!/usr/bin/python3

as the first line of the file and make the script executable. However at this point i need python to default to python3 because i very rarely use python2 any more.

panda3dmastercoder · February 3, 2021, 5:59am

You can make a program in python called executor.py.
The code will be:

import os

pyfile = "path/to/file.py"
os.system("python3 " + pyfile)

Then, you can run this with python because os was supported in older versions.

You can also do:

import sys
import os

os.system("python3 " + sys.argv[1])

And in the command line, type: python executor.py path/to/file.py