Performance improvements

As I reported some weeks ago, I’m working in the code optimization. To do it I implemented the following changes:

  1. Now all the data (including the output one) is sorted before compute the interactions
  2. Hence the permutations vectors are not needed anymore in the interactions stage
  3. Some useless output data have been removed
  4. The registers pressure have been optimized

Let’s say goodbye to more than 10% of the simulation time!

For the moment the optimized version can be found in the optimization branch of the git repository.

P.S. I performed this work with CodeXL by AMD, basically due to NVidia suddenly decided to remove the OpenCL support in the profiler in CUDA 5.
NVidia is expending a lot of resources trying to destroy OpenCL, while AMD is beating them in the hardware side (the real world).
I just hope NVidia people change it’s direction before they reach the final which are deserving.

Also CodeXL is free!

New input/output interface

A new input/output interface is coming… Following it’s features:

  1. An easy way to implement new loaders/savers
  2. Improved XML input
  3. Capability to save the XML case definition (which allows to continue the simulation from the last saved file)
  4. H5Part cleared
  5. Improved the post-processing with Paraview (including tools to fix the errors with the locale options)

Actually it is in development in the input_output branch, but hopefully it will be merged soon


Initial release

The initial release has been uploaded! You can get it from GitHub typing:

I was waiting like a dumb for the CUSL development stage start in order to upload the initial release, just because the development stage starts when the project is officially accepted (there are not a kick off).

Anyway, it will be awesome!