flatbield

flatbield@beehaw.org · edit-2 7 months ago

I think it depends on how good your Numpy build is. Lot of Numpys are not that well built so Numba seems to help there too in that case.

For a python library to be fast it needs to be compiled for your specific hardware, vectorized, with fast math, and auto parallel. Most are probably not unless you build them youself.

flatbield@beehaw.org · edit-2 8 months ago

People use Python a lot as a Matlab, Excel/VBA, or R alternative. That was my use for many years. Some of these are compute focused problems and if the dataset is large enough and the computations complex enough then speed can be an issue.

As far as loading packages and printing. Who cares. These are not computationally intensive and are typically IO bound.

flatbield@beehaw.org · 8 months ago

Yes, I hate indentation as structure but I hate tracking brackets even more.

flatbield@beehaw.org · edit-2 8 months ago

Same for me. I have used Python for most things since the late 1990s. Love Python. Have always hated the poor performance… but in my case mostly it was good enough. When it was not good enough, I wrote C code.

Python is good for problems where time to code is the limiting factor. It sucks for compute bound problems where time to execute is the limiting factor. Most problems in my world are time to code limited but some are not.

Python compute performance has always sucked.

flatbield@beehaw.org · 8 months ago

Just remember that an optimized C program will run about 100x faster then a similar Python program in a compute bound problem. So yes Python is slow but often good enough.

flatbield@beehaw.org · 9 months ago

Reason you do not need Typescript for Python is that it is a real language. JavaScript was a crap extension language that people have been trying to get around forever with preprocessors…

As far as needing types… One of the big advantages of Python is not needing types. I have used Python for 25 years and never used types or missed them.

What I do occasionally miss is speed. That is a combination of lack of typing and crap implementations and there are various ways around it.

flatbield@beehaw.org · 1 year ago

Python!

flatbield@beehaw.org · 1 year ago

ssh plus sshd is available already or can easily be installed on any Linux system. It can do many things: Remote terminal sessions and remote login (for admin for example), file transfer, directories can be mounted as shares too over ssh, remote execution, you can also even do tunneling, graphical application UI forwarding, and even implement VPNs via ssh. Every Linux admin knows about and uses ssh all the time.

It is interesting a lot of people forget you can use any Linux box as a file server via SSH, in addition do a lot of other things. I also have an ssh app on my cell phone, and can just mount the file system their on Linux too. There are clients for SSH for Windows also.

flatbield@beehaw.org · 1 year ago

I think he could have meant nano. :)

flatbield@beehaw.org · 1 year ago

Yes, you can sync between two on devices anywhere in the world as long as a connection path can be found.

The downside of this is that both devices have to be on. If not on the LAN it may go though some unknown gateways too which makes me nervous (though it should be all encrypted). It can take some time too for the devices to find each other and then do the transfer (even on the LAN).

Some people place syncthing on their NAS so it is the always on device. Also if you do not want your connection to go through other peoples bridges then you can disable that feature (and loose the global WAN transfer capability), or you can put up your own bridge in a VPS on the WAN.

I am no expert on this. For me I use syncthing only sometimes and only on my LAN. Mostly I use SSH, Nextcloud, or Bitwarden Send myself. I’d like to play more with some of the other options though. Seafile or placing Send on my VPS for example seems interesting to me.

flatbield@beehaw.org · 1 year ago

Commercial kitchens sometimes have blast chillers and blast freezers. Some of the cooking shows use them.

flatbield@beehaw.org · edit-2 1 year ago

Pick and choose. I actually like most of the Python Doc. Learned Python originally from their tutorial. Then learned key parts of the library. So I like those two documents. The other docs though can be deep. The language reference for example. Never read that except parts.

I also had a book about Tkinter and another about Win32 Python programming. So I learned from those too. My first app was a data acquisition too with a Tkinter GUI. So I think a few books are good but maybe people do not do that now.

For me, learned Python in a day mostly from their tutorial and the standard library reference, then it took me the next 9 months to actually get good at it. Then still learning stuff 25 years later. I did have an advantage. I had been programming for 20 years before I learned Python and had used half a dozen other languages.

flatbield@beehaw.org · 1 year ago

LibreOffice already has Python support along with some other choices.

flatbield@beehaw.org · edit-2 1 year ago

Has anyone been able to get significant acceleration with Nuitka? How much and how?

My experience has been that Nuitka is more of a deployment tool not a speed tool. Asking because article says otherwise. Wondering if they know something I do not.

flatbield@beehaw.org · 1 year ago

Yes, there are a lot of assumptions, incorrect information, or at least miss-leading stuff out there. So I am always interested in learning more about easy and hard ways to make things better. In fact for most things I do, Python is fast enough, but sometimes it is not.

The things I find miss-leading about what people often say about Python are that it is not that slow, and that you can always just use a library like numpy or something similar to solve speed issues. I found both to be more or less untrue in the sense of getting C like speeds. On my code, Python was indeed slow, like 1% of C speed. The surprising thing for me was using numpy helps a lot but not as much as you think. I only got to 5 to 10% of of C speed with numpy. This is because libraries are often generically compiled and to get good speed you really have to have C code that is compiled for your specific hardware with vectorization, autoparallel, and fast math at least. So generic libraries just are not going to be that fast. Another one people push is using GPUs. That also is not really very effective unless you have a very expensive card and most notably a dedicated GPU card design just for that or an array of them. The GPU performance of my workstation is significantly less then throughput of my CPU. There are hardware limitations too that are interesting. My AMD Rizen 7 based workstation would have twice the speed if I had 4 port memory rather then two port memory which is a lot more common since fully optimized code is memory IO bound at about 1/2 the CPU throughput. This must be why AMD Rizen Threadrippers seem to use 4 port memory.

There are ways around a lot of this. For example using numba can be incredible. Similarly writing your owe C code and carefully compiling it too. The careful compile is critical. Maybe one could do the same with some stock libraries, carefully compile them. Lot of the other stuff people talk about just does not work very well in terms of speed or effort such as pypy, cython, nuitka, etc.

flatbield@beehaw.org · edit-2 1 year ago

Thanks. I love Python and have used since about 1998. The two areas where I have always found a little lacking is a) creating and app that you can actually give some one, b) computational speed when needed. So I am always interested in those two areas. A year or so ago I looked at a lot of the tools that that the article described, but there are one or two that were mentioned that are new to me. I think I will have to try when I get some time.

flatbield@beehaw.org · edit-2 1 year ago

For what it is worth, my take on the article. A really over whelming list. Nice read through but for those that are interested, the most useful components that were discussed were probably:

CPython. Of course, that is what we all use.
PyPy. This is an interesting acceleration if you do not need things like numpy and a lot of other common libraries. The acceleration is maybe 9X in my experience. However C code or good use of numba can often get 100X.
MicroPython. Not tried but seems cool if you need a really small Python. Presumably not exactly compatible because of missing libraries.
Pyston. Have not tried but seemed interesting from their discussion of the “pyston_lite_autoload” thing. Have no idea if it is useful.
Cython. Lot of hoopla about this. Good software but my experience is that you do not get much for speedup until you statically declare stuff. When I did that I got about 24X, then playing with prange and openmp features I got 75X. Not a bad speed up. However, it does not look so good when compared with writing C code or using numba. Mainly because those speedups using other methods seem to be easier to get and I got as large as 121X when using them instead. Cython is just complex to use and then does not get your full entitlement with respect to speed, or at least that was my experience.
Numba. Numba and Numpy used in the correct situations can give 121X speed improvements and performance similar to parallized and vectorized C code. Actually for some reason it was faster then my C code. This combo is super. Everyone should know about Numba.
Nuika. Very handy deployment tool. My experience same speed as CPython basically. Well I got about a 9% improvement which is almost nothing. So do not be fooled into thinking that it will give you big speed improvements. A very nice tool as part of your packaging and deployment process.

Since I talked about C code. There are three ways to integrate C code into python: ctypes, CFFI, and using the standard C extension method. I found ctypes to be about 107X, CFFI 108X, and the standard method about 112X for my code on my hardware with code which was using autoparallel and autovectorize, fastmath, and maybe other settings. My point, the speeds of these are about the same though the standard method is just a little faster. So you can really pretty much do whichever is easier.

Anyway my thoughts. Hope they make some sense.

flatbield@beehaw.org · 1 year ago

They could probably have gotten similar results by using a combination of numpy and numba. They could also have just written a C extension which they basically did. The key is to get the final code to run both in parallel and vectorize on your exact hardware. So there are compiler flag choices too if your using C. Nice though.