Why use `python -m pip`

Hello again. In anticipation of the start of a new thread on the Machine Learning course, we want to share a translation of an article that is quite indirectly related to ML, but will certainly be useful to our blog subscribers.

Mariatta – developer from Canada, asked on Twitter about python -m pipasking to talk about this idiom and explain the principle of its work.

I recently learned that you need to write python -m pip instead of the usual pip install, but now I can’t remember who I heard from. Probably from @brettsky or @zooba. Do any of you have a blog post so I can share it with readers?
– Mariatta (@mariatta) October 29, 2019 (https://twitter.com/mariatta/status/1189243515739561985?ref_src=twsrc%5Etfw)

I'm not sure what exactly I told Mariatta about python -m pip, but there is every chance that it was me, because I asked that this instruction for installing packages using PyPI be written exactly this way from 2016. So this article should clarify what is python -m pip and why you should use it at startup pip.

What is python -m pip?

To start, python -m pip pip executes with the version of Python you specified for the python statement. In this way, /usr/bin/python3.7 -m pip means you follow pip for the interpreter located in /usr/bin/python3.7. You can read the documentation about the flag. -mif you don’t know how it works (by the way, it is extremely useful).

Why use python -m pip instead of pip / pip3?

You can say, “Okay, but why can't I just take advantage pipby running the command pip? ”The answer will be:“ Yes, but you will have less control over it. ” I will explain what “control less” means by example.

Suppose I have two versions of Python installed, for example, Python 3.7 and 3.8 (this is very common among people who work on Mac OS or Linux, not to mention the fact that you probably wanted to play with Python 3.8, and you already had Python 3.7). So if you enter pip in the terminal, for which Python interpreter do you install the package?

Without more detailed information you will not know the answer. First you will need to understand what lies in the PATH, i.e. / usr / bin comes first or / usr / local / bin (which are the most common places to install Python, by the way usually / usr / local / comes first). So, you remember where you installed Python 3.7 and 3.8 and that they were different directories, and you will know what came in PATH first. Suppose you installed both manually, perhaps Python 3.7.3 was already preinstalled on your system, and you installed Python 3.7.5. In this case, both versions of Python are installed in / usr / local / bin. Can you tell me now what you are attached to now pip?

You do not know the answer. If you do not know when each version was installed, and understand that the latest version pip was recorded in / usr / local / bin / pipbut you don’t know which interpreter will be used for the command pip. Now you can say: “I always install the latest versions, so that means that Python 3.8.0 will be installed last, because it is newer than, say, 3.7.5.” Well, but what happens when Python 3.7 comes out .6? Your pip It would be used not from Python 3.8, but from Python 3.7.

When you use python -m pip with the specific python interpreter you need, all the ambiguity disappears. If i write python3.8 -m pipI know exactly which pip will be used and that the package will be installed for Python 3.8 (the same would be if I specified python3.7).

If you use Windows, then you have an additional incentive to use python -m pipas it allows pip update yourself. Mostly because pip.exe considered running when you write pip install –upgrade pip. At this point, Windows will not let you reinstall pip.exe. However if you do python-m pip install –upgrade pipyou work around this problem as it starts python.exe, but not pip.exe.

And what happens when I am in an activated environment?

Usually, when I explain the essence of this article to people, there is always someone who will say: "I always use the virtual environment, and this does not apply to me." Well, for starters, it would ALWAYS be good to use a virtual environment! (I will explain why I think so in one of my next articles!) But to be honest, I would still insist on using python -m pip, even if, strictly speaking, this is not necessary.

Firstly, if you use Windows, you will still want to use python-m pipso that you can update in your environment pip.

Secondly, even if you use a different operating system, I would say that you still need to use python-m pipbecause it will work regardless of the situation. He will warn you about a mistake if you forget to activate the environment, and any person who watches you will adopt the best practices. And personally, I do not think that saving 10 keystrokes is a significant price for not using good practice. And this command will also help you prevent errors when writing automation scripts that will perform obviously incorrect operations if you forget to activate the environment.

Personally, when I use any tool whose work depends on which interpreter it starts up, I always use -m, regardless of whether the virtual environment is activated or not. It's always important for me to understand which Python interpreter I'm using.

ALWAYS use the environment! Do not put everything in a row in the global interpreter!

When we talk about how to avoid confusion when installing in Python, I want to emphasize that we should not install anything in the global Python interpreter when we work locally (containers are a completely different matter)! If this is the pre-installed Python of your system, then if you install some incompatible version of the library that your OS relies on, then you will actually break the system.

But even if you install a copy separately for yourself python, I still strongly recommend not putting it directly in local development. Ultimately, in your projects, you will use various packages that may conflict with each other, and you will not have a clear idea of ​​the dependencies within your projects. It is much better to use environments to isolate individual projects and tools for them from each other. The Python community uses two types of environments: virtual environments and conda environments. There is even a special way to install Python tools in isolation.

If you need to install a tool

For a stand-alone installation of the tool, I can recommend using pipx. Each tool will receive its own virtual environment, so as not to conflict with others. Thus, if you want to have only one installation, for example, Black, you can work without accidentally breaking your only installation of mypy.

If you need an environment for the project (and you do not use conda)

When you need to create an environment for a project, I personally always turn to venv and virtual environments. She is included in stdlib Python is therefore always available with python-m venv (unless, of course, you are using Debian or Ubuntu, in which case you may need to install the package python3-venv apt) A bit of history: I actually deleted the old team pyvenvPython installed to create virtual environments with venv, for the same reasons why you need to use python -m pip instead pip. That is, it is not clear for which interpreter you created the virtual environment using the old command pyvenv. And remember that you do not need to activate the environment in order to use the interpreter contained in it, because .venv / bin / python works just as well as activating the environment and entering the command python.

Today, some developers still prefer virtualenvsince it is available in Python 2 and it has some additional features. Personally, I’m not very interested in additional features, and the availability of integrated venv means i don't need to use pipx for installation virtualenv on every machine. But if venv does not meet your needs and you want a virtual environment, then see if it offers virtualenv what you need.

If you use conda

If you use conda, you can use conda environments to get the same effect that virtual environments provided by venv. I am not going to go into whether you need to use conda or venv in your specific situation, but if you use conda, you know that you can (and should) create environments conda for your work, instead of installing everything in your system installation. So you can get a clear understanding of what dependencies your project has (and this is a good reason to use miniconda instead of a full anaconda, since the former is less than a tenth of the volume of the latter).

There are always containers

Working in a container is a way not to deal with the environment at all, since your entire “machine” will become a separate environment. Until you have installed Python in the container system, you should be able to safely make a global installation so that your container remains simple and straightforward.

I repeat that you really understand the essence …
Do not install anything in your global Python interpreter! Always try to use the environment for local development!

I can no longer say how many times I had to help someone who thought that pip installed in one Python interpreter, but actually installed in another. And this immeasurable amount also applies to those moments when people broke the entire system or wondered why they could not install something that contradicted some other thing that they had set earlier for another project, etc. due to the fact that they did not bother to configure the environment on their local machine.

Therefore, so that you and I can sleep peacefully, use python-m pip and try to always use the medium.

Similar Posts

Leave a Reply

Your email address will not be published. Required fields are marked *