YTEP-0037: Code Styling #14

neutrinoceros · 2020-05-19T16:05:27Z

In this YTEP, I propose we start using black and isort to automate code formatting as part of the linting process with flake8.

matthewturk

I like this a lot. The only thing that I think would help with this is a brief description of both how (or if) black handles cython code, and how we could manage existing PRs that have not been merged yet. Can we just run black on the PR?

source/YTEPs/YTEP-0037.rst

matthewturk · 2020-05-19T18:04:03Z

source/YTEPs/YTEP-0037.rst

+A proof of concept for this is `#2598 <https://github.com/yt-project/yt/pull/2598>`_,
+where CI builds are run correctly across all tested python versions (3.6, 3.7, 3.8).
+
+A serious counter-argument to applying black is that it implies messing up with ``git


One thing we have done in the past in faked the authorship of those commits as something like "svn-convert" or something; I forget the specific one we did now. That would at least signal they weren't to be regarded.

That would work for me. I don't want my GitHub profile to show ~20k lines of code "contributed", hiding the real contributions :)

matthewturk · 2020-05-19T18:04:26Z

source/YTEPs/YTEP-0037.rst

+88, as its authors claim it reduces the total number of lines by some 10% (as compared
+to enforcing 80).
+
+In first drafting the PR linked above, I chose a line-lenght of 100, so as to minimize


I think we should go with something in line with either stdlib or pandas. I'd be fine with 79 or 80.

Personally I would align with black’s default, which is also compatible with Raymond Hettinger’ style of «90-ish». It makes for much shorter files (according to black).

That'd work for me.

I'm ok with 88.

btw I double checked and actually pandas uses 88, not 80 as I previously reported.

Co-authored-by: Matthew Turk <[email protected]>

neutrinoceros · 2020-05-19T22:21:35Z

how (or if) black handles cython code

it doesn't. I have no opinion on what should be done with the Cython part of the library : either leave it roam free or impose max line width on a self-discipline basis. I'm adding this to the YTEP.

neutrinoceros · 2020-05-19T22:36:58Z

how we could manage existing PRs that have not been merged yet. Can we just run black on the PR?

One way to do it would be to ask PRs authors to:

rebase to the target branch
run black
force update the PR

though I'm nowhere near confident enough in that it would not create merge conflict, and we definitely don't want to put contributors in that situation.

I think a safer approach would be to provide a script/command line to apply black only to the lines they've updated, though I need to make some research on how to write this script.
@matthewturk

…, fix the image insertion

neutrinoceros · 2020-05-20T10:13:44Z

@matthewturk it took a bit of learning in the subtleties of git merge but I think I came up with a satisfying startegy to handle the transition with existing PRs.

brittonsmith

I think this is a very good proposal and everything you have down here I think it worth doing. In my opinion, one additional major point of discussion is when this would all be done, e.g., just before/after a yt-4.0 release. The how is also important, specifically whether it would be done all at once or piecemeal and possibly in a group event like a PR triage.

brittonsmith · 2020-05-20T10:29:01Z

source/YTEPs/YTEP-0037.rst

+
+Moreover, ``isort`` is configurable so that it allows for the definition of custom
+"sections" within import statements. This can be use to isolate imports from ``unyt``,
+which falls somewhere in between the default sections "third party" (external


For what it's worth, I think we can treat unyt as a regular third party module.

This is up for discussion, of course ! I have no strong opinion myself.

source/YTEPs/YTEP-0037.rst

Co-authored-by: Britton Smith <[email protected]>

neutrinoceros · 2020-05-20T11:08:16Z

In my opinion, one additional major point of discussion is when this would all be done, e.g., just before/after a yt-4.0 release.

Absolutely. My main concern with this is to avoid making @matthewturk 's work harder with the yt-4.0 -> master merge.

The how is also important, specifically whether it would be done all at once or piecemeal and possibly in a group event like a PR triage.

The shorter the transition, the easier, so I think that most of the PRs could be merged in a very narrow time window (a day or two), provided the appropriate conditions. However, because we want to ensure that each step passes the tests, which typically takes a least an hour or two per step, I propose that prep steps be done separately, and the big one (blackening) happen on a meeting.

A possible roadmap:

pre meeting

settle on a maximal line length and the status of unyt ("second" or third party)
merge isort pass on the code base + CI check + doc
optional (needs approval) merge Internally import from unyt yt#2597
merge (needs tweaking) Add pyproject.toml (YTEP0037 3/6) yt#2598
rebase blackening PR on the target branch (yt-4.0 ?) and prepare it with agreed line length
provided all CI check pass and the PR is reviewed & approved, this goes to a PR triage meeting

on the meeting

merge blackening + manual fixups + CI checks + doc
signal to open PR authors that they should apply black (see transitioning strategy)

can be done later

neutrinoceros · 2020-05-20T12:07:25Z

@brittonsmith do you think it’s appropriate to include the roadmap in the YTEP directly ?

munkm · 2020-05-20T15:24:16Z

I haven't gotten to look too deeply at the YTEP yet, but what do you think about making this ytep even more broad to encompass all of your code styling PRs (like the imports) and other future styling proposals? This would be a nice place for all of them!

neutrinoceros · 2020-05-20T15:42:00Z

@munkm the import stuff is already in here :)
I would gladly add other stylistic proposals if you guys want to add any. Right now I do not have more to contribute.

munkm · 2020-05-20T15:55:20Z

@munkm the import stuff is already in here :)

Ha! sorry about that then. Don't mind me!

neutrinoceros · 2020-05-20T16:08:54Z

@munkm actually you got me thinking there's an important point I haven't covered at all: docstrings ! It seems to me that black will never update them (other than unirformizing their delimiters to """), so if we want to apply the line-length limit there, we need an additional validator. Pandas happens to have a script just for that here
https://github.com/pandas-dev/pandas/blob/master/scripts/validate_docstrings.py

Since they're also using the numpy convention for dosctrings, I'm hoping the script could be fairly easily adapted for yt.

brittonsmith · 2020-05-21T09:17:58Z

@neutrinoceros, I'm certainly open to other opinions, but I do think the roadmap has a place in the ytep itself. I think the ytep should more or less contain all the major decisions on which consensus is sought.

neutrinoceros · 2020-05-21T09:56:31Z

@brittonsmith here you go. I also added the docstring stuff, and I also propose we use a flake8 plugin (https://github.com/PyCQA/flake8-bugbear) to enforce additional rules.

…es less than previous estimation !)

brittonsmith · 2020-05-25T09:01:22Z

source/YTEPs/YTEP-0037.rst

+.. _additional rules:
+**additional rules & flake8 pluggings**
+
+Since the oldest python version supported (as of yt-4.0) is 3.6, it means we can start


setup.py indicates that we still support Python 3.5. I don't have much problem increasing that to 3.6, but I think this warrants a discussion about whether it's better to do this pre or post releasing yt-4.0.

You're absolutely right, but there is room for interpretation here. We don't run tests for any version older that the latest 3.6.x, hence my phrasing.
The fstring experiment I'm running in yt-project/yt#2605 has some flaws anyway, since the tool (flynt) is fairly new.
This part of the YTEP can absolutely be postponed.

brittonsmith

Ok, this all looks good to me. Thanks for adding the roadmap section.

Independent of this being merged, I'd like to hear @matthewturk's thoughts about whether this should be done before or after a stable release of yt-4.0.

source/YTEPs/YTEP-0037.rst

matthewturk · 2020-06-22T16:28:57Z

@brittonsmith somehow I missed your question. I think we should do this before a stable 4.0 release. And, now that we've got 4.0 into the default branch, I think it might be time to finish up approving this.

neutrinoceros · 2020-06-22T17:01:43Z

So in order to resolve this here's what needs to be agreed on (with my own suggestion in parenthesis)

what's the maximal line-length we want to enforce ? (88)
should unyt be treated as second party or third party by the import sorter ? (second)

the first point is crucial. The second one is not, but we're currently sitting on a 1 VS 1 vote so I'm soliciting additional votes here.

Xarthisius · 2020-06-22T17:09:07Z

So in order to resolve this here's what needs to be agreed on (with my own suggestion in parenthesis)

what's the maximal line-length we want to enforce ? (88)

should unyt be treated as second party or third party by the import sorter ? (second)

the first point is crucial. The second one is not, but we're currently sitting on a 1 VS 1 vote so I'm soliciting additional votes here.

I'd vote for 88 and unyt as 3rd party.

munkm

I really love the idea of enforcing our style and adding checks to our CI to support them. Thank you for pushing all of these things through!!! 🎉 🥇 🖤 <--- maybe we should use a black heart emoji for the black checks?? 😉

I've left a few questions here I'd like to get some clarification about!

source/YTEPs/YTEP-0037.rst

munkm · 2020-06-22T22:09:16Z

source/YTEPs/YTEP-0037.rst

+`#2592 <https://github.com/yt-project/yt/pull/2592>`_.
+
+To better highlight the way yt 4.0 depends on ``unyt``, I also propose that, within the
+code base, we import directly from ``unyt`` as often as possible, so as to limit


Ok, so with unyt here do we want to create some specific guidelines beyond "as often as possible"? Where would we recommend importing from yt.units over unyt and vice-versa? Maybe @jzuhone has some opinions on this too.

I don't see why we shouldn't always use yt.units instead? That way we're always consistent with our units.

So my guidelines would be very simple (following what I've done in yt-project/yt#2597):

YTArray and YTQuantity can be imported from yt.units

all the rest should be imported from unyt

Hmmm, I'm not convinced about this yet and I think we should have more discussion about it. Why would importing from unyt directly be better than using yt.units? unyt's default units (mks) are in a different system than yt's (cgs). Won't we increase the probability of getting weird conflicts if we do imports of both?

Won't we increase the probability of getting weird conflicts if we do imports of both?

tbh I didn't think that what even possible. My mental model for yt.units is purely a wrapper module that we merely keep for convenience (avoid breaking downstream code if we don't need to). If this view is incorrect in any way (which seems to be the case with cgs VS mks systems), then it's expected that my recommandations don't make much sense. :/

Why do we want to keep yt.units at all? I thought the whole point of splitting out unyt was for yt to fully depend on it. It's fine to keep some sort of thin wrapper to provide backward compatibility, but shouldn't we "just" switch? Even if that means embracing MKS everywhere?

Because we wanted to minimize back compat breaks for users who were already burned by the yt-2 to yt-3 transition.

I think in 4.0, which I believe is the target here, most/all of yt.units now is a thin-layer. And that means, among other things, that they have MKS base:

>>> yt.units.cm.base_value 0.01

There are a handful of additional things still included, like UnitContainer, display_ytarray, some create_code_unit_system stuff and a default registry that includes "h" but for the most part it now imports directly from unyt.

So base_value is the value in unyt's internal unit system, which was indeed changed to an MKS unit system. However, the internal unit system is more of an implementation detail, and is not really exposed in the API beyond the base_value attribute Matt pointed out, unit objects each refer to a unit system distinct from the internal unit system that's used to generate default values for the results of operations (e.g. for unit simplication). The units in the yt namespace have a different unit system from the ones in the unyt namespace:

In [3]: yt.units.cm.registry.unit_system Out[3]: cgs Unit System Base Units: length: cm mass: g time: s temperature: K angle: rad luminous_intensity: cd logarithmic: Np Other Units: energy: erg specific_energy: erg/g pressure: dyn/cm**2 force: dyn magnetic_field_cgs: G charge_cgs: statC current_cgs: statA power: erg/s In [5]: unyt.cm.registry.unit_system Out[5]: mks Unit System Base Units: length: m mass: kg time: s temperature: K angle: rad current_mks: A luminous_intensity: cd logarithmic: Np Other Units: energy: J specific_energy: J/kg pressure: Pa force: N magnetic_field: T charge: C frequency: Hz power: W electric_potential: V capacitance: F inductance: H resistance: Ω magnetic_flux: Wb luminous_flux: lm

We did it this way to make it easier to keep backward compatibility in yt.

If you're interested in more detail about this in a slightly more abstract context, both the yt namespace and yt Dataset instances implement the pattern described in the unyt docs here: https://unyt.readthedocs.io/en/stable/usage.html#custom-unit-systems

thanks @ngoldbaum for the clarifications !

This means that if internally in yt a unit ultimately came from the unyt namespace, a user might end up with a result that will ultimately be associated with an MKS unit system, perhaps causing confusion or even outright buggy behavior.

I think in 4.0, which I believe is the target here, most/all of yt.units now is a thin-layer. And that means, among other things, that they have MKS base
There are a handful of additional things still included, like UnitContainer, display_ytarray, some create_code_unit_system stuff and a default registry that includes "h" but for the most part it now imports directly from unyt.

In yt-project/yt#2597 I only tried to avoid the additional import layer, I don't think any behaviour (internal or external) should be affected ?

Why do we want to keep yt.units at all? I thought the whole point of splitting out unyt was for yt to fully depend on it. It's fine to keep some sort of thin wrapper to provide backward compatibility, but shouldn't we "just" switch? Even if that means embracing MKS everywhere?

Because we wanted to minimize back compat breaks for users who were already burned by the yt-2 to yt-3 transition.

But is yt.units planned for deprecation at some point ? I don't know if we're the 3->4 transition will be comparable to 2->3 in terms of backward compatibility breaks, so maybe it's a good time to at least add deprecation warnings (or even get rid of the wrapper completely and just keep the yt-specific bits) ?

In any case, I feel like everyone here agrees it'd be best to avoid mixing imports from both unyt and yt.units
Please correct me if I'm mistaken but it seemed to me that the current state of yt.unit was meant to be transitory, and that yt.units would ultimately disappear in favour of unyt.

munkm · 2020-06-22T22:10:49Z

source/YTEPs/YTEP-0037.rst

+straight-forward option configuration to validate docstrings are numpy-styled. However,
+there is currently a very large debt in errors caught by this tool, and no way to
+automatically solve them. However, it could still be added to our linting CI, if check
+for *new* errors only, such as


I think using this to check for new errors introduced in PRs is a good idea.

source/YTEPs/YTEP-0037.rst

munkm · 2020-06-26T15:06:27Z

Ok, it seems like everything in this YTEP is agreed on except the unyt imports. Is unyt importing even a style change? Maybe we could table that discussion and start an issue in the yt main repo about it and keep it out of this ytep?

I propose that we remove the unyt-related import changes from this YTEP and add them to a new issue in the yt repo, then merge this YTEP once we have a decision on the time and date of the dedicated maintainer meeting (as mentioned in the schedule of this YTEP). Once that time is decided, then we can update this PR with the time of the meeting and send out specific connection details to the mailing list and slack, and merge the YTEP.

Does this seem reasonable?

matthewturk · 2020-06-26T15:08:13Z

That sounds very good to me. I also did not weigh in earlier about unyt, but I think we *should* give it a bit of a prioritized role. I look forward to discussing that at a later date!

…

On Fri, Jun 26, 2020 at 10:06 AM Madicken Munk ***@***.***> wrote: Ok, it seems like everything in this YTEP is agreed on except the unyt imports. Is unyt importing even a style change? Maybe we could table that discussion and start an issue in the yt main repo about it and keep it out of this ytep? I propose that we remove the unyt-related import changes to a new issue in the yt repo, then merge this YTEP once we have a decision on the time and date of the dedicated maintainer meeting (as mentioned in the schedule of this YTEP). Once that time is decided, then we can update this PR with the time of the meeting and send out specific connection details to the mailing list and slack, and merge the YTEP. Does this seem reasonable? — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#14 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAAVXOZUVNF2HMFAZS7CHJTRYS2QFANCNFSM4NFEOFWA> .

neutrinoceros · 2020-06-26T15:14:29Z

@munkm I fully agree with this. Keeping the controversial stuff out of the YTEP is an excellent way forward. Should I update this PR accordingly right now ?

munkm · 2020-06-26T15:24:26Z

Yeah I think that's a good idea! I'm happy to approve it once they're removed. (I'd rename the unyt-related PRs so they don't have the YTEP in the name too.)

neutrinoceros added 2 commits May 19, 2020 18:02

add a ytep on code style

ae216ec

add missing image file

99e349d

matthewturk approved these changes May 19, 2020

View reviewed changes

typo

5f8659f

Co-authored-by: Matthew Turk <[email protected]>

add insight on Cython files, define merging strategy for existing PRs…

d3cb303

…, fix the image insertion

typo

46bfd25

brittonsmith reviewed May 20, 2020

View reviewed changes

typo

1014f81

Co-authored-by: Britton Smith <[email protected]>

Xarthisius approved these changes May 20, 2020

View reviewed changes

add roadmap, mention docstring issue, and add bugbear

fc03b1b

neutrinoceros added 8 commits May 21, 2020 14:20

revise estimations in number of lines requiring manual update (10 tim…

d60d22e

…es less than previous estimation !)

detail on bugbear

6948754

clarify bugbear example

4899614

add detail on flake8-docstrings

c8fd9e6

typos + rephrasing

de21c10

fix estimations

7335e10

add note about dask's configuration for black

7db39fd

add subsection for flynt

b1481f4

brittonsmith reviewed May 25, 2020

View reviewed changes

brittonsmith approved these changes May 25, 2020

View reviewed changes

jzuhone reviewed Jun 5, 2020

View reviewed changes

source/YTEPs/YTEP-0037.rst Outdated Show resolved Hide resolved

neutrinoceros added 2 commits June 5, 2020 21:37

correct typo

abb6797

correct statement about pandas + black

9d74e29

munkm reviewed Jun 22, 2020

View reviewed changes

add intro to roadmap (contributed by Madicken)

76abbad

change YTEP0037 status to accepted, remove unyt import proposal

1106102

munkm approved these changes Jul 16, 2020

View reviewed changes

chummels approved these changes Jul 16, 2020

View reviewed changes

cphyc merged commit cbe0299 into yt-project:master Jul 16, 2020

neutrinoceros deleted the ytep_37 branch July 21, 2020 12:22

munkm mentioned this pull request Aug 11, 2020

Internally import from unyt yt-project/yt#2597

Closed

YTEP-0037: Code Styling #14

YTEP-0037: Code Styling #14

Conversation

neutrinoceros commented May 19, 2020 • edited Loading

matthewturk left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

neutrinoceros commented May 19, 2020 • edited Loading

neutrinoceros commented May 19, 2020 • edited Loading

neutrinoceros commented May 20, 2020

brittonsmith left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

neutrinoceros commented May 20, 2020 • edited Loading

A possible roadmap:

neutrinoceros commented May 20, 2020

munkm commented May 20, 2020

neutrinoceros commented May 20, 2020

munkm commented May 20, 2020

neutrinoceros commented May 20, 2020

brittonsmith commented May 21, 2020

neutrinoceros commented May 21, 2020

Choose a reason for hiding this comment

neutrinoceros May 25, 2020 • edited Loading

Choose a reason for hiding this comment

brittonsmith left a comment

Choose a reason for hiding this comment

matthewturk commented Jun 22, 2020

neutrinoceros commented Jun 22, 2020

Xarthisius commented Jun 22, 2020

munkm left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ngoldbaum Jun 23, 2020 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

munkm commented Jun 26, 2020 • edited Loading

matthewturk commented Jun 26, 2020 via email

neutrinoceros commented Jun 26, 2020

munkm commented Jun 26, 2020

neutrinoceros commented May 19, 2020 •

edited

Loading

neutrinoceros commented May 19, 2020 •

edited

Loading

neutrinoceros commented May 19, 2020 •

edited

Loading

neutrinoceros commented May 20, 2020 •

edited

Loading

neutrinoceros May 25, 2020 •

edited

Loading

ngoldbaum Jun 23, 2020 •

edited

Loading

munkm commented Jun 26, 2020 •

edited

Loading