[DOC] expand coding standards #78

emdupre · 2024-05-13T21:58:46Z

Addresses #31, #62

To be discussed : Do we want to add recommendations on using LLMs in coding ?

emdupre · 2024-05-13T22:28:19Z

cc @sjshim, please let me know if you have any feedback on this ! 👩‍💻

sjshim · 2024-05-13T22:47:03Z

This is definitely much better! Hoping that materials re:packaging will finalize more following next week's meeting?

emdupre · 2024-05-13T23:08:29Z

Definitely will be good to get more ideas down following that discussion 😸

@poldrack, let us know if this looks OK (enough, for now) to merge on your end !

effigies · 2024-06-12T03:36:45Z

labguide/research/coding_standards.md

+def confirm_data_frame_index_alignment(df1, df2):
+    assert all(df1.index == df2.index)


This is more of a unit test helper. Something like assert_matching_indices(dataframe1, dataframe2). A unit test would then be something like:

def test_dataframe_transformation(): df = make_default_df() transformed_df = my_transformation(df) # Whatever else we do, do not break the index assert_matching_indices(df, transformed_df)

effigies · 2024-06-12T03:37:25Z

labguide/research/coding_standards.md

+## Python packaging
+
+For projects that aim to develop pip-installable packages should follow current best-practices in Python Packaging.
+As of May 2024, this is outlined in [this blog post](https://effigies.gitlab.io/posts/python-packaging-2023/) by lab member Chris Markiewicz.


I might suggest https://www.pyopensci.org/python-package-guide/package-structure-code/python-package-build-tools.html as a more thorough guide.

effigies · 2024-06-12T03:40:49Z

labguide/research/coding_standards.md

+h=read_csv('https://raw.githubusercontent.com/poldrack/clean_coding/master/data/health.csv',index_col=0)[hc].dropna().mean(1)
+```
+
+Compare this with a modular, portable refactoring:
+
+```python
+# load health data
+def load_health_data(datadir, filename='health.csv'):
+    return pd.read_csv(os.path.join(datadir, filename), index_col=0)


These don't quite do the same things. If you're going to say, don't do A, do B, it would be good if A and B produced the same result.

Do you want to add something like:

data = load_health_data(datadir) demeaned = data[columns].dropna().mean(1)

Do you want to go into getting datadir from os.environ or sys.argv? Given the bullet points above, that might help make clear what the alternatives look like.

doc: expand coding standards

87fb4b8

emdupre force-pushed the doc/coding-standards branch from 99db30b to 87fb4b8 Compare May 13, 2024 22:16

emdupre and others added 5 commits May 13, 2024 16:13

doc: small typos

8badfec

Minor style and grammar cleanups

3353021

Python style cleanups

04af9b0

Merge remote-tracking branch 'upstream/main' into doc/coding-standards

74b617c

Typo

e7cdb20

effigies reviewed Jun 12, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[DOC] expand coding standards #78

[DOC] expand coding standards #78

Uh oh!

emdupre commented May 13, 2024

Uh oh!

emdupre commented May 13, 2024

Uh oh!

sjshim commented May 13, 2024

Uh oh!

emdupre commented May 13, 2024

Uh oh!

effigies Jun 12, 2024

Uh oh!

effigies Jun 12, 2024

Uh oh!

effigies Jun 12, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		def confirm_data_frame_index_alignment(df1, df2):
		assert all(df1.index == df2.index)

[DOC] expand coding standards #78

Are you sure you want to change the base?

[DOC] expand coding standards #78

Uh oh!

Conversation

emdupre commented May 13, 2024

Uh oh!

emdupre commented May 13, 2024

Uh oh!

sjshim commented May 13, 2024

Uh oh!

emdupre commented May 13, 2024

Uh oh!

effigies Jun 12, 2024

Choose a reason for hiding this comment

Uh oh!

effigies Jun 12, 2024

Choose a reason for hiding this comment

Uh oh!

effigies Jun 12, 2024

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants