Merge pull request #248 from jeromekelleher/last-docs-updates-and-release

jeromekelleher · web-flow · commit 31a593531370 · 2024-06-10T00:11:21.000+01:00
Last docs updates and release
diff --git a/docs/vcf2zarr/tutorial.md b/docs/vcf2zarr/tutorial.md
@@ -18,6 +18,17 @@ convert your data, basically providing different levels of
 convenience and flexibility corresponding to what you might
 need for small, intermediate and large datasets.
 
+:::{warning}
+The documentation of vcf2zarr is under development, and 
+some bits are more polished than others. This "tutorial"
+is experimental, and will likely evolve into a slightly
+different format in the near future. It is 
+a work in progress and incomplete. The 
+{ref}`sec-vcf2zarr-cli-ref` should be complete
+and authoritative, however.
+:::
+
+
 ## Small dataset
 
 The simplest way to convert VCF data to Zarr is to use the
@@ -229,11 +240,33 @@ granularity). You should be careful to use this value in your scripts
 
 
 Once ``dexplode-init`` is done and we know how many partitions we have,
-we need to call ``dexplode-partition``  this number of times.
+we need to call 
+{ref}`dexplode-partition<cmd-vcf2zarr-dexplode-partition>` this number of times:
 
 ```{code-cell}
 vcf2zarr dexplode-partition sample-dist.icf 0
 vcf2zarr dexplode-partition sample-dist.icf 1
 vcf2zarr dexplode-partition sample-dist.icf 2
 ```
 
+This is not how it would be done in practise of course: you would 
+use your cluster scheduler of choice to dispatch these operations.
+:::{todo}
+Document how to do this conveniently over some popular schedulers.
+:::
+
+:::{tip}
+Use the ``--one-based`` argument in cases in which it's more convenient
+to index the partitions from 1 to n, rather than 0 to n - 1.
+:::
+
+Finally we need to call 
+{ref}`dexplode-finalise<cmd-vcf2zarr-dexplode-finalise>`:
+```{code-cell}
+vcf2zarr dexplode-finalise sample-dist.icf
+```
+
+:::{todo}
+Document the process for dencode, noting the information output about 
+memory requirements.
+:::
diff --git a/pyproject.toml b/pyproject.toml
@@ -24,7 +24,7 @@ dependencies = [
 ]
 requires-python = ">=3.9"
 classifiers = [
-  "Development Status :: 3 - Alpha",
+  "Development Status :: 4 - Beta",
   "License :: OSI Approved :: Apache Software License",
   "Operating System :: POSIX",
   "Operating System :: POSIX :: Linux",

Original file line number	Diff line number	Diff line change
`@@ -24,7 +24,7 @@ dependencies = [`
`24`	`24`	`]`
`25`	`25`	`requires-python = ">=3.9"`
`26`	`26`	`classifiers = [`
`27`		`- "Development Status :: 3 - Alpha",`
	`27`	`+ "Development Status :: 4 - Beta",`
`28`	`28`	`"License :: OSI Approved :: Apache Software License",`
`29`	`29`	`"Operating System :: POSIX",`
`30`	`30`	`"Operating System :: POSIX :: Linux",`