chore(deps): update dependency pytest-cov to v5

chore(release): 0.8.1
2024-04-29 01:31:28 +00:00 · 2024-04-28 04:27:47 -05:00 · 2024-04-28 04:27:32 -05:00 · 2024-04-28 04:27:06 -05:00 · 2024-04-28 04:19:08 -05:00 · 2024-04-28 04:08:02 -05:00
20 changed files with 682 additions and 8 deletions
--- a/.gitignore
+++ b/.gitignore
@ -143,3 +143,5 @@ dmypy.json
 cython_debug/
 *.csv
 local_scripts/
--- a/CHANGELOG.md
+++ b/CHANGELOG.md
@ -2,6 +2,37 @@
 All notable changes to this project will be documented in this file. See [standard-version](https://github.com/conventional-changelog/standard-version) for commit guidelines.
 ### [0.8.1](https://gitea.deepak.science:2222/physics/deepdog/compare/0.8.0...0.8.1) (2024-04-28)
 ### [0.8.1](https://gitea.deepak.science:2222/physics/deepdog/compare/0.8.0...0.8.1) (2024-04-28)
 ## [0.8.0](https://gitea.deepak.science:2222/physics/deepdog/compare/0.7.10...0.8.0) (2024-04-28)
 ### ⚠ BREAKING CHANGES
 * fixes the spin qubit frequency phase shift calculation which had an index problem
 ### Bug Fixes
 * fixes the spin qubit frequency phase shift calculation which had an index problem ([f9646e3](https://gitea.deepak.science:2222/physics/deepdog/commit/f9646e33868e1a0da8ab663230c0c692ac25bb74))
 ### [0.7.10](https://gitea.deepak.science:2222/physics/deepdog/compare/0.7.9...0.7.10) (2024-04-28)
 ### Features
 * adds cli probs ([4b2e573](https://gitea.deepak.science:2222/physics/deepdog/commit/4b2e57371546731137b011461849bb849d4d4e0f))
 * better management of cli wrapper ([b0ad4be](https://gitea.deepak.science:2222/physics/deepdog/commit/b0ad4bead0d4762eb7f848f6e557f6d9b61200b9))
 ### [0.7.9](https://gitea.deepak.science:2222/physics/deepdog/compare/0.7.8...0.7.9) (2024-04-21)
 ### Features
 * adds ability to write custom dmc filters ([ea080ca](https://gitea.deepak.science:2222/physics/deepdog/commit/ea080ca1c7068042ce1e0a222d317f785a6b05f4))
 * adds tarucha phase calculation, using spin qubit precession rate noise ([3ae0783](https://gitea.deepak.science:2222/physics/deepdog/commit/3ae0783d00cbe6a76439c1d671f2cff621d8d0a8))
 ### [0.7.8](https://gitea.deepak.science:2222/physics/deepdog/compare/0.7.7...0.7.8) (2024-02-29)
--- a/README.md
+++ b/README.md
@ -5,7 +5,7 @@
 [![Jenkins](https://img.shields.io/jenkins/build?jobUrl=https%3A%2F%2Fjenkins.deepak.science%2Fjob%2Fgitea-physics%2Fjob%2Fdeepdog%2Fjob%2Fmaster&style=flat-square)](https://jenkins.deepak.science/job/gitea-physics/job/deepdog/job/master/)
 ![Jenkins tests](https://img.shields.io/jenkins/tests?compact_message&jobUrl=https%3A%2F%2Fjenkins.deepak.science%2Fjob%2Fgitea-physics%2Fjob%2Fdeepdog%2Fjob%2Fmaster%2F&style=flat-square)
 ![Jenkins Coverage](https://img.shields.io/jenkins/coverage/cobertura?jobUrl=https%3A%2F%2Fjenkins.deepak.science%2Fjob%2Fgitea-physics%2Fjob%2Fdeepdog%2Fjob%2Fmaster%2F&style=flat-square)
-![Maintenance](https://img.shields.io/maintenance/yes/2023?style=flat-square)
+![Maintenance](https://img.shields.io/maintenance/yes/2024?style=flat-square)
 The DiPole DiaGnostic tool.
@ -13,6 +13,13 @@ The DiPole DiaGnostic tool.
 `poetry install` to start locally
-Commit using [Conventional Commits](https://www.conventionalcommits.org/en/v1.0.0/), and when commits are on master, release with `doo release`.
+Commit using [Conventional Commits](https://www.conventionalcommits.org/en/v1.0.0/), and when commits are on master, release with `just release`.
 In general `just --list` has some of the useful stuff for figuring out what development tools there are.
 Poetry as an installer is good, even better is using Nix (maybe with direnv to automatically pick up the `devShell` from `flake.nix`).
 In either case `just` should handle actually calling things in a way that's agnostic to poetry as a runner or through nix.
 ### local scripts
 `local_scripts` folder allows for scripts to be run using this code, but that probably isn't the most auditable for actual usage.
 The API is still only something I'm using so there's no guarantees yet that it will be stable; overall semantic versioning should help with API breaks.
--- a/deepdog/cli/init.py
+++ b/deepdog/cli/init.py
--- a/deepdog/cli/probs/init.py
+++ b/deepdog/cli/probs/init.py
@ -0,0 +1,5 @@
 from deepdog.cli.probs.main import wrapped_main
 __all__ = [
 	"wrapped_main",
 ]
--- a/deepdog/cli/probs/args.py
+++ b/deepdog/cli/probs/args.py
@ -0,0 +1,63 @@
 import argparse
 import os
 def parse_args() -> argparse.Namespace:
 	def dir_path(path):
 		if os.path.isdir(path):
 			return path
 		else:
 			raise argparse.ArgumentTypeError(f"readable_dir:{path} is not a valid path")
 	parser = argparse.ArgumentParser(
 		"probs", description="Calculating probability from finished bayesrun"
 	)
 	parser.add_argument(
 		"--log_file",
 		type=str,
 		help="A filename for logging to, if not provided will only log to stderr",
 		default=None,
 	)
 	parser.add_argument(
 		"--bayesrun-directory",
 		"-d",
 		type=dir_path,
 		help="The directory to search for bayesrun files, defaulting to cwd if not passed",
 		default=".",
 	)
 	parser.add_argument(
 		"--indexify-json",
 		help="A json file with the indexify config for parsing job indexes. Will skip if not present",
 		default="",
 	)
 	parser.add_argument(
 		"--seed-index",
 		type=int,
 		help='take an integer to append as a "seed" key with range at end of indexify dict. Skip if <= 0',
 		default=0,
 	)
 	parser.add_argument(
 		"--seed-fieldname",
 		type=str,
 		help='if --seed-index is set, the fieldname to append to the indexifier. "seed" by default',
 		default="seed",
 	)
 	parser.add_argument(
 		"--coalesced-keys",
 		type=str,
 		help="A comma separated list of strings over which to coalesce data. By default coalesce over all fields within model names, ignore file level names",
 		default="",
 	)
 	parser.add_argument(
 		"--uncoalesced-outfile",
 		type=str,
 		help="output filename for uncoalesced data. If not provided, will not be written",
 		default=None,
 	)
 	parser.add_argument(
 		"--coalesced-outfile",
 		type=str,
 		help="output filename for coalesced data. If not provided, will not be written",
 		default=None,
 	)
 	return parser.parse_args()
--- a/deepdog/cli/probs/dicts.py
+++ b/deepdog/cli/probs/dicts.py
@ -0,0 +1,178 @@
 import typing
 from deepdog.results import BayesrunOutput
 import logging
 import csv
 import tqdm
 _logger = logging.getLogger(__name__)
 def build_model_dict(
 	bayes_outputs: typing.Sequence[BayesrunOutput],
 ) -> typing.Dict[
 	typing.Tuple, typing.Dict[typing.Tuple, typing.Dict["str", typing.Any]]
 ]:
 	"""
 	Maybe someday do something smarter with the coalescing and stuff but don't want to so i won't
 	"""
 	# assume that everything is well formatted and the keys are the same across entire list and initialise list of keys.
 	# model dict will contain a model_key: {calculation_dict} where each calculation_dict represents a single calculation for that model,
 	# the uncoalesced version, keyed by the specific file keys
 	model_dict: typing.Dict[
 		typing.Tuple, typing.Dict[typing.Tuple, typing.Dict["str", typing.Any]]
 	] = {}
 	_logger.info("building model dict")
 	for out in tqdm.tqdm(bayes_outputs, desc="reading outputs", leave=False):
 		for model_result in out.results:
 			model_key = tuple(v for v in model_result.parsed_model_keys.values())
 			if model_key not in model_dict:
 				model_dict[model_key] = {}
 			calculation_dict = model_dict[model_key]
 			calculation_key = tuple(v for v in out.data.values())
 			if calculation_key not in calculation_dict:
 				calculation_dict[calculation_key] = {
 					"_model_key_dict": model_result.parsed_model_keys,
 					"_calculation_key_dict": out.data,
 					"success": model_result.success,
 					"count": model_result.count,
 				}
 			else:
 				raise ValueError(
 					f"Got {calculation_key} twice for model_key {model_key}"
 				)
 	return model_dict
 def write_uncoalesced_dict(
 	uncoalesced_output_filename: typing.Optional[str],
 	uncoalesced_model_dict: typing.Dict[
 		typing.Tuple, typing.Dict[typing.Tuple, typing.Dict["str", typing.Any]]
 	],
 ):
 	if uncoalesced_output_filename is None or uncoalesced_output_filename == "":
 		_logger.warning("Not provided a uncoalesced filename, not going to try")
 		return
 	first_value = next(iter(next(iter(uncoalesced_model_dict.values())).values()))
 	model_field_names = set(first_value["_model_key_dict"].keys())
 	calculation_field_names = set(first_value["_calculation_key_dict"].keys())
 	if not (set(model_field_names).isdisjoint(calculation_field_names)):
 		_logger.info(f"Detected model field names {model_field_names}")
 		_logger.info(f"Detected calculation field names {calculation_field_names}")
 		raise ValueError(
 			f"model field names {model_field_names} and calculation {calculation_field_names} have an overlap, which is possibly a problem"
 		)
 	collected_fieldnames = list(model_field_names)
 	collected_fieldnames.extend(calculation_field_names)
 	collected_fieldnames.extend(["success", "count"])
 	_logger.info(f"Full uncoalesced fieldnames are {collected_fieldnames}")
 	with open(uncoalesced_output_filename, "w", newline="") as uncoalesced_output_file:
 		writer = csv.DictWriter(
 			uncoalesced_output_file, fieldnames=collected_fieldnames
 		)
 		writer.writeheader()
 		for model_dict in uncoalesced_model_dict.values():
 			for calculation in model_dict.values():
 				row = calculation["_model_key_dict"].copy()
 				row.update(calculation["_calculation_key_dict"].copy())
 				row.update(
 					{
 						"success": calculation["success"],
 						"count": calculation["count"],
 					}
 				)
 				writer.writerow(row)
 def coalesced_dict(
 	uncoalesced_model_dict: typing.Dict[
 		typing.Tuple, typing.Dict[typing.Tuple, typing.Dict["str", typing.Any]]
 	],
 	minimum_count: float = 0.1,
 ):
 	"""
 	pass in uncoalesced dict
 	the minimum_count field is what we use to make sure our probs are never zero
 	"""
 	coalesced_dict = {}
 	# we are already iterating so for no reason because performance really doesn't matter let's count the keys ourselves
 	num_keys = 0
 	# first pass coalesce
 	for model_key, model_dict in uncoalesced_model_dict.items():
 		num_keys += 1
 		for calculation in model_dict.values():
 			if model_key not in coalesced_dict:
 				coalesced_dict[model_key] = {
 					"_model_key_dict": calculation["_model_key_dict"].copy(),
 					"calculations_coalesced": 0,
 					"count": 0,
 					"success": 0,
 				}
 			sub_dict = coalesced_dict[model_key]
 			sub_dict["calculations_coalesced"] += 1
 			sub_dict["count"] += calculation["count"]
 			sub_dict["success"] += calculation["success"]
 	# second pass do probability calculation
 	prior = 1 / num_keys
 	_logger.info(f"Got {num_keys} model keys, so our prior will be {prior}")
 	total_weight = 0
 	for coalesced_model_dict in coalesced_dict.values():
 		model_weight = (
 			max(minimum_count, coalesced_model_dict["success"])
 			/ coalesced_model_dict["count"]
 		) * prior
 		total_weight += model_weight
 	total_prob = 0
 	for coalesced_model_dict in coalesced_dict.values():
 		model_weight = (
 			max(minimum_count, coalesced_model_dict["success"])
 			/ coalesced_model_dict["count"]
 		)
 		prob = model_weight * prior / total_weight
 		coalesced_model_dict["prob"] = prob
 		total_prob += prob
 	_logger.debug(
 		f"Got a total probability of {total_prob}, which should be close to 1 up to float/rounding error"
 	)
 	return coalesced_dict
 def write_coalesced_dict(
 	coalesced_output_filename: typing.Optional[str],
 	coalesced_model_dict: typing.Dict[typing.Tuple, typing.Dict["str", typing.Any]],
 ):
 	if coalesced_output_filename is None or coalesced_output_filename == "":
 		_logger.warning("Not provided a uncoalesced filename, not going to try")
 		return
 	first_value = next(iter(coalesced_model_dict.values()))
 	model_field_names = set(first_value["_model_key_dict"].keys())
 	_logger.info(f"Detected model field names {model_field_names}")
 	collected_fieldnames = list(model_field_names)
 	collected_fieldnames.extend(["calculations_coalesced", "success", "count", "prob"])
 	with open(coalesced_output_filename, "w", newline="") as coalesced_output_file:
 		writer = csv.DictWriter(coalesced_output_file, fieldnames=collected_fieldnames)
 		writer.writeheader()
 		for model_dict in coalesced_model_dict.values():
 			row = model_dict["_model_key_dict"].copy()
 			row.update(
 				{
 					"calculations_coalesced": model_dict["calculations_coalesced"],
 					"success": model_dict["success"],
 					"count": model_dict["count"],
 					"prob": model_dict["prob"],
 				}
 			)
 			writer.writerow(row)
--- a/deepdog/cli/probs/main.py
+++ b/deepdog/cli/probs/main.py
@ -0,0 +1,95 @@
 import logging
 import argparse
 import json
 import deepdog.cli.probs.args
 import deepdog.cli.probs.dicts
 import deepdog.results
 import deepdog.indexify
 import pathlib
 import tqdm
 import tqdm.contrib.logging
 _logger = logging.getLogger(__name__)
 def set_up_logging(log_file: str):
 	log_pattern = "%(asctime)s | %(levelname)-7s | %(name)s:%(lineno)d | %(message)s"
 	if log_file is None:
 		handlers = [
 			logging.StreamHandler(),
 		]
 	else:
 		handlers = [logging.StreamHandler(), logging.FileHandler(log_file)]
 	logging.basicConfig(
 		level=logging.DEBUG,
 		format=log_pattern,
 		# it's okay to ignore this mypy error because who cares about logger handler types
 		handlers=handlers,  # type: ignore
 	)
 	logging.captureWarnings(True)
 def main(args: argparse.Namespace):
 	"""
 	Main function with passed in arguments and no additional logging setup in case we want to extract out later
 	"""
 	with tqdm.contrib.logging.logging_redirect_tqdm():
 		_logger.info(f"args: {args}")
 		try:
 			if args.coalesced_keys:
 				raise NotImplementedError(
 					"Currently not supporting coalesced keys, but maybe in future"
 				)
 		except AttributeError:
 			# we don't care if this is missing because we don't actually want it to be there
 			pass
 		indexifier = None
 		if args.indexify_json:
 			with open(args.indexify_json, "r") as indexify_json_file:
 				indexify_data = json.load(indexify_json_file)
 				if args.seed_index > 0:
 					indexify_data[args.seed_fieldname] = list(range(args.seed_index))
 				# _logger.debug(f"Indexifier data looks like {indexify_data}")
 				indexifier = deepdog.indexify.Indexifier(indexify_data)
 		bayes_dir = pathlib.Path(args.bayesrun_directory)
 		out_files = [f for f in bayes_dir.iterdir() if f.name.endswith("bayesrun.csv")]
 		_logger.info(
 			f"Reading {len(out_files)} bayesrun.csv files in directory {args.bayesrun_directory}"
 		)
 		# _logger.info(out_files)
 		parsed_output_files = [
 			deepdog.results.read_output_file(f, indexifier)
 			for f in tqdm.tqdm(out_files, desc="reading files", leave=False)
 		]
 		_logger.info("building uncoalesced dict")
 		uncoalesced_dict = deepdog.cli.probs.dicts.build_model_dict(parsed_output_files)
 		if "uncoalesced_outfile" in args and args.uncoalesced_outfile:
 			deepdog.cli.probs.dicts.write_uncoalesced_dict(
 				args.uncoalesced_outfile, uncoalesced_dict
 			)
 		else:
 			_logger.info("Skipping writing uncoalesced")
 		_logger.info("building coalesced dict")
 		coalesced = deepdog.cli.probs.dicts.coalesced_dict(uncoalesced_dict)
 		if "coalesced_outfile" in args and args.coalesced_outfile:
 			deepdog.cli.probs.dicts.write_coalesced_dict(
 				args.coalesced_outfile, coalesced
 			)
 		else:
 			_logger.info("Skipping writing coalesced")
 def wrapped_main():
 	args = deepdog.cli.probs.args.parse_args()
 	set_up_logging(args.log_file)
 	main(args)
--- a/deepdog/direct_monte_carlo/compose_filter.py
+++ b/deepdog/direct_monte_carlo/compose_filter.py
@ -0,0 +1,14 @@
 from typing import Sequence
 from deepdog.direct_monte_carlo.direct_mc import DirectMonteCarloFilter
 import numpy
 class ComposedDMCFilter(DirectMonteCarloFilter):
 	def __init__(self, filters: Sequence[DirectMonteCarloFilter]):
 		self.filters = filters
 	def filter_samples(self, samples: numpy.ndarray) -> numpy.ndarray:
 		current_sample = samples
 		for filter in self.filters:
 			current_sample = filter.filter_samples(current_sample)
 		return current_sample
--- a/deepdog/indexify/init.py
+++ b/deepdog/indexify/init.py
@ -0,0 +1,58 @@
 """
 Probably should just include a way to handle the indexify function I reuse so much.
 All about breaking an integer into a tuple of values from lists, which is useful because of how we do CHTC runs.
 """
 import itertools
 import typing
 import logging
 import math
 _logger = logging.getLogger(__name__)
 # from https://stackoverflow.com/questions/5228158/cartesian-product-of-a-dictionary-of-lists
 def _dict_product(dicts):
 	"""
 	>>> list(dict_product(dict(number=[1,2], character='ab')))
 	[{'character': 'a', 'number': 1},
 	{'character': 'a', 'number': 2},
 	{'character': 'b', 'number': 1},
 	{'character': 'b', 'number': 2}]
 	"""
 	return list(dict(zip(dicts.keys(), x)) for x in itertools.product(*dicts.values()))
 class Indexifier:
 	"""
 	The order of keys is very important, but collections.OrderedDict is no longer needed in python 3.7.
 	I think it's okay to rely on that.
 	"""
 	def __init__(self, list_dict: typing.Dict[str, typing.Sequence]):
 		self.dict = list_dict
 	def indexify(self, n: int) -> typing.Dict[str, typing.Any]:
 		product_dict = _dict_product(self.dict)
 		return product_dict[n]
 	def _indexify_indices(self, n: int) -> typing.Sequence[int]:
 		"""
 		legacy indexify from old scripts, copypast.
 		could be used like
 		>>> ret = {}
 		>>> for k, i in zip(self.dict.keys(), self._indexify_indices):
 		>>>		ret[k] = self.dict[k][i]
 		>>> return ret
 		"""
 		weights = [len(v) for v in self.dict.values()]
 		N = math.prod(weights)
 		curr_n = n
 		curr_N = N
 		out = []
 		for w in weights[:-1]:
 			# print(f"current: {curr_N}, {curr_n}, {curr_n // w}")
 			curr_N = curr_N // w  # should be int division anyway
 			out.append(curr_n // curr_N)
 			curr_n = curr_n % curr_N
 		return out
--- a/deepdog/real_spectrum_run.py
+++ b/deepdog/real_spectrum_run.py
@ -144,7 +144,7 @@ def get_a_result_fast_filter_tarucha_spin_qubit_pair_phase_only(input) -> int:
 					* numpy.transpose(diffses1)
 				)[:, :, :, 0]
 			)
-			- ps[:, :, 0, numpy.newaxis]
+			- ps[:, numpy.newaxis, :, 0]
 		) / (norms1**3)
 		alphses2 = (
 			(
@ -156,7 +156,7 @@ def get_a_result_fast_filter_tarucha_spin_qubit_pair_phase_only(input) -> int:
 					* numpy.transpose(diffses2)
 				)[:, :, :, 0]
 			)
-			- ps[:, :, 0, numpy.newaxis]
+			- ps[:, numpy.newaxis, :, 0]
 		) / (norms2**3)
 		bses = (1 / numpy.pi) * (
--- a/deepdog/results/init.py
+++ b/deepdog/results/init.py
@ -0,0 +1,169 @@
 import dataclasses
 import re
 import typing
 import logging
 import deepdog.indexify
 import pathlib
 import csv
 _logger = logging.getLogger(__name__)
 FILENAME_REGEX = r"(?P<timestamp>\d{8}-\d{6})-(?P<filename_slug>.*)\.realdata\.fast_filter\.bayesrun\.csv"
 MODEL_REGEXES = [
 	r"geom_(?P<xmin>-?\d+)_(?P<xmax>-?\d+)_(?P<ymin>-?\d+)_(?P<ymax>-?\d+)_(?P<zmin>-?\d+)_(?P<zmax>-?\d+)-orientation_(?P<orientation>free|fixedxy|fixedz)-dipole_count_(?P<avg_filled>\d+)_(?P<field_name>\w*)"
 ]
 FILE_SLUG_REGEXES = [
 	r"mock_tarucha-(?P<job_index>\d+)",
 ]
@dataclasses.dataclass
 class BayesrunOutputFilename:
 	timestamp: str
 	filename_slug: str
 	path: pathlib.Path
@dataclasses.dataclass
 class BayesrunColumnParsed:
 	"""
 	class for parsing a bayesrun while pulling certain special fields out
 	"""
 	def __init__(self, groupdict: typing.Dict[str, str]):
 		self.column_field = groupdict["field_name"]
 		self.model_field_dict = {
 			k: v for k, v in groupdict.items() if k != "field_name"
 		}
 	def __str__(self):
 		return f"BayesrunColumnParsed[{self.column_field}: {self.model_field_dict}]"
@dataclasses.dataclass
 class BayesrunModelResult:
 	parsed_model_keys: typing.Dict[str, str]
 	success: int
 	count: int
@dataclasses.dataclass
 class BayesrunOutput:
 	filename: BayesrunOutputFilename
 	data: typing.Dict["str", typing.Any]
 	results: typing.Sequence[BayesrunModelResult]
 def _batch_iterable_into_chunks(iterable, n=1):
 	"""
 	utility for batching bayesrun files where columns appear in threes
 	"""
 	for ndx in range(0, len(iterable), n):
 		yield iterable[ndx : min(ndx + n, len(iterable))]
 def _parse_bayesrun_column(
 	column: str,
 ) -> typing.Optional[BayesrunColumnParsed]:
 	"""
 	Tries one by one all of a predefined list of regexes that I might have used in the past.
 	Returns the groupdict for the first match, or None if no match found.
 	"""
 	for pattern in MODEL_REGEXES:
 		match = re.match(pattern, column)
 		if match:
 			return BayesrunColumnParsed(match.groupdict())
 	else:
 		return None
 def _parse_bayesrun_row(
 	row: typing.Dict[str, str],
 ) -> typing.Sequence[BayesrunModelResult]:
 	results = []
 	batched_keys = _batch_iterable_into_chunks(list(row.keys()), 3)
 	for model_keys in batched_keys:
 		parsed = [_parse_bayesrun_column(column) for column in model_keys]
 		values = [row[column] for column in model_keys]
 		if parsed[0] is None:
 			raise ValueError(f"no viable success row found for keys {model_keys}")
 		if parsed[1] is None:
 			raise ValueError(f"no viable count row found for keys {model_keys}")
 		if parsed[0].column_field != "success":
 			raise ValueError(f"The column {model_keys[0]} is not a success field")
 		if parsed[1].column_field != "count":
 			raise ValueError(f"The column {model_keys[1]} is not a count field")
 		parsed_keys = parsed[0].model_field_dict
 		success = int(values[0])
 		count = int(values[1])
 		results.append(
 			BayesrunModelResult(
 				parsed_model_keys=parsed_keys,
 				success=success,
 				count=count,
 			)
 		)
 	return results
 def _parse_output_filename(file: pathlib.Path) -> BayesrunOutputFilename:
 	filename = file.name
 	match = re.match(FILENAME_REGEX, filename)
 	if not match:
 		raise ValueError(f"{filename} was not a valid bayesrun output")
 	groups = match.groupdict()
 	return BayesrunOutputFilename(
 		timestamp=groups["timestamp"], filename_slug=groups["filename_slug"], path=file
 	)
 def _parse_file_slug(slug: str) -> typing.Optional[typing.Dict[str, str]]:
 	for pattern in FILE_SLUG_REGEXES:
 		match = re.match(pattern, slug)
 		if match:
 			return match.groupdict()
 	else:
 		return None
 def read_output_file(
 	file: pathlib.Path, indexifier: typing.Optional[deepdog.indexify.Indexifier]
 ) -> BayesrunOutput:
 	parsed_filename = tag = _parse_output_filename(file)
 	out = BayesrunOutput(filename=parsed_filename, data={}, results=[])
 	out.data.update(dataclasses.asdict(tag))
 	parsed_tag = _parse_file_slug(parsed_filename.filename_slug)
 	if parsed_tag is None:
 		_logger.warning(
 			f"Could not parse {tag} against any matching regexes. Going to skip tag parsing"
 		)
 	else:
 		out.data.update(parsed_tag)
 		if indexifier is not None:
 			try:
 				job_index = parsed_tag["job_index"]
 				indexified = indexifier.indexify(int(job_index))
 				out.data.update(indexified)
 			except KeyError:
 				# This isn't really that important of an error, apart from the warning
 				_logger.warning(
 					f"Parsed tag to {parsed_tag}, and attempted to indexify but no job_index key was found. skipping and moving on"
 				)
 	with file.open() as input_file:
 		reader = csv.DictReader(input_file)
 		rows = [r for r in reader]
 		if len(rows) == 1:
 			row = rows[0]
 		else:
 			raise ValueError(f"Confused about having multiple rows in {file.name}")
 	results = _parse_bayesrun_row(row)
 	out.results = results
 	return out
--- a/3
+++ b/3
@ -43,7 +43,8 @@ fmt:
 	else
 		poetry run black .
 	fi
-	find . -type f -name "*.py" -exec sed -i -e 's/    /\t/g' {} \;
+	find deepdog -type f -name "*.py" -exec sed -i -e 's/    /\t/g' {} \;
 	find tests -type f -name "*.py" -exec sed -i -e 's/    /\t/g' {} \;
 # release the app, checking that our working tree is clean and ready for release
 release:
--- a/poetry.lock
+++ b/poetry.lock
@ -1220,4 +1220,4 @@ testing = ["big-O", "jaraco.functools", "jaraco.itertools", "more-itertools", "p
 [metadata]
 lock-version = "2.0"
 python-versions = ">=3.8.1,<3.10"
-content-hash = "b7f33da5b5a2af6bcb2a4c95cf391d04a76047d4f7e5c105b7cc38c73563fa51"
+content-hash = "828610d9447294e707a6df2affb6ee7947e2be3b567371217265a8b94a9768f6"
--- a/pyproject.toml
+++ b/pyproject.toml
@ -1,6 +1,6 @@
 [tool.poetry]
 name = "deepdog"
-version = "0.7.8"
+version = "0.8.1"
 description = ""
 authors = ["Deepak Mallubhotla <dmallubhotla+github@gmail.com>"]
@ -9,6 +9,7 @@ python = ">=3.8.1,<3.10"
 pdme = "^0.9.3"
 numpy = "1.22.3"
 scipy = "1.10"
 tqdm = "^4.66.2"
 [tool.poetry.dev-dependencies]
 pytest = ">=6"
@ -19,6 +20,9 @@ python-semantic-release = "^7.24.0"
 black = "^22.3.0"
 syrupy = "^4.0.8"
 [tool.poetry.scripts]
 probs = "deepdog.cli.probs:wrapped_main"
 [build-system]
 requires = ["poetry-core>=1.0.0"]
 build-backend = "poetry.core.masonry.api"
@ -38,6 +42,13 @@ module = [
 ]
 ignore_missing_imports = true
 [[tool.mypy.overrides]]
 module = [
 	"tqdm",
 	"tqdm.*"
 ]
 ignore_missing_imports = true
 [tool.semantic_release]
 version_toml = "pyproject.toml:tool.poetry.version"
 tag_format = "{version}"
--- a/scripts/standard-version/pyproject-updater.js
+++ b/scripts/standard-version/pyproject-updater.js
@ -1,4 +1,4 @@
-const pattern = /(\[tool\.poetry\]\nname = "deepdog"\nversion = ")(?<vers>\d+\.\d+\.\d)(")/mg;
+const pattern = /(\[tool\.poetry\]\nname = "deepdog"\nversion = ")(?<vers>\d+\.\d+\.\d+)(")/mg;
 module.exports.readVersion = function (contents) {
 	const result = pattern.exec(contents);
--- a/tests/indexify/init.py
+++ b/tests/indexify/init.py
--- a/tests/indexify/test_indexify.py
+++ b/tests/indexify/test_indexify.py
@ -0,0 +1,12 @@
 import deepdog.indexify
 import logging
 _logger = logging.getLogger(__name__)
 def test_indexifier():
 	weight_dict = {"key_1": [1, 2, 3], "key_2": ["a", "b", "c"]}
 	indexifier = deepdog.indexify.Indexifier(weight_dict)
 	_logger.debug(f"setting up indexifier {indexifier}")
 	assert indexifier.indexify(0) == {"key_1": 1, "key_2": "a"}
 	assert indexifier.indexify(5) == {"key_1": 2, "key_2": "c"}
--- a/tests/results/init.py
+++ b/tests/results/init.py
--- a/tests/results/test_column_results.py
+++ b/tests/results/test_column_results.py
@ -0,0 +1,28 @@
 import deepdog.results
 def test_parse_groupdict():
 	example_column_name = (
 		"geom_-20_20_-10_10_0_5-orientation_free-dipole_count_100_success"
 	)
 	parsed = deepdog.results._parse_bayesrun_column(example_column_name)
 	expected = deepdog.results.BayesrunColumnParsed(
 		{
 			"xmin": "-20",
 			"xmax": "20",
 			"ymin": "-10",
 			"ymax": "10",
 			"zmin": "0",
 			"zmax": "5",
 			"orientation": "free",
 			"avg_filled": "100",
 			"field_name": "success",
 		}
 	)
 	assert parsed == expected
 # def test_parse_no_match_column_name():
 # 	parsed = deepdog.results.parse_bayesrun_column("There's nothing here")
 # 	assert parsed is None
Author	SHA1	Message	Date
Renovate Bot	120b1ab6e5	chore(deps): update dependency pytest-cov to v5 Some checks failed renovate/artifacts Artifact file update failure gitea-physics/deepdog/pipeline/pr-master There was a failure building this commit Details	2024-04-29 01:31:28 +00:00
Deepak Mallubhotla	e5f7085324	chore(release): 0.8.1 All checks were successful gitea-physics/deepdog/pipeline/head This commit looks good Details gitea-physics/deepdog/pipeline/tag This commit looks good Details	2024-04-28 04:27:47 -05:00
Deepak Mallubhotla	578481324b	chore(release): 0.8.1	2024-04-28 04:27:32 -05:00
Deepak Mallubhotla	bf8ac9850d	release: fixes standard version updater which didn't allow minor version to be multidigit	2024-04-28 04:27:06 -05:00
Deepak Mallubhotla	ab408b6412	chore(release): 0.8.1	2024-04-28 04:19:08 -05:00
Deepak Mallubhotla	4aa0a6f234	chore(release): 0.8.0 Some checks failed gitea-physics/deepdog/pipeline/head This commit looks good Details gitea-physics/deepdog/pipeline/tag There was a failure building this commit Details	2024-04-28 04:08:02 -05:00
Deepak Mallubhotla	f9646e3386	fix!: fixes the spin qubit frequency phase shift calculation which had an index problem	2024-04-28 04:07:35 -05:00
Deepak Mallubhotla	3b612b960e	chore(release): 0.7.10 All checks were successful gitea-physics/deepdog/pipeline/head This commit looks good Details gitea-physics/deepdog/pipeline/tag This commit looks good Details	2024-04-27 23:06:24 -05:00
Deepak Mallubhotla	b0ad4bead0	feat: better management of cli wrapper	2024-04-27 23:04:33 -05:00
Deepak Mallubhotla	4b2e573715	feat: adds cli probs	2024-04-27 18:43:25 -05:00
Deepak Mallubhotla	12e6916ab2	doc: documentation for myself because i'll forget otherwise All checks were successful gitea-physics/deepdog/pipeline/head This commit looks good Details	2024-04-27 12:25:39 -05:00
Deepak Mallubhotla	1e76f63725	git: ignores local_scripts directory as place to run stuff while developing All checks were successful gitea-physics/deepdog/pipeline/head This commit looks good Details	2024-04-25 16:18:14 -05:00
Deepak Mallubhotla	7aa5ad2eb9	chore(release): 0.7.9 All checks were successful gitea-physics/deepdog/pipeline/head This commit looks good Details gitea-physics/deepdog/pipeline/tag This commit looks good Details	2024-04-21 11:23:42 -05:00
Deepak Mallubhotla	fe331bb544	Merge branch 'filter_compose' All checks were successful gitea-physics/deepdog/pipeline/head This commit looks good Details	2024-04-21 11:21:36 -05:00
Deepak Mallubhotla	03ac85a967	chore: performance enhancement for fmt in justfile All checks were successful gitea-physics/deepdog/pipeline/head This commit looks good Details	2024-04-21 11:21:11 -05:00
Deepak Mallubhotla	96589ff659	adds a filter for future dmc use	2024-04-21 10:55:44 -05:00