Bug fix to makeSubjectTaskSummary #226

mharms · 2022-04-19T01:14:38Z

I'm not entirely clear on the purpose of makeSubjectTaskSummary yet, esp. in the context of subjects with multiple runs for a given task and thus a "Level2" analysis, but it seems clear to me that it contained a bug in its treatment of $SummaryName in the situation of a Level2 analysis. Specifically $SummaryName would be of the form "A/A" which will then create problems when creating files with $SummaryName as part of the file name.

I've attempted to resolve this by splitting --summaryname into separate --summarydir and --summaryprefix arguments.

This hasn't been tested. I was hoping that @glasserm could conveniently test it.

…mary.sh

coalsont · 2022-04-19T01:25:27Z

My thought on the recent error is that someone misunderstood the description of that argument (which contains a slash that I think is intended to indicate "or", considering the logic of whether the thing gets run or not), and that it wasn't intended to support an extra subdirectory. Putting a / into, say, --subject would also cause weird "subdirectory" problems.

mharms · 2022-04-19T01:42:18Z

No, it's more than that. If you look at Greg's description of --summaryname, you see that he in fact referred to it as containing both a directory and file prefix component:

opts_AddOptional '--summaryname' 'SummaryName' 'tfMRI_TaskName/DesignName_TaskName' "Naming convention for single-subject summary directory. Will not create summary directory for Level1 analysis if this flag is missing or set to NONE. Naming for Level1 summary directories should match naming of Level2 summary directories. Default when running Level2 analysis is derived from --lvl2task and --lvl2fsf options "tfMRI_TaskName/DesignName_TaskName"" 'NONE'

That interpretation is supported by the fact that the code itself in one condition was setting (previous L208):

SummaryName="${LevelTwofMRIName}/${LevelTwofsfName}";

Basically, I think at some point that @gcburgess got a bit twisted up in terms of whether --summaryname was just a prefix, or a dir/prefix combination.

coalsont · 2022-04-19T01:56:29Z

TaskfMRIAnalysis/TaskfMRIAnalysis.sh

@@ -284,7 +289,8 @@ then
 		--lvl1fsfs=$LevelOnefsfNames \
 		--lvl2task=$LevelTwofMRIName \
 		--lvl2fsf=$LevelTwofsfName \
-		--summaryname=$SummaryName \
+		--summarydir=$SummaryDir \
+        --summaryprefix=$SummaryPrefix \


tabs/spaces mismatch

Also, if someone doesn't read the help, or just forgets --summaryprefix when using --summarydir, it looks like at present it will use a literal NONE along with the provided --summarydir, and makeSubjectTaskSummary.sh appears to use it as-is in that case.

glasserm · 2022-04-19T01:58:20Z

Wouldn't it be easier to just fix the description? The current code works fine if you use it more sensibly. I don't see why we need these additional variables.

mharms · 2022-04-19T02:03:23Z

It doesn't work sensibly in the case of applying it to Level2 outputs. I've emailed @gcburgess for clarification as to what the role of the script is on Level2 data.

coalsont · 2022-04-19T02:05:48Z

TaskfMRIAnalysis/scripts/makeSubjectTaskSummary.sh

@@ -201,19 +206,24 @@ if [ "$VolumeBasedProcessing" = "YES" ] ; then
 	Analyses+="StandardVolumeStats "; # space character at end to separate multiple analyses	
 fi

-if [[ "${SummaryName}" = "NONE" || "${SummaryName}" = "" ]]; then
+if [[ "${SummaryDir}" = "NONE" && "${SummaryPrefix}" != "NONE" ]]; then
+    log_Err_Abort "--summarydir=NONE, so --summaryprefix should be NONE as well."


This case would get overwritten by the below logic anyway, and the converse isn't tested for.

glasserm · 2022-04-19T02:06:23Z

I see it as an either or option. Either summary directory or level 2 directory. We don't need to make this more complicated than that.

coalsont · 2022-04-19T02:12:15Z

These options sound like they are intended to provide what would have been the level 2 information, so that the directory structure will match what doing the level 2 mode would have done. Should the script instead have a separate yes/no argument to toggle between the two modes of running, and make the level 2 information mandatory (even when it doesn't do the "real" level 2 stuff), instead of the current separate arguments to give the same level 2 information in the non-level-2 scenario?

glasserm · 2022-04-19T02:18:33Z

That might be an alternative approach. Not sure I see how that simplifies things or clarifies them though.

coalsont · 2022-04-19T02:27:19Z

If it really is as simple as "the same information, used a different way", it could be less confusing to explain to users than "here are different flags that must be used to provide the same information, but which also change what the script actually does". I'm not clear on the details of when it is applicable without doing the level 2 stuff (only when it was already done previously, such that the flag could be something like --skip-level2 or --use-prior-level2?), and that could affect whether it is actually simpler to explain.

mharms · 2022-04-19T02:36:31Z

What is the purpose of the parts of the script that are checking if the data is from Level2? If I understand that, I might better understand what Greg was doing ... For example, to the extent that we even need to do anything with Level2 data, why isn't it a simple one-liner where we just link to the Level2 feat directory itself? I feel like I must be missing something...

mharms · 2022-04-19T02:48:09Z

@glasserm : Have you actually run this script on any Level2 data, so that we have any actual example of what it is generating?

gcburgess · 2022-04-19T13:52:43Z

The point of makeSubjectTaskSummary.sh was to give a single, smaller, simplified output directory with the same naming convention whether the data came from one scan run or two scan runs.

For HCP-YA data, we had two scan runs for every task. However, when a subject only had one good task run, we excluded their data from public release entirely. One reason for that decision was the different naming conventions for outputs from Level1 for a single run versus Level2 directories from multiple runs.

For Lifespan, most of the tasks have only one run, but some do have two runs. Consequently, the output naming convention can be an issue when you’re trying to understand how to work across two different tasks (e.g., HCD CARIT and EMOTION).

The last reason for including the summary directories was that they don’t include the original or residual time series data, making them a much smaller package option if people want to save disk space.

Regarding the original concern about the $SummaryName option having a slash character ("A/A"), it was intended to capture the Level2 directory naming convention when you don't run a Level2 analyses (don't specify the multiple Level2 options). When that $SummaryName variable contains a slash, it bypasses the need for two arguments (one for parent directory and one for subdirectory). But, yes, it could have been done by requiring two separate arguments for the summary directories.

glasserm · 2022-04-19T14:10:12Z

I don't understand any of this. Somehow I was able to produce comparable layouts for level2 or summary directory without all this complication.

gcburgess · 2022-04-19T14:15:37Z

@glasserm Your Level1 outputs will not have the same filenames as your Level2 outputs. That's the purpose of the summary directory: to run an analysis that includes both Level1 and Level2 outputs.

glasserm · 2022-04-19T14:17:18Z

So I used what I would have used as the argument for level 2 for the summary directory and everything worked without multiple new options or A/A specification.

gcburgess · 2022-04-19T14:25:39Z

That's correct. If you passed all of the Level2 options into the script, it already knows how to name the directory. So, it doesn't need you to pass in the summary directory naming info.

The issue comes when you run only the Level1 analysis for a subject. At that point, it doesn't know how to name the summary directory to be consistent with the Level2 naming convention, because those Level2 options were not provided. So, you need to pass in options for naming the summary directory when you run only Level1 analysis, to ensure the directory name will be the same as the directory created automatically after running Level2 for the other subjects.

glasserm · 2022-04-19T14:27:57Z

Right if I were to run level2 I would call it tfMRI_${TASK} (with no phase encoding direction). And for summary directory I would similarly call it tfMRI_${TASK}. When I did this everything worked as I expected it to and I was surprised that there was all this confusion.

mharms · 2022-04-19T17:19:12Z

@gcburgess : I still don't understand why the MakeSummary script is set up to run on any Level2 data at all ...?

mharms · 2022-04-19T17:31:27Z

And to clarify, the existing script (prior to the PR) works ok on Level1 data if and only if summaryname is a single string with no slash -- the script then creates a ${ResultsFolder}/${SummaryName} directory, and also uses ${SummaryName} as a component of various file names.

The existing script won't work as is when applied to Level2 data. But per above, I'm confused why we are ever running the script on Level2 data in the first place.

glasserm · 2022-04-19T18:26:28Z

I don't understand that either. I thought this was a substitute for level2 and that's how it appeared to work when I used it that way.

mharms · 2022-04-26T19:16:44Z

@gcburgess : If you have a moment to help clarify for us the intended role of the MakeSummary script on Level2 data, that would be great. @glasserm and myself are both confused on why the option for running MakeSummary on Level2 data exists, since at least superficially such a mode of operation doesn't seem necessary if the main purpose of the script is to make Level1 data have a similar naming convention and directory structure to Level2 data.

mharms added 2 commits April 18, 2022 19:20

Changes to usage info, and split 'summaryname' into two distinct flags

018ca4f

Split 'summaryname' into two distinct flags within makeSubjectTaskSum…

201b87c

…mary.sh

coalsont reviewed Apr 19, 2022

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bug fix to makeSubjectTaskSummary #226

Bug fix to makeSubjectTaskSummary #226

mharms commented Apr 19, 2022

coalsont commented Apr 19, 2022

mharms commented Apr 19, 2022

coalsont Apr 19, 2022

coalsont Apr 19, 2022

glasserm commented Apr 19, 2022

mharms commented Apr 19, 2022

coalsont Apr 19, 2022

glasserm commented Apr 19, 2022

coalsont commented Apr 19, 2022

glasserm commented Apr 19, 2022

coalsont commented Apr 19, 2022

mharms commented Apr 19, 2022

mharms commented Apr 19, 2022

gcburgess commented Apr 19, 2022

glasserm commented Apr 19, 2022

gcburgess commented Apr 19, 2022

glasserm commented Apr 19, 2022

gcburgess commented Apr 19, 2022

glasserm commented Apr 19, 2022

mharms commented Apr 19, 2022 •

edited

Loading

mharms commented Apr 19, 2022 •

edited

Loading

glasserm commented Apr 19, 2022

mharms commented Apr 26, 2022

Bug fix to makeSubjectTaskSummary #226

Are you sure you want to change the base?

Bug fix to makeSubjectTaskSummary #226

Conversation

mharms commented Apr 19, 2022

coalsont commented Apr 19, 2022

mharms commented Apr 19, 2022

coalsont Apr 19, 2022

Choose a reason for hiding this comment

coalsont Apr 19, 2022

Choose a reason for hiding this comment

glasserm commented Apr 19, 2022

mharms commented Apr 19, 2022

coalsont Apr 19, 2022

Choose a reason for hiding this comment

glasserm commented Apr 19, 2022

coalsont commented Apr 19, 2022

glasserm commented Apr 19, 2022

coalsont commented Apr 19, 2022

mharms commented Apr 19, 2022

mharms commented Apr 19, 2022

gcburgess commented Apr 19, 2022

glasserm commented Apr 19, 2022

gcburgess commented Apr 19, 2022

glasserm commented Apr 19, 2022

gcburgess commented Apr 19, 2022

glasserm commented Apr 19, 2022

mharms commented Apr 19, 2022 • edited Loading

mharms commented Apr 19, 2022 • edited Loading

glasserm commented Apr 19, 2022

mharms commented Apr 26, 2022

mharms commented Apr 19, 2022 •

edited

Loading

mharms commented Apr 19, 2022 •

edited

Loading