How to Configure Different Aspects of NOEMA_Combine

Folders and File Handling

The main configuration options for NOEMA_Combine are set using configuration files. These files allow you to specify folders, file handling options, and other parameters used throughout the package. The file name is config.ini and an example configuration file is shown below.

[folders]
uvt_dir = D/
uvt_dir_out = D30m/
dir_sd = 30m/
inputdir = raw_data/, additional_data/, more_data/

[catalogues]
line_catalogue = line_catalogue_3mm.csv
source_catalogue = region_catalogue.yml

[file_handling]
ignorefiles_1 = raw_data/FTSOdp20220220.30m
ignorefiles_2 = raw_data/FTSOdp20220731.30m

[file_extensions]
selfcal = _sc
uvsub = _contsub

[single_dish]
telescope = iram30m

The different sections of the configuration file are described below:

  • [folders]: This section specifies the directories used for input and output data.
    • uvt_dir: Folder with calibrated uv-tables.

    • uvt_dir_out: Folder to store calibrated uv-tables output.

    • dir_sd: Folder to store calibrated and gridded single dish data.

    • inputdir: list of directories on which to search for raw single dish data.

  • [catalogues]: This section allows you to specify the line and source catalogue files used in the analysis.
    • line_catalogue: File containing the line catalogue. It is a CSV file with the list of lines, spectroscopic parameters and processing parameters (see below).

    • source_catalogue: File containing the source catalogue. It is a YAML file with the list of sources and their properties (see below).

  • [file_handling]: Here, you can list files to be ignored during processing. This is useful for excluding specific datasets that may not be relevant, or to avoid unreliable scans.

  • [file_extensions]: This section defines custom file extensions for self-calibrated and continuum-subtracted files.

  • [single_dish]: This section specifies the single dish telescope used for the observations. This information is important for correctly processing the single dish data, as different telescopes may have different data formats and characteristics. Currently, the only supported telescopes are IRAM 30m (“iram30m”) and APEX (“apex”), but support for other telescopes may be added in the future.

Avoid Bad Single Dish Scans

In some cases, certain scans from the single dish telescope may be unreliable or contain bad data. To avoid processing these bad scans, you can specify them in the configuration file under the [file_handling] section using the ignorefiles_1 and ignorefiles_2 options. For example:

[file_handling]
ignorefiles_1 = raw_data/FTSOdp20220220.30m

The following CLASS script will create a new single dish file excluding the bad scans. Here from scan 1 until 53, the LO and UO scans are compromised and will be removed. The output file will be named FTSOdp20220220_fix.30m and when combined with [file_handling] this ensures that only the right scans are processed to make the cubes.

!
! script to comment remove compromised scans seen on 2022-feb-21
!
file in FTSOdp20220220.30m
file out FTSOdp20220220_fix.30m single

set scan 1 53
! only inner units are kept
set telescope 30ME0*I-*
find
get zero
sic message class s-i
for i 1 to found
    get n
    write
next

!
! now save all other data
!
set scan 54 *
set telescope *
find
get zero
for i 1 to found
    get n
    write
next
sic message class s+i

Line and Source Catalogues

The line and source catalogues are essential components of the NOEMA_Combine package, providing necessary information for data processing and analysis.

  • Line Catalogue: This is a CSV file that contains a list of spectral lines along with their spectroscopic parameters and processing parameters. Each row in the CSV file corresponds to a different spectral line, and the columns include information such as line frequency, transition details, and any specific processing instructions.

  • Source Catalogue: This is a YAML file that contains a list of sources and their properties. Each entry in the YAML file represents a different source, with details such as source

Source Catalogue

The source catalogue is a YAML file that contains a list of sources and their properties. Each entry in the YAML file represents a different source, and an example of a source catalogue entry in YAML format is shown below:

L1448N:
RA0: "3h25m36.44s"
Dec0: "30d45m18.3s"
height: "33 arcsec"
width: "30 arcsec"
fig_width: 6.0
fig_height: 6.0
Vlsr: 9.0
source_sd: "L1448N"
source_out: "L1448N"
B5-IRS1:
RA0: "03h47m41.591s"
Dec0: "32d51m43.672s"
height: "40 arcsec"
width: "40 arcsec"
fig_width: 6.0
fig_height: 6.0
Vlsr: 9.0
source_sd: "B5-IRS1 B5-Cond*"
source_out: "B5-IRS1"
  • The RA0 and Dec0 fields specify the coordinates of the source to be used for the single dish cubes.

  • The Vlsr is also used for the single dish gridding, and it is in km/s, it is also used to extract uv-tables around the systemic velocity of the source.

  • The source_sd field is used to identify the single dish data files corresponding to the source. It is a string that can include a list of names to be later used in the find /source source_sd within CLASS

  • The source_out field specifies the name to be used for output files related to the source.

Line Catalogue

The line catalogue is a CSV file that contains a list of spectral lines along with their spectroscopic parameters and processing parameters. The first line of the CSV file contains the column headers, and each subsequent row corresponds to a different spectral line.

An example of a line catalogue entry in CSV format is shown below:

#name,QN(filename),freq(GHz),mol(plot),QN(plot),Aul(logs^-1),Eul(K),cat,NOEMAbb,unit,width(km/s),30msetup,30mbb,30mwidth(km/s),30mline(km/s)
SO,5_5_4_4,215.22065300,SO,"5(5)-4(4)",-3.91491,44.10371,JPL,lo,l009l048,10.0,Setup1,LO,20,5.0
DCO+,3_2,216.11258220,DCO\u+,"3-2",-2.62283,20.74365,CDMS,lo,l012l051,10.0,Setup2,LO,20,3.0
c-HCCCH,3_3_0_2_2_1,216.27875600,c-C\d3H\d2,"3(3,0)-2(2,1)",-3.59211,19.46594,CDMS,lo,l013l052,10.0,Setup2,LO,20,3.0
  • The #name column specifies the common name of the spectral line to be used as identifing the molecule and used in the filename.

  • The QN(filename) column indicates the quantum numbers used for the line identification and in the filename.

  • The freq(GHz) column provides the frequency of the line in GHz.

  • The mol(plot) and QN(plot) columns specify the molecule and quantum numbers to be used for the GILDAS header.

  • The Aul(logs^-1) and Eul(K) columns to provide the Einstein A coefficient and upper energy level of the transition, good information to access easily.

  • The cat column indicates the catalogue source (e.g., JPL, CDMS).

  • The NOEMAbb column specifies whether the line is observed with the lower sideband (lo or li) or upper sideband (ui or uo).

  • The unit column indicates the unit for processing. This is the narrow unit name, this is used for the folder and filename.

  • The width(km/s) column specifies the velocity width for processing in km/s. It is used to

  • The 30msetup and 30mbb columns refer to the 30m setup (e.g., Setup1 or Setup2) and the unit that covers the line (LO, LI, UI, or UO). This information is not used currently when serching for the scans covering the lines.

  • The 30mwidth(km/s) and 30mline(km/s) columns provide parameters specific to 30m data processing.
    • 30mwidth(km/s) is the with of the cube to be made around the line in km/s (cubes covers Vlsr - 30mwidth(km/s) until Vlsr + 30mwidth(km/s)).

    • 30mline(km/s) is the velocity width used to define the window in CLASS to avoid the line for baseline fitting. The baseline is using the following range of velocities: from Vlsr - 30mwidth(km/s) until Vlsr - 30mline(km/s) and Vlsr + 30mline(km/s) until Vlsr + 30mwidth(km/s).