What's New

View new features for Accelerator 2022.1.1.

2022.1.1 Release Notes

New Features

Internal Number Products Case Number Description
VOV-7819 Accelerator 21323 Add to vovselect the capability to specify more FIELD.X items. Added RESOURCES.<RESNAME> to estimate the requested resource value of RESNAME. Added GRABBEDRESOURCES.<RESNAME> which returns the current value of RESNAME in a job's grabbed resources. Similarly, SOLUTION.<RESNAME> returns the value of RESNAME in a job's solution.
VOV-9389 Accelerator AAP23445 The doTestHealthTooManyOutOfQueueJobs health check was enhanced to allow @USER@ as a valid recipient.
VOV-11252 Accelerator Plus AAP25064 vovwxd was modified to use the CONFIG(slave,setName) parameter instead of a constant "WXTaskers:<wxQueue>" set name.
VOV-13421 All None In an upcoming major release of Accelerator Products, the vovselect * wildcard select feature will be dropped. To prepare for this change, users should update scripts and REST requests to issue vovselect requests using a specified list of field names.
For example:
nc cmd vovselect statusnc,id,command from jobs
An easy way to find out what fields are in an object type is by using vovselect fieldname. For example:
nc cmd vovselect fieldname from jobs
VOV-13912 All CS0202834 Fixed an issue with daily log rotation of auto-started daemons.
VOV-14232 All CS0238414 Clarified log messaging when the saving of trace database and/or metrics data is taking longer than expected.
VOV-14449 Accelerator Plus CS0222621 Added CONFIG(failedAgentsCooldownPeriod) parameter for vovwxd that allows you to continue to request agents after the specified period for a bucket that had failed agent jobs. Format is a time specification, a value of 0 disables this feature.
VOV-14451 Allocator CS0255747
  1. Resources - Plot: legend will not overlap the following plot. To solve this, will be displayed in more than one column if needed.
  2. Overview: resource name and all sites columns are fixed in case there are horizontal scroll because the entries in the table are bigger than the visible screen
  3. Overview: table sorting in descending order
  4. Resources - Summary: Table sorting in descending order
VOV-14978 Hero None The Licensing section of the online help has been updated and improved with recent Altair License Manager (ALM) licensing changes.
VOV-15010 Hero None The Altair Hero online documentation has been updated to reflect the changes in functionality and features.
VOV-15085 Monitor None Sorting the File Systems & Process Summary table in Monitor by clicking the header of each column, will sort the table in place in the current working tab, instead of creating a new browser tab with the table sorted.
VOV-15104 Accelerator, Accelerator Plus None The SDS Configuration documentation has been updated to reflect the newly added "enable_jobdata" feature.
VOV-15149 Hero None The following targets were added to the Palladium emulators: BRD144, BRD72, BRD48, BRD36, BRD24, BRD18, BRD16, BRD12, BRD9, BRD8, BRD6, BRD5, BRD4, BRD3, BRD2 and BRD1. Thes targets correspond to placements with the corresponding number of boards. For example, if the emulator is configured with a group named PZ1, then a job for a Palladium with 8 racks would use the resource HERO:PZ1_BRD144. The placement rules are restrictive and will evolve with experience.
VOV-15150 Accelerator, Accelerator Plus None The Accelerator and Accelerator Plus subserver feature has been retired. This feature provided one way of optimizing load on vovserver. Other modern optimization methods will be used for vovserver performance and scalability going forward.
VOV-15206 All None Added description of Security Level = ANYBODY in the online documentation.
VOV-15254 All None Implemented a new event to Kafka on job delete based on topic 'vov-jobdata' allowing these events to be correctly ordered with others for the same job.
VOV-15326 All None Bundled the Altair License Manager server components with the Accelerator Products installation media.
VOV-15408 Accelerator None FAIRSHARE_WEIGHTS within job class definitions are no longer supported. Use vovfsgroup modify FSGROUP.user weight W instead.

Resolved Issues

Internal Number Products Case Number Description
VOV-13165 Accelerator CS0127745, CS0172968 With vovreconciled active, there were cases when license usage counts that were growing during a job's execution, but the correct license grab counts were not being set for the job.
VOV-14407 Accelerator, Accelerator Plus None Fixed issue that prevented the preemption of wxagent jobs when using Direct Drive.
VOV-14533 Accelerator None Add more documentation about multiphase jobs and their dependency on autoRescheduleCount.
VOV-14557 Monitor CS0282491 Fixed issue with removing a license checkout from within the web UI.
VOV-14586 Accelerator CS0285073 The default values of the FairShare group "/" weight & window values were restored to the 2019.01 values.
VOV-14663 Accelerator, Accelerator Plus None Add documentation for the TLS 1.2 and 1.3 configuration parameters that affect the protocols supported by the internal webserver for REST and HTTP requests.
VOV-14822 Accelerator, Accelerator Plus CS0309335, CS0314259 Fixed issue causing nc wait -dir to wait for jobs in other directories.
VOV-14824 All None Fix a DOS vulnerability in vovserver's base webserver from nefarious large HTTP requests.
VOV-14874 FlowTracer None Fixed issue that affected the ability to stop jobs running on an indirect tasker:
  1. Jobs may appear stopped, but will actually continue to run.
  2. A second stop request may cause the tasker to go into a BUSY state for several seconds while the tasker reconciles the fact that the job is still running.
VOV-14959 Accelerator Plus CS0267206 Fixed a bug in vovwxd that caused an infinite loop due to a deleted "top job" of a bucket.
VOV-14969 All None Change description of fields in Admin licensing page when ALM is used.
VOV-14976 Accelerator None Fixed issue that caused the primary license feature to be blank on the licensing web UI page for the first hour after starting up a vovserver that is using node-locked ALM licensing.
VOV-15056 Accelerator, Accelerator Plus, FlowTracer None Fixed issue that prevented the -m option from being used to request a specific host in the bsub emulation command.
VOV-15064 All None Improved structure and clarity is added to the documentation about vovserver configuration parameters.
VOV-15088 Accelerator CS0330199 Taskers that lose connection with their vovserver will no longer treat a hostname lookup error as critical and exit, but will continue trying to reconnect to the vovserver on a periodic basis. This adds resilience in networks where vovserver is hosted on a VM or a container that has been restarted, and where dynamic DNS removes hostname entries when the VM or container are down.
VOV-15092 Accelerator None All phases of multiphase jobs now correctly run at the same priority.
VOV-15093 Accelerator None A problem parsing -mpres1 XXX -mpres2 YYY -mpresN ZZZ was fixed when running multiphase jobs.
VOV-15099 Monitor None New feature introduced in Monitor Administrator page. In the actions column of the "Edit Monitors" table, a new action has been introduced. This action is only available for monitor definitions prescribed in the config.tcl file. Clicking the button with the icon file will bring up the config.tcl file in a browser environment. Any modifications made to this file through the browser will be persisted to the actual file as well. Monitors that were originally prescribed in the config.tcl can only be edited or removed, either by directly modifying the config.tcl through a text editor, or by using the "Edit config.tcl" action in the actions column of the table.
VOV-15100 Monitor None The new Monitor web UI front page is enhanced to display the Altair Monitor product name.
VOV-15106 Accelerator Plus CS0325511 Fixed issue causing nc wait to exit with error "Failed subcommand wait: Illegal object id".
VOV-15112 All None Updated Kafka Job events with additional data.
VOV-15122 Monitor None Fix a problem in the Monitor web UI in the Network > Process > Details page whereby sorting by some of the columns in the table would display an error.
VOV-15140 Monitor None Able to sort by the time related column in the processes table in LM web UI.
VOV-15144 Monitor None Change the look and feel of the Host details page in LM web UI to match the recent re-skin.
VOV-15148 Accelerator, Accelerator Plus None Fixed issue causing nc wait to exit with error can't read "jInfo(exit)": no such element in array.
VOV-15151 Accelerator, Allocator None Fixed issue with SetMinQuantity causing jobs to run despite no allocated licenses on NC site.
VOV-15152 FlowTracer None In some extraordinary circumstances, if a dialog box was closed by pressing "x" button, the entire vovconsole would crash. This has been fixed.
VOV-15172 Monitor None The number of columns in the grid view in Current Utilization Overview page will always be compliant with the preferred number of columns that the user has selected.
VOV-15174 Accelerator None Documentation is added in the REST Tutorial to explain when TLS 1.2 support must be enabled for REST python client programs running on CentOS 7.
VOV-15190 Allocator CS0334264 Fixed issue server config 'maxResMap' update is not reflected in the resource map which is resulting following error messages:
vovserver(54840) ERROR Jun 15 16:22:58 Resource map id not found for resource License:111. 
[mqlib/mqres:596] 
vovserver(54840) ERROR Jun 15 16:22:58 LA:countJobs:Resource License:111 020193489 without a resmap 
[trace/tracelicense:1114]
VOV-15194 Accelerator Plus CS0341461 Job limits can now have periods in them. For example, if a user has a username of "test.user", a command such as this is now allowed:
wx -q $WXQ run -r Limit:@USER@_foo_1 -- sleep 0
VOV-15219 All None Auto-trimming of PostgreSQL database is disabled by default.
VOV-15222 All None Fixed the vovshow -licenses output when ALM is enabled.
VOV-15245 Accelerator, Accelerator Plus CS0325511 Updated nc wait to filter event by jobid when waiting on 10 or less jobs.
VOV-15283 Accelerator CS0351032 Support was added for FIPS enabled hosts on CentOS and other RHEL based hosts.
VOV-15292 All None Some issues dealing with HTTP requests with headers larger than 4K have been addressed. The internal HTTP server now accepts HTTP requests with a total header size up to 16K.
VOV-15305 Accelerator CS0349956 HTTP x-www-form-urlencoded POST data that's passed to CGI pages via temp files is now encrypted with a random key that changes for each request.
VOV-15310 Accelerator Plus CS0355127 Fixed a crash where an ambigious "vovquery select id from 9" results in an object lookup that vovquery doesn't support.
VOV-15315 Monitor None Fix wrongful description of action in alerts page of LM.
VOV-15332 Accelerator CS0318230 Raw power calculations for taskers has been increased from 2.1 million to 2147483647. If the calculated raw power is greater than this number, it will be clipped to 2147483647 rather than being allowed to become negative due to an integer overflow. A warning will appear in the tasker's log indicating that the rawpower exceeded this value and was "clipped" to the max allowable value.
VOV-15334 All None Updated Kafka Job events with additional data.
VOV-15398 Monitor None In LM Detailed Plots fixed:
  1. 1. In the treemap now all rectangles are visible, nothing is cut off.
  2. In the treemap legend nothing is cut off, everything is aligned and the records are ordered in decreasing order.
VOV-15482 Monitor CS0332038 Monitor's batch reports are using the same version charts like the ones found in the browser setting by default. The use of the command line utility remains the same. However, in order to extract the charts and convert them to static images new requirements have been set in place. Node v14 or greater is required to have been installed on the machine and be available in the path environment variable. In addition, manually reverting the LM configuration parameter "lookAndFeel" to its previous value will result to image extraction failure.
VOV-15497 All None Fixed issue in single-mode license accounting.

2022.1.0 Release Notes

New Features

Internal Number Products Case Number Description
VOV-7252 Accelerator AAP20379 Added an email icon to the Job/Node Information page in the Accelerator web UI so the user can email the job description and execution information details to the owner of the job. The email action is by a "mailto:" link, so your browser needs to have "mailto:" link handling enabled.
VOV-9004 All 23042 The system health and notification daemon, vovnotifyd, now has the ability to control the timing at which license expiration and other emails are sent. In vovhealthlib.tcl, you can define a TIMEVAR such as:
source $env(VOVDIR)/tcl/vtcl/vovflexlmdlib.tcl
package require vovurlutils

global HEALTHLIB_PRODUCT_MAP HEALTH_PROCS env

set HEALTHLIB_PRODUCT_MAP(doTestHealthMyFeature) "nc"
# registerHealthCheck doTestHealthMyFeature -checkfreq 10 -forceCheckfreq -mailfreq 10 -forceMailfreq
lappend HEALTH_PROCS(list) doTestHealthMyFeature

proc doTestHealthMyFeature { args } {
    global HEALTH_PROCS
    set homeUrl $HEALTH_PROCS(homeURL)
    # VovMessage "now running doTestHealthMyFeature"
    set subject "TestMyHealthFeature"
    set body "\nHi,\n\nWe ran TestMyHealthFeature.\n"
    doMailNotify doTestHealthMyFeature "@ADMIN" $subject $body
}

TIMEVAR doTestHealthMyFeature {
    Tue {
        suppressMail doTestHealthMyFeature 1
    }
    06:00-08:00 {
        suppressMail ALL 0
        suppressMail doTestHealthMyFeature 0
    }
    default {
        suppressMail ALL 1
        suppressMail doTestHealthMyFeature 1
    }
} 
Where calling suppressMail with a "1" for the specified health check will suppress the mailings for that TIMEVAR. suppressMail accepts either the name of a specific health check, or "ALL" to control all defined health check routines.
VOV-9417 Allocator 23693 Added Totals row to Allocator Resource Summary Report page.
VOV-10250 All AAP24412 If nc wait cannot find related job info the related message now more accurately describes the job as forgotten instead of vanished.
VOV-10545 All 24546 An additional option -verify has been added to the existing vovversion CLI which checks current client and server versions for mismatches. If a discrepancy is discovered, an error message is displayed and a non-zero exit value is returned, otherwise a verification message is displayed indicating the current version number and a zero exit value is returned.
VOV-11001 All 24808 The cron.csh scripts used by vovcrontab now log through vovdailylog.
VOV-11202 All 24995 Enhanced ? tip help message on filter inputs and added clarification on filter inputs for 'Taskers' and 'Resource Statistics' UI.
VOV-11536 Accelerator None The Container Configuration help page has been updated for clarity and now provides newer, better examples.
VOV-11975 Accelerator CS0120839 There is a new entry in cleanup.config.tcl to control when tasker startup logfiles get removed when running vovcleanup. set config(cleanup,taskerStartup) 180d. This controls clean-up of the tasker startup logs separate from the tasker logs. Users can use vovcleanup to prune tasker logs without accidentally removing the startup log if they wish to do so.
VOV-12006 Accelerator CS0120876 Added auto-trim database option to Database Configuration web page.
VOV-12242 Accelerator None Added a new "shrink-to-fit" job submission option -reconcilemem to nc run. In shrink-to-fit mode, a job's memory resource usage will be reduced after a time if actual memory usage is less than requested RAM usage that was specified with "-r ram/NN".
VOV-12400 Monitor None Updated HASP monitors to support the Sentinel LDK Admin version 8 API.
VOV-12858 Accelerator CS0142679 Added ability to pass the -cmp_remotefile option when using Calibre.
VOV-13227 Accelerator, Accelerator Plus AAP25139 Distributed Parallel support now allows -dpres+ to append resources to either -r or -dpres resource specifiers on the command line or in jobclasses via VOV_JOB_DESC().
VOV-13505 FlowTracer CS0182760 Enhanced LSF driver script to allow jobs submission on execution host(s)/cluster(s) based on the following configurations in LSF config.tcl.
  1. CONFIG(target) - specify the space separated list of cluster(s)/execution host(s), jobs will be submitted using bsub -m option.

    CONFIG(targetName,env) - specify a different named environment per cluster/execution host.

    LSFtarget/LSFTarget - specify cluster/execution host per job in the resource string regardless of the CONFIG(target).

  2. CONFIG(cluster) - specify the space-separated list of the cluster(s), jobs will be submitted using bsub -clusters option

    CONFIG(clusterName,env) - specify a different named environment per cluster.

    LSFcluster/LSFCluster - specify cluster per job in the resource string regardless of the CONFIG(cluster).

VOV-13788 Accelerator, Accelerator Plus AAP24926 Added cleanup of /data/jobs for Accelerator and Accelerator Plus. The default cleanup interval is 10 years.
VOV-13795 Accelerator, Monitor CS0186684 Eliminated unnecessary HTTP-requests made by ftlm_lmproject utility to Monitor since the non-FQDN-version of the hostname is always included in the FQDN request.
VOV-13851 Accelerator, Accelerator Plus CS0208070 Added RabbitMQ logs for nc wait and vovselect command. Added config 'VOV_CLIENT_LOG_TLS' in ncConfig/<queue name>.tcl to enable TLS for RabbitMQ connection.
VOV-13900 Accelerator None A change was made to the upper right pane within the Accelerator dashboard page, whereby the server daemons status squares are removed. This pane will henceforth display only alerts.
VOV-13911 All None When stopping a job with nc stop, you can now specify -skiptop [1|0] to indicate whether or not to send the signal to the top process. The top process is usually a wrapper such as vw, unless the job was started without a wrapper.
VOV-13983 Accelerator CS0121174, CS0209718 Changed the pre and post job environments to include the following variables: VOV_JOBID, VOV_JOBPROJ, VOV_JOBSLOT, VOV_GRABBED_RESOURCES, and for Accelerator specifically, NC_JOBID.
VOV-14017 Accelerator Plus CS0212504 Improved procedure of dequeuing Accelerator Plus jobs in case of a repeated agent failures. vovnc.tcl script will find the jobs of the current bucket more efficiently.
VOV-14026 All None Introduced a feature called VovScope, which provides insights into network activities (read/writes) of vovserver and clients.
VOV-14033 Monitor None Implemented a more modern look-and-feel for the Monitor web UI.
VOV-14098 Monitor None Added ability to generate Checkout Statistics report by user and host.
VOV-14139 Accelerator None Preemption Plans can now take the form:
TIME:STATE:SIGNAL,includerx,excluderx,skiptop 
VovPreemptMethod can now take the form: 
VovPreemptMethod resource SIGNAL -signal SIGNAME -include patternList -exclude patternList 

nc preempt can now take the form: 
nc preempt -method TIME:STATE:SIGNAL,includerx,excluderx,skiptop 
VOV-14177 Accelerator None nc stop now supports setting either NC_STOP_SIGNALS or VOV_STOP_SIGNALS with an EXT like format that doesn't involve invoking vovjobctrl for its implementation.

If you reference NC_STOP_SIGNALS, the EXT format is : EXT:SIGNAL:INCLUDERX:EXCLUDERX:SKIPTOP

You can now use :SIGNAL:INCLUDERX:EXCLUDERX:SKIPTOP to do the same thing but without vovjobctrl getting invoked. Job control will be handled directly by the tasker. This also works with VOV_STOP_SIGNALS.

VOV-14178 Accelerator None When starting a job via the REST API, an additional field named nowrapper has been added. This is equivalent to the -n option on the CLI command nc run. The default value is False. If nowrapper is False and the wrapper field is blank or unset, it gets set by the API to vw.

You must set nowrapper=True for a job to be run with no wrapper. When stopping a job via the REST API via a "PUT" request with action='stop', the API now accepts an additional field named "skiptop". If False, the specified stop signal may also be sent to the top job. Normally the top job the is vw wrapper.

When preempting a job via the REST API with a "PUT" request with action='preempt', the API now accepts a syntax for the method field of: TIME:STATE:SIGNAL,includerx,excluderx,skiptop.

This is similar to the updated preemption plan format.

VOV-14274 Accelerator, FlowTracer CS0241168 Default value of -singleuser tasker option can be set with vtk_tasker_set_defaults.
VOV-14289 Monitor CS0243425 Enhanced ftlm_parse_flexlm parser to handle additional data on checkout in the form [user_data=<value>], and display the same under the project in Monitor reports.
VOV-14311 All None The new -alm option is added to the batch_install.csh script to allow the host and port of Altair License Manager to be specified at install time. This is useful when the Accelerator products are to be licensed via Altair License Manager licenses.
VOV-14315 Monitor None Increased performance of matching license checkouts to jobs for NRU cases.
VOV-14419 All None Multiphase support is provided by two additional command arguments to nc run:

-multiphase [1|0] -mpres "resource string" -multiphase 1 enables multiphase jobs.

-mpres sets the resources that will be used for each phase.

The '%' is used as a delimeter for the resources of each phase. e.g. -mpres "linux64 foo%linux64 bar:linux64 baz" By specifying the resources of each phase and designating that certain resources are only allocated to certain taskers, one can run different phases of a job on different taskers.

For example: I have two taskers named tasker1 and tasker2. I want to run phase 1 and 3 on tasker1, and phase 2 on tasker2. My resources may look like:
vtk_resourcemap_set License:blue UNLIMITED License:blue_tasker1 
vtk_resourcemap_set License:red UNLIMITED License:red_tasker2 
vtk_resourcemap_set License:blue_tasker1 1 tasker1 
vtk_resourcemap_set License:red_tasker2 1 tasker2
I could then run a multiphase job as:
nc run -multiphase 1 -mpres "linux64 
License:blue%linux64 License:red%linux64 License:blue" -- -e BASE -D /home/jjmcwill/testDir/testMultistage.sh 
A multiphase job will have two new Job Properties set:

MPRESOURCES: Contains the same resources passed in -mpres, and is used to reset the job resources for each phase.

MPCURRENTPHASE: Contains an integer indicating the current job phase. It starts at one, and has a max value of 9.

What the job script sees: The running job script will see an environment variable named VOV_JOB_PHASE which is set to the current phase. The script writer will need to use that to decide what work to do for that phase. If the script exits with an exit code of 216, Accelerator will increment the job phase, change the job resources, and reschedule the job to run again. If the script exits with an exit code of 0, the job is considered "Done", and MPCURRENTPHASE is reset to 1.

Failed jobs: If a job fails during a phase with a code other than 0 or 216, it is considered FAILED and MPCURRENTPHASE will not increment. If the job is invalided and re-run (e.g. nc rerun -f JOBID), the job will re-run starting at MPCURRENTPHASE and further phases will run if the job exits with code 216, as described above.

Logging: After the first phase is run, subsequent phases of the job will have the command rewritten so that the wrappers are passed "-a -A", telling the wrappers to append to the job log. This is so that all phases of the job get their stdout and stderr logged to the same file. If this was not done, each phase of the job would overwrite the log, and the user would only see the output from the last phase that was run. If Accelerator does not detect one of the standard vov wrappers at the beginning of the command line, it will assume the command is not using a wrapper. In this case, it will look for the standard ">;" redirect symbol in the command and replace it with ">>;".

REST Support: In the payload for submitting a job via rest, two new fields are allowed: multiphase and mpres. Setting multiphase = True enables multiphase job support. Setting the mpres field behaves the same as described for the command line argument described above. Re-running a multiphase job that has failed via the REST re-run API will behave similarly to rerunning a failed multiphase job from the command line as described above.

VOV-14420 Accelerator, Accelerator Plus None Updated tclrmq library to (1.4.5) currently (1.3.8) to support TLS connection for RabbitMQ.
VOV-14477 All, Accelerator, Accelerator Plus CS0257852 Taskers running as non-root will no longer get sent jobs unless the job's user matches the non-root tasker's userid. This is to address a situation where a job running on a non-root tasker gets access to the user's data on the filesystem. This policy can be disabled by setting the allowForeignJobsOnUserTaskers configuration parameter to 1.
VOV-14488 Accelerator Plus CS0260943 Added new configuration parameters for fine-grained control of vovwxd.
VOV-14490 All None The newly added VovScope feature has been added to the online help documentation.
VOV-14511 All AAP25265 Increased maximum number of normal clients per user limit to 250K from the existing 100K.
VOV-14536 Accelerator Plus None When using wx, the EXTLINKS property on the project will contain information about each base queue. Previously, the format of EXTLINKS was "queueName1 URL1 queueName2 URL2 ..."

This has been changed so that the queueName also contains a status of whether or not vovwxd sees the status of the basequeue as ONLINE or DOWN.

The new format of EXTLINKS is: "queueName1:status=STAT1 URL1 queueName2:status=STAT2 URL2 ..."

When visiting the web UI for the wx project, the link for a base queue that is down will be shown in red instead of the customary gray.

VOV-14561 Accelerator CS0267219 Reduced process tree log frequency from every ~30 secs to once in 10 minutes.
VOV-14571 Accelerator CS0266023 In order to provide more control over the creation of FairShare subgroups, a new acl type named CREATE was added as well as a new server config param called fairshare.strictNodeCreationChecks. The default for fairshare.strictNodeCreationChecks is 0, which means a user only needs the ATTACH acl to create sub-groups or to run a job on an existing group. If fairshare.strictNodeCreationChecks is set to 1, however, a user will need the CREATE acl in order to create sub-groups. They still need the ATTACH acl to run jobs under an existing group. If you set the debug flags FairShareGroups and acl, server.log will contain messaging about the acl checks being made to allow or deny creating a FairShare sub-group.
VOV-14587 Accelerator Plus None In WX DirectDrive setup, the onBucketProcess Tcl procedure is enhanced to support parameters for the agent job in the base queue. The BUCKETINFO array will be initialized with the following fields.

BUCKETINFO(AUXRESOURCES)

BUCKETINFO(BUCKETID)

BUCKETINFO(COUNT)

BUCKETINFO(FSGROUP)

BUCKETINFO(JOBCLASS)

BUCKETINFO(JOBPROJ)

BUCKETINFO(OSGROUP)

BUCKETINFO(PRIORITY)

BUCKETINFO(RESOURCES)

BUCKETINFO(USER)

BUCKETINFO(XDUR)

BUCKETINFO(maxidle)

BUCKETINFO(maxlife)

BUCKETINFO(taskerAutokill)

BUCKETINFO(taskerCmd)

BUCKETINFO(taskerReservation)

BUCKETINFO(taskerResources)

The user can set values in vovaccel.tcl.

VOV-14589 All None For products other than Hero 2.0, parameters to the resource expression comma operator are required to be Monitor features. The resource expression processing no longer allows resource loops in resource maps.
VOV-14616 Accelerator None The output of nc hosts is changed slightly to make the information about running jobs per tasker clearer. The "JOBS" column has been replaced with 2 columns: "RUN/SWP" and "SLOTS".
VOV-14629 Hero None The command passed to hero_adapter is wrapped by vtool -f Emul:<emul-name> <leaf-count> so that the corresponding feature checkouts and checkins can be tracked using Monitor.
VOV-14704 Monitor None Improved the ability for vovresourced to auto-discover Monitor metadata, such as the SSL and web port settings.
VOV-14853 All None Implemented new events to Kafka on job status updates, property changes, and delete jobs. This functionality is controlled using config 'enable_jobdata' in sds.cfg and this is not supported for array jobs.
VOV-14871 Monitor None If there are alerts present on the system, the favicon of the tab will cycle between a red dot and the default icon. Additionally, presence of alerts are notifiesd to the user by a "triangle alert" icon that gets prepended to the label of the Home tab in the main navigation bar. The color of this matches the alert level, the icon demonstrates a pulsation effect.
VOV-14888 All None The Supported Platforms online help page has been updated to reflect the changes implemented for 2022.1.0.
VOV-14992 Accelerator None In the 2021.2.0 release and later, the processing of jobclass initialization has been taken over by the new autostart script start_init_jobclasses.tcl in SWD/autostart, and the new liveness script live_init_jobclasses.tcl in SWD/tasks.
VOV-15000 Accelerator None The -reconcilemem option of the nc run command was updated in the CLI help.
VOV-15002 Accelerator, FlowTracer None The parameter allowForeignJobsOnUserTaskers has been added to the online help.

Resolved Issues

Internal Number Products Case Number Description
VOV-8079 All 21734 Fixed issue with using Environment Modules from within a sh/bash shell. The "module unload" operation was failing.
VOV-8648 FlowTracer 22241 Added -running option to vovstop to provide the ability to stop running jobs without dequeueing queued jobs.
VOV-8752 Monitor 22547 Fixed use of LDAP Email addresses in Monitor reports when VOVLM(ldapEmail) is set to 1 in <SWD>/config/web.cfg.
VOV-8871 Monitor AAP22766 Fixed legend spillover on Monitor usage comparison plot with many items.
VOV-9166 All 23318 Fixed issue where an invalid slave resource expression would result in a permanent error message on the Slaves web page.
VOV-9638 All CS0288154 Added a -nolog option to the vovgetnetinfo utility that will prevent the creation of a log file in the SWD.
VOV-9776 Accelerator 24061 The form for submitting a slave reservation now checks that the data is valid before actually submitting the reservation.
VOV-9897 All AAP24162 Initialize VOV_JOB_DESC variable to avoid errors when listing job classes.
VOV-10568 All CS0120645 HyperThreading fieldname 'HT' on taskers previously reported incorrectly if HT was available. The field now properly reports if SMT threading (either Intel HyperThreading or AMD SMT) is currently available and enabled by checking value(s) of either the /sys/devices/system/cpu/smt/control, /sys/devices/system/cpu/smt/active pair or /sys/devices/system/cpu/cpu0/topology/thread_siblings_list (or equivalent). directly. For /sys/devices/system/cpu/smt/control based files the values must now be both "on" and "1" or HT field will be reported as disabled. If you are having problems with tasker HT field values on a particular system please contact Altair support with details on your system information such as distribution name, release version, kernel version, method of HT/SMT configuration and expected values.
VOV-10904 Monitor None Fixed issue that prevented the Network > Processes > Details report from displaying the processes for the specified host(s) and/or user(s). Also fixed issue on this report that caused the UI to jump the batch reporting page if table sorting/filtering was used.
VOV-11292 Accelerator Plus AAP25068 When submitting Accelerator Plus jobs where you are specifying the name of the base queue, vovwxd would incorrectly schedule the correct number of base queue jobs based on minQueuedPerBucketPerNCQueue and maxQueuedPerBucketPerNC, often scheduling more than allowed. This has been addressed.
VOV-11971 Allocator None New Altair Allocator procedures LA::MarkVqAsOoq and LA:MarkResourceVqAsOoq have been added to the online help.
VOV-12137 Accelerator CS0120974 nc modify now accepts the syntax -G /group.user to change the FairShare group of a queued job, and will allow the user to be different than the job's original FairShare user so long as the client has the correct ATTACH ACL on the new fsgroup.
VOV-12253 Monitor CS0120851, CS0203350 Enhanced MathLM parser to handle space in feature name.
VOV-12510 Monitor CS0265568 Initialization of the crypto signing key for web auth tokens was moved earlier in the vovserver initialization to address possible issues with trying to log into web pages while vovserver was still starting.
VOV-12852 Monitor CS0132965 Fixed parser error "can't read "feature": no such variable" and incorrect capacity value due to usage info not available.
VOV-13088 Allocator, Monitor CS0155322 Added an alert on server buffer overflow for all the products. Alert message will contain the client name if defined else unnamed client with fd would come.
VOV-13094 Monitor CS0193004 Adjusted the Sentinel parser to handle features with unlimited capacity.
VOV-13142 Allocator CS0160431 If a daemon is already running, starting a daemon from web UI is a no-op (in previous releases daemon was stopped and started).
VOV-13150 Accelerator, Accelerator Plus CS0160883 When visiting a URL in the web UI (such as for a newly created job), a user who has not been authenticated would sometimes not get properly redirected to their original URL after logging in. This has been fixed.
VOV-13165 Accelerator CS0127745, CS0172968 vovreconciled kept on repeatedly adding "NRU resources with same handles" back to the job. This has been fixed.
VOV-13219 Monitor CS0165626 Updated embedded LM widgets URL in lm_widgets.php.
VOV-13294 All CS0170164 Updated support email address.
VOV-13348 FlowTracer CS0172882 Fixed issue with vovconsole -fontsize option for not changing the node label font.
VOV-13389 All None Fixed an error when clicking the Cleanup All Cached Files Used for Tasker Load button on the web page for a tasker.
VOV-13418 Monitor CS0175623 Fixed custom time range issue in Monitor Batch Reporting which is resulting in zero size report.
VOV-13422 All CS0175205 The vovdaemonmgr command must be run on the vovserver host, and if not, will result in an error message.
VOV-13548 Accelerator CS0185445 Fixed issue that caused the equal sign to be lost when copying negated resource requests from a preempted job to its resumer job.
VOV-13579 Accelerator CS0187071, CS0213289 Fixed a web UI bug when viewing multiqueue preemption rules with watched resources.
VOV-13595 Allocator CS0188280 Suppressed the log "Changing site ID of host..." which can be controlled by using the flag MicroCode.
VOV-13669 Monitor CS0192634 Added support for renaming custom groups and custom group types that contain SQL-sensitive characters. Also added the ability to create and rename custom groups and custom group types via the ftlm_accounts CLI utility.
VOV-13671 Monitor CS0193959, CS0244917 The ftlm_deobfuscate utility now resets the obfuscation count in the database for specific files that have been requested to be processed.
VOV-13698 Accelerator Plus None Fixed tasker premature idleness exit.
VOV-13704 Monitor CS0190737, CS0237022 TLS 1.3 support is added in connections to the vovserver webport when the "internal" webprovider is active in this release, and TLS 1.2 is diabled by default. The enablement of TLS 1.2 and specific cipher suites is possible with the addition of new configuration parameters. 4 new configuration parameters have been added to allowing tuning of what SSL/TLS versions the internal webserver supports. You can view and modify these via vovservermgr config and make them permanent inside policy.tcl:

http.minSSLVersion - set the minimum TLS version supported. Defaults to "TLSv1.3". Valid values are TLSv1, TLSv1.1, TLSv1.2, and TLSv1.3

http.maxSSLVersion - set the maximum TLS version supported. Defaults to "TLSv1.3". Valid values are TLSv1, TLSv1.1, TLSv1.2, and TLSv1.3

http.tls12Ciphers - this is a list of ciphers that TLS 1.2 and earlier will be restricted to. The default is an empty string, but it can be set to a string such as "ECDH+AESGCM:ECDH+AES256:ECDH+AES128:DH+3DES:!ADH:!AECDH:!MD5"

For an explanation of the string format, see "Cipher List Format" at https://www.openssl.org/docs/manmaster/man1/openssl-ciphers.html for details on what's acceptable.

http.tls13Ciphers - This is a list of cipher SUITES that is used by TLS 1.3 (and presumably above if newer TLS versions are introduced in the future) See the description of SSL_set_ciphersuites() at https://www.openssl.org/docs/manmaster/man3/SSL_CTX_set_cipher_list.html for an explanation.

By default the value is an empty string, which tells OpenSSL to use the default, which according to the documentation is TLS_AES_256_GCM_SHA384:TLS_CHACHA20_POLY1305_SHA256:TLS_AES_128_GCM_SHA256"

Changing any four of these values via vovservermgr config while the server is running will cause the internal webserver to stop and restart with the new configuration. We recommend using a tool such as sslscan that is part of RHEL 8 to verify your changes.

VOV-13708 Accelerator None The Queued jobs numbers format has been modified to integer format.
VOV-13709 Accelerator None A fix has been added to address the browser refresh issue of the Set Details page.
VOV-13728 Accelerator None In the web UI, the following changes have been made:
  1. The filter is most useful to sort out the jobs/Files. So, it has been moved to down to just above the Jobs/Files table and cause it to search only within the Jobs/Files list.
  2. The "Jobs" list has been renamed to "Jobs and Files" because sets contain jobs and files.
  3. Added the count of items (both the Jobs and Files) in the "Jobs and Files" list. eg: "Jobs and Files ( 237 )".
  4. For files in the Jobs and Files table, omit the Status column has been omitted (leave it blank) and fix has been added to the "Type" column to say "File" instead of the incorrect "Job."
VOV-13729 Accelerator None The Sets names listed in the Recently Visited and Favorites lists in the Set Browser at the left side were made into hot links so that it would be easy to see the corresponding Set Detail page. Added fix to the Jobs Running and Jobs Queued numbers, for the Sets names listed in the Recently Visited and Favorites lists in the Set browser at the left side.
VOV-13733 Accelerator, Accelerator Plus CS0196065 Fixed Running jobs CGI error when taskers are missing.
VOV-13734 Accelerator CS0189976 Fixed issue that prevented the VncPolicyValidateEnvironment procedure's results from being applied to the job environment when using the the -f option to process a command file.
VOV-13738 Accelerator Plus None Fixed a bug that caused a vovwxd error, when adding non existing base queues.
VOV-13783 Accelerator, Monitor None When selecting ports for the web port, vov port, and read-only port, vovserver now does additional checking to ensure the same port number isn't inadvertently used for multiple services. Also, vov client connections now have a default send and receive timeout of 30s, but this can be controlled by the environment variables VOV_SEND_TIMEOUT and VOV_RECV_TIMEOUT. These environment variables were supported on Windows in an earlier release but are also honored on Linux. This should prevent commands like vovwait4server from hanging indefinitely if it accidentally tries to connecting to an listening port that is NOT the vov port.
VOV-13790 Accelerator CS0203368 PTY_ERROR property is set to YES for interactive job when its terminal of has been killed.
VOV-13798 Accelerator CS0204909 Fixed the error "USER ERROR: Failed subcommand info: can't read "compatInfo(685614164,info)": no such element in array" for 'nc info -c <jobid associated with taskerlist>' .
VOV-13843 Monitor CS0208377 Added a new option (-removelockfile) to lmmgr start. This instructs the script to remove the server lock file if it already exists.
VOV-13846 Accelerator CS0202837 Added new -confirmafter option to the vovtaskermgr start operation that waits for a specified amount of time, then confirms whether each tasker that was requested to start did, in fact, start.
VOV-13848 Accelerator Plus None The bottom line of the Project Home > Taskers table was updated to indicate the actual number of taskers displayed and the total in the current selection.
VOV-13855 Accelerator None The http.workerthreads parameter has been marked as Obsolete in the online documentation.
VOV-13868 Accelerator CS0209055 Fixed usage of VOV_MAX_WAIT_TO_RECONNECT and VOV_MAX_WAIT_AFTER_CRASH environment variables so that one does not override/affect the other.
VOV-13869 All None vovtaskermgr/vovslavemgr configure now supports -maxwaittoreconnect.
VOV-13874 Accelerator None The cycle length bullet has been modified from showing 'scheduler time' value to the inverse of 'cycle frequency' value.
VOV-13877 Accelerator, Accelerator Plus None nc hosts -ALL no longer shows duplicate consumables and extras. We added -rl option to show the legacy resource output. Note that -rl only shows the partial list of resources. nc hosts -r now shows a complete list of resources, including their status.
VOV-13910 Accelerator Plus CS0213212, CS0303516 If a job is dispatched to a tasker that is in the process of exiting, the job will be refused by the tasker and automatically rescheduled for execution up to the maximum number of times allowed by the autoRescheduleCount server configuration parameter.
VOV-13914 Accelerator Plus None Added documentation of the startWXLauncher configuration parameter to all vovwxd configuration file examples.
VOV-13925 Allocator None Corrected the resource map UI title to have only the resource name without extra parameters (like quantity).
VOV-13930 Allocator None Fixed error "no such element in array" on Allocator Resource Overview page.
VOV-13945 Accelerator, Accelerator Plus CS0212259, CS0222585 Interactive jobs using the vwi script no longer change the SHELL environment variable.
VOV-13954 Accelerator None The footer of nc -h previously contained unsubstituted Tcl expressions. This has been fixed.
VOV-13961 Accelerator CS0204211 nc run -e "<option>" now supports arbitrary length option, constrained only by the value of the maxEnvLength policy variable.
VOV-14005 Monitor CS0219301 Fixed Expire column content for feature with version pools. Will show:
  • "never" when times are:
    • specially formatted patterns
    • over 12/31/2099 in the future
    • causing errors when processing
  • "expired" - all times are in the past
  • <Actual Date> - earliest expiration time in the future

Will NOT show: "some expired" - deprecated

VOV-14007 Accelerator, Accelerator Plus CS0213217 The SNAPPROP environment now prepends relevant VOVDIR paths to PATH in a manner similar to the SNAPSHOT environment.
VOV-14011 FlowTracer None Indirect tasker improvements:
  1. When using the tasker configuration path (vtk_tasker_define), ensure that the correct tasker-based procedures are defined and called.
  2. Define NC_QUEUE in the indirect tasker's execution environment using the queue name that is specified in the vtk_tasker_nc call, which removes the requirement to create a taskerVNC.tcl in $VOVDIR/local that defines it.
  3. Fix major efficiency issue that prevented the indirect tasker from notifying the server that it is ready to accept another job once one has finished.
  4. Improve messaging to display the queue name that is being interfaced with for debuggability purposes.
VOV-14012 Accelerator Plus CS0221575 Accelerator Plus in direct drive mode will now detect wait reason changes from the base queues for buckets without incoming jobs.
VOV-14052 All None The URL shown in the lmmgr, ncmgr, and vsi outputs now reflect the value of VOV_HOST_HTTP_NAME, if set.
VOV-14073 Accelerator None Fixed issues with bjobs -o option that prevented the specified output format from being generated.
VOV-14077 Accelerator None The "Use Altair Accelerator's REST API to Submit and List Jobs" tutorial has been updated and improved.
VOV-14091 Accelerator CS0227548, CS0296810 Queued Job is properly dispatched when one of the running jobs was suspended with -manualresume flag
VOV-14095 Hero None New implementation of Hero leveraging the Accelerator job submission engine. Native support for calendar based reservations and a connector for metrics reporting via Monitor.
VOV-14112 Accelerator CS0229221 Fixed logic to properly preempt Jobs with no requested License resouces.
VOV-14128 All None Some minor typos were fixed in product help.
VOV-14153 Accelerator, Accelerator Plus None Fixed issue that prevented vovtaskermgr stop -sick from stopping sick taskers that still have a client attached to the server.
VOV-14156 All CS0233956 The vovcleanup utility now removes empty directories which are not expected to be persistent. This includes job profile and wave directories under SWD/data. Additionally, a new configuration item has been added to independently control the cleanup of resource-based wave data files: set config(cleanup,waves,resources) 90d
VOV-14158 Accelerator None Fix truncated output from vovversion -clients.
VOV-14182 Monitor None LM checkout usage, denial statistics, usage comparison, and usage trends along with ftlm_batch_report with same options can be done now "by user/host" in addition to other available options.
VOV-14217 Accelerator Plus CS0237217 An issue that prevented dp jobs from successfully being run via Accelerator Plus has been resolved.
VOV-14228 Accelerator None The documentation for logging interactive jobs has been improved to clarify which options are best suited for specific scenarios.
VOV-14290 All None The output format of of the nc hosts command has changed. Instead of showing suspended jobs as part of the count of running jobs, we now show suspended jobs as a separate category. For example, if a host has 8 running jobs and 1 suspended job with 8 total slots, the "JOBS" section of the output for that host would previously read "9/8". It will now read "1/8/8", formatted as <suspended count>/<running count>/<total slots>. This change also pertains to relevant sections of the HTML UI.
VOV-14307 Accelerator CS0245432 Fixed bug where various web pages were sending lists of items with wrong delineator and also added delayed auto refresh to certain job and project based pages when starting/stopping/removing listed items.
VOV-14359 Accelerator, Accelerator Plus, FlowTracer CS0160432 Fixed issue that could cause jobs that are dispatched to an indirect tasker to be spawned with /tmp as their working directory following a restart of the base queue to which the indirect tasker is connected. Note that this fix applies to indirect taskers defined by vtk_tasker_nc only, with taskers defined by the legacy vtk_slave_nc call remaining the same.
VOV-14362 Accelerator, Accelerator Plus, Monitor CS0248090 Fixes a bug where versions of LM prior to 2021.1.0 could not communicate properly with 2021.1.0 and later versions of NC, and versions of NC prior to 2021.1.0 could not communicate properly with 2021.1.0 and later versions of LM.
VOV-14369 Monitor None Fixed issue that caused ftlm_batch_report to hang when generating reports with static images or when extracting static images from a previously generated report that contains dynamic images.
VOV-14372 Accelerator None Fixed issue on Windows that prevented the specified job environment from being applied for execution.
VOV-14430 Accelerator, Accelerator Plus CS0254035 VOV_JOB_DESC(jobclass) global variable is set for every job class when sourcing its TCL config file.
VOV-14435 FlowTracer None Fixed skip jobs timestamp issue due to which its valid outputs turned invalid.
VOV-14454 Accelerator None Fixed an issue where an SSL cert that contains a CA chain was not handled properly by the internal webserver, causing some SSL clients and libraries (curl, Python) to issue an error about the validity of the SSL certificate being offered by the webserver.
VOV-14455 FlowTracer CS0257984 Fixed local resource accounting error when using array submission with FlowTracer and vovwxd with an LSF base queue that resulted in lingering tasker definitions that consumed local limits.
VOV-14459 All None Fixed issue where specifying an instance name was not being handled correctly by the vovreadlic utility, resulting in the instance not being found in the VOV registry. Also added support for ALM licensing.
VOV-14463 Accelerator Plus CS0259047 Fixed error when tasker cannot reconnect to Accelerator Plus after server was frozen and restarted on different port
VOV-14468 Accelerator, Accelerator Plus None A defect was fixed where stopping and starting a daemon through the web UI could cause the web page to show an error rather than correctly reload the daemons status page. This only occurs when webserver is set to internal. The error was cosmetic in that the user could reload the daemons page or any other page, and vovserver also continued to operate normally.
VOV-14470; VOV-14581; VOV-14639 Accelerator None Fixed a crash in the internal webserver when certain URI requests were received.
VOV-14487 FlowTracer None Fix a problem with FlowTraccer when used with the vovwxd daemon. In certain cases, an insufficient number of demand jobs were launched to the base queue for a bucket.
VOV-14492 Allocator CS0284342 Fixed issue causing double counting of NRU matched tokens, which is resulting in underutilization of licenses.
VOV-14494 Accelerator, Accelerator Plus, Allocator, Monitor CS0259047 Set VOV_SO_REUSEADDR environment variable to set VOV server port option to be reused when VOV server is restarted.
VOV-14495 Accelerator None Restored an alert that is generated by vovresourced when there is no match to a license feature in LM that has been requested specifically by vtk_flexlm_monitor.
VOV-14497 Accelerator, Accelerator Plus CS0262302 Fixed issue which is causing array SNAPPROP property error 'SNAPPROP environment: continue sentinel missing'.
VOV-14498 Accelerator CS0261544 Handled temporary directory write permission error by creating a temporary script in the current directory when /tmp, /var/tmp, /usr/tmp don't have write permission.
VOV-14503 Accelerator, Accelerator Plus CS0261567 Enhanced job modification through the web UI or NC GUI to append the NC_MODIFY_LOG property and log the modification details.
VOV-14535 Accelerator Plus None Fixed a bug in vovwxd that was causing it not to process new buckets in case when a base queue was inaccessible.
VOV-14563 All CS0278895 Fixes an issue where large queries made using vovselect, vtk_select_loop or related commands may attempt to refresh their data infinitely, causing a slowdown in the responsiveness of the server.
VOV-14564 All CS0276473 Removed an unwanted message from the server log that is printed when a license key file is processed.
VOV-14566 Monitor None Fixed an issue with the fatal error handler in ftlm_parse_flexlm that caused the parser to exit uncleanly instead of printing the error and generating an alert.
VOV-14572 Accelerator CS0265596 A bug was fixed that was responsible for errors being generated when running vovnotifyd.
VOV-14599 Accelerator None A bug was fixed that prevented job classes from being created via the web interface.
VOV-14611 Accelerator   In prior releases users were able to specify the "-r slots/N" resource spec on nc run with values of N that were not equal to 1. This behavior was not recommended in the past, and is no longer supported in this release. The SLOTS resource spec in -r will be ignored (set to SLOTS/1) going forward. If a job will use more than one processor, use the CPUS or CORES resource instead of SLOTS.
VOV-14614 FlowTracer CS0277985 Added null pointer safety within the "vrt" job wrapper's instrumentation library.
VOV-14640 FlowTracer CS0294142, CS0323287 Fixed a bug in vovwxd that was causing improper deletion of sick tasker objects
VOV-14683 All None Fixed issues with the vsi, vovbrowser, and nc run commands that prevented them from showing the correct URL to the web UI under certain configurations.
VOV-14693 All None Some web security vulnerabilities were fixed.
VOV-14712 All None The nginx webserver, which is used when the "webserver=nginx" configuration parameter is set, has been updated to the latest version 1.21.0.
VOV-14722 Accelerator 123 An issue with partialTool for DP jobs was identified, where it could incorrectly bind to just the IPV4 or the IPV6 port. It has been fixed to bind to both IPV4 and IPV6 ports simultaneously if the kernel has them enabled.
VOV-14724 FlowTracer CS0301482 Update handling of state reported by bjobs to include PSUSP, SSUSP, USUSP & UNKWN.
VOV-14725 Hero None Hero 2.0 configuration updated to deal with Palladium Z2.
VOV-14735 Accelerator, Accelerator Plus CS0220480 Changed the output of the nc hosts command to improve clarity. Fixed a bug where a tasker with more than (max capacity - capacity) suspended jobs would cause the server to erroneously mark queued jobs as running.
VOV-14737 Accelerator None Corrected some documentation about a distributed parallel job query command example.
VOV-14745 Monitor None Fixed issue that caused permanent server licenses to be reported as expired.
VOV-14790 All None The openssl library used by various parts of Accelerator was upgraded to the latest version to incorporate the latest security patches.
VOV-14822 Accelerator, Accelerator Plus CS0309335, CS0314259 Fixed issue causing nc wait -dir to wait for jobs in other directories.
VOV-14824 All None Fix a DOS vulnerability in vovserver's base webserver from nefarious large HTTP requests.
VOV-14830 FlowTracer CS0310567 Removed the environment variable listing at the beginning of the tasker log file for WX taskers.
VOV-14840 Accelerator None The "group" array field for VOV_JOB_DESC erroneously listed both "-g" and "-G" as options. Only "-g" is appropriate. The "-G" option has been updated to reflect the appropriate field, "group,final."
VOV-14870 Monitor CS0312659 If the output of batch reporting is set to multiple files, the tag filter should be taken under consideration.
VOV-14906 All None Fixed Altair License Manager license checkout failure due to version mismatch.
VOV-14907 Accelerator Plus, Hero None A crash was fixed where vovserver may refer to a bucket ID that's not a valid bucket.
VOV-14911 Accelerator CS0304721 Changed the use of the disable file access parameter in Tcl to be in line with the cpp implementation. When file access is set to 1, no one has access to the contents of a file, including the owner of the queue. If the file access is set to 2, then only the owner of the queue has access to contents of files.
VOV-14960 Accelerator None The environment variable LD_LIBRARY_PATH is no longer set by the BASE environment.
VOV-14997 Monitor None Add a rule in LM CSS that hides any img child element. If the LM CSS is not loaded, that's the case for all other products the wrapping element has no effect as a result the img .gif will get displayed. Basically fallback to old icons if bootstrap is not loaded.

2022.1.0-p1 Release Notes

Resolved Issues

Internal Number Products Case Number Description
VOV-14822 Accelerator, Accelerator Plus CS0309335, CS0314259 Fixed issue causing nc wait -dir to wait for jobs in other directories.
VOV-15106 Accelerator Plus CS0325511 Fixed issue causing nc wait to exit with error "Failed subcommand wait: Illegal object id".
VOV-15115 FlowTracer    
VOV-15148 Accelerator, Accelerator Plus   Fixed issue causing nc wait to exit with error "can't read "jInfo(exit)": no such element in array"
VOV-15245 Accelerator, Accelerator Plus CS0325511 Updated nc wait to filter event by jobid when waiting on 10 or less jobs.

Previous Releases

2021.2.0 Release Notes

New Features

Products Internal Number Case Number Description
Accelerator VOV-12728 CS0133915 Added new usage options to nc modify to allow an Admin to increase/decrease grabbed resourcemaps for running jobs and to allow selection of jobs for all nc modify use cases by specifying a selection rule.
Accelerator VOV-12822 None The web UI dashboard page has a Counters section that shows the number of active Users for an NC queue. This counter had been 0 in past releases. This has been fixed to show the correct value.
Monitor VOV-12955 None Jobclass initialization has been moved into the liveness script live_init_jobclasses.tcl instead of vovresourced.
Accelerator VOV-12956 None Timevar definitions are now processed in two configuration files. Timevar devinitions may be place now in new configuration file SWD/config/timevars.tcl, where they are processed by the new VOV liveness script. This new config file is the preferred place for Timevar definitions going forward. Timevar definitions in SWD/resources.tcl will also continue to be processed by vovresourced for compatibility with prior releases.
Accelerator VOV-13090 CS0141521 Support has been added to allow custom values to be used for PIPELOG related ports and range via VOV_PIPELOG_FIRST and VOV_PIPELOG_RANGE. Also VOV_CONTAINER_NETWORK_PROXY environment variable has been added to better support nested container resources, see example container config file containers/c3-enter.sh for more details.
Accelerator Plus, Hero VOV-13136 None Integrated Accelerator Plus and Hero with Altair License Manager for both node-locked and floating licenses.
Allocator, Accelerator VOV-13137 None Integrated Allocator and Accelerator with Altair License Manager for both node-locked and floating licenses.
Accelerator VOV-13625 None Added new NUMUSERS field to the SERVER object. The new field contains the count of users who are currently connected to vovserver with a web or CLI client or a running job. Vovserver updates this field every 10s.
Accelerator VOV-13654 None In Dashboard UI, the server vital signs widget will have the donut under the bullet graph bars.
Accelerator VOV-13774 None By setting the new server configuration option, vovservermgr config slave.childProcessCleanupExclusions someChildDaemon in conjunction with setting vovservermgr config slave.childProcessCleanup 1, users can now specify by name named process exclusions to the child cleanup process. If set in conjunction with slave.childProcessCleanup, then slaves should kill all of a job's child processes when that job exits, except for those named here for slave.childProcessCleanupExclusions in a comma separated list. The default value if not set is empty string, "".
All VOV-13806 None The web server used to provide Accelerator products' web UI interface and HTTP interface to the main server is changed from the internal web server to nginx in this release. If the internal web server is preferred, it can be selected via the -webprovider option on the ncmgr start command. The impact to users will be that with the new default nginx web server, the REST v3 interface and the Accelerator administrator web UI dashboard page will not be available for use. If either of these capabilities are needed, you should select the internal web server option when the Accelerator queue is started.
Accelerator VOV-13839 None With the webport enabled and the webprovider set to "internal" to use the REST service, worker threads dedicated to servicing REST requests have a label of either "RESTService" or "RESTRequestHandler" This can be seen by calling ps -T -p PID with PID being the process id of vovserver.
All VOV-13841 None Added new liverecorder.mode configuration parameter for taskers that can be used to specify whether the main tasker process, the subtasker process, or both processes should generate a LiveRecorder recording file. Note that recording files will be generated per job that is executed if subtasker recording is enabled. The default mode is for the main tasker process to be the only one to generate a recording.
All VOV-13947 None The Installation Guide has been updated to include the information for the Altair License Manager.
FlowTracer VOV-13372 CS0182759 Enhanced Job status bar to show the colors for the valid and failed jobs based on their exit status. This functionality can be controlled by ::VovGUI::configJobStatusBar in the gui.tcl file. All valid jobs with assigned color will appear before default valid (green color), same for failed jobs.
FlowTracer VOV-13135 None Integrated Altair License Manager with FlowTracer for both node-locked and floating licenses.
Hero VOV-13138 None Integrated Hero with Altair License Manager for both node-locked and floating licenses.
Monitor VOV-13133 None Integrated Altair License Manager with Monitor for both node-locked and floating licenses. Added a config key config(alm.enable) in policy.tcl to enable the ALM licensing. The default license manager for Monitor remains the Reprise License Manager (RLM).
Monitor VOV-13776 None Implemented Grace Period for Altair License Manager.
Accelerator, Accelerator Plus VOV-13778 None Implemented licensing modes 'Full' and 'N' for Altair License Manager for Accelerator and Accelerator Plus. This can be set using config key config(enterpriselicense) in policy.tcl.
Monitor VOV-13777 None Enhanced license UI to show Altair License Manager status and make changes to the current license environment.
Allocator, Monitor, Accelerator VOV-13134 None Updated Allocator, Monitor, and Accelerator with the actual (new) features names for Altair License Manager.
All VOV-13617 None The SSL implementation has been upgraded using the latest third party libraries, OpenSSL version 1.1.1.

Resolved Issues

Products Internal Number Case Number Description
All VOV-12564 None Changed default LiveRecorder log directory from . to /tmp. This mainly affects the default behavior for vovserver, whose working directory is the parent of the SWD, which is often stored in an NFS-based location.
Accelerator VOV-12822 None The web UI dashboard page has a Counters section that shows the number of active Users for an NC queue. This counter had been 0 in past releases. This has been fixed to show the correct value.
All VOV-13180 None Robustness changes to mitigate vovsh backtraces linked to query memory management.
All VOV-12963 CS0143849 Fixed issue that could cause object IDs to improperly recycle after multiple rollovers.
All VOV-13950 None test.check_TIMEVAR.sh was primarily failing due to clock skew. This fixes that.
All VOV-14075 None Fix the internal web provider's HTTP responses to include security headers. Previously, the internal web provider did not correctly return the security headers in the following case: when HTTP requests were made to the vovserver web port, responses did not include the standard HTTP security headers for URLs beginning with "/doc".
All VOV-7887 21377 Clarified documentation of VOV_LIMIT_vmemoryuse.
All VOV-14143 None Added missing vovservermgr.bat and vovclientmgr.bat scripts to the Windows package.
All VOV-13183 None Fixed a potential memory leak when a client running a long-running query is unexpectedly terminated, causing vovserver to permanently mark a query as "in-use".
All VOV-14061 None Fixed HTTP security header setting which prevented some icons to not appear in the Altair online documentation.
Accelerator VOV-13921 None In the past, when you changed an initJobClass procedure, you had to restart vovresourced to put the change into effect. Restarting vovresourced is no longer necessary. The online help has been reflected to show this.
Accelerator VOV-13764 None Changing http.proxytimeout requires the REST service be restarted if it is already running. This can be done by setting the webport to 0, waiting 30 seconds, and then setting it back to the desired webport number.
Accelerator VOV-13380 None If the vovserver web interface is enabled by setting the web port, then under some high load conditions the web port interface would hang up. In the 2021.2.0 release, the default web server reverts to nginx, which is immune to this issue. As a result, the default vovserver configuration will not support REST v3 or the Accelerator dashboard UI page. To enable REST v3 and the dashborad UI page, a new web server option may be specified by the -webprovider internal option on the ncmgr start or vovproject start commands.
Accelerator VOV-13167 None Memory reporting within the tools has changed on Linux to reflect what's reported by the Linux kernel in VmRSS rather than VmSize. The vovserver memory usage information from commands such as vsi, vovselect memorystats from server as well as the memory web page at "http://host:port/server?page=memory" will also report consistent memory use totals based on VmRSS. Also, vovselect memorystats from server and the memory statistics webpage have been enhanced to account for more of the "chunk" based memory pool allocations used within vovserver.
Accelerator VOV-13531 None A minor appearance improvement to the web dashboard UI page was made within the Capacity sub-window.
Accelerator VOV-13326 None Changed the VOV_DISABLE_SHARED_MEMORY_LOOKUP behavior to return the RSS.
Accelerator VOV-13769 None When using the internal REST server (as opposed to the nginx server), the vendor library used to implement it was changed from cpprestsdk to Oat++. The back end no longer allocates a static pool of worker threads to service requests, controlled by the variable http.workerthreads. That parameter is now ignored. The Oat++ backend creates a new thread to service each request and terminates the thread after sending the response to the HTTP/REST client.
Accelerator VOV-13638 CS0191754 The RESD(typeList) parameter in the vovreconciled/config.tcl file can be used to modify the license types handled by vovreconciled. The parameter value is a list of names, by default the value is {License}. The following types are not supported and will be ignored if present: Limit, Policy, User, Group and Priority. The type of License will be added if not specified.
Accelerator VOV-13549 None Fixed script execution issue with message.cgi where user did not have proper permissions to modify the underlying file, UI will now display message indicating the issue and properly disable the submit button.
Accelerator VOV-13561 None Internal bug which may have caused some set statistics to not be reported accurately has been addressed in this release.
Accelerator VOV-11780 AAP24453 This release contains OpenSSL 1.1.1j which does not exhibit the warning message of the previous packaged version OpenSSl 1.0.2q.
Accelerator VOV-13720 None The header section stays sticky at the top and always be visible to the user.
Accelerator VOV-13672 None In the Scheduler Vital Signs widget, the values under the horizontal bars are now in sync with the values shown in tooltips, respectively (tooltips appear on hover over the horizontal bars).
Accelerator VOV-13791 None The REST HTTP server has a new threading model that no longer uses a pool of worker threads to service client connections. It now creates a new thread for each connection and that thread terminates after transmitting a response to the client. The vov variable http.workerthreads is deprecated. It is visible but not changeable.
Accelerator VOV-13739 None The Set Browser link has been modified to point to the classic UI's Set Browser page.
Accelerator VOV-13816 CS0205113 Address issue where license resources sometimes became unavailable when on life support.
Accelerator VOV-14110 AAP24923 Fixed bug where TaskerClass.table based resources did not display properly in web UIs extra resources column.
Accelerator VOV-10345 24403, 24469, 24648 The following system taskers no longer consume a license: vovdbd, WXLauncher & maintainer.
Accelerator VOV-12107 CS0120865 Added cleanup of unknown process IDs which also fixes the flooding of tasker logs with the following error messages: "Must kill late child Pid...", "rakeChildren: Child process...", "does not exist anymore: assuming it is done..."
Accelerator VOV-13510 CS0182762 Fixed issue causing license checkout with empty shared (ISV) string which is resulting in duplicate license checkout.
Accelerator VOV-13880 None In the Schedular vital signs widget, the values shown in the tooltip of the buckets bullet graph will be in-sync with the number jobs submitted.
Accelerator VOV-13860 CS0208413, CS0208823 Fixed issue that caused the tasker to overload vovserver with messages when a job execution attempt failed due to not being able to successfully fork out the subtasker process that is used to shepherd the job.
Accelerator VOV-9031 23103 A description on how to set up a tasker in Windows has been added to the online help.
Accelerator VOV-13861 CS0210064 Fixed issue in which SIGALRM interrupted communications on interactive jobs using VOV_INTERACTIVE_PING keep alive method
Accelerator VOV-13771 None When making job related REST API calls, helpful error information is included in the REST response. If using the vov_rest_v3 Python API wrapper, the content of the error will be thrown inside VovRestException.
Accelerator VOV-13849 CS0208895, CS0218919 Fixed bug where interactive (-I/-Ir) root privileged container jobs potentially resulted in a process group SIGINT being captured and accidentally being sent to systemd, following which bad things may happen, such as a system reboot on subtasker host.
Accelerator VOV-14116 None Fix a problem with changing the vovserver's webserver from "nginx" to "internal" using the vovservermgr config webprovider internal command. It was not possible to make this change without restarting vovserver, but with this fix, the transition from nginx to internal web server can be accomplished by a 3 step process: 1) shut down nginx with this command: vovdaemonmgr stop vovnginxd ; 2) delay 5 seconds with sleep 5; 3) start the internal web server with vovservermgr config webprovider internal.
Accelerator Plus VOV-13956 None Fixed race condition with Accelerator Plus that caused jobs to fail due to placement on taskers reserved for different buckets.
Accelerator Plus VOV-13785 CS0142115 Fixed issue that prevented jobs using a jobclass with VOV_JOB_DESC(interactive,useXdisplay) from succuessfully launching agents when run via Altair Accelerator.
Accelerator Plus VOV-13872 CS0211355 Fixed issue where SICK status Accelerator taskers were not removed after an appropriate amount of time. The underlying cause was that there were still related jobs running in the base queue, and was repaired by passing the -forcerunning option to the NC base queue forget command for taskers with a SICK status.
Accelerator Plus VOV-14020 None Fixed issue with Accelerator Plus in DirectDrive mode that prevented jobs from running when added to a bucket that was empty during a vovwxd daemon restart.
Accelerator Plus VOV-13731 None Fixed issue which prevented Accelerator Plus in Direct Drive mode from launching taskers for preexisting jobs when a base queue is restarted and no further jobs are incoming in the Accelerator Plus queue.
Accelerator Plus VOV-13917 None Improved logging to identify when Altair Accelerator Direct Drive feature is in use. On vovwxd startup, the vovwxd.log will contain: Initializing vovwxd with Direct Drive... ... vovwxd with Direct Drive initialization successful. During operation, the vovwxd.log will also contain the thread identifier "APPluginAccel" when running in Direct Drive mode.
Accelerator Plus VOV-13890 None Fixed issue which prevented WX taskers from reconnecting after server freeze/failover with fastexit enabled.
Accelerator Plus VOV-13650 None vovserver has a new config parameter tasker.authorization.delay that specifies the time in seconds that the server wait before authorizing new taskers.This parameter can be useful for WX, where it can reduce the latency of dispatching a job to a newly requested tasker.
FlowTracer VOV-13749 None Fixed issue in vov_lsf_agent that prevented it from launching a tasker.
Monitor VOV-8715 22305 Send warning about nonexistent user to stderr instead of stdout when ftlm_batch_report with user filter is requested
Monitor VOV-13464 CS0174914 Added ADJUST_CAPACITY periodic maintenance task (Admin > System > Configuration Information).
Monitor VOV-13633 CS0188846 Logs containing, "Queued Programs" were not being recorded as queued requests properly for ftlm_parse_lstc.
Monitor VOV-13712 CS0195003 A denial plot showed incorrect data when the data binning size was 30s and the time span was several months.
Monitor VOV-13927 None Fixed issue that prevented ControlCenter jobs from executing on hosts that have an upper/mixed-case name.
Monitor VOV-13929 None Fixed issue that prevented the process monitoring facility from recording incoming running processes as checkouts.
Monitor VOV-7800 21302, 23839 Fixed issue that caused the default bin interval for the denial plot to default to 30s instead of a dynamically calculated optimum value for the report time range. This caused the denials to be binned incorrectly for reports with time ranges that would result in more than 100k bins unless an explicit interval was specified in the report options. Protections were also added to prevent the acceptance of a bin interval that is too short for the report time range (any value that would result in more than 100k bins).
Monitor VOV-9665 23904 Fixed issue that prevented the ftlm_capacity load operation from finding the data files necessary for loading feature capacity information into the database.
None VOV-10568 CS0120645 HyperThreading fieldname 'HT' on taskers previously reported incorrectly if HT was available. The field now properly reports if SMT threading (either Intel HyperThreading or AMD SMT) is currently available and enabled by checking value of either /sys/devices/system/cpu/smt/control or /sys/devices/system/cpu/cpu0/topology/thread_siblings_list (or equivalent). directly.
All VOV-13611 CS0120932 Resolved issue with parsing and filtering via selection rules values that may contain uint64 fields compared with hardcoded integer values. Additionally, users may now explicitly declare uint64 hardcoded values such as vovselect name,totalspace,freespace -from filesystems -where 'totalspace>1U AND freespace>1U'
Accelerator VOV-13689 None The Running-jobs axis scale numbers have been placed with an even space.
Accelerator VOV-14209 CS0236878 Fixed an issue with temporary loss of key file license registration by vovserver when the enterpriselicense configuration parameter was not explicitly set in policy.tcl. In these cases a vovproject sanity resulted in a temporary switch to RLM licensing. Some specific low level changes made were as follows:
  1. Check for the presence of a license key file at server startup so that the default enterprise license mode will be "Off" if a key file is found and the policy.tcl file does not have an explicit setting for the mode.
  2. Prevent the above check upon every request to check licensing so that the mode doesn't switch unless manually requested.
  3. Allow for switching the mode out of "Off".
  4. Restore the mode control and license request sections in license.cgi when the mode is "Off".
  5. Rename the mode selection in the license.cgi drop-down menu to "Off (use license key file)" so that the actual value is represented along with the intent.
Accelerator, Monitor VOV-14180   Fixed a bug that arose when SSL certificate files were added by the admin with file names correctly derived from the fully qualified host name from VOV_HOST_HTTP_NAME. The Accelerator products would not initialize the webport and the web UI URL properly when the "internal" webprovider was activated.
Accelerator VOV-14220 None Fixed issue that could result in a leak of a file descriptor in the tasker for an interactive job that has ended.
Accelerator VOV-14221 CS0221756, CS0238663 Fixed issue that could result in a leak of a file descriptor in the tasker for an interactive job that has ended. The resulting build up of old file descriptors was making the tasker (vovtaskerroot process) go into a "sick" state.

2021.2.0-p4 Release Notes

Resolved Issues
Internal Number Products Case Number Description
VOV-15310 Accelerator Plus CSO355127 Fixed a crash where an ambiguous "vovquery select id from 9" results in an object lookup that vovquery doesn't support.

2021.2.0-p3 Release Notes

Resolved Issues
Internal Number Products Case Number Description
VOV-15245 Accelerator, Accelerator Plus CS0325511 Updated nc wait to filter event by jobid when waiting on 10 or less jobs.

2021.2.0-p2 Release Notes

Resolved Issues
Internal Number Products Case Number Description
VOV-14362 Accelerator, Accelerator Plus, Monitor CS0248090 Fixes a bug where versions of LM prior to 2021.1.0 could not communicate properly with 2021.1.0 and later versions of NC, and versions of NC prior to 2021.1.0 could not communicate properly with 2021.1.0 and later versions of LM.

2021.2.0-p1 Release Notes

Resolved Issues
Internal Number Products Case Number Description
VOV-13910 Accelerator Plus CS0213212, CS0303516 If a job is dispatched to a tasker that is in the process of exiting, the job will be refused by the tasker and automatically rescheduled for execution up to the maximum number of times allowed by the autoRescheduleCount server configuration parameter.

2021.2.1 Release Notes

New Features

Internal Number Products Case Number Description
VOV-13249 All CS0120865 A configuration parameter "logs,tasker,compress" (a boolean that defaults to 0) was added that, if set, will cause the daily logs to be compressed at the end of the day. This is similar to the existing "logs,server,compress" parameter. This parameter should be set in the policy.tcl configuration file.
VOV-14345 Accelerator Plus None Direct drive now supports vovwxd/config.tcl parameters slave,max and client,derate by pausing base queues when limits are reached. Note that due to the asynchronous nature of Direct Drive, these are soft limits and some overrun is expected.
VOV-14171 Accelerator Plus None vovwxd in direct drive mode can now use a driver script for bucket filtering. vovaccel.tcl is an example script and it can be specified in CONFIG(drive_script).

Resolved Issues

Internal Number Products Case Number Description
VOV-14253 All CS0145466, CS0240541 Fixes an issue where in some circumstances, vtk_select_loop would cause a crash in vovsh when requesting large result sets.
VOV-14279 Accelerator, Accelerator Plus CS0241809 Fixed issue causing error 'child process exited abnormally' for forgotten jobs.
VOV-14287 Accelerator None Corrected the script name that is called by the vovclientmgr.bat wrapper script.
VOV-14286 Accelerator None Fixed vovlicensemgr errors with ALM licensing.
VOV-14218 Accelerator None Fixed a crash that occurred when the Internal webprovider's SSL configuration and/or webport was changed.
VOV-13510 Accelerator CS0182762 Fixed issue causing license checkout with empty shared (ISV) string which is resulting in duplicate license checkout.
VOV-13732 Accelerator CS0134814 Fixed an issue with vovclientmgr closedeadinteractive where an error was thrown if jobs were forgotten while the command was running. Also added a -dry-run option to the above command.
VOV-14118 Accelerator CS0230408 An issue was fixed where vovgetgroups would fail if VOV_USE_VOVGETGROUPS was set to 1, and the user in question belonged to more than 128 groups.
VOV-14262 Accelerator None Some compatibility issues were found between the openssl library we use and the one provided on CentOS 8 that negatively affects our ability to validate user information on CentOS 8 when NIS is configured. The compatibility issue has been addressed.
VOV-14302 FlowTracer None Fixed a bug causing a TCL error in vovlsfd daemon when, for any reason, it cannot parse the output of bsub command.
VOV-14068 Monitor CS0219474 Fixed incorrect summary calculation in Monitor tables.
VOV-13693 Monitor CS0183043 Fixed license parser issue causing the error - Illegal number of args: vtk_feature_get_or_create daemon name total version isv

2021.2.1-p3 Release Notes

Resolved Issues
Internal Number Products Case Number Description
VOV-14822 Accelerator, Accelerator Plus CS0309335, CS0314259 Fixed issue causing nc wait -dir to wait for jobs in other directories.
VOV-15106 Accelerator Plus CS0325511 Fixed issue causing nc wait to exit with error "Failed subcommand wait: Illegal object id".
VOV-15148 Accelerator, Accelerator Plus   Fixed issue causing nc wait to exit with error "can't read "jInfo(exit)": no such element in array"
VOV-15245 Accelerator, Accelerator Plus CS0325511 Updated nc wait to filter event by jobid when waiting on 10 or less jobs.
VOV-15310 Accelerator Plus CS0355127 Fixed a crash where an ambigious vovquery select id from 9 results in an object lookup that vovquery doesn't support.

2021.2.1-p2 Release Notes

Resolved Issues
Internal Number Products Case Number Description
VOV-14362 Accelerator, Accelerator Plus, Monitor CS0248090 Fixes a bug where versions of LM prior to 2021.1.0 could not communicate properly with 2021.1.0 and later versions of NC, and versions of NC prior to 2021.1.0 could not communicate properly with 2021.1.0 and later versions of LM.

2021.2.1-p1 Release Notes

New Features
Internal Number Products Case Number Description
VOV-14477 All CS0257852 Taskers running as non-root will no longer get sent jobs unless the job's user matches the non-root tasker's userid. This is to address a situation where a job running on a non-root Tasker gets access to the user's data on the filesystem. This can be disabled by setting allowForeignJobsOnUserTaskers to 1.
VOV-14419 All None Multiphase support is provided by two additional command arguments to nc run: -multiphase [1|0] and -mpres "resource string"

-multiphase 1 enables multiphase jobs.

-mpres sets the resources that will be used for each phase. The '%' is used as a delimeter for the resources of each phase, for example, -mpres "linux64 foo%linux64 bar:linux64 baz.

By specifying the resources of each phase and designating that certain resources are only allocated to certain taskers, you can run different phases of a job on different taskers.

For example, I have two taskers named tasker1 and tasker2. I want to run phase 1 and 3 on tasker1, and phase 2 on tasker2. My resources may look like:
 vtk_resourcemap_set License:blue UNLIMITED License:blue_tasker1 
vtk_resourcemap_set License:red UNLIMITED License:red_tasker2 
vtk_resourcemap_set License:blue_tasker1 1 tasker1 
vtk_resourcemap_set License:red_tasker2 1 tasker2 
I could then run a multiphase job as:
nc run -multiphase 1 -mpres "linux64 License:blue%linux64 License:red%linux64 License:blue" -- -e BASE -D /home/jjmcwill/testDir/testMultistage.sh
A multiphase job will have two new Job Properties set:
  • MPRESOURCES: Contains the same resources passed in -mpres, and is used to reset the job resources for each phase.
  • MPCURRENTPHASE: Contains an integer indicating the current job phase. It starts at one, and has a max value of 9.
The running job script will see an environment variable named VOV_JOB_PHASE which is set to the current phase. The script writer will need to use that to decide what work to do for that phase.

If the script exits with an exit code of 216, nc will increment the job phase, change the job resources, and reschedule the job to run again. If the script exits with an exit code of 0, the job is considered "Done", and MPCURRENTPHASE is reset to 1.

Failed jobs:

If a job fails during a phase with a code other than 0 or 216, it is considered FAILED and MPCURRENTPHASE will not increment. If the job is invalided and re-run (for example, nc rerun -f JOBID), the job will re-run starting at MPCURRENTPHASE and further phases will run if the job exits with code 216, as described above.

Logging:

After the first phase is run, subsequent phases of the job will have the command rewritten so that the wrappers are passed "-a -A", telling the wrappers to append to the job log. This is so that all phases of the job get their stdout and stderr logged to the same file. If this was not done, each phase of the job would overwrite the log, and the user would only see the output from the last phase that was run. If nc does not detect one of the standard vov wrappers at the beginning of the command line, it will assume the command is not using a wrapper. In this case, it will look for the standard ">" redirect symbol in the command and replace it with ">>".

REST Support:

In the payload for submitting a job via rest, two new fields are allowed: multiphase and mpres. Setting multiphase = True enables multiphase job support. Setting the mpres field behaves the same as described for the command line argument described above. Re-running a multiphase job that has failed via the REST re-run API will behave similarly to rerunnign a failed multiphase job from the command line as described above.

Resolved Issues
Internal Number Products Case Number Description
VOV-14465 All CS0257852 Taskers running as non-root will no longer get sent jobs unless the job's user matches the non-root tasker's userid. This is to address a situation where a job running on a non-root tasker gets access to the user's data on the filesystem. This can be disabled by setting allowForeignJobsOnUserTaskers to 1.
VOV-14217 Accelerator Plus CS0237217 An issue prevented dp jobs from successfully being run via Accelerator Plus has been resolved.
VOV-13910 Accelerator Plus CS0213212 If a job is dispatched to a tasker that is in the process of exiting, the job will be refused by the tasker and automatically rescheduled for execution up to the maximum number of times allowed by the autoRescheduleCount server configuration parameter.

2021.1.0 Release Notes

New Features and Enhancements

The following new features and enhancements were introduced this software release:

Table 1.
Product Issue Number Case Number Description
All VOV-9454 23741 Most of the VOV Tcl files from the installation package now contain the proper Altair copyright statement and version number.
All VOV-12801   Introduce the new "tasker" lexicon for Accelerator product environment variables. Environment variable names containing old term "SLAVE" will get new names containing "TASKER" in the place of "SLAVE". The old environment variable name will be honored unless the new name is being used. This compatibility measure will ease the transition for administrators.
All VOV-12797   New Tcl VTK function names are added to move to the new "tasker" lexicon. Old VTK function names containing the string "slave" are deprecated, and new VTK function names containing the string "tasker" are added and should be transitioned to from this release forward.
All VOV-9297   Support for SuSE Linux Enterprise Server (SLES) 15 has been added to the Altair Accelerator products.
All VOV-12111   Support for CentOS and RHEL 8 has been added to the Altair Accelerator products.
All VOV-13321   Support for Ubuntu 14.04 has been dropped.
All VOV-13306   This release discontinues support for the SLES 11 operating system.
All VOV-13127   All references to the term "slave" have been replaced with the new term "tasker" throughout the online help documentation.
Accelerator VOV-12708   A REST API guide and tutorial document is added to the Accelerator documentation bookshelf reader.
Accelerator VOV-12537   SlaveLists are deprecated and replaced with TaskerLists with the following additional functionality:
  • The Taskerlists now contain the tasker names instead of only NC-internal IDs. As a result, the taskerlist does not have te be recreated when a tasker is stopped and then restarted, as had been the case with slavelists in previous releases.
  • The TaskerLists definition will be stored in the PR so they will survive vovserver stop-and-restart sequences.
  • Added support for resource 'TaskerList:XXX' in nc run
  • SlaveList:XXX' resources will be automatically replaced by 'TaskerList:XXX' resources.
Accelerator VOV-12458   Implemented "Dialpad" or "Waffle" menu for mobile screens and at high zoom levels on the dashboard UI page.
Accelerator VOV-12743   Implemented Subsets table in Set Detailed View page.
Accelerator VOV-12747   An actions dropdown menu has been added, enabling the actions (delete, run with priority) to be performed on selected jobs. Added a search bar, allowing the user to filter the jobs by entering search strings.
Accelerator, Accelerator Plus VOV-12150   The documentation shown by "ncmgr start -h" is expanded to explain some additional features that require an Accelerator queue to have the webport enabled. In the 2020.1.0 release, the following new features require the webport: 1) REST v3 API and 2) the new administrator dashboard UI page.
Accelerator VOV-12964   Fixed CSS issues in bulk actions drop down.
Accelerator VOV-12812   Enabled client activity logging for nc cmd commands.
Accelerator VOV-12744   Added a Details section for the selected set in the Set Detailed View page.
Accelerator VOV-12739   Added UI functionality in the Dashboard > Set Browser screen. Users can now perform actions like retrace, and delete on sets, and also filter the displayed list of sets by text string.
Accelerator VOV-12460   Added NC queue color to the dashboard user interface.
Accelerator VOV-12762   Implemented storybooks for the Table component user interface.
Accelerator VOV-9778 23068, 23767, 23914, 24923 Irrelevant alerts are no longer generated. Addressed some implementation issues with vtk_flexlm_exclude_tags. Note that calls to vtk_flexlm_exclude_tags are cumulative and override any tags added with vtk_flexlm_monitor and vtk_flexlm_monitor_all. The -noooq parameter for vtk_flexlm_monitor has no impact at present, please use vovresSetFlags instead. The -order parameter to vtk_flexlm_monitor and vtk_flexlm_monitor_all only orders any specified tags, it no longer adds tags (use -tags to add tags). The optional parameters vovResource, vovMap to vtk_flexlm_monitor are now handled correctly.
Accelerator VOV-12277   A new command option for nc run has been added called -dpinitialport N which allows the user to specify the starting port that partialTool will use to find an open port to communicate among the subtasks in the cohort. This will be reflected in a new job property named DP_INITIAL_PORT that can be observed being set on the job.
Accelerator VOV-12736   The React dashboard has now implemented a Sets List view.
Accelerator VOV-12742   Implemented Jobs table in Set Detailed View page.
Accelerator VOV-13292   Two new server configuration parameters have been added: http.workerthreads and http.proxytimeout. http.workerthreads specifies the number of worker threads that the new REST HTTP server will start when vovserver starts with a valid webport. http.proxytimeout enables you to specify the timeout in seconds, used when the main webserver forwards some requests, like CGI pages, to the older http server listening on the VOV port.
Accelerator VOV-11930 CS0120821 A new command line parameter was added for nc run for dp jobs called -nocohortwait. This instructs partialTool for each cohort task to finish when its subtask process has finished rather than wait for the primary job to complete (which is the default behavior). Passing -nochortwait to nc run sets a new property named DP_COHORTWAIT to 0. By default, this is set to 1 when -nocohortwait is not passed, and partialTool will behave like it always has.
Accelerator, FlowTracer, Monitor VOV-10198   Add support for Windows Server 2019.
Accelerator VOV-12947   Added breadcrumb navigation to the Sets page, through which the user can navigate to the hierarchical sets.
Accelerator VOV-12452   Add a sub-window for scheduler health monitoring and vital signs in the Accelerator admin dashboard web UI.
Accelerator VOV-12733   Support is added for Accelerator on ARM64 systems running Centos 7, Centos 8, or Amazon Linux 2. This support is for execution hosts and submit hosts only. The "armv8" hardware resource name is added for this architecture.
Accelerator Plus VOV-12295 CS0121114
  1. Fixed an issue where the NUMA_AFFINITY property on jobs always showed it as being bound to specific CPUs, even though it was bound to the entire NUMA node.
  2. Fixed an issue where WX vovtaskers were overriding the affinity set by the NC vovtasker, causing WX jobs to be bound to only a subset of the available CPUs.
FlowTracer VOV-12556   Add support for FlowTracer on Windows.
Hero VOV-12891   Added a -P <NAME=VALUE> parameter to the hero submit command (similar to the NC -P parameter).
Hero VOV-12932   Added the -modules, -stagein, -stageout parameters to the hero -zebu submit command. The DeclareEmulator specification now includes the following parameters: -type (for future use), -environment, z-ebu_system_dir, -zebu_root. The commands specified in the -stagein, -stageout parameters depend on the resources Limit:zebu_stagein_load, Limit:zebu_stageout_load.
Monitor VOV-5671   The vtk_feature_add_or_create API now expects an additional parameter for the associated ISV string. See the documentation for the new syntax. The vtk_featureuser_* APIs have had their names changed to vtk_checkout_*. The old vtk_featureuser_* names are still supported but vtk_checkout_* will be the official documented names.
Accelerator VOV-6572   This should be fixed as a side effect of implementing the mutator API. See release notes for VOV-8899.
All VOV-9298   Support for Ubuntu 18.04 and 20.04 is added.
All VOV-13364   Starting with the 2021.1.0 release, the Accelerator Products images come with digitally signed certificates that can be used to reliably confirm authenticity of the installation media images.

Resolved Issues

The following issues were resolved in this release.

Table 2.
Product Issue Number Case Number Description
All VOV-13252   Web server improvements (when the web port is configured as non-zero):
  1. Added the following headers to all HTTP responses:
    • Strict-Transport-Security
    • X-Frame-Options
    • X-Xss-Protection
    • X-Content-Type-Options
    • X-Content-Security-Policy
    • Content-Security-Policy
  2. Added support for web ports higher than 32000.
All VOV-13181   Some stability improvements were made in the Webserver code to avoid potential crashes of vovserver.
All VOV-12989 CS0145649 Fixed an issue where stopping more than 1 vovtasker by name (vovtaskermgr stop <tasker1> <tasker2>...) was renaming only the last named tasker to <taskername>_stopped_<timestamp>.
All VOV-9560 23740 Fixed an issue with vovnotifyd using only the first RAM value for jobs with multiple RAM requests (e.g.: -r+ RAM/100 -r+ RAM/200) to determine if the job is exceeding requested RAM usage (health check of requested RAM).
Accelerator VOV-11388 25153 Fixed an issue with the vovserver failing to start when epoll is enabled (set config(useepoll) 1) in policy.tcl.
All VOV-12582   The vovtaskermgr start command will now only utilize the configured rshcmd (one of: inetd/rsh/ssh/vovtsd) for starting remote taskers. Prior to this change, the inetd method was always attempted, and the vovtsd method would be attempted if the configured vovtsd port was non-zero.
All VOV-12512   All references to PBS Works support have been updated to direct the user to the new Altair One website.
All VOV-6287 20738 Fixed the issue in the error message "too many elements in array" where the max array was not getting updated as per the config(maxJobArray).
All VOV-13247   Network security testing port scans had in certain cases caused vovserver to hang up in an infinite loop.
Accelerator, Accelerator Plus, Monitor VOV-13009 CS0133888 In the help information displayed by nc cmd vovdaemonmgr -h, a note was added indicating that the -f (force) option applies only to the start subcommand, and only when a daemon list is specified.
Accelerator VOV-12418 CS0121215 By default, interactive jobs will also write to a logfile just like normal jobs do. You can also specify the log file with the -l parameter to nc run like normal jobs. If you do not want an interactive job to write to the log file, use -nolog as a parameter to nc run
Accelerator VOV-12135   The axis labels in the jobs histogram in the dashboard UI page showed repeated "1" labels with a small number of running or queueed jobs.
Accelerator VOV-13000 CS0128274 Fixed an issue where incrementing grabbed resources was not incrementing the count of used resources in some cases.
Accelerator VOV-13039 CS0146315 The network data sent as a result of nc info was made more compact, which will make running the command more efficient.
Accelerator VOV-13113   Fixed CSS issues in Set Browser page.
Accelerator VOV-13026 CS0149221 Handle window/weight inheritance for new FairShare groups that are being created during job submission. The window will be inherited from the parent. Both the window and weight will be inherited from a sibling group named "default".
Accelerator VOV-12714   Fixed the following issues with job container support:
  1. The VOV_CONTAINER_<resource> (where <resource> is CORES, RAM, or TMP) environment variable was not being set in the hook environments for containers that were configured to use the job's resource specification as container limits.
  2. The VOV_CONTAINER_JOB_RUNDIR environment variable was not being set in the hook environments if the job's run directory had been configured as an equivalence, as would be the case for jobs submitted from within the user's home directory.
Accelerator VOV-13108   As per the new API response, changes made to the footer version text.
Accelerator VOV-13272   Increased virtual memory limit for nc run.
Accelerator, FlowTracer VOV-12908   The Accelerator new dashboard UI for administrators, when accessed, will increase the vovserver memory "Size" metric printed by the vsi command. The large reported memory size is virtual memory address space size, with only modest associated increase in actual memory usage. The number of worker threads used by the web server can be controlled with config(http.workerthreads) N in policy.tcl. Changing this value will require a vovserver restart, because it can only be set once before the multithreaded webserver is initialized. Also, a timeout value for when the multi-threaded webserver has to delegate some requests, such as CGI pages, to the old vovserver web server, can be configured by setting config(http.proxytimeout). This value can be changed at any time.
Accelerator VOV-12977 CS0143428 Fixed an issue with interactive jobs (nc run -I) failing with the error message "Job has problems with PTY. Bad pipes".
Accelerator VOV-12811 CS0129987 vtk_resourcemap_set now requires that the user either own the resource, or the user have ADMIN security rights for it to take effect.
Accelerator VOV-10921 24781, 24803 To better clarify jobs that have been queued due to reserved taskers, additional information has been added to the output of the nc why command. Under the "Per-slave/per-tasker analysis" section, a count of taskers that would have been compatible but are reserved will be shown as: 'n is currently reserved by others'
Accelerator VOV-5980 21105 The nc modify command has been modified to exit with a status of 1 if any part of the modification request fails.
Accelerator VOV-12892   With certain types of product install methods, the Accelerator documentation bookshelf link in the Web UI had not been functional. This only impacted customers who download and un-tar both common.tar and win64.tar with the intent of installing both linux64 and win64 into the same master installation directory. The workaround was to un-tar and install win64 first, then go back and un-tar common.tar and linux64.tar, and then install linux64 only. If you had an existing installation the workaround was to un-tar common.tar and reinstall linux64 only.
Accelerator VOV-12547   Fixed an issue with the -Il option for interactive jobs that prevented the user from typing in the terminal window and interacting with the job.
Accelerator VOV-11452   Added -orphanreservations option to the vovforget command for forgetting the reservations not attached to any tasker. Behaviour is modified to allow overlapping reservation in the system, but it will never be in effect unless the dominant reservation is deleted. Fixed an issue where the tasker reservation gets duplicated after server restart. Also, changed the tasker instance reservation (created using vtk_tasker_define (-reserve option) or by passing the -e option to vovtasker) to be non-persistent by default. No change in behavior for tasker reservations created using vtk_reservation_create.
Accelerator VOV-11662 29869 Fixed "no such variable 'killTimePP'" alerts when health checks are enabled for stuck jobs with -stuckKillTime.
Accelerator VOV-9254 23430 Fixed issue that prevented child FairShare groups from being displayed when viewing the top-level group via vovfsgroup show.
Accelerator VOV-11261   Addressed issue where delays were encountered due to vovserver not being immediately notified of an update.
Accelerator VOV-7490 20070, 24363 Fixed an issue leading to "URGENT vovnotifyd Cannot send mail. can't read "code": no such variable" alerts. Reduced the severity to WARN, in case of failures to send mail. Also, added an alert if the list of recipients for notification emails is empty.
Accelerator VOV-12464 CS0122942 Requests for CGROUP:RAM with more than 1 RAM specification will now limit RAM usage to the total amount requested by all RAM specifications rather than the last one.  For example the command: nc run -r CGROUP:RAM RAM/60 RAM/40 -- sleep 0 will limit ram usage to 100 megabytes rather than 40.
Accelerator, Accelerator Plus VOV-13030 CS0149277 Fixed an issue with Ctrl-C not working as expected with interactive jobs (nc run -I/-Il/-Ir).
Accelerator, Accelerator Plus, FlowTracer VOV-4998 CS0143832 For all products, strict job name checking has been enabled and invalid job name characters will cause an error. For Accelerator and Accelerator Plus, this can be overridden by putting the following in $VOVDIR/local/vncrun.config.tcl
set ::jobname_lexicon legacy
or
set ::jobname_lexicon replace
Legacy will use the more lax job naming rules from earlier releases. Replace will identify invalid characters in the job name, replace them with "_", and issue a warning to the console An issue with vsm being enabled to handle some invalid job name characters was addressed.
Accelerator, Accelerator Plus VOV-13051 CS0121039 Fixed an issue with interactive jobs (nc run -I) failing with error messages similar to "Error=98: Address already in use [vovttyserver2:244]" and "FATAL ERROR: Cannot open PTY port (with remote signal handling): Cannot open pty server sockets [vncrun.tcl:2257]". This is accompanied by job errors similar to "Cannot connect to PTY server on submission host lava1 13316 Z@:x=XGa56cT_Hd6 from lava5".
Accelerator VOV-12519   For consistency across CLI and web UI, the default values for the following VovPremptRule options have changed in some cases from previous versions.
  • The option -resumedelay default is now 5 seconds previously the CLI default was 10 seconds.
  • The default pool name is now mainpool, previously the web UI default was MainPool.
  • The default for -mqthresh is now 0.1=10%, before the web UI default was 0.01.
  • The -killage option default is now 60, previous version of the web UI had a default of 0.
  • For -skipresumedjob option default is 120, previous versions of the web UI had a default of 0.
  • For -preemptslavenum new default will always be 1, some variants of CLI had a default value of 2 previously.
Accelerator VOV-10558 23924 Empty job class sets are not deleted, thereby preserving all properties for future submissions.
Accelerator VOV-12807   When you hover the mouse over the job graph line in the dashboard UI window, a small pop-up displays the Y unit, the Y number, and a time. The Y number is actually an average over a surrounding time window, and not an instantaneous value as implied by the information shown.
Accelerator VOV-13169   When vovserver was configured with webport and failover was configured, it was found that vovserver could lose access to the webport and get restarted with the webport disabled. This has been fixed. Also, if Accelerator was configured to use License Monitor, it was observed that failover could leave an extra copy of the voveventmon process launched by vovresourced running every time vovserver crashes and restarted. vovresourced has been modified to properly shut down voveventmon in the case where vovserver has crashed.
Accelerator VOV-13036   Fixed a server crash caused by memory corruption when running queries from a 2016.09 client.
Allocator VOV-10997 CS0156459 Suppressed the log, "Could not add FTResJob...", as it is not impacting the functionality.
Allocator VOV-12724 CS0130000 Fixed issues with the allocation of resource groups in Allocator. Prior implementations based the allocation on demand for the component resources only. The new implementation bases the allocation on demand for the resource group and all of its component resources.
Allocator VOV-11883 CS0120726 Fixed issue when the same feature serviced by different daemons with different tags which makes -ExcludeTagRx ignored.
Allocator VOV-11894 CS0120781 Fixed CSV export of the Home > Overview and Home > Resources > Matches tabular reports.
Allocator VOV-11224 24981 Fixed an issue with Allocator showing incorrect "Distributable" values when 'SetReserverForUser' is used with * (all users).
Allocator VOV-13025 CS0145466 Added a config key MQ(pjProbeKillTimeout) for the maximum time that the vovlad daemon should wait for existing probes to be killed at startup.
FlowTracer VOV-7956 21595 Addressed issue that prevented alert text from being displayed in the vovconsole alerts window.
FlowTracer VOV-12960 CS0143848 Fixed issue with keyword substitution for array job submissions that caused arbitrary matches to the array reference job's ID and IDINT values to be substituted with the ID and IDINT values from the individual array jobs in the job metadata. For example, an array job submission of "echo 000001070" where the reference job was coincidentally job 1070 would result in the command being changed in each individual array job to reflect its own ID, such as "echo 000001072" for job 1, "echo 000001074" for job 2, and so on.
FlowTracer VOV-12815 CS0137268 Evaluation of resources when used with an indirect tasker (taskerVNC) now applies the jobclass followed by the resource list which is the opposite of what was done previously.
FlowTracer VOV-12203 CS0120999 Re-evaluation of a job class to compute the union of resources when used with an indirect tasker(taskerVNC) is no longer done. This is typically relevant for FlowTracer integration with either Accelerator (NC) or Accelerator Plus (WX). To restore the old behavior, please contact Altair support.
FlowTracer VOV-12901   Fixed an issue where the user may see PIPELOG related errors in the console on Windows, when running a FlowTracer job directly from the command line, such as : vov cmd.exe /c echo "Hello"
FlowTracer VOV-12918 CS0142609 Improved the behavior of the Taskers > NC Interface: vovwxd vovconsole menu option, which configures and starts the vovwxd daemon. The daemon will be configured to use the default queue name (vnc) unless the NC_QUEUE environment variable is present.
FlowTracer VOV-10189   Schedule priority and execution priority are now saved in the persistent representation.
FlowTracer VOV-12813 CS0137660 systemjob state is now saved in the representation so that it's persistent across FlowTracer restarts.
Hero VOV-12931   Wrapper daemon now runs on the emulator vovtasker associated with the emulator. Previously it ran from wherever the autostart command was executed. Fixed an issue that prevented the command hero -zebu stop_all_wrappers from working correctly in some instances.
Monitor VOV-9774 24058, CS0121121 Fixed problem parsing MathLM licenses when one of HH MM SS time values starts with 08 and 09.
Monitor VOV-12253 CS0120851 Fixed MathLM parser for features with "Sub" and space prefix.
Monitor VOV-12634 CS0126701 Fixed Altair Monitor GUI to correctly show expiration date if one of the licenses expired.
Monitor VOV-12324 CS0121132, 21139 The output format for more recent versions of Sentinel RMS has changed. The new format caused ftlm_parse_sentinel to incorrectly calculate capacity. It now recognizes the new format and only counts instances of capacity appearing inside a feature block. Support for older formats has been retained.
Monitor VOV-9100 23225 Fixed LM report plotting with "Breakdown By Feature" option.

2021.1.1 Release Notes

New Features
Products Internal Number Case Number Description
Accelerator VOV-13210 None Added functionality to perform actions on sets in the Set Browser page.
Accelerator VOV-12810 None The dashboard UI job graph shows actual Y values when user hover the mouse over the graphed lines, and at the intersection points of the graph lines in job plot graph, the tooltip will show the Y-axis values which are intersected.
Accelerator VOV-12780 24397 The commands nc run, vovset resources, and nc modify -res support binary unit conversion for all memory based resources as a convenience from Petabytes (PB), Terabyte (TB), or Gigabyte (GB) to Megabytes (MB), which is still used internally and reported by all commands. The input conversion will accept either decimal or integer form and are all case-insensitive, so for example both nc run -r SWAP/1GB sleep 0, and nc run -r RAM/0.1Tb sleep 0 are supported. The currently supported parameter names for which this conversion is supported are RAM/, RAM#, RAMFREE#, RAMFREE/, RAMTOTAL#, RAMFREE/, SWAP/, SWAP#, SWAPFREE#, SWAPFREE/, SWAPTOTAL#, SWAPTOTAL/ and TMP# or TMP/. By default the unit is MB (Megabytes), where 1MB is 1<<20 bytes.
Accelerator Plus, FlowTracer VOV-12409 None Elastic taskers launched via vovwxd will detect and exit mote quickly when their designated bucket is empty or deleted. Accelerator Plus queues using Direct Drive will detect empty queues and stop launching taskers for those buckets more quickly. This functionality can be disabled by setting vovwxd.fastexit server parameter to 0. This is on by default.
Resolved Issues
Products Internal Number Case Number Description
All VOV-13517 None Fixed issue that prevented the Property Editor GUI utility in vovconsole from launching successfully.
All VOV-13183 None Fixed a potential memory leak when a client running a long-running query is unexpectedly terminated, causing vovserver to permanently mark a query as "in-use".
All VOV-7887 21377 Clarified documentation of VOV_LIMIT_vmemoryuse.
All VOV-13523 None Fixed an issue that caused vovshow -queries to fail when trying to access a non-existent array element.
All VOV-13568 None Fixed an issue where comma list resource maps were not selectable in the Web UI.
All VOV-13743 None Corrected format of output for vtk_time_psp that was returning hh:mm format when the given date/time was on a different day/year. Now returns hh:mm only for current day, otherwise month abbreviation and day (Apr 14) for the same year, else Year month abbreviation (Dec 2020).
All VOV-13706 None The vovtasker binary was missing from 2021.1.0. As a workaround, in previous product versions where it may be missing in the installation, copy vovslave to vovtasker.
All VOV-13610 None Fixed issue with displaying working taskers in Altair Accelerator 2021.1.0
All VOV-13351 CS0173855 Fixed some cases where the Server Working Directory (SWD) contained slaveClass.table or taskerClass.table, and these configuration files were ignored by taskers/slaves. Scenarios that had been broken were:
  • slaves.tcl and slaveClass.table present in SWD
  • slaves.tcl and taskers.tcl and taskerClass.table present in SWD
  • taskers.tcl and taskerClass.table present in SWD
All VOV-13439 CS0175205, CS0187053 Code related to vovps has been modified to be more robust in handling of non-fatal errors emanating from vovps command and to be more inline with typical ps command output.
All VOV-12897 CS0140687, CS170776 On some network configurations, a warning about IPV6 could be issued when running an INTERACTIVE job that could not be filtered out by turning down the verbosity. This has been fixed. The -v switch to nc run on an interactive job will turn off the warning.
All VOV-13014 None With multi-platform product installs, the Accelerator documentation bookshelf link in the Web UI was nonfunctional. This only impacted customers who download and un-tar both common.tar and win64.tar with the intent of installing both linux64 and win64 into the same master installation directory.
All VOV-13171 CS0164654 The timeout duration for PR saves can now be controlled via a server configuration parameter.
All VOV-13837 None Fixed issue that caused some vtx-wrapper links to point to an incorrect absolute path.
All VOV-13502 None This fix provides for mitigation of a hang in the http(s) service. It reinstates the nginx service found in earlier releases. The use of the patch is required for production systems using the https service - typically Accelerator. The patch prevents the use of the REST v3 API and the new web based Dashboard, which is dependent on the REST v3 API. A subsequent release will address this shortcoming. The use of nginx should be seen as temporary and a subsequent release is expected to provide integrated https within vovserver. To start nginx, pass -webprovider nginx to ncmgr start, lmmgr start, etc. You should see vovnginxd start as one of the vov daemons. vovservermgr config will also show the webprovider setting as being either 'internal' or 'nginx' depending on how you have configured the system.
All VOV-13416 None Added configuration section to the sds.cfg file to allow inclusion kafka producer configuration properties such as those needed to enable ssl communication.
Accelerator VOV-13162 None The "Match Jobs to Handles" HTML topic now reflects the code colors that coincide with the software.
Accelerator VOV-9353 23568 In the past, a stopped tasker and a newly started tasker were not aware of each other's NUMA usage, and so could assign CPU or Node affinity that overlaps. Taskers using NUMA on the same machine, with the same vovhost and queue name, will now share NUMA usage to avoid over allocating NUMA resources on the same machine.
Accelerator VOV-7487 20455, CS0120837 Passing bash functions through snapprop is a fragile operation that only works when the following conditions hold:
  1. The function must have been explicitly exported, for example export -f f.
  2. The bash and sh shells must be the same on the submission and execution sides (this holds on CentOS 6-8 but not on Ubuntu).
  3. The bash function encoding must be the same on the submission and execution sides (this depends on the bash version, CentOS 6,7 have the same encoding, but CentOS 8 has different encoding).
Note: Bash encodes functions in two ways (subsequent to the 2014 shellshock vunerability):
  1. BASH_FUNC_<function-name>()=() (CentOS 6,7)
  2. BASH_FUNC_<function-name>%%=() (CentOS 8, MacOS, Ubuntu, albeit the latter does not satisfy 2. above)
Accelerator VOV-7736 21176 Fixed building of resource maps from resources with OR and AND words in resource names.
Accelerator VOV-13629 CS0178114, CS0186671, CS0192772, CS0194045, CS0196466 Fixed an issue where a failed PTY connection for a job would cause subsequent jobs on the tasker to fail as long as the original job was still running, and in some cases, the tasker could become unresponsive.
Accelerator, Accelerator Plus VOV-13651 None Fixed issue in node.cgi which resulted in the CPU Time displayed for job being multiplied by 1000.
Accelerator, Accelerator Plus VOV-13152 CS0159375 Fixed spurious error message when receiving (RESMAP,CHANGE) events in some clients.
Accelerator, Accelerator Plus VOV-13293 CS0169911 Added check for ADMIN privilege which blocks the regular user from stopping the job using NC command nc stop -allusers if the requesting user is not ADMIN.
Accelerator VOV-13161 CS0163181 Fixed issue where setting a project or site message in /cgi/messages.cgi would not result in a message being registered.
Accelerator VOV-13324 None Tasker based support has been added for the following vovselect fieldnames:

CHOSENTASKERID, LASTTASKERID, LASTTASKERNAME, TASKERGROUP, TASKERID, TASKERLIST, TASKERNAME, TASKERSLOTSSUSPENDABLE, TASKERSLOTSSUSPENDED, TASKERSLOTSUSED, TASKERSTATUS for jobs, TASKERID for clients, TASKERGROUP, TASKERHOST, TASKERNAME, TASKERSLOTSSUSPENDABLE, TASKERSLOTSSUSPENDED, TASKERSLOTSUSED, TASKERTYPE for slaves or taskers.

The same are available for use as symbolics such as @LASTTASKERID@ or @TASKERNAME@, etc.

Accelerator VOV-13398 None An error in the online help regarding vtk_server_config suddenshutdown <server-pid> has been addressed.
Accelerator VOV-13388 None Fixed issue that can cause vovserver to crash upon receipt of a REST request when thread.service.max and thread.service.enable.query are both greater than zero in the vovserver policy.
Accelerator VOV-13363 None Fixed ncupgrade abort by changing the vovserver stdout message to Vovmessage ( stderr )
Accelerator VOV-13346 CS0120637, CS0164333, CS0186238 Fixed an issue where a redirect in the nginx configuration would cause vovresourced to crash
Accelerator VOV-13413 CS0175672 Fixed issue where vwn incorrectly attempted to contact the server after the VOV_VW_PING interval.
Accelerator VOV-13438 None Removed vovproject enable command from ncupgrade so that it can read from stdin and can be used for testing automation.
Accelerator, Accelerator Plus VOV-13424 CS0176272 The handling of the resource parameter to vtk_flexlm_monitor has been improved. If the a resource name is specified then this name is the actual resource name used (in a previous release it was always prepended by License:). If a resource name is not specified, it defaults to License:<feature>. This is consistent with vtk_flexlm_monitor_all behavior.
Accelerator, Accelerator Plus VOV-13465 CS0180449 Resolved issue when using vtk_tasker_define with -tsdport.
Accelerator VOV-13458 CS0180796 During jobclass initialization the VovUserError proc does not exit or generate any output.
Accelerator Plus VOV-13563 CS0186685 Fixed an issue which resulted in stalled WX buckets reporting a waitreason of License:xyz when really just waiting on HW in the base queue. Fixed an issue which prevented the setting of resmap.sw.types in the policy.tcl file.
Accelerator Plus VOV-13560 None The custom vnc_policy.tcl file for PBS integration is no longer required and should be removed upon upgrading to 2021.1.1. This file is located in $SWD/vnc_policy.tcl and was originally copied from $VOVDIR/../common/etc/config/vovwxd/vnc_policy_pbs.tcl.
Accelerator Plus VOV-13484 None Fixed an issue which could result in some slaves not being recognized as vovwxd slaves resulting in them not being counted toward max,slaves. Optimized scheduling for WX/PBS jobs by enabling the bucket shortcut and removing per slave resources.
Accelerator Plus VOV-13772 None The Accelerator Plus online help has been updated to reflect the addition of the Direct Drive functionality.
Allocator VOV-13562 CS0187051 Fixed a race-condition in Allocator that resulted in random crashes in complex configurations.
FlowTracer VOV-10426 24499 NodeEditor has renamed 'In Queue' to 'Queue' and times shown are now based upon buckettime to give more accurate breakdown of the job's timeline.
FlowTracer VOV-12348 None A threshold of 4 is now applied before issuing warnings about WXLauncher is not running.
FlowTracer VOV-13419 CS0172898 Fixed vovconsole performance degradation for drawing the sets and switching between horizontal and vertical view.
Monitor VOV-13323 None Fixed issue that caused remote LM parser to fail and return no data.
Monitor VOV-13066 CS0151123 The registering of multiple hosts defined through env variable VOV_LICMON is now handled correctly. However its use should be minimized due to the additional overhead involved and consequential impact on job start up time.
Monitor VOV-13390 None Fixed an issue that prevented licenses provided by Altair license key files from being monitored.
Monitor VOV-13446 None Fixed rare issue that caused an "Unexpected return -90" message in the vovserver log when the top job of a bucket cannot be dispatched to a tasker at the time the dispatch function is called.
Monitor VOV-13440 None Fixed issue that prevented licensing detail tables from being displayed on the licensing administration web UI page on Windows.
Monitor VOV-13426 CS0154262 Some valid Accelerator and Monitor license key files with vovversion set for the early part of year 2021 were not working because of a bug in license keyfile validation code.
Monitor VOV-12980 CS0144723 Fixed ftlm_batch_report for checkouts that have not been moved yet to Altair Monitor database.

2021.1.1-p1 Release Notes

Resolved Issues
Products Internal Number Case Number Description
Accelerator VOV-13861 CS0210064 Fixed issue in which SIGALRM interrupted communications on interactive jobs using VOV_INTERACTIVE_PING keep alive method.
Accelerator VOV-13860 CS0208413, CS0208823 Fixed issue that caused the tasker to overload vovserver with messages when a job execution attempt failed due to not being able to successfully fork out the subtasker process that is used to shepherd the job.
Accelerator VOV-13816 CS0205113 Addressed an issue where license resources sometimes became unavailable when on life support.
Accelerator VOV-13849 CS0208895, CS0218919 Fixed bug where interactive (-I/-Ir) root privileged container jobs potentially resulted in a process group SIGINT being captured and accidentally being sent to systemd, following which bad things may happen, such as a system reboot on subtasker host.
Accelerator Plus VOV-13890 None Fixed issue which prevented wx taskers from reconnecting after server freeze/failover with fastexit enabled.
Accelerator Plus VOV-13872 CS0211355 Fixed issue where SICK status Accelerator taskers were not removed after an appropriate amount of time. The underlying cause was that there were still related jobs running in the base queue, and was repaired by passing the -forcerunning option to the NC base queue forget command for taskers with a SICK status.

2021.1.0-rs1 Patch Release Notes

The following new features and resolved issues were introduced this software release:

Product Issue Number Case Number Description
Accelerator VOV-13399   Added Rapid Scaling - a feature that provides high-throughput, cost-conscious scheduling in the cloud.
Accelerator VOV-13470   The Rapid Scaling PDF has been updated to reflect the changes relative to the 2021.1.0-rs1 patch.

2021.1.0-p1 Patch 1 Release Notes

New Features
Products Internal Number Case Number Description
All VOV-13502 None This fix provides for mitigation of a hang in the http(s) service. It reinstates the nginx service found in earlier releases. The use of the patch is required for production systems using the https service - typically Accelerator. The patch prevents the use of the REST v3 API, a subsequent release will address this shortcoming. The use of nginx should be seen as temporary and a subsequent release is expected to provide integrated https within vovserver. To start nginx, pass -webprovider nginx to ncmgr start, lmmgr start, etc. You should see vovnginxd start as one of the vov daemons. vovservermgr config will also show the webprovider setting as being either 'internal' or 'nginx' depending on how you have configured the system.

2020.1.0 Release

New Features and Enhancements

The following new features and enhancements were introduced this software release:

Product(s) Internal Number Case Number Description
All VOV-11377 A new document viewer for the Accelerator product family is provided in the web UI. This document viewer provides a modernized interface with new client-side search capability.
All VOV-11059 Field descriptions have been populated for all supported fields. These can be queried via the "fieldesc" metadata field, available for each object.
Accelerator, Accelerator Plus VOV-12279 CS0121103 The output of nc info and wx info now includes the project/queue name.
Accelerator VOV-11378 A modernized web UI dashboard page has been added to the Accelerator web UI. To access the page, browse to the URL generated by this command: nc cmd vovbrowser -url /dashboard. ADMIN privileged users may access this dashboard.
Accelerator VOV-10124 24397 The following commands nc run, vovset resources, nc modify -res, now support binary unit conversion for all memory based resources as a convenience from Petabytes (PB), Terabyte (TB), or Gigabyte (GB) to Megabytes (MB), which is still used internally and reported by all commands. The input conversion will accept either decimal or integer form and are all case-insensitive, so for example both nc run -r RAM/0.1Tb — sleep 0 and nc run -r SWAP/1GB — sleep 0 are supported. The currently supported parameter names for which this conversion is supported are RAM/, RAM#, RAMFREE#, RAMFREE/, RAMTOTAL#, RAMFREE/, SWAP/, SWAP#, SWAPFREE#, SWAPFREE/, SWAPTOTAL#, SWAPTOTAL/ and TMP# or TMP/. By default the unit is MB (Megabytes), where 1MB is 1<<20 bytes.
Accelerator VOV-9830 24092, 24093 Added confirmation WebUI dialogs for MakeDefault and ClearDefault job class - Writing MakeDefault and ClearDefault job class events to server log
Accelerator VOV-9778 23068, 23767, 23914, 24923 Irrelevant alerts are no longer generated. Addressed some implementation issues with vtk_flexlm_exclude_tags. Note that calls to vtk_flexlm_exclude_tags are cumulative and override any tags added with vtk_flexlm_monitor and vtk_flexlm_monitor_all. The -noooq parameter for vtk_flexlm_monitor has no impact at present, please use vovresSetFlags instead. The -order parameter to vtk_flexlm_monitor and vtk_flexlm_monitor_all only orders any specified tags, it no longer adds tags (use -tags to add tags). The optional parameters vovResource, vovMap to vtk_flexlm_monitor are now handled correctly.
Accelerator VOV-8055 21777 Added a new capability to limit the number of interactive jobs that can run concurrently, both at the global and user levels. This is accomplished by creating a limit resource and setting is as the interactive job limit in the vncrun.config.tcl configuration file. For example: set VOV_JOB_DESC(interactive,limit) Limit:interactive or for per-user a limit: set VOV_JOB_DESC(interactive,limit) Limit:interactive_@USER@ The resource must exist prior to adding these lines to the file.
Accelerator VOV-9017 23067 MAXCPUPROGRESS now provides the maximum sampled percentage of CPU time as an integer used by a job, including all its children, which can be greater than 100 if the job is utilizing multiple cores.
Accelerator VOV-9130 23265 Added the -sickslave parameter to the vovinvalidate command, to invalidate running/retracing jobs that are no longer attached to a slave.
Accelerator VOV-7811 21342 Added ability to provide a stop reason when stopping jobs via the web UI.
Accelerator VOV-12458 Implemented Dialpad menu for mobile screens.
All VOV-11323 vovdoc CLI utility is retired
All VOV-11454 vov_rest_v3.py is the new Python module used to make v3 REST API requests against vovserver.
All VOV-11251 Accelerated processing of Crash Recovery file.
All VOV-10844 Provide a REST API addition to allow job control. The following operations can be performed via the v3 REST API: 1. Dispatch 2. Forget 3. Preempt 4. Rerun 5. Resume 6. Suspend
All VOV-10964 Job attributes can now be modified via the v3 REST API in ways that are also possible via the command line with nc modify.

Resolved Issues

The following issues were resolved in this release.

Product(s) Internal Number Case Number Description
All VOV-5570 Fixed behavior of the ROWCOUNT field used by vovselect and related commands; vovselect will print "0" instead of an empty string when there are no rows in the query, and will print a correct count instead of 0 for "vovselect rowcount from objects".
All VOV-12721   Fixed an issue with the 2020.1.0 beta that caused a user to be logged out of a web session for one product when the user logged into a different product session in the same browser.
All VOV-9853 24114 When determining if a shell is configured for a project, the environment variables VOV_PROJECT_NAME and VOV_HOST_NAME must be set. New checks were added to ensure the values of these variables may not be empty strings and may not be set to "unknown".
All VOV-12027 CS0120819 Add the previously missing documentation for the vovlicensemgr command.
All VOV-12583 Fixed an issue where querying for "maxnumacores" was returning the total number of cores in the system instead of the maximum number of cores in a NUMA node.
Accelerator VOV-11515 25220 Added a new server configuration parameter, tasker.uninterruptableSignalCascades, that controls whether taskers should process incoming job control requests when there is an existing signal cascade being processed. If TRUE, taskers will ignore incoming job control requests if an existing request is in-process and a message will be printed to the slave log indicating as such. Default is FALSE.
Accelerator VOV-10850 NUMA support has been updated to work correctly on systems configured with the number of sockets unequal to the number of NUMA nodes (Click here for an example). On such systems, the earlier implementation would report the number of NUMA nodes and associated memory incorrectly. The earlier implementation could also result in degraded memory performance if a socket has more than 1 NUMA node.
Accelerator VOV-7947 21578, 24177 Fix a failure to remove an Accelerator job's pre-command and post-command script log files by the command nc clean.
Accelerator VOV-8975 22946, 29901 The output of nc getfield JOB cputime with a lowercase cputime is changed to be the integer accumulated CPU time for a job scaled in milliseconds. This is now consistent with the output of nc getfield JOB CPUTIME with an uppercase CPUTIME field name.
Accelerator VOV-5439 21073 Execution information shows, "job has never run" instead of epoch dates for jobs that have not run yet.
Accelerator VOV-11041 24894 Large v2 REST queries that produced no results have been fixed in the v3 REST API.
Note: The problem still persists with v2, the fix is in v3 only.
Accelerator VOV-11261 Addressed issue where delays were encountered due to vovserver not being immediately notified of an update.
Accelerator VOV-12337 Fixed timing issue with nc wait command when run immediately after submitting multiple jobs in background mode using nc run command
Accelerator, Accelerator Plus VOV-12305 CS0120716 Fixed issue that prevented the wxagent job in an Accelerator base queue from reflecting the job placement policy and priority of the user's job in an Accelerator Plus queue.
Accelerator VOV-11657 25247 Performance of vovselect and related Tcl commands has been improved in general, as well as specific enhancements for the ID field in all tables, and the WHY and FairShare-related fields in JOBS.
Accelerator VOV-12023 CS0120864 Jobs that have been autokilled no longer show duplicate autokill reasons in the job status explanation.
Accelerator VOV-12032 The integrated REST HTTP server now has a separate log under SWD/logs/httpserver.log.
Accelerator VOV-12030 CS0120906, CS0121020 Fixed issue that caused taskers to be killed with the message “Tasker instructed to exit brutally”. This also fixes server messages like “Cannot find tasker rdc-cad-svr12 (illegal id 365667285) pid=32830"
Accelerator VOV-12560 Fixed nc stop -after option, which was not waiting for the specified number of seconds before issuing the stop request.
Accelerator VOV-12628 CS0127402 Fixed a bug where License: was prepended the resource name if the resource parameter was specified in vtk_flexlm_monitor, even if the resource name already started with License:.
Accelerator VOV-12543 Fixed a crash in vovserver caused by trying to select a job graph or node graph field from a set by name, e.g. "vovselect NODEGRAPH from System:jobs".
Accelerator, Accelerator Plus, Monitor VOV-10682 24282 Features names such as set via vtkle_feature_set can now include the '+' character and will be handled properly via the web UI.
Accelerator, Accelerator Plus VOV-12403 CS0121177 Fixed an issue where NUMA jobs that span multiple NUMA nodes would not return all cores used by the job to the free pool on job completion.
None VOV-11221 25011 Monitor email notifications set in the Admin->Notifications UI page using legacy mode email delivery had failed to successfully deliver email to the recipient.
All VOV-10844 Provide a REST API addition to allow job control. The following operations can be performed via the v3 REST API: 1. Dispatch 2. Forget 3. Preempt 4. Rerun 5. Resume 6. Suspend
All VOV-10913 Fixed a bug that caused vovselect to issue an error when requesting the field "env" in all lowercase.
All VOV-9988 Made the WHY property more prominent in the Main Reasons section of the output of vsy and related commands for FAILED jobs.
Accelerator VOV-8599 22097 See VOV-11848.

2019.01 Release

Enhancements

The following new features and enhancements were introduced this software release:
Product Internal Number Case Number Description
All VOV-8981 22923 Slave objects have a new hardware resource HT that specifies whether hyper-threading is enabled on the host where the slave is running (query example: vovselect ht from slaves).
All VOV-8823 22590 The -O option on nc hosts has been enhanced to support display of the @RAMUSED@ hardware resource.
All VOV-8808   Added the field "SLOTSAVAIL" to the SAllocatorVES table for use with vovselect and the vtk_select APIs.
All VOV-8809   The following queryable fields have been added to the RESOURCEMAPS object: PREEMPTION, OTHERS, RESRES, NOLOG, NOOOQ, NOMATCH, MATCHRECENT, LEFTOEXPIRE, UTILIZATION, OOQDUMP, RESDUMP, and JOBDUMP.
All VOV-8320   CLI lmmgr, ncmgr/wxmgr now allows you to specify the database port option when start a new project (in consistence with the web UI database configuration tool and the CLI vovdb_util configure tool).
All VOV-8810   Several new queryable fields have been added to the JOBS and FILES objects; see the output of vovselect fieldname from JOBS and vovselect fieldname from FIELDS. In addition there is a new queryable object named "IOS". This object allows querying from the inputs and/or outputs of a specific node. For example to get the ID field from all of the inputs of node 12345, the query would be vovselect id from ios.12345 where isinput.
All VOV-7917 20080
Important: The license grace period functionality has been removed due to a technical limitation.
The software issues alerts 30/14/7/1d prior to the earliest expiration detected (in case there are multiple lines of the same license feature in the same license file).

Licensing robustness and network fault-tolerance have been significantly improved. The software can operate in "disconnect mode", which means that it connects, gets a checkout, then disconnects from the license server. Checkout is refreshed every hour. The software will not fail for until 5 full days have passed since the last successful checkout. Alerts are issued for this as well.

As before, license servers may be specified as a colon-separated list via the RMonitor_LICENSE environment variable

All VOV-8766   When copying the <swd>/vovnginxd/conf/nginx.conf.template file to an nginx.conf file, a message is output to the server log indicating that a copy is being made, and also specifying the source and destination files.
Accelerator VOV-8789   The ncupgrade script has been implemented at beta level. This script allows customers to upgrade their Accelerator installation to a different (newer) version of software without taking down the queue. The script prompts users for decisions (such as queue name, location of the new version, etc.) and is intended to perform the upgrade without losing any jobs, and without having client job submissions fail (they just take longer to register).

Please provide feedback on this feature during the beta period.

Accelerator VOV-7408 20586 FairShare configure page now displays actions for a group based on user's permission on that group.
Accelerator VOV-7239 20343 When vovserver issues errors or alerts about running out of disk space and having trouble writing a file, always tell the full path of the file.
Accelerator VOV-8536 22170 The nc why command output for a queued job now includes a count of dependent jobs, if any, that must finish before the queued job runs.
Accelerator VOV-7529 20618 Reservation is now a VovObject, which means reservations can be queried with vovselect, are saved persistently(PR), and go into crash recovery as well(CR). One slave can have multiple reservations where previously there could be only one reservation per slave. vovslavemgr reserveshow is improved. vovshow -reservations, vovforget -allreservations, vovforget reservationId could be used to see and handle reservations. vtk_reservation_create, vtk_reservation_delete, vtk_reservation_get, vtk_reservation_update APIs are added as well.
Accelerator VOV-8419 22067 Resourcemaps now use ACLs for access control. The default access control permissions for a resourcemap are:
OWNER:     ATTACH DETACH EDIT VIEW FORGET DELEGATE EXISTS     
ADMIN:     ATTACH DETACH EDIT VIEW FORGET DELEGATE EXISTS     
EVERYBODY: ATTACH DETACH VIEW EXISTS     
Accelerator VOV-9049   Accelerator can now be configured to automatically determine the timeout for license usage data gathered from Monitor based on the time it takes to gather the data.
Accelerator VOV-9494 23807 EXT JobControl signals are now logged in the slave log
Accelerator VOV-9293 23475 Added field for job called SLAVESTATUS; this field can be used for example in preemption rules to avoid preempting jobs that are running on suspended slaves (slaves with status SUSP).
Accelerator VOV-8655 22254 Added support for the '-noheader' option in bjobs (LSF emulation). This removes the column headings from the output.
Accelerator VOV-8897   Added a new environment variable VOV_GROUP_PURGE_PERIOD that can be used to instruct vovslave to periodically purge its internal cache of UNIX group information, including gid/name maps and user group lists. Specified in seconds (minimum 60). Default: 0 (disabled).
Accelerator VOV-8501   Slaves and resource maps can be reserved on the web page representing a corresponding object. The list of reservations for an object will be shown on the page.
Accelerator VOV-8331 22802 Added -xdur flag to nc run for specifying the expected duration of the job. This is identical to the -x flag.
Accelerator VOV-7888 21475 The vovslave now defaults to using the vovgetgroups utility for obtaining user group information from UNIX-based operating systems. Previously, this was enabled by setting the VOV_USE_VOVGETGROUPS environment variable to 1.
Accelerator VOV-7886 21455 Improved readability of messages generated with vov_diagnostic_no_start.
Accelerator VOV-7198 20252 Added new "STARTTIME" field for slaves that contains the UNIX timestamp of the slave's start time.
Accelerator VOV-7207 20224 In Accelerator it is now possible to use nc wait -start to wait for jobs to start running.
Accelerator VOV-8784   Improved RESERVE_SAllocatorVES preemption ruletype to take -preemptslavespec parameters to take a slavelist and slave spec. The preemption puts reservations on slaves that are compatible with preempting jobs.
Accelerator VOV-7388 20587 nc hosts and vovslavemgr reserveshow commands are upgraded according to the new reservation features.
Accelerator VOV-5392 20367 Resource expressions now support lists of typed resources in place of OR, e.g.: nc run -r User:abc,User:def -- sleep 10 .

Additionally the AND, OR and NOT keywords are now case-insensitive.

Accelerator VOV-5455   The number of cpus(cores) is configurable through vovslavemgr configure -cpu.
Accelerator VOV-4823 20583 Added new hog protection for users with more currently-managed jobs than a configurable threshold, that when breached, will result in a configurable service delay for that user. Currently-managed jobs are those younger than the autoforget thresholds, as defined in the policy.tcl. All job states are considered.

This protection can be enabled and configured via the following server configuration parameters defined in the policy.tcl file:

hog.protection.enable (default=0, which is off)
hog.protection.jobcountthreshold (default=100000, min=1000, max=999999)
hog.protection.clientdelay (default=1, min=1, max=600, specified in seconds)
Accelerator VOV-8973 22958 Accelerator now correctly generates an alert if the Monitor instance being tracked by vovresourced is not available at first contact attempt. Previously this alert would appear only if contact was made and then lost.
Accelerator VOV-9763   A new vovshow command option -innerlooptimers provides profiling information about the vovserver innerloop. It can help identify performance issues or performance effects of certain job workloads.
Accelerator VOV-9720 23696 The severity level of vovwxd alerts can be configured. Please refer to $VOVDIR/../common/etc/config/vovwxd/config.tcl for details about each alert. Look for parameters CONFIG(alert,*)
Accelerator   23619 Users can now specify the NUMA-related resources MAXNUMACORES and MAXNUMAFREECORES when running jobs. The former refers to the maximum number of NUMA cores in a single node on the slave, the latter to the maximum number of free NUMA cores on the slave. For example: nc run -jpp pack -r+ MAXNUMAFREECORES#4 will request a job on any slave with at least 4 free NUMA cores on the same node.

In addition, the fields MAXNUMACORES and MAXFREENUMACORES have been added to the SAllocatorVES object for the purposes of vtk_select_loop, vovselect and related commands.

Accelerator VOV-8703 23470 Added support for Linux container integration (beta). See the Accelerator Administrator Guide for information on configuring container support, and the Accelerator User Guide for information on requesting container support in job submissions. Example configurations and hook scripts are provided in $VOVDIR/etc/config/containers.

Resolved Issues

The following issues were resolved in this software release:
Product Internal Number Case Number Description
All VOV-9363   Fixed server crash when the Detach slave from server" link was clicked on the Slave Details page in the web UI.
All VOV-9333   Fixed an issue with vovserver failover that could lead to a corrupt VOV_HOST_HTTP_NAME setting if that value did not already contain a fully-qualified domain name.
All VOV-9333 23318 Fixed issue where an invalid slave resource expression would result in a permanent error message on the "slaves" web page.
All VOV-8578 21858 The following queryable fields have been added to the SAllocatorVES table for consistency with the vovslavemgr configure command: RAMSENTRYFAllocatorG, MINRAMFREE, EFFlowTracerOTALRAM, RETRYCHDIR, RETRYCHDIRSLEEP, RETRYCHDIRSLEEPBACKOFF, MAXWAITNOSTART, ALLOWCOREDUMP, DEBUGJOBCONTROL, AUTOKILMonitorETHOD, MINDISKSPACE.
All VOV-8685 22253 Fixed issues in D and U environments (which are used to Define and Undefine specific env vars) when they are both used, or they are nested (such as having multiple D calls in the env). Previously, it was possible to define something in the environment but have that item undefined in the "END" environment script. Now, it preserves the state using temporary environment variables so that each "START" and "END" environment script has consistent environment variables defined.
All VOV-9501 23821 Rebuilt binaries dependent on OpenSSL to remove a bad default OpenSSL configuration path.
All VOV-9485 23695 Fix a rare crash during slave startup caused by an intermittent failure in the Linux proc filesystem.
Accelerator VOV-9423 23709, 24842 Modified nc list core algorithm to cap output at 2GB. The command will return error if output string gets over limit.
Accelerator VOV-9498 23813 Fixed behavior of vovstop and related commands with regard to EXT:KILL - include and exclude specifications should work as expected now.
Accelerator VOV-9246   Improve vovserver memory allocation for better performance
Accelerator VOV-9245   On Linux-based systems, there are now two client servicing modes from which to choose: poll (default) and epoll. The mode chosen specifies which POSIX mechanism the vovserver will use to determine which client file descriptors are ready for use. The mode can be specified by setting config(useepoll) to 0 (poll, default) or 1 (epoll) in the SWD/policy.tcl file. Generally, the epoll mode should result in more efficient processing of service requests. As of this version, epoll mode is a new feature and is therefore disabled by default.
Accelerator VOV-9098   For nested FairShare groups with same string pattern in the name for example: "/somename" and "/topgroup/somename". The latter one is nested, child of topgroup. When user went on fairshare.cgi web page and clicked to show info for "/somename" group they saw list of both above groups: "somename" and "topgroup". Fixed: second group does not show up when "/somename" is selected
Accelerator VOV-5892 22302 WX jobs occasionally use global Tcl libraries for environment instead of VOVDIRs Tcl.
Accelerator VOV-9163   Improved FreeSlave rule type to update existing reservations if the rule still fires for a waiting bucket. ReserveSlave rule type is improved to skip slaves in bad status such as SUSPENDED.
Accelerator VOV-9076 23182 No longer print a file sourcing error when issuing an unknown nc subcommand.
Accelerator VOV-8041 21757 Eliminated the need for host-based equivalence cache files. The vovserver now caches equivalence entries and sends them to clients when needed.

For this reason, when using this new default behavior, equivalences that reference an environment variable should not resolve the variable in this file, in environments that will have both Unix and Windows clients. Instead, they will need to be resolved by the client upon receipt. This is done by enclosing the equivalence value inside curly braces and referring the environment variable as $VARNAME as opposed to the Tcl format of $env(VARNAME).

The previous behavior can be restored by setting VOVEQUIV_CACHE_FILE to "legacy" in the $SWD/setup.tcl file for a given instance or globally for all instances via a $VOVDIR/local/vovsetup script (vovsetup.csh or vovsetup.sh).

Older clients connecting to a new vovserver will work without any changes as long as the $SWD/equiv.caches directory exists and has world-write permissions. Variables in the new form will resolve correctly.

New clients connecting to an older vovserver will require legacy mode to be enabled.

Accelerator VOV-7407 20607 Corrected the logfile field value in the nc getfield output when a job is submitted with the -nolog option. The logfile field now displays /dev/null in this scenario.
Accelerator VOV-9028 23100, 23463, 23464 Increase performance of vtk_resourcemap_set_limit, the Tcl API command that is used to set limit resources by expanding a @KEYWORD@ to find matches to the main resource being passed in.
Accelerator VOV-7242 20265 FairShare groups now have a "flatten" setting. When a group is flattened, all of its child groups will be treated as though they were on the same level of hierarchy; that is, all leaf-level groups will be assigned weights as though they were direct children of the top-level flattened group, ignoring weights assigned to any non-leaf-level groups in between. This setting can be enabled by the vovfsgroup modify command. Note that disabling this setting for a group that has a flattened parent will have no effect; the group will still be flattened.
Accelerator VOV-8533 21902 Allow for the preemption debug property to exceed the currently configured max value length, up to the highest possible length that can be configured.

Also, for all properties that exceed the max value length, instead of setting the value to an error message only, the value is now set to the same error message as before, followed by a truncated version of the value intended, ending with a truncation indicator.

Accelerator VOV-9064 23004 Added ability to query slave resources individually using the vovselect utility.
% vovselect resources.X from slaves 
% vovselect resources.X from 012345678 where 'resources.X>100' 
This works for both standard and user-defined resources.
Accelerator VOV-8233 21983 Messaging in vsy, nc info and related commands has been changed to clarify which resources are actually missing when the user specifies OR clauses in the resource request.
Accelerator VOV-7946 21549 Prevent the server from checking the existence of SNAPSHOT environment files for Accelerator and Accelerator Plus to prevent long file access times from affecting server performance.
Accelerator VOV-7882 21473 Fixed issue that caused hosts to lose information such as frequency, cores, and RAM after doing a sanity. Now the data is recomputed after sanity when needed. Sanity still removes hosts older than 30 minutes (sort of like a 30-minute cache on host info), but now it gets the information back properly.
Accelerator VOV-7630 21021 <br /> in the Accelerator jobs for user page is for a separate line for each tool info, now is made work as intended.
Accelerator VOV-8352 22090 A case where the partialTool script could exit unexpectedly has been fixed.
Accelerator VOV-8198 21965 Improved the "why" message for jobs that are invalid or failed due to a dependency to be more meaningful. For Accelerator, it now includes information about the dependent job instead of the PHANTOM file used to create the dependency. For FlowTracer, it includes information about the input file as well as the upcone job which has the file as an output.
Accelerator VOV-7509 20789 The vovclientmgr utility now has a "closedeadinteractive" action, which allows the user to close clients that have interactive jobs that are no longer running. Measures have also been implemented to prevent such clients and their matching vovsh processes from lingering in the first place. Now, when an interactive job is either spawned or placed in the background, both the client and vovsh process will close upon job exit.
Accelerator VOV-8730 22315 Fixed an error where editing a queued interactive job in the Accelerator web interface could potentially cause the job to fail.
Accelerator VOV-8059 21780 Cancel creating new preemption rule goes back to the main preemption page.
Accelerator VOV-7639 20590 The command line tool vovslavemgr no longer attempts to start a slave that is already running.
Accelerator VOV-8170 21929 Added parameter resmap.sw.types with default value "License Limit Policy User Group Priority"
Accelerator VOV-6186 20995 The nc getfield command now works correctly with properties that have lower-case letters in their names.
Accelerator VOV-3765 20136 The nc run command now supports an option to control the number of times a job can be rescheduled. Thus: -maxresched <N> -- Maximum number of times the job can be rescheduled. Must be >= 1 and <= 10 (default 10). This implemented via the MAX_RESCHEDULE property on the job.

Errata

The following issues and defects are known to exist in this software release.

Product Internal Issue Description
All VOV-9759 The config subcommand of vovservermgr misprints confirmation messages
All VOV-9827 The vtk_slave_define Tcl command no longer supports automatic resources based on the vovslave name.
All VOV-9419 Setting config(useepoll) 1 in the policy.tcl config file has no effect.
All VOV-9771 The vtk_select_get command is not honoring the case of field names.
All VOV-9742 There is a new vovservermgr command that has been added for system administrators. This command has several subcommands that provide an easier way to set vovserver configuration and environment variables, and interface to memory chunking and scheduler tuning controls.
All VOV-9623 eventserver and epoll() are not starting with vovservermgr
All VOV-9748 Elaborate the vovservermgr -h help screen with more complete usage information.
Accelerator VOV-9792 Containers should allow user IDs from NIS or LDAP.
Accelerator VOV-9793 Containers hooks can be made more portable for Centos host OS.
Accelerator VOV-9806 A container resource shows up 2 times in vovslavemgr show -resourceonly output.

2019.01 Update 7 Release Notes

Product(s) Internal Issue Case Number Description
All VOV-12584   Session and process group IDs 0 and 1 will not cause cause all process with those ids to be included in the process graph used for job statistics.
All VOV-13023  
  • Prevent vovslave from growing beyond the configured maximum capacity whenthe job in the last slot becomes suspended.
  • Allow for maximum capacity to be configured on a running slave even when thecapacity value is not changing.
  • Support the full capacity specification format in wxagent via the CONFIG(slave,slots) parameter.
  • Do not automatically mark a wxagent job as suspended if one of its child processes become suspended.
Accelerator VOV-12707 CS0131356 Curly braces can now be used in environment specifications as a means of supporting characters that would normally be sensitive to the processing of the specification. For example, to pass a comma into an environment variable value using the D environment, use D(FOO={bar,baz}), or via the alternative syntax of D,FOO={bar,baz}.
Accelerator VOV-12628 CS0127402 Fixed a bug where License: was prepended the resource name if the resource parameter was specified in vtk_flexlm_monitor, even if the resource name already started with License:.
Accelerator VOV-12726 CS0133891 The vovcleanup utility has been updated to work with resource data files using the .res suffix.
Accelerator VOV-12489   Fix for cleaning up the files created today when vovcleanup is run before noon and cleantime is < 24h. Previously, these files were not being removed by vovcleanup.
Accelerator VOV-12977 CS0143428 Fixed an issue with interactive jobs (nc run -I) failing with the error message "Job has problems with PTY. Bad pipes".
Accelerator VOV-12030 CS0120906,CS0121020 Fixed issue that caused slaves to be killed with the message "Slave instructed to exit brutally". This also fixes server messages like "Cannot find slave rdc-cad-svr12 (illegal id 365667285) pid=32830"
Accelerator VOV-12396 CS0121165 Multiple CORES, RAM, and TMP resources requested for NC job are properly combinednow when passed to container. For example "nc run -r+ RAM/100 -r+ RAM/200 ..." will result in setting "VOV_CONTAINER_RAM=300" available from Container Hook scripts.
Accelerator VOV-9776 24061 The form for submitting a slave reservation now checks that the data is valid before actually submitting the reservation.
Accelerator VOV-8012 21662, CS0121063 VOV_LM_VARNAMES functionality will now be available for interactive jobs and will support multiple license servers in colon or semicolon (windows) separated list instead of space separated list.
Accelerator,Accelerator Plus VOV-13051 CS0121039 Fixed an issue with interactive jobs (nc run -I) failing with error messages similar to"Error=98: Address already in use [vovttyserver2:244]" and "FATAL ERROR: Cannot open PTY port (with remote signal handling): Cannot open pty server sockets [vncrun.tcl:2257]". This is accompanied by job errors similar to "Cannot connect to PTY server on submission host lava1 13316 Z@:x=XGa56cT_Hd6 from lava5".
Accelerator,Accelerator Plus VOV-13038 CS0145428
  • Fixed issue that prevented shared memory lookup from being disabled if the VOV_SHARED_MEMORY_LOOKUP_DISABLED environment variable was set.
  • Support configurability of job statistics calculation interval via the pre-existingcomposite-value -U option to vovslave, as well as the vtk_slave_* calls. The composite value format is a CSV list of timespecs, in order of resource updates (slave heartbeat), slave statistics, and job statistics.
  • Support the composite-value for the -U option in the vovwxd configuration via the CONFIG(slave,update) parameter.
Accelerator,Accelerator Plus VOV-13030 CS0149277 Fixed an issue with Ctrl-C not working as expected with interactive jobs (nc run -I/-Il/-Ir).
Accelerator VOV-11226 24205, CS0128626 Fixed issue that prevented pre and post commands from running as the job user in containerized jobs that make use of container hooks that run as root.
Accelerator, Accelerator Plus VOV-12629 CS0127516 Removed duplicate detection logic from -r+ and -dpres+ submission options. As a result, the -r+ option now properly handles more complex resource specifications.
Accelerator VOV-11000 24853 This change allows DP jobs to assign a separate jobclass to the master and component jobs. It works by setting VOV_JOB_DESC(dp,jobclasses) to a comma separated list of jobclass names the same way that VOV_JOB_DESC(dp,resources) can be set to specify resources. The jobclasses are treated as strings. They are not evaluated as you are already setting these from within a jobclass definition. See the updated Accelerator documentation for further explanation and examples.
Accelerator Plus VOV-12635 CS0128164 Fixed a bug causing a memory access violation in vovserver when multiple slaves are being stopped with running jobs.
Accelerator Plus VOV-12658 CS0124441 vovwxd will no longer print warnings about slaves being in "state 1" (pending).
Accelerator Plus VOV-11836 CS0120725

nc forget using the options -mine, -dir, or -subdir, will no longer forget system jobs bydefault. A new option, -system, will include system jobs to match old behavior. In general, the option -system should not be used when forgetting user jobs in a WX setting because forgetting launcher jobs for slave agents that are not runnable in the batch system before they are processed by vovwxd couild result in lingering queued agent jobs. Using nc forget

-minewithout the -system flag will forget the user submitted jobs and allow vovwxd to more

efficientlyclean up slave agents.

Accelerator Plus VOV-12316 AAP25172

Fixed an issue that caused vovserver memory to grow over time as "nc wait" and "nc run

-w"commands were issued.

Accelerator Plus VOV-13037 CS0147601
  • Prevent vovslave from growing beyond the configured maximum capacity whenthe job in the last slot becomes suspended.
  • Allow for maximum capacity to be configured on a running slave even when thecapacity value is not changing.
  • Support the full capacity specification format in wxagent via the CONFIG(slave,slots) parameter.
  • Do not automatically mark a wxagent job as suspended if one of its child
FlowTracer VOV-12571   vovwxd configuration parameters CONFIG(launchers,autoForgetSuccessful) and CONFIG(aunchers,autoForgetFailed) will now work regardless of the CONFIG(log,level) setting. The defaults for these settings has been changed to "0s". Either of these values should only be set to non-zero when debugging slave launch issues to avoid additional server overhead.
FlowTracer VOV-12570   Using local resources in FlowTracer with an LSF Base queue will now correctly account for local resources when using array launcher submission.
FlowTracer VOV-12569   FlowTracer Local resources will now be properly released in the case of an LSF launch failure.
FlowTracer VOV-12455 CS0121195 This fix addresses issues handling files larger than 2G in size in the flow.
FlowTracer VOV-12620   vovwxd now supports LSF array jobs when getting the status of launcher jobs submitted to the queue.
FlowTracer VOV-12261   License:*resources will now be properly passed to the base queue when using FlowTracer with vovwxd.
FlowTracer VOV-12203 CS0120999 Re-evaluation of a job class to compute the union of resources when used with an indirect slave (slaveVNC) is no longer done. This is typically relevant for FlowTracer integration with either Accelerator (NC) or AcceleratorPlus (WX). To restore the old behavior, please contact Altair support.
FlowTracer VOV-11873   To enable resources to be managed locally by FlowTracer when using an LSF backend via vovwxd enable the vovwxd.localresources parameter in the policy file and create the resources in FlowTracer using the vtk_resourcemap_set -local parameter.
Monitor VOV-12575   Fixed handling of mixed-case user names on Windows to always honor the case reportedby the OS. On Windows, the case used at login time will be the case used when obtaining the active user from the OS and when applying security. It is therefore recommended for users to always use the same case when logging into Windows. Otherwise, a separate vtk_security entry will be required for each case used (such as for joe, Joe, and JOE), and those user names will be considered as different users. Prior to this change, for a mixed-case username, the default security entry was for the user name in lower-case only, resulting in a security mismatch if the user logged into the web UI with the same case used to login to Windows itself.
Monitor VOV-12294 CS0121092 Suppressed below server log for loopback IP addresses. vovserver(3956) ERROR Jun 1200:04:28 Found host with different ip: 127.0.0.1 instead of 10.10.1.34 [host:903]
2019.01 Update 7 Patch 4

Product(s) Internal Issue Case Number Description
All VOV-14220   FIFO read pipes accumulate in vovtasker root when interactive jobs terminate.
2019.01 Update 7 Patch 3

Product(s) Internal Issue Case Number Description
All VOV-13872 CS0211355 Fixed issue where SICK status Accelerator slaves were not removed after an appropriate amount of time. The underlying cause was that there were still related jobs running in the base queue, and was repaired by passing the -forcerunning option to the NC base queue forget command for slaves with a SICK status.
All VOV-13861 CS0210064 Fixed issue in which SIGALRM interrupted communications on interactive jobs using VOV_INTERACTIVE_PING keep alive method.
All VOV-13816 CS0205113 Address issue where license resources sometimes became unavailable when on life support.
All VOV-13860 CS0208413, CS0208823 Fixed issue that caused the slave to overload vovserver with messages when a job execution attempt failed due to not being able to successfully fork out the subslave process that is used to shepherd the job.
All VOV-12989 CS0145649 Fixed an issue where stopping more than 1 vovslave by name (vovslavemgr stop <slave1> <slave2>...) was renaming only the last named tasker to <slavename>_stopped_<timestamp>.
All VOV-12812   Enabled client activity logging for nc cmd commands.
2019.01 Update 7 Patch 2

Product(s) Internal Issue Case Number Description
All VOV-13629 CS0186671, CS0192772, CS0194045 Fixed an issue where a failed PTY connection for a job would cause subsequent jobs on the slave to fail as long as the original job was still running, and in some cases, the slave could become unresponsive.
All VOV-13346 CS0164333, CS0186238 Fixed an issue where a redirect in the nginx configuration would cause vovresourced to crash.
All VOV-13553 CS0185082, CS0191010 Remove the need to call vtk_flexlm_monitor_all -reset).

No longer make redundant calls to process license data.

No longer start voveventmon when vovresourced is called with the -initjobclass parameter.

2019.01 Update 7 Patch 1

Product(s) Internal Issue Case Number Description
All VOV-12714   Fixed the following issues with job container support:
  1. The VOV_CONTAINER_<resource> (where <resource> is CORES, RAM, or TMP) environment variable was not being set in the hook environments for containers that were configured to use the job's resource specification as container limits.
  2. The VOV_CONTAINER_JOB_RUNDIR environment variable was not being set in the hook environments if the job's run directory had been configured as an equivalence, as would be the case for jobs submitted from within the user's home directory.
All VOV-13025 CS0145466 Added a config key "MQ(pjProbeKillTimeout)" for the maximum time that the vovlad daemon should wait for existing probes to be killed at startup.

2019.01 Update 6 Release

New Features and Enhancements

The following new features and enhancements were introduced this software release:

Product Internal Number Case Number Description
All VOV-9777 24039 Added "Last Dispatch" into the page jobqueue?page=buckets&. Prior to fix, it was showing "age", which is confusing as it gets updated on submission and job dispatch.
Accelerator VOV-11617   Added support for SIGTSTP to vtool.

When vtool receives SIGTSTP, it releases its license tokens and suspends itself. Upon receiving SIGCONT, vtool resumes and eventually checks out the tokens again.

Accelerator VOV-10801 24711 Added support for keeping an interactive job's (nc run -I, -Ir, etc.) network connection with the remote job alive by periodically "pinging" the remote job. This option is enabled by setting the VOV_INTERACTIVE_PING environment variable to the desired ping interval (timespec format).
Accelerator VOV-11740 25241 Duplicate quantitative resource requests are no longer rejected when using the -r+ submission option. For consumable resource requests, the sum duplicate values is used. For all others, the maximum value is used.
Examples:
"-r+ RAMFREE#100 -r+ RAMFREE#200" = RAMFREE#200
"-r+ RAM/100 -r+ RAM/200" = RAM/300

Note that nc info shows the resource string that was specified in the submission. The solution field and property of the job will contain the final result.

Accelerator VOV-11025 23337, 24800 Enhance Accelorator's tracking of the user processes that are part of a job by using UNIX session IDs and also by examining the process environment for the VOV_JOBID variable to associate it with the right NC job.
Accelerator VOV-11640 24715 Fixed issue where more than 10 rows of running jobs cannot be exported as CSV.
Accelerator VOV-11694   Fixed issue with parsing of -dpresN option in nc run
Accelerator VOV-5418 21072 In the output of the nc info command, if the slave for the job being inspected has a current status of SICK, the status will be indicated in parentheses after the name of the slave.
Accelerator VOV-11351 25139 The -dpres+ option is now supported in DP jobs.
Accelerator VOV-11155   The ALLPIDS property has been added to the online help.
Resolved Issues

The following issues were resolved in this release.

Product Internal Number Case Number Description
All VOV-12236   Fixed an issue leading to the following server logs:
"vovserver(3260) ERROR May 24 00:00:53 
Found host with different ip: 10.102.50.114 instead of 10.102.50.114 [host:903]"
All VOV-11253 25045 Improved the error messaging for the vtk_resourcemap_forget API command.
Accelerator, Monitor VOV-11487   A bug fix in 2019.01u5 could result in issues when starting a server with the same requested port id specified in rapid succession, or when requesting a range of ports such as -port 500:501:502.
Accelerator VOV-11223   A typo in the example of the vovreconciled/config.tcl in the Altair Accelerator Administrator's Guide has been fixed.
Accelerator VOV-11515 25220 Added a new server configuration parameter, tasker.uninterruptableSignalCascades, that controls whether slaves should process incoming job control requests when there is an existing signal cascade being processed. If TRUE, slaves will ignore incoming job control requests if an existing request is in-process and a message will be printed to the slave log indicating as such. Default is FALSE.
Accelerator VOV-11650 29856 Added an alert with error message to notify user when create reservation fails due to reservation period overlap with an existing reservation.
Accelerator VOV-12043 CS0120714 Calling vtk_slave_reserve will now cause a purge of all existing reservations for the given slave prior to new reservation creation.

Any vovslave or vovslavemgr start/restart, vtk_slave_define , etc invocation involving -e will also apply the same purge of all existing slave reservations before creation of a new reservation on the slave.

Users must use vtk_reservation_create to create multiple reservations on a slave, or via the web based UI.

Accelerator VOV-7591 20937 Documentation has been updated to not use RAM# in redundant quantitative resources example.
Accelerator VOV-4514 20957 Duplicate values in resource requests for consumables do not cause the job to be rejected in nc run.
Accelerator VOV-12003 CS0120875 Fix for cleaning up files under data/slaveload
Accelerator VOV-12095 AAP24206 Fixed "passing env variable to NC job inside container" issue.

User can call env FOO=BAR nc run ... and get correct value of $FOO inside the job.

Accelerator VOV-11892 CS0120652, CS0120732 Fixed a bug in vtk_flexlm_monitor when it would not create a summary resource map for a feature with multiple tags. The -order parameter to vtk_flexlm_monitor only orders tags, it no longer adds tags (use the -tags to add the tags).
Accelerator VOV-11014 24851 Fixed a timing issue where NC failed to properly reconnect to LM after an LM failover.
Accelerator VOV-11469 25174 Fixed an error that affected containerized jobs where at least one of the container hook scripts required root privileges. In this case, non-root privileged scripts as well as the job itself could potentially elevate themselves to root.
Accelerator VOV-11485 24206 Fixed an issue in the OS container support that could trigger an error message in the vovslave log about it being unable to "clear environment."
Accelerator, Accelerator Plus VOV-12200   Fixed issue with job arrays that are dependent upon a job or set, where the array jobs will not run due to an invalid input.
Accelerator VOV-11543 25205 Fixed issue with plotting workload wait reasons
Accelerator VOV-11699 29895 Fixed the issue where jobs are staying forever in pending state waiting for Limit:@USER@_N resource when both 'CGROUP' and 'limit' options are given in the job's input resource list.
Accelerator VOV-6077 21235 Fixed vovslavemgr restart issue when slave has jobs running.
Accelerator VOV-11895 AAP24898, CS0120786 Inequality comparisons using at-delimited field names in preemption rules (e.g. "PRIORITY<@PRIORITY@") will no longer give an error message about non-numeric comparisons.
Accelerator VOV-12125 CS0120945 Moved processing of interactive jobs (nc run -I, -Ir, etc.) to a separate thread for each job. This should improve the responsiveness of interactive jobs.
Accelerator VOV-11805 29905 Reported intermittent slowness in NC queue has been addressed in an earlier release.
Accelerator VOV-10356   Stderr will now be included in the output log for default windows jobs.
Accelerator VOV-11923   Stopping a job while using VOV_SLAVE_SID_DISABLE should no longer result in stopping processes in the same SID that are not part of the job.
Accelerator VOV-11298   The nc getfield command previously reported lowercase fields "curram", "slaveid", and "retracingid" with 0 values although the corresponding uppercase fields "CURRAM", "SLAVEID", and "RETRACINGID" were correctly reported. This fix corrects the reporting of the lowercase field values.
Accelerator VOV-12016 CS0120897 The duration of the reservation for the vovdbd vovslave is changed from 365d to 'forever', so that no regular jobs may be placed on it after expiration.
Accelerator VOV-8026 21721 The issue where vovslavemgr start needs to wait until vovfoster jobs has completed was addressed in an earlier release.
Accelerator VOV-6082 24694 This fix improves reliability of the -wl option to nc run by avoiding timing-sensitive collisions with other jobs on the ports used.
Accelerator VOV-12046 CS0120935 vovserver will manage the availability of resources on a slave without vovfoster jobs. When slave is requested to exit while it has a running job, it changes its state to DONE and renames itself to "name_stopped_id". Starting the same slave, at this moment, will result it to adjust the numbers of available resources based on their consumption in the previous instance.
2019.01 Update 6 Patch 1

Internal Number Products Case Number Description
VOV-12261 FlowTracer   License:* resources will now be properly passed to the base queue when using FlowTracer with vovwxd.
VOV-12620 FlowTracer   vovwxd now supports LSF array jobs when getting the status of launcher jobs submitted to the queue.
VOV-12570 FlowTracer   Using local resources in FlowTracer with an LSF Base queue will now correctly account for local resources when using array launcher submission.
VOV-12569 FlowTracer   FlowTracer Local resources will now be properly released in the case of an LSF launch failure.
VOV-11873 FlowTracer   To enable resources to be managed locally by FlowTracer when using an LSF backend via vovwxd enable the vovwxd.local resources parameter in the policy file and create the resources in FlowTracer using the vtk_resourcemap_set -local parameter.
VOV-12030 Accelerator

0120906, 0121020

Fixed issue that caused slaves to be killed with the message "Slave instructed to exit brutally". This also fixes server messages like "Cannot find slave rdc-cad-svr12 (illegal id 365667285) pid=32830".
VOV-12628 Accelerator 0127402 Fixed a bug where License: was prepended the resource name if the resource parameter was specified in vtk_flexlm_monitor, even if the resource name already started with License:.
VOV-12647 Accelerator 0128385 Fixed issues with interactive jobs failing with the following log message "Timed out waiting forauthentication request from pty server". Also, added a new environment variable, VOV_INTERACTIVE_AUTH_TIMEOUT, to configure the authentication timeout on the vovslave.
VOV-12679 Accelerator 0124318, 0128385 Fixed an issue with interactive jobs failing with the incorrect message "Client has responded to authentication request... with key &apos; &apos;". This happens when the client has closed the connection. This is also accompanied with the following vovslave log message "Timed out waitingfor authentication request from pty server".

VOV-11836

Accelerator Plus

0120725

nc forget using the options -mine,-dir, or -subdir, will no longer forget system jobs by default. A new option, -system, will include system jobs to match old behavior. In general, the option -system should not be used when forgetting user jobs in a WX setting because forgetting launcher jobs for slave agents that are not runnable in the batch system before they are processed by vovwxd couild result in lingering queued agent jobs. Using nc forget -mine without the -system flag will forget the user submitted jobs and allow vovwxd to more efficiently clean up slave agents.
VOV-12635 Accelerator Plus 0128164 Fixed a bug causing a memory access violation in vovserver when multiple slaves are being stopped with running jobs.
VOV-12316 Accelerator Plus   Fixed an issue that caused vovserver memory to grow over time as "nc wait" and "nc run -w" commands were issued.
VOV-12647 Accelerator Plus 0128385 Fixed issues with interactive jobs failing with the following log message "Timed out waiting for authentication request from pty server". Also, added a new environment variable, VOV_INTERACTIVE_AUTH_TIMEOUT, to configure the authentication timeout on the vovslave.
VOV-12679 Accelerator Plus 0124318, 0128385 Fixed an issue with interactive jobs failing with the incorrect message "Client has responded toauthentication request... with key &apos; &apos;". This happens when the client has closed the connection. This is also accompanied with the following vovslave log message "Timed out waiting for authentication request from pty server".

2019.01 Update 5 Release

New Features and Enhancements

The following new features and enhancements were introduced this software release:

Table 3.
Product Internal Number Case Number Description
All VOV-11299 25102 Fixed error that resulted in a "Server is operating on a non-internal object" error to be printed in the server log. This error is linked to querying for the "why" status of a job that has an input dependency.
All VOV-11350   Fixed statistics for some hierarchical sets.
Accelerator VOV-7535 20833 The proc VovGetRevokeDelay {} can now be added and customized by redefining it in vovresourced/config.tcl under the SWD directory to allow users to customize the revoke delay to be used in vovreconciled. This allows users to have the revoke delay from their job classes override the default value of RESD(revokeDelay). The proc definition has been added to the Altair Accelerator Administrator Guide. In addition, the verbosity levels of various messages have been modified per customer requests.
Accelerator VOV-3765 20136 The nc run command now supports an option to control the number of times a job can be rescheduled. Thus:
  • maxresched <N> -- Maximum number of times the job can be rescheduled.
  • Must be >= 1 and <= 10 (default 10).
This is implemented via the MAX_RESCHEDULE property on the job.
Resolved Issues

The following issues were resolved in this release.

Table 4.
Product Internal Number Case Number Description
All VOV-10110   vovwxd cleaner log files will be preserved for the time spec specified by the delCleanerLog,older config parameter in vovwxd/config.tcl.
All VOV-11326   Slave slot licenses will be released when a slave exits in Auto Licensing mode.
All VOV-11294   The /local/registry/system-accelerator folder may have not always been writable because it was created with user's umask permissions. Now created with 777.
All VOV-11350   Fixed statistics for some hierarchical sets.
Accelerator VOV-8012 21662 VOV_LM_VARNAMES functionality will now be available for interactive jobs.
Accelerator VOV-11347 25138 Fixed an issue in vovfsgroup loadconfig where the weight and window values of the FairShare group were not getting set to the values in the config file.
Accelerator VOV-11242 25036 Broken HTML links in the Altair Accelerator Training Guide have been fixed.
Accelerator VOV-10926 24819 Fixed issue with interactive jobs (nc -I) failing when run with a PRECMD that reschedules the job. This also fixes the issue of the PTY overriding the exit code from the PRECMD.
Accelerator VOV-11307 25103 Fixed a race condition in the job fostering system, which is used to properly account for jobs running on a host that has had its vovslave restarted, that could cause the foster jobs to fail and the restarted vovslave to refuse any future stop requests.
Accelerator VOV-11305 25109 Prevent a new autokill sequence from initiating if an existing sequence is already being processed. Prior to this change, an autokill sequence that took more than 5m to process would result in a new sequence starting without the existing one completing. This would result in the slave entering a looping condition that may never end.
Accelerator VOV-11296 25093 A change to the command nc hosts that displayed all reservations a slave may have was backed out due to adverse performance effects on the server. The command nc hosts will now only show the "dominant" reservation", i.e. the oldest unexpired reservation.
Accelerator, Accelerator Plus VOV-11338 25079 Fixed issues with job resource usage reporting by including detached processes with unique gpids and session ids by matching VOV_JOBID and VOV_SLAVE_PID. The VOV_JOBID to be matched will be taken from the transaction object rather than depending on the subslave environment. Also added NC_JOBID and NC_SLAVE_PID env variables so that WX and NC slaves can both correctly track processes.
Accelerator VOV-11358   Fixed an issue where running Altair Accelerator in interactive mode (nc run -I) with both the input and output redirected would result in lost output.
Accelerator VOV-11336 CS0120656, CS0120663 Fixed issue where preempted jobs may have been prematurely resumed preventing the preempting job from running.
Accelerator VOV-11180   Handled invalid values for these 3 resources: "RAM" "CORES" "SLOTS". Numeric within this range [0 - 2147483647] is allowed.
Accelerator VOV-11160   The output of nc getfield JOB CPUTIME with an uppercase "CPUTIME" will now accurately show time in milliseconds instead of 0.
Accelerator VOV-11210   Changed 'cputime' type from integer to integer64 in vovshow -fields command output
Accelerator VOV-11222   Underscores have been removed from the Node Field Names help topic to reflect the updated behavior.
Accelerator VOV-11799 CS0120663 Fixed issue where preempted jobs may have been prematurely resumed preventing the preempting job from running.
Accelerator VOV-11828 CS0120715 The output of "nc getfield JOB CPUTIME" with an uppercase "CPUTIME" will now accurately show time in milliseconds instead of 0.
2019.01 Update 5 Patch 1

Resolved Issues
The following issues were resolved in this software release:
Table 5.
Product Internal Number Case Number Description
All VOV-12236   Fixed an issue leading to the following server logs:
vovserver(3260) ERROR May 24 00:00:53 Found host with different ip: 10.102.50.114 
instead of 10.102.50.114 [host:903]
Accelerator VOV-10801 24711 Added support for keeping an interactive job's (nc run -I, -Ir, etc.) network connection with the remote job alive by periodically "pinging" the remote job. This option is enabled by setting the VOV_INTERACTIVE_PING environment variable to the desired ping interval (timespec format).
Accelerator VOV-11895 0120786 Inequality comparisons using at-delimited field names in preemption rules (e.g. "PRIORITY<@PRIORITY@") will no longer give an error message about non-numeric comparisons.
Accelerator VOV-12030 0120906, 0121020 Fixed issue that caused slaves to be killed with the message "Slave instructed to exit brutally". This also fixes server messages like "Cannot find slave rdc-cad-svr12 (illegal id 365667285) pid=32830".
Accelerator VOV-12200   Fixed issue with job arrays that are dependent upon a job or set, where the array jobs will not run due to an invalid input.

2019.01 Update 4 Release

New Features and Enhancements

The following new features and enhancements were introduced this software release:

Product Internal Number Case Number Description
All VOV-10969   The command vovclientmgr show no longer incorrectly labels clients without "nicknames" as HTTP clients. In addition, the nc command now properly sets a client nickname in all scenarios to allow it to be more easily identified in the output of both vovclientmgr show and vovshow -clients.
All VOV-10864   Added a new trace parameter, enterpriselicense.burst to enable burst licensing for NC/WX in Auto mode. 1=enabled 0=disabled, defaults to disabled.

Cleanup of various presentation errors in web UI license page when switching modes and show more details on current usage/availability for all modes.

Disabled choices for Full and N for the licensing mode in the web UI for non-NC/WX servers.

Accelerator VOV-10520   Accelerator will now support the use of burst licenses. If license file is provisioned with nc_slots_burst type licenses, Accelerator will allocate licenses first out of the base nc_slots licenses, and then allocate additional slots as required from the nc _slots_burst license pool.
Accelerator VOV-4900 21070 TIMEVAR time slot specifications are expanded to allow the second time item in the range HH:MM-HH:MM to be a prior time, as is the case when spanning an overnight time. For example, 6 PM to 6 AM may now be specified using a 24 hour clock range as follows: 18:00-6:00.
Accelerator VOV-8188 21251 Previously, when vovgetgroups timed out or did not return groups info correctly, the job would run with the incorrect groups. Following this change, under those conditions, the job will fail. Also previously, the VOV_ALARM timeout for vovgetgroups was limited to not exceed 60 seconds. The 60 second limit has been eliminated.
Accelerator VOV-10812   The show/hide cgroups link on the Slave Resources web UI page is no longer required to show the CGROUP:RAM slave resource, and therefore has been removed.
Accelerator VOV-9555   Modified vovslave log messages to be more clear and actionable.
Accelerator VOV-10905 24801 nc why output for DP jobs will omit the confusing internal DP:SLOTS_N resource and show subjob IDs and statuses
Accelerator VOV-5294   A new capability to improve job RAM and CPUTIME accounting for jobs with detached processes is implemented on Linux systems. In addition to collecting PIDs that share a PGID or are within the process tree for a job, various types of detached processes are found if they are in the same Session ID or if the VOV_JOBID environment variable matches the values for the running job.
Accelerator VOV-10795   Support for hourly charging in the cloud has been implemented. If a slave is started with the environment variable VOV_INSTANCE_LAUNCH_TS set to the launch time of the instance on which the slave is running, then the slave will be kept alive until we approach the hour-boundary to within a few minutes.

This is an unsupported feature.

Resolved Issues
The following issues were resolved in 2019.01 Update 4 software release.
Product Internal Number Case Number Description
All VOV-10999   Fixed issue that prevented the show all rows link from working on the buckets web UI page. Previously, using this link would result in an empty table as opposed to showing all available rows.
All VOV-10350   All installers/SFDs now reject installation paths that contain spaces.
All VOV-10427 24510 This ticket addressed three issues that affected crash recovery.

The first was a race condition that occurred when a vovslave connected to a restarted server. If the vovslave license authorization happened to be checked during a very small interval the result was that the vovslave was destroyed.

The second was that 'hog protection' was inadvertently applied to vovslaves during crash recovery with the result that reconnection of vovslaves after a serve restart could be delayed until crash recovery period had ended. (This compounded the first issue during crash recovery.)

The third issue was cosmetic and resulted an a Tcl stack trace if the vovserver took too long to respond while restarting. The database queries (vtk_select_loop) parameters were adjusted to lengthen the response period.

Also see the release notes for VOV-10444 for pertinent crash recovery parameters.

All VOV-10221 24265, 24417 vovserver failover recovery has been enhanced to try for the recovery on all the configured server candidates.
All VOV-9902   Prevent vovserver and child processes from exiting when Ctrl-C is pressed in the Windows command prompt from which the server was started.
All VOV-11126 24961 The description of the RAMUSED slave resource was updated for better clarity on usage.
Accelerator VOV-10833 24749 Fixed an issue with vovreconciled not revoking component resource as per the revocation delay set for summary resource, when the component resource revocation delay is not set.
Accelerator VOV-9123 23250 Accelerator issuing "stop" from the web interface is now sending the right exit signals.
Accelerator VOV-9194 24593, 24922 Interactive jobs now use the fully-qualified domain name, if available, of the submission host to ensure the execution host can find and connect to the submission host.
Accelerator VOV-9557 23850 Added new vovslavemgr stop -sick <TIMESPEC> function that can be used to forget slaves that are older than the specified timespec-based threshold.
Accelerator VOV-10028 24238
  1. In back-compat mode, increase verbosity level to 3 for message relating to using the signal list and delay obtained from the NC_STOP_* properties.
  2. Do not show <default> in the level 3 Job Control message when using properties.
  3. In that same message, add an indicator of the signal/delay origin when not <default>. Example: "delay=4 (from property)". The indicator will be one of: property, environment, or option.
Accelerator VOV-7862 21237 Correct some edge cases in the job CPU utilization graph. Phantom CPU usage spikes were being seen.
Accelerator VOV-11046 24897 Fixed issue where the vovslave would continuously log "Killing subslave with pid = <pid>" leading to eventual exhaustion of disk space. This fix requires a restart of all vovslaves.
Accelerator VOV-10393   Slaves now start automatically on Windows.
Accelerator VOV-9794   Fixed issue where the terminal appears to freeze when the output of an interactive Accelerator job (NC -Ir) is piped to tee (tee, for example, would report "tee:write error"), cat, etc.
Accelerator VOV-10108 24207 New API containerHooksRunDir is available to specify location where to run container hook scripts. The requested job running directory will be passed to Enter hook script as env(VOV_CONTAINER_JOB_RUNDIR). Please see sample files at /etc/config/containers
Accelerator VOV-10341   Added a description of the RESV_<license> job property, which is a counter of how many times <license> has been revoked by the vovreconciled daemon (if configured).
2019.01 Update 4 Patch 2

Product Internal Number Case Number Description
All VOV-12679 CS0124318, CS0128385 Fixed an issue with interactive jobs failing with the incorrect message "Client has responded to authentication request... with key ''". This happens when the client has closed the connection. This is also accompanied with the following vovslave log message "Timed out waiting for authentication request from pty server".
All VOV-12647 CS0128385 Fixed issues with interactive jobs failing with the following log message "Timed out waiting for authentication request from pty server". Also, added a new environment variable, VOV_INTERACTIVE_AUTH_TIMEOUT, to configure the authentication timeout on the vovslave.
All VOV-12316 AAP25172 Fixed an issue that caused vovserver memory to grow over time as "nc wait" and "nc run -w" commands were issued.

2019.01 Update 4 Patch 1

Resolved Issues
Product Internal Issue Description
Accelerator VOV-11347 Fixed an issue in "vovfsgroup loadconfig" where the weight and window values of the FairShare group were not getting set to the values in the config file.
Accelerator VOV-11305 Prevent a new autokill sequence from initiating if an existing sequence is already being processed. Prior to this change, an autokill sequence that took more than 5m to process would result in a new sequence starting without the existing one completing. This would result in the slave entering a looping condition that may never end.
Accelerator VOV-11307 Fixed a race condition in the job fostering system, which is used to properly account for jobs running on a host that has had its vovslave restarted, that could cause the foster jobs to fail and the restarted vovslave to refuse any future stop requests.
Accelerator VOV-11296 A change to the command nc hosts that displayed all reservations a slave may have was backed out due to adverse performance effects on the server. The command nc hosts will now only show the "dominant" reservation, i.e. the oldest unexpired reservation.

2019.01 Update 3 Release

New Features and Enhancements
The following new features and enhancements were introduced this software release:
Product Internal Number Case Number Description
All VOV-10550   Added inline documentation for the default vovcleanup config file, cleanup.config.tcl.
All VOV-6577   A new configuration parameter (liverecorder.logdir) has been added to allow the Live Recorder recording file directory to be specified. This can be used with both the vovservermgr configure and vovslavemgr configure utilities.

The default location for the server remains $SWD/../. The default location for the slave remains /tmp.

All VOV-9639, VOV-9640   Improved the functional relationship between the refresh operation and the reporting operations of the vovprocessmgr utility, improved the help text for all options that needed clarification, added warnings about timing and accuracy, and added host filter support to the refresh operation for orphans.
All VOV-9454 23741 Most of the VOV Tcl files from the installation package now contain a proper Altair copyright statement.
Accelerator VOV-10392   Added an SFD (single-file distributable) package for Accelerator. Also, the window for all SFDs can no longer be resized.
Accelerator VOV-9596 24133 Added -json and jsondoc arguments to vsy to return job information in JSON format.
Accelerator VOV-9909   Added an option (-splitstderr) to write stderr output of the job to the terminal's stderr when running interactive jobs (nc run -I). The default is to write the job's stderr output to the terminal's stdout. Note that using this option will probably result in garbled terminal output due to interleaving of stdout and stderr outputs.

This feature requires an update of the vovsh, vovslave and vovserver binaries.

Accelerator VOV-10488   Added new STATISTICS field to the SET object that contains the break-down of jobs in a set, per job state, in JSON format.
Accelerator VOV-10489   Added new fields to SERVER object:
  • INNERLOOPTIMERS - json object containing percent of time spent in various phases within the server inner loop
  • NUMBUCKETS - total number of buckets
  • SERVERSIZE - the ram size of the server data
  • SCHEDMAXEFFORT - current setting of the schedMaxEffort trace parameter
  • CYCLEFREQ - avg server cycles/sec over the previous ~10s interval
Accelerator VOV-8857 22729 Added new slave resource: OSNAME. OSNAME is the name of the OS vendor/distribution (CentOS, Ubuntu, Windows, MacOS. etc).

Changed the value for the OSVERSION resource to be more useful. OSVERSION is the common version number associated with the release, such as 18.10, 6-2.

OSVERSION can be compared with =, !=, >, <, >=, and <= . A shorthand version such as 18, will match all 18.x.y versions. 18.4 will match all 18.4.y versions.

"nc run -r osname=Ubuntu 'osversion>=6.2' 'osversion<=7.6' -- command" will run on a slave running Ubuntu versions 6.2 - 7.6 inclusive.

Accelerator VOV-10223   Added wording to lmmgr usage help detailing how to disable the guest access port.
Accelerator VOV-8925 22887 Jobs placed under the spread policy will now favor the socket with the most free memory when the cores used count is equal between all sockets. Additionally, the NUMA_LAYOUT slave property has been augmented to show the memory currently used by all jobs on the slave that requested NUMA support (e.g. "Socket: 0 RAM= 500/32089 **oooooooooo").
Accelerator VOV-9021 23089 Allow for VovUserError to be called from within a job class as a means of exiting the job class without affecting the job class web UI page or the nc jobclass command.
Accelerator VOV-4897 20586, 21030 FairShare configuration page now permits direct weight entry without resetting the window to 0.
Accelerator VOV-9364 23619 Users can now specify the NUMA-related resources MAXNUMACORES and MAXNUMACORESFREE when running jobs. The former refers to the maximum number of NUMA cores in a single node on the slave, the latter to the maximum number of free NUMA cores on any single node on the slave.
For example:
nc run -jpp pack -r+ MAXNUMACORESFREE#4
will request a job on any slave with at least 4 free NUMA cores on the same node.

In addition, the fields MAXNUMACORES and MAXNUMACORESFREE have been added to the SLAVES object for the purposes of vtk_select_loop, vovselect and related commands.

Accelerator VOV-10363 24474 Fixed requested RAM value shown by ram_usage.cgi
Accelerator VOV-10540   Licenses are now checked in upon ncmgr stop -freeze.
Accelerator VOV-10478   Metrics are now enabled by default. Added new scheduler metrics:
  • scheduler/cores/capacity
  • scheduler/cores/inuse
  • scheduler/hosts/ready
  • scheduler/hosts/full
  • scheduler/hosts/overloaded
  • scheduler/hosts/sick
  • scheduler/hosts/noslot
Internally calculate moving averages for the above metrics as well as all existing metrics:
  • scheduler/ram/capacity
  • scheduler/ram/inuse
  • scheduler/slots/capacity
  • scheduler/slots/inuse

Added new CAPACITYMETRICS field to the SERVER object that contains the latest value and moving averages for these metrics in json format.

Accelerator VOV-7408 20586 Original: FairShare configure page now displays actions for a group based on user's permission on that group.

This fix: Randomly, the fairshare.cgi web UI page would receive an error from the API that prevented it from rendering.

Accelerator VOV-9647 24675 Simplified the controls and behavior of the limit functionality of the tabular reports found in all products.

For reports that deal with a potentially high number of records, the separate "records to retrieve" control has been removed and consolidated into the table's built-in limit control. In such tables, the table no longer attempts to show the total number of rows available, nor the number of rows that were matched by the back-end query. This is done for both performance and accuracy reasons, as getting these numbers can take a significant amount of time from the back-end database.

Also fixed an issue that caused the "show all rows" link to not appear in some web UI tables that were being limited.

The behavior of this link has also changed slightly, in that when showing all rows, instead of passing the total number of rows the table is cognizant of back into the table as the new limit, a limit of 0 is passed. This triggers the table functionality to show all rows available. To prevent confusion, an empty string is displayed as the current limit value instead of 0. Before this change, the total number of rows was only current at the time the table was build, so the limit passed was often stale and still did not show all rows.

Accelerator VOV-10531   The "default" keyword is now supported for the log file specification for UNIX-based slaves that are managed by vovssd. This results in a composed log file specification of @SWD@/logs/slaves/<slaveName>/<slaveName> that will be passed into the vovslave -l option.
Accelerator VOV-9593 21346, 23449 The NC integration code will now load correctly with versions of Cadence ADE environment that have alphanumeric characters in their version strings (for example, "ICADVM18.1-64b").
Accelerator VOV-9779 23926 The vovcleanup command has been enhanced to remove old logfiles generated by vovwxd. The guiding age threshold is controlled by the new vonfig(cleanup,vovwxd) setting in the cleanup.config.tcl configuration file. Default value is 30d. This value can be changed by modifying (or adding, as necessary) the value in the cleanup.config.tcl file in the project's server working directory. For example set config(cleanup,vovwxd) 15d.
Accelerator VOV-6173 21118 Reservation by negation for vovslave is now accepted. For example:
nc cmd vovslavemgr reserve -jobclass \!normal -duration 2h a003
Resolved Issues
The following issues were resolved in this software release:
Product Internal Number Case Number Description
All   24362 Fixed a bug when vovserver incorrectly reports about license expiration for perpetual license.
All   23689 Added a new time value trace parameter: preemption.rule.cooldown. Preemption rules that are disabled due to exceeding the max preemption processing time (preemption.max.time.rule) will observe a cooldown period equal to the value specified by preemption.rule.cooldown before being automatically re-enabled. The minimum value is 0 which indicates that an alert should be generated but the rule will not be disabled. The default and max value is 1y = off (rules will not be automatically re-enabled).
All VOV-10317   Fixed an error that corrupted non-default values for the allowUidForSecurityFile parameter, which in turn could prevent the vovserver from honoring settings in the security file.
All VOV-10665   Fixed an issue in the deprecated vtk_resourcemap_reserve Tcl command that caused an error when attempting to cancel existing reservations by updating the duration to 0 seconds.
All VOV-10239   Simplified switches to avoid user confusion.
All VOV-10726   Fixed an issue that caused the "show all rows" link to not appear in some web UI tables that were being limited. The behavior of this link has also changed slightly, in that when showing all rows, instead of passing the total number of rows the table is cognizant of back into the table as the new limit, a limit of 0 is passed. This triggers the table functionality to show all rows available. To prevent confusion, an empty string is displayed as the current limit value instead of 0. Before this change, the total number of rows was only current at the time the table was build, so the limit passed was often stale and still did not show all rows.
Accelerator VOV-10119   A low swap space alert will no longer be generated for servers with 0 total swap or total swap that is lower than the threshold.
Accelerator VOV-10403   A separate Windows user name is no longer required for user impersonation in Windows slaves. It is still required, however, if the Windows user name is different from the user name obtained from the job submission, as might be the case in a cross-platform submission environment.
Accelerator VOV-9883   AAP command line tools no longer throw errors when the output is piped through another command line program that exits before allowing the input command to finish, such as the UNIX "head" command.
Accelerator VOV-10072   An alert will be generated if all jobs on a host are stuck (consuming no CPU time). Optional alerts are available for setting a threshold for creating alerts based on a number of jobs being stuck per host, tool, or user. See the Jobstuck Procedure of vovnotifyd.
Accelerator VOV-10142 24314, 24325 An alert will now be generated if the process spawned to save the PR file fails.
Accelerator VOV-9482   Enhanced stripTclComments filter to retain repository IDs
Accelerator VOV-10784 24660 Fixed an issue with vovreconciled not revoking sum resources when the component resource that was revoked contains zeros in its name.
Accelerator VOV-10748   Fixed an undesired growth in memory consumption for jobs that utilize license-based resources.
Accelerator VOV-10401   Fixed issue that prevented the vovauxpasswd utility working in the Windows command prompt.
Accelerator VOV-10412   Fixed numerous issues regarding user impersonation on Windows. Jobs should now execute correctly as the target user on slave and job output should be available in the log file.
Accelerator VOV-10217   Fixed an issue with "default" slavelist.
Accelerator VOV-10410   Fixed issue with SNAPSHOT environment that caused UNIX-style path delimiters (colons) to be used on Windows instead of semi-colons for the VOVDIR/bat, VOVDIR/bin, and VOVDIR/scripts segments of the PATH environment variable.
Accelerator VOV-10781 24680 Fixes a crash related to nc info, vtk_explain_status and related commands when querying a job with pending dependencies (e.g. if the job was run with nc run -dep).
Accelerator VOV-7570, VOV-9885 20881, 24152 If $NC_LOGDIR is a non-existent directory, nc run failed in prior releases. This change causes nc run to create the $NC_LOGDIR directory if it does not exist and successfully launch the job.
Accelerator VOV-10616 24589 In jobclass editor, the autokill field is now free-form.
Accelerator VOV-10914 20586 Randomly, the fairshare.cgi web UI page would receive an error from the API that prevented it from rendering
Accelerator VOV-8980, VOV-9841 22986 Removed appearance of DOS newline characters (^M) when the output of nc run -I is piped.
Accelerator VOV-9877   Show the Accelerator-specific job status (eg Running) instead of the FlowTracer-specific status (RETRACING) in the UIs for all products other than FlowTracer.
Accelerator VOV-10356   Stderr will now be included in the output log for default windows jobs.
Accelerator VOV-9619   The network port opened on the submission host when running an interactive job now authenticates the incoming connection to prevent accidental connections, such as those made by port scanners, from causing the interactive job to fail.
Accelerator VOV-10408   The user profile directory is now created upon starting a job on a Windows slave if the directory does not already exist on the slave host.
Accelerator VOV-10107   Updated minMemFreeOnServer and minSwapFreeOnServer configuration parameter descriptions with the unit of measure, which is MB.
Accelerator VOV-10792 24689 Fixed issue with vovresourced failing to start when vtk_flexlm_monitor -tags { } is used with more than one tag.
Accelerator VOV-10831   A couple of issues with running Accelerator in interactive mode (nc run -I) have been fixed. The first issue would occasionally result in the initial output of the job being lost. The second issue would, in rare cases, cause interactive mode to time out and fail.
Accelerator VOV-10989 24514 Post execution script handling is changed for the case where a job body script terminates due to a signal or nc stop. In these cases, the post-execution script specified by -post on the nc run command will run to completion. A second signal or nc stop executed while the post-execution script runs can be used to terminate it early if that is desired.
2019.01 Update 3 Patch 2

Resolved Issues

The following issues were resolved in this software release:

Product Case Number Internal Number Description
Accelerator 24897 VOV-11046 Jobs are failing with fork errno 0, filling up the disk with error logs.
2019.01 Update 3 Patch 1

Resolved Issues
The following issues were resolved in this software release:
Product Internal Issue Description
FlowTracer VOV-10990 "Generic Error" in vovlsfd log file

2019.01 Update 2 Release

New Features and Enhancements
The following new features and enhancements were introduced this software release:
Product Issue Number Case Number Description
All VOV-10021   The Bookshelf page has been simplified for easier reading.
All VOV-9742   The new vovservermgr command is added for system administrators.

This command has several subcommands. Three of the subcommands provide an easier syntax to set vovserver configuration and environment variables than vtk_server_config, et al. Other subcommands interface to memory chunking and scheduler tuning controls.

Accelerator VOV-7248 20366 Added the BUCKETTIME field to the JOBS object for vovselect, vtk_select_loop and related commands. This field indicates the timestamp at which the job was most recently assigned to a bucket. For jobs that are not yet assigned to buckets, this field will be 0.
Accelerator VOV-8994 22987 The Accelerator administrator may now restrict access to the nc modify subcommand. If a VovUserGroup named "System:jobmodify" exists, only USERS who are in that group may modify jobs.
Accelerator VOV-9535   There will be two additional trace parameters visible: preemption.load and preemption.module. Both may be set in the policy file or via the vovservermgr config command.

preemption.load - a value of 1 will cause the preemption module to be loaded, a value of zero will cause it to be unloaded. During startup the value indicates the desired state, afterwards it reflects the current loaded/unloaded state. Setting the value to 0 in the policy.tcl file will disable the loading of the preemption module on startup.

preemption.module - indicates the preemption module to be loaded. The default (and only currently available) value is libpreemption.so.

In the future these parameters will allow us to unload preemption and load a new version without taking down vovserver.

Resolved Issues
The following issues were resolved in 2019.01 Update 2 software release:
Product Internal Number Case Number Description
All VOV-9998   Eliminated the appearance of errors similar to the following in the logs:
7,275 vovserver(19661) 
ERROR May 20 15:59:55 Ending a visit that is already ended (searchId=81179) [object:411] 
7,277 vovserver(19661) 
ERROR May 20 16:14:01 Nested visit with searchId=82117 [object:388] 
All VOV-10103   Fixed issue in calculating the total memory used by jobs, where the reported amount did not take into account memory that is shared between multiple processes in a job's process hierarchy.
All VOV-9827   Fixed a bug in the handling of resource names that were the same as the beginning of a slave name. For example a slave named "foo1" with a resource "foo" would see its "foo1" resource disappear and be replaced by a second "foo" resource. Both the disappearing slave name and the resource duplication have been fixed.
All VOV-10003   Fixed a bug involving missing field names in result arrays returned by vtk_select_get when specifying parameterized fields, e.g. properties.
All VOV-10071   Relaxed the restrictions around the user-specified slave health check script.
All VOV-9852   vovserver will not start any configured daemon as part of the server start up process; this task will be handled by the auto start scripts.

One exception to this behavior is vovnginxd; the vovserver will explicitly start vovnginxd if the web port is changed from zero to a valid port while the server is running.

Accelerator VOV-9895   Added documentation page in the Accelerator Admin Guide for job fostering and improved command help.
Accelerator VOV-9128 23263 Jobs will have exit status 1 if for any reason they cannot be started on vovslave.
Accelerator VOV-9996 24208 Fix docs to correctly describe user-provided vovslave health script
Accelerator VOV-10100 24275 Correct the job NUMA placement doc in the Accelerator Manuals.
Accelerator VOV-9879   Changed slave error logging messages to make them clearer and more user-friendly.
Accelerator VOV-8837   Corrected job why property to display the actual time taken on start.
Accelerator VOV-9806   Extras and consumables are no longer listed twice in vovslavemgr show -resourceonly output.
Accelerator VOV-9679   Fixed issue where TIMEVAR procedures defined in job classes were not being re-evaluated when needed.
Accelerator VOV-9511   Fixed issue where slaves in 'ready' state were shown as 'overloaded' on vovslavemgr update.
Accelerator VOV-9198   Fixed issue with the "-O @CLOCK@" output format option to nc hosts where the leading digit of the clock was being truncated.
Accelerator VOV-10104 24273 Fixed the issue where vovreconciled continues to reconcile (or vice versa) even after "Handle-job matching" is turned off in Allocator.
Accelerator VOV-9881   Improved messaging from nc stop command when signals are provided via environment variables or when using the default stop signal cascade. In addition, CHECK is now recognized as a valid signal.
Accelerator VOV-9908   Accelerator's vnc_policy.tcl file will be applied to Accelerator jobs coming from FlowTracer scripted indirect slaves.
Accelerator VOV-9768   The default vovresourced configuration file no longer assumes Monitor to be available, preventing unwanted alerts concerning Monitor being unreachable in environments where it is not being used or has not yet been configured.

Additionally, the call to vtk_flexlm_monitor_all is now disabled by default to prevent the creation of Accelerator resources for all licenses reported by Monitor. For performance reasons, the use of vtk_flexlm_monitor_all is only recommended in environments where a small total number of licenses are reported by Monitor, or when the majority of the licenses reported are actually requested by Accelerator jobs as a resource requirement. In all other cases, it is recommended to individually choose which licenses will be used to create Accelerator resources by calling vtk_flexlm_monitor for each license.

2019.01 Update 1 Release

New Features and Enhancements
The following new features and enhancements were introduced this software release:
Product Internal Number Case Number Description
All VOV-9448   Cached queries created via vtk_select_create or related commands can no longer expire while the results are still being processed.
All VOV-8080 22230 Startup scripts for use with systemd are now included in $VOVDIR/etc/boot.
Accelerator VOV-7248 20366 Added the BUCKETTIME field to the JOBS object for vovselect, vtk_select_loop and related commands. This field indicates the time in seconds that a job spent after being assigned to a bucket, but before being dispatched to a slave. For jobs that are not yet assigned to buckets, this field will be 0.
Accelerator VOV-9588   Added IPv6 support.
Resolved Issues
The following issues were resolved in 2019.01 Update 1 software release:
Product Internal Number Case Number Description
All VOV-10040   This fixes some issues with vov services on Windows not listening to both IPV4 and IPV6 network ports. Without this fix, the user may experience hangs while starting and stopping Monitor or other projects, and with nginx being unresponsive.
All VOV-9771   Fixed an issue where all field names specified by vtk_select_create and related commands would be converted to upper-case in the returned Tcl array. The field names will now be returned with the same capitalization as requested by the user.
All VOV-9749   Changed the behavior of the stop button in the vovssd SFD to stop the vovssd daemon and all child processes to prevent orphaned child processes from being present after exiting the SFD.
Accelerator VOV-9511 23730, 23867 Fixed issue where slaves in 'ready' state were shown as 'overloaded' on vovslavemgr update
Accelerator VOV-9681 23977 Fixed issues with the vovssd SFD (single-file distributable) that prevented it from working properly as a Windows service. Slaves are now stopped when vovssd is stopped from within the SFD GUI, the SFD is exited entirely, or its Windows service is stopped. This change prevents the situation where orphan slave processes are left behind, preventing the SFD from extracting into the same directory as before, which would be the case in a Windows service.
Accelerator VOV-9051 23098 Changed vovresourced to use more efficient data URLs for querying Monitor for data.
Accelerator VOV-8928   The parameter schedSkip has been replaced with schedMaxEffort. The documentation has been updated to reflect this change.
Accelerator VOV-9184   Fixed a bug that was causing performance problems due to generation of redundant NODESET events when submitting jobs nc run -fb 1 -f <filename>.
Accelerator VOV-9170   Metrics system no longer requires a license.
Accelerator     Changed internal representation of filesystem sizes to accommodate large filesystems. Customers with TMP filesystems up to 9x10^24 bytes should see correct values for the TMP# resource

2016.09 Release Notes

New Features and Enhancements

Products Internal Number Case Number Description
All 6258   There is now a Python interface for the vovserver REST API. The Python interface returns a JSON formatted string as a result of posted query. For more information about request URLs and response format please refer to REST API documentation. As vovserver requires the user to be authenticated, the Python interface will automatically prompt the user for username and password if the current login session is expired.
Example scripts:
from common.scripts.rtda_rest import RTDARest
rtda_client = RTDARest()
obj = rtda_client.getUrl(&http://localhost:11913/api/v1/jobs/12345/status&)
print (&Return type is&, type(obj)) print (&Return val: &, obj)
All 2272   Selection rule syntax now supports several addtional features.

Logical "or" operations in selection rules are available via the | operator.

Logical operations may be grouped with parentheses.

AND operations always take precedence over OR operations in the absence of parentheses.

The words and, or and not can now be used in place of &, | and ! respectively; this makes shell-level scripting easier. These words are case-insensitive; they may also be typed as AND, OR and NOT.

The = operator is now supported, which is equivalent to the == operator.

Spaces are now allowed in non-quoted string values if they are preceded by a backslash '\' character. Example: command^sleep\ 60 is now a valid rule.

All 4218   Improved protocol for more compact packing.

To send 16-bit integer, packing size is changed from 8 bytes to 3 bytes. to send 32-bit integer, packing size is changed from 8 bytes to 5 bytes.

To send 64bit integer, packing size is changed from 16 bytes to 10 bytes.

By removing 4-byte alignment packing and reducing data type encoding method, encoding of string and double types is also more efficient.

All 4980   Daily log files generated by system daemons are now automatically compressed.
Note: This applies only to non-Windows platforms.
All 5349   Vovselect now supports the '*' wildcard to signify all fields of a particular object.
Note: Ensure the '*' character is quoted as required by your shell. Example: vovselect '*' from jobs where idint==12345
All 5595 13053 The increased web server security now prevents local file inclusion via a path relative to an open URL, such as /gif.
All 5935   Added waitreasons as from source to vovselect, which provides access to the data available in vovshow -waitreasons.
All 5954   Jobs submitted to NetworkComputer by the administrator can no longer be accidentally dispatched to the VOV database support slave.
All 5985   Added coresused and corestotal fields to vovselect when querying slaves.
All 6071   The interface to the vovversion shell command has been changed.
The new command interface:
% vovversion -h 
vovversion: Usage Message
 DESCRIPTION: 
 Print the VOV version. Optionally show more details.
USAGE:
 vovversion [options] 
OPTIONS: 
 -help -- Print this message 
 -all -- Print all available version information 
 -clients -- List clients and their version numbers 
 -install -- Print the VOV version with patch names, if any 
 -patches -- Print patch names, if any 
 -project -- Print the version of vovserver associated with the current project [Can abbreviate options provided they are unambiguous]
All 6107   Added timestamp for FILE type and start and end time for JOB job in vovconsole Navigator. This allows users to easily see the dependencies of a timestamp.
All 6119   For easier editing, Cut/Copy/Paste menus have been added to Text fields
All 6168   Added support for doing daily maintenance on the VOV database, which is configurable through the database administration web interface.
All 6196   A new configuration parameter, liverecorder.logsize, now controls the size of Live Recording log files.
The parameter, liverecorder.logsize, can be added to the policy.tcl file, or set on-demand for the server via:
 vovsh -x &vtk_server_config liverecorder.logsize N

or for the slave via: vovslavemgr configure -liverecorder.logsize N SLAVENAME where N is any number between 256 and 8192, which represents the size of the log file in megabytes. This setting must be applied before turning Live Recorder on.

All 6400   The protection against deleting a resource when a job is queued against is now improved; it no longer slows the vovresourced-based expiration extender.
LicenseMonitor 5394   Integrated licensing capabilities now support server lists. This feature is useful for unlicensed tool wrapping.
NetworkComputer 5766   New features have been added to the vovprocessmgr utility for finding processes that are descendents of vovslave, orphans of vovslave and external processes. There is an option to create foster jobs for discovered orphans; orphans can now be accounted for by a slave on the same host, and tracked for the rest of their lifetime.
NetworkComputer 5904   On Linux slaves, a job can now be requested to run in one or more cgroups.The syntax is similar to requesting any other resource, with the resource name consisting of the prefix CGROUP: followed by the path to the cgroup on the filesystem.
Example, to use /sys/fs/cgroup/cpuset/my_cgroup1 /sys/fs/cgroup/memory/my_cgroup2:
nc run -r CGROUP:/sys/fs/cgroup/cpuset/my_cgroup1 -r
                  CGROUP:/sys/fs/cgroup/memory/my_cgroup2 -- sleep 120

will assign a "sleep 120" job to the cgroups my_cgroup1 and my_cgroup2

If the user specifies multiple conflicting cgroups (such as 2 cgroups under the /memory hierarchy), the cgroup that is specified last is the one that the process will be assigned to.

The special resource CGROUP:RAM can be used to limit memory usage of a job within a cgroup.

Example:

nc run -r CGROUP:RAM -r RAM/2000 -- sleep 120 will assign the job to a default cgroup and limit that cgroup to 2000 megabytes of RAM. Since we only place one job in each default cgroup, we can effectively limit RAM usage on a per-job level. The path to this default cgroup will be:
<path to cgroup root directory >/memory/ <queue name>_ <slave name>_ <job slot number>
Note: CGROUP:RAM cannot be used with a non-default cgroup; if both CGROUP:RAM and a non-default cgroup are specified, the job will be placed in the specified cgroup without changing that cgroup's RAM usage limit. We strongly recommend specifying a RAM resource when using CGROUP:RAM, as the default value is low (currently 20 megabytes).

To see slave resource for cgroups, use nc hosts -r and look for CGROUP: entries. If they are not present, ensure that they are set up on the slaves with lscgroups and check the slave logs for errors.

NetworkComputer 5909   The new dynamic server tuning feature enhances performance when the server is under heavy load conditions: a unique maximum value is set per bucket.
NetworkComputer 5080   Slaves now account for jobs running on a stopped slave on the same host. When a slave is started, if there is a matching slave in the stopped condition (waiting on its jobs to finish), the new slave will adopt any jobs on the stopped slave by using foster jobs. This feature helps prevent host overloading.
NetworkComputer 5472 12544 There is a new server configuration parameter for statistics: cpuprogressWindowSize.

This parameter can be set in the policy.tcl configuration file.

The default value (1) provides identical behavior to previous releases. Increasing the parameter increases the number of samples to be used in the calculation of the cpuprogress statistic.

The accepted range of values is 1-1440. 1440 signifies one full day, assuming the sample time is 60 seconds.

NetworkComputer 5689   The fields REQCORES, REQCPUS, REQPERCENT, REQRAM, REQSLOTS, and REQSWAP are now valid on SCHEDULED jobs. This can be useful in writing -preempting clauses on preemption rules.
NetworkComputer 6144 14414 Validation is improved with two new options and a new health check.

Summary:

The new -failover option to vtk_slave_define now sets the capacity to 0, which prevents the slave from accepting jobs and pulling a license. This also acts as a flag to perform some failover configuration testing, such as checking servercandidates.tcl to ensure the slave host is in the list.

Passing in the new -descriptors as well as the -failover option to vovslavestartup triggers a check to ensure the host has at least as many descriptors as vovserver, which ensures it can operate at full capacity in the event of failover. It also checks that the server_election directory is empty.

In addition, a new health check ensures that a slave is running on each host listed in the servercandidates.tcl file. By default, the health check is on; it can be disabled in the health check configuration of the web UI.
Note: The health check features and options are listed on the health check configuration of the web UI.
NetworkComputer 6195   Update calls to ftlm_lmproject from vw every 20m. The period is controlled by the variable VOV_VW_PING.
NetworkComputer 6204   The PERCENT slave resource can now be configured in the slave resource specification. This is specified in consumable form, such as PERCENT/50. Previously, PERCENT was always initialized to 100 and could not be changed.
  6312   New parameter resuserDisableMatchingThreshold allows matching to be disabled for license resources from a license server: in such cases, the matching process can take a very long time.

resuserDisableMatchingThreshold provides a threshold that disables matching if the sum of LicenseMonitor handles and FlowTracer jobs exceed that threshold.

The range is 0 - 10000; the default value is 1000.

This parameter can be specifed in policy.tcl.

Example:
set config(resuserDisableMatchingThreshold) 1000

This parameter can also be set using the VTK API.

Example:
vtk_server_config resuserDisableMatchingThreshold 1000
NetworkComputer 6722 14704 The vovserver now responds to multiple incoming client connections during the same cycle.
This capability is enabled by default, and can be controlled with the following congfiguration parameters via the policy.tcl file:
set config(clientConnectionQueue.mode) multi
set config(clientConnectionQueue.size) 512

The mode can be "single" or "multi" (default=multi). The size can be any integer in the range of 25-1024 (default=512).

Workload Accelerator 6558   The ncmgr and wxmgr utilities now check for a minimum number of file descriptors on Unix-based platforms.

Summary:

The default value can be over-riden with the VOV_MIN_DESCRIPTORS environment variable.

An attempt is made to raise the default limit if needed.

If the target's limit is higher than the system's hard limit, an error is displayed and the product start will be aborted.

LicenseAllocator 5311 12150 License resources can now be allocated from multiple servers or excluded.

Summary:

Resource groups can be defined that are a logical OR of components that are hosted on different license servers.

Licenses from specific servers can be excluded from distribution to specific sites.

FlowTracer 6358   A SNAPSHOT of the server environment is now automatically captured on FlowTracer vovproject start. This new feature enables users to run a job in the same environment as the server. A shell environment snapshot can also be saved on demand from any project enabled shell.

Summary:

The automatically captured environment is saved as a file under the server working directory, which can be used to set the SNAPSHOT environment at any time.

The new standalone utility vovenvcapture is for saving the environment on demand, either into a file or as a property on a Vov object.

FlowTracer 4879   A periodic job can be paused by setting a property "PERIOD_PAUSE" as 1. This is done through web interface, vovconsole NodeEditor, vtk_prop_set tcl function, or vovprop cli. If a paused job is running when this property is set, the current run will continue to completion, but a new run will not be started at the next period; it will remain paused until PERIOD_PAUSE property is reset to 0.
FlowTracer 5428   If EventOverflow occurs from server, vovconsole will empty all events, wait, reconnect and refresh all set viewers. When this occurs, a message is displayed: Updating ... please wait
FlowTracer 5617 13082 Upcone Set and Downcone set menus are now added to the popup menu of vovconsole when a set is selected. If no node is selected, Connectivity popup menus of Downcone, Upcone and Expand will apply to the current set displayed in the SetViewer.
FlowTracer 6018   Details about the condition of a schedule job that is stuck due to barrier invalid are now available.
Summary:
When a job is stuck due to barrier invalid node in upcone, a balloon message appears and shows "barrier invalid".
On NodeEditor, the "Why" tab shows barrier invalid messages on the associated node(s).
Trouble-shooting methods:
(1) An aggressive retrace flag can be done in the main console window or by using vsr.
(2) Find the invalid jobs in the upcone and retrace them to validate the barrier.
If there are no invalid jobs in the upcone, fix the barrier invalid flag by running sanity.
FlowTracer 6122   The Navigator column now sorts per the severity of the node status.
FlowTracer 6156   SERIAL and PARALLEL commands in FDL now work with any level of nesting: you can nest parallel in serial and vice versa, for as many levels deep as desired. SERIAL and PARALLEL commands now work with S (set), T (task), and J (job) commands, which can be nested under one another in any order.
FlowTracer 6440   vovlsfd now utilizes an agent script (vov_lsf_agent) in the installation to launch vovslave in the batch system. Previuosly, shell scripts had to be written to the launchers/<hourly sub-directory>.
FlowTracer 6571 15109 The vw command now has the option to apply a delay that allows latency on outputs as well as inputs.

Previously, if the environment variable VOV_VW_NFS_PROTECTION was set to 1 (or if the -n option is used with vw), vw added a delay when waiting for inputs to allow for Network File System (NFS) latency. By default, the wait duration was 60 seconds that could be overridden by setting the environment variable VOV_VW_NFS_WAIT.

In this release, in addition to the above behavior, if VOV_VW_NFS_PROTECTION is specified as 2 (or the -N option is used), vw also applies the same delay to allow for latency on outputs.

Overview:
Under certain conditions, for both inputs and outputs, vw will attempt to trigger a cache update sooner if possible. Note: for this to work, nfsdelay must be enabled in the vovserver settings in the policy.tcl file, and also applied to any currently running vovserver (by doing vovproject sanity). Example: set config(nfsdelay) 60
It is also recommended to specify a small time tolerance in policy.tcl to compensate for minor time discrepancies among machines and/or the filer. Example: set config(timeTolerance) 3
Note: This should also be applied to any currently running vovserver using vovproject sanity.

Resolved Issues

Products Internal Number Case Number Description
All 3120 13056 Increased web server security, which prevents cross-site scripting.
All 6132   Added arch field support for slave objects to VovQuery: vovselect, vtk_select.
All 6193   An issue has been fixed with field-only selection rules for string fields. By passing the field name without an operator or a value, you can now query for objects that have a non-empty value in the specified field. For integer fields, this form of selection rule queries for objects that have a non-zero value in the specified field.
All 6253   Scrolling has been added to the alerts dialog, which makes long lists of alerts easier to access and consumes less space on the monitor.
All 6394   Viewing the files of sets has been made easier:

Summary:

For default sets that only contain files, such as System:files, System:filesToCheck, System:zippable, Predefined:missing files, and Predefined:blocking files, files are always displayed regardless of the setting of the flag.

However, for user defined sets, Show files must be turned on to view the files.

LicenseMonitor 6103 13577 New format is now recognized, and added support for additional date format in Feature line.
LicenseMonitor 6268   Detailed plots are now reasier to read. The visibility of plot lines was increased for checkouts and queued requests with shorter durations compared to the report time range.
NetworkComputer 5253   The vovfsgroup create command now copies the parent ACL when creating a subgroup. Example: vovfsgroup create /abc/def will create a new group /abc/def, with ACL permissions copied from the group /abc. If there is no applicable parent group, the default ACLs will be used.
NetworkComputer 5699 14154 fairshare.cgi now takes fstokens into account. Previously, fstokens were ignored.
NetworkComputer 5715 13420 The autokill function now takes suspension time into account when determining if a job has exceeded the autokill time threshold.
NetworkComputer 5993 13705 License resources are no longer overbooked. Previously, the configured threshold was honored the first time the overbooking procedure was called for a specific feature.
NetworkComputer 6125 13164 vovset list can now display more than 600k sets.
NetworkComputer 6617   The failover process now works correctly.
Note: It can take over 2 minutes for the failover process to complete.
NetworkComputer 6618   The error message for a specific port issue is better handled: where ncmgr is being used to start or stop an NetworkComputer instance, but the specified port matches that of an NetworkComputer instance other than the one being controlled.
NetworkComputer 6619 15159 The DISPLAY environemnt is unset during the failover process, which prevents the process from failing due to an invalid value.
LicenseAllocator 6414   ADMIN permission is no longer needed to read LicenseMonitor or NetworkComputer data. That data is now accessible by all USERs. However, updating the NetworkComputer instances with new allocations still requires ADMIN privileges.
LicenseAllocator 6601 12150 The minimum number of tokens to run any queued job on a site are allocated when possible. This ensures jobs will be run.
LicenseAllocator 6752 15243 When a new resource is declared in the LA config.tcl file, restarting LicenseAllocator or vovlad is no longer needed to get the resource tokens.
LicenseAllocator 6753 15245 LicenseAllocator now uses the expression specified through LA::SetMapForResourceInSite to set the resource map expression in NetworkComputer; components are in the resource map in the correct order.
LicenseAllocator 6756 15244 LicenseAllocator now sets the weight of the components of a resource group from the weight of the resource group, unless the weight of the component has been set explicitly.
FlowTracer 4826 11903 When a new set is created in the Set Browser, the sets can now be placed in alphabetical order: right-click any node and select the Update and Sort option. Previously, new sets were appended to the bottom, with no option to update the order.
FlowTracer 4901 10461 vovproject enable now returns a non-zero exit when it fails.
FlowTracer 4945   Detailed information of "why" is now provided if a job fails with no outputs.
FlowTracer 5198   Improved Slave Monitor: Slave LED Monitor on vovconsole and Floating Slave Monitor.
Summary:
When there are many slaves, Slave Monitor will not occupy the vovconsole area and show only one row and rest of them can be seen at floating Slave Monitor
If there are more than 20 slaves, grouping options are displayed.
Users can group slaves by state, arch, group(=slavegroup)
FlowTracer 5231 3186 4562 5409 Predefined sets on the console can now be deleted or modified; they are not permanently deleted.
Summary of the changes:
After a predefined set is deleted, the vovconsole keeps the set names.
When the predefined sets are double-clicked, the predefined sets are recreated: the predefined set is recomputed and the up-to-date set is drawn on the set viewer.
FlowTracer 5424   Manually overriding vovslave cores and capacity is now improved.
Summary of features:
By default, the capacity follows the core count.
The capacity can be manually set via the -T option, or by defining the SLOTS/N consumable resource via the -r option. N is a positive integer.
In all cases, the capacity directly affects the number of slot licenses that will be requested.
FlowTracer 5537   Sticky attachment attributes on inputs are now preserved when creating job arrays. Previously, sticky attributes were not copied for inputs.
FlowTracer 6101 14221 vovconsole no longer crashes when running in READ-ONLY mode. Menus and buttons that are not applicable for read-only security are now disabled.
FlowTracer 6129 13269 A race condition that occurred between resource reservations and resource grabbing by a job has been fixed. Previously, this occurred when a job was dispatched by the server and resource was reserved between the time of the dispatch and the time the job started running on a slave.
FlowTracer 6262   The Disconnect popup menu of the Navigator is now fixed. Previously, error messages were displayed.
FlowTracer 6364   In vovconsole, jobs in a set are no longer deleted when the set is flattened then rebuilt from FDL.
FlowTracer 6438   The following commands that were previously deprecated are no longer available: vovstopjobs vovleader -K retraces
FlowTracer 6668   The numbers of projects for other users shown in registry.cgi page has been corrected.
FlowTracer 6683   Files no longer appear in sets that should not contain files such as "Predefined:stuff to do" and "System:jobs".

2016.09 Update 20

New Features and Enhancements
Products Internal Number Case Number Description
LicenseMonitor VOV-8169 21933 The vovdb_util 'showcfg' sub-command now shows information about DB backups, if configured.
LicenseMonitor VOV-9133 23261 The Altium log file parser now handles dates of a few new formats, such as yyyy-mm-dd, in addition to those using slash and dot as separators.
LicenseMonitor VOV-9045 23136 ftlm_parse_altiumlog now is able to parse the log line record of the new format of time log: Product Name;License Name;User;Role;Action;Action Time or format of usage log: Product Name;Activation Code;User;Role;Version;Start Time;Returned Time
NetworkComputer VOV-9075 23176 For preemption rule debugging purposes, a separate preemption log file is written if enabled. The preemption log can be turned on through server config parameters in policy.tcl. set preemption.log.verbosity to 1 or up to 10. For example, add the following to the <swd>/policy.tcl:set preemption.log.verbosity 3 Turn on debug flag for each preemption rule of interest through WebUI or preemption rule definition in <swd>/vovpreemptd/config.tcl. To enable all preemption rules debugging, add the following to <swd>/policy.tcl:set preemption.log.allrules 1 Verbosity level needs to be chosen carefully. Higher verbosity number will create many messages and may cause the log file size to grow quickly. In case the current preemption log file is removed, it can be recreated with:vovsh -x 'vtk_server_config rotate_server_preemption_log 1'

Resolved Issues
Products Internal Number Case Number Description
NetworkComputer VOV-8018 23130 Removed unnecessary "Load" button for config.tcl. config.tcl is loaded automatically by vovpreemptd daemon.
  VOV-8347 23028 The "slave.childProcessCleanup" parameter in policy.tcl no longer requires cgroups; it does still require a Linux platform. Additionally the method of process cleanup has been changed to be more thorough. This setting may cause vovslave to be less responsive in heavy usage scenarios.
  VOV-9096 23174 Fixed a bug that when slave names get updated in RESERVE_SLAVES preempt rules, previous slaves still get reserved.
  VOV-9097 23175 FREE_SLAVES rule type preempts slaves only in IDLE(READY), WORKING, or FULLLOAD.
       
  VOV-9118   vovserver doesn't check file existence with lstat for nc run -l <logfilename> .
  VOV-9025 23101 Initialize job classes in a separate vovresourced process by default.
  VOV-9026 23102 Initialize job classes in a separate vovresourced process by default.
  VOV-9036 23149 Prevent misleading message in node.cgi about the job being executed on an invalid slave when the job has been forgotten from server memory and job data is being loaded from the database.
  VOV-8923 22882 Added an environment variable, VOV_BJOBS_JOBID_WIDTH, that can be used to override the default job ID column width in the bjobs output.
  VOV-8986 22899 Fixed incorrect value of cputime field displayed at job completion.

2016.09 Update 19

New Features and Enhancements
Products Internal Number Case Number Description
All 7453 20684 The vovbrowser command now shows the value of VOV_HOST_HTTP_NAME, if this is set, in the project URL.
NetworkComputer 8973 22958 NetworkComputer now correctly generates an alert if the LicenseMonitor instance being tracked by vovresourced is not available at first contact attempt. Previously this alert would appear only if contact was made and then lost.
NetworkComputer 7108 20078 nc hosts will show the full list of reservations for a slave in the following format: U:<list of users> C:<list of job classes> G:<list of groups> etc...
NetworkComputer 7126 20116 vtk_slave_define has a new argument -expiredate that specifies the date and time after which the definition of this slave is expired, and it cannot be started with vovslavemgr command. The format of this parameter is year_month_day_hour_min_sec. Example: 2018_12_31_23_59_00
NetworkComputer 7970 21621 The job output URL protocol (http or https) matches that from the output of vovbrowser. i.e. It is determined by setting of sslenabled from the project.
NetworkComputer 8014 21705 Added CLOCKTURBO field to slave objects in VovQuery that represents the CPU turbo speed of the host.
NetworkComputer 8028 21725 The command vovslavemgr stop -force with an empty slave list now prints a warning and does not stop any vovslaves. Use with -all if you really mean it.
NetworkComputer 8996   FreeStyle preemption ruletype no longer preempts more jobs than necessary to run jobs in preempting bucket.

Resolved Issues
Products Internal Number Case Number Description
LicenseMonitor 8967 22940 Now cvs export function works properly on the following pages:
  • Admin > Tags > Server
  • Admin > Tags > Database
  • Admin > Features > Server
  • Admin > Features > Database
  • Admin > Tasks
  • Admin > Licensing (VovSlaves table)
NetworkComputer 8911 22868 Fix premature termination of nc hosts command when a slave exits during the running of the command. This may have been encountered with large or volatile slave counts and is seen when the -O format option is used. Fixed in 2016.09 Update 19 for the case of volatile slave counts.
NetworkComputer 8917   Jobs that have reached WITHDRAWN state later than preemption plan spec. are no longer left as WITHDRAWN and get rescheduled.
NetworkComputer 8925 22887 Added accountability of RAM used by jobs that are submitted with NUMA supported requested. The total RAM for each socket will be decremented by a job's RAM request when the job is bound to the socket. This allows the next job requesting NUMA support to be placed according to the RAM that is expected to be available instead of the RAM that is currently available. Additionally, the NUMA_LAYOUT slave property has been augmented to show the memory currently used by all jobs on the slave that requested NUMA support (e.g. "Socket: 0 RAM= 500/32089 oooooooooo").
NetworkComputer 8964 22922 Interactive jobs no longer consume 100% CPU when attempting to reconnect to a downed NetworkComputer queue.
NetworkComputer 4453 20991 Added sorting, filtering, and searching capabilities to legacy FairShare page.
NetworkComputer 8730 22315 Fixed an error where editing a queued interactive job in the NetworkComputer web interface could potentially cause the job to fail.
NetworkComputer 8978 22965 Removed redundant "Excluded from being declared as input" warnings on nc run -f command.
NetworkComputer 9047   The URL shown at the end of the NC startup routine now takes into account the VOV_HOST_HTTP_NAME value and whether SSL is enabled.

2016.09 Update 18

New Features and Enhancements
Products Internal Number Case Number Description
All 8766 22020 When copying the <swd>/vovnginxd/conf/nginx.conf.template file to an nginx.conf file, a message is output to the server log indicating that a copy is being made, and also specifying the source and destination files.
LicenseMonitor 8574 22192 LicenseMonitor now includes support for monitoring licenses of QF-Test software.
LicenseMonitor 8954 22878 Moved the icons for exporting png/csv up in a separate row, so chart area can be highlighted excluding those icons for sreenshots.
LicenseMonitor 8959 22897 The FLEXlm parser has been enhanced to correctly handle version 11.14+ lmstat output.
NetworkComputer 8789   The ncupgrade script has been implemented at beta level. This script allows customers to upgrade their NC installation to a different (newer) version of software without taking down the queue. The script prompts users for decisions (such as queue name, location of the new version, etc.) and is intended to perform the upgrade without losing any jobs, and without having client job submissions fail (they just take longer to register). Please provide feedback on this feature during the beta period.
NetworkComputer 7466 20598 Reservations may now be negated to specify who or what is NOT allowed to use a slave. For example a reservation of "USER !john,mary" will allow all users except "john" and "mary" to run jobs on the slave for the specified duration.
NetworkComputer 7748 21236 Generate both an event and a server log entry when a FairShare group is changed. Event example:

27 FSGROUP CHANGE 000553805 1507566658 {/test, by joe@localhost SERVER: Window 1h00m to 2h00m }

Log example:

vovserver(9805) Oct 09 13:26:20 FairShare change: /test, by joe@localhost SERVER: Window 1h00m to 2h00m
NetworkComputer 8695   Slave reservation that is created by preemption type FreeSlave is now reserving the preempted slave by bucket id. The reservation duration is 5 seconds by default and it can be configured using preempt rule field reservetime.
NetworkComputer 8735 22516 Speeds up jobclasses.cgi by showing detailed job statistics per jobclass only on demand, by clicking the basic job count initially displayed.
NetworkComputer 6199 20959 Documented the boolean jobclass variable "classEditable", which controls whether a jobclass can be edited using the jobclass web UI page. Example: set classDescription "my jobclass" set classEditable false ...rest of jobclass script...

Resolved Issues
Products Internal Number Case Number Description
All 8826 22666 Prevent vovslave from caching empty group lists returned from OS.
All 8947   The AGE field of the SETS object will now return the correct age, defined as time in seconds since the set was last updated. In addition, the LASTUPDATE field will always return a valid timestamp instead of 0 for non-smart sets.
FlowTracer 8854 22728 Fixed vovconsole crash that could occur in flow graphs when using Orthogonal/Manhattan arrows.
FlowTracer 8984 22999 Fixed bug which caused a Tcl error in node editor when the expected duration field had a time value with a leading zero, and then changes were made to the node and saved. The time spec parser was failing to trim the leading zeroes and producing an error. Now, the time spec parser does it correctly. An example time spec that could cause the error is "1h05m" due to the "05" value for the minutes.
LicenseMonitor 8133 21860 The "shift combo" was fixed in previous 2016.09 release. The test case added to ensure this error will got covered in our regression tests.
LicenseMonitor 9011 23060 Removed the double html-encoding to prevent license editing corruption.
LicenseMonitor 8834 22690 ftlm_batch_report should honor the switch with "Hide" parameter (e.g.-UtilPlotHideAverage) with empty value (default value as 1).
LicenseMonitor 8852   Configure group access to a tab should not prevent the guests view page using readonly port.
LicenseMonitor 8859 22682 Several robustness improvements to vovresourced with respect to obtaining data from LicenseMonitor:
  • Improved handling of buffer overflow condition in /raw/checkouts and /raw/vovresourced URLs when a large number of checkouts is present.
  • Improved debuggability of HTTP errors by printing errors and timeout occurrences in vovresourced log.
  • Fixed bug in LM lookup procedure when SSL is specified as true in the vovresourced configuration but LM is not running with SSL.
  • Add handling for HTTP timeout in LM lookup procedure so that 3 retries are attempted before marking LM as down.
  • Improved resource life support by entering the mode only for the affected license-based resource when the data request to LM fails only for that license.
  • Improved life support debuggability with improved messaging in the vovresourced log.
  • Improved performance of /raw/checkouts and /raw/vovresourced URLs.
  • Added configurable HTTP timeout to vovresourced configuration to accomodate heavily loaded LM instances. The setting, RESD(timeout,http), accepts timespec values and is set to 30s by default.
  • Disable vtk_flexlm_monitor_all by default for new WX installations.
LicenseMonitor 8615 22208 ftlm_parse_flexlmlic to handle nodelocked licenses (e.g. MATLAB:ID=720099) with correct capacity
NetworkComputer 7242 20265 FairShare groups now have a "flatten" setting. When a group is flattened, all of its child groups will be treated as though they were on the same level of hierarchy; that is, all leaf-level groups will be assigned weights as though they were direct children of the top-level flattened group, ignoring weights assigned to any non-leaf-level groups in between. This setting can be enabled by the vovfsgroup modify command. Note that disabling this setting for a group that has a flattened parent will have no effect; the group will still be flattened.
NetworkComputer 7806 21608 Fixed FreeSlave preemption type not to preempt slaves that are not compatible with preempting jobs in HW requests such as CORES, RAM.
NetworkComputer 8824 22655 Do not process commas as option delimiters in SNAPSHOT environment calls so that a snapshot file path can include commas.
NetworkComputer 8865 22693 Improved error messages for wrong parameters in nc preempt command.
NetworkComputer 8686   Improved the preemption rule view page to include all of the fields that are displayed in the new rule page. This mainly includes improvements to the FREE_SLAVES, RESERVE_SLAVES, and RESERVE_RESOURCES rule views.
NetworkComputer 8697 21608 Improved the performance and reliability of the FREE_SLAVES preemption rule behavior. Hosts with non-preemptable jobs can be preempted if the machine can still run preempting jobs.
NetworkComputer 8914 22857
  1. Implemented "spread" NUMA placement. Jobs using this placement type will be bound to the least-loaded sockets available on the chosen slave.
  2. Simplified NUMA usage. Requesting the "NUMA" resource is no longer required. NUMA is controlled exclusively by the -jpp option to nc run. The "NUMA" resource is no longer required to be provided by slaves.
  3. Improved NUMA visibility. Fixed issue with the NUMA field that prevented the NUMA type from being displayed with vovselect, or when using the @NUMA@ keyword in the nc list output format string. Additionally, the NUMA_AFFINITY job property has been augmented to show the placement type (e.g. "NUMA pack: 0 1 * *").
  4. Improved debug capability. Added ability to use vovslavemgr configure to control the debugnuma setting for slaves. Detailed NUMA information will be printed to the slave log when this setting is set to 1. Also added control in vovslavemgr configure for binding to entire sockets (default) or to individual cores (experimental). See vovslavemgr usage help for details.
WorkloadXelerator 8878 22773 Fixed problem with SNAPSHOT environments that caused environment variables to get added in subsequent jobs but never removed. Now it uses the correct environment from the shell that launched the job, plus the environment specific to each job. The bug impacted vovslave but not vovslaveroot.

2016.09 Update 17

New Features and Enhancements
Products Internal Number Case Number Description
All 8720 22285 Added patchlevel display and patch sorting for various options of vovversion command.
FlowTracer 8756 22556 Added 2 config parameters for vovlsfd:
VOVLSFD(bsubErrorsBlackList) specifies a list of bsub errors that will be ignored by vovlsfd.
For all other errors vovlsfd will dequeue the jobs that are causing unsuccessful bsub requeuests.
                
VOVLSFD(dequeueOnNumBsubErrors) specifies a number of sequential bsub failures that should force
vovlsfd to dequeue the corresponding FT jobs.
Default value: 2
LicenseMonitor 7006 5636 Added a pair of new configuration variables in the vovlmd configuration file to control the threshold of elapsed time without an update at which point a license server is to be considered down. That is, you can set the following in the licmon.swd/vovlmd/config.tcl file:
LSset VOVLM(downthreshold) 10m
LSset VOVLM(checkfrequency) 5m
LicenseMonitor 8743 22692 Added "Show capacity with queued" and "Show peak with queued" options in the utilization plot batch report UI and corresponding options for CLI.
NetworkComputer 8701 22294 The bjobs LSF emulation utility has been modified to support the -o option to control the job listing output format. Column width and delimiter specifiers are also supported. See bjobs usage syntax (bjobs -h) for details.
NetworkComputer 8796 22674 A new subcommand of vovslavemgr, rotatelog, creates new log files and new log directories if missing. Startup log files are not recreated.

Resolved Issues
Products Internal Number Case Number Description
All 7809 21345 Fixed issue that caused the patch status link in certain web UI pages to be rendered as plain text instead of HTML.
All 8630 22215 Improved vovnginxd robustness when reconfiguring the web port, and corrected the documentation on how to modify the web port while the server is still running.
LicenseMonitor 8580 22181 LicenseMonitor can now serve batch reports greater than 1GB in size.
LicenseMonitor 5898   License server emulate thru vtkle_feature_set can take y (year) as time unit.
LicenseMonitor 8451 22083 Remove the VIEW (and other relevant) permissions for EVERYBODY when restricting tag access to specific users.
LicenseMonitor 8846 22701 Fixed "extra switch pattern with no body" Tcl error in the vovdb_util dump function.
NetworkComputer 8805 22639 Fixed issue where the highest possible value for the maxload setting of a vovslave was silently capped at 100.0. This prevented slaves configured with more than 100 job slots from running more than 100 jobs. The new highest possible value for the maxload setting is 10000.0. Along with this change, jobs will need to be submitted with PERCENT/0, or the default minimum HW request will need to be modified so that PERCENT/0 is requested, or no PERCENT resource at all. The default minimum HW request can be configured in the SWD/policy.tcl file, using the "minhw" setting (default is shown):
set config(minhw) "RAM/20 CORES/1 SLOTS/1 PERCENT/1"
If this setting is changed, the policy.tcl file will need to be reread by the vovserver, which can be accomplished with the following command:
nc cmd vovproject reread
NetworkComputer 8075 21678 The server parameter schedskip is now obsolete. We recommend using the parameter schedMaxEffort to limit the time usage of the scheduler. See the documentation for schedMaxEffort for more details.
NetworkComputer 8244 22004 A new server configuration parameter is added to limit the maximum number of ORs in a resource expression. For example in policy.tcl:
set config(maxORsInResourceExpressions) 10

Or from the shell:
                
nc cmd vovsh -x 'vtk_server_config maxORsInResourceExpressions 10'
                
If a job is submitted and the number of ORs in its resource expression exceeds the limit, the job fails with
a message in the Why info.
                
The default value for the limit is 20.
NetworkComputer 8373 22020 Added documentation that describes vovnginxd configuration files and procedure.
Note: This ticket is composed of 2 parts. The first part has been addressed. The second part, "nginx sometimes holds up the port indefinitely when the back end NC vovserver has failed over", will be addressed in a future release.
NetworkComputer 8475 22006 Jobs with a negated soft resource containing a colon or underscore character were unexpectedly dispatched instead of queueing. This has been corrected.
NetworkComputer 7864 21442 Fixed to set correct fail codes for stopped jobs by Ext method.
NetworkComputer 8833 22689 The SOLUTION property on jobs will now use "!" to indicate negated resources rather than "!=".
NetworkComputer 8872 22713 Fixed the problem that one of preempt rules is not sorted by order. Improved error message.

2016.09 Update 16

New Features and Enhancements
Products Internal Number Case Number Description
All 8641 22235 When a job is modified, do not mark the job unsafe when the host has been changed. Also do not invalidate the job when the resources, aux resources, jobclass, job project, or FairShare group changed.
NetworkComputer 7878 21366 Added option -leaf to vovfsgroup genconfig to also include weights for the leaf nodes of the complete FairShare tree.
NetworkComputer 8644 22239 To prevent excessive server load due to too many OR clauses in resource map sums, calls to vtk_resourcemap_sum from <swd>vovresourced/config.tcl are now limited to expressions of up to 5 ORs (by default). The default limit can be adjusted by setting RESD(maxORsInResourceSum) in <swd>/vovresourced/config.tcl. If a resource sum with excessive ORs is seen, an alert is generated and an error message is output once per day in the vovresourced logfile.
NetworkComputer 8749   A new option, -writeprdir <directory_path> has been added to the ncmgr stop command. This permits the PR file to be written, uncompressed, to the specified directory instead of the trace.db directory upon server shutdown.

Resolved Issues
Products Internal Number Case Number Description
All 8658 22287 Fixed issue that randomly caused a "page too large" error to be displayed for certain pages in the web UI that contain a significant amount of data.
FlowTracer 8709   Fixed vovfileready bug that existed in 2016.09u12-2016.0u15. The bug caused the vovfileready job to become invalid instead of becoming valid and running the downcone immediately. Now, it behaves correctly. This is a 1-line Tcl change and is a trivial patch to any release.
FlowTracer 8626   An alert for not enough file descriptors from vovlsfd.tcl was confusing users and has been removed. Warnings about this are still issued in the logfile.
FlowTracer 8773   Fixed issue in Tcl procedures that attempt to validate nodes (waive exit code, force run, force validate). These procedures were broken in recent 2016.09 versions (validation did not always occur and error messages were seen). This fix allows these functions to successfully validate the requested node, which also includes the output of jobs.
LicenseMonitor 8162 21794 Made sure the data series has the same order as the legend series so that there is no mismatch between them.
LicenseMonitor 8610   Use bold font for the label and legend in pie charts, and use bold font for tick and legend in bar/histogram chart. This change should provide improved look for presentation views.
LicenseMonitor 8611   Adopted a new pie chart color selection algorithm that produces a range of lighter colors that look good with black fonts on top of them in most cases.
LicenseMonitor 8209   Fixed issue that prevented the lmmgr reset function from working.
LicenseMonitor 8571   Make sure correct where clause for the SQL is used when retrieve data from database to produce reports.
NetworkComputer 8492 22142 Fixed issue with vovslaveroot reconnecting after a 30+ minute network interruption. Now, the slaves will properly reconnect and recover the jobs when the network connection to the server is re-established even after a delay of over 30 minutes. Fixed issue with vovslaveroot when the process exits with an error to make sure that the message gets logged (previously the "fatal error" message was not shown or logged anywhere). Fixed issue with the sleep time between reconnection attempts to be accurate based on wall clock time. Previously, when a vovslave was reconnecting, the timeout would sometimes be very far off from the requested wall clock time.
NetworkComputer 8739 22515 In certain cases, when a job finishes at the same time as an internal alarm, a job can hang, or be auto-killed, so appears to fail. This problem has been resolved.
NetworkComputer 8753 22291 Fixed error in jobclass.cgi, where a jobclass using the variable VOV_JOB_DESC(jobclass) would trigger a Tcl error.
NetworkComputer 7343 20113 When there is an error opening, writing, or closing a dailylog file, such as the log used for vovresourced, catch the error and report it. The error message, including the decoded string from C++, will get printed, as well as the message that was being written. In addition, generate an alert when such an error occurs with the same information (except for the message being written).
NetworkComputer 8744 22529 Format the value of the WX_BUCKET_LINK property when viewed from within node.cgi.
NetworkComputer 7562 20821 Made some fixes to input and output declarations to address "Server is operating on a non-internal object" error.
NetworkComputer 7911 21526 The command vovslavemgr restart now behaves identically to vovslavemgr stop followed by vovslavemgr start, allowing vovslavemgr restart to properly restart busy slaves.
NetworkComputer 8623 22166
  • vtk_fsgroup_update is added. Calling this function triggers the server to update FairShare statistics such as target share, actual share, excess share.
  • FairShare statistics updated timestamp is showing on the page fairshare.cgi.
  • Sanity function does not restart vovresourced daemon any longer.
  • If running counts is wrong and gets fixed by sanity, the server log will have a message that looks like "Fixing count of running jobs old=1 new=2".
  • Running count became wrong when a job was invalidated, and this case is fixed.
NetworkComputer 8625 22226 Queries for requested-resource fields will now return correct values for jobs with non-running statuses.
NetworkComputer 8673 22267 Added "USERXDUR" and "USERXDURPP" fields to vovselect (for FlowTracer) and nc getfields (for NetworkComputer) to reflect the expected duration of the job as specified by the user. The existing fields "XDUR" and "XDURPP" will continue to be updated to reflect actual duration when the job completes successfully.
NetworkComputer 8678 22273 A slave that is in the process of stopping but is still running jobs will now have _stopped_<timestamp> appended to its name, whether stopped from the command line or through the server's browser-based UI. This allows a slave to be restarted multiple times. In addition, a request to a slave to stop can only be canceled if a replacement slave hasn't already started.
NetworkComputer 8697 21608 Improved the performance and reliability of the FREE_SLAVES preemption rule behavior.

2016.09 Update 16.1

Resolved Issues
Products Internal Number Case Number Description
NetworkComputer 8833 22689 Have vovfosterjob detect the malformed negated resource and translate it into the proper syntax (work-around prior to server-side fix).
NetworkComputer 8753 22291 Fix error in jobclasses.cgi, where a jobclass using the variable VOV_JOB_DESC(jobclass) would trigger a Tcl error.

2016.09 Update 15

Resolved Issues
Products Internal Number Case Number Description
All 8529 22165 Attempts to use vtk_resourcemap_set to create a self-referential resource map are now rejected with an error message and an alert.
All 8570   NetworkComputer and WorkloadXelerator queue configuration files are no longer corrupted when enabling server failover.
All 8592   Timespec values are now supported in selection rules, in all places that accept integer values.
FlowTracer 8261 22036 Fixed a problem with editing jobs from the browser UI when the command line uses single quote characters.
FlowTracer 8569 22173 We have added a new environment variable, VOV_SLAVE_SID_DISABLE. If this variable is set to 1, vovslave and vovslaveroot will not create new sessions (and consequently will not create new process groups) on startup.
FlowTracer 8595   When calling ForceValidate, queued jobs that are validated are removed from the buckets.
FlowTracer 8146 21921 Fixed the occasional failure of vovlsfd to start when run in single user mode for multi-user FT usage. Removed redundant checks for whether it is safe to run or not.
LicenseAllocator 8585 22193 Critical jobs check will now work with wait reasons that include number of tokens in the resource specification.
LicenseAllocator 8635   Improved presentation of "group matching" in LA user interface.
LicenseMonitor 8561 22112 Fix issues with LM SFD that prevented it from running as a Windows service under the "system" user. Running under the "system" user requires the user name AND password to be "system" in the account information portion of the SFD GUI. The "system" user is now added to the default security configuration in security.tcl to allow it to run normally. Because the "system" user has no password, at least one real user will need to be granted ADMIN privileges in the security.tcl file as well. After doing so, restart the service for the change to take affect. Please allow for 30 seconds between stop and start of the service. Note that the service must be uninstalled and reinstalled or it will fail to start with this version of the SFD. This can be done using the Install/Delete controls in the service portion of the SFD GUI.
LicenseMonitor 8580 22181 Fixed error in LicenseMonitor that prevented the viewing of very large batch reports.
LicenseMonitor 8620 22225 Now the ftlm_parse_flexlm program correctly outputs the parsed version for the queued for license records to the *.chk file.
LicenseMonitor 8526   Clarified messaging regarding unrecognized slave configuration parameters; this is now a warning rather than an error, as it can occur for example when using a newer server with an older slave.
NetworkComputer 8468 22121 NetworkComputer no longer "forgets" a resource map that has been in use for longer than the cutoff provided or is currently in use by a preempted job.
NetworkComputer 7643 21098 Fixed issue that caused jobs submitted through the LSF emulation layer to result in a SLEEPING state due to an output conflict caused by the use of the -o option to bsub.
NetworkComputer 8038 21752 Added the ability to grant a user the privilege to stop another user's job. This can be accomplished by granting the STOP ACL privilege to the other user by using the vovacl command line utility.
vovacl -agent "USER joe" -append -actions STOP 008210915 
NetworkComputer 8089 21856 Ensure VOV_JOBCLASS_DIRS is honored in CLI and web UI, regardless whether it is defined in the queue setup file in vncConfig or the vnc_policy.tcl file. This includes a new line in the output of the nc jobclass -ll command that shows the location of the jobclass script. For the web UI specifically, jobclasses that are located in a user-defined directory are also marked as editable.
NetworkComputer 8233 21983 Messaging in vsy, nc info and related commands has been changed to clarify which resources are actually missing when the user specifies OR clauses in the resource request.
NetworkComputer 8347 22087 The slave.childProcessCleanup parameter in policy.tcl no longer requires cgroups; it does still require a Linux platform. Additionally the method of process cleanup has been changed to be more thorough; this setting may cause vovslave to be less responsive in heavy usage scenarios.
NetworkComputer 8439 22088 The snapshot environment used by the nc run -ep option is now protected against stale system environment variables, such as VOV_PORT_NUMBER.
NetworkComputer 8587 22194 Fixed handling of selection rules for preemptable job for MULTIQUEUE preemption rules.
NetworkComputer 8631   Fixed resource leak caused by hyper-active preemption of same job.
NetworkComputer 8412   NetworkComputer now reconnects properly to a LicenseMonitor instance that has been restarted.
NetworkComputer 8588   Improved reliability of nc stop -dir xxx which now makes sure all stops have been performed.
NetworkComputer 8629   A job preempted with EXT and then killed correctly ends up in the FAILED status rather than WITHDRAWN.
NetworkComputer 8636   Allow editing of mqthresh parameter in MULTIQUEUE preemption rules.
HERO 8164   Fixed problem with exit-status-based auto-rescheduling that prevent the job from running upon being auto-rescheduled.

2016.09 Update 14

New Features and Enhancements
Products Internal Number Case Number Description
NetworkComputer 8248   NetworkComputer will now update license information from LicenseMonitor the minimum of either every 5 minutes or half the configured expiration time.

Resolved Issues
Products Internal Number Case Number Description
FlowTracer 8479   Fixed "Force Validate", which uses a procedure called MakeDashT to emulate the "-t" option to Make. This feature updates the start and end times of jobs, which was being prevented due to tightened protections that were added. This change also fixes a similar problem with "RECONCILE_WITH_FILE_SYSTEM", which also needs to update start and end times of jobs.
FlowTracer 8519   Added back addflow.cgi and removed the broken web link to it in the web interface. This script is not expected to be used but the broken web link needed to be removed.
LicenseMonitor 8487   Handles correctly the case "No checkouts found for the specified report period." for division calculation when excluding idle time.
NetworkComputer 8401 22101 An incorrect "why" message produced by "nc info" when jobs had been stopped was corrected.
NetworkComputer 8503 22148 Added support for entire list of legacy field names in "nc hosts -O", which in turn, fixes issues with the named groups, such as "nc hosts -SLOTS".
NetworkComputer 8187 21130 Fixed issue with auto-rescheduling of failed jobs so that the downcone dependencies do not get descheduled. Now, the failed job can re-run and the downcone will run if and when the job passes on the second (or later) attempt.
NetworkComputer 8505 22150 Restored sensitivity to NC_STOP_SIGNALS and NC_STOP_SIG_DELAY job properties.
NetworkComputer 7952 21588 Reverted FairShare history graph to original width of 600px in order to restore x-axis label visibility for graph windows down to 5d.
WorkloadXelerator 8334 22085 vovelasticd will now use auxiliary resources of WX jobs when submitting slaves to NC. In case of a previously failed slave, using auxiliary resource, WX will try to submit a slave on a different host in NC.
WorkloadXelerator 8074 21997 Resource expressions with | operator in job classes should be combined without spaces. e.g. set VOV_JOB_DESC(resources) "(general|PD) Limit:..."

2016.09 Update 13

New Features and Enhancements
Products Internal Number Case Number Description
All 8346 22069 Changed default value of thread.service.max from 2 to 0, per customer request.
All 8420 22111 Allow control of the maximum number of maps in a job resource expression, previously hard-coded at 300. Now this can be set with the parameter resources.max.maps.
NetworkComputer 5947 22069 Added ability to shut down threads. This can be done now with % vovsh -x 'vtk_server_config thread.service.max 0'. Other parameters called thread.service.... are accessible to control when to use threads.
NetworkComputer 7535 20833 The Tcl procedure VovGetRevokeDelay {} can now be added and customized by redefining it in vovresourced/config.tcl under the SWD directory to allow users to customize the revoke delay to be used in vovreconciled. This allows users to have the revoke delay from their job classes override the default value of RESD(revokeDelay). The proc definition has been added to the documentation. In addition, the verbosity levels of various messages have been modified per customer request.
NetworkComputer 7455 20674 The optional live_keepfor_jobs.tcl task script has been improved to reduce the load on the NC vovserver.
NetworkComputer 7959 21585 The -maxload option of vtk_slave_define now accepts simple expressions relative to the capacity value, represented by the keyword 'CAPACITY'. For example, -maxload CAPACITY*1.5 would set the maxload to 1.5 times the number of slots. Supported operators are: +-/* .

Resolved Issues
Products Internal Number Case Number Description
All 8031 21729 As of 2016.09 update 11, there is a new environment variable VOV_PAUSE_CHILD_SIGNAL, which controls the signalling of child processes on Linux platforms when a slave enters the PAUSED state (usually this happens when a slave receives a SIGTSTP signal). The valid values of this variable are: "STOP" - send SIGSTOP to all child processes. This is the default behavior. "TSTP" - send SIGTSTP to all child processes. SIGTSTP is not guaranteed to suspend child processes. "NONE" or ""(empty string) - do not signal child processes. Any value other than the valid values will result in the default behavior (SIGSTOP). As of 2016.09 Update 13, the environment variable VOV_PAUSE_VW_ON_TSTP can be set to 1 to cause vw jobs to pause when they receive the SIGTSTP signal. The vw jobs will continue if they receive either SIGCONT or SIGALRM.
All 8486   Fixed a bug that was introduced in 2016.09u10 wherein the server would crash if it failed to obtain a lock on the server.info file (on server startup).
FlowTracer 8294 22059 Prevent Tcl errors for FDL procedures that apply to the most recent job, set, or file when the FDL has not yet declared the necessary item. Issue a warning that the item was ignored for properties, annotations, and job names. For inputs, outputs, capsules, adding items to sets, IFJOB, and X11_DISPLAY, stop the vovbuild with a fatal error when this problem occurs.
LicenseMonitor 7863 1099 vovsql_load_checkouts now is able to process checkout log with handle# > limit of INTEGER data type in one of the following ways:
  • For newly created checkouts table, the handle's data type is BIGINT, can deal with integer up to 263-1 (vs. the INTEGER can deal with integer up to 231-1=2147483647).
  • For existing tables, if you set the environment variable ALTER_HANDLE_TO_BIGINT before you upgrade the software, it will convert handle from INTEGER to BIGINT.
  • For those keeping the handle as INTEGER type, we will do some processing in the temporary table when loading the data to checkouts table, i.e. handle=handle%2000000000 so that it will not over the limit.
LicenseMonitor 7977 21594 The efficiency stats report now has option "Include idle time". If selected, the percentile calculation will based duration, match the right-side (Exactly Used) chart in the "Feature Efficiency Histogram".
LicenseMonitor 8054   The .png export for report chart will work correctly even for high DPI canvas.
LicenseMonitor 8251 21817 Updated LicenseMonitor's FlexLM parser to combine reservations for the same feature and user or group that are spread over multiple declarations.
LicenseMonitor 8173 21934 The vovserver policy.tcl file options checkoutHostLowerCase and checkoutUserLowercase were inoperative, but now correctly control case-sensitivity.
LicenseMonitor 8201 21958 Fixed issue in the usage comparison plot report where an error is produced when dropped queued requests are present.
NetworkComputer 8187 21130 Added a post_retrace_downcone post-processing script to schedule the down cone of a job, to cover the case in which a job with dependencies may be automatically resubmitted multiple times.
NetworkComputer 8241 21985 Fixed a problem with @JOBID@ not being expanded in the environment specification. This was causing a problem with cross queue submission due to environment not being captured correctly.
NetworkComputer 8362 22086 Fixed issue with new dependencies for existing jobs in NC that caused the job that is a dependency for another job to turn invalid, which also invalidated the downcone. Now, it does not turn invalid.
NetworkComputer 7982 21646 Changed documentation to suggest usage of the option -rule rather than -hw for multiple constraints in an nc hosts command. Corrected the help for nc hosts. An issue was also fixed in the nc hosts -hw command that prevented multiple elements from being evaluated.
NetworkComputer 8188 21251 Previously, when vovgetgroups timed out or did not return groups info correctly, the job would run with the incorrect groups. Following this change, if vovgetgroups fails or times out, the job will fail.
NetworkComputer 8219 22004 No longer perform detailed slave analysis unless there is a confirmed HW resource wait reason.
NetworkComputer 8244   A new server configuration parameter is added to limit the maximum number of ORs in a resource expression. For example in policy.tcl:
set config(maxORsInResourceExpressions) 10 
Or from the shell:
nc cmd vovsh -x 'vtk_server_config
              maxORsInResourceExpressions 10' 
If a job is submitted and the number of ORs in its resource expression exceeds the limit, the job fails with a message in the Why info. The default value for the limit is 20.
NetworkComputer 8458   Synchronized the nc hosts utility with supported slave fields so that all fields are supported by the -O option.
NetworkComputer 8089 21856 NC jobclasses defined via VOV_JOBCLASS_DIRS now are visible in web and command line.
NetworkComputer 8111 21869 vovserver no longer issues incorrect license status GRACE messages in its log file.
NetworkComputer 8218   Fixed error message concerning undefined errorCode variable in the nc why help message.
NetworkComputer 8330 22081 Fixed a bug in which nc hosts -f did not return an accurate list of field names.
NetworkComputer