BEGIN:VCALENDAR
VERSION:2.0
PRODID:Linklings LLC
BEGIN:VTIMEZONE
TZID:America/Chicago
X-LIC-LOCATION:America/Chicago
BEGIN:DAYLIGHT
TZOFFSETFROM:-0600
TZOFFSETTO:-0500
TZNAME:CDT
DTSTART:19700308T020000
RRULE:FREQ=YEARLY;BYMONTH=3;BYDAY=2SU
END:DAYLIGHT
BEGIN:STANDARD
TZOFFSETFROM:-0500
TZOFFSETTO:-0600
TZNAME:CST
DTSTART:19701101T020000
RRULE:FREQ=YEARLY;BYMONTH=11;BYDAY=1SU
END:STANDARD
END:VTIMEZONE
BEGIN:VEVENT
DTSTAMP:20181221T160903Z
LOCATION:C2/3/4 Ballroom
DTSTART;TZID=America/Chicago:20181115T083000
DTEND;TZID=America/Chicago:20181115T170000
UID:submissions.supercomputing.org_SC18_sess324_post109@linklings.com
SUMMARY:Redesigning The Absorbing Boundary Algorithm for Asynchronous High
  Performance Acoustic Wave Propagation
DESCRIPTION:Poster\nTech Program Reg Pass, Exhibits Reg Pass\n\nRedesignin
 g The Absorbing Boundary Algorithm for Asynchronous High Performance Acous
 tic Wave Propagation\n\nAbdelkhalak, Akbudak, Etienne, Tonellot\n\nExploit
 ing high concurrency, relaxing the synchrony of existing algorithms, and i
 ncreasing data reuse have immense effect in performance. We integrate the 
 Multicore-optimized Wavefront Diamond (MWD) tiling approach by Malas et al
 . [SIAM SISC, 2015, ACM Trans. Parallel Comput. 2017],  which takes into a
 ccount the three aforementioned ingredients, into the industrial project c
 odenamed ExaWave framework beside the traditional spatial blocking (SB) te
 chnique for stencil computations. However, the fine-grained asynchronous h
 andling of the Convolution Perfectly Matched Layer (CPML) for absorbing bo
 undary conditions turns out to be a challenging open research problem, due
  to severe inherent data dependencies constraints, which impedes MWD perfo
 rmance impact. We propose techniques of loop fusion to reduce memory traff
 ic and sliding windows to cut down the engendered extra flops, in order to
  consolidate CPML integration with the overall asynchronous MWD technique.
  The experimental results on Intel's latest processors show the effectiven
 ess of the proposed techniques.
URL:https://sc18.supercomputing.org/presentation/?id=post109&sess=sess324
END:VEVENT
END:VCALENDAR

