ac6-training, un département d'Ac6 SAS
 
Site displayed in English (USA)
View the site in FrenchSite affiché en English (USA)Voir le site en English (GB)
+ +
- -
Online Training
 
Calendar  Details
Operating Systems
 
Calendar  Details
Programming
Calendar  Details
Processors
 
Calendar  Details
Communication
 
 
 
Calendar  Details
+ +
> >
- -

ac6 >> ac6-training >> Processors >> NXP Power CPUs >> P102X QorIQ implementation Download as PDF Write us

FCQ11 P102X QorIQ implementation

This course covers NXP QorIQ P1020/P1011, P1021/P1012, P1022/P1013, P1023/P1017, P1024/P1015, P1025/P1016

formateur
Objectives
  • The course clarifies the architecture of the P102X, particularly the operation of the coherency module that interconnects the e500s to memory and high-speed interfaces.
  • Cache coherency protocol is introduced in increasing depth.
  • The e500 core is viewed in detail, especially the SPE unit that enable vector processing.
  • The boot sequence and the clocking are explained.
  • The course focuses on the hardware implementation of the P102X.
  • A long introduction to DDR SDRAM operation is done before studying the DDR2/3 SDRAM controller.
  • An in-depth description of the PCI-Express port is done.
  • The course highlights both hardware and software implementation of gigabit / fast / Ethernet controllers.
  • Communication interfaces are explained according to the exact reference of the SoC: either TDM or QuiccEngine or DPAA.

  • AC6 has developed an optimized SPE based FFT coded in assembler language.
  • Performance for 1024 complex floating point single precision samples is:
    • - 91_386 core clock cycles without reverse ordering, 94_124 with reverse ordering
  • Performance for 4096 complex floating point single precision samples is:
    • - 470_778 core clock cycles without reverse ordering, 511_227 with reverse ordering
  • For any information contact training@ac6-training.com
A more detailed course description is available on request at training@ac6-training.com
  • Theoretical course
    • PDF course material (in English) supplemented by a printed version for face-to-face courses.
    • Online courses are dispensed using the Teams video-conferencing system.
    • The trainer answers trainees' questions during the training and provide technical and pedagogical assistance.
  • At the start of each session the trainer will interact with the trainees to ensure the course fits their expectations and correct if needed
  • Any embedded systems engineer or technician with the above prerequisites.
  • The prerequisites indicated above are assessed before the training by the technical supervision of the traineein his company, or by the trainee himself in the exceptional case of an individual trainee.
  • Trainee progress is assessed by quizzes offered at the end of various sections to verify that the trainees have assimilated the points presented
  • At the end of the training, each trainee receives a certificate attesting that they have successfully completed the course.
    • In the event of a problem, discovered during the course, due to a lack of prerequisites by the trainee a different or additional training is offered to them, generally to reinforce their prerequisites,in agreement with their company manager if applicable.

Course Outline

  • Internal data flows, OCEAN switch fabric, packet reordering
  • Implementation examples
  • Address map, ATMU, OCEAN configuration
  • Local vs external address spaces, inbound and outbound address decoding
  • Dual-issue superscalar operation
  • Execution units
  • Dynamic branch prediction
  • The Core Complex Bus
  • Store miss merging and store gathering
  • Memory access ordering
  • Lock acquisition and import barriers
  • The first level MMU and the second level MMU, consistency between L1 and L2 TLBs
  • TLB software reload
  • Process protection
  • 36-bit real addressing
  • The L1 caches
  • Software cache coherency
  • Level 2 cache
  • Allocation of data transferred by external masters into the cache: stashing
  • e500 coherency module
  • Snooping mechanism
  • Stashing mechanism
  • L2 cache locking
  • Differences between the new Book E architecture and the classic PowerPC architecture
  • Floating Point units, Double-Precision FP
  • Signal Processing APU (SPE)
  • Book E exception handling
  • Syndrome registers
  • Core timers
  • Performance monitoring
  • JTAG emulation
  • Watchpoint logic
  • Platform clock
  • Voltage configuration selection
  • Power-on reset sequence, using the I2C interface to access serial ROM
  • Power management
  • eSDHC boot
  • eSPI boot ROM
  • I/O arbiter
  • CCB arbiter
  • CCB interface
  • On-Die termination
  • Calibration mechanism
  • Mode registers initialization, bank selection and precharge
  • Command truth table
  • Hardware interface
  • Bank activation, read, write and precharge timing diagrams, page mode
  • ECC error correction
  • Initialization routine
  • Multiplexed or non-multiplexed address and data buses
  • Dynamic bus sizing
  • GPCM, UPMs states machines
  • NAND flash controller
  • 4-lane PCI Express interface
  • Modes of operation, Root Complex / Endpoint
  • Transaction ordering rules
  • Programming inbound and outbound ATMUs
  • PIC in multiple-processor implementation
  • Understanding interrupt masking
  • Interprocessor interrupts
  • Per-CPU register usage, message registers
  • Nesting implementation
  • Support for cascading descriptor chains
  • Scatter / gathering
  • Selectable hardware enforced coherency
  • Threshold events
  • Chaining, triggering
  • Watchpoint facility
  • Trace buffer
  • Address recognition, pattern matching
  • Buffer descriptors management
  • Physical interfaces
  • Buffer descriptor management
  • 256-entry hash table for unicast and multicast
  • Management of VLAN tags and priority, VLAN insertion and deletion
  • Quality of service, managing several transmit and receive queues
  • TCP/IP offload engine, filer programming
  • IEEE1588 compliant time-stamping
  • Storing and executing commands targeting the external card
  • Multi-block transfers
  • Moving data by using the dedicated DMA controller
  • Dividing large data transfers
  • EHCI implementation
  • Periodic Frame List
  • ULPI interfaces to the transceiver
  • OTG support
  • Endpoints configuration
  • Crypto channels
  • Sequence to subcontract a crypto job to SEC
  • Link tables
  • Managing interrupts
  • Description of the NS16552 compliant Uarts
  • I2C controller
  • Enhanced SPI controller
  • Serial interface
  • Network mode of operation with up to 128 time-slots
  • DMA configuration
  • End-of-frame interrupt
  • Configuring the TDM for I2S Operation
  • Display interfaces
  • Display color depth
  • Pixel structure, alpha-blending
  • Utilization of area descriptor
  • Moving images through the dedicated DMA channel
  • Integrated RISC CPU
  • Communication between Host CPU and QE RISC CPU
  • Priority management
  • Steering the interrupt source to either Low priority or High priority input of the platform PIC
  • Serial DMA
  • QUICC engine external requests
  • NMSI vs TDM
  • Enabling connections to TSA or NMSI
  • Utilization of Buffer Descriptors
  • Chaining descriptors into rings
  • Parameter RAM independent of protocol
  • UCC as slow communications controllers, UART mode
  • UCC for fast protocols, virtual FIFOs
  • Flow control
  • Setting global parameters
  • Describing the parameter RAM
  • Transparent data encapsulation, frame sync and frame CRC
  • Describing the parameter RAM
  • Connecting TDM lines
  • Parameterizing the timings related to Rx/Tx clock, sync and data signals
  • Connecting the TDM line to UCC using Rx/Tx routing tables
  • Comparison with MCC and QMC
  • Connecting time-slots to logical channels through Rx/Tx routing tables
  • Implementing Rx/Tx channel buffers
  • Interrupt management
  • Channel-specific HDLC parameters
  • Per channel exception management
  • UMCC host commands
  • Definitions: buffer, buffer pool, frame, frame queue, work queue, channel
  • Data formats
  • Frame formats
  • Packet walk through
  • Objectives if this accelerator
  • Frame description
  • Structure of frame queues
  • Frame queue state machine
  • Multiway resource arbiter
  • Work queues and channels
  • Enqueue and dequeue portals
  • Class and intra-class scheduling rules
  • Dequeue dispatcher operation
  • Message ring
  • Stash transaction flow control and scheduling
  • Congestion avoidance
  • CoreNet initiator scheduling and priority
  • Objectives if this accelerator
  • Software portals
  • Direct connect portals
  • Software interface, Command register, Management Response registers
  • Buffer Pool State Change Notifications
  • Buffer pool size programming
  • Performance Monitor
  • Objectives if this accelerator, parsing, classifying and distributing in-line/off-line packet
  • FMAN submodules
  • Rx BMI features
  • Tx BMI features
  • Offline parsing, host command features
  • Frame processing manager
  • FMan controller
  • Parser
  • Key generator
  • Policer
  • MAC address recognition
  • 256-entry hash table for unicast and multicast
  • Suspending the transmitter, handling pause packets
  • RMON statistic counters, carry registers
  • Client IEEE1588 timers
  • Job management using QMan interface
  • Input / output rings
  • Job descriptor parsing
  • Sharing descriptors
  • Selecting the authentication / cryptographic algorithm
  • Public Key Hardware Accelerator (PKHA)
  • SNOW 3G Accelerator
  • Example, implementing IPSec