189773 ⎘
Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs; Arrangements for executing machine instructions, e.g. instruction decode; Concurrent instruction execution, e.g. pipeline, look ahead Instruction issuing, e.g. dynamic instruction scheduling, out of order instruction execution
Enhanced performance-aware instruction scheduling
#1202METHOD AND APPARATUS FOR FLUSHING INSTRUCTIONS FROM RESERVATION STATIONS
#1203SELECT IN-ORDER INSTRUCTION PICK USING AN OUT OF ORDER INSTRUCTION PICKER
#1204PROVIDING CODE SECTIONS FOR MATRIX OF ARITHMETIC LOGIC UNITS IN A PROCESSOR
#1205Data read-write scheduler and reservation station for vector operations
#1206Data read-write scheduler and reservation station for vector operations
#1207Entanglement of pages and guest threads
#1208Apparatus and methods for matrix multiplication
#1209Token-based data dependency protection for memory access
#1210Instruction and logic for tracking fetch performance bottlenecks
#1211Efficient mitigation of side-channel based attacks against speculative execution processing architectures
#1212Flash Storage Devices Executing ECC in Parallel and Methods Thereof
#1213Memory device, a dual inline memory module, a storage device, an apparatus for storing, a method for storing, a computer program, a machine readable storage, and a machine readable medium
#1214Wide vector execution in single thread mode for an out-of-order processor
#1215Wide vector execution in single thread mode for an out-of-order processor
#1216Managing an issue queue for fused instructions and paired instructions in a microprocessor
#1217Managing an issue queue for fused instructions and paired instructions in a microprocessor
#1218Processor with hybrid pipeline capable of operating in out-of-order and in-order modes
#1219TECHNIQUES TO MITIGATE HIGH LATENCY INSTRUCTIONS IN HIGH FREQUENCY EXECUTION PATHS
#1220Fast multi-width instruction issue in parallel slice processor
#1221Parallel slice processor shadowing states of hardware threads across execution slices
#1222Dual data streams sharing dual level two cache access ports to maximize bandwidth utilization
#1223CACHE RETURN ORDER OPTIMIZATION
#1224Cache return order optimization
#1225Cracked execution of move-to-FPSCR instructions
#1226Managing backend resources via frontend steering or stalls
#1227Vector processing using loops of dynamic vector length
#1228Stream processor with decoupled crossbar for cross lane operations
#1229Instructions for remote atomic operations
#1230System and method of merging partial write results for resolving renaming size issues
#1231Selective updating of floating point controls
#1232Efficient enforcement of barriers with respect to memory move sequences
#1233Asymmetric multi-core heterogeneous parallel processing system
#1234Scheduling tasks using work fullness counter
#1235Scheduling tasks using targeted pipelines
#1236Scheduling tasks using swap flags
#1237Systems and methods for automatic computer code parallelization
#1238Multicore on-die memory microcontroller
#1239Continuation analysis tasks for GPU task scheduling
#1240Performance scaling for binary translation
#1241Arithmetic processing device and method of controlling arithmetic processing device
#1242Virtual machine coprocessor for accelerating software execution
#1243Multifunction vector processor circuits
#1244Tensor processor instruction set architecture
#1245On-demand GPR ECC error detection and scrubbing for a multi-slice microprocessor
#1246Linkable issue queue parallel execution slice processing method
#1247Parallel slice processor having a recirculating load-store queue for fast deallocation of issue queue entries
#1248Implementing barriers to efficiently support cumulativity in a weakly ordered memory system
#1249Banking register renaming to reduce power
#1250MULTI-NULLIFICATION
#1251System and method for processing and arbitrating submission and completion queues
#1252Processing operation issue control
#1253Intelligent thread dispatch and vectorization of atomic operations
#1254Techniques for comprehensively synchronizing execution threads
#1255Mixed inference using low and high precision
#1256Instructions having support for floating point and integer data types in the same register
#1257System and method of reducing processor pipeline stall caused by full load queue
#1258Early predicate look-up
#1259Enhancing processing performance of artificial intelligence/machine hardware by data sharing and distribution as well as reuse of data in neuron buffer/line buffer
#1260Dynamically partitioning workload in a deep neural network module to reduce power consumption
#1261Power-efficient deep neural network module configured for parallel kernel and parallel input processing
#1262Power-efficient deep neural network module configured for executing a layer descriptor list
#1263Minimizing memory reads and increasing performance by leveraging aligned blob data in a processing unit of a neural network environment
#1264Neural network processor using compression and decompression of activation data to reduce memory bandwidth utilization
#1265Reducing power consumption in a neural network processor by skipping processing operations
#1266Power-efficient deep neural network module configured for layer and operation fencing and dependency management
#1267Dynamic sequencing of data partitions for optimizing memory utilization and performance of neural networks
#1268Control system for process data and method for controlling process data
#1269Predicting cache misses using data access behavior and instruction address
#1270SYSTEM AND METHOD OF EXECUTING CACHE LINE UNALIGNED LOAD INSTRUCTIONS
#1271Graphics processing integrated circuit package
#1272Operation of a multi-slice processor with an expanded merge fetching queue
#1273Engine to enable high speed context switching via on-die storage
#1274Parallel input/output via multipath software
#1275Reconfigurable processor with load-store slices supporting reorder and controlling access to cache slices
#1276Method of detecting repetition of an out-of-order execution schedule, apparatus and computer-readable medium
#1277Processor and task processing method therefor, and storage medium
#1278Hints for shared store pipeline and multi-rate targets
#1279Determine whether to fuse move prefix instruction and immediately following instruction independently of detecting identical destination registers
#1280Spin loop delay instruction
#1281Apparatus and method to generate trace data in response to transactional execution
#1282Spin loop delay instruction
#1283Operation instruction response control method and terminal for human-machine interface
#1284Resource scheduling system and method under graphics processing unit virtualization based on instant feedback of application effect
#1285Multifunctional hexadecimal instruction form system and program product
#1286Hardware mechanism to mitigate stalling of a processor core
#1287Indicating instruction scheduling mode for processing wavefront portions
#1288Method and apparatus for providing accelerated access to a memory system
#1289Method and apparatus for efficient scheduling for asymmetrical execution units
#1290Hardware processors and methods for tightly-coupled heterogeneous computing
#1291METHOD FOR POPULATING AND INSTRUCTION VIEW DATA STRUCTURE BY USING REGISTER TEMPLATE SNAPSHOTS
#1292Infinite processor thread balancing
#1293Method and apparatus for asynchronous scheduling
#1294Parallel slice processor shadowing states of hardware threads across execution slices
#1295Data processing systems
#1296IMPLEMENTATION OF REGISTER RENAMING, CALL-RETURN PREDICTION AND PREFETCH
#1297Load/store unit for a processor, and applications thereof
#1298Unambiguous proxying of interface methods
#1299Methods and apparatus for adjacency network delivery of operands to instruction specified destinations that reduces storage of temporary variables
#1300Issuing instructions to multiple execution units
#1301DISPATCH OF PROCESSOR READ RESULTS
#1302Operation cache
#1303General purpose register allocation in streaming processor
#1304Memory fragments for supporting code block execution by using virtual cores instantiated by partitionable engines
#1305Bufferless communication for redundant multithreading using register permutation
#1306Deterministic code fingerprinting for program flow monitoring
#1307Out-of-order processor that avoids deadlock in processing queues by designating a most favored instruction
#1308Hardware acceleration method, compiler, and device
#1309Pipeline including separate hardware data paths for different instruction types
#1310Computer instruction processing method, coprocessor, and system
#1311Compact linked-list-based multi-threaded instruction graduation buffer
#1312Accessing data in multi-dimensional tensors
#1313Method, device, and single-tasking system for implementing multi-tasking in single-tasking system
#1314Computer architecture with a hardware accumulator reset
#1315System and method for load balancing in out-of-order clustered decoding
#1316Method and system for managing data access in storage system
#1317Electronic apparatus, processor and control method including a compiler scheduling instructions to reduce unused input ports
#1318Reducing power consumption in a multi-slice computer processor
#1319Processor with improved alias queue and store collision detection to reduce memory violations and load replays
#1320Effectiveness and prioritization of prefetches
#1321Arithmetic processing device and control method for arithmetic processing device
#1322Reducing power consumption in a multi-slice computer processor
#1323Multi-thread processor with rescheduling when threads are nondispatchable
#1324Independent mapping of threads
#1325Signaling interface with phase and framing calibration
#1326Method for implementing a reduced size register view data structure in a microprocessor
#1327Memory dependence prediction
#1328Efficient enforcement of barriers with respect to memory move sequences
#1329System and method for load and store queue allocations at address generation time
#1330Highly efficient inexact computing storage device
#1331Starting and stopping instruction dispatch to execution unit queues in a multi-pipeline processor
#1332Microprocessor that fuses if-then instructions
#1333Scheduling independent and dependent operations for processing
#1334Controlling the operating speed of stages of an asynchronous pipeline
#1335Method for performing dual dispatch of blocks and half blocks
#1336Method for executing multithreaded instructions grouped into blocks
#1337Techniques for hybrid computer thread creation and management
#1338Advanced processor architecture
#1339OPERATION OF A MULTI-SLICE PROCESSOR IMPLEMENTING PRIORITIZED DEPENDENCY CHAIN RESOLUTION
#1340Effectiveness and prioritization of prefeteches
#1341PROGRAM INFORMATION GENERATION SYSTEM, METHOD, AND COMPUTER PROGRAM PRODUCT
#1342LOAD-STORE QUEUE FOR MULTIPLE PROCESSOR CORES
#1343LOAD-STORE QUEUE FOR BLOCK-BASED PROCESSOR
#1344Core pairing in multicore systems
#1345Techniques for hybrid computer thread creation and management
#1346INSTRUCTION AND LOGIC TO PROVIDE VECTOR SCATTER-OP AND GATHER-OP FUNCTIONALITY
#1347Broadcasting messages between execution slices for issued instructions indicating when execution results are ready
#1348Executing memory requests out of order
#1349Operation of a multi-slice processor implementing datapath steering
#1350Operation of a multi-slice processor implementing datapath steering
#1351Fetched data in an ultra-short piped load store unit
#1352System and method for associative power and clock management with instruction governed operation for power efficient computing
#1353Direct register restore mechanism for distributed history buffers
#1354Operation of a multi-slice processor implementing tagged geometric history length (TAGE) branch prediction
#1355Method and apparatus for reordering in a non-uniform compute device
#1356Graphic processor unit providing reduced storage costs for similar operands
#1357Age based fast instruction issue
#1358Implementing barriers to efficiently support cumulativity in a weakly-ordered memory system
#1359Implementing barriers to efficiently support cumulativity in a weakly-ordered memory system
#1360Hybrid block-based processor and custom function blocks
#1361Out-of-order block-based processors and instruction schedulers using ready state data indexed by instruction position identifiers
#1362Incremental scheduler for out-of-order block ISA processors
#1363PARALLEL INSTRUCTION SCHEDULER FOR BLOCK ISA PROCESSOR
#1364Merging status and control data in a reservation station
#1365Sliding time window control mechanism for parallel execution of multiple processor core models in a virtual platform simulation
#1366Annotation logic for dynamic instruction lookahead distance determination
#1367FPSCR STICKY BIT HANDLING FOR OUT OF ORDER INSTRUCTION EXECUTION
#1368Method, apparatus and instructions for parallel data conversions
#1369Method and apparatus for implementing a dynamic out-of-order processor pipeline
#1370Preventing hazard flushes in an instruction sequencing unit of a multi-slice processor
#1371ARITHMETIC PROCESSING DEVICE, METHOD, AND SYSTEM
#1372Execution slice with supplemental instruction port for an instruction using a source operand from another instruction port
#1373Operation of a multi-slice processor implementing a hardware level transfer of an execution thread
#1374APPARATUS AND METHOD FOR RE-EXECUTION OF FAULTING OPERATIONS
#1375Multiple instruction issuance with parallel inter-group and intra-group picking
#1376Executing a composite VLIW instruction having a scalar atom that indicates an iteration of execution
#1377Apparatus and method for dynamic power reduction in a unified scheduler
#1378Operation of a multi-slice processor with an expanded merge fetching queue
#1379Relaxed execution of overlapping mixed-scalar-vector instructions
#1380Age based fast instruction issue
#1381Preventing premature reads from a general purpose register
#1382Command scheduler for a display device
#1383Single cycle multi-branch prediction including shadow cache for early far branch prediction
#1384Controlling transition between using first and second processing circuitry
#1385Thread migration using a microcode engine of a multi-slice processor
#1386Coalescing adjacent gather/scatter operations
#1387Operation of a multi-slice processor preventing early dependent instruction wakeup
#1388Operation of a multi-slice processor implementing dynamic switching of instruction issuance order
#1389Energy efficient processor core architecture for image processor
#1390Dual data streams sharing dual level two cache access ports to maximize bandwidth utilization
#1391Dynamic pipeline throttling using confidence-based weighting of in-flight branch instructions
#1392System and method for contextual vectorization of instructions at runtime
#1393Systems, apparatuses, and methods for performing a double blocked sum of absolute differences
#1394Instructions for managing a parallel cache hierarchy
#1395Operation of a multi-slice processor implementing a mechanism to overcome a system hang
#1396Parallel dispatching of multi-operation instructions in a multi-slice computer processor
#1397Controlling processing of instructions in a processing pipeline
#1398Controlling processing of instructions in a processing pipeline
#1399Asynchronous instruction execution apparatus with execution modules invoking external calculation resources
#1400MFENCE AND LFENCE MICRO-ARCHITECTURAL IMPLEMENTATION METHOD AND SYSTEM
#1401Execution of load instructions in a processor
#1402Predictive fetching and decoding for selected instructions
#1403Allocating a register to an instruction using register index information
#1404Providing task-triggered deterministic operational mode for simultaneous multi-threaded superscalar processor
#1405Scheduling execution of instructions on a processor having multiple hardware threads with different execution resources
#1406Techniques for hybrid computer thread creation and management
#1407Techniques for hybrid computer thread creation and management
#1408INSTRUCTION EXECUTION METHOD AND PROCESSOR
#1409Computer system
#1410Lightweight interrupts for condition checking
#1411Method and apparatus for detecting memory conflicts using distinguished memory addresses
#1412Processing of multiple instruction streams in a parallel slice processor
#1413Determining of validity of speculative load data after a predetermined period of time in a multi-slice processor
#1414Multi-slice processor issue of a dependent instruction in an issue queue based on issue of a producer instruction
#1415Multi-slice processor issue of a dependent instruction in an issue queue based on issue of a producer instruction
#1416Energy efficient source operand issue
#1417Operation of a multi-slice processor with reduced flush and restore latency
#1418Determining of validity of speculative load data after a predetermined period of time in a multi-slice processor
#1419Operation of a multi-slice processor with reduced flush and restore latency
#1420Reducing power consumption in a multi-slice computer processor
#1421Reducing power consumption in a multi-slice computer processor
#1422Multilevel conversion table cache for translating guest instructions to native instructions
#1423Processor with an expandable instruction set architecture for dynamically configuring execution resources
#1424Run-time code parallelization with independent speculative committing of instructions per segment
#1425Conversion system for a processor with an expandable instruction set architecture for dynamically configuring execution resources
#1426Compiler system for a processor with an expandable instruction set architecture for dynamically configuring execution resources
#1427Instruction And Logic For Programmable Fabric Hierarchy And Cache
#1428System and method of speculative parallel execution of cache line unaligned load instructions
#1429Cache system and method using track table and branch information
#1430Computer architecture with resistive processing units
#1431Fusion of instructions by delaying handling of a partial subset of a fusible group of instructions
#1432Method for dependency broadcasting through a source organized source view data structure
#1433Method for populating register view data structure by using register template snapshots
#1434Method for populating a source view data structure by using register template snapshots
#1435Tightly coupled processor arrays using coarse grained reconfigurable architecture with iteration level commits
#1436Tightly coupled processor arrays using coarse grained reconfigurable architecture with iteration level commits
#1437Method for forming constant extensions in the same execute packet in a VLIW processor
#1438Multi-thread processor with rescheduling when threads are nondispatchable
#1439Data processing apparatus and method for executing a stream of instructions out of order with respect to original program order
#1440Processing instructions in parallel with waw hazards and via a distributed history buffer in a microprocessor having a multi-execution slice architecture
#1441SIMD variable shift and rotate using control manipulation
#1442Neural network unit employing user-supplied reciprocal for normalizing an accumulated value
#1443Neural network unit with shared activation function units
#1444Neural processing unit that selectively writes back to neural memory either activation function output or accumulator value
#1445Neural network unit with output buffer feedback and masking capability with processing unit groups that operate as recurrent neural network LSTM cells
#1446Neural network unit that performs convolutions using collective shift register among array of neural processing units
#1447Multi-operation neural network unit
#1448Processor with hybrid coprocessor/execution unit neural network unit
#1449Neural network unit with neural memory and array of neural processing units and sequencer that collectively shift row of data received from neural memory
#1450Neural network unit with output buffer feedback for performing recurrent neural network computations
#1451Neural network unit with neural processing units dynamically configurable to process multiple data sizes
#1452Processor with architectural neural network execution unit
#1453Tri-configuration neural network unit
#1454Mechanism for communication between architectural program running on processor and non-architectural program running on execution unit of the processor regarding shared resource
#1455Processor with variable rate execution unit
#1456System and method for predicting latency of a variable-latency instruction
#1457Direct execution by an execution unit of a micro-operation loaded into an architectural register file by an architectural instruction of a processor
#1458Variable length execution pipeline
#1459Neural network unit with output buffer feedback and masking capability
#1460Neural network unit with neural memory and array of neural processing units that collectively shift row of data received from neural memory
#1461Neural network unit that performs stochastic rounding
#1462System and method for managing static divergence in a SIMD computing architecture
#1463Chained split execution of fused compound arithmetic operations
#1464Instruction and Logic for Indirect Accesses
#1465Microprocessor with a reservation stations structure including primary and secondary reservation stations and a bypass system
#1466Reconfiguring execution pipelines of out-of-order (OOO) computer processors based on phase training and prediction
#1467Debug support for block-based processor
#1468Instruction block address register
#1469Prefetching instruction blocks
#1470Broadcast channel architectures for block-based processors
#1471Block-based processor including topology and control registers to indicate resource sharing and size of logical processor
#1472Multi-nullification
#1473Write nullification
#1474Store nullification in the target field
#1475Implicit program order
#1476Register read/write ordering
#1477Dynamic generation of null instructions
#1478Generation and use of memory access instruction order encodings
#1479Multimodal targets in a block-based processor
#1480Dense read encoding for dataflow ISA
#1481Distinct system registers for logical processors
#1482Block-based processor core composition register
#1483Initiating instruction block execution using a register access instruction
#1484Task execution in a SIMD processing unit with parallel groups of processing lanes
#1485Multiple scheduling schemes for handling read requests
#1486Fine-grained heterogeneous computing
#1487Allocation of a segmented interconnect to support the execution of instruction sequences by a plurality of engines
#1488Expedited servicing of store operations in a data processing system
#1489Method of dispatching instruction data when a number of available resource credits meets a resource requirement
#1490Handling Instruction Data and Shared resources in a Processor Having an Architecture Including a Pre-Execution Pipeline and a Resource and a Resource Tracker Circuit Based on Credit Availability
#1491System and method of accelerating arbitration by approximating relative ages
#1492Device and processing architecture for resolving execution pipeline dependencies without requiring no operation instructions in the instruction memory
#1493Fusible instructions and logic to provide or-test and and-test functionality using multiple test sources
#1494Techniques for implementing barriers to efficiently support cumulativity in a weakly-ordered memory system
#1495Techniques for enhancing progress for hardware transactional memory
#1496Computer with hybrid Von-Neumann/dataflow execution architecture
#1497System and method for variable lane architecture
#1498Age based fast instruction issue
#1499Method and device for allocating a VLIW instruction based on slot information stored in a database by a calculation allocation instruction
#1500Method and apparatus for implementing dynamic portbinding within a reservation station