Wednesday, May 21, 2014

20 TOP DataStage Interview Questions and Answers pdf

The below List of top 20 DataStage Interview Questions and Answers for freshers and experienced pdf free download
1. How did you handle reject data?
Typically a Reject-link is defined and the rejected data is loaded back into data warehouse. So Reject link has to be defined every Output link you wish to collect rejected data. Rejected data is typically bad data like duplicates of Primary keys or null-rows where data is expected.

2. If worked with DS6.0 and latest versions what are Link-Partitioner and Link-Collector used for? 
Link Partitioner - Used for partitioning the data.
Link Collector - Used for collecting the partitioned data.

3. What are Routines and where/how are they written and have you written any routines before?
Routines are stored in the Routines branch of the DataStage Repository, where you can create, view or edit. The following are different types of routines:
1) Transform functions
2) Before-after job subroutines
3) Job Control routines

4. What are OConv () and Iconv () functions and where are they used?
IConv() - Converts a string to an internal storage format
OConv() - Converts an expression to an output format.

5. How did you connect to DB2 in your last project?
Using DB2 ODBC drivers.

6. Explain METASTAGE?
MetaStage is used to handle the Metadata which will be very useful for data lineage and data analysis later on. Meta Data defines the type of data we are handling. This Data Definitions are stored in repository and can be accessed with the use of MetaStage.

7. Do you know about INTEGRITY/QUALITY stage?
Qulaity Stage can be integrated with DataStage, In Quality Stage we have many stages like investigate, match, survivorship like that so that we can do the Quality related works and we can integrate with datastage we need Quality stage plugin to achieve the task.

8. Explain the differences between Oracle8i/9i?
Oracle 8i does not support pseudo column sysdate but 9i supports
Oracle 8i we can create 256 columns in a table but in 9i we can upto 1000 columns(fields)

9. How do you merge two files in DS?
Either use Copy command as a Before-job subroutine if the metadata of the 2 files are same or create a job to concatenate the 2 files into one if the metadata is different.

10. What is DS Designer used for?
You use the Designer to build jobs by creating a visual design that models the flow and transformation of data from the data source through to the target warehouse. The Designer graphical interface lets you select stage icons, drop them onto the Designer work area, and add links.

11. What is DS Administrator used for?
The Administrator enables you to set up DataStage users, control the purging of the Repository, and, if National Language Support (NLS) is enabled, install and manage maps and locales.

12. What is DS Director used for?
datastage director is used to run the jobs and validate the jobs.
we can go to datastage director from datastage designer it self.

13. What is DS Manager used for?
The Manager is a graphical tool that enables you to view and manage the contents of the DataStage Repository

14. What are Static Hash files and Dynamic Hash files?
As the names itself suggest what they mean. In general we use Type-30 dynamic Hash files. The Data file has a default size of 2Gb and the overflow file is used if the data exceeds the 2GB size.

15. What is Hash file stage and what is it used for?
Used for Look-ups. It is like a reference table. It is also used in-place of ODBC, OCI tables for better performance.

16. How are the Dimension tables designed?
Find where data for this dimension are located.
Figure out how to extract this data.
Determine how to maintain changes to this dimension.
Change fact table and DW population routines.

17. What are conformed dimensions?
A conformed dimension is a single, coherent view of the same piece of data throughout the organization. The same dimension is used in all subsequent star schemas defined. This enables reporting across the complete data warehouse in a simple format.

18. Why fact table is in normal form?
Basically the fact table consists of the Index keys of the dimension/ook up tables and the measures.
so when ever we have the keys in a table .that itself implies that the table is in the normal form.

19. What is a linked cube?
A cube can be stored on a single analysis server and then defined as a linked cube on other Analysis servers. End users connected to any of these analysis servers can then access the cube. This arrangement avoids the more costly alternative of storing and maintaining copies of a cube on multiple analysis servers. linked cubes can be connected using TCP/IP or HTTP. To end users a linked cube looks like a regular cube.

20. What is degenerate dimension table?
The values of dimension which is stored in fact table is called degenerate dimensions. these dimensions doesn't have its own dimensions.

More DataStage Interview Questions: Click Here

No comments: