·
Types of Stages in DS? Explain with
Examples
·
What are active stages and
passive stages?
·
Can you filter data in hashed
file? (No)
·
Difference between sequential
and hashed file?
·
How do you populate time
dimension?
·
Can we use target hashed file
as lookup? (Yes)
·
What is Merge Stage?
·
What is Job Sequencer?
·
What are stages in sequences?
·
How do you pass parameters?
·
What parameters you used in
your project?
·
What are log tables?
·
What is job controlling?
·
Facts and dimension tables?
·
Confirmed dimensions?
·
Difference between OLTP and
OLAP?
·
Difference between star schema
and snow flake schema?
·
What are hierarchies? Examples?
·
What are materialized views?
·
What is aggregation?
·
What is surrogate key? Is it
used for both fact and dimension tables?
·
Why do you go for oracle
sequence generator rather than datastage routine?
·
Flow of data in datastage?
·
Initial loading and incremental
loading?
·
What is SCD? Types?
·
How do you develop SCD type2 in
your project?
·
How do you load dimension data
and fact data? Which is first?
·
Difference between oracle
function and procedure?
·
Difference between unique and
primary key?
·
Difference between union and
union all?
·
What is minus operator?
·
What is audit table?
·
If there is a large hash file
and a smaller oracle table and if you are looking up from
·
transformer in different jobs
which will be faster?
·
Tell me about SCD’s?
·
How did you implement SCD in
your project?
·
What are derivations in
transformer?
·
How do you use surrogate key in
reporting?
·
Logs view in datastage, logs in
Informatica which is clear?
·
How does pivot stage work?
·
What is surrogate key? What is
the importance of it? How did you implement it in your
·
project?
·
Totally how many jobs did you
developed and how many lookups did you use totally?
·
How do constraint in
transformer work?
·
How will you declare a
constraint in datastage?
·
How will you handle rejected
data?
·
Give me some performance tips
in datastage?
·
Can we use sequential file as a
lookup?
·
How does hash file stage
lookup?
·
Why can’t we use sequential
file as a lookup?
·
What is data warehouse?
·
What is ‘Star-Schema’?
·
What is ‘Snowflake-Schema’?
·
What is difference between
Star-Schema and Snowflake-Schema?
·
What is mean by surrogate key?
·
What is ‘Conformed Dimension’?
·
What is Factless Fact Table?
·
When will we use connected and
unconnected lookup?
·
Which cache supports connected
and unconnected lookup?
·
What is the difference between
SCD Type2 and SCD Type3?
·
What is difference between data
mart and data warehouse?
·
What is composite key?
·
What is surrogate key? When you
will go for it?
·
What is dimensional modeling?
·
What are SCD and SGT?
Difference between them? Example of SGT from your project.
·
How do you import your source
and targets? What are the types of sources and targets?
·
What is Active Stages and
Passive Stages means in datastage?
·
What is difference between
Informatica and DataStage? Which do you think is best?
·
What are the stages you used in
your project?
·
What do you mean by parallel
processing?
·
What is difference between
Merge Stage and Join Stage?
·
What is difference between Copy
Stage and Transformer Stage?
·
What is difference between ODBC
Stage and OCI Stage?
·
What is difference between
Lookup Stage and Join Stage?
·
What is difference between
Change Capture Stage and Difference Stage?
·
What is difference between
Hashed file and Sequential File?
·
What are different Joins used
in Join Stage?
·
How you decide when to go for
join stage and lookup stage?
·
What is partition key? Which
key is used in round robin partition?
·
How do you handle SCD in
datastage?
·
What are Change Capture Stage
and Change Apply Stages?
·
How many streams to the
transformer you can give?
·
What is primary link and
reference link?
·
What is routine? What is before
and after subroutines? These are run after/before job or
·
stage?
·
What is Config File? Each job
having its own config file or one is needed?
·
What is Node?
·
What is IPC Stage? What it
increase performance?
·
What is Sequential buffer?
·
What are Link Partioner and
Link Collector?
·
What are the performance
tunning you have done in your project?
·
Did you done scheduling? How?
Can you schedule a job at the every end date of month?
·
How?
·
What is job sequence? Had you
run any jobs?
·
What is status view? Why you
clear this? If you clear the status view what internally
·
done?
·
What is hashed file? What are
the types of hashed file? Which you use? What is default?
·
What is main advantage of
hashed file? Difference between them. (static and dynamic)
·
What are containers? Give
example from your project.
·
What are parameters and
parameter file?
·
How do you convert columns to
rows and rows to columns in datastage? (Using Pivot
·
Stage).
·
What is Pivot Stage?
·
What is execution flow of
constraints, derivations and variables in transformer stage?
·
What are these?
·
How do you eliminate duplicates
in datastage? Can you use hash file for it?
·
If 1st and 8th record
is duplicate then which will be skipped? Can you configure it?
·
How do you import and export
datastage jobs? What is the file extension? (See each
·
component while importing and
exporting).
·
How do you rate yourself in
DataStage?
·
Explain DataStage Architecture?
·
What is repository? What are
the repository items?
·
What is difference between
routine and transform?
·
When you write the routines?
·
What is the complex situation
you faced in DataStage?
·
System variable, what are
system variables used your project?
·
What are the different
datastage functions used in your project?
·
Difference between star schema
and snow flake schema?
·
What is confirmed, degenerated
and junk dimension?
·
What are confirmed facts?
·
Different type of facts and
their examples?
·
What are approaches in
developing data warehouse?
·
Different types of hashed
files?
·
What are routines and
transforms? How you used in your project?
·
Difference between Data Mart
and Data Warehouse?
·
What is surrogate key? How do
you generate it?
·
What are environment variables
and global variables?
·
How do you improve the
performance of the job?
·
What is SCD? How do you
developed SCD type1 and SCD type2?
·
How do you generate surrogate
key in datastage?
·
What is job sequence?
·
What are plug-ins?
·
How much data you can get every
day?
·
What is the biggest table and
size in your schema or in your project?
·
What is the size of data
warehouse (by loading data)?
·
How do you improve the
performance of the hashed file?
·
What is IPC Stage?
·
What are the different types of
stages and used in your project?
·
What are the operations you can
do in IPC Stage and transformer stage?
·
What is merge stage? How do you
merge two flat files?
·
What is difference between ODBC
and ORACLE OCI stage?
·
What difference between
sequential file and hashed file?
·
Can you use sequential file as
source to hashed file? Have you done it? What error it will
·
give?
·
Why hashed file improve the
performance?
·
Can aggregator and transformer
stage used for sorting data? How
·
How many input links you can
give to transformer?
·
Definition of Slowly Changing
Dimensions? Types?
·
What is iconv and oconv
functions?
·
What is the advantage of using
OCI stage as compared to ODBC stage
·
What is the difference between
Interprocess and inprocess? Which one is the best?
0 comments: