; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI03G22490 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI03G22490
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
Descriptionrho-N domain-containing protein 1, chloroplastic-like
Genome locationChr3:19063987..19068003
RNA-Seq ExpressionCSPI03G22490
SyntenyCSPI03G22490
Gene Ontology termsGO:0006353 - DNA-templated transcription, termination (biological process)
InterPro domainsIPR011112 - Rho termination factor, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004151986.1 rho-N domain-containing protein 1, chloroplastic isoform X1 [Cucumis sativus]2.4e-22399.76Show/hide
Query:  MSQAIHLLPHNPTGFGLSDSRCIPCSGVSGRAASFSFHSLCAEHRINVPVKFRPLNCTSLGESFTCKASSGGHRRNPDFPKQNRHGFSRSRNRQNEERES
        MSQAIHLLPHNPTGFGLSDSRCIPCSGVSGRAASFSFHSLCAEHRINVPVKFRPLNCTSLGESFTCKASSGGHRRNPDFPKQNRHGFSRSRNRQNEERES
Subjt:  MSQAIHLLPHNPTGFGLSDSRCIPCSGVSGRAASFSFHSLCAEHRINVPVKFRPLNCTSLGESFTCKASSGGHRRNPDFPKQNRHGFSRSRNRQNEERES

Query:  LDNVDESDLLLSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKVESQGQTKGSETVDSLLKLLRKHSVEQGKRSSGGGGSSN
        LDNVDESDLLLSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKVE+QGQTKGSETVDSLLKLLRKHSVEQGKRSSGGGGSSN
Subjt:  LDNVDESDLLLSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKVESQGQTKGSETVDSLLKLLRKHSVEQGKRSSGGGGSSN

Query:  KDISFNHVKENGPYDEGRGSSFFGLSPNLREKAQRPVSNFQRRSPVPRVKYQPIYPGESIVNSTNGMNSKGVKPNGTDTGSQLKGKVWTRQESEREHWEE
        KDISFNHVKENGPYDEGRGSSFFGLSPNLREKAQRPVSNFQRRSPVPRVKYQPIYPGESIVNSTNGMNSKGVKPNGTDTGSQLKGKVWTRQESEREHWEE
Subjt:  KDISFNHVKENGPYDEGRGSSFFGLSPNLREKAQRPVSNFQRRSPVPRVKYQPIYPGESIVNSTNGMNSKGVKPNGTDTGSQLKGKVWTRQESEREHWEE

Query:  LQSQREAEQEPEPDQEFELEPEAETYDLEHEGDEMEPELVNLLGVSSDVDDTFEDDVKDNEEFAKHGEQEHEDLNSLKLAELRAIAKSRSLRGFSKMKKS
        LQSQREAEQEPEPDQEFELEPEAETYDLEHEGDEMEPELVNLLGVSSDVDDTFEDDVKDNEEFAKHGEQEHEDLNSLKLAELRAIAKSRSLRGFSKMKKS
Subjt:  LQSQREAEQEPEPDQEFELEPEAETYDLEHEGDEMEPELVNLLGVSSDVDDTFEDDVKDNEEFAKHGEQEHEDLNSLKLAELRAIAKSRSLRGFSKMKKS

Query:  ELVQLLSNGQ
        ELVQLLSNGQ
Subjt:  ELVQLLSNGQ

XP_004151987.1 rho-N domain-containing protein 1, chloroplastic isoform X2 [Cucumis sativus]1.3e-21898.54Show/hide
Query:  MSQAIHLLPHNPTGFGLSDSRCIPCSGVSGRAASFSFHSLCAEHRINVPVKFRPLNCTSLGESFTCKASSGGHRRNPDFPKQNRHGFSRSRNRQNEERES
        MSQAIHLLPHNPT     DSRCIPCSGVSGRAASFSFHSLCAEHRINVPVKFRPLNCTSLGESFTCKASSGGHRRNPDFPKQNRHGFSRSRNRQNEERES
Subjt:  MSQAIHLLPHNPTGFGLSDSRCIPCSGVSGRAASFSFHSLCAEHRINVPVKFRPLNCTSLGESFTCKASSGGHRRNPDFPKQNRHGFSRSRNRQNEERES

Query:  LDNVDESDLLLSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKVESQGQTKGSETVDSLLKLLRKHSVEQGKRSSGGGGSSN
        LDNVDESDLLLSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKVE+QGQTKGSETVDSLLKLLRKHSVEQGKRSSGGGGSSN
Subjt:  LDNVDESDLLLSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKVESQGQTKGSETVDSLLKLLRKHSVEQGKRSSGGGGSSN

Query:  KDISFNHVKENGPYDEGRGSSFFGLSPNLREKAQRPVSNFQRRSPVPRVKYQPIYPGESIVNSTNGMNSKGVKPNGTDTGSQLKGKVWTRQESEREHWEE
        KDISFNHVKENGPYDEGRGSSFFGLSPNLREKAQRPVSNFQRRSPVPRVKYQPIYPGESIVNSTNGMNSKGVKPNGTDTGSQLKGKVWTRQESEREHWEE
Subjt:  KDISFNHVKENGPYDEGRGSSFFGLSPNLREKAQRPVSNFQRRSPVPRVKYQPIYPGESIVNSTNGMNSKGVKPNGTDTGSQLKGKVWTRQESEREHWEE

Query:  LQSQREAEQEPEPDQEFELEPEAETYDLEHEGDEMEPELVNLLGVSSDVDDTFEDDVKDNEEFAKHGEQEHEDLNSLKLAELRAIAKSRSLRGFSKMKKS
        LQSQREAEQEPEPDQEFELEPEAETYDLEHEGDEMEPELVNLLGVSSDVDDTFEDDVKDNEEFAKHGEQEHEDLNSLKLAELRAIAKSRSLRGFSKMKKS
Subjt:  LQSQREAEQEPEPDQEFELEPEAETYDLEHEGDEMEPELVNLLGVSSDVDDTFEDDVKDNEEFAKHGEQEHEDLNSLKLAELRAIAKSRSLRGFSKMKKS

Query:  ELVQLLSNGQ
        ELVQLLSNGQ
Subjt:  ELVQLLSNGQ

XP_008447421.1 PREDICTED: rho-N domain-containing protein 1, chloroplastic isoform X7 [Cucumis melo]2.2e-19790.12Show/hide
Query:  MSQAIHLLPHNPTGFGLSDSRCIPCSGVSGRAASFSFHSLCAEHRINVPVKFRPLNCTSLGESFTCKASSGGHRRNPDFPKQNRHGFSRSRNRQNEERES
        MSQAIHLLP NPTGFGLSDSRC+PCSGVSGRAASFSF SLCAEH INVPVKFRPLNCTSLG SFTCKASS GHRRNPDFPKQNR G+SRSRNRQNEERES
Subjt:  MSQAIHLLPHNPTGFGLSDSRCIPCSGVSGRAASFSFHSLCAEHRINVPVKFRPLNCTSLGESFTCKASSGGHRRNPDFPKQNRHGFSRSRNRQNEERES

Query:  LDNVDESDLLLSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKVESQGQTKGSETVDSLLKLLRKHSVEQGKRSSGGGGSSN
        L+NVDESDLL S+NGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKK+E+QGQTKGSETVDSLLKLLRKH+VEQGKRSSG GG SN
Subjt:  LDNVDESDLLLSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKVESQGQTKGSETVDSLLKLLRKHSVEQGKRSSGGGGSSN

Query:  KDISFNHVKENGPYDEGRGSSFFGLSPNLREKAQ-------RPVSNFQRRSPVPRVKYQPIYPGESIVNSTNGMNSKGVKPNGTDTGSQLKGKVWTRQES
        KDISFNHVKENGPYDEGRGSS FGLSPNLREKAQ       RP SNFQRRSPVPRVKYQPIYPGESIV+STNGMNSKG+K NGT+TGSQLK KVWTRQES
Subjt:  KDISFNHVKENGPYDEGRGSSFFGLSPNLREKAQ-------RPVSNFQRRSPVPRVKYQPIYPGESIVNSTNGMNSKGVKPNGTDTGSQLKGKVWTRQES

Query:  EREHWEELQSQREAEQEPEPDQEFELEPEAETYDLEHEGDEMEPELVNLLGVSSDVDDTFEDDVKDNEEFAKHGEQEHEDLNSLKLAELRAIAKSRSLRG
        EREHWEELQSQR+ EQEPE DQEFE+EPEAETYDLEHE DEMEPELVNLLGVSSD+DDTFEDD+KDNEEF+KHG  EHE+LNSLKLAELRAIAKSRSLRG
Subjt:  EREHWEELQSQREAEQEPEPDQEFELEPEAETYDLEHEGDEMEPELVNLLGVSSDVDDTFEDDVKDNEEFAKHGEQEHEDLNSLKLAELRAIAKSRSLRG

Query:  FSKMKKSELVQLLSN
        FSKMKKSELVQLLSN
Subjt:  FSKMKKSELVQLLSN

XP_016900423.1 PREDICTED: rho-N domain-containing protein 1, chloroplastic isoform X1 [Cucumis melo]8.8e-19485.98Show/hide
Query:  MSQAIHLLPHNPT--------------------GFGLSDSRCIPCSGVSGRAASFSFHSLCAEHRINVPVKFRPLNCTSLGESFTCKASSGGHRRNPDFP
        MSQAIHLLP NPT                    GFGLSDSRC+PCSGVSGRAASFSF SLCAEH INVPVKFRPLNCTSLG SFTCKASS GHRRNPDFP
Subjt:  MSQAIHLLPHNPT--------------------GFGLSDSRCIPCSGVSGRAASFSFHSLCAEHRINVPVKFRPLNCTSLGESFTCKASSGGHRRNPDFP

Query:  KQNRHGFSRSRNRQNEERESLDNVDESDLLLSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKVESQGQTKGSETVDSLLKL
        KQNR G+SRSRNRQNEERESL+NVDESDLL S+NGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKK+E+QGQTKGSETVDSLLKL
Subjt:  KQNRHGFSRSRNRQNEERESLDNVDESDLLLSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKVESQGQTKGSETVDSLLKL

Query:  LRKHSVEQGKRSSGGGGSSNKDISFNHVKENGPYDEGRGSSFFGLSPNLREKAQ-------RPVSNFQRRSPVPRVKYQPIYPGESIVNSTNGMNSKGVK
        LRKH+VEQGKRSSG GG SNKDISFNHVKENGPYDEGRGSS FGLSPNLREKAQ       RP SNFQRRSPVPRVKYQPIYPGESIV+STNGMNSKG+K
Subjt:  LRKHSVEQGKRSSGGGGSSNKDISFNHVKENGPYDEGRGSSFFGLSPNLREKAQ-------RPVSNFQRRSPVPRVKYQPIYPGESIVNSTNGMNSKGVK

Query:  PNGTDTGSQLKGKVWTRQESEREHWEELQSQREAEQEPEPDQEFELEPEAETYDLEHEGDEMEPELVNLLGVSSDVDDTFEDDVKDNEEFAKHGEQEHED
         NGT+TGSQLK KVWTRQESEREHWEELQSQR+ EQEPE DQEFE+EPEAETYDLEHE DEMEPELVNLLGVSSD+DDTFEDD+KDNEEF+KHG  EHE+
Subjt:  PNGTDTGSQLKGKVWTRQESEREHWEELQSQREAEQEPEPDQEFELEPEAETYDLEHEGDEMEPELVNLLGVSSDVDDTFEDDVKDNEEFAKHGEQEHED

Query:  LNSLKLAELRAIAKSRSLRGFSKMKKSELVQLLSN
        LNSLKLAELRAIAKSRSLRGFSKMKKSELVQLLSN
Subjt:  LNSLKLAELRAIAKSRSLRGFSKMKKSELVQLLSN

XP_016900425.1 PREDICTED: rho-N domain-containing protein 1, chloroplastic isoform X4 [Cucumis melo]1.8e-19487.18Show/hide
Query:  MSQAIHLLPHNPT--------------GFGLSDSRCIPCSGVSGRAASFSFHSLCAEHRINVPVKFRPLNCTSLGESFTCKASSGGHRRNPDFPKQNRHG
        MSQAIHLLP NPT              GFGLSDSRC+PCSGVSGRAASFSF SLCAEH INVPVKFRPLNCTSLG SFTCKASS GHRRNPDFPKQNR G
Subjt:  MSQAIHLLPHNPT--------------GFGLSDSRCIPCSGVSGRAASFSFHSLCAEHRINVPVKFRPLNCTSLGESFTCKASSGGHRRNPDFPKQNRHG

Query:  FSRSRNRQNEERESLDNVDESDLLLSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKVESQGQTKGSETVDSLLKLLRKHSV
        +SRSRNRQNEERESL+NVDESDLL S+NGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKK+E+QGQTKGSETVDSLLKLLRKH+V
Subjt:  FSRSRNRQNEERESLDNVDESDLLLSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKVESQGQTKGSETVDSLLKLLRKHSV

Query:  EQGKRSSGGGGSSNKDISFNHVKENGPYDEGRGSSFFGLSPNLREKAQ-------RPVSNFQRRSPVPRVKYQPIYPGESIVNSTNGMNSKGVKPNGTDT
        EQGKRSSG GG SNKDISFNHVKENGPYDEGRGSS FGLSPNLREKAQ       RP SNFQRRSPVPRVKYQPIYPGESIV+STNGMNSKG+K NGT+T
Subjt:  EQGKRSSGGGGSSNKDISFNHVKENGPYDEGRGSSFFGLSPNLREKAQ-------RPVSNFQRRSPVPRVKYQPIYPGESIVNSTNGMNSKGVKPNGTDT

Query:  GSQLKGKVWTRQESEREHWEELQSQREAEQEPEPDQEFELEPEAETYDLEHEGDEMEPELVNLLGVSSDVDDTFEDDVKDNEEFAKHGEQEHEDLNSLKL
        GSQLK KVWTRQESEREHWEELQSQR+ EQEPE DQEFE+EPEAETYDLEHE DEMEPELVNLLGVSSD+DDTFEDD+KDNEEF+KHG  EHE+LNSLKL
Subjt:  GSQLKGKVWTRQESEREHWEELQSQREAEQEPEPDQEFELEPEAETYDLEHEGDEMEPELVNLLGVSSDVDDTFEDDVKDNEEFAKHGEQEHEDLNSLKL

Query:  AELRAIAKSRSLRGFSKMKKSELVQLLSN
        AELRAIAKSRSLRGFSKMKKSELVQLLSN
Subjt:  AELRAIAKSRSLRGFSKMKKSELVQLLSN

TrEMBL top hitse value%identityAlignment
A0A0A0L7X8 Rho_N domain-containing protein1.1e-22399.76Show/hide
Query:  MSQAIHLLPHNPTGFGLSDSRCIPCSGVSGRAASFSFHSLCAEHRINVPVKFRPLNCTSLGESFTCKASSGGHRRNPDFPKQNRHGFSRSRNRQNEERES
        MSQAIHLLPHNPTGFGLSDSRCIPCSGVSGRAASFSFHSLCAEHRINVPVKFRPLNCTSLGESFTCKASSGGHRRNPDFPKQNRHGFSRSRNRQNEERES
Subjt:  MSQAIHLLPHNPTGFGLSDSRCIPCSGVSGRAASFSFHSLCAEHRINVPVKFRPLNCTSLGESFTCKASSGGHRRNPDFPKQNRHGFSRSRNRQNEERES

Query:  LDNVDESDLLLSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKVESQGQTKGSETVDSLLKLLRKHSVEQGKRSSGGGGSSN
        LDNVDESDLLLSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKVE+QGQTKGSETVDSLLKLLRKHSVEQGKRSSGGGGSSN
Subjt:  LDNVDESDLLLSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKVESQGQTKGSETVDSLLKLLRKHSVEQGKRSSGGGGSSN

Query:  KDISFNHVKENGPYDEGRGSSFFGLSPNLREKAQRPVSNFQRRSPVPRVKYQPIYPGESIVNSTNGMNSKGVKPNGTDTGSQLKGKVWTRQESEREHWEE
        KDISFNHVKENGPYDEGRGSSFFGLSPNLREKAQRPVSNFQRRSPVPRVKYQPIYPGESIVNSTNGMNSKGVKPNGTDTGSQLKGKVWTRQESEREHWEE
Subjt:  KDISFNHVKENGPYDEGRGSSFFGLSPNLREKAQRPVSNFQRRSPVPRVKYQPIYPGESIVNSTNGMNSKGVKPNGTDTGSQLKGKVWTRQESEREHWEE

Query:  LQSQREAEQEPEPDQEFELEPEAETYDLEHEGDEMEPELVNLLGVSSDVDDTFEDDVKDNEEFAKHGEQEHEDLNSLKLAELRAIAKSRSLRGFSKMKKS
        LQSQREAEQEPEPDQEFELEPEAETYDLEHEGDEMEPELVNLLGVSSDVDDTFEDDVKDNEEFAKHGEQEHEDLNSLKLAELRAIAKSRSLRGFSKMKKS
Subjt:  LQSQREAEQEPEPDQEFELEPEAETYDLEHEGDEMEPELVNLLGVSSDVDDTFEDDVKDNEEFAKHGEQEHEDLNSLKLAELRAIAKSRSLRGFSKMKKS

Query:  ELVQLLSNGQ
        ELVQLLSNGQ
Subjt:  ELVQLLSNGQ

A0A1S3BHF2 rho-N domain-containing protein 1, chloroplastic isoform X71.1e-19790.12Show/hide
Query:  MSQAIHLLPHNPTGFGLSDSRCIPCSGVSGRAASFSFHSLCAEHRINVPVKFRPLNCTSLGESFTCKASSGGHRRNPDFPKQNRHGFSRSRNRQNEERES
        MSQAIHLLP NPTGFGLSDSRC+PCSGVSGRAASFSF SLCAEH INVPVKFRPLNCTSLG SFTCKASS GHRRNPDFPKQNR G+SRSRNRQNEERES
Subjt:  MSQAIHLLPHNPTGFGLSDSRCIPCSGVSGRAASFSFHSLCAEHRINVPVKFRPLNCTSLGESFTCKASSGGHRRNPDFPKQNRHGFSRSRNRQNEERES

Query:  LDNVDESDLLLSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKVESQGQTKGSETVDSLLKLLRKHSVEQGKRSSGGGGSSN
        L+NVDESDLL S+NGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKK+E+QGQTKGSETVDSLLKLLRKH+VEQGKRSSG GG SN
Subjt:  LDNVDESDLLLSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKVESQGQTKGSETVDSLLKLLRKHSVEQGKRSSGGGGSSN

Query:  KDISFNHVKENGPYDEGRGSSFFGLSPNLREKAQ-------RPVSNFQRRSPVPRVKYQPIYPGESIVNSTNGMNSKGVKPNGTDTGSQLKGKVWTRQES
        KDISFNHVKENGPYDEGRGSS FGLSPNLREKAQ       RP SNFQRRSPVPRVKYQPIYPGESIV+STNGMNSKG+K NGT+TGSQLK KVWTRQES
Subjt:  KDISFNHVKENGPYDEGRGSSFFGLSPNLREKAQ-------RPVSNFQRRSPVPRVKYQPIYPGESIVNSTNGMNSKGVKPNGTDTGSQLKGKVWTRQES

Query:  EREHWEELQSQREAEQEPEPDQEFELEPEAETYDLEHEGDEMEPELVNLLGVSSDVDDTFEDDVKDNEEFAKHGEQEHEDLNSLKLAELRAIAKSRSLRG
        EREHWEELQSQR+ EQEPE DQEFE+EPEAETYDLEHE DEMEPELVNLLGVSSD+DDTFEDD+KDNEEF+KHG  EHE+LNSLKLAELRAIAKSRSLRG
Subjt:  EREHWEELQSQREAEQEPEPDQEFELEPEAETYDLEHEGDEMEPELVNLLGVSSDVDDTFEDDVKDNEEFAKHGEQEHEDLNSLKLAELRAIAKSRSLRG

Query:  FSKMKKSELVQLLSN
        FSKMKKSELVQLLSN
Subjt:  FSKMKKSELVQLLSN

A0A1S3BIA8 rho-N domain-containing protein 1, chloroplastic isoform X61.6e-19387.5Show/hide
Query:  MSQAIHLLPHNPTG---------FGLSDSRCIPCSGVSGRAASFSFHSLCAEHRINVPVKFRPLNCTSLGESFTCKASSGGHRRNPDFPKQNRHGFSRSR
        MSQAIHLLP NPTG         F + DSRC+PCSGVSGRAASFSF SLCAEH INVPVKFRPLNCTSLG SFTCKASS GHRRNPDFPKQNR G+SRSR
Subjt:  MSQAIHLLPHNPTG---------FGLSDSRCIPCSGVSGRAASFSFHSLCAEHRINVPVKFRPLNCTSLGESFTCKASSGGHRRNPDFPKQNRHGFSRSR

Query:  NRQNEERESLDNVDESDLLLSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKVESQGQTKGSETVDSLLKLLRKHSVEQGKR
        NRQNEERESL+NVDESDLL S+NGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKK+E+QGQTKGSETVDSLLKLLRKH+VEQGKR
Subjt:  NRQNEERESLDNVDESDLLLSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKVESQGQTKGSETVDSLLKLLRKHSVEQGKR

Query:  SSGGGGSSNKDISFNHVKENGPYDEGRGSSFFGLSPNLREKAQ-------RPVSNFQRRSPVPRVKYQPIYPGESIVNSTNGMNSKGVKPNGTDTGSQLK
        SSG GG SNKDISFNHVKENGPYDEGRGSS FGLSPNLREKAQ       RP SNFQRRSPVPRVKYQPIYPGESIV+STNGMNSKG+K NGT+TGSQLK
Subjt:  SSGGGGSSNKDISFNHVKENGPYDEGRGSSFFGLSPNLREKAQ-------RPVSNFQRRSPVPRVKYQPIYPGESIVNSTNGMNSKGVKPNGTDTGSQLK

Query:  GKVWTRQESEREHWEELQSQREAEQEPEPDQEFELEPEAETYDLEHEGDEMEPELVNLLGVSSDVDDTFEDDVKDNEEFAKHGEQEHEDLNSLKLAELRA
         KVWTRQESEREHWEELQSQR+ EQEPE DQEFE+EPEAETYDLEHE DEMEPELVNLLGVSSD+DDTFEDD+KDNEEF+KHG  EHE+LNSLKLAELRA
Subjt:  GKVWTRQESEREHWEELQSQREAEQEPEPDQEFELEPEAETYDLEHEGDEMEPELVNLLGVSSDVDDTFEDDVKDNEEFAKHGEQEHEDLNSLKLAELRA

Query:  IAKSRSLRGFSKMKKSELVQLLSN
        IAKSRSLRGFSKMKKSELVQLLSN
Subjt:  IAKSRSLRGFSKMKKSELVQLLSN

A0A1S4DWS1 rho-N domain-containing protein 1, chloroplastic isoform X14.2e-19485.98Show/hide
Query:  MSQAIHLLPHNPT--------------------GFGLSDSRCIPCSGVSGRAASFSFHSLCAEHRINVPVKFRPLNCTSLGESFTCKASSGGHRRNPDFP
        MSQAIHLLP NPT                    GFGLSDSRC+PCSGVSGRAASFSF SLCAEH INVPVKFRPLNCTSLG SFTCKASS GHRRNPDFP
Subjt:  MSQAIHLLPHNPT--------------------GFGLSDSRCIPCSGVSGRAASFSFHSLCAEHRINVPVKFRPLNCTSLGESFTCKASSGGHRRNPDFP

Query:  KQNRHGFSRSRNRQNEERESLDNVDESDLLLSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKVESQGQTKGSETVDSLLKL
        KQNR G+SRSRNRQNEERESL+NVDESDLL S+NGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKK+E+QGQTKGSETVDSLLKL
Subjt:  KQNRHGFSRSRNRQNEERESLDNVDESDLLLSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKVESQGQTKGSETVDSLLKL

Query:  LRKHSVEQGKRSSGGGGSSNKDISFNHVKENGPYDEGRGSSFFGLSPNLREKAQ-------RPVSNFQRRSPVPRVKYQPIYPGESIVNSTNGMNSKGVK
        LRKH+VEQGKRSSG GG SNKDISFNHVKENGPYDEGRGSS FGLSPNLREKAQ       RP SNFQRRSPVPRVKYQPIYPGESIV+STNGMNSKG+K
Subjt:  LRKHSVEQGKRSSGGGGSSNKDISFNHVKENGPYDEGRGSSFFGLSPNLREKAQ-------RPVSNFQRRSPVPRVKYQPIYPGESIVNSTNGMNSKGVK

Query:  PNGTDTGSQLKGKVWTRQESEREHWEELQSQREAEQEPEPDQEFELEPEAETYDLEHEGDEMEPELVNLLGVSSDVDDTFEDDVKDNEEFAKHGEQEHED
         NGT+TGSQLK KVWTRQESEREHWEELQSQR+ EQEPE DQEFE+EPEAETYDLEHE DEMEPELVNLLGVSSD+DDTFEDD+KDNEEF+KHG  EHE+
Subjt:  PNGTDTGSQLKGKVWTRQESEREHWEELQSQREAEQEPEPDQEFELEPEAETYDLEHEGDEMEPELVNLLGVSSDVDDTFEDDVKDNEEFAKHGEQEHED

Query:  LNSLKLAELRAIAKSRSLRGFSKMKKSELVQLLSN
        LNSLKLAELRAIAKSRSLRGFSKMKKSELVQLLSN
Subjt:  LNSLKLAELRAIAKSRSLRGFSKMKKSELVQLLSN

A0A1S4DXI1 rho-N domain-containing protein 1, chloroplastic isoform X48.6e-19587.18Show/hide
Query:  MSQAIHLLPHNPT--------------GFGLSDSRCIPCSGVSGRAASFSFHSLCAEHRINVPVKFRPLNCTSLGESFTCKASSGGHRRNPDFPKQNRHG
        MSQAIHLLP NPT              GFGLSDSRC+PCSGVSGRAASFSF SLCAEH INVPVKFRPLNCTSLG SFTCKASS GHRRNPDFPKQNR G
Subjt:  MSQAIHLLPHNPT--------------GFGLSDSRCIPCSGVSGRAASFSFHSLCAEHRINVPVKFRPLNCTSLGESFTCKASSGGHRRNPDFPKQNRHG

Query:  FSRSRNRQNEERESLDNVDESDLLLSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKVESQGQTKGSETVDSLLKLLRKHSV
        +SRSRNRQNEERESL+NVDESDLL S+NGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKK+E+QGQTKGSETVDSLLKLLRKH+V
Subjt:  FSRSRNRQNEERESLDNVDESDLLLSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKVESQGQTKGSETVDSLLKLLRKHSV

Query:  EQGKRSSGGGGSSNKDISFNHVKENGPYDEGRGSSFFGLSPNLREKAQ-------RPVSNFQRRSPVPRVKYQPIYPGESIVNSTNGMNSKGVKPNGTDT
        EQGKRSSG GG SNKDISFNHVKENGPYDEGRGSS FGLSPNLREKAQ       RP SNFQRRSPVPRVKYQPIYPGESIV+STNGMNSKG+K NGT+T
Subjt:  EQGKRSSGGGGSSNKDISFNHVKENGPYDEGRGSSFFGLSPNLREKAQ-------RPVSNFQRRSPVPRVKYQPIYPGESIVNSTNGMNSKGVKPNGTDT

Query:  GSQLKGKVWTRQESEREHWEELQSQREAEQEPEPDQEFELEPEAETYDLEHEGDEMEPELVNLLGVSSDVDDTFEDDVKDNEEFAKHGEQEHEDLNSLKL
        GSQLK KVWTRQESEREHWEELQSQR+ EQEPE DQEFE+EPEAETYDLEHE DEMEPELVNLLGVSSD+DDTFEDD+KDNEEF+KHG  EHE+LNSLKL
Subjt:  GSQLKGKVWTRQESEREHWEELQSQREAEQEPEPDQEFELEPEAETYDLEHEGDEMEPELVNLLGVSSDVDDTFEDDVKDNEEFAKHGEQEHEDLNSLKL

Query:  AELRAIAKSRSLRGFSKMKKSELVQLLSN
        AELRAIAKSRSLRGFSKMKKSELVQLLSN
Subjt:  AELRAIAKSRSLRGFSKMKKSELVQLLSN

SwissProt top hitse value%identityAlignment
Q8L4E7 SAP-like protein BP-738.6e-3535.93Show/hide
Query:  SFTCKASSGGHR-RNPDFPKQNRHGFSRSRNRQNEERESLDNVDE--SDLLLSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEE
        S  C A+   HR R+ D  +  + G +R +++  +E++  +N+DE  +D++ SKNGP +S++S  + QAT+ PG REKEIVELF++VQAQLR R   KEE
Subjt:  SFTCKASSGGHR-RNPDFPKQNRHGFSRSRNRQNEERESLDNVDE--SDLLLSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEE

Query:  KKVESQGQTKGSE-TVDSLLKLLRKHSVEQGKRSSGGGGSSNKDISFNHVKENGPYDEGRGSSFFGLSPNLREKA-------QRPVSNFQRRSPVPRVKY
        KK E Q + +G   +VDSLL LLRKHSV+Q ++S        K+ S +  K +      + SS F  +    E+        +RP SNF+RRSPVP VK+
Subjt:  KKVESQGQTKGSE-TVDSLLKLLRKHSVEQGKRSSGGGGSSNKDISFNHVKENGPYDEGRGSSFFGLSPNLREKA-------QRPVSNFQRRSPVPRVKY

Query:  QPI--YPGESIVNSTNGMNSKGVKPNGTDTGSQLKGKVWTRQESEREHWEELQSQREAEQEPEPDQEFELEPEAETYDLEHEGDEMEPELVNLLGVSSDV
        QP+     E ++N+ N    +  KP        L+ K  T  E +     E  S  E E     D +   + E +  D +    E +   + +  V   +
Subjt:  QPI--YPGESIVNSTNGMNSKGVKPNGTDTGSQLKGKVWTRQESEREHWEELQSQREAEQEPEPDQEFELEPEAETYDLEHEGDEMEPELVNLLGVSSDV

Query:  DDTFEDDVKDNEEFAKHGEQEHEDLNSLKLAELRAIAKSRSLRGFSKMKKSELVQLLSN
        D++ +  +K +            DL++LK+ ELR +AKSR ++G+SKMKK++LV+LLSN
Subjt:  DDTFEDDVKDNEEFAKHGEQEHEDLNSLKLAELRAIAKSRSLRGFSKMKKSELVQLLSN

Q94K75 Rho-N domain-containing protein 1, chloroplastic1.1e-6144.05Show/hide
Query:  MSQAIHLLPHNPTGFGLSDSRCIPCSGVSGRAASFSFHSLCAEHRINVPVKFRPLNCTSLGESFTCKASSGGHRRNPDFPKQNRHGFSRSRNRQNEERES
        MS   HL      G+ LSDSRC   S VS R  +    S C +H+ N  +K  P        SF C+ASSGG+RRNPDF + N+HG+ R  NRQ+  RE 
Subjt:  MSQAIHLLPHNPTGFGLSDSRCIPCSGVSGRAASFSFHSLCAEHRINVPVKFRPLNCTSLGESFTCKASSGGHRRNPDFPKQNRHGFSRSRNRQNEERES

Query:  LDNVDESDLLLSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRER-AAMKEEKKVE--SQGQTKGSETVDSLLKLLRKHSVEQGKRSSGGGG
         D ++ SD+L S+NGPL ++SS+PK QAT++PGPREKEIVELFRKVQAQLR R AA KEEKK+E  S+GQ K SETVDSLLKLLRKHS EQ KR      
Subjt:  LDNVDESDLLLSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRER-AAMKEEKKVE--SQGQTKGSETVDSLLKLLRKHSVEQGKRSSGGGG

Query:  SSNKDISFNHVKENGPYDEGRGSSFFGLSPNLREKAQRPVSNFQRRSPVPRVKYQPIYPGESIVNSTNGMNSKGVKPNGTDTGSQLKGKVWTRQESEREH
        SS  ++  + V +    D        G   N      RP S+F+R+SPVPR +  P Y  E+  + ++  +          T +Q K  V    E E E 
Subjt:  SSNKDISFNHVKENGPYDEGRGSSFFGLSPNLREKAQRPVSNFQRRSPVPRVKYQPIYPGESIVNSTNGMNSKGVKPNGTDTGSQLKGKVWTRQESEREH

Query:  WEELQSQREAEQEPEP-----DQEFELEPEAETYDLEHEGDEMEPELVN----LLGVSSDVDDTFEDDVKDNEEFAKHGEQEHEDLNSLKLAELRAIAKS
          E + + E E EP P     + + EL+PE+ ++  E E D++  ++++    +L V SD D++ +D  +D++E     E+  +DL+ LKL ELR IAKS
Subjt:  WEELQSQREAEQEPEP-----DQEFELEPEAETYDLEHEGDEMEPELVN----LLGVSSDVDDTFEDDVKDNEEFAKHGEQEHEDLNSLKLAELRAIAKS

Query:  RSLRGFSKMKKSELVQLLSN
        R L+G SKMKK+ELV+LL +
Subjt:  RSLRGFSKMKKSELVQLLSN

Arabidopsis top hitse value%identityAlignment
AT1G06190.1 Rho termination factor7.7e-6344.05Show/hide
Query:  MSQAIHLLPHNPTGFGLSDSRCIPCSGVSGRAASFSFHSLCAEHRINVPVKFRPLNCTSLGESFTCKASSGGHRRNPDFPKQNRHGFSRSRNRQNEERES
        MS   HL      G+ LSDSRC   S VS R  +    S C +H+ N  +K  P        SF C+ASSGG+RRNPDF + N+HG+ R  NRQ+  RE 
Subjt:  MSQAIHLLPHNPTGFGLSDSRCIPCSGVSGRAASFSFHSLCAEHRINVPVKFRPLNCTSLGESFTCKASSGGHRRNPDFPKQNRHGFSRSRNRQNEERES

Query:  LDNVDESDLLLSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRER-AAMKEEKKVE--SQGQTKGSETVDSLLKLLRKHSVEQGKRSSGGGG
         D ++ SD+L S+NGPL ++SS+PK QAT++PGPREKEIVELFRKVQAQLR R AA KEEKK+E  S+GQ K SETVDSLLKLLRKHS EQ KR      
Subjt:  LDNVDESDLLLSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRER-AAMKEEKKVE--SQGQTKGSETVDSLLKLLRKHSVEQGKRSSGGGG

Query:  SSNKDISFNHVKENGPYDEGRGSSFFGLSPNLREKAQRPVSNFQRRSPVPRVKYQPIYPGESIVNSTNGMNSKGVKPNGTDTGSQLKGKVWTRQESEREH
        SS  ++  + V +    D        G   N      RP S+F+R+SPVPR +  P Y  E+  + ++  +          T +Q K  V    E E E 
Subjt:  SSNKDISFNHVKENGPYDEGRGSSFFGLSPNLREKAQRPVSNFQRRSPVPRVKYQPIYPGESIVNSTNGMNSKGVKPNGTDTGSQLKGKVWTRQESEREH

Query:  WEELQSQREAEQEPEP-----DQEFELEPEAETYDLEHEGDEMEPELVN----LLGVSSDVDDTFEDDVKDNEEFAKHGEQEHEDLNSLKLAELRAIAKS
          E + + E E EP P     + + EL+PE+ ++  E E D++  ++++    +L V SD D++ +D  +D++E     E+  +DL+ LKL ELR IAKS
Subjt:  WEELQSQREAEQEPEP-----DQEFELEPEAETYDLEHEGDEMEPELVN----LLGVSSDVDDTFEDDVKDNEEFAKHGEQEHEDLNSLKLAELRAIAKS

Query:  RSLRGFSKMKKSELVQLLSN
        R L+G SKMKK+ELV+LL +
Subjt:  RSLRGFSKMKKSELVQLLSN

AT1G06190.2 Rho termination factor3.1e-4848.88Show/hide
Query:  MSQAIHLLPHNPTGFGLSDSRCIPCSGVSGRAASFSFHSLCAEHRINVPVKFRPLNCTSLGESFTCKASSGGHRRNPDFPKQNRHGFSRSRNRQNEERES
        MS   HL      G+ LSDSRC   S VS R  +    S C +H+ N  +K  P        SF C+ASSGG+RRNPDF + N+HG+ R  NRQ+  RE 
Subjt:  MSQAIHLLPHNPTGFGLSDSRCIPCSGVSGRAASFSFHSLCAEHRINVPVKFRPLNCTSLGESFTCKASSGGHRRNPDFPKQNRHGFSRSRNRQNEERES

Query:  LDNVDESDLLLSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRER-AAMKEEKKVE--SQGQTKGSETVDSLLKLLRKHSVEQGKRSSGGGG
         D ++ SD+L S+NGPL ++SS+PK QAT++PGPREKEIVELFRKVQAQLR R AA KEEKK+E  S+GQ K SETVDSLLKLLRKHS EQ KR      
Subjt:  LDNVDESDLLLSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRER-AAMKEEKKVE--SQGQTKGSETVDSLLKLLRKHSVEQGKRSSGGGG

Query:  SSNKDISFNHVKENGPYDEGRGSSFFGLSPNLREKAQRPVSNFQRRSPVPRVKYQPIYPGESIVNSTN
        SS  ++  + V +    D        G   N      RP S+F+R+SPVPR +  P Y  E+  + ++
Subjt:  SSNKDISFNHVKENGPYDEGRGSSFFGLSPNLREKAQRPVSNFQRRSPVPRVKYQPIYPGESIVNSTN

AT2G31150.1 ATP binding;ATPases, coupled to transmembrane movement of ions, phosphorylative mechanism1.0e-2239.81Show/hide
Query:  HRR-NPDFPKQNRHGFSRSRNRQNEERESL-DNVDESDLLLSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRER-AAMKEEKKVE--SQGQ
        HRR NPDF + N+HGF R RNR+NE+++ L D   E D+L SKN                     EKEIVELF+KVQ QLR R AA KEEKK E  S+GQ
Subjt:  HRR-NPDFPKQNRHGFSRSRNRQNEERESL-DNVDESDLLLSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRER-AAMKEEKKVE--SQGQ

Query:  -TKGSETVDSLLKLLRKHSVEQGKRSSGGGGSSNKDISFNHVKENGPYDEGRGSSFFGLSPNLREKA---QRPVSNFQRRSPVPRVKYQPIYPGESIVNS
          K SETVDSLLKLLRKHS EQ K+      S  +    +   E     +   SS F  S N    A    RP S+F+R SPVPR K Q  Y  E+I + 
Subjt:  -TKGSETVDSLLKLLRKHSVEQGKRSSGGGGSSNKDISFNHVKENGPYDEGRGSSFFGLSPNLREKA---QRPVSNFQRRSPVPRVKYQPIYPGESIVNS

Query:  TNGMNSKGVKPNGTDTGSQLKGKVWTRQESEREHWEELQSQREAEQEPEPDQEFELEPEAETYDLE-----------HEGDEMEPELVNLLGVSSDVDDT
         +  +          T +Q K +V +R E E E   E +S  E + EPEP+ E+E E E E   LE            E DE E +   ++   SD D++
Subjt:  TNGMNSKGVKPNGTDTGSQLKGKVWTRQESEREHWEELQSQREAEQEPEPDQEFELEPEAETYDLE-----------HEGDEMEPELVNLLGVSSDVDDT

Query:  FEDDVKDNE
           D +  +
Subjt:  FEDDVKDNE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTCAAGCCATTCATCTTCTTCCTCACAACCCTACAGGCTTTGGACTGTCAGATAGCAGATGCATACCTTGCTCTGGAGTTTCAGGACGAGCAGCCTCGTTCTCTTT
TCACTCTTTATGTGCTGAACATAGAATCAATGTACCAGTCAAATTTAGACCTCTAAACTGTACTTCGTTGGGGGAGTCTTTTACGTGCAAAGCCAGCTCAGGAGGTCACA
GGAGAAACCCAGATTTTCCAAAGCAAAATAGGCATGGCTTCTCAAGAAGCAGAAATAGGCAAAATGAGGAGAGAGAGAGTCTTGACAATGTTGATGAATCTGATTTATTA
TTGTCTAAGAATGGACCATTACTTTCCATCTCTAGCACCCCAAAATCCCAGGCCACTGCTACCCCAGGACCAAGGGAAAAGGAAATTGTTGAACTTTTCAGAAAGGTTCA
AGCTCAGCTTCGGGAGCGAGCAGCAATGAAAGAAGAGAAGAAAGTGGAATCTCAAGGACAAACGAAAGGGAGTGAGACAGTAGATTCTCTTCTTAAGCTATTGCGAAAGC
ATTCGGTGGAACAAGGGAAGAGAAGCAGTGGTGGGGGTGGCAGCAGCAACAAGGACATCAGTTTTAACCATGTCAAAGAGAATGGTCCTTATGATGAAGGAAGAGGCTCA
AGCTTTTTTGGACTAAGTCCCAATTTGAGGGAGAAGGCCCAAAGACCCGTATCAAATTTCCAACGTAGATCCCCCGTGCCTCGGGTGAAGTACCAACCAATTTACCCTGG
GGAAAGTATTGTCAACTCGACCAATGGTATGAATTCAAAAGGTGTGAAACCTAATGGAACTGATACAGGTTCTCAACTGAAAGGAAAGGTATGGACTCGGCAAGAGTCGG
AACGAGAGCACTGGGAAGAGCTGCAATCACAAAGGGAGGCAGAACAGGAGCCAGAGCCGGACCAAGAGTTCGAATTGGAACCAGAGGCTGAAACATATGATCTAGAGCAT
GAAGGTGATGAAATGGAGCCTGAACTTGTTAATTTATTAGGTGTGTCATCAGACGTCGATGACACGTTTGAGGATGATGTTAAAGACAATGAGGAATTTGCAAAGCATGG
TGAACAAGAACATGAGGACTTGAACTCATTGAAACTTGCTGAACTGAGGGCGATTGCCAAATCTCGTAGTTTGAGAGGGTTTTCGAAGATGAAGAAGAGCGAGCTCGTGC
AATTGTTAAGCAACGGTCAGTAA
mRNA sequenceShow/hide mRNA sequence
CAATGATAAATTCACAATTACAGGCCAGCAGGACAAAGAAGAACGAACCGGAAAAGCGAAATTCTTCAAAGAAATGCAAATAAAAAACCCTTCCTTTCCTCACTAAAACC
TTTCCTTCCTTCCAGGTGTCTTAACCAGCAATGTCTCAAGCCATTCATCTTCTTCCTCACAACCCTACAGGCTTTGGACTGTCAGATAGCAGATGCATACCTTGCTCTGG
AGTTTCAGGACGAGCAGCCTCGTTCTCTTTTCACTCTTTATGTGCTGAACATAGAATCAATGTACCAGTCAAATTTAGACCTCTAAACTGTACTTCGTTGGGGGAGTCTT
TTACGTGCAAAGCCAGCTCAGGAGGTCACAGGAGAAACCCAGATTTTCCAAAGCAAAATAGGCATGGCTTCTCAAGAAGCAGAAATAGGCAAAATGAGGAGAGAGAGAGT
CTTGACAATGTTGATGAATCTGATTTATTATTGTCTAAGAATGGACCATTACTTTCCATCTCTAGCACCCCAAAATCCCAGGCCACTGCTACCCCAGGACCAAGGGAAAA
GGAAATTGTTGAACTTTTCAGAAAGGTTCAAGCTCAGCTTCGGGAGCGAGCAGCAATGAAAGAAGAGAAGAAAGTGGAATCTCAAGGACAAACGAAAGGGAGTGAGACAG
TAGATTCTCTTCTTAAGCTATTGCGAAAGCATTCGGTGGAACAAGGGAAGAGAAGCAGTGGTGGGGGTGGCAGCAGCAACAAGGACATCAGTTTTAACCATGTCAAAGAG
AATGGTCCTTATGATGAAGGAAGAGGCTCAAGCTTTTTTGGACTAAGTCCCAATTTGAGGGAGAAGGCCCAAAGACCCGTATCAAATTTCCAACGTAGATCCCCCGTGCC
TCGGGTGAAGTACCAACCAATTTACCCTGGGGAAAGTATTGTCAACTCGACCAATGGTATGAATTCAAAAGGTGTGAAACCTAATGGAACTGATACAGGTTCTCAACTGA
AAGGAAAGGTATGGACTCGGCAAGAGTCGGAACGAGAGCACTGGGAAGAGCTGCAATCACAAAGGGAGGCAGAACAGGAGCCAGAGCCGGACCAAGAGTTCGAATTGGAA
CCAGAGGCTGAAACATATGATCTAGAGCATGAAGGTGATGAAATGGAGCCTGAACTTGTTAATTTATTAGGTGTGTCATCAGACGTCGATGACACGTTTGAGGATGATGT
TAAAGACAATGAGGAATTTGCAAAGCATGGTGAACAAGAACATGAGGACTTGAACTCATTGAAACTTGCTGAACTGAGGGCGATTGCCAAATCTCGTAGTTTGAGAGGGT
TTTCGAAGATGAAGAAGAGCGAGCTCGTGCAATTGTTAAGCAACGGTCAGTAAGGATGTCATACTGGACAGGAACATCTGGATTTAGGTTGTTTAGGTGTTTTTTTGTAC
TTATTTTTGTTGTGCTTATCAATCTGTTTGACTAGTTTTGAGTTAGAAATTGGTATTAGCTTTTTACAAACTCTTTCCCAGATTGTATATCAGGCTAATCTTCGCAGGTT
CAGAGTACAAGTCAACAACAATCTCTTCTTTTAGTTTTTTCTCTTCTATTCAGACTCTTGTTCACTTATAGGAACCATTTTTTAAAGGGC
Protein sequenceShow/hide protein sequence
MSQAIHLLPHNPTGFGLSDSRCIPCSGVSGRAASFSFHSLCAEHRINVPVKFRPLNCTSLGESFTCKASSGGHRRNPDFPKQNRHGFSRSRNRQNEERESLDNVDESDLL
LSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKVESQGQTKGSETVDSLLKLLRKHSVEQGKRSSGGGGSSNKDISFNHVKENGPYDEGRGS
SFFGLSPNLREKAQRPVSNFQRRSPVPRVKYQPIYPGESIVNSTNGMNSKGVKPNGTDTGSQLKGKVWTRQESEREHWEELQSQREAEQEPEPDQEFELEPEAETYDLEH
EGDEMEPELVNLLGVSSDVDDTFEDDVKDNEEFAKHGEQEHEDLNSLKLAELRAIAKSRSLRGFSKMKKSELVQLLSNGQ