; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC07G126080 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC07G126080
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
Descriptionrho-N domain-containing protein 1, chloroplastic-like
Genome locationCiama_Chr07:1210984..1214544
RNA-Seq ExpressionCaUC07G126080
SyntenyCaUC07G126080
Gene Ontology termsGO:0006353 - DNA-templated transcription, termination (biological process)
InterPro domainsIPR011112 - Rho termination factor, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008447421.1 PREDICTED: rho-N domain-containing protein 1, chloroplastic isoform X7 [Cucumis melo]4.5e-18378.32Show/hide
Query:  MSQAIHLLPNNLTVRLRDSWICFAASCSRLLGLKKIYVFSFHQFVEEGSRCNSFGLSDSICLPCSGVSGRAASVSSRTLCAEHKINAPVKFRPLNCTSLG
        MSQAIHLLP N T                                        FGLSDS CLPCSGVSGRAAS S R+LCAEH IN PVKFRPLNCTSLG
Subjt:  MSQAIHLLPNNLTVRLRDSWICFAASCSRLLGLKKIYVFSFHQFVEEGSRCNSFGLSDSICLPCSGVSGRAASVSSRTLCAEHKINAPVKFRPLNCTSLG

Query:  LSFTCKASSGGHRRNPDFSKQNRHGFSRSRNRHNEERESLDNVDESDLLSSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKK
        +SFTCKASS GHRRNPDF KQNR G+SRSRNR NEERESL+NVDESDLLSS+NGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKK
Subjt:  LSFTCKASSGGHRRNPDFSKQNRHGFSRSRNRHNEERESLDNVDESDLLSSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKK

Query:  MEAQGQTKGSETVDSLLKLLRKHSVEQGKRSSGGGGSSKDFSFNHVKENGPYDEGKGTSIFGLSANTRDKAQEPTGSFNRPVSNFQRKSPVPRVKYQPIY
        MEAQGQTKGSETVDSLLKLLRKH+VEQGKRSSG GGS+KD SFNHVKENGPYDEG+G+SIFGLS N R+KAQEP GSF RP SNFQR+SPVPRVKYQPIY
Subjt:  MEAQGQTKGSETVDSLLKLLRKHSVEQGKRSSGGGGSSKDFSFNHVKENGPYDEGKGTSIFGLSANTRDKAQEPTGSFNRPVSNFQRKSPVPRVKYQPIY

Query:  PGESVVNSTDGVNAKGVKLNGTEAGSQLKAKVWTRQESEREHWEELQSQGETEQEPEPDQEYELEPEAESYELDHEPDEMEPELVNLLGVSSDIDDKIED
        PGES+V+ST+G+N+KG+KLNGTE GSQLKAKVWTRQESEREHWEELQSQ +TEQEPE DQE+E+EPEAE+Y+L+HE DEMEPELVNLLGVSSDIDD  ED
Subjt:  PGESVVNSTDGVNAKGVKLNGTEAGSQLKAKVWTRQESEREHWEELQSQGETEQEPEPDQEYELEPEAESYELDHEPDEMEPELVNLLGVSSDIDDKIED

Query:  NVKDNEKFAKHDEHVDLNSLKLAELRAIAKSRSLKGFSKMKKSELVRLLSDA
        ++KDNE+F+KH EH +LNSLKLAELRAIAKSRSL+GFSKMKKSELV+LLS++
Subjt:  NVKDNEKFAKHDEHVDLNSLKLAELRAIAKSRSLKGFSKMKKSELVRLLSDA

XP_016900423.1 PREDICTED: rho-N domain-containing protein 1, chloroplastic isoform X1 [Cucumis melo]9.7e-18679.87Show/hide
Query:  MSQAIHLLPNNLTVRLRDSWICFAASCSRLLGLKKIYVFSFHQFVEEGSRCNSFGLSDSICLPCSGVSGRAASVSSRTLCAEHKINAPVKFRPLNCTSLG
        MSQAIHLLP N TV               ++G  K  V     FVE       FGLSDS CLPCSGVSGRAAS S R+LCAEH IN PVKFRPLNCTSLG
Subjt:  MSQAIHLLPNNLTVRLRDSWICFAASCSRLLGLKKIYVFSFHQFVEEGSRCNSFGLSDSICLPCSGVSGRAASVSSRTLCAEHKINAPVKFRPLNCTSLG

Query:  LSFTCKASSGGHRRNPDFSKQNRHGFSRSRNRHNEERESLDNVDESDLLSSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKK
        +SFTCKASS GHRRNPDF KQNR G+SRSRNR NEERESL+NVDESDLLSS+NGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKK
Subjt:  LSFTCKASSGGHRRNPDFSKQNRHGFSRSRNRHNEERESLDNVDESDLLSSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKK

Query:  MEAQGQTKGSETVDSLLKLLRKHSVEQGKRSSGGGGSSKDFSFNHVKENGPYDEGKGTSIFGLSANTRDKAQEPTGSFNRPVSNFQRKSPVPRVKYQPIY
        MEAQGQTKGSETVDSLLKLLRKH+VEQGKRSSG GGS+KD SFNHVKENGPYDEG+G+SIFGLS N R+KAQEP GSF RP SNFQR+SPVPRVKYQPIY
Subjt:  MEAQGQTKGSETVDSLLKLLRKHSVEQGKRSSGGGGSSKDFSFNHVKENGPYDEGKGTSIFGLSANTRDKAQEPTGSFNRPVSNFQRKSPVPRVKYQPIY

Query:  PGESVVNSTDGVNAKGVKLNGTEAGSQLKAKVWTRQESEREHWEELQSQGETEQEPEPDQEYELEPEAESYELDHEPDEMEPELVNLLGVSSDIDDKIED
        PGES+V+ST+G+N+KG+KLNGTE GSQLKAKVWTRQESEREHWEELQSQ +TEQEPE DQE+E+EPEAE+Y+L+HE DEMEPELVNLLGVSSDIDD  ED
Subjt:  PGESVVNSTDGVNAKGVKLNGTEAGSQLKAKVWTRQESEREHWEELQSQGETEQEPEPDQEYELEPEAESYELDHEPDEMEPELVNLLGVSSDIDDKIED

Query:  NVKDNEKFAKHDEHVDLNSLKLAELRAIAKSRSLKGFSKMKKSELVRLLSDA
        ++KDNE+F+KH EH +LNSLKLAELRAIAKSRSL+GFSKMKKSELV+LLS++
Subjt:  NVKDNEKFAKHDEHVDLNSLKLAELRAIAKSRSLKGFSKMKKSELVRLLSDA

XP_016900425.1 PREDICTED: rho-N domain-containing protein 1, chloroplastic isoform X4 [Cucumis melo]1.4e-18479.42Show/hide
Query:  MSQAIHLLPNNLTVRLRDSWICFAASCSRLLGLKKIYVFSFHQFVEEGSRCNSFGLSDSICLPCSGVSGRAASVSSRTLCAEHKINAPVKFRPLNCTSLG
        MSQAIHLLP N T                     K  V     FVE       FGLSDS CLPCSGVSGRAAS S R+LCAEH IN PVKFRPLNCTSLG
Subjt:  MSQAIHLLPNNLTVRLRDSWICFAASCSRLLGLKKIYVFSFHQFVEEGSRCNSFGLSDSICLPCSGVSGRAASVSSRTLCAEHKINAPVKFRPLNCTSLG

Query:  LSFTCKASSGGHRRNPDFSKQNRHGFSRSRNRHNEERESLDNVDESDLLSSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKK
        +SFTCKASS GHRRNPDF KQNR G+SRSRNR NEERESL+NVDESDLLSS+NGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKK
Subjt:  LSFTCKASSGGHRRNPDFSKQNRHGFSRSRNRHNEERESLDNVDESDLLSSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKK

Query:  MEAQGQTKGSETVDSLLKLLRKHSVEQGKRSSGGGGSSKDFSFNHVKENGPYDEGKGTSIFGLSANTRDKAQEPTGSFNRPVSNFQRKSPVPRVKYQPIY
        MEAQGQTKGSETVDSLLKLLRKH+VEQGKRSSG GGS+KD SFNHVKENGPYDEG+G+SIFGLS N R+KAQEP GSF RP SNFQR+SPVPRVKYQPIY
Subjt:  MEAQGQTKGSETVDSLLKLLRKHSVEQGKRSSGGGGSSKDFSFNHVKENGPYDEGKGTSIFGLSANTRDKAQEPTGSFNRPVSNFQRKSPVPRVKYQPIY

Query:  PGESVVNSTDGVNAKGVKLNGTEAGSQLKAKVWTRQESEREHWEELQSQGETEQEPEPDQEYELEPEAESYELDHEPDEMEPELVNLLGVSSDIDDKIED
        PGES+V+ST+G+N+KG+KLNGTE GSQLKAKVWTRQESEREHWEELQSQ +TEQEPE DQE+E+EPEAE+Y+L+HE DEMEPELVNLLGVSSDIDD  ED
Subjt:  PGESVVNSTDGVNAKGVKLNGTEAGSQLKAKVWTRQESEREHWEELQSQGETEQEPEPDQEYELEPEAESYELDHEPDEMEPELVNLLGVSSDIDDKIED

Query:  NVKDNEKFAKHDEHVDLNSLKLAELRAIAKSRSLKGFSKMKKSELVRLLSDA
        ++KDNE+F+KH EH +LNSLKLAELRAIAKSRSL+GFSKMKKSELV+LLS++
Subjt:  NVKDNEKFAKHDEHVDLNSLKLAELRAIAKSRSLKGFSKMKKSELVRLLSDA

XP_038887988.1 rho-N domain-containing protein 1, chloroplastic-like isoform X1 [Benincasa hispida]6.9e-19282.14Show/hide
Query:  MSQAIHLLPNNLTVRLRDSWICFAASCSRLLGLKKIYVFSFHQFVEEGSRCNSFGLSDSICLPCSGVSGRAASVSSRTLCAEHKINAPVKFRPLNCTSLG
        MSQAIHLLPNNLT                                       +FGLSDS CLPCSGVSGRAASVSSR+LCAEH+I+A VKFRPLNCTSLG
Subjt:  MSQAIHLLPNNLTVRLRDSWICFAASCSRLLGLKKIYVFSFHQFVEEGSRCNSFGLSDSICLPCSGVSGRAASVSSRTLCAEHKINAPVKFRPLNCTSLG

Query:  LSFTCKASSGGHRRNPDFSKQNRHGFSRSRNRHNEERESLDNVDESDLLSSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKK
         SFTCKASSGGHRRNPDFSKQNRHGFSRSRNR NEERESLDNVDESDLLSSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKK
Subjt:  LSFTCKASSGGHRRNPDFSKQNRHGFSRSRNRHNEERESLDNVDESDLLSSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKK

Query:  MEAQGQTKGSETVDSLLKLLRKHSVEQGKRS----SGGGGSSKDFSFNHVKENGPYDEGKGTSIFGLSANTRDKAQEPTGSFNRPVSNFQRKSPVPRVKY
        +EAQGQTKGSETVDSLLKLLRKHSVEQGKRS    SGGG SSKDF+FNHVKENG YDEGKGTSIFGLSAN R+KAQEPTGSF+RPVSNFQRKSPVPRVKY
Subjt:  MEAQGQTKGSETVDSLLKLLRKHSVEQGKRS----SGGGGSSKDFSFNHVKENGPYDEGKGTSIFGLSANTRDKAQEPTGSFNRPVSNFQRKSPVPRVKY

Query:  QPIYPGESVVNSTDGVNAKGVKLNGTEAGSQLKAKVWTRQES-EREHWEELQSQGETEQEPEPDQEYELEPEAESYELDHEPDEMEPELVNLLGVSSDID
        QPI+PGES+V+STDGVN+KGVKLNGTE  SQLKAKVWTRQES ER HWEELQSQGET+QEPE DQEYELEPEAESYEL+H+PDE E ELVNLLGVSSD+D
Subjt:  QPIYPGESVVNSTDGVNAKGVKLNGTEAGSQLKAKVWTRQES-EREHWEELQSQGETEQEPEPDQEYELEPEAESYELDHEPDEMEPELVNLLGVSSDID

Query:  DKIEDNVKDNEKFAKHDEHVDLNSLKLAELRAIAKSRSLKGFSKMKKSELVRLLSDAQV
        D  +D+VKDNEKFAKHDEH DLNSLK+AELRAIAKSRSLKGFSKMKKSELV+LLSD  V
Subjt:  DKIEDNVKDNEKFAKHDEHVDLNSLKLAELRAIAKSRSLKGFSKMKKSELVRLLSDAQV

XP_038887989.1 rho-N domain-containing protein 1, chloroplastic-like isoform X2 [Benincasa hispida]5.5e-18981.26Show/hide
Query:  MSQAIHLLPNNLTVRLRDSWICFAASCSRLLGLKKIYVFSFHQFVEEGSRCNSFGLSDSICLPCSGVSGRAASVSSRTLCAEHKINAPVKFRPLNCTSLG
        MSQAIHLLPNNLT                                            DS CLPCSGVSGRAASVSSR+LCAEH+I+A VKFRPLNCTSLG
Subjt:  MSQAIHLLPNNLTVRLRDSWICFAASCSRLLGLKKIYVFSFHQFVEEGSRCNSFGLSDSICLPCSGVSGRAASVSSRTLCAEHKINAPVKFRPLNCTSLG

Query:  LSFTCKASSGGHRRNPDFSKQNRHGFSRSRNRHNEERESLDNVDESDLLSSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKK
         SFTCKASSGGHRRNPDFSKQNRHGFSRSRNR NEERESLDNVDESDLLSSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKK
Subjt:  LSFTCKASSGGHRRNPDFSKQNRHGFSRSRNRHNEERESLDNVDESDLLSSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKK

Query:  MEAQGQTKGSETVDSLLKLLRKHSVEQGKRS----SGGGGSSKDFSFNHVKENGPYDEGKGTSIFGLSANTRDKAQEPTGSFNRPVSNFQRKSPVPRVKY
        +EAQGQTKGSETVDSLLKLLRKHSVEQGKRS    SGGG SSKDF+FNHVKENG YDEGKGTSIFGLSAN R+KAQEPTGSF+RPVSNFQRKSPVPRVKY
Subjt:  MEAQGQTKGSETVDSLLKLLRKHSVEQGKRS----SGGGGSSKDFSFNHVKENGPYDEGKGTSIFGLSANTRDKAQEPTGSFNRPVSNFQRKSPVPRVKY

Query:  QPIYPGESVVNSTDGVNAKGVKLNGTEAGSQLKAKVWTRQES-EREHWEELQSQGETEQEPEPDQEYELEPEAESYELDHEPDEMEPELVNLLGVSSDID
        QPI+PGES+V+STDGVN+KGVKLNGTE  SQLKAKVWTRQES ER HWEELQSQGET+QEPE DQEYELEPEAESYEL+H+PDE E ELVNLLGVSSD+D
Subjt:  QPIYPGESVVNSTDGVNAKGVKLNGTEAGSQLKAKVWTRQES-EREHWEELQSQGETEQEPEPDQEYELEPEAESYELDHEPDEMEPELVNLLGVSSDID

Query:  DKIEDNVKDNEKFAKHDEHVDLNSLKLAELRAIAKSRSLKGFSKMKKSELVRLLSDAQV
        D  +D+VKDNEKFAKHDEH DLNSLK+AELRAIAKSRSLKGFSKMKKSELV+LLSD  V
Subjt:  DKIEDNVKDNEKFAKHDEHVDLNSLKLAELRAIAKSRSLKGFSKMKKSELVRLLSDAQV

TrEMBL top hitse value%identityAlignment
A0A1S3BHD7 rho-N domain-containing protein 1, chloroplastic isoform X32.9e-18378.98Show/hide
Query:  MSQAIHLLPNNLTVRLRDSWICFAASCSRLLGLKKIYVFSFHQFVEEGSRCNSFGLSDSICLPCSGVSGRAASVSSRTLCAEHKINAPVKFRPLNCTSLG
        MSQAIHLLP N TV               ++G  K  V     FVE           DS CLPCSGVSGRAAS S R+LCAEH IN PVKFRPLNCTSLG
Subjt:  MSQAIHLLPNNLTVRLRDSWICFAASCSRLLGLKKIYVFSFHQFVEEGSRCNSFGLSDSICLPCSGVSGRAASVSSRTLCAEHKINAPVKFRPLNCTSLG

Query:  LSFTCKASSGGHRRNPDFSKQNRHGFSRSRNRHNEERESLDNVDESDLLSSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKK
        +SFTCKASS GHRRNPDF KQNR G+SRSRNR NEERESL+NVDESDLLSS+NGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKK
Subjt:  LSFTCKASSGGHRRNPDFSKQNRHGFSRSRNRHNEERESLDNVDESDLLSSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKK

Query:  MEAQGQTKGSETVDSLLKLLRKHSVEQGKRSSGGGGSSKDFSFNHVKENGPYDEGKGTSIFGLSANTRDKAQEPTGSFNRPVSNFQRKSPVPRVKYQPIY
        MEAQGQTKGSETVDSLLKLLRKH+VEQGKRSSG GGS+KD SFNHVKENGPYDEG+G+SIFGLS N R+KAQEP GSF RP SNFQR+SPVPRVKYQPIY
Subjt:  MEAQGQTKGSETVDSLLKLLRKHSVEQGKRSSGGGGSSKDFSFNHVKENGPYDEGKGTSIFGLSANTRDKAQEPTGSFNRPVSNFQRKSPVPRVKYQPIY

Query:  PGESVVNSTDGVNAKGVKLNGTEAGSQLKAKVWTRQESEREHWEELQSQGETEQEPEPDQEYELEPEAESYELDHEPDEMEPELVNLLGVSSDIDDKIED
        PGES+V+ST+G+N+KG+KLNGTE GSQLKAKVWTRQESEREHWEELQSQ +TEQEPE DQE+E+EPEAE+Y+L+HE DEMEPELVNLLGVSSDIDD  ED
Subjt:  PGESVVNSTDGVNAKGVKLNGTEAGSQLKAKVWTRQESEREHWEELQSQGETEQEPEPDQEYELEPEAESYELDHEPDEMEPELVNLLGVSSDIDDKIED

Query:  NVKDNEKFAKHDEHVDLNSLKLAELRAIAKSRSLKGFSKMKKSELVRLLSDA
        ++KDNE+F+KH EH +LNSLKLAELRAIAKSRSL+GFSKMKKSELV+LLS++
Subjt:  NVKDNEKFAKHDEHVDLNSLKLAELRAIAKSRSLKGFSKMKKSELVRLLSDA

A0A1S3BHF2 rho-N domain-containing protein 1, chloroplastic isoform X72.2e-18378.32Show/hide
Query:  MSQAIHLLPNNLTVRLRDSWICFAASCSRLLGLKKIYVFSFHQFVEEGSRCNSFGLSDSICLPCSGVSGRAASVSSRTLCAEHKINAPVKFRPLNCTSLG
        MSQAIHLLP N T                                        FGLSDS CLPCSGVSGRAAS S R+LCAEH IN PVKFRPLNCTSLG
Subjt:  MSQAIHLLPNNLTVRLRDSWICFAASCSRLLGLKKIYVFSFHQFVEEGSRCNSFGLSDSICLPCSGVSGRAASVSSRTLCAEHKINAPVKFRPLNCTSLG

Query:  LSFTCKASSGGHRRNPDFSKQNRHGFSRSRNRHNEERESLDNVDESDLLSSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKK
        +SFTCKASS GHRRNPDF KQNR G+SRSRNR NEERESL+NVDESDLLSS+NGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKK
Subjt:  LSFTCKASSGGHRRNPDFSKQNRHGFSRSRNRHNEERESLDNVDESDLLSSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKK

Query:  MEAQGQTKGSETVDSLLKLLRKHSVEQGKRSSGGGGSSKDFSFNHVKENGPYDEGKGTSIFGLSANTRDKAQEPTGSFNRPVSNFQRKSPVPRVKYQPIY
        MEAQGQTKGSETVDSLLKLLRKH+VEQGKRSSG GGS+KD SFNHVKENGPYDEG+G+SIFGLS N R+KAQEP GSF RP SNFQR+SPVPRVKYQPIY
Subjt:  MEAQGQTKGSETVDSLLKLLRKHSVEQGKRSSGGGGSSKDFSFNHVKENGPYDEGKGTSIFGLSANTRDKAQEPTGSFNRPVSNFQRKSPVPRVKYQPIY

Query:  PGESVVNSTDGVNAKGVKLNGTEAGSQLKAKVWTRQESEREHWEELQSQGETEQEPEPDQEYELEPEAESYELDHEPDEMEPELVNLLGVSSDIDDKIED
        PGES+V+ST+G+N+KG+KLNGTE GSQLKAKVWTRQESEREHWEELQSQ +TEQEPE DQE+E+EPEAE+Y+L+HE DEMEPELVNLLGVSSDIDD  ED
Subjt:  PGESVVNSTDGVNAKGVKLNGTEAGSQLKAKVWTRQESEREHWEELQSQGETEQEPEPDQEYELEPEAESYELDHEPDEMEPELVNLLGVSSDIDDKIED

Query:  NVKDNEKFAKHDEHVDLNSLKLAELRAIAKSRSLKGFSKMKKSELVRLLSDA
        ++KDNE+F+KH EH +LNSLKLAELRAIAKSRSL+GFSKMKKSELV+LLS++
Subjt:  NVKDNEKFAKHDEHVDLNSLKLAELRAIAKSRSLKGFSKMKKSELVRLLSDA

A0A1S4DWR7 rho-N domain-containing protein 1, chloroplastic isoform X24.9e-18385.11Show/hide
Query:  RCNSFGLSDSICLPCSGVSGRAASVSSRTLCAEHKINAPVKFRPLNCTSLGLSFTCKASSGGHRRNPDFSKQNRHGFSRSRNRHNEERESLDNVDESDLL
        +   FGLSDS CLPCSGVSGRAAS S R+LCAEH IN PVKFRPLNCTSLG+SFTCKASS GHRRNPDF KQNR G+SRSRNR NEERESL+NVDESDLL
Subjt:  RCNSFGLSDSICLPCSGVSGRAASVSSRTLCAEHKINAPVKFRPLNCTSLGLSFTCKASSGGHRRNPDFSKQNRHGFSRSRNRHNEERESLDNVDESDLL

Query:  SSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKMEAQGQTKGSETVDSLLKLLRKHSVEQGKRSSGGGGSSKDFSFNHVKEN
        SS+NGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKMEAQGQTKGSETVDSLLKLLRKH+VEQGKRSSG GGS+KD SFNHVKEN
Subjt:  SSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKMEAQGQTKGSETVDSLLKLLRKHSVEQGKRSSGGGGSSKDFSFNHVKEN

Query:  GPYDEGKGTSIFGLSANTRDKAQEPTGSFNRPVSNFQRKSPVPRVKYQPIYPGESVVNSTDGVNAKGVKLNGTEAGSQLKAKVWTRQESEREHWEELQSQ
        GPYDEG+G+SIFGLS N R+KAQEP GSF RP SNFQR+SPVPRVKYQPIYPGES+V+ST+G+N+KG+KLNGTE GSQLKAKVWTRQESEREHWEELQSQ
Subjt:  GPYDEGKGTSIFGLSANTRDKAQEPTGSFNRPVSNFQRKSPVPRVKYQPIYPGESVVNSTDGVNAKGVKLNGTEAGSQLKAKVWTRQESEREHWEELQSQ

Query:  GETEQEPEPDQEYELEPEAESYELDHEPDEMEPELVNLLGVSSDIDDKIEDNVKDNEKFAKHDEHVDLNSLKLAELRAIAKSRSLKGFSKMKKSELVRLL
         +TEQEPE DQE+E+EPEAE+Y+L+HE DEMEPELVNLLGVSSDIDD  ED++KDNE+F+KH EH +LNSLKLAELRAIAKSRSL+GFSKMKKSELV+LL
Subjt:  GETEQEPEPDQEYELEPEAESYELDHEPDEMEPELVNLLGVSSDIDDKIEDNVKDNEKFAKHDEHVDLNSLKLAELRAIAKSRSLKGFSKMKKSELVRLL

Query:  SDA
        S++
Subjt:  SDA

A0A1S4DWS1 rho-N domain-containing protein 1, chloroplastic isoform X14.7e-18679.87Show/hide
Query:  MSQAIHLLPNNLTVRLRDSWICFAASCSRLLGLKKIYVFSFHQFVEEGSRCNSFGLSDSICLPCSGVSGRAASVSSRTLCAEHKINAPVKFRPLNCTSLG
        MSQAIHLLP N TV               ++G  K  V     FVE       FGLSDS CLPCSGVSGRAAS S R+LCAEH IN PVKFRPLNCTSLG
Subjt:  MSQAIHLLPNNLTVRLRDSWICFAASCSRLLGLKKIYVFSFHQFVEEGSRCNSFGLSDSICLPCSGVSGRAASVSSRTLCAEHKINAPVKFRPLNCTSLG

Query:  LSFTCKASSGGHRRNPDFSKQNRHGFSRSRNRHNEERESLDNVDESDLLSSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKK
        +SFTCKASS GHRRNPDF KQNR G+SRSRNR NEERESL+NVDESDLLSS+NGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKK
Subjt:  LSFTCKASSGGHRRNPDFSKQNRHGFSRSRNRHNEERESLDNVDESDLLSSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKK

Query:  MEAQGQTKGSETVDSLLKLLRKHSVEQGKRSSGGGGSSKDFSFNHVKENGPYDEGKGTSIFGLSANTRDKAQEPTGSFNRPVSNFQRKSPVPRVKYQPIY
        MEAQGQTKGSETVDSLLKLLRKH+VEQGKRSSG GGS+KD SFNHVKENGPYDEG+G+SIFGLS N R+KAQEP GSF RP SNFQR+SPVPRVKYQPIY
Subjt:  MEAQGQTKGSETVDSLLKLLRKHSVEQGKRSSGGGGSSKDFSFNHVKENGPYDEGKGTSIFGLSANTRDKAQEPTGSFNRPVSNFQRKSPVPRVKYQPIY

Query:  PGESVVNSTDGVNAKGVKLNGTEAGSQLKAKVWTRQESEREHWEELQSQGETEQEPEPDQEYELEPEAESYELDHEPDEMEPELVNLLGVSSDIDDKIED
        PGES+V+ST+G+N+KG+KLNGTE GSQLKAKVWTRQESEREHWEELQSQ +TEQEPE DQE+E+EPEAE+Y+L+HE DEMEPELVNLLGVSSDIDD  ED
Subjt:  PGESVVNSTDGVNAKGVKLNGTEAGSQLKAKVWTRQESEREHWEELQSQGETEQEPEPDQEYELEPEAESYELDHEPDEMEPELVNLLGVSSDIDDKIED

Query:  NVKDNEKFAKHDEHVDLNSLKLAELRAIAKSRSLKGFSKMKKSELVRLLSDA
        ++KDNE+F+KH EH +LNSLKLAELRAIAKSRSL+GFSKMKKSELV+LLS++
Subjt:  NVKDNEKFAKHDEHVDLNSLKLAELRAIAKSRSLKGFSKMKKSELVRLLSDA

A0A1S4DXI1 rho-N domain-containing protein 1, chloroplastic isoform X46.8e-18579.42Show/hide
Query:  MSQAIHLLPNNLTVRLRDSWICFAASCSRLLGLKKIYVFSFHQFVEEGSRCNSFGLSDSICLPCSGVSGRAASVSSRTLCAEHKINAPVKFRPLNCTSLG
        MSQAIHLLP N T                     K  V     FVE       FGLSDS CLPCSGVSGRAAS S R+LCAEH IN PVKFRPLNCTSLG
Subjt:  MSQAIHLLPNNLTVRLRDSWICFAASCSRLLGLKKIYVFSFHQFVEEGSRCNSFGLSDSICLPCSGVSGRAASVSSRTLCAEHKINAPVKFRPLNCTSLG

Query:  LSFTCKASSGGHRRNPDFSKQNRHGFSRSRNRHNEERESLDNVDESDLLSSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKK
        +SFTCKASS GHRRNPDF KQNR G+SRSRNR NEERESL+NVDESDLLSS+NGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKK
Subjt:  LSFTCKASSGGHRRNPDFSKQNRHGFSRSRNRHNEERESLDNVDESDLLSSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKK

Query:  MEAQGQTKGSETVDSLLKLLRKHSVEQGKRSSGGGGSSKDFSFNHVKENGPYDEGKGTSIFGLSANTRDKAQEPTGSFNRPVSNFQRKSPVPRVKYQPIY
        MEAQGQTKGSETVDSLLKLLRKH+VEQGKRSSG GGS+KD SFNHVKENGPYDEG+G+SIFGLS N R+KAQEP GSF RP SNFQR+SPVPRVKYQPIY
Subjt:  MEAQGQTKGSETVDSLLKLLRKHSVEQGKRSSGGGGSSKDFSFNHVKENGPYDEGKGTSIFGLSANTRDKAQEPTGSFNRPVSNFQRKSPVPRVKYQPIY

Query:  PGESVVNSTDGVNAKGVKLNGTEAGSQLKAKVWTRQESEREHWEELQSQGETEQEPEPDQEYELEPEAESYELDHEPDEMEPELVNLLGVSSDIDDKIED
        PGES+V+ST+G+N+KG+KLNGTE GSQLKAKVWTRQESEREHWEELQSQ +TEQEPE DQE+E+EPEAE+Y+L+HE DEMEPELVNLLGVSSDIDD  ED
Subjt:  PGESVVNSTDGVNAKGVKLNGTEAGSQLKAKVWTRQESEREHWEELQSQGETEQEPEPDQEYELEPEAESYELDHEPDEMEPELVNLLGVSSDIDDKIED

Query:  NVKDNEKFAKHDEHVDLNSLKLAELRAIAKSRSLKGFSKMKKSELVRLLSDA
        ++KDNE+F+KH EH +LNSLKLAELRAIAKSRSL+GFSKMKKSELV+LLS++
Subjt:  NVKDNEKFAKHDEHVDLNSLKLAELRAIAKSRSLKGFSKMKKSELVRLLSDA

SwissProt top hitse value%identityAlignment
Q8L4E7 SAP-like protein BP-731.6e-3737.88Show/hide
Query:  LSFTCKASSGGHR-RNPDFSKQNRHGFSRSRNRHNEERESLDNVDE--SDLLSSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKE
        LS  C A+   HR R+ D ++  + G +R +++  +E++  +N+DE  +D++SSKNGP +S++S  + QAT+ PG REKEIVELF++VQAQLR R   KE
Subjt:  LSFTCKASSGGHR-RNPDFSKQNRHGFSRSRNRHNEERESLDNVDE--SDLLSSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKE

Query:  EKKMEAQGQTKGSE-TVDSLLKLLRKHSVEQGKRSSGGGGSSKDFSFNHVKENGPYDEGKGTSIFGLSANTRDKAQEP-TGSFNRPVSNFQRKSPVPRVK
        EKK E Q + +G   +VDSLL LLRKHSV+Q ++S    G  K+ S +  K +      + +SIF +  +T+++ ++P   +F RP SNF+R+SPVP VK
Subjt:  EKKMEAQGQTKGSE-TVDSLLKLLRKHSVEQGKRSSGGGGSSKDFSFNHVKENGPYDEGKGTSIFGLSANTRDKAQEP-TGSFNRPVSNFQRKSPVPRVK

Query:  YQPI--YPGESVVNS-TDGVNAKGVKLNGTEAGSQLKAKVWTRQESEREHWEELQSQGETEQEPEPDQEYELEPEAESYELDHEPDEMEPELVNLLGVSS
        +QP+     E V+N+  D V          EA   L+ K  T  E +     E  S  E E     D ++  + E ++ + D    E +   + +  V  
Subjt:  YQPI--YPGESVVNS-TDGVNAKGVKLNGTEAGSQLKAKVWTRQESEREHWEELQSQGETEQEPEPDQEYELEPEAESYELDHEPDEMEPELVNLLGVSS

Query:  DIDDKIEDNVKDNEKFAKHDEHVDLNSLKLAELRAIAKSRSLKGFSKMKKSELVRLLSD
         ID+  +  +K +          DL++LK+ ELR +AKSR +KG+SKMKK++LV LLS+
Subjt:  DIDDKIEDNVKDNEKFAKHDEHVDLNSLKLAELRAIAKSRSLKGFSKMKKSELVRLLSD

Q94K75 Rho-N domain-containing protein 1, chloroplastic1.7e-6044.61Show/hide
Query:  FGLSDSICLPCSGVSGRAASVSSRTLCAEHKINAPVKFRPLNCTSLGLSFTCKASSGGHRRNPDFSKQNRHGFSRSRNRHNEERESLDNVDESDLLSSKN
        + LSDS C   S VS R  ++   + C +HK N  +K  P        SF C+ASSGG+RRNPDFS+ N+HG+ R  NR +  RE  D ++ SD+LSS+N
Subjt:  FGLSDSICLPCSGVSGRAASVSSRTLCAEHKINAPVKFRPLNCTSLGLSFTCKASSGGHRRNPDFSKQNRHGFSRSRNRHNEERESLDNVDESDLLSSKN

Query:  GPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRER-AAMKEEKKME--AQGQTKGSETVDSLLKLLRKHSVEQGKRSSGGGGSSKDFSFNHVKENG
        GPL ++SS+PK QAT++PGPREKEIVELFRKVQAQLR R AA KEEKK+E  ++GQ K SETVDSLLKLLRKHS EQ KR          FS     +  
Subjt:  GPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRER-AAMKEEKKME--AQGQTKGSETVDSLLKLLRKHSVEQGKRSSGGGGSSKDFSFNHVKENG

Query:  PYDEGKGTSIFGLSANTRDKAQEPTGSFNRPVSNFQRKSPVPRVKYQPIYPGESVVNSTDGVNAKGVKLNGTEAGSQLKAKVWTRQESEREHWEELQSQG
          D+   T     S N  + A     SF RP S+F+RKSPVPR +  P Y  E+  + +   +            +Q K  V    E E E   E + + 
Subjt:  PYDEGKGTSIFGLSANTRDKAQEPTGSFNRPVSNFQRKSPVPRVKYQPIYPGESVVNSTDGVNAKGVKLNGTEAGSQLKAKVWTRQESEREHWEELQSQG

Query:  ETEQEPEP-----DQEYELEPEAESYELDHEPDEMEPELVN----LLGVSSDIDDKIEDNVKDNEKFAKHDEHVDLNSLKLAELRAIAKSRSLKGFSKMK
        E E EP P     + + EL+PE+ S+  + E D++  ++++    +L V SD D+ ++D  +D+++ A+ +   DL+ LKL ELR IAKSR LKG SKMK
Subjt:  ETEQEPEP-----DQEYELEPEAESYELDHEPDEMEPELVN----LLGVSSDIDDKIEDNVKDNEKFAKHDEHVDLNSLKLAELRAIAKSRSLKGFSKMK

Query:  KSELVRLL
        K+ELV LL
Subjt:  KSELVRLL

Arabidopsis top hitse value%identityAlignment
AT1G06190.1 Rho termination factor1.2e-6144.61Show/hide
Query:  FGLSDSICLPCSGVSGRAASVSSRTLCAEHKINAPVKFRPLNCTSLGLSFTCKASSGGHRRNPDFSKQNRHGFSRSRNRHNEERESLDNVDESDLLSSKN
        + LSDS C   S VS R  ++   + C +HK N  +K  P        SF C+ASSGG+RRNPDFS+ N+HG+ R  NR +  RE  D ++ SD+LSS+N
Subjt:  FGLSDSICLPCSGVSGRAASVSSRTLCAEHKINAPVKFRPLNCTSLGLSFTCKASSGGHRRNPDFSKQNRHGFSRSRNRHNEERESLDNVDESDLLSSKN

Query:  GPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRER-AAMKEEKKME--AQGQTKGSETVDSLLKLLRKHSVEQGKRSSGGGGSSKDFSFNHVKENG
        GPL ++SS+PK QAT++PGPREKEIVELFRKVQAQLR R AA KEEKK+E  ++GQ K SETVDSLLKLLRKHS EQ KR          FS     +  
Subjt:  GPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRER-AAMKEEKKME--AQGQTKGSETVDSLLKLLRKHSVEQGKRSSGGGGSSKDFSFNHVKENG

Query:  PYDEGKGTSIFGLSANTRDKAQEPTGSFNRPVSNFQRKSPVPRVKYQPIYPGESVVNSTDGVNAKGVKLNGTEAGSQLKAKVWTRQESEREHWEELQSQG
          D+   T     S N  + A     SF RP S+F+RKSPVPR +  P Y  E+  + +   +            +Q K  V    E E E   E + + 
Subjt:  PYDEGKGTSIFGLSANTRDKAQEPTGSFNRPVSNFQRKSPVPRVKYQPIYPGESVVNSTDGVNAKGVKLNGTEAGSQLKAKVWTRQESEREHWEELQSQG

Query:  ETEQEPEP-----DQEYELEPEAESYELDHEPDEMEPELVN----LLGVSSDIDDKIEDNVKDNEKFAKHDEHVDLNSLKLAELRAIAKSRSLKGFSKMK
        E E EP P     + + EL+PE+ S+  + E D++  ++++    +L V SD D+ ++D  +D+++ A+ +   DL+ LKL ELR IAKSR LKG SKMK
Subjt:  ETEQEPEP-----DQEYELEPEAESYELDHEPDEMEPELVN----LLGVSSDIDDKIEDNVKDNEKFAKHDEHVDLNSLKLAELRAIAKSRSLKGFSKMK

Query:  KSELVRLL
        K+ELV LL
Subjt:  KSELVRLL

AT1G06190.2 Rho termination factor1.7e-4749.81Show/hide
Query:  FGLSDSICLPCSGVSGRAASVSSRTLCAEHKINAPVKFRPLNCTSLGLSFTCKASSGGHRRNPDFSKQNRHGFSRSRNRHNEERESLDNVDESDLLSSKN
        + LSDS C   S VS R  ++   + C +HK N  +K  P        SF C+ASSGG+RRNPDFS+ N+HG+ R  NR +  RE  D ++ SD+LSS+N
Subjt:  FGLSDSICLPCSGVSGRAASVSSRTLCAEHKINAPVKFRPLNCTSLGLSFTCKASSGGHRRNPDFSKQNRHGFSRSRNRHNEERESLDNVDESDLLSSKN

Query:  GPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRER-AAMKEEKKME--AQGQTKGSETVDSLLKLLRKHSVEQGKRSSGGGGSSKDFSFNHVKENG
        GPL ++SS+PK QAT++PGPREKEIVELFRKVQAQLR R AA KEEKK+E  ++GQ K SETVDSLLKLLRKHS EQ KR          FS     +  
Subjt:  GPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRER-AAMKEEKKME--AQGQTKGSETVDSLLKLLRKHSVEQGKRSSGGGGSSKDFSFNHVKENG

Query:  PYDEGKGTSIFGLSANTRDKAQEPTGSFNRPVSNFQRKSPVPRVKYQPIYPGESVVNST
          D+   T     S N  + A     SF RP S+F+RKSPVPR +  P Y  E+  + +
Subjt:  PYDEGKGTSIFGLSANTRDKAQEPTGSFNRPVSNFQRKSPVPRVKYQPIYPGESVVNST

AT2G31150.1 ATP binding;ATPases, coupled to transmembrane movement of ions, phosphorylative mechanism6.4e-2639.93Show/hide
Query:  HRR-NPDFSKQNRHGFSRSRNRHNEERESL-DNVDESDLLSSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRER-AAMKEEKKMEAQGQTK
        HRR NPDFS+ N+HGF R RNR NE+++ L D   E D+LSSKN                     EKEIVELF+KVQ QLR R AA KEEKK E   + +
Subjt:  HRR-NPDFSKQNRHGFSRSRNRHNEERESL-DNVDESDLLSSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRER-AAMKEEKKMEAQGQTK

Query:  G---SETVDSLLKLLRKHSVEQGKRSSGGGGSSKDFSFNHVKENGPYDEGKGTSIFGLSANTRDKAQEPTGSFNRPVSNFQRKSPVPRVKYQPIYPGESV
        G   SETVDSLLKLLRKHS EQ K+      S K      ++ +    E +  S     +  +D    P   F RP S+F+R SPVPR K Q  Y  E++
Subjt:  G---SETVDSLLKLLRKHSVEQGKRSSGGGGSSKDFSFNHVKENGPYDEGKGTSIFGLSANTRDKAQEPTGSFNRPVSNFQRKSPVPRVKYQPIYPGESV

Query:  VNSTDGVNAKGVKLNGTEAGSQLKAKVWTRQESEREHWEELQSQGETEQEPEPDQ--EY-ELEPEAESYELDHEPDEMEPELVNLLGVSSDIDDKIEDNV
         +               EA S   +  WT+++      ++++S+ E E EPEP+   EY E EPEAE YE + EP+    E V+ L + S   ++ ED  
Subjt:  VNSTDGVNAKGVKLNGTEAGSQLKAKVWTRQESEREHWEELQSQGETEQEPEPDQ--EY-ELEPEAESYELDHEPDEMEPELVNLLGVSSDIDDKIEDNV

Query:  KDN
          N
Subjt:  KDN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTCAAGCCATTCATCTTCTTCCTAACAACCTTACAGTCAGATTACGTGACTCTTGGATTTGTTTTGCTGCATCATGTTCTCGCCTTCTTGGATTGAAGAAG
ATATATGTGTTTTCTTTTCATCAGTTTGTGGAGGAAGGTTCAAGATGTAATAGCTTTGGACTGTCAGATAGCATATGCCTACCATGCTCTGGAGTTTCAGGACGA
GCAGCCTCGGTCTCATCTCGCACTTTATGTGCTGAACATAAAATCAATGCACCGGTCAAATTCAGACCCCTAAACTGTACTTCGTTGGGGTTGTCTTTTACGTGC
AAAGCCAGCTCAGGAGGTCATAGGAGAAACCCAGATTTTTCAAAGCAAAATAGGCATGGCTTCTCAAGAAGCAGAAATAGGCACAATGAGGAGAGAGAGAGCCTT
GACAATGTTGATGAATCTGATTTATTATCATCTAAGAATGGACCCTTACTTTCCATCTCTAGCACACCAAAATCCCAAGCCACAGCTACCCCAGGCCCGAGGGAG
AAGGAAATTGTTGAACTTTTCAGGAAGGTTCAAGCTCAGCTTCGGGAGCGAGCTGCAATGAAAGAAGAGAAGAAAATGGAAGCTCAAGGACAAACGAAAGGGAGC
GAGACAGTGGATTCTCTTCTCAAGCTATTGAGAAAGCATTCGGTGGAGCAAGGGAAGAGAAGCAGTGGTGGTGGCGGCAGCAGCAAGGACTTCAGTTTTAACCAT
GTCAAAGAGAATGGTCCTTATGATGAAGGAAAAGGCACAAGCATTTTTGGCCTAAGTGCCAATACGAGAGACAAGGCCCAAGAACCAACAGGTTCTTTCAATAGA
CCCGTATCAAATTTTCAACGTAAATCCCCCGTGCCTCGGGTGAAGTACCAGCCAATTTACCCCGGGGAAAGTGTTGTCAACTCGACTGATGGTGTGAATGCAAAG
GGAGTGAAACTTAATGGAACTGAGGCAGGTTCTCAACTGAAGGCAAAGGTATGGACTCGGCAGGAGTCAGAACGAGAGCACTGGGAAGAGCTGCAATCACAAGGA
GAGACTGAGCAGGAGCCAGAGCCAGATCAAGAGTATGAGTTGGAGCCAGAGGCTGAATCATATGAGCTAGACCATGAGCCTGATGAAATGGAGCCTGAACTTGTA
AATTTATTAGGCGTGTCATCAGACATCGATGACAAGATTGAAGACAATGTTAAAGACAATGAGAAATTTGCAAAGCATGATGAACATGTGGATTTGAACTCATTG
AAGCTTGCTGAACTGAGGGCGATTGCCAAATCTCGCAGTTTGAAAGGGTTCTCGAAGATGAAGAAGAGTGAACTCGTGCGGTTGTTAAGCGACGCTCAGGTATGA
mRNA sequenceShow/hide mRNA sequence
ACTACATCCTCTTCGCCCGAGGAAGCTACGGCATTTCTCAAGCCAAACCAATCTCTCCCTTTTATCAGTGATATTTACAATAACAGGCCAGCAGGGGAAAGAAGA
ACAAACCGGAAAAGCGAAAATCCTCAAAGAAATGCAAATCCAAAACCCTTCTTTTCCTCACTAAAACCTTTCCTTCGAGCTGTCTAAACAAGCAATGTCTCAAGC
CATTCATCTTCTTCCTAACAACCTTACAGTCAGATTACGTGACTCTTGGATTTGTTTTGCTGCATCATGTTCTCGCCTTCTTGGATTGAAGAAGATATATGTGTT
TTCTTTTCATCAGTTTGTGGAGGAAGGTTCAAGATGTAATAGCTTTGGACTGTCAGATAGCATATGCCTACCATGCTCTGGAGTTTCAGGACGAGCAGCCTCGGT
CTCATCTCGCACTTTATGTGCTGAACATAAAATCAATGCACCGGTCAAATTCAGACCCCTAAACTGTACTTCGTTGGGGTTGTCTTTTACGTGCAAAGCCAGCTC
AGGAGGTCATAGGAGAAACCCAGATTTTTCAAAGCAAAATAGGCATGGCTTCTCAAGAAGCAGAAATAGGCACAATGAGGAGAGAGAGAGCCTTGACAATGTTGA
TGAATCTGATTTATTATCATCTAAGAATGGACCCTTACTTTCCATCTCTAGCACACCAAAATCCCAAGCCACAGCTACCCCAGGCCCGAGGGAGAAGGAAATTGT
TGAACTTTTCAGGAAGGTTCAAGCTCAGCTTCGGGAGCGAGCTGCAATGAAAGAAGAGAAGAAAATGGAAGCTCAAGGACAAACGAAAGGGAGCGAGACAGTGGA
TTCTCTTCTCAAGCTATTGAGAAAGCATTCGGTGGAGCAAGGGAAGAGAAGCAGTGGTGGTGGCGGCAGCAGCAAGGACTTCAGTTTTAACCATGTCAAAGAGAA
TGGTCCTTATGATGAAGGAAAAGGCACAAGCATTTTTGGCCTAAGTGCCAATACGAGAGACAAGGCCCAAGAACCAACAGGTTCTTTCAATAGACCCGTATCAAA
TTTTCAACGTAAATCCCCCGTGCCTCGGGTGAAGTACCAGCCAATTTACCCCGGGGAAAGTGTTGTCAACTCGACTGATGGTGTGAATGCAAAGGGAGTGAAACT
TAATGGAACTGAGGCAGGTTCTCAACTGAAGGCAAAGGTATGGACTCGGCAGGAGTCAGAACGAGAGCACTGGGAAGAGCTGCAATCACAAGGAGAGACTGAGCA
GGAGCCAGAGCCAGATCAAGAGTATGAGTTGGAGCCAGAGGCTGAATCATATGAGCTAGACCATGAGCCTGATGAAATGGAGCCTGAACTTGTAAATTTATTAGG
CGTGTCATCAGACATCGATGACAAGATTGAAGACAATGTTAAAGACAATGAGAAATTTGCAAAGCATGATGAACATGTGGATTTGAACTCATTGAAGCTTGCTGA
ACTGAGGGCGATTGCCAAATCTCGCAGTTTGAAAGGGTTCTCGAAGATGAAGAAGAGTGAACTCGTGCGGTTGTTAAGCGACGCTCAGGTATGAGAATGTCACGC
TGGACAGGAACCTCTGGGTTTAGGATGTTTAGGTGT
Protein sequenceShow/hide protein sequence
MSQAIHLLPNNLTVRLRDSWICFAASCSRLLGLKKIYVFSFHQFVEEGSRCNSFGLSDSICLPCSGVSGRAASVSSRTLCAEHKINAPVKFRPLNCTSLGLSFTC
KASSGGHRRNPDFSKQNRHGFSRSRNRHNEERESLDNVDESDLLSSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKMEAQGQTKGS
ETVDSLLKLLRKHSVEQGKRSSGGGGSSKDFSFNHVKENGPYDEGKGTSIFGLSANTRDKAQEPTGSFNRPVSNFQRKSPVPRVKYQPIYPGESVVNSTDGVNAK
GVKLNGTEAGSQLKAKVWTRQESEREHWEELQSQGETEQEPEPDQEYELEPEAESYELDHEPDEMEPELVNLLGVSSDIDDKIEDNVKDNEKFAKHDEHVDLNSL
KLAELRAIAKSRSLKGFSKMKKSELVRLLSDAQV