; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg038294 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg038294
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
Descriptionrho-N domain-containing protein 1, chloroplastic-like
Genome locationscaffold12:42413710..42417364
RNA-Seq ExpressionSpg038294
SyntenySpg038294
Gene Ontology termsGO:0006353 - DNA-templated transcription, termination (biological process)
InterPro domainsIPR011112 - Rho termination factor, N-terminal
IPR036269 - Rho termination factor, N-terminal domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0038048.1 rho-N domain-containing protein 1 [Cucumis melo var. makuwa]4.6e-18083.9Show/hide
Query:  MEFCFDIGFGLSDSRCLPCSGVSGRAATVSSRSLCAEHRIKAQVKFPPLNCTSLGASFTCKASSGGHRRNPDFSKQNRHGFSRSRNRQNEERESLENLDE
        ME  FDIGFGLSDSRCLPCSGVSGRAA+ S RSLCAEH I   VKF PLNCTSLG SFTCKASS GHRRNPDF KQNR G+SRSRNRQNEERESLEN+DE
Subjt:  MEFCFDIGFGLSDSRCLPCSGVSGRAATVSSRSLCAEHRIKAQVKFPPLNCTSLGASFTCKASSGGHRRNPDFSKQNRHGFSRSRNRQNEERESLENLDE

Query:  SDLLSSKNGPLLSLSSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKMEAQGQTKGSETVDSLLKLLRKHSVEQGKRSSGSGGSSSSSKDFS
        SDLLSS+NGPLLS+SSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKMEAQGQTKGSETVDSLLKLLRKH+VEQGKRSSG+GG   S+KD S
Subjt:  SDLLSSKNGPLLSLSSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKMEAQGQTKGSETVDSLLKLLRKHSVEQGKRSSGSGGSSSSSKDFS

Query:  FNHVKENGPYDEGKGTSIFGLSASLREKAQEPTGSFSRPVSSFQRKSPVPRVKYQSIYPGESIVDCTDGVSSKGVKLNGTETGSHLKAKVWTRQESEREA
        FNHVKENGPYDEG+G+SIFGLS +LREKAQEP GSF RP S+FQR+SPVPRVKYQ IYPGESIVD T+G++SKG+KLNGTETGS LKAKVWTRQESERE 
Subjt:  FNHVKENGPYDEGKGTSIFGLSASLREKAQEPTGSFSRPVSSFQRKSPVPRVKYQSIYPGESIVDCTDGVSSKGVKLNGTETGSHLKAKVWTRQESEREA

Query:  WEELQSQRVTGQEPEPDQEFELEAEPESYELEHEPDEMEPELVNLLGVSSDVDDTFDDNVKDNEKFAKHEDHEDLNSLKLAELRAMAKSRGMKGFSKMKK
        WEELQSQR T QEPE DQEFE+E E E+Y+LEHE DEMEPELVNLLGVSSD+DDTF+D++KDNE+F+KH +HE+LNSLKLAELRA+AKSR ++GFSKMKK
Subjt:  WEELQSQRVTGQEPEPDQEFELEAEPESYELEHEPDEMEPELVNLLGVSSDVDDTFDDNVKDNEKFAKHEDHEDLNSLKLAELRAMAKSRGMKGFSKMKK

Query:  SELVQLLSEA
        SELVQLLS +
Subjt:  SELVQLLSEA

XP_016900423.1 PREDICTED: rho-N domain-containing protein 1, chloroplastic isoform X1 [Cucumis melo]1.4e-18180.05Show/hide
Query:  MSQAIHLLPNNVTALKSLFSLHTENSVNFINMEFCFDIGFGLSDSRCLPCSGVSGRAATVSSRSLCAEHRIKAQVKFPPLNCTSLGASFTCKASSGGHRR
        MSQAIHLLP N T    +     E     +   F F  GFGLSDSRCLPCSGVSGRAA+ S RSLCAEH I   VKF PLNCTSLG SFTCKASS GHRR
Subjt:  MSQAIHLLPNNVTALKSLFSLHTENSVNFINMEFCFDIGFGLSDSRCLPCSGVSGRAATVSSRSLCAEHRIKAQVKFPPLNCTSLGASFTCKASSGGHRR

Query:  NPDFSKQNRHGFSRSRNRQNEERESLENLDESDLLSSKNGPLLSLSSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKMEAQGQTKGSETVD
        NPDF KQNR G+SRSRNRQNEERESLEN+DESDLLSS+NGPLLS+SSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKMEAQGQTKGSETVD
Subjt:  NPDFSKQNRHGFSRSRNRQNEERESLENLDESDLLSSKNGPLLSLSSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKMEAQGQTKGSETVD

Query:  SLLKLLRKHSVEQGKRSSGSGGSSSSSKDFSFNHVKENGPYDEGKGTSIFGLSASLREKAQEPTGSFSRPVSSFQRKSPVPRVKYQSIYPGESIVDCTDG
        SLLKLLRKH+VEQGKRSSG+GG   S+KD SFNHVKENGPYDEG+G+SIFGLS +LREKAQEP GSF RP S+FQR+SPVPRVKYQ IYPGESIVD T+G
Subjt:  SLLKLLRKHSVEQGKRSSGSGGSSSSSKDFSFNHVKENGPYDEGKGTSIFGLSASLREKAQEPTGSFSRPVSSFQRKSPVPRVKYQSIYPGESIVDCTDG

Query:  VSSKGVKLNGTETGSHLKAKVWTRQESEREAWEELQSQRVTGQEPEPDQEFELEAEPESYELEHEPDEMEPELVNLLGVSSDVDDTFDDNVKDNEKFAKH
        ++SKG+KLNGTETGS LKAKVWTRQESERE WEELQSQR T QEPE DQEFE+E E E+Y+LEHE DEMEPELVNLLGVSSD+DDTF+D++KDNE+F+KH
Subjt:  VSSKGVKLNGTETGSHLKAKVWTRQESEREAWEELQSQRVTGQEPEPDQEFELEAEPESYELEHEPDEMEPELVNLLGVSSDVDDTFDDNVKDNEKFAKH

Query:  EDHEDLNSLKLAELRAMAKSRGMKGFSKMKKSELVQLLSEA
         +HE+LNSLKLAELRA+AKSR ++GFSKMKKSELVQLLS +
Subjt:  EDHEDLNSLKLAELRAMAKSRGMKGFSKMKKSELVQLLSEA

XP_016900425.1 PREDICTED: rho-N domain-containing protein 1, chloroplastic isoform X4 [Cucumis melo]4.1e-18180.05Show/hide
Query:  MSQAIHLLPNNVTALKSLFSLHTENSVNFINMEFCFDIGFGLSDSRCLPCSGVSGRAATVSSRSLCAEHRIKAQVKFPPLNCTSLGASFTCKASSGGHRR
        MSQAIHLLP N T  K             +   F F  GFGLSDSRCLPCSGVSGRAA+ S RSLCAEH I   VKF PLNCTSLG SFTCKASS GHRR
Subjt:  MSQAIHLLPNNVTALKSLFSLHTENSVNFINMEFCFDIGFGLSDSRCLPCSGVSGRAATVSSRSLCAEHRIKAQVKFPPLNCTSLGASFTCKASSGGHRR

Query:  NPDFSKQNRHGFSRSRNRQNEERESLENLDESDLLSSKNGPLLSLSSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKMEAQGQTKGSETVD
        NPDF KQNR G+SRSRNRQNEERESLEN+DESDLLSS+NGPLLS+SSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKMEAQGQTKGSETVD
Subjt:  NPDFSKQNRHGFSRSRNRQNEERESLENLDESDLLSSKNGPLLSLSSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKMEAQGQTKGSETVD

Query:  SLLKLLRKHSVEQGKRSSGSGGSSSSSKDFSFNHVKENGPYDEGKGTSIFGLSASLREKAQEPTGSFSRPVSSFQRKSPVPRVKYQSIYPGESIVDCTDG
        SLLKLLRKH+VEQGKRSSG+GG   S+KD SFNHVKENGPYDEG+G+SIFGLS +LREKAQEP GSF RP S+FQR+SPVPRVKYQ IYPGESIVD T+G
Subjt:  SLLKLLRKHSVEQGKRSSGSGGSSSSSKDFSFNHVKENGPYDEGKGTSIFGLSASLREKAQEPTGSFSRPVSSFQRKSPVPRVKYQSIYPGESIVDCTDG

Query:  VSSKGVKLNGTETGSHLKAKVWTRQESEREAWEELQSQRVTGQEPEPDQEFELEAEPESYELEHEPDEMEPELVNLLGVSSDVDDTFDDNVKDNEKFAKH
        ++SKG+KLNGTETGS LKAKVWTRQESERE WEELQSQR T QEPE DQEFE+E E E+Y+LEHE DEMEPELVNLLGVSSD+DDTF+D++KDNE+F+KH
Subjt:  VSSKGVKLNGTETGSHLKAKVWTRQESEREAWEELQSQRVTGQEPEPDQEFELEAEPESYELEHEPDEMEPELVNLLGVSSDVDDTFDDNVKDNEKFAKH

Query:  EDHEDLNSLKLAELRAMAKSRGMKGFSKMKKSELVQLLSEA
         +HE+LNSLKLAELRA+AKSR ++GFSKMKKSELVQLLS +
Subjt:  EDHEDLNSLKLAELRAMAKSRGMKGFSKMKKSELVQLLSEA

XP_038887988.1 rho-N domain-containing protein 1, chloroplastic-like isoform X1 [Benincasa hispida]6.3e-19083.82Show/hide
Query:  MSQAIHLLPNNVTALKSLFSLHTENSVNFINMEFCFDIGFGLSDSRCLPCSGVSGRAATVSSRSLCAEHRIKAQVKFPPLNCTSLGASFTCKASSGGHRR
        MSQAIHLLPNN+TA                         FGLSDSRCLPCSGVSGRAA+VSSRSLCAEHRI A+VKF PLNCTSLGASFTCKASSGGHRR
Subjt:  MSQAIHLLPNNVTALKSLFSLHTENSVNFINMEFCFDIGFGLSDSRCLPCSGVSGRAATVSSRSLCAEHRIKAQVKFPPLNCTSLGASFTCKASSGGHRR

Query:  NPDFSKQNRHGFSRSRNRQNEERESLENLDESDLLSSKNGPLLSLSSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKMEAQGQTKGSETVD
        NPDFSKQNRHGFSRSRNRQNEERESL+N+DESDLLSSKNGPLLS+SSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKK+EAQGQTKGSETVD
Subjt:  NPDFSKQNRHGFSRSRNRQNEERESLENLDESDLLSSKNGPLLSLSSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKMEAQGQTKGSETVD

Query:  SLLKLLRKHSVEQGKRSSGSG-GSSSSSKDFSFNHVKENGPYDEGKGTSIFGLSASLREKAQEPTGSFSRPVSSFQRKSPVPRVKYQSIYPGESIVDCTD
        SLLKLLRKHSVEQGKRSSG G G  SSSKDF+FNHVKENG YDEGKGTSIFGLSA+LREKAQEPTGSFSRPVS+FQRKSPVPRVKYQ I+PGESIVD TD
Subjt:  SLLKLLRKHSVEQGKRSSGSG-GSSSSSKDFSFNHVKENGPYDEGKGTSIFGLSASLREKAQEPTGSFSRPVSSFQRKSPVPRVKYQSIYPGESIVDCTD

Query:  GVSSKGVKLNGTETGSHLKAKVWTRQES-EREAWEELQSQRVTGQEPEPDQEFELEAEPESYELEHEPDEMEPELVNLLGVSSDVDDTFDDNVKDNEKFA
        GV+SKGVKLNGTET S LKAKVWTRQES ER  WEELQSQ  T QEPE DQE+ELE E ESYELEH+PDE E ELVNLLGVSSD+DDTFDD+VKDNEKFA
Subjt:  GVSSKGVKLNGTETGSHLKAKVWTRQES-EREAWEELQSQRVTGQEPEPDQEFELEAEPESYELEHEPDEMEPELVNLLGVSSDVDDTFDDNVKDNEKFA

Query:  KHEDHEDLNSLKLAELRAMAKSRGMKGFSKMKKSELVQLLSEARV
        KH++HEDLNSLK+AELRA+AKSR +KGFSKMKKSELVQLLS+  V
Subjt:  KHEDHEDLNSLKLAELRAMAKSRGMKGFSKMKKSELVQLLSEARV

XP_038887989.1 rho-N domain-containing protein 1, chloroplastic-like isoform X2 [Benincasa hispida]1.5e-18682.7Show/hide
Query:  MSQAIHLLPNNVTALKSLFSLHTENSVNFINMEFCFDIGFGLSDSRCLPCSGVSGRAATVSSRSLCAEHRIKAQVKFPPLNCTSLGASFTCKASSGGHRR
        MSQAIHLLPNN+T                              DSRCLPCSGVSGRAA+VSSRSLCAEHRI A+VKF PLNCTSLGASFTCKASSGGHRR
Subjt:  MSQAIHLLPNNVTALKSLFSLHTENSVNFINMEFCFDIGFGLSDSRCLPCSGVSGRAATVSSRSLCAEHRIKAQVKFPPLNCTSLGASFTCKASSGGHRR

Query:  NPDFSKQNRHGFSRSRNRQNEERESLENLDESDLLSSKNGPLLSLSSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKMEAQGQTKGSETVD
        NPDFSKQNRHGFSRSRNRQNEERESL+N+DESDLLSSKNGPLLS+SSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKK+EAQGQTKGSETVD
Subjt:  NPDFSKQNRHGFSRSRNRQNEERESLENLDESDLLSSKNGPLLSLSSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKMEAQGQTKGSETVD

Query:  SLLKLLRKHSVEQGKRSSGSG-GSSSSSKDFSFNHVKENGPYDEGKGTSIFGLSASLREKAQEPTGSFSRPVSSFQRKSPVPRVKYQSIYPGESIVDCTD
        SLLKLLRKHSVEQGKRSSG G G  SSSKDF+FNHVKENG YDEGKGTSIFGLSA+LREKAQEPTGSFSRPVS+FQRKSPVPRVKYQ I+PGESIVD TD
Subjt:  SLLKLLRKHSVEQGKRSSGSG-GSSSSSKDFSFNHVKENGPYDEGKGTSIFGLSASLREKAQEPTGSFSRPVSSFQRKSPVPRVKYQSIYPGESIVDCTD

Query:  GVSSKGVKLNGTETGSHLKAKVWTRQES-EREAWEELQSQRVTGQEPEPDQEFELEAEPESYELEHEPDEMEPELVNLLGVSSDVDDTFDDNVKDNEKFA
        GV+SKGVKLNGTET S LKAKVWTRQES ER  WEELQSQ  T QEPE DQE+ELE E ESYELEH+PDE E ELVNLLGVSSD+DDTFDD+VKDNEKFA
Subjt:  GVSSKGVKLNGTETGSHLKAKVWTRQES-EREAWEELQSQRVTGQEPEPDQEFELEAEPESYELEHEPDEMEPELVNLLGVSSDVDDTFDDNVKDNEKFA

Query:  KHEDHEDLNSLKLAELRAMAKSRGMKGFSKMKKSELVQLLSEARV
        KH++HEDLNSLK+AELRA+AKSR +KGFSKMKKSELVQLLS+  V
Subjt:  KHEDHEDLNSLKLAELRAMAKSRGMKGFSKMKKSELVQLLSEARV

TrEMBL top hitse value%identityAlignment
A0A1S3BHD7 rho-N domain-containing protein 1, chloroplastic isoform X39.3e-17978.83Show/hide
Query:  MSQAIHLLPNNVTALKSLFSLHTENSVNFINMEFCFDIG---FGLSDSRCLPCSGVSGRAATVSSRSLCAEHRIKAQVKFPPLNCTSLGASFTCKASSGG
        MSQAIHLLP N T             V+F  +   F++G   F + DSRCLPCSGVSGRAA+ S RSLCAEH I   VKF PLNCTSLG SFTCKASS G
Subjt:  MSQAIHLLPNNVTALKSLFSLHTENSVNFINMEFCFDIG---FGLSDSRCLPCSGVSGRAATVSSRSLCAEHRIKAQVKFPPLNCTSLGASFTCKASSGG

Query:  HRRNPDFSKQNRHGFSRSRNRQNEERESLENLDESDLLSSKNGPLLSLSSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKMEAQGQTKGSE
        HRRNPDF KQNR G+SRSRNRQNEERESLEN+DESDLLSS+NGPLLS+SSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKMEAQGQTKGSE
Subjt:  HRRNPDFSKQNRHGFSRSRNRQNEERESLENLDESDLLSSKNGPLLSLSSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKMEAQGQTKGSE

Query:  TVDSLLKLLRKHSVEQGKRSSGSGGSSSSSKDFSFNHVKENGPYDEGKGTSIFGLSASLREKAQEPTGSFSRPVSSFQRKSPVPRVKYQSIYPGESIVDC
        TVDSLLKLLRKH+VEQGKRSSG+GG   S+KD SFNHVKENGPYDEG+G+SIFGLS +LREKAQEP GSF RP S+FQR+SPVPRVKYQ IYPGESIVD 
Subjt:  TVDSLLKLLRKHSVEQGKRSSGSGGSSSSSKDFSFNHVKENGPYDEGKGTSIFGLSASLREKAQEPTGSFSRPVSSFQRKSPVPRVKYQSIYPGESIVDC

Query:  TDGVSSKGVKLNGTETGSHLKAKVWTRQESEREAWEELQSQRVTGQEPEPDQEFELEAEPESYELEHEPDEMEPELVNLLGVSSDVDDTFDDNVKDNEKF
        T+G++SKG+KLNGTETGS LKAKVWTRQESERE WEELQSQR T QEPE DQEFE+E E E+Y+LEHE DEMEPELVNLLGVSSD+DDTF+D++KDNE+F
Subjt:  TDGVSSKGVKLNGTETGSHLKAKVWTRQESEREAWEELQSQRVTGQEPEPDQEFELEAEPESYELEHEPDEMEPELVNLLGVSSDVDDTFDDNVKDNEKF

Query:  AKHEDHEDLNSLKLAELRAMAKSRGMKGFSKMKKSELVQLLSEA
        +KH +HE+LNSLKLAELRA+AKSR ++GFSKMKKSELVQLLS +
Subjt:  AKHEDHEDLNSLKLAELRAMAKSRGMKGFSKMKKSELVQLLSEA

A0A1S3BHF2 rho-N domain-containing protein 1, chloroplastic isoform X76.4e-18079.37Show/hide
Query:  MSQAIHLLPNNVTALKSLFSLHTENSVNFINMEFCFDIGFGLSDSRCLPCSGVSGRAATVSSRSLCAEHRIKAQVKFPPLNCTSLGASFTCKASSGGHRR
        MSQAIHLLP N T                         GFGLSDSRCLPCSGVSGRAA+ S RSLCAEH I   VKF PLNCTSLG SFTCKASS GHRR
Subjt:  MSQAIHLLPNNVTALKSLFSLHTENSVNFINMEFCFDIGFGLSDSRCLPCSGVSGRAATVSSRSLCAEHRIKAQVKFPPLNCTSLGASFTCKASSGGHRR

Query:  NPDFSKQNRHGFSRSRNRQNEERESLENLDESDLLSSKNGPLLSLSSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKMEAQGQTKGSETVD
        NPDF KQNR G+SRSRNRQNEERESLEN+DESDLLSS+NGPLLS+SSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKMEAQGQTKGSETVD
Subjt:  NPDFSKQNRHGFSRSRNRQNEERESLENLDESDLLSSKNGPLLSLSSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKMEAQGQTKGSETVD

Query:  SLLKLLRKHSVEQGKRSSGSGGSSSSSKDFSFNHVKENGPYDEGKGTSIFGLSASLREKAQEPTGSFSRPVSSFQRKSPVPRVKYQSIYPGESIVDCTDG
        SLLKLLRKH+VEQGKRSSG+GG   S+KD SFNHVKENGPYDEG+G+SIFGLS +LREKAQEP GSF RP S+FQR+SPVPRVKYQ IYPGESIVD T+G
Subjt:  SLLKLLRKHSVEQGKRSSGSGGSSSSSKDFSFNHVKENGPYDEGKGTSIFGLSASLREKAQEPTGSFSRPVSSFQRKSPVPRVKYQSIYPGESIVDCTDG

Query:  VSSKGVKLNGTETGSHLKAKVWTRQESEREAWEELQSQRVTGQEPEPDQEFELEAEPESYELEHEPDEMEPELVNLLGVSSDVDDTFDDNVKDNEKFAKH
        ++SKG+KLNGTETGS LKAKVWTRQESERE WEELQSQR T QEPE DQEFE+E E E+Y+LEHE DEMEPELVNLLGVSSD+DDTF+D++KDNE+F+KH
Subjt:  VSSKGVKLNGTETGSHLKAKVWTRQESEREAWEELQSQRVTGQEPEPDQEFELEAEPESYELEHEPDEMEPELVNLLGVSSDVDDTFDDNVKDNEKFAKH

Query:  EDHEDLNSLKLAELRAMAKSRGMKGFSKMKKSELVQLLSEA
         +HE+LNSLKLAELRA+AKSR ++GFSKMKKSELVQLLS +
Subjt:  EDHEDLNSLKLAELRAMAKSRGMKGFSKMKKSELVQLLSEA

A0A1S4DWS1 rho-N domain-containing protein 1, chloroplastic isoform X16.9e-18280.05Show/hide
Query:  MSQAIHLLPNNVTALKSLFSLHTENSVNFINMEFCFDIGFGLSDSRCLPCSGVSGRAATVSSRSLCAEHRIKAQVKFPPLNCTSLGASFTCKASSGGHRR
        MSQAIHLLP N T    +     E     +   F F  GFGLSDSRCLPCSGVSGRAA+ S RSLCAEH I   VKF PLNCTSLG SFTCKASS GHRR
Subjt:  MSQAIHLLPNNVTALKSLFSLHTENSVNFINMEFCFDIGFGLSDSRCLPCSGVSGRAATVSSRSLCAEHRIKAQVKFPPLNCTSLGASFTCKASSGGHRR

Query:  NPDFSKQNRHGFSRSRNRQNEERESLENLDESDLLSSKNGPLLSLSSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKMEAQGQTKGSETVD
        NPDF KQNR G+SRSRNRQNEERESLEN+DESDLLSS+NGPLLS+SSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKMEAQGQTKGSETVD
Subjt:  NPDFSKQNRHGFSRSRNRQNEERESLENLDESDLLSSKNGPLLSLSSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKMEAQGQTKGSETVD

Query:  SLLKLLRKHSVEQGKRSSGSGGSSSSSKDFSFNHVKENGPYDEGKGTSIFGLSASLREKAQEPTGSFSRPVSSFQRKSPVPRVKYQSIYPGESIVDCTDG
        SLLKLLRKH+VEQGKRSSG+GG   S+KD SFNHVKENGPYDEG+G+SIFGLS +LREKAQEP GSF RP S+FQR+SPVPRVKYQ IYPGESIVD T+G
Subjt:  SLLKLLRKHSVEQGKRSSGSGGSSSSSKDFSFNHVKENGPYDEGKGTSIFGLSASLREKAQEPTGSFSRPVSSFQRKSPVPRVKYQSIYPGESIVDCTDG

Query:  VSSKGVKLNGTETGSHLKAKVWTRQESEREAWEELQSQRVTGQEPEPDQEFELEAEPESYELEHEPDEMEPELVNLLGVSSDVDDTFDDNVKDNEKFAKH
        ++SKG+KLNGTETGS LKAKVWTRQESERE WEELQSQR T QEPE DQEFE+E E E+Y+LEHE DEMEPELVNLLGVSSD+DDTF+D++KDNE+F+KH
Subjt:  VSSKGVKLNGTETGSHLKAKVWTRQESEREAWEELQSQRVTGQEPEPDQEFELEAEPESYELEHEPDEMEPELVNLLGVSSDVDDTFDDNVKDNEKFAKH

Query:  EDHEDLNSLKLAELRAMAKSRGMKGFSKMKKSELVQLLSEA
         +HE+LNSLKLAELRA+AKSR ++GFSKMKKSELVQLLS +
Subjt:  EDHEDLNSLKLAELRAMAKSRGMKGFSKMKKSELVQLLSEA

A0A1S4DXI1 rho-N domain-containing protein 1, chloroplastic isoform X42.0e-18180.05Show/hide
Query:  MSQAIHLLPNNVTALKSLFSLHTENSVNFINMEFCFDIGFGLSDSRCLPCSGVSGRAATVSSRSLCAEHRIKAQVKFPPLNCTSLGASFTCKASSGGHRR
        MSQAIHLLP N T  K             +   F F  GFGLSDSRCLPCSGVSGRAA+ S RSLCAEH I   VKF PLNCTSLG SFTCKASS GHRR
Subjt:  MSQAIHLLPNNVTALKSLFSLHTENSVNFINMEFCFDIGFGLSDSRCLPCSGVSGRAATVSSRSLCAEHRIKAQVKFPPLNCTSLGASFTCKASSGGHRR

Query:  NPDFSKQNRHGFSRSRNRQNEERESLENLDESDLLSSKNGPLLSLSSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKMEAQGQTKGSETVD
        NPDF KQNR G+SRSRNRQNEERESLEN+DESDLLSS+NGPLLS+SSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKMEAQGQTKGSETVD
Subjt:  NPDFSKQNRHGFSRSRNRQNEERESLENLDESDLLSSKNGPLLSLSSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKMEAQGQTKGSETVD

Query:  SLLKLLRKHSVEQGKRSSGSGGSSSSSKDFSFNHVKENGPYDEGKGTSIFGLSASLREKAQEPTGSFSRPVSSFQRKSPVPRVKYQSIYPGESIVDCTDG
        SLLKLLRKH+VEQGKRSSG+GG   S+KD SFNHVKENGPYDEG+G+SIFGLS +LREKAQEP GSF RP S+FQR+SPVPRVKYQ IYPGESIVD T+G
Subjt:  SLLKLLRKHSVEQGKRSSGSGGSSSSSKDFSFNHVKENGPYDEGKGTSIFGLSASLREKAQEPTGSFSRPVSSFQRKSPVPRVKYQSIYPGESIVDCTDG

Query:  VSSKGVKLNGTETGSHLKAKVWTRQESEREAWEELQSQRVTGQEPEPDQEFELEAEPESYELEHEPDEMEPELVNLLGVSSDVDDTFDDNVKDNEKFAKH
        ++SKG+KLNGTETGS LKAKVWTRQESERE WEELQSQR T QEPE DQEFE+E E E+Y+LEHE DEMEPELVNLLGVSSD+DDTF+D++KDNE+F+KH
Subjt:  VSSKGVKLNGTETGSHLKAKVWTRQESEREAWEELQSQRVTGQEPEPDQEFELEAEPESYELEHEPDEMEPELVNLLGVSSDVDDTFDDNVKDNEKFAKH

Query:  EDHEDLNSLKLAELRAMAKSRGMKGFSKMKKSELVQLLSEA
         +HE+LNSLKLAELRA+AKSR ++GFSKMKKSELVQLLS +
Subjt:  EDHEDLNSLKLAELRAMAKSRGMKGFSKMKKSELVQLLSEA

A0A5D3DAG1 Rho-N domain-containing protein 12.2e-18083.9Show/hide
Query:  MEFCFDIGFGLSDSRCLPCSGVSGRAATVSSRSLCAEHRIKAQVKFPPLNCTSLGASFTCKASSGGHRRNPDFSKQNRHGFSRSRNRQNEERESLENLDE
        ME  FDIGFGLSDSRCLPCSGVSGRAA+ S RSLCAEH I   VKF PLNCTSLG SFTCKASS GHRRNPDF KQNR G+SRSRNRQNEERESLEN+DE
Subjt:  MEFCFDIGFGLSDSRCLPCSGVSGRAATVSSRSLCAEHRIKAQVKFPPLNCTSLGASFTCKASSGGHRRNPDFSKQNRHGFSRSRNRQNEERESLENLDE

Query:  SDLLSSKNGPLLSLSSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKMEAQGQTKGSETVDSLLKLLRKHSVEQGKRSSGSGGSSSSSKDFS
        SDLLSS+NGPLLS+SSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKMEAQGQTKGSETVDSLLKLLRKH+VEQGKRSSG+GG   S+KD S
Subjt:  SDLLSSKNGPLLSLSSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKMEAQGQTKGSETVDSLLKLLRKHSVEQGKRSSGSGGSSSSSKDFS

Query:  FNHVKENGPYDEGKGTSIFGLSASLREKAQEPTGSFSRPVSSFQRKSPVPRVKYQSIYPGESIVDCTDGVSSKGVKLNGTETGSHLKAKVWTRQESEREA
        FNHVKENGPYDEG+G+SIFGLS +LREKAQEP GSF RP S+FQR+SPVPRVKYQ IYPGESIVD T+G++SKG+KLNGTETGS LKAKVWTRQESERE 
Subjt:  FNHVKENGPYDEGKGTSIFGLSASLREKAQEPTGSFSRPVSSFQRKSPVPRVKYQSIYPGESIVDCTDGVSSKGVKLNGTETGSHLKAKVWTRQESEREA

Query:  WEELQSQRVTGQEPEPDQEFELEAEPESYELEHEPDEMEPELVNLLGVSSDVDDTFDDNVKDNEKFAKHEDHEDLNSLKLAELRAMAKSRGMKGFSKMKK
        WEELQSQR T QEPE DQEFE+E E E+Y+LEHE DEMEPELVNLLGVSSD+DDTF+D++KDNE+F+KH +HE+LNSLKLAELRA+AKSR ++GFSKMKK
Subjt:  WEELQSQRVTGQEPEPDQEFELEAEPESYELEHEPDEMEPELVNLLGVSSDVDDTFDDNVKDNEKFAKHEDHEDLNSLKLAELRAMAKSRGMKGFSKMKK

Query:  SELVQLLSEA
        SELVQLLS +
Subjt:  SELVQLLSEA

SwissProt top hitse value%identityAlignment
Q8L4E7 SAP-like protein BP-735.5e-3535.39Show/hide
Query:  KFPPLNCTSLGASFTCKASSGGHR-RNPDFSKQNRHGFSRSRNRQNEERESLENLDE--SDLLSSKNGPLLSLSSTPKSQATATPGPREKEIVELFRKVQ
        K P L    L  S  C A+   HR R+ D ++  + G +R +++  +E++  EN+DE  +D++SSKNGP +SL+S  + QAT+ PG REKEIVELF++VQ
Subjt:  KFPPLNCTSLGASFTCKASSGGHR-RNPDFSKQNRHGFSRSRNRQNEERESLENLDE--SDLLSSKNGPLLSLSSTPKSQATATPGPREKEIVELFRKVQ

Query:  AQLRERAAMKEEKKMEAQGQTKGSE-TVDSLLKLLRKHSVEQGKRSSGSGGSSSSSKDFSFNHVKENGPYDEGKGTSIFGLSASLREKAQEPTGSFSRPV
        AQLR R   KEEKK E Q + +G   +VDSLL LLRKHSV+Q ++       S   K+ S +  K +      + +SIF  + +  E+ +    +F RP 
Subjt:  AQLRERAAMKEEKKMEAQGQTKGSE-TVDSLLKLLRKHSVEQGKRSSGSGGSSSSSKDFSFNHVKENGPYDEGKGTSIFGLSASLREKAQEPTGSFSRPV

Query:  SSFQRKSPVPRVKYQSIYPGESIVDCTDGVSSKGVKLNGTETGSHLKAKVWTRQES-EREAWEELQSQRVTGQEPE----PDQEFELEAEPESYELEHEP
        S+F+R+SPVP VK+Q +            V ++ V  N  +     K  +  +  + E ++    +   V   EPE     D +   + EP++ + +   
Subjt:  SSFQRKSPVPRVKYQSIYPGESIVDCTDGVSSKGVKLNGTETGSHLKAKVWTRQES-EREAWEELQSQRVTGQEPE----PDQEFELEAEPESYELEHEP

Query:  DEMEPELVNLLGVSSDVDDTFDDNVKDNEKFAKHEDHEDLNSLKLAELRAMAKSRGMKGFSKMKKSELVQLLS
         E +   + +  V   +D++ D  +K +          DL++LK+ ELR +AKSRG+KG+SKMKK++LV+LLS
Subjt:  DEMEPELVNLLGVSSDVDDTFDDNVKDNEKFAKHEDHEDLNSLKLAELRAMAKSRGMKGFSKMKKSELVQLLS

Q94K75 Rho-N domain-containing protein 1, chloroplastic1.7e-6044.66Show/hide
Query:  GFGLSDSRCLPCSGVSGRAATVSSRSLCAEHRIKAQVKFPPLNCTSLGASFTCKASSGGHRRNPDFSKQNRHGFSRSRNRQNEERESLENLDESDLLSSK
        G+ LSDSRC   S VS R   +   S C +H+   ++K  P       +SF C+ASSGG+RRNPDFS+ N+HG+ R  NRQ+  RE  + ++ SD+LSS+
Subjt:  GFGLSDSRCLPCSGVSGRAATVSSRSLCAEHRIKAQVKFPPLNCTSLGASFTCKASSGGHRRNPDFSKQNRHGFSRSRNRQNEERESLENLDESDLLSSK

Query:  NGPLLSLSSTPKSQATATPGPREKEIVELFRKVQAQLRER-AAMKEEKKME--AQGQTKGSETVDSLLKLLRKHSVEQGKRSSGSGGSSSSSKDFSFNHV
        NGPL +LSS+PK QAT++PGPREKEIVELFRKVQAQLR R AA KEEKK+E  ++GQ K SETVDSLLKLLRKHS EQ KR             FS    
Subjt:  NGPLLSLSSTPKSQATATPGPREKEIVELFRKVQAQLRER-AAMKEEKKME--AQGQTKGSETVDSLLKLLRKHSVEQGKRSSGSGGSSSSSKDFSFNHV

Query:  KENGPYDEGKGTSIFGLSASLREKAQEPTGSFSRPVSSFQRKSPVPRVKYQSIYPGESIVDCTDGVSSKGVKLNGTETGSHLKAKVWTRQESEREAWEEL
         +    D+   T     S +    A     SF+RP SSF+RKSPVPR +    Y  E+  D +   S          T +  K  V    E E E   E 
Subjt:  KENGPYDEGKGTSIFGLSASLREKAQEPTGSFSRPVSSFQRKSPVPRVKYQSIYPGESIVDCTDGVSSKGVKLNGTETGSHLKAKVWTRQESEREAWEEL

Query:  QSQRVTGQEPEP-----DQEFELEAEPESYELEHEPDEMEPELVN----LLGVSSDVDDTFDDNVKDNEKFAKHEDHEDLNSLKLAELRAMAKSRGMKGF
        + +     EP P     + + EL+ E  S+  E E D++  ++++    +L V SD D++ DD  +D+++ A+ E  +DL+ LKL ELR +AKSRG+KG 
Subjt:  QSQRVTGQEPEP-----DQEFELEAEPESYELEHEPDEMEPELVN----LLGVSSDVDDTFDDNVKDNEKFAKHEDHEDLNSLKLAELRAMAKSRGMKGF

Query:  SKMKKSELVQLL
        SKMKK+ELV+LL
Subjt:  SKMKKSELVQLL

Arabidopsis top hitse value%identityAlignment
AT1G06190.1 Rho termination factor1.2e-6144.66Show/hide
Query:  GFGLSDSRCLPCSGVSGRAATVSSRSLCAEHRIKAQVKFPPLNCTSLGASFTCKASSGGHRRNPDFSKQNRHGFSRSRNRQNEERESLENLDESDLLSSK
        G+ LSDSRC   S VS R   +   S C +H+   ++K  P       +SF C+ASSGG+RRNPDFS+ N+HG+ R  NRQ+  RE  + ++ SD+LSS+
Subjt:  GFGLSDSRCLPCSGVSGRAATVSSRSLCAEHRIKAQVKFPPLNCTSLGASFTCKASSGGHRRNPDFSKQNRHGFSRSRNRQNEERESLENLDESDLLSSK

Query:  NGPLLSLSSTPKSQATATPGPREKEIVELFRKVQAQLRER-AAMKEEKKME--AQGQTKGSETVDSLLKLLRKHSVEQGKRSSGSGGSSSSSKDFSFNHV
        NGPL +LSS+PK QAT++PGPREKEIVELFRKVQAQLR R AA KEEKK+E  ++GQ K SETVDSLLKLLRKHS EQ KR             FS    
Subjt:  NGPLLSLSSTPKSQATATPGPREKEIVELFRKVQAQLRER-AAMKEEKKME--AQGQTKGSETVDSLLKLLRKHSVEQGKRSSGSGGSSSSSKDFSFNHV

Query:  KENGPYDEGKGTSIFGLSASLREKAQEPTGSFSRPVSSFQRKSPVPRVKYQSIYPGESIVDCTDGVSSKGVKLNGTETGSHLKAKVWTRQESEREAWEEL
         +    D+   T     S +    A     SF+RP SSF+RKSPVPR +    Y  E+  D +   S          T +  K  V    E E E   E 
Subjt:  KENGPYDEGKGTSIFGLSASLREKAQEPTGSFSRPVSSFQRKSPVPRVKYQSIYPGESIVDCTDGVSSKGVKLNGTETGSHLKAKVWTRQESEREAWEEL

Query:  QSQRVTGQEPEP-----DQEFELEAEPESYELEHEPDEMEPELVN----LLGVSSDVDDTFDDNVKDNEKFAKHEDHEDLNSLKLAELRAMAKSRGMKGF
        + +     EP P     + + EL+ E  S+  E E D++  ++++    +L V SD D++ DD  +D+++ A+ E  +DL+ LKL ELR +AKSRG+KG 
Subjt:  QSQRVTGQEPEP-----DQEFELEAEPESYELEHEPDEMEPELVN----LLGVSSDVDDTFDDNVKDNEKFAKHEDHEDLNSLKLAELRAMAKSRGMKGF

Query:  SKMKKSELVQLL
        SKMKK+ELV+LL
Subjt:  SKMKKSELVQLL

AT1G06190.2 Rho termination factor9.8e-4849.44Show/hide
Query:  GFGLSDSRCLPCSGVSGRAATVSSRSLCAEHRIKAQVKFPPLNCTSLGASFTCKASSGGHRRNPDFSKQNRHGFSRSRNRQNEERESLENLDESDLLSSK
        G+ LSDSRC   S VS R   +   S C +H+   ++K  P       +SF C+ASSGG+RRNPDFS+ N+HG+ R  NRQ+  RE  + ++ SD+LSS+
Subjt:  GFGLSDSRCLPCSGVSGRAATVSSRSLCAEHRIKAQVKFPPLNCTSLGASFTCKASSGGHRRNPDFSKQNRHGFSRSRNRQNEERESLENLDESDLLSSK

Query:  NGPLLSLSSTPKSQATATPGPREKEIVELFRKVQAQLRER-AAMKEEKKME--AQGQTKGSETVDSLLKLLRKHSVEQGKRSSGSGGSSSSSKDFSFNHV
        NGPL +LSS+PK QAT++PGPREKEIVELFRKVQAQLR R AA KEEKK+E  ++GQ K SETVDSLLKLLRKHS EQ KR             FS    
Subjt:  NGPLLSLSSTPKSQATATPGPREKEIVELFRKVQAQLRER-AAMKEEKKME--AQGQTKGSETVDSLLKLLRKHSVEQGKRSSGSGGSSSSSKDFSFNHV

Query:  KENGPYDEGKGTSIFGLSASLREKAQEPTGSFSRPVSSFQRKSPVPRVKYQSIYPGESIVDCTDGVS
         +    D+   T     S +    A     SF+RP SSF+RKSPVPR +    Y  E+  D +   S
Subjt:  KENGPYDEGKGTSIFGLSASLREKAQEPTGSFSRPVSSFQRKSPVPRVKYQSIYPGESIVDCTDGVS

AT2G31150.1 ATP binding;ATPases, coupled to transmembrane movement of ions, phosphorylative mechanism6.9e-2538.63Show/hide
Query:  HRR-NPDFSKQNRHGFSRSRNRQNEERESL-ENLDESDLLSSKNGPLLSLSSTPKSQATATPGPREKEIVELFRKVQAQLRER-AAMKEEKKMEAQGQTK
        HRR NPDFS+ N+HGF R RNR+NE+++ L +   E D+LSSKN                     EKEIVELF+KVQ QLR R AA KEEKK E   + +
Subjt:  HRR-NPDFSKQNRHGFSRSRNRQNEERESL-ENLDESDLLSSKNGPLLSLSSTPKSQATATPGPREKEIVELFRKVQAQLRER-AAMKEEKKMEAQGQTK

Query:  G---SETVDSLLKLLRKHSVEQGKRSSGSGGSSSSSKDFSFNHVKENGPYDEGKGTSIFGLSASLREKAQEPTGSFSRPVSSFQRKSPVPRVKYQSIYPG
        G   SETVDSLLKLLRKHS EQ K+              +FN  K+    D+         S             F+RP SSF+R SPVPR K Q+ Y  
Subjt:  G---SETVDSLLKLLRKHSVEQGKRSSGSGGSSSSSKDFSFNHVKENGPYDEGKGTSIFGLSASLREKAQEPTGSFSRPVSSFQRKSPVPRVKYQSIYPG

Query:  ESIVDCTDGVSSKGVKLNGTETGSHLKAKVWTRQESEREAWEELQSQRVTGQEPEPDQEFELEAEPESYELEHEPDEMEPELVNLLGVSSDVDDTF---D
        E+I D               E  S+  +  WT+++ + E+ +E +       EPEP+   E + EPE  E E+EP E EPEL  L  VS    ++F   +
Subjt:  ESIVDCTDGVSSKGVKLNGTETGSHLKAKVWTRQESEREAWEELQSQRVTGQEPEPDQEFELEAEPESYELEHEPDEMEPELVNLLGVSSDVDDTF---D

Query:  DNVKDNEKFAKHEDHEDLNSL
        D  +++  F   E  +D  SL
Subjt:  DNVKDNEKFAKHEDHEDLNSL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTCAAGCCATACATCTTCTTCCTAACAACGTTACAGCGTTGAAGTCCCTATTTTCGCTTCATACTGAAAATTCGGTAAATTTTATAAATATGGAATTTTGTTTTGA
TATAGGCTTTGGACTGTCAGATAGCAGATGCCTACCATGTTCTGGAGTTTCAGGACGAGCAGCCACTGTCTCTTCTCGCTCTTTATGTGCTGAACATAGAATCAAAGCAC
AGGTTAAATTCCCACCCCTAAACTGTACATCGTTGGGGGCTTCTTTTACGTGCAAAGCCAGCTCGGGAGGTCATAGGAGAAACCCAGACTTCTCAAAGCAAAATAGGCAT
GGCTTCTCAAGAAGCAGAAATAGGCAAAATGAGGAGAGAGAGAGCCTTGAAAATCTTGATGAATCTGATTTATTATCGTCTAAGAATGGACCATTACTTTCCCTCTCTAG
CACCCCAAAATCCCAGGCCACTGCTACCCCAGGCCCAAGGGAGAAGGAAATTGTTGAACTTTTCAGAAAGGTTCAAGCTCAGCTTCGGGAGCGCGCTGCGATGAAAGAAG
AGAAGAAAATGGAAGCACAAGGACAAACGAAAGGGAGCGAAACGGTGGATTCTCTTCTTAAGCTATTGAGAAAGCATTCAGTTGAGCAAGGGAAGAGAAGCAGTGGTAGT
GGTGGCAGCAGCAGCAGCAGCAAGGACTTCAGTTTTAACCATGTCAAAGAGAATGGTCCATATGATGAAGGAAAAGGCACAAGCATTTTTGGCCTAAGTGCCAGCTTGAG
AGAGAAGGCCCAAGAACCAACAGGATCTTTTAGTAGACCCGTATCGAGTTTTCAACGTAAATCCCCCGTGCCTCGGGTGAAATACCAATCAATTTACCCCGGGGAAAGTA
TTGTTGACTGCACCGATGGCGTGAGTTCAAAGGGGGTGAAACTTAATGGAACCGAGACAGGTTCTCACCTGAAGGCAAAGGTATGGACTCGACAAGAGTCAGAACGAGAG
GCCTGGGAAGAGCTGCAATCACAACGAGTGACAGGGCAGGAGCCAGAGCCAGACCAAGAGTTCGAATTGGAGGCAGAGCCTGAATCATATGAGCTAGAGCATGAACCTGA
TGAGATGGAGCCTGAACTCGTTAATTTATTAGGCGTATCTTCAGACGTCGATGACACATTTGACGACAATGTTAAAGACAACGAGAAATTTGCAAAGCATGAGGATCATG
AGGACTTGAACTCATTGAAGCTTGCTGAACTGAGGGCAATGGCCAAATCTCGCGGTATGAAAGGCTTCTCAAAGATGAAGAAGAGCGAGCTCGTGCAGTTGCTAAGCGAG
GCTCGAGTATGA
mRNA sequenceShow/hide mRNA sequence
ATGTCTCAAGCCATACATCTTCTTCCTAACAACGTTACAGCGTTGAAGTCCCTATTTTCGCTTCATACTGAAAATTCGGTAAATTTTATAAATATGGAATTTTGTTTTGA
TATAGGCTTTGGACTGTCAGATAGCAGATGCCTACCATGTTCTGGAGTTTCAGGACGAGCAGCCACTGTCTCTTCTCGCTCTTTATGTGCTGAACATAGAATCAAAGCAC
AGGTTAAATTCCCACCCCTAAACTGTACATCGTTGGGGGCTTCTTTTACGTGCAAAGCCAGCTCGGGAGGTCATAGGAGAAACCCAGACTTCTCAAAGCAAAATAGGCAT
GGCTTCTCAAGAAGCAGAAATAGGCAAAATGAGGAGAGAGAGAGCCTTGAAAATCTTGATGAATCTGATTTATTATCGTCTAAGAATGGACCATTACTTTCCCTCTCTAG
CACCCCAAAATCCCAGGCCACTGCTACCCCAGGCCCAAGGGAGAAGGAAATTGTTGAACTTTTCAGAAAGGTTCAAGCTCAGCTTCGGGAGCGCGCTGCGATGAAAGAAG
AGAAGAAAATGGAAGCACAAGGACAAACGAAAGGGAGCGAAACGGTGGATTCTCTTCTTAAGCTATTGAGAAAGCATTCAGTTGAGCAAGGGAAGAGAAGCAGTGGTAGT
GGTGGCAGCAGCAGCAGCAGCAAGGACTTCAGTTTTAACCATGTCAAAGAGAATGGTCCATATGATGAAGGAAAAGGCACAAGCATTTTTGGCCTAAGTGCCAGCTTGAG
AGAGAAGGCCCAAGAACCAACAGGATCTTTTAGTAGACCCGTATCGAGTTTTCAACGTAAATCCCCCGTGCCTCGGGTGAAATACCAATCAATTTACCCCGGGGAAAGTA
TTGTTGACTGCACCGATGGCGTGAGTTCAAAGGGGGTGAAACTTAATGGAACCGAGACAGGTTCTCACCTGAAGGCAAAGGTATGGACTCGACAAGAGTCAGAACGAGAG
GCCTGGGAAGAGCTGCAATCACAACGAGTGACAGGGCAGGAGCCAGAGCCAGACCAAGAGTTCGAATTGGAGGCAGAGCCTGAATCATATGAGCTAGAGCATGAACCTGA
TGAGATGGAGCCTGAACTCGTTAATTTATTAGGCGTATCTTCAGACGTCGATGACACATTTGACGACAATGTTAAAGACAACGAGAAATTTGCAAAGCATGAGGATCATG
AGGACTTGAACTCATTGAAGCTTGCTGAACTGAGGGCAATGGCCAAATCTCGCGGTATGAAAGGCTTCTCAAAGATGAAGAAGAGCGAGCTCGTGCAGTTGCTAAGCGAG
GCTCGAGTATGA
Protein sequenceShow/hide protein sequence
MSQAIHLLPNNVTALKSLFSLHTENSVNFINMEFCFDIGFGLSDSRCLPCSGVSGRAATVSSRSLCAEHRIKAQVKFPPLNCTSLGASFTCKASSGGHRRNPDFSKQNRH
GFSRSRNRQNEERESLENLDESDLLSSKNGPLLSLSSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKMEAQGQTKGSETVDSLLKLLRKHSVEQGKRSSGS
GGSSSSSKDFSFNHVKENGPYDEGKGTSIFGLSASLREKAQEPTGSFSRPVSSFQRKSPVPRVKYQSIYPGESIVDCTDGVSSKGVKLNGTETGSHLKAKVWTRQESERE
AWEELQSQRVTGQEPEPDQEFELEAEPESYELEHEPDEMEPELVNLLGVSSDVDDTFDDNVKDNEKFAKHEDHEDLNSLKLAELRAMAKSRGMKGFSKMKKSELVQLLSE
ARV