; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh11G011370 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh11G011370
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationCmo_Chr11:6423587..6424516
RNA-Seq ExpressionCmoCh11G011370
SyntenyCmoCh11G011370
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022926214.1 uncharacterized protein LOC111433394 [Cucurbita moschata]6.9e-15193.73Show/hide
Query:  LKEKKKMTEQNIQQIELGAQLNREFENLAMMPNQERITANAIHLADDRERAIQAYAHPAVEELNSCIIRPEMQATTLEIKPVMFQMLQTIGQFHGLPSED
        +KEKKKMTEQNIQQIELGAQLNREFEN AMM NQERITANAIHLADDRERAI+AYAHPAVEELN CIIRPEMQATT E+KPVMFQMLQTIGQFHGLPSED
Subjt:  LKEKKKMTEQNIQQIELGAQLNREFENLAMMPNQERITANAIHLADDRERAIQAYAHPAVEELNSCIIRPEMQATTLEIKPVMFQMLQTIGQFHGLPSED

Query:  PHLHLKSFLGVSDSFRFQGVDKDVIRLSLFPYSLRDGAKSWLNTLAPGTIDSWNSLVEKFFIKYFPPTRNSRFRNEIVAFQQFEDETLSEAWERFKEMLR
        PHLHLKSFLGVSDSFRFQ VDKDVIRLSLFPYSLRDGAKSWLNTLA GTIDSWNSLVEKF IKYFPPTRN+RFRNEIV FQQFED+TLSEAWERFKEMLR
Subjt:  PHLHLKSFLGVSDSFRFQGVDKDVIRLSLFPYSLRDGAKSWLNTLAPGTIDSWNSLVEKFFIKYFPPTRNSRFRNEIVAFQQFEDETLSEAWERFKEMLR

Query:  KCPYYGLSHCIQMETFYNGLNIATKQVVDASANGAILSKTYNEAYEILERIASNNCQWDDVRNNPGRKTRGVLEVDALSSINAQLAS
        KCP++GL HCIQMETFYNGLNIATKQVVDASANGAILSKTYNEAYEILERIASNNCQW DVR+NPGRKTRGVLEVDALSSINAQLAS
Subjt:  KCPYYGLSHCIQMETFYNGLNIATKQVVDASANGAILSKTYNEAYEILERIASNNCQWDDVRNNPGRKTRGVLEVDALSSINAQLAS

XP_022929949.1 uncharacterized protein LOC111436411 [Cucurbita moschata]5.8e-15889.56Show/hide
Query:  MNPPTGLEFILDPEIERTFRRRLKEKKKMTEQNIQQIELGAQLNREFENLAMMPNQERITANAIHLADDRERAIQAYAHPAVEELNSCIIRPEMQATTLE
        MNPPTGLEFILDPEIERTFRRRLK+KKKMTEQNIQQIELGAQLNREFEN AMM NQERITAN IHLADDRERAI+AYAHPAVEELN CIIRPE+Q TT E
Subjt:  MNPPTGLEFILDPEIERTFRRRLKEKKKMTEQNIQQIELGAQLNREFENLAMMPNQERITANAIHLADDRERAIQAYAHPAVEELNSCIIRPEMQATTLE

Query:  IKPVMFQMLQTIGQFHGLPSEDPHLHLKSFLGV-------SDSFRFQGVDKDVIRLSLFPYSLRDGAKSWLNTLAPGTIDSWNSLVEKFFIKYFPPTRNS
        +KPVMFQMLQTIGQFHGLP EDPHLHLKSFLGV       SDSFRFQGVDKD+IRLSLFPY LRDGAKSWLNTLAPGTIDSWNSL E F IKYFPPTRN+
Subjt:  IKPVMFQMLQTIGQFHGLPSEDPHLHLKSFLGV-------SDSFRFQGVDKDVIRLSLFPYSLRDGAKSWLNTLAPGTIDSWNSLVEKFFIKYFPPTRNS

Query:  RFRNEIVAFQQFEDETLSEAWERFKEMLRKCPYYGLSHCIQMETFYNGLNIATKQVVDASANGAILSKTYNEAYEILERIASNNCQWDDVRNNPGRKTRG
        RF+NEIV FQQFEDETLSEA ERFKEMLRKCP++GL HCIQMETFYNGLNI TKQVVDASANGAILSKTYNEAYEILERIASNNCQW DVR+NPGRKTRG
Subjt:  RFRNEIVAFQQFEDETLSEAWERFKEMLRKCPYYGLSHCIQMETFYNGLNIATKQVVDASANGAILSKTYNEAYEILERIASNNCQWDDVRNNPGRKTRG

Query:  VLEVDALSSINAQLAS
        VLEVDALSSINAQLAS
Subjt:  VLEVDALSSINAQLAS

XP_022947838.1 uncharacterized protein LOC111451598 [Cucurbita moschata]1.5e-12687.94Show/hide
Query:  MPNQERITANAIHLADDRERAIQAYAHPAVEELNSCIIRPEMQATTLEIKPVMFQMLQTIGQFHGLPSEDPHLHLKSFLGVSDSFRFQGVDKDVIRLSLF
        M NQERITANAIH+ADDRERAI+AYAHPAVEELN CIIRPEMQATT E+KPVMFQMLQTIGQFHGL S+DPHLHLKSFLGVSDSFRFQGVDKDVIRLS F
Subjt:  MPNQERITANAIHLADDRERAIQAYAHPAVEELNSCIIRPEMQATTLEIKPVMFQMLQTIGQFHGLPSEDPHLHLKSFLGVSDSFRFQGVDKDVIRLSLF

Query:  PYSLRDGAKSWLNTLAPGTIDSWNSLVEKFFIKYFPPTRNSRFRNEIVAFQQFEDETLSEAWERFKEMLRKCPYYGLSHCIQMETFYNGLNIATKQVVDA
         YSLRDGAKSWLN LA G IDSWNSL EKF  KYFPPTR++RFRNEIVAFQ+FE+ETLSEAWERFKE LRKCP++GL HCIQ+ETFYNGLN ATKQVVDA
Subjt:  PYSLRDGAKSWLNTLAPGTIDSWNSLVEKFFIKYFPPTRNSRFRNEIVAFQQFEDETLSEAWERFKEMLRKCPYYGLSHCIQMETFYNGLNIATKQVVDA

Query:  SANGAILSKTYNEAYEILERIASNNCQWDDVRNNPGRKTRGVLEVDALSSINAQLAS
        SANG ILSKTYNEAYEILERIASNNCQW DVR+NPG+KTR VLEVDALSSINAQLAS
Subjt:  SANGAILSKTYNEAYEILERIASNNCQWDDVRNNPGRKTRGVLEVDALSSINAQLAS

XP_022960432.1 uncharacterized protein LOC111461168 [Cucurbita moschata]1.0e-14190.39Show/hide
Query:  MTEQNIQQIELGAQLNREFENLAMMPNQERITANAIHLADDRERAIQAYAHPAVEELNSCIIRPEMQATTLEIKPVMFQMLQTIGQFHGLPSEDPHLHLK
        M EQNIQQ+   AQLN+EFEN  MM NQERI ANAI LADDRERAI+AYAHPAV+ELN CIIRPEMQATT E+KPVMFQMLQTIGQFHGLPSEDPHLHLK
Subjt:  MTEQNIQQIELGAQLNREFENLAMMPNQERITANAIHLADDRERAIQAYAHPAVEELNSCIIRPEMQATTLEIKPVMFQMLQTIGQFHGLPSEDPHLHLK

Query:  SFLGVSDSFRFQGVDKDVIRLSLFPYSLRDGAKSWLNTLAPGTIDSWNSLVEKFFIKYFPPTRNSRFRNEIVAFQQFEDETLSEAWERFKEMLRKCPYYG
        SFLGVSDSFRFQGVDKDVIRLSLFPYSLRDGAKSWLNTLAP TIDSWNSL EKF IKYFPPTRN+RFRNEIVAFQQFEDETLSEAWERFKEMLRKCP++G
Subjt:  SFLGVSDSFRFQGVDKDVIRLSLFPYSLRDGAKSWLNTLAPGTIDSWNSLVEKFFIKYFPPTRNSRFRNEIVAFQQFEDETLSEAWERFKEMLRKCPYYG

Query:  LSHCIQMETFYNGLNIATKQVVDASANGAILSKTYNEAYEILERIASNNCQWDDVRNNPGRKTRGVLEVDALSSINAQLAS
        L HCIQMETFYNGLNIATKQVVDASANGA+LSKTYNEAYEILERIASNNCQW DVR+NPG+KTRGVLEVDALSSINAQLAS
Subjt:  LSHCIQMETFYNGLNIATKQVVDASANGAILSKTYNEAYEILERIASNNCQWDDVRNNPGRKTRGVLEVDALSSINAQLAS

XP_023511572.1 uncharacterized protein LOC111776371 [Cucurbita pepo subsp. pepo]1.9e-13286.83Show/hide
Query:  MTEQNIQQIELGAQLNREFENLAMMPNQERITANAIHLADDRERAIQAYAHPAVEELNSCIIRPEMQATTLEIKPVMFQMLQTIGQFHGLPSEDPHLHLK
        M +QNIQQ+   AQLNREFEN AMM NQ+RITANAIHLADDRERAI+AYAH AVEEL  CIIRPE Q TT E+KPVMFQMLQTIG+FHGL SEDPHLHLK
Subjt:  MTEQNIQQIELGAQLNREFENLAMMPNQERITANAIHLADDRERAIQAYAHPAVEELNSCIIRPEMQATTLEIKPVMFQMLQTIGQFHGLPSEDPHLHLK

Query:  SFLGVSDSFRFQGVDKDVIRLSLFPYSLRDGAKSWLNTLAPGTIDSWNSLVEKFFIKYFPPTRNSRFRNEIVAFQQFEDETLSEAWERFKEMLRKCPYYG
        SFLGVSDSFRFQGVDKDVIRLSLFPYSLRDGAKSWLNTLA GTIDSWNSL +KFFIKYF PTRN+RFRNEIVAFQQFEDETLSEAWERFKEMLRKCP++G
Subjt:  SFLGVSDSFRFQGVDKDVIRLSLFPYSLRDGAKSWLNTLAPGTIDSWNSLVEKFFIKYFPPTRNSRFRNEIVAFQQFEDETLSEAWERFKEMLRKCPYYG

Query:  LSHCIQMETFYNGLNIATKQVVDASANGAILSKTYNEAYEILERIASNNCQWDDVRNNPGRKTRGVLEVDALSSINAQLAS
           CIQMETFY+GLNIATKQVVDASANGAILSKT NE YEILERIASNNCQW DVR+NPG+KTR  LEVDALSSINAQLAS
Subjt:  LSHCIQMETFYNGLNIATKQVVDASANGAILSKTYNEAYEILERIASNNCQWDDVRNNPGRKTRGVLEVDALSSINAQLAS

TrEMBL top hitse value%identityAlignment
A0A6J1EEI2 uncharacterized protein LOC1114333943.3e-15193.73Show/hide
Query:  LKEKKKMTEQNIQQIELGAQLNREFENLAMMPNQERITANAIHLADDRERAIQAYAHPAVEELNSCIIRPEMQATTLEIKPVMFQMLQTIGQFHGLPSED
        +KEKKKMTEQNIQQIELGAQLNREFEN AMM NQERITANAIHLADDRERAI+AYAHPAVEELN CIIRPEMQATT E+KPVMFQMLQTIGQFHGLPSED
Subjt:  LKEKKKMTEQNIQQIELGAQLNREFENLAMMPNQERITANAIHLADDRERAIQAYAHPAVEELNSCIIRPEMQATTLEIKPVMFQMLQTIGQFHGLPSED

Query:  PHLHLKSFLGVSDSFRFQGVDKDVIRLSLFPYSLRDGAKSWLNTLAPGTIDSWNSLVEKFFIKYFPPTRNSRFRNEIVAFQQFEDETLSEAWERFKEMLR
        PHLHLKSFLGVSDSFRFQ VDKDVIRLSLFPYSLRDGAKSWLNTLA GTIDSWNSLVEKF IKYFPPTRN+RFRNEIV FQQFED+TLSEAWERFKEMLR
Subjt:  PHLHLKSFLGVSDSFRFQGVDKDVIRLSLFPYSLRDGAKSWLNTLAPGTIDSWNSLVEKFFIKYFPPTRNSRFRNEIVAFQQFEDETLSEAWERFKEMLR

Query:  KCPYYGLSHCIQMETFYNGLNIATKQVVDASANGAILSKTYNEAYEILERIASNNCQWDDVRNNPGRKTRGVLEVDALSSINAQLAS
        KCP++GL HCIQMETFYNGLNIATKQVVDASANGAILSKTYNEAYEILERIASNNCQW DVR+NPGRKTRGVLEVDALSSINAQLAS
Subjt:  KCPYYGLSHCIQMETFYNGLNIATKQVVDASANGAILSKTYNEAYEILERIASNNCQWDDVRNNPGRKTRGVLEVDALSSINAQLAS

A0A6J1EQ90 uncharacterized protein LOC1114364112.8e-15889.56Show/hide
Query:  MNPPTGLEFILDPEIERTFRRRLKEKKKMTEQNIQQIELGAQLNREFENLAMMPNQERITANAIHLADDRERAIQAYAHPAVEELNSCIIRPEMQATTLE
        MNPPTGLEFILDPEIERTFRRRLK+KKKMTEQNIQQIELGAQLNREFEN AMM NQERITAN IHLADDRERAI+AYAHPAVEELN CIIRPE+Q TT E
Subjt:  MNPPTGLEFILDPEIERTFRRRLKEKKKMTEQNIQQIELGAQLNREFENLAMMPNQERITANAIHLADDRERAIQAYAHPAVEELNSCIIRPEMQATTLE

Query:  IKPVMFQMLQTIGQFHGLPSEDPHLHLKSFLGV-------SDSFRFQGVDKDVIRLSLFPYSLRDGAKSWLNTLAPGTIDSWNSLVEKFFIKYFPPTRNS
        +KPVMFQMLQTIGQFHGLP EDPHLHLKSFLGV       SDSFRFQGVDKD+IRLSLFPY LRDGAKSWLNTLAPGTIDSWNSL E F IKYFPPTRN+
Subjt:  IKPVMFQMLQTIGQFHGLPSEDPHLHLKSFLGV-------SDSFRFQGVDKDVIRLSLFPYSLRDGAKSWLNTLAPGTIDSWNSLVEKFFIKYFPPTRNS

Query:  RFRNEIVAFQQFEDETLSEAWERFKEMLRKCPYYGLSHCIQMETFYNGLNIATKQVVDASANGAILSKTYNEAYEILERIASNNCQWDDVRNNPGRKTRG
        RF+NEIV FQQFEDETLSEA ERFKEMLRKCP++GL HCIQMETFYNGLNI TKQVVDASANGAILSKTYNEAYEILERIASNNCQW DVR+NPGRKTRG
Subjt:  RFRNEIVAFQQFEDETLSEAWERFKEMLRKCPYYGLSHCIQMETFYNGLNIATKQVVDASANGAILSKTYNEAYEILERIASNNCQWDDVRNNPGRKTRG

Query:  VLEVDALSSINAQLAS
        VLEVDALSSINAQLAS
Subjt:  VLEVDALSSINAQLAS

A0A6J1G7Q6 uncharacterized protein LOC1114515987.5e-12787.94Show/hide
Query:  MPNQERITANAIHLADDRERAIQAYAHPAVEELNSCIIRPEMQATTLEIKPVMFQMLQTIGQFHGLPSEDPHLHLKSFLGVSDSFRFQGVDKDVIRLSLF
        M NQERITANAIH+ADDRERAI+AYAHPAVEELN CIIRPEMQATT E+KPVMFQMLQTIGQFHGL S+DPHLHLKSFLGVSDSFRFQGVDKDVIRLS F
Subjt:  MPNQERITANAIHLADDRERAIQAYAHPAVEELNSCIIRPEMQATTLEIKPVMFQMLQTIGQFHGLPSEDPHLHLKSFLGVSDSFRFQGVDKDVIRLSLF

Query:  PYSLRDGAKSWLNTLAPGTIDSWNSLVEKFFIKYFPPTRNSRFRNEIVAFQQFEDETLSEAWERFKEMLRKCPYYGLSHCIQMETFYNGLNIATKQVVDA
         YSLRDGAKSWLN LA G IDSWNSL EKF  KYFPPTR++RFRNEIVAFQ+FE+ETLSEAWERFKE LRKCP++GL HCIQ+ETFYNGLN ATKQVVDA
Subjt:  PYSLRDGAKSWLNTLAPGTIDSWNSLVEKFFIKYFPPTRNSRFRNEIVAFQQFEDETLSEAWERFKEMLRKCPYYGLSHCIQMETFYNGLNIATKQVVDA

Query:  SANGAILSKTYNEAYEILERIASNNCQWDDVRNNPGRKTRGVLEVDALSSINAQLAS
        SANG ILSKTYNEAYEILERIASNNCQW DVR+NPG+KTR VLEVDALSSINAQLAS
Subjt:  SANGAILSKTYNEAYEILERIASNNCQWDDVRNNPGRKTRGVLEVDALSSINAQLAS

A0A6J1H7E4 uncharacterized protein LOC1114611684.8e-14290.39Show/hide
Query:  MTEQNIQQIELGAQLNREFENLAMMPNQERITANAIHLADDRERAIQAYAHPAVEELNSCIIRPEMQATTLEIKPVMFQMLQTIGQFHGLPSEDPHLHLK
        M EQNIQQ+   AQLN+EFEN  MM NQERI ANAI LADDRERAI+AYAHPAV+ELN CIIRPEMQATT E+KPVMFQMLQTIGQFHGLPSEDPHLHLK
Subjt:  MTEQNIQQIELGAQLNREFENLAMMPNQERITANAIHLADDRERAIQAYAHPAVEELNSCIIRPEMQATTLEIKPVMFQMLQTIGQFHGLPSEDPHLHLK

Query:  SFLGVSDSFRFQGVDKDVIRLSLFPYSLRDGAKSWLNTLAPGTIDSWNSLVEKFFIKYFPPTRNSRFRNEIVAFQQFEDETLSEAWERFKEMLRKCPYYG
        SFLGVSDSFRFQGVDKDVIRLSLFPYSLRDGAKSWLNTLAP TIDSWNSL EKF IKYFPPTRN+RFRNEIVAFQQFEDETLSEAWERFKEMLRKCP++G
Subjt:  SFLGVSDSFRFQGVDKDVIRLSLFPYSLRDGAKSWLNTLAPGTIDSWNSLVEKFFIKYFPPTRNSRFRNEIVAFQQFEDETLSEAWERFKEMLRKCPYYG

Query:  LSHCIQMETFYNGLNIATKQVVDASANGAILSKTYNEAYEILERIASNNCQWDDVRNNPGRKTRGVLEVDALSSINAQLAS
        L HCIQMETFYNGLNIATKQVVDASANGA+LSKTYNEAYEILERIASNNCQW DVR+NPG+KTRGVLEVDALSSINAQLAS
Subjt:  LSHCIQMETFYNGLNIATKQVVDASANGAILSKTYNEAYEILERIASNNCQWDDVRNNPGRKTRGVLEVDALSSINAQLAS

U5CUI2 Retrotrans_gag domain-containing protein1.2e-9565.5Show/hide
Query:  MMPNQERITANAIHLADDRERAIQAYAHPAVEELNSCIIRPEMQATTLEIKPVMFQMLQTIGQFHGLPSEDPHLHLKSFLGVSDSFRFQGVDKDVIRLSL
        MM    +   N I LADDR RAI+ YA P   ELN  I+RPE+QA   E+KPVMFQMLQT+GQF G+P+EDPHLHL+SFL VSDSF+ QGV ++V+RL L
Subjt:  MMPNQERITANAIHLADDRERAIQAYAHPAVEELNSCIIRPEMQATTLEIKPVMFQMLQTIGQFHGLPSEDPHLHLKSFLGVSDSFRFQGVDKDVIRLSL

Query:  FPYSLRDGAKSWLNTLAPGTIDSWNSLVEKFFIKYFPPTRNSRFRNEIVAFQQFEDETLSEAWERFKEMLRKCPYYGLSHCIQMETFYNGLNIATKQVVD
        FP+SLRD A+SWLNTL P ++ +WN L EKF  KYFPPTRN++FR+EI++FQQ EDE+ S+AWERFKE+LRKCP++G+ HCIQMETFYNGLN A++ V+D
Subjt:  FPYSLRDGAKSWLNTLAPGTIDSWNSLVEKFFIKYFPPTRNSRFRNEIVAFQQFEDETLSEAWERFKEMLRKCPYYGLSHCIQMETFYNGLNIATKQVVD

Query:  ASANGAILSKTYNEAYEILERIASNNCQWDDVRNNPGRKTRGVLEVDALSSINAQLAS
        ASANGAILSK+YNEA+EILE IASNN QW + R    RK  GVLEVDA++++ AQ+AS
Subjt:  ASANGAILSKTYNEAYEILERIASNNCQWDDVRNNPGRKTRGVLEVDALSSINAQLAS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACCCACCTACGGGTTTAGAATTTATTTTGGATCCTGAAATAGAGAGAACATTTAGACGAAGGTTGAAGGAGAAAAAGAAAATGACAGAACAGAATATACAACAAAT
AGAACTTGGAGCCCAGTTGAATCGGGAGTTTGAAAATCTAGCGATGATGCCTAATCAAGAGAGAATTACTGCCAATGCCATTCATTTAGCAGATGACAGAGAAAGAGCGA
TCCAAGCATATGCACACCCAGCTGTGGAAGAGCTAAATTCGTGCATCATTAGACCCGAAATGCAAGCAACCACGCTTGAGATAAAACCTGTGATGTTTCAAATGTTGCAA
ACCATTGGGCAATTTCATGGACTACCGTCGGAAGATCCTCACCTACACCTAAAGTCATTTTTGGGAGTTAGTGACTCATTTCGATTCCAAGGAGTGGATAAAGATGTGAT
TAGACTGAGCTTATTCCCATATTCATTGAGGGATGGTGCTAAATCATGGTTGAATACCTTAGCACCGGGAACAATTGATTCGTGGAATAGTCTAGTAGAGAAATTTTTTA
TCAAGTATTTCCCACCCACTAGAAATTCACGGTTCAGAAATGAGATTGTTGCTTTTCAACAGTTTGAAGATGAGACACTAAGTGAAGCTTGGGAGAGATTTAAGGAGATG
CTTCGAAAGTGCCCTTACTATGGACTATCTCATTGTATACAAATGGAGACTTTCTACAATGGATTAAATATTGCTACTAAACAAGTAGTTGATGCTTCCGCCAATGGAGC
TATTTTGTCAAAGACATACAATGAAGCATATGAGATTTTAGAGAGAATAGCATCTAACAATTGTCAATGGGACGATGTGAGAAACAACCCAGGAAGGAAGACTCGAGGAG
TACTTGAAGTTGATGCTTTGTCCTCTATCAATGCTCAACTAGCTTCGTGA
mRNA sequenceShow/hide mRNA sequence
ATGAACCCACCTACGGGTTTAGAATTTATTTTGGATCCTGAAATAGAGAGAACATTTAGACGAAGGTTGAAGGAGAAAAAGAAAATGACAGAACAGAATATACAACAAAT
AGAACTTGGAGCCCAGTTGAATCGGGAGTTTGAAAATCTAGCGATGATGCCTAATCAAGAGAGAATTACTGCCAATGCCATTCATTTAGCAGATGACAGAGAAAGAGCGA
TCCAAGCATATGCACACCCAGCTGTGGAAGAGCTAAATTCGTGCATCATTAGACCCGAAATGCAAGCAACCACGCTTGAGATAAAACCTGTGATGTTTCAAATGTTGCAA
ACCATTGGGCAATTTCATGGACTACCGTCGGAAGATCCTCACCTACACCTAAAGTCATTTTTGGGAGTTAGTGACTCATTTCGATTCCAAGGAGTGGATAAAGATGTGAT
TAGACTGAGCTTATTCCCATATTCATTGAGGGATGGTGCTAAATCATGGTTGAATACCTTAGCACCGGGAACAATTGATTCGTGGAATAGTCTAGTAGAGAAATTTTTTA
TCAAGTATTTCCCACCCACTAGAAATTCACGGTTCAGAAATGAGATTGTTGCTTTTCAACAGTTTGAAGATGAGACACTAAGTGAAGCTTGGGAGAGATTTAAGGAGATG
CTTCGAAAGTGCCCTTACTATGGACTATCTCATTGTATACAAATGGAGACTTTCTACAATGGATTAAATATTGCTACTAAACAAGTAGTTGATGCTTCCGCCAATGGAGC
TATTTTGTCAAAGACATACAATGAAGCATATGAGATTTTAGAGAGAATAGCATCTAACAATTGTCAATGGGACGATGTGAGAAACAACCCAGGAAGGAAGACTCGAGGAG
TACTTGAAGTTGATGCTTTGTCCTCTATCAATGCTCAACTAGCTTCGTGA
Protein sequenceShow/hide protein sequence
MNPPTGLEFILDPEIERTFRRRLKEKKKMTEQNIQQIELGAQLNREFENLAMMPNQERITANAIHLADDRERAIQAYAHPAVEELNSCIIRPEMQATTLEIKPVMFQMLQ
TIGQFHGLPSEDPHLHLKSFLGVSDSFRFQGVDKDVIRLSLFPYSLRDGAKSWLNTLAPGTIDSWNSLVEKFFIKYFPPTRNSRFRNEIVAFQQFEDETLSEAWERFKEM
LRKCPYYGLSHCIQMETFYNGLNIATKQVVDASANGAILSKTYNEAYEILERIASNNCQWDDVRNNPGRKTRGVLEVDALSSINAQLAS