; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc03g0066481 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc03g0066481
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionReverse transcriptase
Genome locationCMiso1.1chr03:8440489..8441139
RNA-Seq ExpressionCmc03g0066481
SyntenyCmc03g0066481
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008194 - UDP-glycosyltransferase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR021109 - Aspartic peptidase domain superfamily
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025917.1 pol protein [Cucumis melo var. makuwa]1.2e-10996.14Show/hide
Query:  MQDFDVILGMDWLSTNHATIDCFNKEVVFNPPSGDKFKFRGAGMVGIPKVISAMKASKLLSQGTWGILASVVDVAEPEVFLSSEPVVREYPDVFPDELPG
        MQDFDVILGMDWLSTNHATIDCFNKEVVFNPPSGDKFKFRGAGMVGIPKVISAMKASKLLSQGTWGILA+VVD+AEPEV LSSEPVVREYPDVFPDELPG
Subjt:  MQDFDVILGMDWLSTNHATIDCFNKEVVFNPPSGDKFKFRGAGMVGIPKVISAMKASKLLSQGTWGILASVVDVAEPEVFLSSEPVVREYPDVFPDELPG

Query:  LPPPREVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIEDLFDQLQ
        LPPPREVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIEDLFDQLQ
Subjt:  LPPPREVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIEDLFDQLQ

Query:  GPPSFPR
        G   F +
Subjt:  GPPSFPR

KAA0025998.1 pol protein [Cucumis melo var. makuwa]5.2e-10593.24Show/hide
Query:  MQDFDVILGMDWLSTNHATIDCFNKEVVFNPPSGDKFKFRGAGMVGIPKVISAMKASKLLSQGTWGILASVVDVAEPEVFLSSEPVVREYPDVFPDELPG
        MQDFDVILGMDWLS NHA IDCF KEVVFNPPSG  FKFRGAGMV IPKVISAMKASKLLSQGTWGILASVVDV EPEV LSSEPVVREYPDVFPDELPG
Subjt:  MQDFDVILGMDWLSTNHATIDCFNKEVVFNPPSGDKFKFRGAGMVGIPKVISAMKASKLLSQGTWGILASVVDVAEPEVFLSSEPVVREYPDVFPDELPG

Query:  LPPPREVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIEDLFDQLQ
        LPPPREVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRI+DLFDQLQ
Subjt:  LPPPREVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIEDLFDQLQ

Query:  GPPSFPR
        G   F +
Subjt:  GPPSFPR

KAA0059723.1 pol protein [Cucumis melo var. makuwa]6.8e-10592.75Show/hide
Query:  MQDFDVILGMDWLSTNHATIDCFNKEVVFNPPSGDKFKFRGAGMVGIPKVISAMKASKLLSQGTWGILASVVDVAEPEVFLSSEPVVREYPDVFPDELPG
        MQDFDVILGMDWLS NHA IDCF KEVVFNPPSG  FKFRGAGMV IPKVISAMKASKLLSQGTWGILASVVD+ EPEV LSSEPVVREYPDVFPDELPG
Subjt:  MQDFDVILGMDWLSTNHATIDCFNKEVVFNPPSGDKFKFRGAGMVGIPKVISAMKASKLLSQGTWGILASVVDVAEPEVFLSSEPVVREYPDVFPDELPG

Query:  LPPPREVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIEDLFDQLQ
        LPPPREVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRI+DLFDQLQ
Subjt:  LPPPREVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIEDLFDQLQ

Query:  GPPSFPR
        G   F +
Subjt:  GPPSFPR

KAA0060745.1 pol protein [Cucumis melo var. makuwa]1.5e-10492.27Show/hide
Query:  MQDFDVILGMDWLSTNHATIDCFNKEVVFNPPSGDKFKFRGAGMVGIPKVISAMKASKLLSQGTWGILASVVDVAEPEVFLSSEPVVREYPDVFPDELPG
        MQDFDVILGMDWLS NHA IDCF KEVVFNPPSG  FKFRGAGMV IPKVISAMKASKLLSQGTWGILASVVD+ EPEV LSSEPVVREYPDVFPDELPG
Subjt:  MQDFDVILGMDWLSTNHATIDCFNKEVVFNPPSGDKFKFRGAGMVGIPKVISAMKASKLLSQGTWGILASVVDVAEPEVFLSSEPVVREYPDVFPDELPG

Query:  LPPPREVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIEDLFDQLQ
        LPPPREVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLP+I+DLFDQLQ
Subjt:  LPPPREVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIEDLFDQLQ

Query:  GPPSFPR
        G   F +
Subjt:  GPPSFPR

TYK01306.1 pol protein [Cucumis melo var. makuwa]6.8e-10592.75Show/hide
Query:  MQDFDVILGMDWLSTNHATIDCFNKEVVFNPPSGDKFKFRGAGMVGIPKVISAMKASKLLSQGTWGILASVVDVAEPEVFLSSEPVVREYPDVFPDELPG
        MQDFDVILGMDWLS NHA IDCF KEVVFNPPSG  FKFRGAGMV IPKVISAMKASKLLSQGTWGILASVVD+ EPEV LSSEPVVREYPDVFPDELPG
Subjt:  MQDFDVILGMDWLSTNHATIDCFNKEVVFNPPSGDKFKFRGAGMVGIPKVISAMKASKLLSQGTWGILASVVDVAEPEVFLSSEPVVREYPDVFPDELPG

Query:  LPPPREVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIEDLFDQLQ
        LPPPREVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRI+DLFDQLQ
Subjt:  LPPPREVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIEDLFDQLQ

Query:  GPPSFPR
        G   F +
Subjt:  GPPSFPR

TrEMBL top hitse value%identityAlignment
A0A5A7SIJ5 Reverse transcriptase2.5e-10593.24Show/hide
Query:  MQDFDVILGMDWLSTNHATIDCFNKEVVFNPPSGDKFKFRGAGMVGIPKVISAMKASKLLSQGTWGILASVVDVAEPEVFLSSEPVVREYPDVFPDELPG
        MQDFDVILGMDWLS NHA IDCF KEVVFNPPSG  FKFRGAGMV IPKVISAMKASKLLSQGTWGILASVVDV EPEV LSSEPVVREYPDVFPDELPG
Subjt:  MQDFDVILGMDWLSTNHATIDCFNKEVVFNPPSGDKFKFRGAGMVGIPKVISAMKASKLLSQGTWGILASVVDVAEPEVFLSSEPVVREYPDVFPDELPG

Query:  LPPPREVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIEDLFDQLQ
        LPPPREVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRI+DLFDQLQ
Subjt:  LPPPREVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIEDLFDQLQ

Query:  GPPSFPR
        G   F +
Subjt:  GPPSFPR

A0A5A7TP01 Reverse transcriptase5.8e-11096.14Show/hide
Query:  MQDFDVILGMDWLSTNHATIDCFNKEVVFNPPSGDKFKFRGAGMVGIPKVISAMKASKLLSQGTWGILASVVDVAEPEVFLSSEPVVREYPDVFPDELPG
        MQDFDVILGMDWLSTNHATIDCFNKEVVFNPPSGDKFKFRGAGMVGIPKVISAMKASKLLSQGTWGILA+VVD+AEPEV LSSEPVVREYPDVFPDELPG
Subjt:  MQDFDVILGMDWLSTNHATIDCFNKEVVFNPPSGDKFKFRGAGMVGIPKVISAMKASKLLSQGTWGILASVVDVAEPEVFLSSEPVVREYPDVFPDELPG

Query:  LPPPREVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIEDLFDQLQ
        LPPPREVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIEDLFDQLQ
Subjt:  LPPPREVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIEDLFDQLQ

Query:  GPPSFPR
        G   F +
Subjt:  GPPSFPR

A0A5A7UUX8 Reverse transcriptase3.3e-10592.75Show/hide
Query:  MQDFDVILGMDWLSTNHATIDCFNKEVVFNPPSGDKFKFRGAGMVGIPKVISAMKASKLLSQGTWGILASVVDVAEPEVFLSSEPVVREYPDVFPDELPG
        MQDFDVILGMDWLS NHA IDCF KEVVFNPPSG  FKFRGAGMV IPKVISAMKASKLLSQGTWGILASVVD+ EPEV LSSEPVVREYPDVFPDELPG
Subjt:  MQDFDVILGMDWLSTNHATIDCFNKEVVFNPPSGDKFKFRGAGMVGIPKVISAMKASKLLSQGTWGILASVVDVAEPEVFLSSEPVVREYPDVFPDELPG

Query:  LPPPREVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIEDLFDQLQ
        LPPPREVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRI+DLFDQLQ
Subjt:  LPPPREVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIEDLFDQLQ

Query:  GPPSFPR
        G   F +
Subjt:  GPPSFPR

A0A5A7V4E4 Reverse transcriptase7.4e-10592.27Show/hide
Query:  MQDFDVILGMDWLSTNHATIDCFNKEVVFNPPSGDKFKFRGAGMVGIPKVISAMKASKLLSQGTWGILASVVDVAEPEVFLSSEPVVREYPDVFPDELPG
        MQDFDVILGMDWLS NHA IDCF KEVVFNPPSG  FKFRGAGMV IPKVISAMKASKLLSQGTWGILASVVD+ EPEV LSSEPVVREYPDVFPDELPG
Subjt:  MQDFDVILGMDWLSTNHATIDCFNKEVVFNPPSGDKFKFRGAGMVGIPKVISAMKASKLLSQGTWGILASVVDVAEPEVFLSSEPVVREYPDVFPDELPG

Query:  LPPPREVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIEDLFDQLQ
        LPPPREVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLP+I+DLFDQLQ
Subjt:  LPPPREVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIEDLFDQLQ

Query:  GPPSFPR
        G   F +
Subjt:  GPPSFPR

A0A5D3BSV9 Reverse transcriptase3.3e-10592.75Show/hide
Query:  MQDFDVILGMDWLSTNHATIDCFNKEVVFNPPSGDKFKFRGAGMVGIPKVISAMKASKLLSQGTWGILASVVDVAEPEVFLSSEPVVREYPDVFPDELPG
        MQDFDVILGMDWLS NHA IDCF KEVVFNPPSG  FKFRGAGMV IPKVISAMKASKLLSQGTWGILASVVD+ EPEV LSSEPVVREYPDVFPDELPG
Subjt:  MQDFDVILGMDWLSTNHATIDCFNKEVVFNPPSGDKFKFRGAGMVGIPKVISAMKASKLLSQGTWGILASVVDVAEPEVFLSSEPVVREYPDVFPDELPG

Query:  LPPPREVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIEDLFDQLQ
        LPPPREVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRI+DLFDQLQ
Subjt:  LPPPREVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIEDLFDQLQ

Query:  GPPSFPR
        G   F +
Subjt:  GPPSFPR

SwissProt top hitse value%identityAlignment
P0CT41 Transposon Tf2-12 polyprotein3.2e-1231.62Show/hide
Query:  VAEPEVFLSSEPVVREYPDVFPD-ELPGLPPP-REVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRL
        V EPE+      + +E+ D+  +     LP P + ++F +EL      +    Y + P +++ +  ++ + L  G IR S +    PV+FV KK+G++R+
Subjt:  VAEPEVFLSSEPVVREYPDVFPD-ELPGLPPP-REVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRL

Query:  CIDYRELNKVTVKNRYPLPRIEDLFDQLQGPPSFPR
         +DY+ LNK    N YPLP IE L  ++QG   F +
Subjt:  CIDYRELNKVTVKNRYPLPRIEDLFDQLQGPPSFPR

P10394 Retrovirus-related Pol polyprotein from transposon 4126.4e-1331.82Show/hide
Query:  REYPDVFPDELPGLPPPREVDFAIELEPGTA--------------PISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDG------
        + +P++F  +L  +       FA+E EP T               P+    YR   ++++E++ Q+Q+L+    + PSVS + +P+L V KK        
Subjt:  REYPDVFPDELPGLPPPREVDFAIELEPGTA--------------PISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDG------

Query:  SMRLCIDYRELNKVTVKNRYPLPRIEDLFDQL
          RL IDYR++NK  + +++PLPRI+D+ DQL
Subjt:  SMRLCIDYRELNKVTVKNRYPLPRIEDLFDQL

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein3.3e-1733.33Show/hide
Query:  KASKLLSQGTWGILASVVDVAEPEVFLSSEP---------VVREYPDVFPDELPGLPPPREVD-----FAIELEPGTAPISRAPYRMAPAELKELKVQLQ
        +AS L   G +  + S +   EP     S           + ++Y ++  ++LP  P P +++       IE++PG       PY +     +E+   +Q
Subjt:  KASKLLSQGTWGILASVVDVAEPEVFLSSEP---------VVREYPDVFPDELPGLPPPREVD-----FAIELEPGTAPISRAPYRMAPAELKELKVQLQ

Query:  ELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIEDLFDQL
        +LLD  FI PS SP  +PV+ V KKDG+ RLC+DYR LNK T+ + +PLPRI++L  ++
Subjt:  ELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIEDLFDQL

Q99315 Transposon Ty3-G Gag-Pol polyprotein3.3e-1733.33Show/hide
Query:  KASKLLSQGTWGILASVVDVAEPEVFLSSEP---------VVREYPDVFPDELPGLPPPREVD-----FAIELEPGTAPISRAPYRMAPAELKELKVQLQ
        +AS L   G +  + S +   EP     S           + ++Y ++  ++LP  P P +++       IE++PG       PY +     +E+   +Q
Subjt:  KASKLLSQGTWGILASVVDVAEPEVFLSSEP---------VVREYPDVFPDELPGLPPPREVD-----FAIELEPGTAPISRAPYRMAPAELKELKVQLQ

Query:  ELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIEDLFDQL
        +LLD  FI PS SP  +PV+ V KKDG+ RLC+DYR LNK T+ + +PLPRI++L  ++
Subjt:  ELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIEDLFDQL

Q9UR07 Transposon Tf2-11 polyprotein3.2e-1231.62Show/hide
Query:  VAEPEVFLSSEPVVREYPDVFPD-ELPGLPPP-REVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRL
        V EPE+      + +E+ D+  +     LP P + ++F +EL      +    Y + P +++ +  ++ + L  G IR S +    PV+FV KK+G++R+
Subjt:  VAEPEVFLSSEPVVREYPDVFPD-ELPGLPPP-REVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRL

Query:  CIDYRELNKVTVKNRYPLPRIEDLFDQLQGPPSFPR
         +DY+ LNK    N YPLP IE L  ++QG   F +
Subjt:  CIDYRELNKVTVKNRYPLPRIEDLFDQLQGPPSFPR

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAAGATTTCGATGTAATTTTAGGCATGGATTGGTTGTCCACTAACCATGCAACCATAGACTGTTTCAATAAGGAAGTAGTCTTTAACCCTCCTTCAGGGGATAAGTT
TAAATTTAGAGGAGCAGGCATGGTAGGTATACCCAAGGTCATCTCAGCAATGAAGGCGAGTAAGCTACTTAGCCAGGGTACTTGGGGTATCTTGGCAAGCGTAGTAGATG
TGGCAGAACCAGAAGTTTTCCTATCTTCTGAACCAGTAGTAAGGGAGTACCCTGACGTTTTCCCCGACGAACTCCCAGGACTTCCGCCTCCTAGGGAGGTAGACTTCGCC
ATCGAGTTAGAGCCCGGCACCGCCCCTATCTCTAGAGCTCCTTACAGAATGGCCCCAGCCGAATTAAAGGAGTTGAAAGTCCAGTTACAGGAGCTGTTGGACAAGGGCTT
TATCCGGCCCAGTGTATCACCGTGGGGAGCCCCAGTGTTGTTCGTGAAGAAGAAGGATGGGTCGATGCGCCTTTGCATTGACTACCGAGAGTTGAACAAGGTGACAGTTA
AAAACCGCTACCCCTTACCCAGGATTGAGGACTTGTTTGACCAGTTACAGGGGCCACCGTCTTTTCCAAGATCGACCTGCGATCAGGCTATCACCAGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGCAAGATTTCGATGTAATTTTAGGCATGGATTGGTTGTCCACTAACCATGCAACCATAGACTGTTTCAATAAGGAAGTAGTCTTTAACCCTCCTTCAGGGGATAAGTT
TAAATTTAGAGGAGCAGGCATGGTAGGTATACCCAAGGTCATCTCAGCAATGAAGGCGAGTAAGCTACTTAGCCAGGGTACTTGGGGTATCTTGGCAAGCGTAGTAGATG
TGGCAGAACCAGAAGTTTTCCTATCTTCTGAACCAGTAGTAAGGGAGTACCCTGACGTTTTCCCCGACGAACTCCCAGGACTTCCGCCTCCTAGGGAGGTAGACTTCGCC
ATCGAGTTAGAGCCCGGCACCGCCCCTATCTCTAGAGCTCCTTACAGAATGGCCCCAGCCGAATTAAAGGAGTTGAAAGTCCAGTTACAGGAGCTGTTGGACAAGGGCTT
TATCCGGCCCAGTGTATCACCGTGGGGAGCCCCAGTGTTGTTCGTGAAGAAGAAGGATGGGTCGATGCGCCTTTGCATTGACTACCGAGAGTTGAACAAGGTGACAGTTA
AAAACCGCTACCCCTTACCCAGGATTGAGGACTTGTTTGACCAGTTACAGGGGCCACCGTCTTTTCCAAGATCGACCTGCGATCAGGCTATCACCAGTTGA
Protein sequenceShow/hide protein sequence
MQDFDVILGMDWLSTNHATIDCFNKEVVFNPPSGDKFKFRGAGMVGIPKVISAMKASKLLSQGTWGILASVVDVAEPEVFLSSEPVVREYPDVFPDELPGLPPPREVDFA
IELEPGTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIEDLFDQLQGPPSFPRSTCDQAITS