; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh15G011110 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh15G011110
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionReverse transcriptase
Genome locationCmo_Chr15:7604404..7605003
RNA-Seq ExpressionCmoCh15G011110
SyntenyCmoCh15G011110
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR032567 - LDOC1-related
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK07954.1 reverse transcriptase [Cucumis melo var. makuwa]3.4e-9081.91Show/hide
Query:  MAPPELAELSKQLDELLTAGFIRPIKVPYRSPVLFQKKKDGTLLLCIDYRALNKMTVHNKYPLPIISDLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPK
        MAPPELAEL KQLDELL AGFIRP K PY +PVLFQKKKDG+L LCIDYRALNK+TV NKYPLPII+DLFD+LHGAKYF+KLDLRSGYYQVRIAEGDEPK
Subjt:  MAPPELAELSKQLDELLTAGFIRPIKVPYRSPVLFQKKKDGTLLLCIDYRALNKMTVHNKYPLPIISDLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPK

Query:  TKCVTRYGAFEFLVMPFRLTNAPATFCTLMNQIFYKYLDQFVVVYLNNIVVYSTALEEHKVHLKLVFDKLCQNQLYVKKEKCALAQTCINFLGHVISCG
        T CVTRYGAFEFLVMPF LTNAPATFCTLMNQ+F++YLD+FVVVYL++IVVYST +EEH+ HL+ VF KL +NQLYVK+EKC+ AQ  INFLGHVI CG
Subjt:  TKCVTRYGAFEFLVMPFRLTNAPATFCTLMNQIFYKYLDQFVVVYLNNIVVYSTALEEHKVHLKLVFDKLCQNQLYVKKEKCALAQTCINFLGHVISCG

XP_022150099.1 uncharacterized protein LOC111018360 [Momordica charantia]2.6e-9083.42Show/hide
Query:  MAPPELAELSKQLDELLTAGFIRPIKVPYRSPVLFQKKKDGTLLLCIDYRALNKMTVHNKYPLPIISDLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPK
        MAPPELAEL KQLDELL AGFIRP K P+ +PVLFQKKKDGTL LCIDYRALNK+TV NKYPLPII+DLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPK
Subjt:  MAPPELAELSKQLDELLTAGFIRPIKVPYRSPVLFQKKKDGTLLLCIDYRALNKMTVHNKYPLPIISDLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPK

Query:  TKCVTRYGAFEFLVMPFRLTNAPATFCTLMNQIFYKYLDQFVVVYLNNIVVYSTALEEHKVHLKLVFDKLCQNQLYVKKEKCALAQTCINFLGHVISCG
        T CVTRYGAFEFLVM F LTNAPATFCT+MNQ+F++YLDQFVVVYL++IVVYS  L+EH++HL+LVFDKL QNQLYVKKEKCA AQ  I FLGHVI  G
Subjt:  TKCVTRYGAFEFLVMPFRLTNAPATFCTLMNQIFYKYLDQFVVVYLNNIVVYSTALEEHKVHLKLVFDKLCQNQLYVKKEKCALAQTCINFLGHVISCG

XP_023524533.1 uncharacterized protein LOC111788429 [Cucurbita pepo subsp. pepo]4.8e-9788.44Show/hide
Query:  MAPPELAELSKQLDELLTAGFIRPIKVPYRSPVLFQKKKDGTLLLCIDYRALNKMTVHNKYPLPIISDLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPK
        MAPPELAEL KQLDELL AGFIRP K PY +PVLFQKKKDGTL LCIDYRALNK+TV NKYPLPIISDLFDQLHGAKYFTKLDLRSGYYQVRI EGDEPK
Subjt:  MAPPELAELSKQLDELLTAGFIRPIKVPYRSPVLFQKKKDGTLLLCIDYRALNKMTVHNKYPLPIISDLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPK

Query:  TKCVTRYGAFEFLVMPFRLTNAPATFCTLMNQIFYKYLDQFVVVYLNNIVVYSTALEEHKVHLKLVFDKLCQNQLYVKKEKCALAQTCINFLGHVISCG
        T CVTRYGAFEFLVMPF LTNAPATF TLMNQ+FY+YLDQFV+VYL++IVVYST LEEHKVHLKLVFDKL QNQLYVKKEKCA AQTCINFLGHV+ CG
Subjt:  TKCVTRYGAFEFLVMPFRLTNAPATFCTLMNQIFYKYLDQFVVVYLNNIVVYSTALEEHKVHLKLVFDKLCQNQLYVKKEKCALAQTCINFLGHVISCG

XP_023526180.1 uncharacterized protein LOC111789739 [Cucurbita pepo subsp. pepo]3.3e-9888.94Show/hide
Query:  MAPPELAELSKQLDELLTAGFIRPIKVPYRSPVLFQKKKDGTLLLCIDYRALNKMTVHNKYPLPIISDLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPK
        MAPPELAEL KQLDELL AGFIRP K PY +PVLFQKKKDGTL LCIDYRALNK+TV NKYPLPIISDLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPK
Subjt:  MAPPELAELSKQLDELLTAGFIRPIKVPYRSPVLFQKKKDGTLLLCIDYRALNKMTVHNKYPLPIISDLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPK

Query:  TKCVTRYGAFEFLVMPFRLTNAPATFCTLMNQIFYKYLDQFVVVYLNNIVVYSTALEEHKVHLKLVFDKLCQNQLYVKKEKCALAQTCINFLGHVISCG
        T CVTRYGAFEFLVMPF LTNAPATFCTLMNQ+FY+YLDQFV+VYL++IVVYST LEEHKVHLKLVFDKL QNQLYVKKEKCA AQTCI+FLGHV+ CG
Subjt:  TKCVTRYGAFEFLVMPFRLTNAPATFCTLMNQIFYKYLDQFVVVYLNNIVVYSTALEEHKVHLKLVFDKLCQNQLYVKKEKCALAQTCINFLGHVISCG

XP_023537907.1 uncharacterized protein LOC111798805 [Cucurbita pepo subsp. pepo]3.3e-9888.94Show/hide
Query:  MAPPELAELSKQLDELLTAGFIRPIKVPYRSPVLFQKKKDGTLLLCIDYRALNKMTVHNKYPLPIISDLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPK
        MAPPELAEL KQLDELL AGFIRP K PY +PVLFQKKKDGTL LCIDYRALNK+TV NKYPLPIISDLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPK
Subjt:  MAPPELAELSKQLDELLTAGFIRPIKVPYRSPVLFQKKKDGTLLLCIDYRALNKMTVHNKYPLPIISDLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPK

Query:  TKCVTRYGAFEFLVMPFRLTNAPATFCTLMNQIFYKYLDQFVVVYLNNIVVYSTALEEHKVHLKLVFDKLCQNQLYVKKEKCALAQTCINFLGHVISCG
        T CVTRYGAFEFLVMPF LTNAPATFCTLMNQ+FY+YLDQFV+VYL++IVVYST LEEHKVHLKLVFDKL QNQLYVKKEKCA AQTCI+FLGHV+ CG
Subjt:  TKCVTRYGAFEFLVMPFRLTNAPATFCTLMNQIFYKYLDQFVVVYLNNIVVYSTALEEHKVHLKLVFDKLCQNQLYVKKEKCALAQTCINFLGHVISCG

TrEMBL top hitse value%identityAlignment
A0A5A7UXR6 Reverse transcriptase3.6e-9081.41Show/hide
Query:  MAPPELAELSKQLDELLTAGFIRPIKVPYRSPVLFQKKKDGTLLLCIDYRALNKMTVHNKYPLPIISDLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPK
        MAPPELAEL KQLDELL AGFIRP K PY +PVLFQ+KKDG+L LCIDYRALNK+TV NKYPLPII+DLFD+LHGAKYF+KLDLRSGYYQVRIAEGDEPK
Subjt:  MAPPELAELSKQLDELLTAGFIRPIKVPYRSPVLFQKKKDGTLLLCIDYRALNKMTVHNKYPLPIISDLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPK

Query:  TKCVTRYGAFEFLVMPFRLTNAPATFCTLMNQIFYKYLDQFVVVYLNNIVVYSTALEEHKVHLKLVFDKLCQNQLYVKKEKCALAQTCINFLGHVISCG
        T CVTRYGAFEFLVMPF LTNAPATFCTLMNQ+F++YLD+FVVVYL++IVVYST +EEH+ HL+ VF KL +NQLYVK+EKC+ AQ  INFLGHVI CG
Subjt:  TKCVTRYGAFEFLVMPFRLTNAPATFCTLMNQIFYKYLDQFVVVYLNNIVVYSTALEEHKVHLKLVFDKLCQNQLYVKKEKCALAQTCINFLGHVISCG

A0A5D3BRZ6 Reverse transcriptase3.6e-9081.41Show/hide
Query:  MAPPELAELSKQLDELLTAGFIRPIKVPYRSPVLFQKKKDGTLLLCIDYRALNKMTVHNKYPLPIISDLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPK
        MAPPELAEL KQLDELL AGFIRP K PY +PVLFQ+KKDG+L LCIDYRALNK+TV NKYPLPII+DLFD+LHGAKYF+KLDLRSGYYQVRIAEGDEPK
Subjt:  MAPPELAELSKQLDELLTAGFIRPIKVPYRSPVLFQKKKDGTLLLCIDYRALNKMTVHNKYPLPIISDLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPK

Query:  TKCVTRYGAFEFLVMPFRLTNAPATFCTLMNQIFYKYLDQFVVVYLNNIVVYSTALEEHKVHLKLVFDKLCQNQLYVKKEKCALAQTCINFLGHVISCG
        T CVTRYGAFEFLVMPF LTNAPATFCTLMNQ+F++YLD+FVVVYL++IVVYST +EEH+ HL+ VF KL +NQLYVK+EKC+ AQ  INFLGHVI CG
Subjt:  TKCVTRYGAFEFLVMPFRLTNAPATFCTLMNQIFYKYLDQFVVVYLNNIVVYSTALEEHKVHLKLVFDKLCQNQLYVKKEKCALAQTCINFLGHVISCG

A0A5D3C4R1 Reverse transcriptase3.6e-9081.41Show/hide
Query:  MAPPELAELSKQLDELLTAGFIRPIKVPYRSPVLFQKKKDGTLLLCIDYRALNKMTVHNKYPLPIISDLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPK
        MAPPELAEL KQLDELL AGFIRP K PY +PVLFQ+KKDG+L LCIDYRALNK+TV NKYPLPII+DLFD+LHGAKYF+KLDLRSGYYQVRIAEGDEPK
Subjt:  MAPPELAELSKQLDELLTAGFIRPIKVPYRSPVLFQKKKDGTLLLCIDYRALNKMTVHNKYPLPIISDLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPK

Query:  TKCVTRYGAFEFLVMPFRLTNAPATFCTLMNQIFYKYLDQFVVVYLNNIVVYSTALEEHKVHLKLVFDKLCQNQLYVKKEKCALAQTCINFLGHVISCG
        T CVTRYGAFEFLVMPF LTNAPATFCTLMNQ+F++YLD+FVVVYL++IVVYST +EEH+ HL+ VF KL +NQLYVK+EKC+ AQ  INFLGHVI CG
Subjt:  TKCVTRYGAFEFLVMPFRLTNAPATFCTLMNQIFYKYLDQFVVVYLNNIVVYSTALEEHKVHLKLVFDKLCQNQLYVKKEKCALAQTCINFLGHVISCG

A0A5D3C9P8 Reverse transcriptase1.6e-9081.91Show/hide
Query:  MAPPELAELSKQLDELLTAGFIRPIKVPYRSPVLFQKKKDGTLLLCIDYRALNKMTVHNKYPLPIISDLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPK
        MAPPELAEL KQLDELL AGFIRP K PY +PVLFQKKKDG+L LCIDYRALNK+TV NKYPLPII+DLFD+LHGAKYF+KLDLRSGYYQVRIAEGDEPK
Subjt:  MAPPELAELSKQLDELLTAGFIRPIKVPYRSPVLFQKKKDGTLLLCIDYRALNKMTVHNKYPLPIISDLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPK

Query:  TKCVTRYGAFEFLVMPFRLTNAPATFCTLMNQIFYKYLDQFVVVYLNNIVVYSTALEEHKVHLKLVFDKLCQNQLYVKKEKCALAQTCINFLGHVISCG
        T CVTRYGAFEFLVMPF LTNAPATFCTLMNQ+F++YLD+FVVVYL++IVVYST +EEH+ HL+ VF KL +NQLYVK+EKC+ AQ  INFLGHVI CG
Subjt:  TKCVTRYGAFEFLVMPFRLTNAPATFCTLMNQIFYKYLDQFVVVYLNNIVVYSTALEEHKVHLKLVFDKLCQNQLYVKKEKCALAQTCINFLGHVISCG

A0A6J1D906 Reverse transcriptase1.2e-9083.42Show/hide
Query:  MAPPELAELSKQLDELLTAGFIRPIKVPYRSPVLFQKKKDGTLLLCIDYRALNKMTVHNKYPLPIISDLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPK
        MAPPELAEL KQLDELL AGFIRP K P+ +PVLFQKKKDGTL LCIDYRALNK+TV NKYPLPII+DLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPK
Subjt:  MAPPELAELSKQLDELLTAGFIRPIKVPYRSPVLFQKKKDGTLLLCIDYRALNKMTVHNKYPLPIISDLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPK

Query:  TKCVTRYGAFEFLVMPFRLTNAPATFCTLMNQIFYKYLDQFVVVYLNNIVVYSTALEEHKVHLKLVFDKLCQNQLYVKKEKCALAQTCINFLGHVISCG
        T CVTRYGAFEFLVM F LTNAPATFCT+MNQ+F++YLDQFVVVYL++IVVYS  L+EH++HL+LVFDKL QNQLYVKKEKCA AQ  I FLGHVI  G
Subjt:  TKCVTRYGAFEFLVMPFRLTNAPATFCTLMNQIFYKYLDQFVVVYLNNIVVYSTALEEHKVHLKLVFDKLCQNQLYVKKEKCALAQTCINFLGHVISCG

SwissProt top hitse value%identityAlignment
P0CT42 Transposon Tf2-7 polyprotein1.6e-3437.56Show/hide
Query:  MAPPELAELSKQLDELLTAGFIRPIKVPYRSPVLFQKKKDGTLLLCIDYRALNKMTVHNKYPLPIISDLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPK
        + P ++  ++ ++++ L +G IR  K     PV+F  KK+GTL + +DY+ LNK    N YPLP+I  L  ++ G+  FTKLDL+S Y+ +R+ +GDE K
Subjt:  MAPPELAELSKQLDELLTAGFIRPIKVPYRSPVLFQKKKDGTLLLCIDYRALNKMTVHNKYPLPIISDLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPK

Query:  TKCVTRYGAFEFLVMPFRLTNAPATFCTLMNQIFYKYLDQFVVVYLNNIVVYSTALEEHKVHLKLVFDKLCQNQLYVKKEKCALAQTCINFLGHVIS
               G FE+LVMP+ ++ APA F   +N I  +  +  VV Y++NI+++S +  EH  H+K V  KL    L + + KC   Q+ + F+G+ IS
Subjt:  TKCVTRYGAFEFLVMPFRLTNAPATFCTLMNQIFYKYLDQFVVVYLNNIVVYSTALEEHKVHLKLVFDKLCQNQLYVKKEKCALAQTCINFLGHVIS

P31843 RNA-directed DNA polymerase homolog3.8e-4467.19Show/hide
Query:  TLLLCIDYRALNKMTVHNKYPLPIISDLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPKTKCVTRYGAFEFLVMPFRLTNAPATFCTLMNQIFYKYLDQF
        +L +CIDYRAL K+T+ NKYP+P + DLFD+L  A +FTKLDLRSGY+QVRIA+GDEPKT CVTRYG+FEF VMPF LTNA ATFC LMN + Y+YLD F
Subjt:  TLLLCIDYRALNKMTVHNKYPLPIISDLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPKTKCVTRYGAFEFLVMPFRLTNAPATFCTLMNQIFYKYLDQF

Query:  VVVYLNNIVV---YSTALEEHKVHLKLV
        VVVYL+++VV   YS +L EH  HL++V
Subjt:  VVVYLNNIVV---YSTALEEHKVHLKLV

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein9.0e-3844.97Show/hide
Query:  ELSKQLDELLTAGFIRPIKVPYRSPVLFQKKKDGTLLLCIDYRALNKMTVHNKYPLPIISDLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPKTKCVTRY
        E++K + +LL   FI P K P  SPV+   KKDGT  LC+DYR LNK T+ + +PLP I +L  ++  A+ FT LDL SGY+Q+ +   D  KT  VT  
Subjt:  ELSKQLDELLTAGFIRPIKVPYRSPVLFQKKKDGTLLLCIDYRALNKMTVHNKYPLPIISDLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPKTKCVTRY

Query:  GAFEFLVMPFRLTNAPATFCTLMNQIFYKYLDQFVVVYLNNIVVYSTALEEHKVHLKLVFDKLCQNQLYVKKEKCALAQTCINFLGHVI
        G +E+ VMPF L NAP+TF   M   F     +FV VYL++I+++S + EEH  HL  V ++L    L VKK+KC  A     FLG+ I
Subjt:  GAFEFLVMPFRLTNAPATFCTLMNQIFYKYLDQFVVVYLNNIVVYSTALEEHKVHLKLVFDKLCQNQLYVKKEKCALAQTCINFLGHVI

Q99315 Transposon Ty3-G Gag-Pol polyprotein9.0e-3844.97Show/hide
Query:  ELSKQLDELLTAGFIRPIKVPYRSPVLFQKKKDGTLLLCIDYRALNKMTVHNKYPLPIISDLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPKTKCVTRY
        E++K + +LL   FI P K P  SPV+   KKDGT  LC+DYR LNK T+ + +PLP I +L  ++  A+ FT LDL SGY+Q+ +   D  KT  VT  
Subjt:  ELSKQLDELLTAGFIRPIKVPYRSPVLFQKKKDGTLLLCIDYRALNKMTVHNKYPLPIISDLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPKTKCVTRY

Query:  GAFEFLVMPFRLTNAPATFCTLMNQIFYKYLDQFVVVYLNNIVVYSTALEEHKVHLKLVFDKLCQNQLYVKKEKCALAQTCINFLGHVI
        G +E+ VMPF L NAP+TF   M   F     +FV VYL++I+++S + EEH  HL  V ++L    L VKK+KC  A     FLG+ I
Subjt:  GAFEFLVMPFRLTNAPATFCTLMNQIFYKYLDQFVVVYLNNIVVYSTALEEHKVHLKLVFDKLCQNQLYVKKEKCALAQTCINFLGHVI

Q9UR07 Transposon Tf2-11 polyprotein1.6e-3437.56Show/hide
Query:  MAPPELAELSKQLDELLTAGFIRPIKVPYRSPVLFQKKKDGTLLLCIDYRALNKMTVHNKYPLPIISDLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPK
        + P ++  ++ ++++ L +G IR  K     PV+F  KK+GTL + +DY+ LNK    N YPLP+I  L  ++ G+  FTKLDL+S Y+ +R+ +GDE K
Subjt:  MAPPELAELSKQLDELLTAGFIRPIKVPYRSPVLFQKKKDGTLLLCIDYRALNKMTVHNKYPLPIISDLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPK

Query:  TKCVTRYGAFEFLVMPFRLTNAPATFCTLMNQIFYKYLDQFVVVYLNNIVVYSTALEEHKVHLKLVFDKLCQNQLYVKKEKCALAQTCINFLGHVIS
               G FE+LVMP+ ++ APA F   +N I  +  +  VV Y++NI+++S +  EH  H+K V  KL    L + + KC   Q+ + F+G+ IS
Subjt:  TKCVTRYGAFEFLVMPFRLTNAPATFCTLMNQIFYKYLDQFVVVYLNNIVVYSTALEEHKVHLKLVFDKLCQNQLYVKKEKCALAQTCINFLGHVIS

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTCCCCCTGAGCTAGCCGAATTGAGTAAACAACTAGATGAGTTGTTGACGGCAGGATTCATCCGCCCGATAAAGGTGCCTTATAGATCCCCCGTACTATTT
CAGAAAAAGAAGGATGGGACGTTGCTTCTGTGCATAGATTATAGAGCCTTAAACAAGATGACGGTACACAACAAATACCCACTGCCAATAATATCCGACTTGTTT
GACCAACTTCACGGGGCCAAATACTTCACGAAGTTGGACTTACGATCAGGGTACTACCAAGTACGTATCGCCGAAGGGGACGAACCCAAGACAAAGTGTGTAACA
AGATATGGGGCCTTTGAGTTCCTGGTAATGCCCTTTCGCTTGACAAACGCCCCAGCTACGTTTTGCACGTTAATGAACCAGATTTTCTACAAATACTTGGATCAG
TTCGTCGTAGTATACCTCAACAACATAGTTGTATACAGCACAGCCTTAGAGGAACACAAGGTGCACCTGAAGCTAGTATTTGACAAGCTGTGTCAAAATCAGTTG
TACGTCAAGAAAGAAAAATGTGCTCTCGCACAAACATGTATCAACTTCCTTGGACATGTCATCAGTTGTGGATAG
mRNA sequenceShow/hide mRNA sequence
ATGGCTCCCCCTGAGCTAGCCGAATTGAGTAAACAACTAGATGAGTTGTTGACGGCAGGATTCATCCGCCCGATAAAGGTGCCTTATAGATCCCCCGTACTATTT
CAGAAAAAGAAGGATGGGACGTTGCTTCTGTGCATAGATTATAGAGCCTTAAACAAGATGACGGTACACAACAAATACCCACTGCCAATAATATCCGACTTGTTT
GACCAACTTCACGGGGCCAAATACTTCACGAAGTTGGACTTACGATCAGGGTACTACCAAGTACGTATCGCCGAAGGGGACGAACCCAAGACAAAGTGTGTAACA
AGATATGGGGCCTTTGAGTTCCTGGTAATGCCCTTTCGCTTGACAAACGCCCCAGCTACGTTTTGCACGTTAATGAACCAGATTTTCTACAAATACTTGGATCAG
TTCGTCGTAGTATACCTCAACAACATAGTTGTATACAGCACAGCCTTAGAGGAACACAAGGTGCACCTGAAGCTAGTATTTGACAAGCTGTGTCAAAATCAGTTG
TACGTCAAGAAAGAAAAATGTGCTCTCGCACAAACATGTATCAACTTCCTTGGACATGTCATCAGTTGTGGATAG
Protein sequenceShow/hide protein sequence
MAPPELAELSKQLDELLTAGFIRPIKVPYRSPVLFQKKKDGTLLLCIDYRALNKMTVHNKYPLPIISDLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPKTKCVT
RYGAFEFLVMPFRLTNAPATFCTLMNQIFYKYLDQFVVVYLNNIVVYSTALEEHKVHLKLVFDKLCQNQLYVKKEKCALAQTCINFLGHVISCG