; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi05G011340 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi05G011340
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionReverse transcriptase
Genome locationchr05:19112156..19113213
RNA-Seq ExpressionLsi05G011340
SyntenyLsi05G011340
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0016020 - membrane (cellular component)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0068077.1 reverse transcriptase [Cucumis melo var. makuwa]3.6e-10380.88Show/hide
Query:  LRPRIPEETDGFPPMEELEAYKIVFETYTF-GSEQTPY-GDDDPEVEVEVDFQEPMEHFPEEIQILPENHLLDEIEAKTEELKEAQIENRDESQDSSKAM
        LR R PEE  GFPP+EELEAYKIVFETYTF GSEQ PY GDD+P  E+EVDFQEPME+FP+E QILP     DE EAKTEE KEAQIENR          
Subjt:  LRPRIPEETDGFPPMEELEAYKIVFETYTF-GSEQTPY-GDDDPEVEVEVDFQEPMEHFPEEIQILPENHLLDEIEAKTEELKEAQIENRDESQDSSKAM

Query:  EKEMTKDLRKITEESSISSRSESSPWSSPGSFSSREYNS---LGSYGSMRKEKEWRRTLACKLFDERHNSEGTEGMDSLWETYENSESKTLQKKEKINGK
        E EM KDLRK+ EESSISSR+ESSPWSSPGSFSSREYNS   LGSYGSMRKEKEWRRTLACKLF+ERHNSEGTEGMDSLWETYENSESK LQKKEK+NGK
Subjt:  EKEMTKDLRKITEESSISSRSESSPWSSPGSFSSREYNS---LGSYGSMRKEKEWRRTLACKLFDERHNSEGTEGMDSLWETYENSESKTLQKKEKINGK

Query:  TKKGKKIQNKKEEDEEEEEEDREGQLCCLQALKFSAGKMNLGMGRPNLLKMTKALKGFGWLSRNGSRKRLIH
         KKGKKIQ KK +D++EEEED EGQLCCLQALKFSAGKMNLGMG+PNLLKMTKALKGFGWL+RNGSRKRLIH
Subjt:  TKKGKKIQNKKEEDEEEEEEDREGQLCCLQALKFSAGKMNLGMGRPNLLKMTKALKGFGWLSRNGSRKRLIH

KAG7036245.1 hypothetical protein SDJN02_03047, partial [Cucurbita argyrosperma subsp. argyrosperma]4.5e-9875.74Show/hide
Query:  LRPRIPEETDGFPPMEELEAYKIVFETYTFGSEQTPYGDDD-----PEVEVEVDFQEPMEHFPEEIQILPENHLLDEIEAKTEELKEAQIENRDESQDSS
        LRPR PEE D F PMEELEAYKIVFETYTFG+EQ  YG +D     PEVEVEVD +E ME+FP +++  PEN L  E EAKT E +E   E ++E +DSS
Subjt:  LRPRIPEETDGFPPMEELEAYKIVFETYTFGSEQTPYGDDD-----PEVEVEVDFQEPMEHFPEEIQILPENHLLDEIEAKTEELKEAQIENRDESQDSS

Query:  KAMEKEMTKDLRKITEESSISSRSESSPWSSPGSFSSREYNSLGSYGSMRKEKEWRRTLACKLFDERHNSEGTEGMDSLWETYENSESKTLQKKEKINGK
        KA+E EM K+LRK+TEESSISSRSESSPWSSPGSF SR+Y SLGSYGSMRKEKEWRRTLACKLF+ERHNSEGTEGMDSLWETYE SE  TLQK EKINGK
Subjt:  KAMEKEMTKDLRKITEESSISSRSESSPWSSPGSFSSREYNSLGSYGSMRKEKEWRRTLACKLFDERHNSEGTEGMDSLWETYENSESKTLQKKEKINGK

Query:  TKKGKKIQNKKEEDEEEEEEDREGQLCCLQALKFSAGKMNLGMGRPNLLKMTKALKGFGWLSRNGSRKRLIH
        +KKGKKI+ K+E+++E+E EDREGQLCCLQALKFSAGKMNLGMGRPNL+KM+KALKGFGWLSR+GSRKR +H
Subjt:  TKKGKKIQNKKEEDEEEEEEDREGQLCCLQALKFSAGKMNLGMGRPNLLKMTKALKGFGWLSRNGSRKRLIH

XP_004144391.1 uncharacterized protein LOC101214978 [Cucumis sativus]3.3e-10178.23Show/hide
Query:  LRPRIPEETDGFPPMEELEAYKIVFETYTF-GSEQTPYGDDDPEVEVEVDFQEPMEHFPEEIQILPENHLLDEIEAKTEELKEAQIENRDESQDSSKAME
        LR R PEE  GFPP+EELEAYKIVFETYTF GSEQ PYG DD   E+EVDFQE ME+FP+E QILP     DE EAKTEE KEAQI NR          E
Subjt:  LRPRIPEETDGFPPMEELEAYKIVFETYTF-GSEQTPYGDDDPEVEVEVDFQEPMEHFPEEIQILPENHLLDEIEAKTEELKEAQIENRDESQDSSKAME

Query:  KEMTKDLRKITEESSISSRSESSPWSSPGSFSSREYN---SLGSYGSMRKEKEWRRTLACKLFDERHNSEGTEGMDSLWETYENSESKTLQKKEKINGKT
         EM KDLRK+TEESSISSR+ESSPWSSPGSFSSREYN   +LGSYGSMRKEKEWRRTLACKLF+ERHNSEGTEGMDSLWETYENSESK LQKKEK+NGK+
Subjt:  KEMTKDLRKITEESSISSRSESSPWSSPGSFSSREYN---SLGSYGSMRKEKEWRRTLACKLFDERHNSEGTEGMDSLWETYENSESKTLQKKEKINGKT

Query:  KKGKKIQNKKEEDEEEEEEDREGQLCCLQALKFSAGKMNLGMGRPNLLKMTKALKGFGWLSRNGSRKRLIH
         KGKKIQ K ++D+EEEE+  +GQLCCLQALKFSAGKMNLGMG+PNLLKMTKALKGFGWL+RNGSRK+LIH
Subjt:  KKGKKIQNKKEEDEEEEEEDREGQLCCLQALKFSAGKMNLGMGRPNLLKMTKALKGFGWLSRNGSRKRLIH

XP_016902526.1 PREDICTED: uncharacterized protein LOC103499217 [Cucumis melo]3.6e-10380.88Show/hide
Query:  LRPRIPEETDGFPPMEELEAYKIVFETYTF-GSEQTPY-GDDDPEVEVEVDFQEPMEHFPEEIQILPENHLLDEIEAKTEELKEAQIENRDESQDSSKAM
        LR R PEE  GFPP+EELEAYKIVFETYTF GSEQ PY GDD+P  E+EVDFQEPME+FP+E QILP     DE EAKTEE KEAQIENR          
Subjt:  LRPRIPEETDGFPPMEELEAYKIVFETYTF-GSEQTPY-GDDDPEVEVEVDFQEPMEHFPEEIQILPENHLLDEIEAKTEELKEAQIENRDESQDSSKAM

Query:  EKEMTKDLRKITEESSISSRSESSPWSSPGSFSSREYNS---LGSYGSMRKEKEWRRTLACKLFDERHNSEGTEGMDSLWETYENSESKTLQKKEKINGK
        E EM KDLRK+ EESSISSR+ESSPWSSPGSFSSREYNS   LGSYGSMRKEKEWRRTLACKLF+ERHNSEGTEGMDSLWETYENSESK LQKKEK+NGK
Subjt:  EKEMTKDLRKITEESSISSRSESSPWSSPGSFSSREYNS---LGSYGSMRKEKEWRRTLACKLFDERHNSEGTEGMDSLWETYENSESKTLQKKEKINGK

Query:  TKKGKKIQNKKEEDEEEEEEDREGQLCCLQALKFSAGKMNLGMGRPNLLKMTKALKGFGWLSRNGSRKRLIH
         KKGKKIQ KK +D++EEEED EGQLCCLQALKFSAGKMNLGMG+PNLLKMTKALKGFGWL+RNGSRKRLIH
Subjt:  TKKGKKIQNKKEEDEEEEEEDREGQLCCLQALKFSAGKMNLGMGRPNLLKMTKALKGFGWLSRNGSRKRLIH

XP_038887002.1 stress response protein NST1 [Benincasa hispida]1.3e-11687.05Show/hide
Query:  LRPRIPEETDGF-PPMEELEAYKIVFETYTFGSEQTPYG-DDDPEVEVEVDFQEPMEHFPEEIQILPENHLLDEIEAKTEELKEAQIEN--------RDE
        LRPR PEETDGF PPMEELEAYKIVFETYTFGSEQTPYG DDDPEVEVE DFQEPME+FPEEIQIL EN LL  IE KTEELKEAQIEN        RDE
Subjt:  LRPRIPEETDGF-PPMEELEAYKIVFETYTFGSEQTPYG-DDDPEVEVEVDFQEPMEHFPEEIQILPENHLLDEIEAKTEELKEAQIEN--------RDE

Query:  SQDSSKAMEKEMTKDLRKITEESSISSRSESSPWSSPGSFSSREYNSLGSYGSMRKEKEWRRTLACKLFDERHNSEGTEGMDSLWETYENSESKTLQKKE
        ++DSSKAME EM KDLRKITEESSISSR+ESSPWSSPGSFSSREYNSLGSYGSMRKEKEWRRTLACKLF+ERHNSEGTEGMDSLWETYE SESK LQK+ 
Subjt:  SQDSSKAMEKEMTKDLRKITEESSISSRSESSPWSSPGSFSSREYNSLGSYGSMRKEKEWRRTLACKLFDERHNSEGTEGMDSLWETYENSESKTLQKKE

Query:  KINGKTKKGKKIQNKKEEDEEEEEEDREGQLCCLQALKFSAGKMNLGMGRPNLLKMTKALKGFGWLSRNGSRK-RLIH
        KINGK KKGKKIQ K+EED  EEEED EGQLCCLQALKFSAGKMNLGMGRPNLLKMTKALKGFGWLSRNGSR+ RLIH
Subjt:  KINGKTKKGKKIQNKKEEDEEEEEEDREGQLCCLQALKFSAGKMNLGMGRPNLLKMTKALKGFGWLSRNGSRK-RLIH

TrEMBL top hitse value%identityAlignment
A0A0A0L955 Uncharacterized protein1.6e-10178.23Show/hide
Query:  LRPRIPEETDGFPPMEELEAYKIVFETYTF-GSEQTPYGDDDPEVEVEVDFQEPMEHFPEEIQILPENHLLDEIEAKTEELKEAQIENRDESQDSSKAME
        LR R PEE  GFPP+EELEAYKIVFETYTF GSEQ PYG DD   E+EVDFQE ME+FP+E QILP     DE EAKTEE KEAQI NR          E
Subjt:  LRPRIPEETDGFPPMEELEAYKIVFETYTF-GSEQTPYGDDDPEVEVEVDFQEPMEHFPEEIQILPENHLLDEIEAKTEELKEAQIENRDESQDSSKAME

Query:  KEMTKDLRKITEESSISSRSESSPWSSPGSFSSREYN---SLGSYGSMRKEKEWRRTLACKLFDERHNSEGTEGMDSLWETYENSESKTLQKKEKINGKT
         EM KDLRK+TEESSISSR+ESSPWSSPGSFSSREYN   +LGSYGSMRKEKEWRRTLACKLF+ERHNSEGTEGMDSLWETYENSESK LQKKEK+NGK+
Subjt:  KEMTKDLRKITEESSISSRSESSPWSSPGSFSSREYN---SLGSYGSMRKEKEWRRTLACKLFDERHNSEGTEGMDSLWETYENSESKTLQKKEKINGKT

Query:  KKGKKIQNKKEEDEEEEEEDREGQLCCLQALKFSAGKMNLGMGRPNLLKMTKALKGFGWLSRNGSRKRLIH
         KGKKIQ K ++D+EEEE+  +GQLCCLQALKFSAGKMNLGMG+PNLLKMTKALKGFGWL+RNGSRK+LIH
Subjt:  KKGKKIQNKKEEDEEEEEEDREGQLCCLQALKFSAGKMNLGMGRPNLLKMTKALKGFGWLSRNGSRKRLIH

A0A1S4E2S0 uncharacterized protein LOC1034992171.7e-10380.88Show/hide
Query:  LRPRIPEETDGFPPMEELEAYKIVFETYTF-GSEQTPY-GDDDPEVEVEVDFQEPMEHFPEEIQILPENHLLDEIEAKTEELKEAQIENRDESQDSSKAM
        LR R PEE  GFPP+EELEAYKIVFETYTF GSEQ PY GDD+P  E+EVDFQEPME+FP+E QILP     DE EAKTEE KEAQIENR          
Subjt:  LRPRIPEETDGFPPMEELEAYKIVFETYTF-GSEQTPY-GDDDPEVEVEVDFQEPMEHFPEEIQILPENHLLDEIEAKTEELKEAQIENRDESQDSSKAM

Query:  EKEMTKDLRKITEESSISSRSESSPWSSPGSFSSREYNS---LGSYGSMRKEKEWRRTLACKLFDERHNSEGTEGMDSLWETYENSESKTLQKKEKINGK
        E EM KDLRK+ EESSISSR+ESSPWSSPGSFSSREYNS   LGSYGSMRKEKEWRRTLACKLF+ERHNSEGTEGMDSLWETYENSESK LQKKEK+NGK
Subjt:  EKEMTKDLRKITEESSISSRSESSPWSSPGSFSSREYNS---LGSYGSMRKEKEWRRTLACKLFDERHNSEGTEGMDSLWETYENSESKTLQKKEKINGK

Query:  TKKGKKIQNKKEEDEEEEEEDREGQLCCLQALKFSAGKMNLGMGRPNLLKMTKALKGFGWLSRNGSRKRLIH
         KKGKKIQ KK +D++EEEED EGQLCCLQALKFSAGKMNLGMG+PNLLKMTKALKGFGWL+RNGSRKRLIH
Subjt:  TKKGKKIQNKKEEDEEEEEEDREGQLCCLQALKFSAGKMNLGMGRPNLLKMTKALKGFGWLSRNGSRKRLIH

A0A5D3DQW2 Reverse transcriptase1.7e-10380.88Show/hide
Query:  LRPRIPEETDGFPPMEELEAYKIVFETYTF-GSEQTPY-GDDDPEVEVEVDFQEPMEHFPEEIQILPENHLLDEIEAKTEELKEAQIENRDESQDSSKAM
        LR R PEE  GFPP+EELEAYKIVFETYTF GSEQ PY GDD+P  E+EVDFQEPME+FP+E QILP     DE EAKTEE KEAQIENR          
Subjt:  LRPRIPEETDGFPPMEELEAYKIVFETYTF-GSEQTPY-GDDDPEVEVEVDFQEPMEHFPEEIQILPENHLLDEIEAKTEELKEAQIENRDESQDSSKAM

Query:  EKEMTKDLRKITEESSISSRSESSPWSSPGSFSSREYNS---LGSYGSMRKEKEWRRTLACKLFDERHNSEGTEGMDSLWETYENSESKTLQKKEKINGK
        E EM KDLRK+ EESSISSR+ESSPWSSPGSFSSREYNS   LGSYGSMRKEKEWRRTLACKLF+ERHNSEGTEGMDSLWETYENSESK LQKKEK+NGK
Subjt:  EKEMTKDLRKITEESSISSRSESSPWSSPGSFSSREYNS---LGSYGSMRKEKEWRRTLACKLFDERHNSEGTEGMDSLWETYENSESKTLQKKEKINGK

Query:  TKKGKKIQNKKEEDEEEEEEDREGQLCCLQALKFSAGKMNLGMGRPNLLKMTKALKGFGWLSRNGSRKRLIH
         KKGKKIQ KK +D++EEEED EGQLCCLQALKFSAGKMNLGMG+PNLLKMTKALKGFGWL+RNGSRKRLIH
Subjt:  TKKGKKIQNKKEEDEEEEEEDREGQLCCLQALKFSAGKMNLGMGRPNLLKMTKALKGFGWLSRNGSRKRLIH

A0A6J1ES68 uncharacterized protein LOC111437254 isoform X12.2e-8251.51Show/hide
Query:  LRPRIPEETDGFPPMEELEAYKIVFETYTFGSEQTPYGDDD-----PEVEVEVDF---------------------------------------------
        LRPR PEE D F PMEELEAYKIVFETYTFG+EQ  YG +D     PEVEVEVD+                                             
Subjt:  LRPRIPEETDGFPPMEELEAYKIVFETYTFGSEQTPYGDDD-----PEVEVEVDF---------------------------------------------

Query:  ---------------------------------------------------------------------------------QEPMEHFPEEIQILPENHL
                                                                                         +E ME+FP +++  PEN L
Subjt:  ---------------------------------------------------------------------------------QEPMEHFPEEIQILPENHL

Query:  LDEIEAKTEELKEAQIENRDESQDSSKAMEKEMTKDLRKITEESSISSRSESSPWSSPGSFSSREYNSLGSYGSMRKEKEWRRTLACKLFDERHNSEGTE
          E EAKT E +E   E ++E +DSSKA+E EM KDLRK+TEESSISSRSESSPWSSPGSF SR+Y SLGSYGS+RKEKEWRRTLACKLF+ERHNSEGTE
Subjt:  LDEIEAKTEELKEAQIENRDESQDSSKAMEKEMTKDLRKITEESSISSRSESSPWSSPGSFSSREYNSLGSYGSMRKEKEWRRTLACKLFDERHNSEGTE

Query:  GMDSLWETYENSESKTLQKKEKINGKTKKGKKIQNKKEEDEEEEEEDREGQLCCLQALKFSAGKMNLGMGRPNLLKMTKALKGFGWLSRNGSRKRLIH
        GMDSLWETYE SE   LQK EKINGK+KKGKKI+ K+E+++E+E EDREGQLCCLQALKFSAGKMNLGMGRPNL+KM+KALKGFGWLSR+GSRKR +H
Subjt:  GMDSLWETYENSESKTLQKKEKINGKTKKGKKIQNKKEEDEEEEEEDREGQLCCLQALKFSAGKMNLGMGRPNLLKMTKALKGFGWLSRNGSRKRLIH

A0A6J1HI04 uncharacterized protein LOC1114638236.8e-8471.38Show/hide
Query:  LRPRIPEETDGFPPMEELEAYKIVFETYTFGSEQTPYGDDDPEVEVEVDFQEPMEHFPEEIQILPENHL-LDEIEAKTEELKEAQIENRDESQDSSKAME
        LRPR P+E+    PMEELEAYKIVFE YTFGSEQ PY     EVEVEVDFQEPMEHFPE+I+ LPEN L + E+EAKTEEL+EA+ ENRDE        +
Subjt:  LRPRIPEETDGFPPMEELEAYKIVFETYTFGSEQTPYGDDDPEVEVEVDFQEPMEHFPEEIQILPENHL-LDEIEAKTEELKEAQIENRDESQDSSKAME

Query:  KEMTKDLRKITEESSISSRSESS-PWSSPGSFSSREYNSLGSYGSMRKEKEWRRTLACKLFDERHNSEGTEGMDSLWETYENSESKTLQKKEKINGKTKK
         E+  DL+    ESS SSRSESS PWSSPGSF  R+Y+SLGSYGSMRKEKEWRRTLACKLF+ERH+SE TEGMDSLWETYE        KKEK N K   
Subjt:  KEMTKDLRKITEESSISSRSESS-PWSSPGSFSSREYNSLGSYGSMRKEKEWRRTLACKLFDERHNSEGTEGMDSLWETYENSESKTLQKKEKINGKTKK

Query:  GKKIQNKKEEDEEEEEEDREGQLCCLQALKFSAGKMNLGMGRPNLLKMTKALKGFGWLSRNGSRKRLIH
             +KKEE+EEEEEE+ EGQLCCLQALKFSAGKMNLGM RPNL+KMTKALKGFGWLSR GSRKRLIH
Subjt:  GKKIQNKKEEDEEEEEEDREGQLCCLQALKFSAGKMNLGMGRPNLLKMTKALKGFGWLSRNGSRKRLIH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G25130.1 unknown protein9.6e-3037.85Show/hide
Query:  LRPRIPEETDGFPPMEELEAYKIVFETYTFG--------SEQTPYGDDDPEVEVEVDFQEPMEHFPEEIQILP---ENHLLDEIEAKTEELKEAQIENRD
        L  +     +GF  +EELEAYK+V E  +          S++  + D     E  V      E   E+++I P   E+ ++ E E +T++ ++ ++E + 
Subjt:  LRPRIPEETDGFPPMEELEAYKIVFETYTFG--------SEQTPYGDDDPEVEVEVDFQEPMEHFPEEIQILP---ENHLLDEIEAKTEELKEAQIENRD

Query:  ESQDSSKAME--KEMTKDLRKITEESSI-SSRSESSPWSSPGSF-------------SSREYN-SLGSYGSMRKEKEWRRTLACKLFDERHNSEGTEGMD
            S   ++  +E TK+  K  +   +  S +ES       +F             +  E N SL S+GSMRKEKEWRRTLACKLF+ERHN++  +GMD
Subjt:  ESQDSSKAME--KEMTKDLRKITEESSI-SSRSESSPWSSPGSF-------------SSREYN-SLGSYGSMRKEKEWRRTLACKLFDERHNSEGTEGMD

Query:  SLWETYENSESK---TLQKKEKINGKTK---KGKKIQNKKEEDEEEEEEDREGQLCCLQALKFSAGKMNLGMGRPNLLKMTKALKGFG
         LWETYE    K   T ++K+K+  KTK   K K I+ +   +EE+++     QLCCLQALKFS GKM+LG+ RPNLLK++KA KG G
Subjt:  SLWETYENSESK---TLQKKEKINGKTK---KGKKIQNKKEEDEEEEEEDREGQLCCLQALKFSAGKMNLGMGRPNLLKMTKALKGFG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTTTGAGACCAAGAATACCAGAAGAAACCGATGGATTTCCCCCCATGGAAGAGCTTGAAGCTTATAAAATCGTGTTTGAGACTTACACTTTTGGCTCTGAACAAAC
CCCATATGGCGATGATGACCCAGAAGTGGAAGTGGAAGTCGATTTTCAAGAACCCATGGAGCATTTTCCCGAGGAAATCCAAATTCTCCCAGAAAATCATCTGCTCGATG
AAATTGAAGCAAAAACAGAGGAATTAAAAGAAGCCCAAATCGAAAACAGAGACGAAAGCCAAGACTCATCGAAGGCGATGGAGAAGGAAATGACGAAAGATTTGAGAAAA
ATCACAGAAGAATCATCGATTTCTTCAAGATCAGAATCGAGTCCATGGAGTTCACCAGGGAGTTTCAGCAGTAGAGAGTATAATTCATTAGGAAGCTATGGATCGATGAG
GAAGGAGAAAGAATGGCGAAGAACACTCGCTTGTAAGCTCTTTGACGAGCGGCATAATTCAGAGGGAACAGAAGGAATGGATTCGCTATGGGAAACATACGAGAATAGTG
AATCAAAGACGTTGCAGAAGAAAGAGAAAATCAATGGAAAAACGAAGAAAGGAAAGAAAATTCAAAACAAAAAAGAAGAAGATGAAGAAGAAGAAGAAGAAGATAGAGAA
GGGCAACTTTGCTGTTTACAAGCACTGAAATTCTCAGCAGGGAAGATGAATTTGGGAATGGGAAGACCAAATCTTTTGAAAATGACTAAAGCTTTGAAGGGATTTGGATG
GTTGAGCAGAAATGGAAGTAGAAAGAGATTGATCCATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGTTTGAGACCAAGAATACCAGAAGAAACCGATGGATTTCCCCCCATGGAAGAGCTTGAAGCTTATAAAATCGTGTTTGAGACTTACACTTTTGGCTCTGAACAAAC
CCCATATGGCGATGATGACCCAGAAGTGGAAGTGGAAGTCGATTTTCAAGAACCCATGGAGCATTTTCCCGAGGAAATCCAAATTCTCCCAGAAAATCATCTGCTCGATG
AAATTGAAGCAAAAACAGAGGAATTAAAAGAAGCCCAAATCGAAAACAGAGACGAAAGCCAAGACTCATCGAAGGCGATGGAGAAGGAAATGACGAAAGATTTGAGAAAA
ATCACAGAAGAATCATCGATTTCTTCAAGATCAGAATCGAGTCCATGGAGTTCACCAGGGAGTTTCAGCAGTAGAGAGTATAATTCATTAGGAAGCTATGGATCGATGAG
GAAGGAGAAAGAATGGCGAAGAACACTCGCTTGTAAGCTCTTTGACGAGCGGCATAATTCAGAGGGAACAGAAGGAATGGATTCGCTATGGGAAACATACGAGAATAGTG
AATCAAAGACGTTGCAGAAGAAAGAGAAAATCAATGGAAAAACGAAGAAAGGAAAGAAAATTCAAAACAAAAAAGAAGAAGATGAAGAAGAAGAAGAAGAAGATAGAGAA
GGGCAACTTTGCTGTTTACAAGCACTGAAATTCTCAGCAGGGAAGATGAATTTGGGAATGGGAAGACCAAATCTTTTGAAAATGACTAAAGCTTTGAAGGGATTTGGATG
GTTGAGCAGAAATGGAAGTAGAAAGAGATTGATCCATTGATGAATTGTTTTGGGTTTTTCTTTTTCTTTTGGTTCTTCATTTTACAGTTCTTCTATGTTCTTCCTCATCC
TAATTCTCCATTTTTTTTTTCTTTGTTGGTTTGGTTTTTTCTCTTTGATTAAACTGTTCAAATAGTGAAAGTAAATATTTTTGTTATGGTTTGCTAATTGGAGATCTATC
TAATCTTAACCCATTTTACTTTCTTCTTTCAA
Protein sequenceShow/hide protein sequence
MGLRPRIPEETDGFPPMEELEAYKIVFETYTFGSEQTPYGDDDPEVEVEVDFQEPMEHFPEEIQILPENHLLDEIEAKTEELKEAQIENRDESQDSSKAMEKEMTKDLRK
ITEESSISSRSESSPWSSPGSFSSREYNSLGSYGSMRKEKEWRRTLACKLFDERHNSEGTEGMDSLWETYENSESKTLQKKEKINGKTKKGKKIQNKKEEDEEEEEEDRE
GQLCCLQALKFSAGKMNLGMGRPNLLKMTKALKGFGWLSRNGSRKRLIH