; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0001357 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0001357
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionDNA-directed DNA polymerase
Genome locationchr4:30132438..30136981
RNA-Seq ExpressionLag0001357
SyntenyLag0001357
Gene Ontology termsNA
InterPro domainsIPR004332 - Transposase, MuDR, plant
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_017227899.1 PREDICTED: uncharacterized protein LOC108203467 [Daucus carota subsp. sativus]1.1e-7044.29Show/hide
Query:  LGHSLLILLPKSRIAVGQLANEIKARPQGKLPSDIEHP--------RREELESGKGARGSNNDAGASDSV-----------PDVEPPYVPPP-------P
        L HS    L      VGQLANE++ RP G L SD E P        +   L+SGK    +  +A   DSV            + E   V PP        
Subjt:  LGHSLLILLPKSRIAVGQLANEIKARPQGKLPSDIEHP--------RREELESGKGARGSNNDAGASDSV-----------PDVEPPYVPPP-------P

Query:  YVPPLPLPQRQRPKNRDGQFKKFLEIPKQFHINIPLVEAIEQMPNYAKFLKDIVTKKKRLGEFEIVSLTEECSAILKNGLPTKAKDPGSFTIPVLIGGKD
          P  P PQR + + +D QFKKFL++ KQ HINIPLVEA+EQMPNY KF+KDI+TKK+RLGEFE V+LT+ECS+ L++ LPTK KDPGSFTIP  IG   
Subjt:  YVPPLPLPQRQRPKNRDGQFKKFLEIPKQFHINIPLVEAIEQMPNYAKFLKDIVTKKKRLGEFEIVSLTEECSAILKNGLPTKAKDPGSFTIPVLIGGKD

Query:  LGRALCDLGASINLMPLSVYRKLGIGDVRPTTTTLQLADRSITYPE--------------------------DKDVPIILGRPFLATGRTLIDVPKGIL-
         G ALCDLGASINLMP+SV+RKLGIG+VRPTT TLQLADRS+ +PE                          D++VPIILGRPFLATGRTLIDV  G L 
Subjt:  LGRALCDLGASINLMPLSVYRKLGIGDVRPTTTTLQLADRSITYPE--------------------------DKDVPIILGRPFLATGRTLIDVPKGIL-

Query:  ----------------------ENTIVETTMEDLTNEHLEDHGK--------ISIEDLEVCSLE----RKSEKEVFRCEDVFESLDLNERMVPPMKPSLI
                              E+    T  ++L ++ L+ +          +++   E  ++E     ++  +V R    FESLDL+ R     K S+ 
Subjt:  ----------------------ENTIVETTMEDLTNEHLEDHGK--------ISIEDLEVCSLE----RKSEKEVFRCEDVFESLDLNERMVPPMKPSLI

Query:  EAPTLDLKPLPDHLKYVYLG
        E P L+LK LP HLKY YLG
Subjt:  EAPTLDLKPLPDHLKYVYLG

XP_023522102.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111785979 [Cucurbita pepo subsp. pepo]9.9e-7243.29Show/hide
Query:  IAVGQLANEIKARPQGKLPSDIEHPRRE--------ELESGK--GARG------------------------------SNNDAGAS-------DSVPDVE
        + VGQLANE++ RP  KLP+D E P+RE        EL SGK   +RG                              + NDA A+       +    V+
Subjt:  IAVGQLANEIKARPQGKLPSDIEHPRRE--------ELESGK--GARG------------------------------SNNDAGAS-------DSVPDVE

Query:  PPYVPPPP--------YVPPLPLPQRQRPKNRDGQFKKFLEIPKQFHINIPLVEAIEQMPNYAKFLKDIVTKKKRLGEFEIVSLTEECSAILKNGLPTKA
        PP              Y P  P PQR + K  +  F+KF++I K+ HINIPLVEA++QMPNY KFLKD++T +++  EF++V L EECSAILKN +P K 
Subjt:  PPYVPPPP--------YVPPLPLPQRQRPKNRDGQFKKFLEIPKQFHINIPLVEAIEQMPNYAKFLKDIVTKKKRLGEFEIVSLTEECSAILKNGLPTKA

Query:  KDPGSFTIPVLIGGKDLGRALCDLGASINLMPLSVYRKLGIGDVRPTTTTLQLADRSITYPE--------------------------DKDVPIILGRPF
        KDPGSFTIP+ IGGK LGRALCDLG+SINLMPLS+Y+KLGIG+ RPTT TLQLADRS TYPE                          D DVPIILGRPF
Subjt:  KDPGSFTIPVLIGGKDLGRALCDLGASINLMPLSVYRKLGIGDVRPTTTTLQLADRSITYPE--------------------------DKDVPIILGRPF

Query:  LATGRTLIDVPKGILENTIVETTMEDLTNEHLE-----DHGKISIEDLEVCSLERKSEKEVFRCED------------------VFESLDLNERMVPPMK
        L TGRTL+DV KG +   + +  +E   N+ ++     +      E  E  + E   + E  + ED                   FESL+   R   PM+
Subjt:  LATGRTLIDVPKGILENTIVETTMEDLTNEHLE-----DHGKISIEDLEVCSLERKSEKEVFRCED------------------VFESLDLNERMVPPMK

Query:  PSLIEAPTLDLKPLPDHLKYVYLGE
        PS+ EAP LDLKPLP +LKY YLG+
Subjt:  PSLIEAPTLDLKPLPDHLKYVYLGE

XP_024028757.1 uncharacterized protein LOC112093792 [Morus notabilis]2.6e-7243.6Show/hide
Query:  VGQLANEIKARPQGKLPSDIEHPRRE------------ELESGKGARGSNNDAGASDSVPDVEPPYVPPP--------------------PYVPPLPLPQ
        VGQLAN +  RPQG LPSD ++PRR+             L++G+          A++           PP                    P  PP P PQ
Subjt:  VGQLANEIKARPQGKLPSDIEHPRRE------------ELESGKGARGSNNDAGASDSVPDVEPPYVPPP--------------------PYVPPLPLPQ

Query:  RQRPKNRDGQFKKFLEIPKQFHINIPLVEAIEQMPNYAKFLKDIVTKKKRLGEFEIVSLTEECSAILKNGLPTKAKDPGSFTIPVLIGGKDLGRALCDLG
        R + + +D QF++FL++ KQ HINIPLVEA+EQMP+Y KF+KDI+TKK+RLGEFE V+LTEECSAILKN LP K KDPGSFTIP  IG + +G+ALCDLG
Subjt:  RQRPKNRDGQFKKFLEIPKQFHINIPLVEAIEQMPNYAKFLKDIVTKKKRLGEFEIVSLTEECSAILKNGLPTKAKDPGSFTIPVLIGGKDLGRALCDLG

Query:  ASINLMPLSVYRKLGIGDVRPTTTTLQLADRSITYPE--------------------------DKDVPIILGRPFLATGRTLIDVPKGIL----------
        ASINLMP+S++RKLGIG+V PTT TLQLADRS  +PE                          DK+VPIILGRPFLATG+TLIDV KG L          
Subjt:  ASINLMPLSVYRKLGIGDVRPTTTTLQLADRSITYPE--------------------------DKDVPIILGRPFLATGRTLIDVPKGIL----------

Query:  ------------------------------ENTIVETTM--EDLTNEHL-EDHGKISIEDLEVCSLERKSEKEVFRCEDVFESLDLNERMVPPMKPSLIE
                                      E T  E  M  EDL +  + ED+    +  LE      KS +        FESLDL+   +   KPS+ E
Subjt:  ------------------------------ENTIVETTM--EDLTNEHL-EDHGKISIEDLEVCSLERKSEKEVFRCEDVFESLDLNERMVPPMKPSLIE

Query:  APTLDLKPLPDHLKYVYLGEGD
         P L+L+PLP HL+Y YLG+ D
Subjt:  APTLDLKPLPDHLKYVYLGEGD

XP_030502183.1 uncharacterized protein LOC115717351 [Cannabis sativa]1.1e-7051.24Show/hide
Query:  PPLPLPQRQRPKNRDGQFKKFLEIPKQFHINIPLVEAIEQMPNYAKFLKDIVTKKKRLGEFEIVSLTEECSAILKNGLPTKAKDPGSFTIPVLIGGKDLG
        PPLP PQR      DGQFKKFL++ KQ HINIPLVEA+EQM NY KFLKDI+TKK+RLGEFE V+LTE CSA+LK+ +P K KDPGSFTIP  IGG+D+G
Subjt:  PPLPLPQRQRPKNRDGQFKKFLEIPKQFHINIPLVEAIEQMPNYAKFLKDIVTKKKRLGEFEIVSLTEECSAILKNGLPTKAKDPGSFTIPVLIGGKDLG

Query:  RALCDLGASINLMPLSVYRKLGIGDVRPTTTTLQLADRSITYPE--------------------------DKDVPIILGRPFLATGRTLIDVPKGILENT
        RALCDLGASINLMP+S+++KLGIG+ RPTT TLQLADRS+ +PE                          D+DVPIILGR FLATGRTLIDV    L   
Subjt:  RALCDLGASINLMPLSVYRKLGIGDVRPTTTTLQLADRSITYPE--------------------------DKDVPIILGRPFLATGRTLIDVPKGILENT

Query:  IVETTME-DLTN-----EHLEDHGKISIED--------LEVC----------SLERKSEKE------------VFRCEDVFESLDLNERMVPPMKPSLIE
        + +  +  ++ N     + +E+  +IS+ D         E C           LE  SE E            + + +  FESL+L E    P KPS  E
Subjt:  IVETTME-DLTN-----EHLEDHGKISIED--------LEVC----------SLERKSEKE------------VFRCEDVFESLDLNERMVPPMKPSLIE

Query:  APTLDLKPLPDHLKYVYLGEGD
         P L+LKPLP HLKY YLGE D
Subjt:  APTLDLKPLPDHLKYVYLGEGD

XP_030509265.1 uncharacterized protein LOC115723943 [Cannabis sativa]6.0e-7746.91Show/hide
Query:  IAVGQLANEIKARPQGKLPSDIEHPRRE--------ELESGKGARGSNNDAGAS-----------------DSVPDVEP-----------PYVPPPPYVP
        + +G LANE+KARPQG LPSD E+PRR+         L SGK  + S  +   S                   + D  P               P    P
Subjt:  IAVGQLANEIKARPQGKLPSDIEHPRRE--------ELESGKGARGSNNDAGAS-----------------DSVPDVEP-----------PYVPPPPYVP

Query:  PLPLPQRQRPKNRDGQFKKFLEIPKQFHINIPLVEAIEQMPNYAKFLKDIVTKKKRLGEFEIVSLTEECSAILKNGLPTKAKDPGSFTIPVLIGGKDLGR
        PLP PQR R + +DGQFKKFL++ KQ HINIPLVEA+EQMPNY KFLKDI+TKK+RLGEFE V+LTE CSA+LK+ +P K KDPGSFTIP  IGG+D+GR
Subjt:  PLPLPQRQRPKNRDGQFKKFLEIPKQFHINIPLVEAIEQMPNYAKFLKDIVTKKKRLGEFEIVSLTEECSAILKNGLPTKAKDPGSFTIPVLIGGKDLGR

Query:  ALCDLGASINLMPLSVYRKLGIGDVRPTTTTLQLADRSITYPE--------------------------DKDVPIILGRPFLATGRTLIDVPKGILE---
        ALCDLGASINLMP+S+++KLGIG+ RPTT TLQLADRS+ +PE                          D+DVPIILGRPFLATGRTLIDV  G L    
Subjt:  ALCDLGASINLMPLSVYRKLGIGDVRPTTTTLQLADRSITYPE--------------------------DKDVPIILGRPFLATGRTLIDVPKGILE---

Query:  ----------NTIVETTMEDLTNEHLEDHGKISIEDLEVCSLERKSEKEVFRCEDV---------FESLDLNERMVPPMKPSLIEAPTLDLKPLPDHLKY
                  + I     E    E   D   IS  D E+  L    + +V   E +         FESL+L E    P KPS+ E P L+LKPLP H   
Subjt:  ----------NTIVETTMEDLTNEHLEDHGKISIEDLEVCSLERKSEKEVFRCEDV---------FESLDLNERMVPPMKPSLIEAPTLDLKPLPDHLKY

Query:  VYLGE
           GE
Subjt:  VYLGE

TrEMBL top hitse value%identityAlignment
A0A2G9GK35 Reverse transcriptase1.4e-6643.07Show/hide
Query:  VGQLANEIKARPQGKLPSDIE-HPRRE--------ELESGK--------GARGSNNDAGASDSVPDVEPPYVPPPPYVPPLPLPQRQRPKNRDGQFKKFL
        +GQLAN I +RPQG LPS+ E +PR++         L +G+          +    +  + +   +VE P     P     P PQR + +  + QF KFL
Subjt:  VGQLANEIKARPQGKLPSDIE-HPRRE--------ELESGK--------GARGSNNDAGASDSVPDVEPPYVPPPPYVPPLPLPQRQRPKNRDGQFKKFL

Query:  EIPKQFHINIPLVEAIEQMPNYAKFLKDIVTKKKRLGEFEIVSLTEECSAILKNGLPTKAKDPGSFTIPVLIGGKDLGRALCDLGASINLMPLSVYRKLG
        E+ K+ HINIP  EA+EQMP+Y KF+KDI++KK+RLG++E V+LTEECSAI++N LP K KDPGSFTIP  IG    GRALCDLGASINLMP S+YR LG
Subjt:  EIPKQFHINIPLVEAIEQMPNYAKFLKDIVTKKKRLGEFEIVSLTEECSAILKNGLPTKAKDPGSFTIPVLIGGKDLGRALCDLGASINLMPLSVYRKLG

Query:  IGDVRPTTTTLQLADRSITYPE--------------------------DKDVPIILGRPFLATGRTLIDVPKGIL-----------------------EN
        +G+ +PT+ TLQLADRS+TYP+                          D +VPIILGRPFLATGRTLIDV KG L                       + 
Subjt:  IGDVRPTTTTLQLADRSITYPE--------------------------DKDVPIILGRPFLATGRTLIDVPKGIL-----------------------EN

Query:  TIVETTMEDLT-NEHLEDHGKISIED--LEVCSLERKSEKEVFRCEDV---FES--LDLNERMVPP--MKPSLIEAPTLDLKPLPDHLKYVYLGEGD
               ++L  NE + +     +E   L++   E + + EV +  D    F+S  ++  ER+ P   +KPS+ E PTL+LKPLP HL Y YLGE D
Subjt:  TIVETTMEDLT-NEHLEDHGKISIED--LEVCSLERKSEKEVFRCEDV---FES--LDLNERMVPP--MKPSLIEAPTLDLKPLPDHLKYVYLGEGD

A0A2G9HSD1 DNA-directed DNA polymerase2.3e-6644.74Show/hide
Query:  VGQLANEIKARPQGKLPSDIEHPRREELESGKGARGSNNDAGASDSVPDVEPPYVPPPPYVPPLPLPQRQRPKNRDGQFKKFLEIPKQFHINIPLVEAIE
        +GQLAN I +RPQG LPS+ E   R++           N+  + +   ++E P     P     P PQR + +  + QF KFLE+ K+ HINIP  EA+E
Subjt:  VGQLANEIKARPQGKLPSDIEHPRREELESGKGARGSNNDAGASDSVPDVEPPYVPPPPYVPPLPLPQRQRPKNRDGQFKKFLEIPKQFHINIPLVEAIE

Query:  QMPNYAKFLKDIVTKKKRLGEFEIVSLTEECSAILKNGLPTKAKDPGSFTIPVLIGGKDLGRALCDLGASINLMPLSVYRKLGIGDVRPTTTTLQLADRS
        QMP+Y KF+KDI++KK+RLG++E V LTEECSAI++N LP K KDPGSFTIP  IG    GRALCDL ASINLMP S+YR LG+G+ +PT+ TLQLADRS
Subjt:  QMPNYAKFLKDIVTKKKRLGEFEIVSLTEECSAILKNGLPTKAKDPGSFTIPVLIGGKDLGRALCDLGASINLMPLSVYRKLGIGDVRPTTTTLQLADRS

Query:  ITYPE--------------------------DKDVPIILGRPFLATGRTLIDVPKG----------ILENTIVETTMEDLTNE----HLEDH--GKISIE
        +TYP+                          D +VPIILGRPFLATGRTLIDV KG          I+ N        + ++E    +L D+  G  SI 
Subjt:  ITYPE--------------------------DKDVPIILGRPFLATGRTLIDVPKG----------ILENTIVETTMEDLTNE----HLEDH--GKISIE

Query:  D----------LEVCSLERKSEKEVFRCEDV---FES--LDLNERMVPP--MKPSLIEAPTLDLKPLPDHLKYVYLGEGD
        +          L++   E + ++EV +  D    F+S  ++  ER  P   +KPS+ E PTL LKPLP HL YVYLG+ D
Subjt:  D----------LEVCSLERKSEKEVFRCEDV---FES--LDLNERMVPP--MKPSLIEAPTLDLKPLPDHLKYVYLGEGD

A0A2G9HYA0 Reverse transcriptase5.1e-6643.07Show/hide
Query:  VGQLANEIKARPQGKLPSDIE-HPRRE--------ELESGK--------GARGSNNDAGASDSVPDVEPPYVPPPPYVPPLPLPQRQRPKNRDGQFKKFL
        +GQLAN I +RPQG LPS+ E +PR++         L +G+          +    +  + +   +VE P     P     P PQR + +  + QF KFL
Subjt:  VGQLANEIKARPQGKLPSDIE-HPRRE--------ELESGK--------GARGSNNDAGASDSVPDVEPPYVPPPPYVPPLPLPQRQRPKNRDGQFKKFL

Query:  EIPKQFHINIPLVEAIEQMPNYAKFLKDIVTKKKRLGEFEIVSLTEECSAILKNGLPTKAKDPGSFTIPVLIGGKDLGRALCDLGASINLMPLSVYRKLG
        E+ K+ HINIP  EA+EQMP+Y KF+KDI++KK+RLG++E V+LTEECSAI++N LP K KDPGSFTIP  IG    GRALCDLGASINLMP S+YR LG
Subjt:  EIPKQFHINIPLVEAIEQMPNYAKFLKDIVTKKKRLGEFEIVSLTEECSAILKNGLPTKAKDPGSFTIPVLIGGKDLGRALCDLGASINLMPLSVYRKLG

Query:  IGDVRPTTTTLQLADRSITYPE--------------------------DKDVPIILGRPFLATGRTLIDVPKGILENTIVETTME-------DLTNEHLE
        +G+ +PT+ TLQLADRS+TYP+                          D +VPIILGRPFLATGRTLIDV KG L   + +  +           NE  E
Subjt:  IGDVRPTTTTLQLADRSITYPE--------------------------DKDVPIILGRPFLATGRTLIDVPKGILENTIVETTME-------DLTNEHLE

Query:  DH---------GKISIED----------LEVCSLERKSEKEVFRCEDVFESL-----DLNERMVPP--MKPSLIEAPTLDLKPLPDHLKYVYLGEGD
                   G  SI +          L++   E + + EV +  D  + L     +  ER  P   +KPS+ + PTL+LKPLP HL Y YLGE D
Subjt:  DH---------GKISIED----------LEVCSLERKSEKEVFRCEDVFESL-----DLNERMVPP--MKPSLIEAPTLDLKPLPDHLKYVYLGEGD

A0A2G9HYD8 Reverse transcriptase3.0e-6643.07Show/hide
Query:  VGQLANEIKARPQGKLPSDIE-HPRRE--------ELESG--------KGARGSNNDAGASDSVPDVEPPYVPPPPYVPPLPLPQRQRPKNRDGQFKKFL
        +GQLAN I +RPQG LPS+ E +PR++         L +G        K  +    +  + +   +VE P     P     P PQ+ + +  + QF KFL
Subjt:  VGQLANEIKARPQGKLPSDIE-HPRRE--------ELESG--------KGARGSNNDAGASDSVPDVEPPYVPPPPYVPPLPLPQRQRPKNRDGQFKKFL

Query:  EIPKQFHINIPLVEAIEQMPNYAKFLKDIVTKKKRLGEFEIVSLTEECSAILKNGLPTKAKDPGSFTIPVLIGGKDLGRALCDLGASINLMPLSVYRKLG
        E+ K+ HINIP  EA+EQMP+Y KF+KDI++KK+RLG++E  +LTEEC+AI++N LP K KDPGSFTIP  IG    GRALCDLGASINLMP S+YR LG
Subjt:  EIPKQFHINIPLVEAIEQMPNYAKFLKDIVTKKKRLGEFEIVSLTEECSAILKNGLPTKAKDPGSFTIPVLIGGKDLGRALCDLGASINLMPLSVYRKLG

Query:  IGDVRPTTTTLQLADRSITYPE--------------------------DKDVPIILGRPFLATGRTLIDVPKGILENTIVETTM--------------ED
        +G+ +PT+ TLQLADRS+TYP+                          D +VPIILGRPFLATGRTLIDV KG L   + +  +              ++
Subjt:  IGDVRPTTTTLQLADRSITYPE--------------------------DKDVPIILGRPFLATGRTLIDVPKGILENTIVETTM--------------ED

Query:  LTNEHLEDH--GKISIEDLEVCSLER----------KSEKEVFR---CEDVFES--LDLNERMVPP--MKPSLIEAPTLDLKPLPDHLKYVYLGEGD
          +  L D+  G  SI +  + SLER          + + EV +       F+S  ++  ER  P   +KPS+ + PTL+LKPLP+HL YVYLGE D
Subjt:  LTNEHLEDH--GKISIEDLEVCSLER----------KSEKEVFR---CEDVFES--LDLNERMVPP--MKPSLIEAPTLDLKPLPDHLKYVYLGEGD

A0A6J1DV77 uncharacterized protein LOC1110238183.2e-6853.2Show/hide
Query:  VPPPPYVPPLPLPQRQRPKNRDGQFKKFLEIPKQFHINIPLVEAIEQMPNYAKFLKDIVTKKKRLGEFEIVSLTEECSAILKNGLPTKAKDPGSFTIPVL
        +P    VPP P PQR + KN+D QF +FLE+ KQ HINIPL+EA+EQMPNY KFLKDI+ KK+RLGEFEIV+LT+E SAIL   LP K  DPGSFTIPVL
Subjt:  VPPPPYVPPLPLPQRQRPKNRDGQFKKFLEIPKQFHINIPLVEAIEQMPNYAKFLKDIVTKKKRLGEFEIVSLTEECSAILKNGLPTKAKDPGSFTIPVL

Query:  IGGKDLGRALCDLGASINLMPLSVYRKLGIGDVRPTTTTLQLADRSITYPE--------------------------DKDVPIILGRPFLATGRTLIDVP
        IGGK++G ALCDLGASINLMPLSVY+KLGIG+ RP T TLQLADRSITY E                          DK++PIILGRPFL+TGR LIDV 
Subjt:  IGGKDLGRALCDLGASINLMPLSVYRKLGIGDVRPTTTTLQLADRSITYPE--------------------------DKDVPIILGRPFLATGRTLIDVP

Query:  KGILENTIVETTME-DLTNE-----HLEDHGKISIEDLEVCSLERKSEKEVFRCEDVFESLDLNERMVPPMKPSLIEAPTLDLKPLPDHLKYVYLGE
         G L   + +  +   + N       +E+   + I D ++ S E ++E+ + + ED    + + +R+  P++PS+++AP L+LK LP HLKY YLGE
Subjt:  KGILENTIVETTME-DLTNE-----HLEDHGKISIEDLEVCSLERKSEKEVFRCEDVFESLDLNERMVPPMKPSLIEAPTLDLKPLPDHLKYVYLGE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTTTCGAGAGAGCTTGTTCCTTGGCTGGACTGATTGACAGAATTCTTTTGTTGGGGCATAGTCTGCTGATTTTGCTTCCCAAATCCAGAATTGCAGTAGGCCAACT
AGCTAATGAGATAAAGGCAAGGCCTCAAGGGAAACTTCCTTCAGATATTGAGCACCCTAGAAGGGAAGAGTTGGAGTCTGGTAAAGGTGCTAGAGGCAGCAATAATGATG
CTGGAGCATCTGATTCTGTTCCAGATGTGGAACCACCTTATGTGCCGCCCCCACCTTATGTACCACCTCTACCTCTTCCACAAAGGCAAAGGCCTAAGAATAGGGATGGT
CAATTCAAGAAATTTTTAGAAATTCCTAAGCAATTCCATATAAATATCCCTTTAGTAGAAGCTATTGAGCAAATGCCTAATTATGCTAAATTTCTCAAGGATATTGTAAC
TAAAAAGAAGAGGTTAGGTGAATTTGAAATTGTGTCTCTTACTGAGGAATGTAGTGCTATTCTTAAGAATGGGCTACCAACCAAGGCTAAGGATCCAGGGTCATTCACTA
TTCCTGTCTTAATAGGTGGAAAGGACTTAGGAAGAGCACTTTGTGACTTAGGCGCAAGTATTAACCTTATGCCTCTTTCGGTTTATCGAAAGTTAGGTATTGGTGACGTT
AGGCCTACCACAACCACACTCCAATTAGCTGATAGGTCTATCACTTATCCAGAAGATAAAGATGTCCCAATTATTCTTGGTCGTCCATTTTTGGCTACTGGTAGAACATT
GATAGATGTTCCAAAAGGGATATTGGAGAACACAATTGTTGAGACAACAATGGAGGATTTGACAAACGAGCATTTGGAAGATCATGGAAAGATTAGTATAGAAGATTTAG
AAGTTTGTTCTTTAGAAAGAAAAAGTGAAAAAGAAGTGTTTAGGTGTGAGGATGTTTTTGAGTCTTTAGATTTGAATGAAAGGATGGTTCCTCCTATGAAGCCATCCCTG
ATTGAGGCACCCACTTTAGATTTGAAGCCCTTGCCGGATCATCTAAAGTATGTGTATCTTGGGGAGGGTGATGAAGGAGTCTACAGTGGGTTCAAGTCCGGCATTGTGAC
ATGCCCATGTGATTCTAACCTCTCCCAACTAAAGGGACTGTTGCTTAGTTGTCGACGTGCAGGGGTAGATACTAATATACACAACGGGGGAGGGGCAGTTGCAAGTTCAT
CCTGGGGGGAGAAAAATTTTGAGTTTGTCAACTCAGATGGGATTGATGCATTGACCGACATAGTATCCGTAACAGAGGGGTGCGTATTTTCATGCAAAGAACATCTCAAG
AAGGTAGTCTCTTCACTGGCTCTGAAAGGGAGTTTCCAATTCAAGACAATTAAATCCAATAGCATACAGTATACAGTGACATGCATAGACAACTCATGCCAATGTCATGA
CAAAGCATGGAGGGGTAGAGAAAAGGCGTTAAATGAGTTGAGAGGATCTCCTGAAGTGTCTTATGCTCAAATTTCGTTGTTTGCTGCCAGGTTGATCGAAAGGAATTCAG
GTACGTACACCACCCAGGAAATTGATTCAAATGACAGGTTCAAGTTCTTCTTCATGAGTATTGCAGCATCCATAAACGGGTGGAAACACTGTCTCCCAGTTATTTCGGTA
GATGGTACATCTTTGAAGAATAAATTCAGTGGCACCTTTTTGACGGCCTGCACATTTGATGGGTTGGCGGGAGGAGGTCTACACGAAGGCGATGGCTGGAGGTTGGATCG
GCGGGAGGTGGGGGTCGGCGGTCTGTACGAAGAAGGACTGGCGTTTGTGGGAAGATGGAGGGTTTTGCTTGTCTTGAAGAAGGCTGTTTGTGGGAGATGGAGGGTTTTGC
TGATTTGA
mRNA sequenceShow/hide mRNA sequence
ATGATTTTCGAGAGAGCTTGTTCCTTGGCTGGACTGATTGACAGAATTCTTTTGTTGGGGCATAGTCTGCTGATTTTGCTTCCCAAATCCAGAATTGCAGTAGGCCAACT
AGCTAATGAGATAAAGGCAAGGCCTCAAGGGAAACTTCCTTCAGATATTGAGCACCCTAGAAGGGAAGAGTTGGAGTCTGGTAAAGGTGCTAGAGGCAGCAATAATGATG
CTGGAGCATCTGATTCTGTTCCAGATGTGGAACCACCTTATGTGCCGCCCCCACCTTATGTACCACCTCTACCTCTTCCACAAAGGCAAAGGCCTAAGAATAGGGATGGT
CAATTCAAGAAATTTTTAGAAATTCCTAAGCAATTCCATATAAATATCCCTTTAGTAGAAGCTATTGAGCAAATGCCTAATTATGCTAAATTTCTCAAGGATATTGTAAC
TAAAAAGAAGAGGTTAGGTGAATTTGAAATTGTGTCTCTTACTGAGGAATGTAGTGCTATTCTTAAGAATGGGCTACCAACCAAGGCTAAGGATCCAGGGTCATTCACTA
TTCCTGTCTTAATAGGTGGAAAGGACTTAGGAAGAGCACTTTGTGACTTAGGCGCAAGTATTAACCTTATGCCTCTTTCGGTTTATCGAAAGTTAGGTATTGGTGACGTT
AGGCCTACCACAACCACACTCCAATTAGCTGATAGGTCTATCACTTATCCAGAAGATAAAGATGTCCCAATTATTCTTGGTCGTCCATTTTTGGCTACTGGTAGAACATT
GATAGATGTTCCAAAAGGGATATTGGAGAACACAATTGTTGAGACAACAATGGAGGATTTGACAAACGAGCATTTGGAAGATCATGGAAAGATTAGTATAGAAGATTTAG
AAGTTTGTTCTTTAGAAAGAAAAAGTGAAAAAGAAGTGTTTAGGTGTGAGGATGTTTTTGAGTCTTTAGATTTGAATGAAAGGATGGTTCCTCCTATGAAGCCATCCCTG
ATTGAGGCACCCACTTTAGATTTGAAGCCCTTGCCGGATCATCTAAAGTATGTGTATCTTGGGGAGGGTGATGAAGGAGTCTACAGTGGGTTCAAGTCCGGCATTGTGAC
ATGCCCATGTGATTCTAACCTCTCCCAACTAAAGGGACTGTTGCTTAGTTGTCGACGTGCAGGGGTAGATACTAATATACACAACGGGGGAGGGGCAGTTGCAAGTTCAT
CCTGGGGGGAGAAAAATTTTGAGTTTGTCAACTCAGATGGGATTGATGCATTGACCGACATAGTATCCGTAACAGAGGGGTGCGTATTTTCATGCAAAGAACATCTCAAG
AAGGTAGTCTCTTCACTGGCTCTGAAAGGGAGTTTCCAATTCAAGACAATTAAATCCAATAGCATACAGTATACAGTGACATGCATAGACAACTCATGCCAATGTCATGA
CAAAGCATGGAGGGGTAGAGAAAAGGCGTTAAATGAGTTGAGAGGATCTCCTGAAGTGTCTTATGCTCAAATTTCGTTGTTTGCTGCCAGGTTGATCGAAAGGAATTCAG
GTACGTACACCACCCAGGAAATTGATTCAAATGACAGGTTCAAGTTCTTCTTCATGAGTATTGCAGCATCCATAAACGGGTGGAAACACTGTCTCCCAGTTATTTCGGTA
GATGGTACATCTTTGAAGAATAAATTCAGTGGCACCTTTTTGACGGCCTGCACATTTGATGGGTTGGCGGGAGGAGGTCTACACGAAGGCGATGGCTGGAGGTTGGATCG
GCGGGAGGTGGGGGTCGGCGGTCTGTACGAAGAAGGACTGGCGTTTGTGGGAAGATGGAGGGTTTTGCTTGTCTTGAAGAAGGCTGTTTGTGGGAGATGGAGGGTTTTGC
TGATTTGA
Protein sequenceShow/hide protein sequence
MIFERACSLAGLIDRILLLGHSLLILLPKSRIAVGQLANEIKARPQGKLPSDIEHPRREELESGKGARGSNNDAGASDSVPDVEPPYVPPPPYVPPLPLPQRQRPKNRDG
QFKKFLEIPKQFHINIPLVEAIEQMPNYAKFLKDIVTKKKRLGEFEIVSLTEECSAILKNGLPTKAKDPGSFTIPVLIGGKDLGRALCDLGASINLMPLSVYRKLGIGDV
RPTTTTLQLADRSITYPEDKDVPIILGRPFLATGRTLIDVPKGILENTIVETTMEDLTNEHLEDHGKISIEDLEVCSLERKSEKEVFRCEDVFESLDLNERMVPPMKPSL
IEAPTLDLKPLPDHLKYVYLGEGDEGVYSGFKSGIVTCPCDSNLSQLKGLLLSCRRAGVDTNIHNGGGAVASSSWGEKNFEFVNSDGIDALTDIVSVTEGCVFSCKEHLK
KVVSSLALKGSFQFKTIKSNSIQYTVTCIDNSCQCHDKAWRGREKALNELRGSPEVSYAQISLFAARLIERNSGTYTTQEIDSNDRFKFFFMSIAASINGWKHCLPVISV
DGTSLKNKFSGTFLTACTFDGLAGGGLHEGDGWRLDRREVGVGGLYEEGLAFVGRWRVLLVLKKAVCGRWRVLLI