; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0001515 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0001515
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionUnknown protein
Genome locationchr4:32221997..32223790
RNA-Seq ExpressionLag0001515
SyntenyLag0001515
Gene Ontology termsGO:0043457 - regulation of cellular respiration (biological process)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF1855781.1 hypothetical protein Lal_00045049 [Lupinus albus]1.4e-5435.64Show/hide
Query:  MLTLEPFSEDQGRSVVQPTRGSHQSLPCALRVYSPVNSHTCQTPWSVFQDGSNGEPTGRCQERADVEARPRARAASHDRDDDVSTGVTKAWALAAVSIRI
        ML+LEPF+EDQGRS VQPTR     LPCALR   P+     +     F+ G  G P    +     +    ARA++H+RDDDVST ++ A A A ++IR+
Subjt:  MLTLEPFSEDQGRSVVQPTRGSHQSLPCALRVYSPVNSHTCQTPWSVFQDGSNGEPTGRCQERADVEARPRARAASHDRDDDVSTGVTKAWALAAVSIRI

Query:  GPCPKSIGGPALT------------------------------------------------------PDWGCIPKQLDSLTAPRGATGSERNGALTLSGA
        GPCP+SIGGPAL                                                       PDWGCIPKQ DS TAPRGATGS  +GALTLSGA
Subjt:  GPCPKSIGGPALT------------------------------------------------------PDWGCIPKQLDSLTAPRGATGSERNGALTLSGA

Query:  PFQGTCARSAAEDASPDYNSDAEGADSQAGLFPRVVPPDLGSRREHRLWGRRLEGRHRSPRSLAGGVAQTEVATRYRGWINHRSVATTGAEDSNLSHPHD
        PFQGT ARSA EDASPDYNSD EG                      R   R  +      R   GG        R R      S+      DS  + P  
Subjt:  PFQGTCARSAAEDASPDYNSDAEGADSQAGLFPRVVPPDLGSRREHRLWGRRLEGRHRSPRSLAGGVAQTEVATRYRGWINHRSVATTGAEDSNLSHPHD

Query:  GAHGRPACARTCATNPKGVAWGAAMRDTQADVPSARRLRAQLAFKDSVVRGILQFTPSIAFRYVLHRCESRDIRCRS-------------------RCID
         A G  A   TC     G       RD      SA   + +++ + S ++      P + FR  + +    + R R+                       
Subjt:  GAHGRPACARTCATNPKGVAWGAAMRDTQADVPSARRLRAQLAFKDSVVRGILQFTPSIAFRYVLHRCESRDIRCRS-------------------RCID

Query:  NDPSAGSPTETLLRLLLPLNDKVQWTSRAGQRTAHVAAIRTLHRTVQSVGATDGVYKGQGRSQRELMDSEPIAMIYPHHDEISKITRACRP
         + + GSPTETLLRLLLPLNDKVQWTS                    +V  ++     Q        +   IAMIYPHHDEISKITRACRP
Subjt:  NDPSAGSPTETLLRLLLPLNDKVQWTSRAGQRTAHVAAIRTLHRTVQSVGATDGVYKGQGRSQRELMDSEPIAMIYPHHDEISKITRACRP

KAF7112640.1 hypothetical protein RHSIM_RhsimUnG0208700 [Rhododendron simsii]1.5e-4839.31Show/hide
Query:  PRARAASHDRDDDVSTGVTKAWALAAVSIRIGPCPKSIGGPA-------------------------------------------------------LTP
        P+A AASHDR + VSTG+T +WALA+V IR GP P+SIGGP                                                        L P
Subjt:  PRARAASHDRDDDVSTGVTKAWALAAVSIRIGPCPKSIGGPA-------------------------------------------------------LTP

Query:  DWGCIPKQLDSLTAPRGATGSERNGALTLSGAPFQGTCARSAAEDASPDYNSDAEGADSQAGLFPRVVPPDLGSRREHRLWGRRLEGRHRSPRSLAGGVA
        DWGCIPKQ DS TAPRGATGSE +GALTLSGAPFQGT ARS AEDASPDYNS+ E A                                           
Subjt:  DWGCIPKQLDSLTAPRGATGSERNGALTLSGAPFQGTCARSAAEDASPDYNSDAEGADSQAGLFPRVVPPDLGSRREHRLWGRRLEGRHRSPRSLAGGVA

Query:  QTEVATRYRGWINHRSVATTGAEDSNLSHPHDGAHGRPACARTCATNPKGVAWGAAMRDTQADVPSARRLRAQLAFKDSVVRGILQFTPSIAFRYVLHRC
              R+  W    S+A T                                               R + AQLAFKDS+V GILQFTPSIAFRYVLHRC
Subjt:  QTEVATRYRGWINHRSVATTGAEDSNLSHPHDGAHGRPACARTCATNPKGVAWGAAMRDTQADVPSARRLRAQLAFKDSVVRGILQFTPSIAFRYVLHRC

Query:  ESRDIRCRS-----------------------------------RC---IDNDPSAGSPTETLLRLLLPLNDKVQWTSR
        +SRDIRCR                                    RC    DNDPSAGSPTETLLRLLLPLNDKVQWTSR
Subjt:  ESRDIRCRS-----------------------------------RC---IDNDPSAGSPTETLLRLLLPLNDKVQWTSR

KEH17348.1 hypothetical protein MTR_0021s0160 [Medicago truncatula]1.4e-5439.29Show/hide
Query:  MLTLEPFSEDQGRSVVQPTRGSHQSLPCAL---------------RVYSPVNSHTCQTP-----WSVFQDGSNGEPTGRCQERADVEARPRARAASHD--
        +L LEPF+EDQGR  V+P R S  S    +               R Y P      + P     W      +   P+    ++     RP   A  H   
Subjt:  MLTLEPFSEDQGRSVVQPTRGSHQSLPCAL---------------RVYSPVNSHTCQTP-----WSVFQDGSNGEPTGRCQERADVEARPRARAASHD--

Query:  --RDDDVSTGVTKAWAL--AAVSIRIGPCPKSIGGPALTPDWGCIPKQLDSLTAPRGATGSERNGALTLSGAPFQGTCARSAAEDASPDYNSDAEGADSQ
          +   +   + K   +  +     IG  P    G +L PD GCIPKQ DSLTAPRGATGS  +GALTL GAPFQGT ARS AEDASPDYNSD   A   
Subjt:  --RDDDVSTGVTKAWAL--AAVSIRIGPCPKSIGGPALTPDWGCIPKQLDSLTAPRGATGSERNGALTLSGAPFQGTCARSAAEDASPDYNSDAEGADSQ

Query:  AGLFP----------RVVPPDLGSRREHRLWGRRLEGRHRSPRSLAGGVAQTEVATRYRGWINHRSVATTGAEDSNLSHPHDGAHGRPACAR---TCATN
        +   P          RVVPPDLGSR            + R+ RS+ G       +TR    I+     T              AH RP         +  
Subjt:  AGLFP----------RVVPPDLGSRREHRLWGRRLEGRHRSPRSLAGGVAQTEVATRYRGWINHRSVATTGAEDSNLSHPHDGAHGRPACAR---TCATN

Query:  PKGVAWGAAMRDTQADVPSARRLRAQLAFKDSVVRGILQFTPSIAFRYVLHRCESRDIRCRSRC------------------------------------
        PK V      RD QA VPSA   RAQLAFKDS+VRGILQFTP IAFRYVLHRCESRDIRCR  C                                    
Subjt:  PKGVAWGAAMRDTQADVPSARRLRAQLAFKDSVVRGILQFTPSIAFRYVLHRCESRDIRCRSRC------------------------------------

Query:  -----------------------IDNDPSAGSPTETLLRLLLPLNDKVQWTSR
                                DNDPSAGSPTETLLRLLLPLNDKVQWTSR
Subjt:  -----------------------IDNDPSAGSPTETLLRLLLPLNDKVQWTSR

OIV90747.1 hypothetical protein TanjilG_21878 [Lupinus angustifolius]8.3e-5243.97Show/hide
Query:  LPCALRVYSPVNSHTCQTPWSVFQDGSNGEPTGRCQERADVEARPRARAASHDRDDDVSTGVTKAWALAAVSIRIGPCPKSIGGPALTPDWGCIPKQLDS
        LPCALRVY P +SHTCQTPWSVFQDG NGEP GR  E A  EA   ARA++H+RDDDVST ++ A A A ++IR+        G  L PDWGCIPKQ DS
Subjt:  LPCALRVYSPVNSHTCQTPWSVFQDGSNGEPTGRCQERADVEARPRARAASHDRDDDVSTGVTKAWALAAVSIRIGPCPKSIGGPALTPDWGCIPKQLDS

Query:  LTAPRGATGSERNGALTLSGAPFQGTCARSAAEDASPDYNSDAEGADSQAGLFPRVVPPDLGSRREHRLWGRRLEGRHRSPRSLAGGVAQTEVATRYRGW
         TAPRGATGS  +GALTLSGAPFQGT ARSAAEDASPDYNSD EG                    +   W  R    H S    AGG    +   R R  
Subjt:  LTAPRGATGSERNGALTLSGAPFQGTCARSAAEDASPDYNSDAEGADSQAGLFPRVVPPDLGSRREHRLWGRRLEGRHRSPRSLAGGVAQTEVATRYRGW

Query:  INHRSVATTGAEDSNLSHPHDGAHGRPACARTCATNPKGVAWGAAMRDTQADVPSARRLRAQLAFKDSVVRGILQFTPSIAFRYVLHRCESRDIRCRS--
            S+      DS  + P   A G  A   TC     G       RD      SA   + +++ + S ++      P + FR  + +    + R R+  
Subjt:  INHRSVATTGAEDSNLSHPHDGAHGRPACARTCATNPKGVAWGAAMRDTQADVPSARRLRAQLAFKDSVVRGILQFTPSIAFRYVLHRCESRDIRCRS--

Query:  -----------------RCIDNDPSAGSPTETLLRLLLPLNDKVQWTS
                              D + GSPTETLLRLLLPLNDKVQWTS
Subjt:  -----------------RCIDNDPSAGSPTETLLRLLLPLNDKVQWTS

TXG67213.1 hypothetical protein EZV62_008488 [Acer yangbiense]5.7e-5341.46Show/hide
Query:  GMLTLEPFSEDQGRSVVQPTRGSHQSLPCALRVYSPVNSHTCQTPWSVFQDGSNGEPTGRCQERADVEARPRARAASHDRDDDVSTGVTKAWALAAVSIR
        GML+LEPFSEDQGRS VQP R     LPCALRVY PV+SHTCQTPWSVFQDG NGE  GRCQERA +   P                       +     
Subjt:  GMLTLEPFSEDQGRSVVQPTRGSHQSLPCALRVYSPVNSHTCQTPWSVFQDGSNGEPTGRCQERADVEARPRARAASHDRDDDVSTGVTKAWALAAVSIR

Query:  IGPCPKSIGGPALTPDWGCIPKQLDSLTAPRGATGSERNGALTLSGAPFQGTCARSAAEDASPDYNSDAEGADSQAGLFPRVVPPDLGSRREHRLWGRRL
        IG  P    G  L PD GCIPKQ DS TAPR A GS R+GA TLSGAPFQGT ARSAAEDASPDYNS+ E A                            
Subjt:  IGPCPKSIGGPALTPDWGCIPKQLDSLTAPRGATGSERNGALTLSGAPFQGTCARSAAEDASPDYNSDAEGADSQAGLFPRVVPPDLGSRREHRLWGRRL

Query:  EGRHRSPRSLAGGVAQTEVATRYRGWINHRSVATTGAEDSNLSHPHDGAHGRPACARTCATNPKGVAWGAAMRDTQADVPSARRLRAQLAFKDSVVRGIL
                             R+  W    S+A T                R   A T +   +G A                     L+F  +V  G+ 
Subjt:  EGRHRSPRSLAGGVAQTEVATRYRGWINHRSVATTGAEDSNLSHPHDGAHGRPACARTCATNPKGVAWGAAMRDTQADVPSARRLRAQLAFKDSVVRGIL

Query:  QFTPSIAFRYVLHRCESRDIRCRSRCIDNDPSAGSPTETLLRLLLPLNDK---VQWTSRAGQRTAHVAAIRTLHRTVQSVGATDGVYKGQGRSQRELM
                     RC +   R  +     DP+A    E     +LP +DK   V ++ R  QRTA VAAIRTLHRT+QS+GATDGVYKGQGRSQRELM
Subjt:  QFTPSIAFRYVLHRCESRDIRCRSRCIDNDPSAGSPTETLLRLLLPLNDK---VQWTSRAGQRTAHVAAIRTLHRTVQSVGATDGVYKGQGRSQRELM

TrEMBL top hitse value%identityAlignment
A0A6N2K0Y8 Uncharacterized protein4.4e-5941.32Show/hide
Query:  FQDGSNGEPTGRCQERADVEARPRARAASHDRDDDVSTGVTKAWALAAVSIRIGPCPKSIGGPA------------------------------------
        F+ G  G P    +          ARAA HDR D +STG       AA +IRIGP P+ IGGPA                                    
Subjt:  FQDGSNGEPTGRCQERADVEARPRARAASHDRDDDVSTGVTKAWALAAVSIRIGPCPKSIGGPA------------------------------------

Query:  ------------------LTPDWGCIPKQLDSLTAPRGATGSERNGALTLSGAPFQGTCARSAAEDASPDYNSDAEGADSQAGLFPRVVPPDLGSRREHR
                          L PDWGCIPKQ DS TAPRGA GS  +GALTLSGAPFQGT A SAAEDASPDYNS+A GA   +  FP  +    G      
Subjt:  ------------------LTPDWGCIPKQLDSLTAPRGATGSERNGALTLSGAPFQGTCARSAAEDASPDYNSDAEGADSQAGLFPRVVPPDLGSRREHR

Query:  LWGRRLEGRHRS-PRSLAGGVAQTEVATRYRGWINH--RSVATTGAEDSNLS-HPHDGAHGRPACARTCATNPKGVAWGAAMRDTQADVPSARRLRAQLA
        +  R   GR+ S   S   GV +   AT  R  ++    + A  G     L   P  G   R    R     P+GV  GA MRDTQADVPS RR RAQLA
Subjt:  LWGRRLEGRHRS-PRSLAGGVAQTEVATRYRGWINH--RSVATTGAEDSNLS-HPHDGAHGRPACARTCATNPKGVAWGAAMRDTQADVPSARRLRAQLA

Query:  FKDSVVRGILQFTPSIAFRYVLHRCESRDIRCRS-----------------------------------------RCI----------------------
        FKDS+V GILQFTPSIAFRYVLHRCESRDIRCR                                          RC+                      
Subjt:  FKDSVVRGILQFTPSIAFRYVLHRCESRDIRCRS-----------------------------------------RCI----------------------

Query:  ---------DNDPSAGSPTETLLRLLLPLNDKVQWTSR
                 DNDPSAGSPTETLLRLLLPLNDKVQWTSR
Subjt:  ---------DNDPSAGSPTETLLRLLLPLNDKVQWTSR

A0A6N2KB50 Uncharacterized protein (Fragment)2.3e-5547.29Show/hide
Query:  IGPCPKSIGGPALTPDWGCIPKQLDSLTAPRGATGSERNGALTLSGAPFQGTCARSAAEDASPDYNSDAEGADSQAGLFPRVVPPDLGSRREHRLWGRRL
        IG  P    G  L PDWGCIPKQ DS TAPRGA GS  +GALTLSGAPFQGT A SAAEDASPDYNS+A GA   +  FP  +    G      +  R  
Subjt:  IGPCPKSIGGPALTPDWGCIPKQLDSLTAPRGATGSERNGALTLSGAPFQGTCARSAAEDASPDYNSDAEGADSQAGLFPRVVPPDLGSRREHRLWGRRL

Query:  EGRHRS-PRSLAGGVAQTEVATRYRGWINH--RSVATTGAEDSNLS-HPHDGAHGRPACARTCATNPKGVAWGAAMRDTQADVPSARRLRAQLAFKDSVV
         GR+ S   S   GV +   AT  R  ++    + A  G     L   P  G   R    R     P+GV  GA MRDTQADVPS RR RAQLAFKDS+V
Subjt:  EGRHRS-PRSLAGGVAQTEVATRYRGWINH--RSVATTGAEDSNLS-HPHDGAHGRPACARTCATNPKGVAWGAAMRDTQADVPSARRLRAQLAFKDSVV

Query:  RGILQFTPSIAFRYVLHRCESRDIRCRS-----------------------------------------RCI----------------------------
         GILQFTPSIAFRYVLHRCESRDIRCR                                          RC+                            
Subjt:  RGILQFTPSIAFRYVLHRCESRDIRCRS-----------------------------------------RCI----------------------------

Query:  ---DNDPSAGSPTETLLRLLLPLNDKVQWTSR
           DNDPSAGSPTETLLRLLLPLNDKVQWTSR
Subjt:  ---DNDPSAGSPTETLLRLLLPLNDKVQWTSR

A0A6N2MTU4 Uncharacterized protein4.3e-6243.57Show/hide
Query:  TPWSVFQDGSNGEPTGRCQERADVEAR----------------------------------PRARAASHDRDDDVSTGVTKAWALAAVSIR---------
        TPWSVFQDG+NGEPTGRC ERA   AR                                  PRA   +  R      G ++   L +   +         
Subjt:  TPWSVFQDGSNGEPTGRCQERADVEAR----------------------------------PRARAASHDRDDDVSTGVTKAWALAAVSIR---------

Query:  -----------IGPCPKSIGGPALTPDWGCIPKQLDSLTAPRGATGSERNGALTLSGAPFQGTCARSAAEDASPDYNSDAEGADSQAGLFP------RVV
                   IG  P    G  L PDWGCIPKQ DS TAPRGA GS  +GALTLSGAPFQGT A SAAEDASPDYNS+A GA   +  FP      R +
Subjt:  -----------IGPCPKSIGGPALTPDWGCIPKQLDSLTAPRGATGSERNGALTLSGAPFQGTCARSAAEDASPDYNSDAEGADSQAGLFP------RVV

Query:  PPDLGSRREHRLWGRRLEGRHRSPRSLAGGVAQTEVATRYRGWINHRSVATTGAEDSNLSHPHDGAHGRPACARTCATNPKGVAWGAAMRDTQADVPSAR
         P LG        G  + G  R  R    G++ T      R  ++ R              P  G   R    R     P+GV  GA MRDTQADVPS R
Subjt:  PPDLGSRREHRLWGRRLEGRHRSPRSLAGGVAQTEVATRYRGWINHRSVATTGAEDSNLSHPHDGAHGRPACARTCATNPKGVAWGAAMRDTQADVPSAR

Query:  RLRAQLAFKDSVVRGILQFTPSIAFRYVLHRCESRDIRCR-----------------------------SRC------------------IDNDPSAGSP
        R RAQLAFKDS+V GILQFTPSIAFRYVLHRCESRDIRCR                             SR                    DNDPSAGSP
Subjt:  RLRAQLAFKDSVVRGILQFTPSIAFRYVLHRCESRDIRCR-----------------------------SRC------------------IDNDPSAGSP

Query:  TETLLRLLLPLNDKVQWTSR
        TETLLRLLLPLNDKVQWTSR
Subjt:  TETLLRLLLPLNDKVQWTSR

A0A6N2NG36 Uncharacterized protein4.4e-5942.42Show/hide
Query:  TPWSVFQDGSNGEPTGRCQERADVEAR-------PRA-RAASHDR--DDDVSTGVTKAWALAAVSIRIGPC---PKSIGGP-------------------
        TPWSVFQDG+NGEPTGRC ERA   AR       PR+ R   H R     +  G     A +  + R+ P    P  I GP                   
Subjt:  TPWSVFQDGSNGEPTGRCQERADVEAR-------PRA-RAASHDR--DDDVSTGVTKAWALAAVSIRIGPC---PKSIGGP-------------------

Query:  ---------------------ALTPDWGCIPKQLDSLTAPRGATGSERNGALTLSGAPFQGTCARSAAEDASPDYNSDAEGADSQAGLFPRVVPPDLGSR
                              L PDWGCIPKQ DS TAPRGA GS  +GALTLSGAPFQGT A SAAEDASPDYNS+A GA   +  FP       GS 
Subjt:  ---------------------ALTPDWGCIPKQLDSLTAPRGATGSERNGALTLSGAPFQGTCARSAAEDASPDYNSDAEGADSQAGLFPRVVPPDLGSR

Query:  REHRLWGRRLEGRHRSPRSLAGGVAQTEVATRYRGWINHRSVATTGAEDSNLSHPHDGAHGRPACARTCATNPKGVAWGAAMRDTQADVPSARRLRAQLA
           R  G  + G   +  +++ G++ T      R  ++ R      A D     P +               P+GV  GA MRDTQADVPS RR RAQLA
Subjt:  REHRLWGRRLEGRHRSPRSLAGGVAQTEVATRYRGWINHRSVATTGAEDSNLSHPHDGAHGRPACARTCATNPKGVAWGAAMRDTQADVPSARRLRAQLA

Query:  FKDSVVRGILQFTPSIAFRYVLHRCESRDIRCR------------------------------SRCI---------------------------------
        FKDS+V GILQFTPSIAFRYVLHRCESRDIRCR                              SR +                                 
Subjt:  FKDSVVRGILQFTPSIAFRYVLHRCESRDIRCR------------------------------SRCI---------------------------------

Query:  DNDPSAGSPTETLLRLLLPLNDKVQWTSR
        DNDPSAGSPTETLLRLLLPLNDKVQWTSR
Subjt:  DNDPSAGSPTETLLRLLLPLNDKVQWTSR

A0A7N2RFC1 Uncharacterized protein9.5e-6251.03Show/hide
Query:  TPWSVFQDGSNGEPTGRCQERADVEARPRARAASHDRDDDVSTGVT--KAW----------------ALAAVSIRIG------PCPK-------------
        TPWSVFQDG NGEPTGRCQERA+ EAR  AR A+HDR +DVSTG+T  +AW                 L+   IR G      P P              
Subjt:  TPWSVFQDGSNGEPTGRCQERADVEARPRARAASHDRDDDVSTGVT--KAW----------------ALAAVSIRIG------PCPK-------------

Query:  ----------SIG-------GPALTPDWGCIPKQLDSLTAPRGATGSERNGALTLSGAPFQGTCARSAAEDASPDYNSD--AEGADSQAGLFPRVVPPDL
                  +IG       G  L PDWGCIPKQ DS TAPRGATGS  +GALTLSGAPFQGT ARSAAEDASPDYNS+      DSQAGLFP   P   
Subjt:  ----------SIG-------GPALTPDWGCIPKQLDSLTAPRGATGSERNGALTLSGAPFQGTCARSAAEDASPDYNSD--AEGADSQAGLFPRVVPPDL

Query:  GSRREHRLWGRRLEGRHRSPRSLAGGVAQTEVATRYRGWINHRSVATTGAEDSNLSH---PHDGAHGRPACARTCATNPKGVAWGAAMRDTQADVPSARR
               L G  L+ R R   + +G  A+T   TR  G    ++    GA    L+       G HGRP  A      P+    GA MRDTQADVPSA  
Subjt:  GSRREHRLWGRRLEGRHRSPRSLAGGVAQTEVATRYRGWINHRSVATTGAEDSNLSH---PHDGAHGRPACARTCATNPKGVAWGAAMRDTQADVPSARR

Query:  LRAQLAFKDSVVRGILQFTPSIAFRYVLHRCESRDIRCR
        LRAQLAFK+S++RGILQFTPSIAFRYVLHRCES DIRCR
Subjt:  LRAQLAFKDSVVRGILQFTPSIAFRYVLHRCESRDIRCR

SwissProt top hitse value%identityAlignment
P0C5Q0 Putative uncharacterized protein YLR154W-F5.1e-0491.3Show/hide
Query:  NDPSAGSPTETLLRLLLPLNDKV
        NDPSAGSPTETLLRLL+PLND+V
Subjt:  NDPSAGSPTETLLRLLLPLNDKV

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACGATTTGCACGTCAGTATCGCTGCGGGCCTCCACCCGAGTTTCCTCTGGCTTCGCCCCGCTCAGGCATATTTCACCATCTTTCGGGTCCCGACAGGCATGCTCAC
ACTCGAACCCTTCTCAGAAGATCAAGGTCGGTCGGTGGTGCAACCCACGAGGGGATCCCACCAGTCACTTCCTTGCGCCTTACGGGTTTACTCGCCCGTTAACTCGCACA
CATGTCAGACTCCTTGGTCCGTGTTTCAAGACGGGTCGAATGGGGAGCCCACAGGCCGATGCCAGGAGCGCGCAGATGTCGAAGCACGTCCGAGGGCGCGCGCTGCCAGC
CACGATCGGGACGACGACGTCTCCACAGGCGTAACAAAGGCCTGGGCTTTGGCCGCCGTCTCAATCCGCATCGGTCCATGCCCCAAGTCGATCGGCGGACCGGCTCTCAC
CCCCGATTGGGGCTGCATTCCCAAACAACTCGACTCGTTGACAGCGCCTCGTGGTGCGACAGGGTCCGAGCGCAACGGGGCTCTCACCCTCTCCGGCGCCCCCTTCCAGG
GGACTTGTGCCCGGTCTGCCGCTGAGGACGCTTCTCCAGACTACAATTCGGACGCCGAGGGCGCCGATTCTCAAGCTGGGCTCTTCCCGCGGGTAGTCCCGCCTGACCTG
GGGTCGCGTCGAGAGCATCGTCTTTGGGGACGACGTTTAGAGGGTCGACATAGAAGTCCTCGCTCGCTCGCGGGAGGAGTTGCGCAGACTGAGGTCGCGACGCGGTACCG
AGGTTGGATCAACCACCGTAGTGTCGCGACGACAGGCGCCGAGGACTCGAATTTAAGCCATCCGCACGACGGTGCGCACGGGAGGCCAGCGTGTGCCCGCACCTGCGCAA
CCAACCCGAAGGGGGTTGCGTGGGGGGCAGCGATGCGTGACACCCAGGCAGACGTGCCCTCGGCCAGAAGGCTCCGGGCGCAACTTGCATTCAAAGACTCGGTGGTTCGC
GGGATCCTGCAATTCACACCAAGTATCGCATTTCGCTACGTTCTTCATCGATGCGAGAGCCGAGATATCCGTTGCCGAAGTCGTTGCATCGACAATGATCCTTCCGCAGG
TTCACCTACGGAAACCTTGTTACGACTTCTCCTTCCTCTAAATGATAAGGTTCAGTGGACTTCTCGCGCGGGGCAGCGAACCGCCCACGTCGCCGCGATCCGAACACTTC
ATCGGACCGTTCAATCGGTAGGAGCGACAGACGGTGTGTACAAAGGGCAGGGACGTAGTCAACGCGAGTTAATGGACTCGGAACCAATTGCAATGATCTATCCCCATCAC
GATGAAATTTCAAAGATTACCCGGGCCTGTCGGCCAAGGCTATAG
mRNA sequenceShow/hide mRNA sequence
ATGAACGATTTGCACGTCAGTATCGCTGCGGGCCTCCACCCGAGTTTCCTCTGGCTTCGCCCCGCTCAGGCATATTTCACCATCTTTCGGGTCCCGACAGGCATGCTCAC
ACTCGAACCCTTCTCAGAAGATCAAGGTCGGTCGGTGGTGCAACCCACGAGGGGATCCCACCAGTCACTTCCTTGCGCCTTACGGGTTTACTCGCCCGTTAACTCGCACA
CATGTCAGACTCCTTGGTCCGTGTTTCAAGACGGGTCGAATGGGGAGCCCACAGGCCGATGCCAGGAGCGCGCAGATGTCGAAGCACGTCCGAGGGCGCGCGCTGCCAGC
CACGATCGGGACGACGACGTCTCCACAGGCGTAACAAAGGCCTGGGCTTTGGCCGCCGTCTCAATCCGCATCGGTCCATGCCCCAAGTCGATCGGCGGACCGGCTCTCAC
CCCCGATTGGGGCTGCATTCCCAAACAACTCGACTCGTTGACAGCGCCTCGTGGTGCGACAGGGTCCGAGCGCAACGGGGCTCTCACCCTCTCCGGCGCCCCCTTCCAGG
GGACTTGTGCCCGGTCTGCCGCTGAGGACGCTTCTCCAGACTACAATTCGGACGCCGAGGGCGCCGATTCTCAAGCTGGGCTCTTCCCGCGGGTAGTCCCGCCTGACCTG
GGGTCGCGTCGAGAGCATCGTCTTTGGGGACGACGTTTAGAGGGTCGACATAGAAGTCCTCGCTCGCTCGCGGGAGGAGTTGCGCAGACTGAGGTCGCGACGCGGTACCG
AGGTTGGATCAACCACCGTAGTGTCGCGACGACAGGCGCCGAGGACTCGAATTTAAGCCATCCGCACGACGGTGCGCACGGGAGGCCAGCGTGTGCCCGCACCTGCGCAA
CCAACCCGAAGGGGGTTGCGTGGGGGGCAGCGATGCGTGACACCCAGGCAGACGTGCCCTCGGCCAGAAGGCTCCGGGCGCAACTTGCATTCAAAGACTCGGTGGTTCGC
GGGATCCTGCAATTCACACCAAGTATCGCATTTCGCTACGTTCTTCATCGATGCGAGAGCCGAGATATCCGTTGCCGAAGTCGTTGCATCGACAATGATCCTTCCGCAGG
TTCACCTACGGAAACCTTGTTACGACTTCTCCTTCCTCTAAATGATAAGGTTCAGTGGACTTCTCGCGCGGGGCAGCGAACCGCCCACGTCGCCGCGATCCGAACACTTC
ATCGGACCGTTCAATCGGTAGGAGCGACAGACGGTGTGTACAAAGGGCAGGGACGTAGTCAACGCGAGTTAATGGACTCGGAACCAATTGCAATGATCTATCCCCATCAC
GATGAAATTTCAAAGATTACCCGGGCCTGTCGGCCAAGGCTATAG
Protein sequenceShow/hide protein sequence
MNDLHVSIAAGLHPSFLWLRPAQAYFTIFRVPTGMLTLEPFSEDQGRSVVQPTRGSHQSLPCALRVYSPVNSHTCQTPWSVFQDGSNGEPTGRCQERADVEARPRARAAS
HDRDDDVSTGVTKAWALAAVSIRIGPCPKSIGGPALTPDWGCIPKQLDSLTAPRGATGSERNGALTLSGAPFQGTCARSAAEDASPDYNSDAEGADSQAGLFPRVVPPDL
GSRREHRLWGRRLEGRHRSPRSLAGGVAQTEVATRYRGWINHRSVATTGAEDSNLSHPHDGAHGRPACARTCATNPKGVAWGAAMRDTQADVPSARRLRAQLAFKDSVVR
GILQFTPSIAFRYVLHRCESRDIRCRSRCIDNDPSAGSPTETLLRLLLPLNDKVQWTSRAGQRTAHVAAIRTLHRTVQSVGATDGVYKGQGRSQRELMDSEPIAMIYPHH
DEISKITRACRPRL