; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0027181 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0027181
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionProtein of unknown function (DUF604)
Genome locationchr10:45640562..45641303
RNA-Seq ExpressionLag0027181
SyntenyLag0027181
Gene Ontology termsGO:0016020 - membrane (cellular component)
GO:0008375 - acetylglucosaminyltransferase activity (molecular function)
InterPro domainsIPR006740 - Protein of unknown function DUF604


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF3961948.1 hypothetical protein CMV_013484 [Castanea mollissima]1.5e-7361.18Show/hide
Query:  FCSIAFLSLLFSF---NQKPNSPNRHRL---QNSVHRRTTWTRNNPTNISHLLFGIAGSTKTWRKRQSYCELWWKPNVTRGFVWVDEKP--NATWPATSP
        FC+IA +S +F F   +Q    P+ HR+   Q   +   T    N TNISH++FGIAGSTKTW KR++Y ELWWKPN+TRGFVW+DEKP  N TWP TSP
Subjt:  FCSIAFLSLLFSF---NQKPNSPNRHRL---QNSVHRRTTWTRNNPTNISHLLFGIAGSTKTWRKRQSYCELWWKPNVTRGFVWVDEKP--NATWPATSP

Query:  PYRVSGDTSEFRYTCWYGSRSAIRLARIVKESFELGEENVRWFVMGDDDTVFFVENLRG--ARFD------------------PFVWT-AYGGGGYAISY
        PY+VSGD S F+YTCWYG+RSA+RLARIVKE+FELG  NVRWFVMGDDDTVFF ENL    +++D                   F +T AYGGGG+AISY
Subjt:  PYRVSGDTSEFRYTCWYGSRSAIRLARIVKESFELGEENVRWFVMGDDDTVFFVENLRG--ARFD------------------PFVWT-AYGGGGYAISY

Query:  PLAVELVRILDGCLDRYASLYGGDQKVQACVSEVGVP
        PLA  LVRILDGC+DRYA+LYG DQK+QAC+SE+GVP
Subjt:  PLAVELVRILDGCLDRYASLYGGDQKVQACVSEVGVP

KAG6580525.1 hypothetical protein SDJN03_20527, partial [Cucurbita argyrosperma subsp. sororia]1.5e-9473.39Show/hide
Query:  MKIQNSLKSRKFFKISLAFCSIAFLSLLFSFNQK-PNSPNRHRLQNSVHRRTTWTRNNPTNISHLLFGIAGSTKTWRKRQSYCELWWKPNVTRGFVWVDE
        ++ QNS+KS KFFKISL  CSIAFLSLLFSFNQK PN  N HR    +    +     PTN+SHLLFGIAGSTKTW+KRQSYCELWW PNVTRGFVWVDE
Subjt:  MKIQNSLKSRKFFKISLAFCSIAFLSLLFSFNQK-PNSPNRHRLQNSVHRRTTWTRNNPTNISHLLFGIAGSTKTWRKRQSYCELWWKPNVTRGFVWVDE

Query:  KPNATWPATSPPYRVSGDTSEFRYTCWYGSRSAIRLARIVKESFELGEENVRWFVMGDDDTVFFVENLRG--ARFD-------------------PFVWT
        KPNATWPATSPPYRVS DTSEF YTCWYGSRSA+RLARIVKESFELG ENVRWFVMGDDDTVFFVENL     ++D                       T
Subjt:  KPNATWPATSPPYRVSGDTSEFRYTCWYGSRSAIRLARIVKESFELGEENVRWFVMGDDDTVFFVENLRG--ARFD-------------------PFVWT

Query:  AYGGGGYAISYPLAVELVRILDGCLDRYASLYGGDQKVQACVSEVGVP
        AYGGGGYAISY LAVELVRILDGCLDRYASLYGGDQKVQACV+E+GVP
Subjt:  AYGGGGYAISYPLAVELVRILDGCLDRYASLYGGDQKVQACVSEVGVP

XP_022935184.1 uncharacterized protein LOC111442138 [Cucurbita moschata]1.5e-9473.39Show/hide
Query:  MKIQNSLKSRKFFKISLAFCSIAFLSLLFSFNQK-PNSPNRHRLQNSVHRRTTWTRNNPTNISHLLFGIAGSTKTWRKRQSYCELWWKPNVTRGFVWVDE
        ++ QNS+KS KFFKISL  CSIAFLSLLFSFNQK PN  N HR    +    +     PTN+SHLLFGIAGSTKTW+KRQSYCELWW PNVTRGFVWVDE
Subjt:  MKIQNSLKSRKFFKISLAFCSIAFLSLLFSFNQK-PNSPNRHRLQNSVHRRTTWTRNNPTNISHLLFGIAGSTKTWRKRQSYCELWWKPNVTRGFVWVDE

Query:  KPNATWPATSPPYRVSGDTSEFRYTCWYGSRSAIRLARIVKESFELGEENVRWFVMGDDDTVFFVENLRG--ARFD-------------------PFVWT
        KPNATWPATSPPYRVS DTSEF YTCWYGSRSA+RLARIVKESFELG ENVRWFVMGDDDTVFFVENL     ++D                       T
Subjt:  KPNATWPATSPPYRVSGDTSEFRYTCWYGSRSAIRLARIVKESFELGEENVRWFVMGDDDTVFFVENLRG--ARFD-------------------PFVWT

Query:  AYGGGGYAISYPLAVELVRILDGCLDRYASLYGGDQKVQACVSEVGVP
        AYGGGGYAISY LAVELVRILDGCLDRYASLYGGDQKVQACV+E+GVP
Subjt:  AYGGGGYAISYPLAVELVRILDGCLDRYASLYGGDQKVQACVSEVGVP

XP_022983299.1 uncharacterized protein LOC111481920 [Cucurbita maxima]3.0e-9574.8Show/hide
Query:  MKI--QNSLKSRKFFKISLAFCSIAFLSLLFSFNQK-PNSPNRHRLQNSVHRRTTWTRNNPTNISHLLFGIAGSTKTWRKRQSYCELWWKPNVTRGFVWV
        MKI  QNSLKSRKFFKISL  CSIAFLSLLFSFNQK PN  + HR    +    +     PTNISHLLFGIAGSTKTW+KRQSYCELWW PNVTRGFVWV
Subjt:  MKI--QNSLKSRKFFKISLAFCSIAFLSLLFSFNQK-PNSPNRHRLQNSVHRRTTWTRNNPTNISHLLFGIAGSTKTWRKRQSYCELWWKPNVTRGFVWV

Query:  DEKPNATWPATSPPYRVSGDTSEFRYTCWYGSRSAIRLARIVKESFELGEENVRWFVMGDDDTVFFVENLRG--ARFD-------------------PFV
        DEKPNATWPATSPPYRVS DTSEF YTCWYGSRSA+RLARIVKESFELG ENVRWFVMGDDDTVFFVENL     ++D                      
Subjt:  DEKPNATWPATSPPYRVSGDTSEFRYTCWYGSRSAIRLARIVKESFELGEENVRWFVMGDDDTVFFVENLRG--ARFD-------------------PFV

Query:  WTAYGGGGYAISYPLAVELVRILDGCLDRYASLYGGDQKVQACVSEVGVP
         TAYGGGGYAISY LAVELVRILDGCLDRYASLYGGDQKVQACV+E+GVP
Subjt:  WTAYGGGGYAISYPLAVELVRILDGCLDRYASLYGGDQKVQACVSEVGVP

XP_023910744.1 uncharacterized protein LOC112022386 [Quercus suber]1.9e-7361.18Show/hide
Query:  FCSIAFLSLLFSF---NQKPNSPNRHRL---QNSVHRRTTWTRNNPTNISHLLFGIAGSTKTWRKRQSYCELWWKPNVTRGFVWVDEKP--NATWPATSP
        FC+IA +S +F F   +Q    P+  R+   Q   +   T    N TNISH++FGIAGSTKTW KR++Y ELWWKPN+TRGFVW DEKP  N TWP TSP
Subjt:  FCSIAFLSLLFSF---NQKPNSPNRHRL---QNSVHRRTTWTRNNPTNISHLLFGIAGSTKTWRKRQSYCELWWKPNVTRGFVWVDEKP--NATWPATSP

Query:  PYRVSGDTSEFRYTCWYGSRSAIRLARIVKESFELGEENVRWFVMGDDDTVFFVENLRG--ARFDP-------------------FVWTAYGGGGYAISY
        PY+VSGD S F+YTCWYGSRSA+RLARIVKESFELG  NVRWFVMGDDDTVFF ENL    +++D                       TAYGGGG+AISY
Subjt:  PYRVSGDTSEFRYTCWYGSRSAIRLARIVKESFELGEENVRWFVMGDDDTVFFVENLRG--ARFDP-------------------FVWTAYGGGGYAISY

Query:  PLAVELVRILDGCLDRYASLYGGDQKVQACVSEVGVP
        PLA +LVRILDGC+DRYA+LYG DQK+QAC+SE+GVP
Subjt:  PLAVELVRILDGCLDRYASLYGGDQKVQACVSEVGVP

TrEMBL top hitse value%identityAlignment
A0A5C7HNJ7 Uncharacterized protein1.7e-7256.59Show/hide
Query:  SLKSRKFF----KISLAFCSIAFLSLLFSFN-----QKPNSPN-----RHRLQNSVHRRTTWTRNNPTNISHLLFGIAGSTKTWRKRQSYCELWWKPNVT
        ++KS   F    K +L   ++ ++S+ F ++     Q P+ P+      HR  + +H   +   +  TNISH++FGIAGS KTW  R+S+ ELWWKPNVT
Subjt:  SLKSRKFF----KISLAFCSIAFLSLLFSFN-----QKPNSPN-----RHRLQNSVHRRTTWTRNNPTNISHLLFGIAGSTKTWRKRQSYCELWWKPNVT

Query:  RGFVWVDEKP--NATWPATSPPYRVSGDTSEFRYTCWYGSRSAIRLARIVKESFELGEENVRWFVMGDDDTVFFVENLRG--ARFDPFVW----------
        RGFVW+DEKP  N TWP TSP Y+VS DTS F YTCWYGSRSA+RL RIVKESFELG +NVRWFVMGDDDTVFFVENL    A++D              
Subjt:  RGFVWVDEKP--NATWPATSPPYRVSGDTSEFRYTCWYGSRSAIRLARIVKESFELGEENVRWFVMGDDDTVFFVENLRG--ARFDPFVW----------

Query:  ---------TAYGGGGYAISYPLAVELVRILDGCLDRYASLYGGDQKVQACVSEVGVP
                 TAYGGGG+AISYPLA ELVRILDGCLDRYASLYG DQK+QAC++E+GVP
Subjt:  ---------TAYGGGGYAISYPLAVELVRILDGCLDRYASLYGGDQKVQACVSEVGVP

A0A6J1F4P4 uncharacterized protein LOC1114421387.2e-9573.39Show/hide
Query:  MKIQNSLKSRKFFKISLAFCSIAFLSLLFSFNQK-PNSPNRHRLQNSVHRRTTWTRNNPTNISHLLFGIAGSTKTWRKRQSYCELWWKPNVTRGFVWVDE
        ++ QNS+KS KFFKISL  CSIAFLSLLFSFNQK PN  N HR    +    +     PTN+SHLLFGIAGSTKTW+KRQSYCELWW PNVTRGFVWVDE
Subjt:  MKIQNSLKSRKFFKISLAFCSIAFLSLLFSFNQK-PNSPNRHRLQNSVHRRTTWTRNNPTNISHLLFGIAGSTKTWRKRQSYCELWWKPNVTRGFVWVDE

Query:  KPNATWPATSPPYRVSGDTSEFRYTCWYGSRSAIRLARIVKESFELGEENVRWFVMGDDDTVFFVENLRG--ARFD-------------------PFVWT
        KPNATWPATSPPYRVS DTSEF YTCWYGSRSA+RLARIVKESFELG ENVRWFVMGDDDTVFFVENL     ++D                       T
Subjt:  KPNATWPATSPPYRVSGDTSEFRYTCWYGSRSAIRLARIVKESFELGEENVRWFVMGDDDTVFFVENLRG--ARFD-------------------PFVWT

Query:  AYGGGGYAISYPLAVELVRILDGCLDRYASLYGGDQKVQACVSEVGVP
        AYGGGGYAISY LAVELVRILDGCLDRYASLYGGDQKVQACV+E+GVP
Subjt:  AYGGGGYAISYPLAVELVRILDGCLDRYASLYGGDQKVQACVSEVGVP

A0A6J1J6Y4 uncharacterized protein LOC1114819201.5e-9574.8Show/hide
Query:  MKI--QNSLKSRKFFKISLAFCSIAFLSLLFSFNQK-PNSPNRHRLQNSVHRRTTWTRNNPTNISHLLFGIAGSTKTWRKRQSYCELWWKPNVTRGFVWV
        MKI  QNSLKSRKFFKISL  CSIAFLSLLFSFNQK PN  + HR    +    +     PTNISHLLFGIAGSTKTW+KRQSYCELWW PNVTRGFVWV
Subjt:  MKI--QNSLKSRKFFKISLAFCSIAFLSLLFSFNQK-PNSPNRHRLQNSVHRRTTWTRNNPTNISHLLFGIAGSTKTWRKRQSYCELWWKPNVTRGFVWV

Query:  DEKPNATWPATSPPYRVSGDTSEFRYTCWYGSRSAIRLARIVKESFELGEENVRWFVMGDDDTVFFVENLRG--ARFD-------------------PFV
        DEKPNATWPATSPPYRVS DTSEF YTCWYGSRSA+RLARIVKESFELG ENVRWFVMGDDDTVFFVENL     ++D                      
Subjt:  DEKPNATWPATSPPYRVSGDTSEFRYTCWYGSRSAIRLARIVKESFELGEENVRWFVMGDDDTVFFVENLRG--ARFD-------------------PFV

Query:  WTAYGGGGYAISYPLAVELVRILDGCLDRYASLYGGDQKVQACVSEVGVP
         TAYGGGGYAISY LAVELVRILDGCLDRYASLYGGDQKVQACV+E+GVP
Subjt:  WTAYGGGGYAISYPLAVELVRILDGCLDRYASLYGGDQKVQACVSEVGVP

A0A7N2M4D5 Uncharacterized protein4.6e-7360.76Show/hide
Query:  FCSIAFLSLLFSF---NQKPNSPNRHRL---QNSVHRRTTWTRNNPTNISHLLFGIAGSTKTWRKRQSYCELWWKPNVTRGFVWVDEKP--NATWPATSP
        FC+IA +S +F F   +Q    P+  R+   Q   +   T    N TNISH++FGIAGSTKTW KR++Y ELWW+PN+TRGFVW+DEKP  N TWP TSP
Subjt:  FCSIAFLSLLFSF---NQKPNSPNRHRL---QNSVHRRTTWTRNNPTNISHLLFGIAGSTKTWRKRQSYCELWWKPNVTRGFVWVDEKP--NATWPATSP

Query:  PYRVSGDTSEFRYTCWYGSRSAIRLARIVKESFELGEENVRWFVMGDDDTVFFVENLRG--ARFD------------------PFVWT-AYGGGGYAISY
        PY+VSGD S F+YTCWYGSRSA+RLARIVKESFELG  NVRWFVMGDDDTVFF ENL    +++D                   F +T AYGGGG+AISY
Subjt:  PYRVSGDTSEFRYTCWYGSRSAIRLARIVKESFELGEENVRWFVMGDDDTVFFVENLRG--ARFD------------------PFVWT-AYGGGGYAISY

Query:  PLAVELVRILDGCLDRYASLYGGDQKVQACVSEVGVP
        PLA +LVRILDGC++RYA+LYG DQK+QAC+SE+GVP
Subjt:  PLAVELVRILDGCLDRYASLYGGDQKVQACVSEVGVP

A0A7N2M4F9 Uncharacterized protein2.7e-7361.18Show/hide
Query:  FCSIAFLSLLFSF---NQKPNSPNRHRL---QNSVHRRTTWTRNNPTNISHLLFGIAGSTKTWRKRQSYCELWWKPNVTRGFVWVDEKP--NATWPATSP
        FC+IA +S +F F   +Q    P+  R+   Q   +   T T    TNISH++FGIAGSTKTW KR++Y ELWWKPN+TRGFVW+DEKP  N TWP TSP
Subjt:  FCSIAFLSLLFSF---NQKPNSPNRHRL---QNSVHRRTTWTRNNPTNISHLLFGIAGSTKTWRKRQSYCELWWKPNVTRGFVWVDEKP--NATWPATSP

Query:  PYRVSGDTSEFRYTCWYGSRSAIRLARIVKESFELGEENVRWFVMGDDDTVFFVENLRG--ARFD----------------PFVWT---AYGGGGYAISY
        PY+VSGD S F+YTCWYGSRSA+RLARIVKESFELG  NVRWFVMGDDDTVFF ENL    +++D                  V++   AYGGGG+AISY
Subjt:  PYRVSGDTSEFRYTCWYGSRSAIRLARIVKESFELGEENVRWFVMGDDDTVFFVENLRG--ARFD----------------PFVWT---AYGGGGYAISY

Query:  PLAVELVRILDGCLDRYASLYGGDQKVQACVSEVGVP
        PLA +LVRILDGC+DRYA+LYG DQK+QAC+SE+GVP
Subjt:  PLAVELVRILDGCLDRYASLYGGDQKVQACVSEVGVP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G07850.1 Protein of unknown function (DUF604)2.3e-4544.97Show/hide
Query:  TNISHLLFGIAGSTKTWRKRQSYCELWWKPNVTRGFVWVDEKPNATWPATSPPYRVSGDTSEFRYTCWYGSRSAIRLARIVKESFELGEENVRWFVMGDD
        T + H++FGIA S+  W  R+ Y + WW+P  TRG VW+D++         P  R+S DTS FRYT   G RSA+R++R+V E+  LG++ VRWFVMGDD
Subjt:  TNISHLLFGIAGSTKTWRKRQSYCELWWKPNVTRGFVWVDEKPNATWPATSPPYRVSGDTSEFRYTCWYGSRSAIRLARIVKESFELGEENVRWFVMGDD

Query:  DTVFFVENLRG--ARFD--PFVWT-----------------AYGGGGYAISYPLAVELVRILDGCLDRYASLYGGDQKVQACVSEVGVP
        DTVF V+N+    +++D   F +                  A+GGGG+AISY LA+EL+R+ D C+ RY  LYG D ++QAC++E+GVP
Subjt:  DTVFFVENLRG--ARFD--PFVWT-----------------AYGGGGYAISYPLAVELVRILDGCLDRYASLYGGDQKVQACVSEVGVP

AT1G33250.1 Protein of unknown function (DUF604)2.2e-4346.56Show/hide
Query:  NISHLLFGIAGSTKTWRKRQSYCELWWKPNVTRGFVWVDEKPNATWPATS-PPYRVSGDTSEFRYTCWYGSRSAIRLARIVKESFELGEENVRWFVMGDD
        +++HL+FGIAGS++ W +R+    LWWKP+  RG VW++E+ +      S PP  VS D+S FRYT   G  S +R++RI  ESF L   NVRWFV+GDD
Subjt:  NISHLLFGIAGSTKTWRKRQSYCELWWKPNVTRGFVWVDEKPNATWPATS-PPYRVSGDTSEFRYTCWYGSRSAIRLARIVKESFELGEENVRWFVMGDD

Query:  DTVFFVENLRG--ARFDPFVWT-------------------AYGGGGYAISYPLAVELVRILDGCLDRYASLYGGDQKVQACVSEVGVP
        DT+F V NL    +++DP                       A+GGGG AISYPLA  L RI D CLDRY  LYG D ++ AC++E+GVP
Subjt:  DTVFFVENLRG--ARFDPFVWT-------------------AYGGGGYAISYPLAVELVRILDGCLDRYASLYGGDQKVQACVSEVGVP

AT2G37730.1 Protein of unknown function (DUF604)7.4e-6852.51Show/hide
Query:  SLKSRKFFKISLAFCSIAFLSLLFSF-----------NQKPNSPNRHRLQNSVHRRTTWTRNN----PTNISHLLFGIAGSTKTWRKRQSYCELWWKPNV
        SL ++ FF I + F S+A +S    F            ++    N +   NS    +T  R N     T+ISH+ FGI GS +TWR R  Y ELWW+PNV
Subjt:  SLKSRKFFKISLAFCSIAFLSLLFSF-----------NQKPNSPNRHRLQNSVHRRTTWTRNN----PTNISHLLFGIAGSTKTWRKRQSYCELWWKPNV

Query:  TRGFVWVDEKP--NATWPATSPPYRVSGDTSEFRYTCWYGSRSAIRLARIVKESFELGEENVRWFVMGDDDTVFFVENLRGA--RFD-------------
        TRGF+W+DE+P  N TW +TSPPY+VS DTS F YTCWYGSRSAIR+ARI+KE+FELG  +VRWF+MGDDDTVFFV+NL     ++D             
Subjt:  TRGFVWVDEKP--NATWPATSPPYRVSGDTSEFRYTCWYGSRSAIRLARIVKESFELGEENVRWFVMGDDDTVFFVENLRGA--RFD-------------

Query:  ------PFVWTAYGGGGYAISYPLAVELVRILDGCLDRYASLYGGDQKVQACVSEVGVP
                   AYGGGG AISYPLAVELV++LDGC+DRYASLYG DQK++AC+SE+GVP
Subjt:  ------PFVWTAYGGGGYAISYPLAVELVRILDGCLDRYASLYGGDQKVQACVSEVGVP

AT4G23490.1 Protein of unknown function (DUF604)1.7e-4339.08Show/hide
Query:  SLAFCSIAFLSLLFSFNQKPNSPN-RHRLQNSVHRRTTWTRNNPTNISHLLFGIAGSTKTWRKRQSYCELWWKPNVTRGFVWVDEKPNATWPATS-----
        S +F +++ LS   S N    S +   R +N          +  T+++H++FGIA S+K W++R+ Y ++W+KP   RG+VW+D++   +          
Subjt:  SLAFCSIAFLSLLFSFNQKPNSPN-RHRLQNSVHRRTTWTRNNPTNISHLLFGIAGSTKTWRKRQSYCELWWKPNVTRGFVWVDEKPNATWPATS-----

Query:  PPYRVSGDTSEFRYTCWYGSRSAIRLARIVKESFELGEENVRWFVMGDDDTVFFVEN----LRGARFDPFVW-----------------TAYGGGGYAIS
        PP ++SG T+ F YT   G RSA+R++RIV E+  LG +NVRWFVMGDDDTVF ++N    LR    +   +                  AYGGGG+AIS
Subjt:  PPYRVSGDTSEFRYTCWYGSRSAIRLARIVKESFELGEENVRWFVMGDDDTVFFVEN----LRGARFDPFVW-----------------TAYGGGGYAIS

Query:  YPLAVELVRILDGCLDRYASLYGGDQKVQACVSEVGVP
        YPLA  L ++ D C+ RY +LYG D ++QAC++E+GVP
Subjt:  YPLAVELVRILDGCLDRYASLYGGDQKVQACVSEVGVP

AT5G41460.1 Protein of unknown function (DUF604)1.7e-4345.6Show/hide
Query:  TNISHLLFGIAGSTKTWRKRQSYCELWWKPNVTRGFVWVDEKP----NATWPATSPPYRVSGDTSEFRYTCWYGSRSAIRLARIVKESFELGEENVRWFV
        T   H++FGIA S + W++R+ Y ++W+KPN  R +VW+ EKP    +     + PP ++SGDTS+F Y    G RSAIR++RIV E+ +LG ++VRWFV
Subjt:  TNISHLLFGIAGSTKTWRKRQSYCELWWKPNVTRGFVWVDEKP----NATWPATSPPYRVSGDTSEFRYTCWYGSRSAIRLARIVKESFELGEENVRWFV

Query:  MGDDDTVFFVENL--------------RGARFDPFVWT-------AYGGGGYAISYPLAVELVRILDGCLDRYASLYGGDQKVQACVSEVGVP
        MGDDDTVF  ENL               G+  +  +         AYGGGG+AISYPLAV L ++ D C+ RY +LYG D ++QAC++E+GVP
Subjt:  MGDDDTVFFVENL--------------RGARFDPFVWT-------AYGGGGYAISYPLAVELVRILDGCLDRYASLYGGDQKVQACVSEVGVP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGATTCAAAACTCATTGAAATCTCGTAAATTCTTCAAAATCTCACTTGCTTTCTGCTCCATAGCTTTCCTTTCCCTCCTGTTTTCCTTCAACCAGAAACCCAACTC
TCCCAACCGCCACCGCCTCCAAAACTCCGTCCACCGGAGAACCACGTGGACTCGCAACAACCCGACGAACATATCCCACCTTCTATTCGGCATAGCGGGGTCCACCAAGA
CATGGAGAAAGCGTCAAAGCTACTGCGAGCTCTGGTGGAAGCCCAACGTCACCCGCGGCTTCGTCTGGGTCGACGAGAAGCCCAACGCCACGTGGCCGGCCACGTCCCCG
CCGTACAGAGTCTCCGGCGACACGTCGGAGTTCAGGTACACGTGCTGGTACGGGTCCCGGTCGGCGATAAGGCTTGCGAGGATCGTGAAGGAGAGCTTCGAACTGGGAGA
AGAGAACGTGCGGTGGTTCGTGATGGGCGACGACGACACCGTTTTCTTTGTGGAGAACTTGCGTGGAGCAAGATTTGATCCATTTGTATGGACGGCGTACGGCGGCGGCG
GATATGCGATAAGTTATCCGCTGGCGGTGGAGCTGGTGAGGATTTTGGACGGCTGTCTTGATCGGTACGCCAGTCTCTACGGCGGCGATCAGAAAGTCCAGGCTTGCGTT
AGCGAGGTTGGTGTCCCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAGATTCAAAACTCATTGAAATCTCGTAAATTCTTCAAAATCTCACTTGCTTTCTGCTCCATAGCTTTCCTTTCCCTCCTGTTTTCCTTCAACCAGAAACCCAACTC
TCCCAACCGCCACCGCCTCCAAAACTCCGTCCACCGGAGAACCACGTGGACTCGCAACAACCCGACGAACATATCCCACCTTCTATTCGGCATAGCGGGGTCCACCAAGA
CATGGAGAAAGCGTCAAAGCTACTGCGAGCTCTGGTGGAAGCCCAACGTCACCCGCGGCTTCGTCTGGGTCGACGAGAAGCCCAACGCCACGTGGCCGGCCACGTCCCCG
CCGTACAGAGTCTCCGGCGACACGTCGGAGTTCAGGTACACGTGCTGGTACGGGTCCCGGTCGGCGATAAGGCTTGCGAGGATCGTGAAGGAGAGCTTCGAACTGGGAGA
AGAGAACGTGCGGTGGTTCGTGATGGGCGACGACGACACCGTTTTCTTTGTGGAGAACTTGCGTGGAGCAAGATTTGATCCATTTGTATGGACGGCGTACGGCGGCGGCG
GATATGCGATAAGTTATCCGCTGGCGGTGGAGCTGGTGAGGATTTTGGACGGCTGTCTTGATCGGTACGCCAGTCTCTACGGCGGCGATCAGAAAGTCCAGGCTTGCGTT
AGCGAGGTTGGTGTCCCTTGA
Protein sequenceShow/hide protein sequence
MKIQNSLKSRKFFKISLAFCSIAFLSLLFSFNQKPNSPNRHRLQNSVHRRTTWTRNNPTNISHLLFGIAGSTKTWRKRQSYCELWWKPNVTRGFVWVDEKPNATWPATSP
PYRVSGDTSEFRYTCWYGSRSAIRLARIVKESFELGEENVRWFVMGDDDTVFFVENLRGARFDPFVWTAYGGGGYAISYPLAVELVRILDGCLDRYASLYGGDQKVQACV
SEVGVP