; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0026726 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0026726
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionProtein of unknown function (DUF620)
Genome locationchr10:41055777..41058808
RNA-Seq ExpressionLag0026726
SyntenyLag0026726
Gene Ontology termsNA
InterPro domainsIPR006873 - Protein of unknown function DUF620


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6580785.1 hypothetical protein SDJN03_20787, partial [Cucurbita argyrosperma subsp. sororia]3.9e-9785.32Show/hide
Query:  MQLQRMNRLAPLSEEPIDEHDGRIRNRNRNRSATTA----------SGGRSWRNWIRTHLSILSFGKKSDGLNVLLSVLGCPLFPVSVQPNTFVSSTNQV
        MQLQRM+RLAPLSEEPIDE DGRIRNRNR+ S + +           GGRSWRNWIRTHLSILS GKKSDGLNVLLSVLGCPLFPVSV+PN FVSS NQV
Subjt:  MQLQRMNRLAPLSEEPIDEHDGRIRNRNRNRSATTA----------SGGRSWRNWIRTHLSILSFGKKSDGLNVLLSVLGCPLFPVSVQPNTFVSSTNQV

Query:  SSSSQYIIEHFAAATGCRKLKGGVKNIFATGKLTMGMVDEV---GSGGSGGGGPTGGVAQKGCFVMWQMIPNKWLIELAVGGHSIVAGSDGNVAWRHTPW
        SSSSQYIIEHFAAATGCRKL G VKNIFATGKLTMG+VDEV   G GG GGGGPTGGV QKGCFVMWQMIPNKWLIELAVGGHSIVAGSDGNVAWRHTPW
Subjt:  SSSSQYIIEHFAAATGCRKLKGGVKNIFATGKLTMGMVDEV---GSGGSGGGGPTGGVAQKGCFVMWQMIPNKWLIELAVGGHSIVAGSDGNVAWRHTPW

Query:  LGSHAAKGAVRPLRRAFQ
        LGSHAAKGAVRPLRRAFQ
Subjt:  LGSHAAKGAVRPLRRAFQ

KAG7017537.1 hypothetical protein SDJN02_19402, partial [Cucurbita argyrosperma subsp. argyrosperma]1.0e-9785Show/hide
Query:  MQLQRMNRLAPLSEEPIDEHDGRIRNRNRNRSATTA----------SGGRSWRNWIRTHLSILSFGKKSDGLNVLLSVLGCPLFPVSVQPNTFVSSTNQV
        MQLQRM+RLAPLSEEPIDE DGRIRNRNR+ S + +           GGRSWRNWIRTHLSILS GKKSDGLNVLLSVLGCPLFPVSV+PN FVSS NQV
Subjt:  MQLQRMNRLAPLSEEPIDEHDGRIRNRNRNRSATTA----------SGGRSWRNWIRTHLSILSFGKKSDGLNVLLSVLGCPLFPVSVQPNTFVSSTNQV

Query:  SSSSQYIIEHFAAATGCRKLKGGVKNIFATGKLTMGMVDEV---GSGGSGGGGPTGGVAQKGCFVMWQMIPNKWLIELAVGGHSIVAGSDGNVAWRHTPW
        SSSSQYIIEHFAAATGCRKL G VKNIFATGKLTMG+VDEV   G GG GGGGPTGGV QKGCFVMWQMIPNKWLIELAVGGHSIVAGSDGNVAWRHTPW
Subjt:  SSSSQYIIEHFAAATGCRKLKGGVKNIFATGKLTMGMVDEV---GSGGSGGGGPTGGVAQKGCFVMWQMIPNKWLIELAVGGHSIVAGSDGNVAWRHTPW

Query:  LGSHAAKGAVRPLRRAFQAN
        LGSHAAKGAVRPLRRAFQA+
Subjt:  LGSHAAKGAVRPLRRAFQAN

XP_022934078.1 uncharacterized protein LOC111441359 [Cucurbita moschata]6.7e-9787.26Show/hide
Query:  MQLQRMNRLAPLSEEPIDEHDGRIRNRNRNRSAT-TASGGRSWRNWIRTHLSILSFGKKSDGLNVLLSVLGCPLFPVSVQPNTFVSSTNQVSSSSQYIIE
        MQLQRM+RLAPLSEEPIDE DGR RNRNR+ S +    GGRSWRNWIRTHLSILS GKKSDGLNVLLSVLGCPLFPVSV+PN FVSS NQVSSSSQYIIE
Subjt:  MQLQRMNRLAPLSEEPIDEHDGRIRNRNRNRSAT-TASGGRSWRNWIRTHLSILSFGKKSDGLNVLLSVLGCPLFPVSVQPNTFVSSTNQVSSSSQYIIE

Query:  HFAAATGCRKLKGGVKNIFATGKLTMGMVDEVGSG------GSGGGGPTGGVAQKGCFVMWQMIPNKWLIELAVGGHSIVAGSDGNVAWRHTPWLGSHAA
        HFAAATGCRKL G VKNIFATGKLTMG+VDEV SG      G GGGGPTGGV QKGCFVMWQMIPNKWLIELAVGGHSIVAGSDGNVAWRHTPWLGSHAA
Subjt:  HFAAATGCRKLKGGVKNIFATGKLTMGMVDEVGSG------GSGGGGPTGGVAQKGCFVMWQMIPNKWLIELAVGGHSIVAGSDGNVAWRHTPWLGSHAA

Query:  KGAVRPLRRAFQ
        KGAVRPLRRAFQ
Subjt:  KGAVRPLRRAFQ

XP_022982999.1 uncharacterized protein LOC111481672 [Cucurbita maxima]4.2e-9989.52Show/hide
Query:  MQLQRMNRLAPLSEEPIDEHDGRIRNRNRNRSATTASGGRSWRNWIRTHLSILSFGKKSDGLNVLLSVLGCPLFPVSVQPNTFVSSTNQVSSSSQYIIEH
        MQLQRMNRLAPLSEEPIDEHDGR RNRNR+ S +   GGRSWRNWIRTHLSILS GK+SDGLNVLLSVLGCPLFPVSVQPN FVSS NQVSSSSQYIIEH
Subjt:  MQLQRMNRLAPLSEEPIDEHDGRIRNRNRNRSATTASGGRSWRNWIRTHLSILSFGKKSDGLNVLLSVLGCPLFPVSVQPNTFVSSTNQVSSSSQYIIEH

Query:  FAAATGCRKLKGGVKNIFATGKLTMGMVDEVGSGG--SGG---GGPTGGVAQKGCFVMWQMIPNKWLIELAVGGHSIVAGSDGNVAWRHTPWLGSHAAKG
        FAAATGCRKL G VKNIFATGKLTMG+VDEV SGG  SGG   GGPTGGV QKGCFVMWQMIPNKWLIELAVGGHSIVAGSDGNVAWRHTPWLGSHAAKG
Subjt:  FAAATGCRKLKGGVKNIFATGKLTMGMVDEVGSGG--SGG---GGPTGGVAQKGCFVMWQMIPNKWLIELAVGGHSIVAGSDGNVAWRHTPWLGSHAAKG

Query:  AVRPLRRAFQ
        AVRPLRRAFQ
Subjt:  AVRPLRRAFQ

XP_023526366.1 uncharacterized protein LOC111789880 [Cucurbita pepo subsp. pepo]1.0e-9787.38Show/hide
Query:  MQLQRMNRLAPLSEEPIDEHDGRIRNRNRNRSAT---TASGGRSWRNWIRTHLSILSFGKKSDGLNVLLSVLGCPLFPVSVQPNTFVSSTNQVSSSSQYI
        MQLQRMNRLAPLSEEPIDE DGR RNRNR+ S +    + GGRSWRNWIRTHLSILS GKKSDGLNVLLSVLGCPLFPVSVQPN FVSS NQVSSSSQYI
Subjt:  MQLQRMNRLAPLSEEPIDEHDGRIRNRNRNRSAT---TASGGRSWRNWIRTHLSILSFGKKSDGLNVLLSVLGCPLFPVSVQPNTFVSSTNQVSSSSQYI

Query:  IEHFAAATGCRKLKGGVKNIFATGKLTMGMVDEVGSGG------SGGGGPTGGVAQKGCFVMWQMIPNKWLIELAVGGHSIVAGSDGNVAWRHTPWLGSH
        IEHFAAATGCRKL G VKNIFATGKLTMG+VDEV SGG       GGGGPTGGV QKGCFVMWQMIPNKWLIELAVGGHSIVAGSDGNVAWRHTPWLGSH
Subjt:  IEHFAAATGCRKLKGGVKNIFATGKLTMGMVDEVGSGG------SGGGGPTGGVAQKGCFVMWQMIPNKWLIELAVGGHSIVAGSDGNVAWRHTPWLGSH

Query:  AAKGAVRPLRRAFQ
        AAKGAVRPLRRAFQ
Subjt:  AAKGAVRPLRRAFQ

TrEMBL top hitse value%identityAlignment
A0A0A0LBR2 Uncharacterized protein2.2e-9386.27Show/hide
Query:  MNRLAPLSEEPIDEHDGRIRNRNRNR-SATTASGGRSWRNWIRTHLSILSFGKKSDGLNVLLSVLGCPLFPVSVQPNTFVSSTNQVSSSSQYIIEHFAAA
        MNRLAPLSEEPIDEHD R R RNRNR +A    GGRSWRNWIRTH SILS  KKSDGLNVLLSVLGCPLFPVS+QPN+ VS TNQVSSSSQYIIEHFAAA
Subjt:  MNRLAPLSEEPIDEHDGRIRNRNRNR-SATTASGGRSWRNWIRTHLSILSFGKKSDGLNVLLSVLGCPLFPVSVQPNTFVSSTNQVSSSSQYIIEHFAAA

Query:  TGCRKLKGGVKNIFATGKLTMGMVDEVGS---GGSGGGGPTGGVAQKGCFVMWQMIPNKWLIELAVGGHSIVAGSDGNVAWRHTPWLGSHAAKGAVRPLR
        TGCRKL+G VKNIFATGK+TMGM +EV S   GG GGGGPTGGV QKGCFVMWQMIPNKWLIEL+VGGHSIVAGSDGNVAWRHTPWLGSHAAKGAVRPLR
Subjt:  TGCRKLKGGVKNIFATGKLTMGMVDEVGS---GGSGGGGPTGGVAQKGCFVMWQMIPNKWLIELAVGGHSIVAGSDGNVAWRHTPWLGSHAAKGAVRPLR

Query:  RAFQ
        RAFQ
Subjt:  RAFQ

A0A6J1F6M9 uncharacterized protein LOC1114413593.2e-9787.26Show/hide
Query:  MQLQRMNRLAPLSEEPIDEHDGRIRNRNRNRSAT-TASGGRSWRNWIRTHLSILSFGKKSDGLNVLLSVLGCPLFPVSVQPNTFVSSTNQVSSSSQYIIE
        MQLQRM+RLAPLSEEPIDE DGR RNRNR+ S +    GGRSWRNWIRTHLSILS GKKSDGLNVLLSVLGCPLFPVSV+PN FVSS NQVSSSSQYIIE
Subjt:  MQLQRMNRLAPLSEEPIDEHDGRIRNRNRNRSAT-TASGGRSWRNWIRTHLSILSFGKKSDGLNVLLSVLGCPLFPVSVQPNTFVSSTNQVSSSSQYIIE

Query:  HFAAATGCRKLKGGVKNIFATGKLTMGMVDEVGSG------GSGGGGPTGGVAQKGCFVMWQMIPNKWLIELAVGGHSIVAGSDGNVAWRHTPWLGSHAA
        HFAAATGCRKL G VKNIFATGKLTMG+VDEV SG      G GGGGPTGGV QKGCFVMWQMIPNKWLIELAVGGHSIVAGSDGNVAWRHTPWLGSHAA
Subjt:  HFAAATGCRKLKGGVKNIFATGKLTMGMVDEVGSG------GSGGGGPTGGVAQKGCFVMWQMIPNKWLIELAVGGHSIVAGSDGNVAWRHTPWLGSHAA

Query:  KGAVRPLRRAFQ
        KGAVRPLRRAFQ
Subjt:  KGAVRPLRRAFQ

A0A6J1FR88 uncharacterized protein LOC1114477151.8e-9588.5Show/hide
Query:  MNRLAPLSEEPIDEHDGRIRNRNRNRSATTASGGRSWRNWIRTHLSILSFGKKSDGLNVLLSVLGCPLFPVSVQPNTFVSSTNQVSSSSQYIIEHFAAAT
        MNRLAPLSEEPIDE+DGR R+RNR+ +  +  GGRSWRNWIRTHLSIL  GKKSD LNVLLSVLGCPLFPVSVQPNT VSS NQVSSSSQYIIEHF AAT
Subjt:  MNRLAPLSEEPIDEHDGRIRNRNRNRSATTASGGRSWRNWIRTHLSILSFGKKSDGLNVLLSVLGCPLFPVSVQPNTFVSSTNQVSSSSQYIIEHFAAAT

Query:  GCRKLKGGVKNIFATGKLTMGMVDEVGSGGSGGGGPTGGVAQKGCFVMWQMIPNKWLIELAVGGHSIVAGSDGNVAWRHTPWLGSHAAKGAVRPLRRAFQ
        GCRKLKG VKNIF TGKLTMGM DEV SGG GGGGPT GVAQKGCFVMWQMIPNKWLIELAVGGHSIVAGSDGNVAWRHTPWLGSHAAKGAVRPLRRAFQ
Subjt:  GCRKLKGGVKNIFATGKLTMGMVDEVGSGGSGGGGPTGGVAQKGCFVMWQMIPNKWLIELAVGGHSIVAGSDGNVAWRHTPWLGSHAAKGAVRPLRRAFQ

A0A6J1J631 uncharacterized protein LOC1114816722.0e-9989.52Show/hide
Query:  MQLQRMNRLAPLSEEPIDEHDGRIRNRNRNRSATTASGGRSWRNWIRTHLSILSFGKKSDGLNVLLSVLGCPLFPVSVQPNTFVSSTNQVSSSSQYIIEH
        MQLQRMNRLAPLSEEPIDEHDGR RNRNR+ S +   GGRSWRNWIRTHLSILS GK+SDGLNVLLSVLGCPLFPVSVQPN FVSS NQVSSSSQYIIEH
Subjt:  MQLQRMNRLAPLSEEPIDEHDGRIRNRNRNRSATTASGGRSWRNWIRTHLSILSFGKKSDGLNVLLSVLGCPLFPVSVQPNTFVSSTNQVSSSSQYIIEH

Query:  FAAATGCRKLKGGVKNIFATGKLTMGMVDEVGSGG--SGG---GGPTGGVAQKGCFVMWQMIPNKWLIELAVGGHSIVAGSDGNVAWRHTPWLGSHAAKG
        FAAATGCRKL G VKNIFATGKLTMG+VDEV SGG  SGG   GGPTGGV QKGCFVMWQMIPNKWLIELAVGGHSIVAGSDGNVAWRHTPWLGSHAAKG
Subjt:  FAAATGCRKLKGGVKNIFATGKLTMGMVDEVGSGG--SGG---GGPTGGVAQKGCFVMWQMIPNKWLIELAVGGHSIVAGSDGNVAWRHTPWLGSHAAKG

Query:  AVRPLRRAFQ
        AVRPLRRAFQ
Subjt:  AVRPLRRAFQ

A0A6J1KUK6 uncharacterized protein LOC1114988045.7e-9489Show/hide
Query:  MNRLAPLSEEPIDEHDGRIRNRNRNRSATTASGGRSWRNWIRTHLSILSFGKKSDGLNVLLSVLGCPLFPVSVQPNTFVSSTNQVSSSSQYIIEHFAAAT
        MNRLAPLSEEPIDE DGR R+RNR+ +A +  GGRSWRNWIRTHLSIL  GKKSD LNVLLSVLGCPLFPVSVQPNT VSSTNQVSSSSQYIIEHF AAT
Subjt:  MNRLAPLSEEPIDEHDGRIRNRNRNRSATTASGGRSWRNWIRTHLSILSFGKKSDGLNVLLSVLGCPLFPVSVQPNTFVSSTNQVSSSSQYIIEHFAAAT

Query:  GCRKLKGGVKNIFATGKLTMGMVDEVGSGGSGGGGPTGGVAQKGCFVMWQMIPNKWLIELAVGGHSIVAGSDGNVAWRHTPWLGSHAAKGAVRPLRRAFQ
        GCRKLKG VKNIF TGKLTMGMVDEV    SGGGGPT GVAQKGCFVMWQMIPNKWLIELAVGGHSIVAGSDGNVAWRHTPWLGSHAAKGAVRPLRRAFQ
Subjt:  GCRKLKGGVKNIFATGKLTMGMVDEVGSGGSGGGGPTGGVAQKGCFVMWQMIPNKWLIELAVGGHSIVAGSDGNVAWRHTPWLGSHAAKGAVRPLRRAFQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G27690.1 Protein of unknown function (DUF620)3.3e-3343.26Show/hide
Query:  RSWRNWIRTHLSIL--SFGKKSD----GLNVLLSVLGCPLFPVSVQ-----PNTFVSSTNQVSSSSQYIIEHFAAATGCRKLKGGVKNIFATGKL-TMGM
        R W NW++  L +   S    SD     L +LL VLG PL PV V      P+  + +T   +SS+QYI++ + AA+G +KL   V+N +  G++ TM  
Subjt:  RSWRNWIRTHLSIL--SFGKKSD----GLNVLLSVLGCPLFPVSVQ-----PNTFVSSTNQVSSSSQYIIEHFAAATGCRKLKGGVKNIFATGKL-TMGM

Query:  VDEVGSGGSGGGGPTGGVAQKGCFVMWQMIPNKWLIELAVGGHSIVAGSDGNVAWRHTPWLGSHAAKGAVRPLRRAFQ
          E GS GS     +    + G FV+W M P+ W +EL +GG  ++AG DG + WRHTPWLG HAAKG VRPLRRA Q
Subjt:  VDEVGSGGSGGGGPTGGVAQKGCFVMWQMIPNKWLIELAVGGHSIVAGSDGNVAWRHTPWLGSHAAKGAVRPLRRAFQ

AT1G49840.1 Protein of unknown function (DUF620)7.5e-3039.17Show/hide
Query:  RMNRLAPLSEEPIDEHDGRIRNRNRNRSATTASGGRSW--RNWIR--THLSILSFGKKSDGLNVLLSVLGCPLFPVSVQPNTF-----VSSTNQVSSSSQ
        R + L P+ E P D  +G +   +  R     SG   W    W R  +  S     +KSD L +LL V+G PL P++V  ++      +  +   +SS+Q
Subjt:  RMNRLAPLSEEPIDEHDGRIRNRNRNRSATTASGGRSW--RNWIR--THLSILSFGKKSDGLNVLLSVLGCPLFPVSVQPNTF-----VSSTNQVSSSSQ

Query:  YIIEHFAAATGCRKLKGGVKNIFATGKLTMGMVDEVGSGGSGGGGPTGGV-------AQKGCFVMWQMIPNKWLIELAVGGHSIVAGSDGNVAWRHTPWL
        YI++ + AA G  KL   +KN +A GKL M +  E+ +       PTG V       ++ G FV+WQM P+ W +EL+VGG  + AG +G + WRHTPWL
Subjt:  YIIEHFAAATGCRKLKGGVKNIFATGKLTMGMVDEVGSGGSGGGGPTGGV-------AQKGCFVMWQMIPNKWLIELAVGGHSIVAGSDGNVAWRHTPWL

Query:  GSHAAKGAVRPLRRAFQ
        GSH AKG VRPLRRA Q
Subjt:  GSHAAKGAVRPLRRAFQ

AT1G79420.1 Protein of unknown function (DUF620)1.6e-3245.4Show/hide
Query:  KSDGLNVLLSVLGCPLFPVSVQ-----PNTFVSSTNQV------SSSSQYIIEHFAAATGCRKLKGGVKNIFATGKLTMGMVD-EVGSGGSG---GGGPT
        K   L +LL VLGCPL P+SV      P+  +  + Q+      +S++ YII+ + AATGC K     KN++ATG + M   + E+ +G S    GGG  
Subjt:  KSDGLNVLLSVLGCPLFPVSVQ-----PNTFVSSTNQV------SSSSQYIIEHFAAATGCRKLKGGVKNIFATGKLTMGMVD-EVGSGGSG---GGGPT

Query:  GGVAQKGCFVMWQMIPNKWLIELAVGGHSIVAGSDGNVAWRHTPWLGSHAAKGAVRPLRRAFQ
        G     GCFV+WQM P  W +EL +GG  +++GSDG   WRHTPWLG+HAAKG  RPLRR  Q
Subjt:  GGVAQKGCFVMWQMIPNKWLIELAVGGHSIVAGSDGNVAWRHTPWLGSHAAKGAVRPLRRAFQ

AT3G19540.1 Protein of unknown function (DUF620)6.8e-3136.97Show/hide
Query:  RMNRLAPLSEEPIDEHDGRIRNRNRNRSATTASGGRSWRNWIRTHLS-----ILSFGKKSDGLNVLLSVLGCPLFPVSVQ-----PNTFVSSTNQVSSSS
        R   L P+ E P  +  G   N   ++   +  G     +W++  LS       +   + + L +LL V+G PL P+ V      P+  + +T   +SS+
Subjt:  RMNRLAPLSEEPIDEHDGRIRNRNRNRSATTASGGRSWRNWIRTHLS-----ILSFGKKSDGLNVLLSVLGCPLFPVSVQ-----PNTFVSSTNQVSSSS

Query:  QYIIEHFAAATGCRKLKGGVKNIFATGKLTMGMVDEVGSGGSGGGGPTGGVAQKGCFVMWQMIPNKWLIELAVGGHSIVAGSDGNVAWRHTPWLGSHAAK
        QYI++ + AA+G +KL+  +KN +A GKL M +  E+ +            A+ G FV+WQM P+ W +ELAVGG  + AG +G + WRHTPWLGSH AK
Subjt:  QYIIEHFAAATGCRKLKGGVKNIFATGKLTMGMVDEVGSGGSGGGGPTGGVAQKGCFVMWQMIPNKWLIELAVGGHSIVAGSDGNVAWRHTPWLGSHAAK

Query:  GAVRPLRRAFQ
        G VRPLRR  Q
Subjt:  GAVRPLRRAFQ

AT5G06610.1 Protein of unknown function (DUF620)5.5e-5757Show/hide
Query:  MNRLAPLSEEPIDEHDGRIRNRNRNRSATTASGGRSWRNWIRTHLSILSFGKKSDGLNVLLSVLGCPLFPVSVQPNTFVSSTNQVSSSSQYIIEHFAAAT
        M RLAPL EEPIDE D       R  S  +    +SW+ WI+T L  + F KK D + +LLSV+GCPLFPV   P     S  QVSSS+QYII+ FAAAT
Subjt:  MNRLAPLSEEPIDEHDGRIRNRNRNRSATTASGGRSWRNWIRTHLSILSFGKKSDGLNVLLSVLGCPLFPVSVQPNTFVSSTNQVSSSSQYIIEHFAAAT

Query:  GCRKLKGGVKNIFATGKLTMGMVDEVGSGGSGGGGPTGGVAQKGCFVMWQMIPNKWLIELAVGGHSIVAGSDGNVAWRHTPWLGSHAAKGAVRPLRRAFQ
        GC+KL G +KN F TGK+TM MV ++ S  S        V+ KGCFVMWQM+P KWLIEL  GGH + AGSDG + WR+TPWLG HAAKGA+RPLRRA Q
Subjt:  GCRKLKGGVKNIFATGKLTMGMVDEVGSGGSGGGGPTGGVAQKGCFVMWQMIPNKWLIELAVGGHSIVAGSDGNVAWRHTPWLGSHAAKGAVRPLRRAFQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAGTTGCAGAGGATGAATCGTCTCGCGCCGCTATCGGAGGAGCCGATCGACGAGCACGACGGCCGGATTCGAAATCGCAATCGCAACCGCAGCGCCACCACGGCCAG
CGGCGGACGATCGTGGCGGAACTGGATCAGAACTCATCTCTCCATCCTTTCTTTTGGAAAAAAGTCCGATGGCCTTAACGTTCTCCTCAGCGTCCTCGGCTGCCCTCTCT
TTCCGGTCTCCGTTCAACCTAATACCTTCGTCTCCTCTACCAATCAGGTTTCCTCGTCGTCTCAATATATCATAGAGCATTTTGCGGCGGCCACGGGTTGTCGGAAGTTG
AAAGGGGGAGTGAAGAACATATTTGCGACGGGGAAATTAACGATGGGGATGGTGGACGAGGTCGGCTCCGGCGGCAGCGGCGGAGGAGGACCGACGGGCGGGGTGGCACA
AAAAGGATGCTTTGTGATGTGGCAAATGATTCCGAATAAGTGGCTGATAGAGCTGGCAGTGGGAGGCCACAGCATTGTGGCCGGCAGCGATGGCAACGTCGCTTGGAGGC
ACACGCCTTGGCTTGGCTCTCACGCCGCTAAGGGCGCCGTCCGCCCTCTCCGCCGTGCTTTTCAGGCAAATTTTGACCATAGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGCAGTTGCAGAGGATGAATCGTCTCGCGCCGCTATCGGAGGAGCCGATCGACGAGCACGACGGCCGGATTCGAAATCGCAATCGCAACCGCAGCGCCACCACGGCCAG
CGGCGGACGATCGTGGCGGAACTGGATCAGAACTCATCTCTCCATCCTTTCTTTTGGAAAAAAGTCCGATGGCCTTAACGTTCTCCTCAGCGTCCTCGGCTGCCCTCTCT
TTCCGGTCTCCGTTCAACCTAATACCTTCGTCTCCTCTACCAATCAGGTTTCCTCGTCGTCTCAATATATCATAGAGCATTTTGCGGCGGCCACGGGTTGTCGGAAGTTG
AAAGGGGGAGTGAAGAACATATTTGCGACGGGGAAATTAACGATGGGGATGGTGGACGAGGTCGGCTCCGGCGGCAGCGGCGGAGGAGGACCGACGGGCGGGGTGGCACA
AAAAGGATGCTTTGTGATGTGGCAAATGATTCCGAATAAGTGGCTGATAGAGCTGGCAGTGGGAGGCCACAGCATTGTGGCCGGCAGCGATGGCAACGTCGCTTGGAGGC
ACACGCCTTGGCTTGGCTCTCACGCCGCTAAGGGCGCCGTCCGCCCTCTCCGCCGTGCTTTTCAGGCAAATTTTGACCATAGCTGA
Protein sequenceShow/hide protein sequence
MQLQRMNRLAPLSEEPIDEHDGRIRNRNRNRSATTASGGRSWRNWIRTHLSILSFGKKSDGLNVLLSVLGCPLFPVSVQPNTFVSSTNQVSSSSQYIIEHFAAATGCRKL
KGGVKNIFATGKLTMGMVDEVGSGGSGGGGPTGGVAQKGCFVMWQMIPNKWLIELAVGGHSIVAGSDGNVAWRHTPWLGSHAAKGAVRPLRRAFQANFDHS