; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10019994 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10019994
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionProtein of unknown function (DUF620)
Genome locationChr04:27694534..27696996
RNA-Seq ExpressionHG10019994
SyntenyHG10019994
Gene Ontology termsNA
InterPro domainsIPR006873 - Protein of unknown function DUF620


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7017537.1 hypothetical protein SDJN02_19402, partial [Cucurbita argyrosperma subsp. argyrosperma]1.7e-9882.17Show/hide
Query:  MQLQRMNRLAPLSEEPIDEHDGRTRNRNRN-------------TAGGGGRSWRNWIRTHFSILSFGKKSDGLNVLLSVLGCPLFPVSLQPNSVVSDTNQV
        MQLQRM+RLAPLSEEPIDE DGR RNRNR+               GGGGRSWRNWIRTH SILS GKKSDGLNVLLSVLGCPLFPVS++PN+ VS  NQV
Subjt:  MQLQRMNRLAPLSEEPIDEHDGRTRNRNRN-------------TAGGGGRSWRNWIRTHFSILSFGKKSDGLNVLLSVLGCPLFPVSLQPNSVVSDTNQV

Query:  SSSSQYIIEHFAAATGCRKLKGRVKNIFATGKITMGMADEVSS--GGGSGGGGPTGGVTQKGCFVMWQMIPNKWLIELAVGGHSIVAGSDGNVAWRHTPW
        SSSSQYIIEHFAAATGCRKL GRVKNIFATGK+TMG+ DEVSS  GGG GGGGPTGGVTQKGCFVMWQMIPNKWLIELAVGGHSIVAGSDGNVAWRHTPW
Subjt:  SSSSQYIIEHFAAATGCRKLKGRVKNIFATGKITMGMADEVSS--GGGSGGGGPTGGVTQKGCFVMWQMIPNKWLIELAVGGHSIVAGSDGNVAWRHTPW

Query:  LGSHAAKGAVRPLRRAFQAS--LISIIQNY
        LGSHAAKGAVRPLRRAFQAS  L+ +I +Y
Subjt:  LGSHAAKGAVRPLRRAFQAS--LISIIQNY

XP_008443008.1 PREDICTED: uncharacterized protein LOC103486737 [Cucumis melo]1.2e-10193.97Show/hide
Query:  MNRLAPLSEEPIDEHDGRTRNRNRNTAGGGGRSWRNWIRTHFSILSFGKKSDGLNVLLSVLGCPLFPVSLQPNSVVSDTNQVSSSSQYIIEHFAAATGCR
        MNRLAPLSEEPIDEHD RTRNR+R TAGGGGRSWRNWIRTHFSILS GKKSDGLNVLLSVLGCPLFPVSLQPNS VS TNQVSSSSQYIIEHFAAATGCR
Subjt:  MNRLAPLSEEPIDEHDGRTRNRNRNTAGGGGRSWRNWIRTHFSILSFGKKSDGLNVLLSVLGCPLFPVSLQPNSVVSDTNQVSSSSQYIIEHFAAATGCR

Query:  KLKGRVKNIFATGKITMGMADEVSS-GGGSGGGGPTGGVTQKGCFVMWQMIPNKWLIELAVGGHSIVAGSDGNVAWRHTPWLGSHAAKGAVRPLRRAFQ
        KL+GRVKNIFATGKITMGMA+EVSS GGG GGGGPTGGVT+KGCFVMWQMIPNKWLIEL+VGGHSIVAGSDGNVAWRHTPWLGSHAAKGAVRPLRRAFQ
Subjt:  KLKGRVKNIFATGKITMGMADEVSS-GGGSGGGGPTGGVTQKGCFVMWQMIPNKWLIELAVGGHSIVAGSDGNVAWRHTPWLGSHAAKGAVRPLRRAFQ

XP_022982999.1 uncharacterized protein LOC111481672 [Cucurbita maxima]8.9e-10088.57Show/hide
Query:  MQLQRMNRLAPLSEEPIDEHDGRTRNRNRN---TAGGGGRSWRNWIRTHFSILSFGKKSDGLNVLLSVLGCPLFPVSLQPNSVVSDTNQVSSSSQYIIEH
        MQLQRMNRLAPLSEEPIDEHDGRTRNRNR+   + GGGGRSWRNWIRTH SILS GK+SDGLNVLLSVLGCPLFPVS+QPN+ VS  NQVSSSSQYIIEH
Subjt:  MQLQRMNRLAPLSEEPIDEHDGRTRNRNRN---TAGGGGRSWRNWIRTHFSILSFGKKSDGLNVLLSVLGCPLFPVSLQPNSVVSDTNQVSSSSQYIIEH

Query:  FAAATGCRKLKGRVKNIFATGKITMGMADEVSSGGGSGG----GGPTGGVTQKGCFVMWQMIPNKWLIELAVGGHSIVAGSDGNVAWRHTPWLGSHAAKG
        FAAATGCRKL GRVKNIFATGK+TMG+ DEVSSGGG+ G    GGPTGGVTQKGCFVMWQMIPNKWLIELAVGGHSIVAGSDGNVAWRHTPWLGSHAAKG
Subjt:  FAAATGCRKLKGRVKNIFATGKITMGMADEVSSGGGSGG----GGPTGGVTQKGCFVMWQMIPNKWLIELAVGGHSIVAGSDGNVAWRHTPWLGSHAAKG

Query:  AVRPLRRAFQ
        AVRPLRRAFQ
Subjt:  AVRPLRRAFQ

XP_023526366.1 uncharacterized protein LOC111789880 [Cucurbita pepo subsp. pepo]4.4e-9987.38Show/hide
Query:  MQLQRMNRLAPLSEEPIDEHDGRTRNRNRNTAG------GGGRSWRNWIRTHFSILSFGKKSDGLNVLLSVLGCPLFPVSLQPNSVVSDTNQVSSSSQYI
        MQLQRMNRLAPLSEEPIDE DGRTRNRNR+ +G      GGGRSWRNWIRTH SILS GKKSDGLNVLLSVLGCPLFPVS+QPN+ VS  NQVSSSSQYI
Subjt:  MQLQRMNRLAPLSEEPIDEHDGRTRNRNRNTAG------GGGRSWRNWIRTHFSILSFGKKSDGLNVLLSVLGCPLFPVSLQPNSVVSDTNQVSSSSQYI

Query:  IEHFAAATGCRKLKGRVKNIFATGKITMGMADEVSSGGG-----SGGGGPTGGVTQKGCFVMWQMIPNKWLIELAVGGHSIVAGSDGNVAWRHTPWLGSH
        IEHFAAATGCRKL GRVKNIFATGK+TMG+ DEVSSGGG      GGGGPTGGVTQKGCFVMWQMIPNKWLIELAVGGHSIVAGSDGNVAWRHTPWLGSH
Subjt:  IEHFAAATGCRKLKGRVKNIFATGKITMGMADEVSSGGG-----SGGGGPTGGVTQKGCFVMWQMIPNKWLIELAVGGHSIVAGSDGNVAWRHTPWLGSH

Query:  AAKGAVRPLRRAFQ
        AAKGAVRPLRRAFQ
Subjt:  AAKGAVRPLRRAFQ

XP_038906374.1 uncharacterized protein LOC120092207 [Benincasa hispida]2.8e-10193.24Show/hide
Query:  MQLQRMNRLAPLSEEPIDEHDGRTRNRNRNTAGGGG---RSWRNWIRTHFSILSFGKKSDGLNVLLSVLGCPLFPVSLQPNSVVSDTNQVSSSSQYIIEH
        M LQRMNRLAPLSEEPIDEHD RTRNRNR++ GGGG   RSWRNWIRTHFSILS GKKSDGLNVLLSVLGCPLFPVSLQPNS VSDTNQVSSSSQYIIEH
Subjt:  MQLQRMNRLAPLSEEPIDEHDGRTRNRNRNTAGGGG---RSWRNWIRTHFSILSFGKKSDGLNVLLSVLGCPLFPVSLQPNSVVSDTNQVSSSSQYIIEH

Query:  FAAATGCRKLKGRVKNIFATGKITMGMADEVSS-GGGSGGGGPTGGVTQKGCFVMWQMIPNKWLIELAVGGHSIVAGSDGNVAWRHTPWLGSHAAKGAVR
        FAAATGCRKLKGRVKNIF TGKITMGMADEVSS GGGSGGGG TGGVTQKGCFVMWQMIPNKWLIELAVGGHSIVAGSDGNVAWRHTPWLGSHAAKG VR
Subjt:  FAAATGCRKLKGRVKNIFATGKITMGMADEVSS-GGGSGGGGPTGGVTQKGCFVMWQMIPNKWLIELAVGGHSIVAGSDGNVAWRHTPWLGSHAAKGAVR

Query:  PLRRAFQ
        PLRRAFQ
Subjt:  PLRRAFQ

TrEMBL top hitse value%identityAlignment
A0A0A0LBR2 Uncharacterized protein1.6e-9991.67Show/hide
Query:  MNRLAPLSEEPIDEHDGRTRNRNRNT----AGGGGRSWRNWIRTHFSILSFGKKSDGLNVLLSVLGCPLFPVSLQPNSVVSDTNQVSSSSQYIIEHFAAA
        MNRLAPLSEEPIDEHD RTR RNRN     AGGGGRSWRNWIRTHFSILS  KKSDGLNVLLSVLGCPLFPVSLQPNS VS TNQVSSSSQYIIEHFAAA
Subjt:  MNRLAPLSEEPIDEHDGRTRNRNRNT----AGGGGRSWRNWIRTHFSILSFGKKSDGLNVLLSVLGCPLFPVSLQPNSVVSDTNQVSSSSQYIIEHFAAA

Query:  TGCRKLKGRVKNIFATGKITMGMADEVSS--GGGSGGGGPTGGVTQKGCFVMWQMIPNKWLIELAVGGHSIVAGSDGNVAWRHTPWLGSHAAKGAVRPLR
        TGCRKL+GRVKNIFATGKITMGMA+EVSS  GGG GGGGPTGGVTQKGCFVMWQMIPNKWLIEL+VGGHSIVAGSDGNVAWRHTPWLGSHAAKGAVRPLR
Subjt:  TGCRKLKGRVKNIFATGKITMGMADEVSS--GGGSGGGGPTGGVTQKGCFVMWQMIPNKWLIELAVGGHSIVAGSDGNVAWRHTPWLGSHAAKGAVRPLR

Query:  RAFQ
        RAFQ
Subjt:  RAFQ

A0A1S3B7U5 uncharacterized protein LOC1034867376.0e-10293.97Show/hide
Query:  MNRLAPLSEEPIDEHDGRTRNRNRNTAGGGGRSWRNWIRTHFSILSFGKKSDGLNVLLSVLGCPLFPVSLQPNSVVSDTNQVSSSSQYIIEHFAAATGCR
        MNRLAPLSEEPIDEHD RTRNR+R TAGGGGRSWRNWIRTHFSILS GKKSDGLNVLLSVLGCPLFPVSLQPNS VS TNQVSSSSQYIIEHFAAATGCR
Subjt:  MNRLAPLSEEPIDEHDGRTRNRNRNTAGGGGRSWRNWIRTHFSILSFGKKSDGLNVLLSVLGCPLFPVSLQPNSVVSDTNQVSSSSQYIIEHFAAATGCR

Query:  KLKGRVKNIFATGKITMGMADEVSS-GGGSGGGGPTGGVTQKGCFVMWQMIPNKWLIELAVGGHSIVAGSDGNVAWRHTPWLGSHAAKGAVRPLRRAFQ
        KL+GRVKNIFATGKITMGMA+EVSS GGG GGGGPTGGVT+KGCFVMWQMIPNKWLIEL+VGGHSIVAGSDGNVAWRHTPWLGSHAAKGAVRPLRRAFQ
Subjt:  KLKGRVKNIFATGKITMGMADEVSS-GGGSGGGGPTGGVTQKGCFVMWQMIPNKWLIELAVGGHSIVAGSDGNVAWRHTPWLGSHAAKGAVRPLRRAFQ

A0A6J1F6M9 uncharacterized protein LOC1114413591.1e-9887.26Show/hide
Query:  MQLQRMNRLAPLSEEPIDEHDGRTRNRNRNTA----GGGGRSWRNWIRTHFSILSFGKKSDGLNVLLSVLGCPLFPVSLQPNSVVSDTNQVSSSSQYIIE
        MQLQRM+RLAPLSEEPIDE DGRTRNRNR+ +    GGGGRSWRNWIRTH SILS GKKSDGLNVLLSVLGCPLFPVS++PN+ VS  NQVSSSSQYIIE
Subjt:  MQLQRMNRLAPLSEEPIDEHDGRTRNRNRNTA----GGGGRSWRNWIRTHFSILSFGKKSDGLNVLLSVLGCPLFPVSLQPNSVVSDTNQVSSSSQYIIE

Query:  HFAAATGCRKLKGRVKNIFATGKITMGMADEVSSGG-----GSGGGGPTGGVTQKGCFVMWQMIPNKWLIELAVGGHSIVAGSDGNVAWRHTPWLGSHAA
        HFAAATGCRKL GRVKNIFATGK+TMG+ DEVSSGG     G GGGGPTGGVTQKGCFVMWQMIPNKWLIELAVGGHSIVAGSDGNVAWRHTPWLGSHAA
Subjt:  HFAAATGCRKLKGRVKNIFATGKITMGMADEVSSGG-----GSGGGGPTGGVTQKGCFVMWQMIPNKWLIELAVGGHSIVAGSDGNVAWRHTPWLGSHAA

Query:  KGAVRPLRRAFQ
        KGAVRPLRRAFQ
Subjt:  KGAVRPLRRAFQ

A0A6J1FR88 uncharacterized protein LOC1114477152.5e-9589.05Show/hide
Query:  MNRLAPLSEEPIDEHDGRTRNRNRNTA---GGGGRSWRNWIRTHFSILSFGKKSDGLNVLLSVLGCPLFPVSLQPNSVVSDTNQVSSSSQYIIEHFAAAT
        MNRLAPLSEEPIDE+DGRTR+RNR+TA    GGGRSWRNWIRTH SIL  GKKSD LNVLLSVLGCPLFPVS+QPN++VS  NQVSSSSQYIIEHF AAT
Subjt:  MNRLAPLSEEPIDEHDGRTRNRNRNTA---GGGGRSWRNWIRTHFSILSFGKKSDGLNVLLSVLGCPLFPVSLQPNSVVSDTNQVSSSSQYIIEHFAAAT

Query:  GCRKLKGRVKNIFATGKITMGMADEVSSGGGSGGGGPTGGVTQKGCFVMWQMIPNKWLIELAVGGHSIVAGSDGNVAWRHTPWLGSHAAKGAVRPLRRAF
        GCRKLKGRVKNIF TGK+TMGMADEVSSGGG GGGGPT GV QKGCFVMWQMIPNKWLIELAVGGHSIVAGSDGNVAWRHTPWLGSHAAKGAVRPLRRAF
Subjt:  GCRKLKGRVKNIFATGKITMGMADEVSSGGGSGGGGPTGGVTQKGCFVMWQMIPNKWLIELAVGGHSIVAGSDGNVAWRHTPWLGSHAAKGAVRPLRRAF

Query:  Q
        Q
Subjt:  Q

A0A6J1J631 uncharacterized protein LOC1114816724.3e-10088.57Show/hide
Query:  MQLQRMNRLAPLSEEPIDEHDGRTRNRNRN---TAGGGGRSWRNWIRTHFSILSFGKKSDGLNVLLSVLGCPLFPVSLQPNSVVSDTNQVSSSSQYIIEH
        MQLQRMNRLAPLSEEPIDEHDGRTRNRNR+   + GGGGRSWRNWIRTH SILS GK+SDGLNVLLSVLGCPLFPVS+QPN+ VS  NQVSSSSQYIIEH
Subjt:  MQLQRMNRLAPLSEEPIDEHDGRTRNRNRN---TAGGGGRSWRNWIRTHFSILSFGKKSDGLNVLLSVLGCPLFPVSLQPNSVVSDTNQVSSSSQYIIEH

Query:  FAAATGCRKLKGRVKNIFATGKITMGMADEVSSGGGSGG----GGPTGGVTQKGCFVMWQMIPNKWLIELAVGGHSIVAGSDGNVAWRHTPWLGSHAAKG
        FAAATGCRKL GRVKNIFATGK+TMG+ DEVSSGGG+ G    GGPTGGVTQKGCFVMWQMIPNKWLIELAVGGHSIVAGSDGNVAWRHTPWLGSHAAKG
Subjt:  FAAATGCRKLKGRVKNIFATGKITMGMADEVSSGGGSGG----GGPTGGVTQKGCFVMWQMIPNKWLIELAVGGHSIVAGSDGNVAWRHTPWLGSHAAKG

Query:  AVRPLRRAFQ
        AVRPLRRAFQ
Subjt:  AVRPLRRAFQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G27690.1 Protein of unknown function (DUF620)1.1e-3140.19Show/hide
Query:  LAPLSE--EPIDEHDGRTRNRNRNTAGGGGRSWRNWIRTHFSIL--SFGKKSD----GLNVLLSVLGCPLFPV-----SLQPNSVVSDTNQVSSSSQYII
        LAP+ E  +P  E  G + + +R       R W NW++    +   S    SD     L +LL VLG PL PV      L P+  + +T   +SS+QYI+
Subjt:  LAPLSE--EPIDEHDGRTRNRNRNTAGGGGRSWRNWIRTHFSIL--SFGKKSD----GLNVLLSVLGCPLFPV-----SLQPNSVVSDTNQVSSSSQYII

Query:  EHFAAATGCRKLKGRVKNIFATGKITMGMADEVSSGG-GSGGGGPTGGVTQKGCFVMWQMIPNKWLIELAVGGHSIVAGSDGNVAWRHTPWLGSHAAKGA
        + + AA+G +KL   V+N +  G+I   MA E  +G  GS     +    + G FV+W M P+ W +EL +GG  ++AG DG + WRHTPWLG HAAKG 
Subjt:  EHFAAATGCRKLKGRVKNIFATGKITMGMADEVSSGG-GSGGGGPTGGVTQKGCFVMWQMIPNKWLIELAVGGHSIVAGSDGNVAWRHTPWLGSHAAKGA

Query:  VRPLRRAFQ
        VRPLRRA Q
Subjt:  VRPLRRAFQ

AT1G49840.1 Protein of unknown function (DUF620)2.5e-3139.07Show/hide
Query:  RMNRLAPLSEEPIDEHDGRTRNRNRNTAGGGGRSW--RNWIR--THFSILSFGKKSDGLNVLLSVLGCPLFPVSLQPNS-----VVSDTNQVSSSSQYII
        R + L P+ E P D  +G     +    G G   W    W R  +  S     +KSD L +LL V+G PL P+++  +S      + D+   +SS+QYI+
Subjt:  RMNRLAPLSEEPIDEHDGRTRNRNRNTAGGGGRSW--RNWIR--THFSILSFGKKSDGLNVLLSVLGCPLFPVSLQPNS-----VVSDTNQVSSSSQYII

Query:  EHFAAATGCRKLKGRVKNIFATGKITMGMADEVSSGGGSGGGGPTGGV-------TQKGCFVMWQMIPNKWLIELAVGGHSIVAGSDGNVAWRHTPWLGS
        + + AA G  KL   +KN +A GK+ M +  E+ +        PTG V       ++ G FV+WQM P+ W +EL+VGG  + AG +G + WRHTPWLGS
Subjt:  EHFAAATGCRKLKGRVKNIFATGKITMGMADEVSSGGGSGGGGPTGGV-------TQKGCFVMWQMIPNKWLIELAVGGHSIVAGSDGNVAWRHTPWLGS

Query:  HAAKGAVRPLRRAFQ
        H AKG VRPLRRA Q
Subjt:  HAAKGAVRPLRRAFQ

AT1G79420.1 Protein of unknown function (DUF620)1.0e-3238.6Show/hide
Query:  LAPLSEEP-IDEHDGRTRNRNRNTAGGGGRSW---RNWIRTHFSI---------------LSFGKKSDGLNVLLSVLGCPLFPVS-----LQPNSVVSDT
        L PL E P  D  D RT+  +         SW   R W + H  I                    K   L +LL VLGCPL P+S     L P+  +  +
Subjt:  LAPLSEEP-IDEHDGRTRNRNRNTAGGGGRSW---RNWIRTHFSI---------------LSFGKKSDGLNVLLSVLGCPLFPVS-----LQPNSVVSDT

Query:  NQV------SSSSQYIIEHFAAATGCRKLKGRVKNIFATGKITMGMADEVSSGGGSG---GGGPTGGVTQKGCFVMWQMIPNKWLIELAVGGHSIVAGSD
         Q+      +S++ YII+ + AATGC K     KN++ATG + M   +   + G S    GGG  G     GCFV+WQM P  W +EL +GG  +++GSD
Subjt:  NQV------SSSSQYIIEHFAAATGCRKLKGRVKNIFATGKITMGMADEVSSGGGSG---GGGPTGGVTQKGCFVMWQMIPNKWLIELAVGGHSIVAGSD

Query:  GNVAWRHTPWLGSHAAKGAVRPLRRAFQ
        G   WRHTPWLG+HAAKG  RPLRR  Q
Subjt:  GNVAWRHTPWLGSHAAKGAVRPLRRAFQ

AT3G19540.1 Protein of unknown function (DUF620)2.3e-2936.36Show/hide
Query:  RMNRLAPLSEEPIDEHDGRTRNRNRNTAGGGGRSWRNWIRTHFS-----ILSFGKKSDGLNVLLSVLGCPLFPVSLQ-----PNSVVSDTNQVSSSSQYI
        R   L P+ E P  +  G   N   +   G G    +W++   S       +   + + L +LL V+G PL P+ +      P+  + +T   +SS+QYI
Subjt:  RMNRLAPLSEEPIDEHDGRTRNRNRNTAGGGGRSWRNWIRTHFS-----ILSFGKKSDGLNVLLSVLGCPLFPVSLQ-----PNSVVSDTNQVSSSSQYI

Query:  IEHFAAATGCRKLKGRVKNIFATGKITMGMADEVSSGGGSGGGGPTGGVTQKGCFVMWQMIPNKWLIELAVGGHSIVAGSDGNVAWRHTPWLGSHAAKGA
        ++ + AA+G +KL+  +KN +A GK+ M  ++  ++        P+   T  G FV+WQM P+ W +ELAVGG  + AG +G + WRHTPWLGSH AKG 
Subjt:  IEHFAAATGCRKLKGRVKNIFATGKITMGMADEVSSGGGSGGGGPTGGVTQKGCFVMWQMIPNKWLIELAVGGHSIVAGSDGNVAWRHTPWLGSHAAKGA

Query:  VRPLRRAFQ
        VRPLRR  Q
Subjt:  VRPLRRAFQ

AT5G06610.1 Protein of unknown function (DUF620)1.0e-5657.07Show/hide
Query:  MNRLAPLSEEPIDEHDGRTRNRNRNTAGGGGRSWRNWIRTHFSILSFGKKSDGLNVLLSVLGCPLFPVSLQPNSVVSDTNQVSSSSQYIIEHFAAATGCR
        M RLAPL EEPIDE D +   R         +SW+ WI+T    + F KK D + +LLSV+GCPLFPV   P S +S   QVSSS+QYII+ FAAATGC+
Subjt:  MNRLAPLSEEPIDEHDGRTRNRNRNTAGGGGRSWRNWIRTHFSILSFGKKSDGLNVLLSVLGCPLFPVSLQPNSVVSDTNQVSSSSQYIIEHFAAATGCR

Query:  KLKGRVKNIFATGKITMGMADEVSSGGGSGGGGPTGGVTQKGCFVMWQMIPNKWLIELAVGGHSIVAGSDGNVAWRHTPWLGSHAAKGAVRPLRRAFQ
        KL G +KN F TGKITM M  +++S   S        V+ KGCFVMWQM+P KWLIEL  GGH + AGSDG + WR+TPWLG HAAKGA+RPLRRA Q
Subjt:  KLKGRVKNIFATGKITMGMADEVSSGGGSGGGGPTGGVTQKGCFVMWQMIPNKWLIELAVGGHSIVAGSDGNVAWRHTPWLGSHAAKGAVRPLRRAFQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAGTTGCAGAGGATGAATCGTTTAGCGCCACTATCGGAGGAGCCGATCGACGAACACGACGGCCGCACTCGTAATCGCAACCGCAACACCGCTGGAGGAGGAGGACG
ATCGTGGCGGAACTGGATCAGAACTCATTTCTCCATCCTTTCTTTTGGAAAGAAGTCCGATGGCCTTAATGTTCTCCTCAGCGTCCTCGGATGCCCTCTCTTTCCGGTCT
CCCTTCAACCTAACTCCGTCGTCTCCGATACAAATCAGGTTTCGTCATCGTCTCAATATATCATAGAGCATTTCGCGGCAGCGACGGGGTGCCGGAAGTTGAAGGGGAGA
GTGAAGAACATATTTGCGACGGGAAAAATAACGATGGGGATGGCGGATGAGGTTAGCTCCGGCGGCGGAAGTGGCGGAGGAGGACCCACCGGCGGTGTAACACAAAAAGG
TTGCTTTGTGATGTGGCAAATGATTCCGAATAAGTGGCTGATAGAGCTGGCTGTGGGAGGCCACAGCATTGTGGCCGGCAGCGATGGCAACGTCGCTTGGAGGCACACGC
CTTGGCTTGGCTCTCACGCCGCTAAGGGCGCCGTTCGCCCTCTCCGCCGTGCTTTTCAGGCAAGTCTGATCTCCATTATCCAAAATTATTTATTTTTCACTATTTTACCA
ATATCTGCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGCAGTTGCAGAGGATGAATCGTTTAGCGCCACTATCGGAGGAGCCGATCGACGAACACGACGGCCGCACTCGTAATCGCAACCGCAACACCGCTGGAGGAGGAGGACG
ATCGTGGCGGAACTGGATCAGAACTCATTTCTCCATCCTTTCTTTTGGAAAGAAGTCCGATGGCCTTAATGTTCTCCTCAGCGTCCTCGGATGCCCTCTCTTTCCGGTCT
CCCTTCAACCTAACTCCGTCGTCTCCGATACAAATCAGGTTTCGTCATCGTCTCAATATATCATAGAGCATTTCGCGGCAGCGACGGGGTGCCGGAAGTTGAAGGGGAGA
GTGAAGAACATATTTGCGACGGGAAAAATAACGATGGGGATGGCGGATGAGGTTAGCTCCGGCGGCGGAAGTGGCGGAGGAGGACCCACCGGCGGTGTAACACAAAAAGG
TTGCTTTGTGATGTGGCAAATGATTCCGAATAAGTGGCTGATAGAGCTGGCTGTGGGAGGCCACAGCATTGTGGCCGGCAGCGATGGCAACGTCGCTTGGAGGCACACGC
CTTGGCTTGGCTCTCACGCCGCTAAGGGCGCCGTTCGCCCTCTCCGCCGTGCTTTTCAGGCAAGTCTGATCTCCATTATCCAAAATTATTTATTTTTCACTATTTTACCA
ATATCTGCTTAG
Protein sequenceShow/hide protein sequence
MQLQRMNRLAPLSEEPIDEHDGRTRNRNRNTAGGGGRSWRNWIRTHFSILSFGKKSDGLNVLLSVLGCPLFPVSLQPNSVVSDTNQVSSSSQYIIEHFAAATGCRKLKGR
VKNIFATGKITMGMADEVSSGGGSGGGGPTGGVTQKGCFVMWQMIPNKWLIELAVGGHSIVAGSDGNVAWRHTPWLGSHAAKGAVRPLRRAFQASLISIIQNYLFFTILP
ISA