; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi02G003780 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi02G003780
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionUVR domain-containing protein/DUF525 domain-containing protein
Genome locationchr02:3260754..3271246
RNA-Seq ExpressionLsi02G003780
SyntenyLsi02G003780
Gene Ontology termsNA
InterPro domainsIPR007474 - ApaG domain
IPR036767 - ApaG domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7012250.1 apaG [Cucurbita argyrosperma subsp. argyrosperma]3.6e-10375.52Show/hide
Query:  MHSLSFKVVGDFNAGSRMNLPRRGSFCFPEMEVGAMRCFGGRRAFGRTCRIVACASERDGDGGGGQSQSQSASTSRSRSFLSRSETYALLKQQLEVAAKS
        MHSLSFKV GDFNAGSRMNLP RGSFC PEMEVG MRCFGGRRAFGR CRIVACASER+GDGGG  SQSQSASTSRSRSFLSRSETYALLKQQLEVAAKS
Subjt:  MHSLSFKVVGDFNAGSRMNLPRRGSFCFPEMEVGAMRCFGGRRAFGRTCRIVACASERDGDGGGGQSQSQSASTSRSRSFLSRSETYALLKQQLEVAAKS

Query:  EKLGVKKLGEETFTRSLIPSFSLDLWPEWVRDYEEAARIRDLLKLFEEEEPVLRLRRLMKEAISSERFE---------------------------GIRV
        E                              DYEEAARIRD LKLFEEEEPVLRLRRLMKEAISSERFE                           GIRV
Subjt:  EKLGVKKLGEETFTRSLIPSFSLDLWPEWVRDYEEAARIRDLLKLFEEEEPVLRLRRLMKEAISSERFE---------------------------GIRV

Query:  QVRSVYIEGRSQPSKNQYFFAYRIRITNNSNRPVQLLRRHWIITDANGKTENVWGVGVIGEQPVILPRTGFEYSSACPLSTANGRM
        QVRSVYIEGRSQP KNQYFFAYRIRITNNSNRPVQLLRRHWIITDANGKTENV GVGVIGEQPVILP+TGFEYSSACPL+TANGRM
Subjt:  QVRSVYIEGRSQPSKNQYFFAYRIRITNNSNRPVQLLRRHWIITDANGKTENVWGVGVIGEQPVILPRTGFEYSSACPLSTANGRM

XP_022954598.1 uncharacterized protein LOC111456816 [Cucurbita moschata]7.3e-10475.87Show/hide
Query:  MHSLSFKVVGDFNAGSRMNLPRRGSFCFPEMEVGAMRCFGGRRAFGRTCRIVACASERDGDGGGGQSQSQSASTSRSRSFLSRSETYALLKQQLEVAAKS
        MHSLSFKV GDFNAGSRMNLP RGSFC PEMEVG MRCFGGRRAFGR CRIVACASER+GDGGG  SQSQSASTSRSRSFLSRSETYALLKQQLEVAAKS
Subjt:  MHSLSFKVVGDFNAGSRMNLPRRGSFCFPEMEVGAMRCFGGRRAFGRTCRIVACASERDGDGGGGQSQSQSASTSRSRSFLSRSETYALLKQQLEVAAKS

Query:  EKLGVKKLGEETFTRSLIPSFSLDLWPEWVRDYEEAARIRDLLKLFEEEEPVLRLRRLMKEAISSERFE---------------------------GIRV
        E                              DYEEAARIRD LKLFEEEEPVLRLRRLMKEAISSERFE                           GIRV
Subjt:  EKLGVKKLGEETFTRSLIPSFSLDLWPEWVRDYEEAARIRDLLKLFEEEEPVLRLRRLMKEAISSERFE---------------------------GIRV

Query:  QVRSVYIEGRSQPSKNQYFFAYRIRITNNSNRPVQLLRRHWIITDANGKTENVWGVGVIGEQPVILPRTGFEYSSACPLSTANGRM
        QVRSVYIEGRSQPSKNQYFFAYRIRITNNSNRPVQLLRRHWIITDANGKTENV GVGVIGEQPVILP+TGFEYSSACPL+TANGRM
Subjt:  QVRSVYIEGRSQPSKNQYFFAYRIRITNNSNRPVQLLRRHWIITDANGKTENVWGVGVIGEQPVILPRTGFEYSSACPLSTANGRM

XP_023000699.1 uncharacterized protein LOC111495061 [Cucurbita maxima]8.1e-10374.48Show/hide
Query:  MHSLSFKVVGDFNAGSRMNLPRRGSFCFPEMEVGAMRCFGGRRAFGRTCRIVACASERDGDGGGGQSQSQSASTSRSRSFLSRSETYALLKQQLEVAAKS
        MHSLSFKV+GDFNA SRMNLP RGSFC PEMEVGAMRCFG  RAFGRTCRIVACASER+GDGGGG+ QSQS+STS SRSFLSRSETYALLKQQLEVAAKS
Subjt:  MHSLSFKVVGDFNAGSRMNLPRRGSFCFPEMEVGAMRCFGGRRAFGRTCRIVACASERDGDGGGGQSQSQSASTSRSRSFLSRSETYALLKQQLEVAAKS

Query:  EKLGVKKLGEETFTRSLIPSFSLDLWPEWVRDYEEAARIRDLLKLFEEEEPVLRLRRLMKEAISSERFE---------------------------GIRV
        E                              DYEEAARIRD LKLFEEEEPVLRLRRLMKEAISSERFE                           GIRV
Subjt:  EKLGVKKLGEETFTRSLIPSFSLDLWPEWVRDYEEAARIRDLLKLFEEEEPVLRLRRLMKEAISSERFE---------------------------GIRV

Query:  QVRSVYIEGRSQPSKNQYFFAYRIRITNNSNRPVQLLRRHWIITDANGKTENVWGVGVIGEQPVILPRTGFEYSSACPLSTANGRM
        QVRSVYIEGRSQPSKN YFFAYRIRITNNS+RP+QLLRRHWIITDANG+TENVWGVGVIGEQPVILPRTGFEYSSACPLSTANGRM
Subjt:  QVRSVYIEGRSQPSKNQYFFAYRIRITNNSNRPVQLLRRHWIITDANGKTENVWGVGVIGEQPVILPRTGFEYSSACPLSTANGRM

XP_031736034.1 polymerase delta-interacting protein 2 isoform X2 [Cucumis sativus]1.1e-10482.24Show/hide
Query:  MHSLSFKVVGDFNAGSRMNLPRRGSFCFPEMEVGAMRCFGGRRAFGRTCRIVACASERDGDGGGGQSQSQSASTSRSRSFLSRSETYALLKQQLEVAAKS
        MHSLSFKVVG FNAGSRMNLPRR     PEMEVGAMRCFGGRRAFGR+CRIVACASER+  G GG+ QSQSASTSRSRSFLSRSETYALLKQQLEVAAKS
Subjt:  MHSLSFKVVGDFNAGSRMNLPRRGSFCFPEMEVGAMRCFGGRRAFGRTCRIVACASERDGDGGGGQSQSQSASTSRSRSFLSRSETYALLKQQLEVAAKS

Query:  EKLGVKKLGEETFTRSLIPSFSLDLWPEWVRDYEEAARIRDLLKLFEEEEPVLRLRRLMKEAISSERFEGIRVQVRSVYIEGRSQPSKNQYFFAYRIRIT
        E                              DYEEAARIRD LKLFEEEEPVLRLRRLMKEAISSERFEGIRVQVRSVYIEGRSQPSKNQYFFAYRIRIT
Subjt:  EKLGVKKLGEETFTRSLIPSFSLDLWPEWVRDYEEAARIRDLLKLFEEEEPVLRLRRLMKEAISSERFEGIRVQVRSVYIEGRSQPSKNQYFFAYRIRIT

Query:  NNSNRPVQLLRRHWIITDANGKTENVWGVGVIGEQPVILPRTGFEYSSACPLSTANGRM
        NNSNRPVQLLRRHWIITDANGKTENVWGVGVIGEQPVILP+TGFEYSSACPL+TANGRM
Subjt:  NNSNRPVQLLRRHWIITDANGKTENVWGVGVIGEQPVILPRTGFEYSSACPLSTANGRM

XP_038895353.1 uncharacterized protein LOC120083605 [Benincasa hispida]7.8e-10676.57Show/hide
Query:  MHSLSFKVVGDFNAGSRMNLPRRGSFCFPEMEVGAMRCFGGRRAFGRTCRIVACASERDGDGGGGQSQSQSASTSRSRSFLSRSETYALLKQQLEVAAKS
        MHSLSFKVVGDFNAGSRMNLPRRGSFCFPEMEVGAMR  GGRRAFGR CRIVAC SER+GDGGGG  QSQSASTSRSRSFLSRSETYALLKQQLEVAAKS
Subjt:  MHSLSFKVVGDFNAGSRMNLPRRGSFCFPEMEVGAMRCFGGRRAFGRTCRIVACASERDGDGGGGQSQSQSASTSRSRSFLSRSETYALLKQQLEVAAKS

Query:  EKLGVKKLGEETFTRSLIPSFSLDLWPEWVRDYEEAARIRDLLKLFEEEEPVLRLRRLMKEAISSERFE---------------------------GIRV
        E                              DYEEAARIRD LKLFEEEEPVLRLRRLMKEAISSERFE                           GIRV
Subjt:  EKLGVKKLGEETFTRSLIPSFSLDLWPEWVRDYEEAARIRDLLKLFEEEEPVLRLRRLMKEAISSERFE---------------------------GIRV

Query:  QVRSVYIEGRSQPSKNQYFFAYRIRITNNSNRPVQLLRRHWIITDANGKTENVWGVGVIGEQPVILPRTGFEYSSACPLSTANGRM
        QVRS+YIEGRSQPSKNQYFFAYRIRITNNSNRPVQLLRRHWIITDANGKTENVWGVGVIGEQPVILP+TGFEYSSACPLSTANGRM
Subjt:  QVRSVYIEGRSQPSKNQYFFAYRIRITNNSNRPVQLLRRHWIITDANGKTENVWGVGVIGEQPVILPRTGFEYSSACPLSTANGRM

TrEMBL top hitse value%identityAlignment
A0A0A0LWB4 Uncharacterized protein1.4e-10074.48Show/hide
Query:  MHSLSFKVVGDFNAGSRMNLPRRGSFCFPEMEVGAMRCFGGRRAFGRTCRIVACASERDGDGGGGQSQSQSASTSRSRSFLSRSETYALLKQQLEVAAKS
        MHSLSFKVVG FNAGSRMNLPRR     PEMEVGAMRCFGGRRAFGR+CRIVACASER+  G GG+ QSQSASTSRSRSFLSRSETYALLKQQLEVAAKS
Subjt:  MHSLSFKVVGDFNAGSRMNLPRRGSFCFPEMEVGAMRCFGGRRAFGRTCRIVACASERDGDGGGGQSQSQSASTSRSRSFLSRSETYALLKQQLEVAAKS

Query:  EKLGVKKLGEETFTRSLIPSFSLDLWPEWVRDYEEAARIRDLLKLFEEEEPVLRLRRLMKEAISSERFE---------------------------GIRV
        E                              DYEEAARIRD LKLFEEEEPVLRLRRLMKEAISSERFE                           GIRV
Subjt:  EKLGVKKLGEETFTRSLIPSFSLDLWPEWVRDYEEAARIRDLLKLFEEEEPVLRLRRLMKEAISSERFE---------------------------GIRV

Query:  QVRSVYIEGRSQPSKNQYFFAYRIRITNNSNRPVQLLRRHWIITDANGKTENVWGVGVIGEQPVILPRTGFEYSSACPLSTANGRM
        QVRSVYIEGRSQPSKNQYFFAYRIRITNNSNRPVQLLRRHWIITDANGKTENVWGVGVIGEQPVILP+TGFEYSSACPL+TANGRM
Subjt:  QVRSVYIEGRSQPSKNQYFFAYRIRITNNSNRPVQLLRRHWIITDANGKTENVWGVGVIGEQPVILPRTGFEYSSACPLSTANGRM

A0A6J1EAQ4 uncharacterized protein LOC1114314401.5e-10274.13Show/hide
Query:  MHSLSFKVVGDFNAGSRMNLPRRGSFCFPEMEVGAMRCFGGRRAFGRTCRIVACASERDGDGGGGQSQSQSASTSRSRSFLSRSETYALLKQQLEVAAKS
        MHSLSFKV+GDFNA SRMNLP RGSFC PEMEVGAMRCFG  RAFGRTCRIVACAS+R+GDGGGG  QSQS+STS SRSFLSRSETYALLKQQLEVAAKS
Subjt:  MHSLSFKVVGDFNAGSRMNLPRRGSFCFPEMEVGAMRCFGGRRAFGRTCRIVACASERDGDGGGGQSQSQSASTSRSRSFLSRSETYALLKQQLEVAAKS

Query:  EKLGVKKLGEETFTRSLIPSFSLDLWPEWVRDYEEAARIRDLLKLFEEEEPVLRLRRLMKEAISSERFE---------------------------GIRV
        E                              DYEEAARIRD LKLFEEEEPVLRLRRLMKEAISSERFE                           GIRV
Subjt:  EKLGVKKLGEETFTRSLIPSFSLDLWPEWVRDYEEAARIRDLLKLFEEEEPVLRLRRLMKEAISSERFE---------------------------GIRV

Query:  QVRSVYIEGRSQPSKNQYFFAYRIRITNNSNRPVQLLRRHWIITDANGKTENVWGVGVIGEQPVILPRTGFEYSSACPLSTANGRM
        QVRSVYIEGRSQPSKN YFFAYRIRITNNS+RP+QLLRRHWIITDANG+TENVWGVGVIGEQPVILPRTGFEYSSACPLSTANGRM
Subjt:  QVRSVYIEGRSQPSKNQYFFAYRIRITNNSNRPVQLLRRHWIITDANGKTENVWGVGVIGEQPVILPRTGFEYSSACPLSTANGRM

A0A6J1GRJ0 uncharacterized protein LOC1114568163.6e-10475.87Show/hide
Query:  MHSLSFKVVGDFNAGSRMNLPRRGSFCFPEMEVGAMRCFGGRRAFGRTCRIVACASERDGDGGGGQSQSQSASTSRSRSFLSRSETYALLKQQLEVAAKS
        MHSLSFKV GDFNAGSRMNLP RGSFC PEMEVG MRCFGGRRAFGR CRIVACASER+GDGGG  SQSQSASTSRSRSFLSRSETYALLKQQLEVAAKS
Subjt:  MHSLSFKVVGDFNAGSRMNLPRRGSFCFPEMEVGAMRCFGGRRAFGRTCRIVACASERDGDGGGGQSQSQSASTSRSRSFLSRSETYALLKQQLEVAAKS

Query:  EKLGVKKLGEETFTRSLIPSFSLDLWPEWVRDYEEAARIRDLLKLFEEEEPVLRLRRLMKEAISSERFE---------------------------GIRV
        E                              DYEEAARIRD LKLFEEEEPVLRLRRLMKEAISSERFE                           GIRV
Subjt:  EKLGVKKLGEETFTRSLIPSFSLDLWPEWVRDYEEAARIRDLLKLFEEEEPVLRLRRLMKEAISSERFE---------------------------GIRV

Query:  QVRSVYIEGRSQPSKNQYFFAYRIRITNNSNRPVQLLRRHWIITDANGKTENVWGVGVIGEQPVILPRTGFEYSSACPLSTANGRM
        QVRSVYIEGRSQPSKNQYFFAYRIRITNNSNRPVQLLRRHWIITDANGKTENV GVGVIGEQPVILP+TGFEYSSACPL+TANGRM
Subjt:  QVRSVYIEGRSQPSKNQYFFAYRIRITNNSNRPVQLLRRHWIITDANGKTENVWGVGVIGEQPVILPRTGFEYSSACPLSTANGRM

A0A6J1JZR7 uncharacterized protein LOC1114902992.8e-10174.48Show/hide
Query:  MHSLSFKVVGDFNAGSRMNLPRRGSFCFPEMEVGAMRCFGGRRAFGRTCRIVACASERDGDGGGGQSQSQSASTSRSRSFLSRSETYALLKQQLEVAAKS
        MHSLSFKV GDFNAGSRMNL  RGSFC PEMEVG +RCFGGRRAFGRTCRIVACA ER+GDGGG  SQSQSASTSRSRSFLSRSETYALLKQQLEVAAKS
Subjt:  MHSLSFKVVGDFNAGSRMNLPRRGSFCFPEMEVGAMRCFGGRRAFGRTCRIVACASERDGDGGGGQSQSQSASTSRSRSFLSRSETYALLKQQLEVAAKS

Query:  EKLGVKKLGEETFTRSLIPSFSLDLWPEWVRDYEEAARIRDLLKLFEEEEPVLRLRRLMKEAISSERFE---------------------------GIRV
        E                              DYEEAARIRD LKLFEEEEPVLRLRRL+KEAISSERFE                           GIRV
Subjt:  EKLGVKKLGEETFTRSLIPSFSLDLWPEWVRDYEEAARIRDLLKLFEEEEPVLRLRRLMKEAISSERFE---------------------------GIRV

Query:  QVRSVYIEGRSQPSKNQYFFAYRIRITNNSNRPVQLLRRHWIITDANGKTENVWGVGVIGEQPVILPRTGFEYSSACPLSTANGRM
        QVRSVYIEGRSQPSKNQYFFAYRIRITNNSNRPVQLLRRHWIITDANGKTENV GVGVIGEQPVILP+TGFEYSSAC L+TANGRM
Subjt:  QVRSVYIEGRSQPSKNQYFFAYRIRITNNSNRPVQLLRRHWIITDANGKTENVWGVGVIGEQPVILPRTGFEYSSACPLSTANGRM

A0A6J1KKQ3 uncharacterized protein LOC1114950613.9e-10374.48Show/hide
Query:  MHSLSFKVVGDFNAGSRMNLPRRGSFCFPEMEVGAMRCFGGRRAFGRTCRIVACASERDGDGGGGQSQSQSASTSRSRSFLSRSETYALLKQQLEVAAKS
        MHSLSFKV+GDFNA SRMNLP RGSFC PEMEVGAMRCFG  RAFGRTCRIVACASER+GDGGGG+ QSQS+STS SRSFLSRSETYALLKQQLEVAAKS
Subjt:  MHSLSFKVVGDFNAGSRMNLPRRGSFCFPEMEVGAMRCFGGRRAFGRTCRIVACASERDGDGGGGQSQSQSASTSRSRSFLSRSETYALLKQQLEVAAKS

Query:  EKLGVKKLGEETFTRSLIPSFSLDLWPEWVRDYEEAARIRDLLKLFEEEEPVLRLRRLMKEAISSERFE---------------------------GIRV
        E                              DYEEAARIRD LKLFEEEEPVLRLRRLMKEAISSERFE                           GIRV
Subjt:  EKLGVKKLGEETFTRSLIPSFSLDLWPEWVRDYEEAARIRDLLKLFEEEEPVLRLRRLMKEAISSERFE---------------------------GIRV

Query:  QVRSVYIEGRSQPSKNQYFFAYRIRITNNSNRPVQLLRRHWIITDANGKTENVWGVGVIGEQPVILPRTGFEYSSACPLSTANGRM
        QVRSVYIEGRSQPSKN YFFAYRIRITNNS+RP+QLLRRHWIITDANG+TENVWGVGVIGEQPVILPRTGFEYSSACPLSTANGRM
Subjt:  QVRSVYIEGRSQPSKNQYFFAYRIRITNNSNRPVQLLRRHWIITDANGKTENVWGVGVIGEQPVILPRTGFEYSSACPLSTANGRM

SwissProt top hitse value%identityAlignment
A6WVX4 Protein ApaG2.1e-2152.75Show/hide
Query:  GIRVQVRSVYIEGRSQPSKNQYFFAYRIRITNNSNRPVQLLRRHWIITDANGKTENVWGVGVIGEQPVILPRTGFEYSSACPLSTANGRMI
        GI V V   Y+E +S+P +N+Y + YRI I NNS   VQL  R+W ITDANG  E V G GV+GEQP + P   F+YSS CPL+T +G M+
Subjt:  GIRVQVRSVYIEGRSQPSKNQYFFAYRIRITNNSNRPVQLLRRHWIITDANGKTENVWGVGVIGEQPVILPRTGFEYSSACPLSTANGRMI

A9M7Z1 Protein ApaG4.7e-2150.55Show/hide
Query:  GIRVQVRSVYIEGRSQPSKNQYFFAYRIRITNNSNRPVQLLRRHWIITDANGKTENVWGVGVIGEQPVILPRTGFEYSSACPLSTANGRMI
        GI V V   Y+E +S+P +N+Y + YR+ I NNS+  VQL  R+W ITDANG  + V G GV+GEQPV+ P   ++YSS CPL+T++G M+
Subjt:  GIRVQVRSVYIEGRSQPSKNQYFFAYRIRITNNSNRPVQLLRRHWIITDANGKTENVWGVGVIGEQPVILPRTGFEYSSACPLSTANGRMI

B0CJT2 Protein ApaG4.7e-2150.55Show/hide
Query:  GIRVQVRSVYIEGRSQPSKNQYFFAYRIRITNNSNRPVQLLRRHWIITDANGKTENVWGVGVIGEQPVILPRTGFEYSSACPLSTANGRMI
        GI V V   Y+E +S+P +N+Y + YR+ I NNS+  VQL  R+W ITDANG  + V G GV+GEQPV+ P   ++YSS CPL+T++G M+
Subjt:  GIRVQVRSVYIEGRSQPSKNQYFFAYRIRITNNSNRPVQLLRRHWIITDANGKTENVWGVGVIGEQPVILPRTGFEYSSACPLSTANGRMI

Q2VZE7 Protein ApaG1.1e-2047.92Show/hide
Query:  SERFEGIRVQVRSVYIEGRSQPSKNQYFFAYRIRITNNSNRPVQLLRRHWIITDANGKTENVWGVGVIGEQPVILPRTGFEYSSACPLSTANGRMI
        S+    I V V+  Y++ +S P  N + +AYR+RI N  +R VQLLRRHW+ITDA G+ + V G GV+GEQPV+ P   +EY+S  PL T +G M+
Subjt:  SERFEGIRVQVRSVYIEGRSQPSKNQYFFAYRIRITNNSNRPVQLLRRHWIITDANGKTENVWGVGVIGEQPVILPRTGFEYSSACPLSTANGRMI

Q8G2L5 Protein ApaG3.6e-2150.55Show/hide
Query:  GIRVQVRSVYIEGRSQPSKNQYFFAYRIRITNNSNRPVQLLRRHWIITDANGKTENVWGVGVIGEQPVILPRTGFEYSSACPLSTANGRMI
        GI V V   Y+E +S+P +N+Y + YR+ I NNS+  VQL  R+W ITDANG  + V G GV+GEQPV+ P   ++YSS CPL+T++G M+
Subjt:  GIRVQVRSVYIEGRSQPSKNQYFFAYRIRITNNSNRPVQLLRRHWIITDANGKTENVWGVGVIGEQPVILPRTGFEYSSACPLSTANGRMI

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGATGACAATACCATCAAGGACCACATTGCATCTGACACGTGTAAATTTGTCTTCCACTCCCAAATTGTCGCGTACAAGGTTTGGTTATATGTGGAAGAAGAAGA
ACAGAGCGTCTTCACCGCCAACACGGAATTCCAACTATCACTTCATGTGACATTGAAGCCTATTCAGTCGTTTTTGTACAATTTTATCAGATCAGATAGAGTATCTCTTG
ATTTGAACAGTGAGAAATTTAGCAGAATTTTGGAAGAATACGGGAGGATGCATTCGTTGAGCTTCAAGGTTGTAGGTGATTTTAATGCGGGATCGAGGATGAATTTACCT
AGACGAGGAAGCTTTTGCTTTCCGGAAATGGAGGTCGGGGCAATGAGGTGTTTTGGGGGCCGTAGAGCATTTGGAAGAACTTGTAGGATTGTAGCATGTGCTTCGGAGAG
AGATGGTGATGGTGGTGGCGGACAGAGCCAGAGCCAGAGCGCAAGTACGAGTCGGAGTCGTTCGTTTCTCTCCCGTAGTGAAACTTATGCACTACTGAAGCAGCAATTGG
AGGTTGCCGCCAAGTCCGAGAAATTGGGGGTCAAGAAATTGGGGGAAGAGACATTCACTCGTTCCCTCATTCCCTCATTTTCCCTTGATTTATGGCCTGAATGGGTTCGA
GATTATGAAGAAGCTGCAAGGATACGGGACTTGTTAAAATTATTTGAAGAGGAAGAGCCAGTTTTGCGTCTGCGAAGACTGATGAAGGAGGCTATTTCTAGTGAGAGGTT
TGAGGGTATCAGGGTACAAGTTAGGAGCGTTTACATAGAAGGCCGAAGCCAACCTTCGAAGAATCAGTACTTTTTTGCATACCGAATAAGAATAACCAATAATTCAAACC
GTCCAGTTCAACTTCTCAGAAGACATTGGATTATCACCGATGCAAATGGGAAAACAGAAAATGTCTGGGGCGTTGGTGTTATTGGTGAACAACCAGTTATACTTCCTAGG
ACTGGGTTTGAATATTCATCAGCATGCCCATTAAGTACTGCTAACGGCAGAATGATATTCTTTCGTCGGGGATGCAGAAGACCAAAGCTTGCCAACAAATCCATATTATA
TTCTCAAATGAATTTGGAAATGTTAGGAGCTTCTTATCTTTTTACTCCGAAGAGAGTTGTTGTAAAGATCAATGTGCCTTGCTTGGAAATCTTGAGAGTTCGATATGGGA
TTGGAGCTCTCTTTAGAGTCGTCTTGACCAGGGCAAGGCGGCAGGAGTGGATGATCAATGCAGCATTCATTCCTTGCACTGCACTACAATCTCTAGAGGTTGAAGACCCA
AAAAAGGGGGAAGAAAAGCAACATGGAGAACAGCAAGATGGAAGATGGACCTTGGAGTTTGCTAACAGAGTCAATAAATTGGCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGAGATGACAATACCATCAAGGACCACATTGCATCTGACACGTGTAAATTTGTCTTCCACTCCCAAATTGTCGCGTACAAGGTTTGGTTATATGTGGAAGAAGAAGA
ACAGAGCGTCTTCACCGCCAACACGGAATTCCAACTATCACTTCATGTGACATTGAAGCCTATTCAGTCGTTTTTGTACAATTTTATCAGATCAGATAGAGTATCTCTTG
ATTTGAACAGTGAGAAATTTAGCAGAATTTTGGAAGAATACGGGAGGATGCATTCGTTGAGCTTCAAGGTTGTAGGTGATTTTAATGCGGGATCGAGGATGAATTTACCT
AGACGAGGAAGCTTTTGCTTTCCGGAAATGGAGGTCGGGGCAATGAGGTGTTTTGGGGGCCGTAGAGCATTTGGAAGAACTTGTAGGATTGTAGCATGTGCTTCGGAGAG
AGATGGTGATGGTGGTGGCGGACAGAGCCAGAGCCAGAGCGCAAGTACGAGTCGGAGTCGTTCGTTTCTCTCCCGTAGTGAAACTTATGCACTACTGAAGCAGCAATTGG
AGGTTGCCGCCAAGTCCGAGAAATTGGGGGTCAAGAAATTGGGGGAAGAGACATTCACTCGTTCCCTCATTCCCTCATTTTCCCTTGATTTATGGCCTGAATGGGTTCGA
GATTATGAAGAAGCTGCAAGGATACGGGACTTGTTAAAATTATTTGAAGAGGAAGAGCCAGTTTTGCGTCTGCGAAGACTGATGAAGGAGGCTATTTCTAGTGAGAGGTT
TGAGGGTATCAGGGTACAAGTTAGGAGCGTTTACATAGAAGGCCGAAGCCAACCTTCGAAGAATCAGTACTTTTTTGCATACCGAATAAGAATAACCAATAATTCAAACC
GTCCAGTTCAACTTCTCAGAAGACATTGGATTATCACCGATGCAAATGGGAAAACAGAAAATGTCTGGGGCGTTGGTGTTATTGGTGAACAACCAGTTATACTTCCTAGG
ACTGGGTTTGAATATTCATCAGCATGCCCATTAAGTACTGCTAACGGCAGAATGATATTCTTTCGTCGGGGATGCAGAAGACCAAAGCTTGCCAACAAATCCATATTATA
TTCTCAAATGAATTTGGAAATGTTAGGAGCTTCTTATCTTTTTACTCCGAAGAGAGTTGTTGTAAAGATCAATGTGCCTTGCTTGGAAATCTTGAGAGTTCGATATGGGA
TTGGAGCTCTCTTTAGAGTCGTCTTGACCAGGGCAAGGCGGCAGGAGTGGATGATCAATGCAGCATTCATTCCTTGCACTGCACTACAATCTCTAGAGGTTGAAGACCCA
AAAAAGGGGGAAGAAAAGCAACATGGAGAACAGCAAGATGGAAGATGGACCTTGGAGTTTGCTAACAGAGTCAATAAATTGGCTTGA
Protein sequenceShow/hide protein sequence
MGDDNTIKDHIASDTCKFVFHSQIVAYKVWLYVEEEEQSVFTANTEFQLSLHVTLKPIQSFLYNFIRSDRVSLDLNSEKFSRILEEYGRMHSLSFKVVGDFNAGSRMNLP
RRGSFCFPEMEVGAMRCFGGRRAFGRTCRIVACASERDGDGGGGQSQSQSASTSRSRSFLSRSETYALLKQQLEVAAKSEKLGVKKLGEETFTRSLIPSFSLDLWPEWVR
DYEEAARIRDLLKLFEEEEPVLRLRRLMKEAISSERFEGIRVQVRSVYIEGRSQPSKNQYFFAYRIRITNNSNRPVQLLRRHWIITDANGKTENVWGVGVIGEQPVILPR
TGFEYSSACPLSTANGRMIFFRRGCRRPKLANKSILYSQMNLEMLGASYLFTPKRVVVKINVPCLEILRVRYGIGALFRVVLTRARRQEWMINAAFIPCTALQSLEVEDP
KKGEEKQHGEQQDGRWTLEFANRVNKLA