; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0010217 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0010217
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionDUF1279 domain-containing protein
Genome locationchr9:45484676..45485990
RNA-Seq ExpressionLag0010217
SyntenyLag0010217
Gene Ontology termsNA
InterPro domainsIPR009688 - Protein FAM210A/B-like domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6570471.1 Family With Sequence Similarity 210 Member B, mitochondrial, partial [Cucurbita argyrosperma subsp. sororia]1.3e-6860.79Show/hide
Query:  MATASSVICFSSSFISSPASFTDGNFRFHLKFSNRPPKFNSKRFRVQSLKEKTGEIERPSP--AASSSSADEVTKKYGLEAGLWKVPFSSFSSYFICREV
        MATA S ICFSSSFISSPASF D NFRF    S RPPKFN KR RVQ+LKEKTGEIERPSP  + SSSSADEVTKKYGLEAGLWK               
Subjt:  MATASSVICFSSSFISSPASFTDGNFRFHLKFSNRPPKFNSKRFRVQSLKEKTGEIERPSP--AASSSSADEVTKKYGLEAGLWKVPFSSFSSYFICREV

Query:  DWNLEELHIPCYGGIGGFKYIDSRHLLFEGSFKKHKFNSVCLIVVAKGCFNCSYFVQNFQVMQIFSSKEEGEEGEGKNKSKGDQAKELLAKYGGAYLATS
                                                                       IFSSK   EEGEGK+KSKGDQAKELLAKYGGAYLATS
Subjt:  DWNLEELHIPCYGGIGGFKYIDSRHLLFEGSFKKHKFNSVCLIVVAKGCFNCSYFVQNFQVMQIFSSKEEGEEGEGKNKSKGDQAKELLAKYGGAYLATS

Query:  ITLSLISFSLCYALISAGVDVQSLLQKVGISTDETGEKVGTFALAYAAHKAASPIRFPPTVALTPIVASWIGKKVEKE
        ITLSLISF+ CYALISAGVDVQ+LLQKVGIS D  GEKVGTFALAYAAHKA SPIRFPPTVALT +VA+WIGKKVE+E
Subjt:  ITLSLISFSLCYALISAGVDVQSLLQKVGISTDETGEKVGTFALAYAAHKAASPIRFPPTVALTPIVASWIGKKVEKE

XP_022152952.1 uncharacterized protein LOC111020570 isoform X1 [Momordica charantia]1.2e-6960.73Show/hide
Query:  SSVICFSSSFISSPASFTDGNFRFHLKFSNRPPKFNSK--RFRVQSLKEKTGEIERPSPAASSSSADEVTKKYGLEAGLWKVPFSSFSSYFICREVDWNL
        +S+ICFSS FISSPASF DGNFRFHLK  NR PKFNS+  RFRV++LKEKTGEIERPSP       DEVTKKYGLEAGLWK                   
Subjt:  SSVICFSSSFISSPASFTDGNFRFHLKFSNRPPKFNSK--RFRVQSLKEKTGEIERPSPAASSSSADEVTKKYGLEAGLWKVPFSSFSSYFICREVDWNL

Query:  EELHIPCYGGIGGFKYIDSRHLLFEGSFKKHKFNSVCLIVVAKGCFNCSYFVQNFQVMQIFSSKEEGEEGEGKNKSKGDQAKELLAKYGGAYLATSITLS
                                                                   IFSSKEEGE   GK KSKG+QAKELLAKYGGAYLATSITLS
Subjt:  EELHIPCYGGIGGFKYIDSRHLLFEGSFKKHKFNSVCLIVVAKGCFNCSYFVQNFQVMQIFSSKEEGEEGEGKNKSKGDQAKELLAKYGGAYLATSITLS

Query:  LISFSLCYALISAGVDVQSLLQKVGISTDETGEKVGTFALAYAAHKAASPIRFPPTVALTPIVASWIGKKVEKEK
        LISFSLCYALISAG+DVQ+LLQKVGIS +ETGEKVGTFALAYAAHKAASPIRFPPTVALTPIVASWIGKKVEK++
Subjt:  LISFSLCYALISAGVDVQSLLQKVGISTDETGEKVGTFALAYAAHKAASPIRFPPTVALTPIVASWIGKKVEKEK

XP_022943302.1 uncharacterized protein LOC111448115 [Cucurbita moschata]7.7e-6960.79Show/hide
Query:  MATASSVICFSSSFISSPASFTDGNFRFHLKFSNRPPKFNSKRFRVQSLKEKTGEIERPSP--AASSSSADEVTKKYGLEAGLWKVPFSSFSSYFICREV
        MATA S ICFSSSFISSPASF D NFRF    S RPPKFN KR RVQ+LKEKTGEIERPSP  + SSSSADEVTKKYGLEAGLWK               
Subjt:  MATASSVICFSSSFISSPASFTDGNFRFHLKFSNRPPKFNSKRFRVQSLKEKTGEIERPSP--AASSSSADEVTKKYGLEAGLWKVPFSSFSSYFICREV

Query:  DWNLEELHIPCYGGIGGFKYIDSRHLLFEGSFKKHKFNSVCLIVVAKGCFNCSYFVQNFQVMQIFSSKEEGEEGEGKNKSKGDQAKELLAKYGGAYLATS
                                                                       IFSSK   EEGEGK+KSKGDQAKELLAKYGGAYLATS
Subjt:  DWNLEELHIPCYGGIGGFKYIDSRHLLFEGSFKKHKFNSVCLIVVAKGCFNCSYFVQNFQVMQIFSSKEEGEEGEGKNKSKGDQAKELLAKYGGAYLATS

Query:  ITLSLISFSLCYALISAGVDVQSLLQKVGISTDETGEKVGTFALAYAAHKAASPIRFPPTVALTPIVASWIGKKVEKE
        ITLSLISF++CYALISAGVDVQ+LLQKVGIS D  GEKVGTFALAYAAHKA SPIRFPPTVALT +VA+WIGKKVE+E
Subjt:  ITLSLISFSLCYALISAGVDVQSLLQKVGISTDETGEKVGTFALAYAAHKAASPIRFPPTVALTPIVASWIGKKVEKE

XP_023512163.1 uncharacterized protein LOC111776967 [Cucurbita pepo subsp. pepo]2.3e-6860.43Show/hide
Query:  MATASSVICFSSSFISSPASFTDGNFRFHLKFSNRPPKFNSKRFRVQSLKEKTGEIERPSP--AASSSSADEVTKKYGLEAGLWKVPFSSFSSYFICREV
        MATA S ICFSSSFISSPASF D NFRF    S +PPKFN KR RVQ+LKEKTGEIERPSP  + SSSSADEVTKKYGLEAGLWK               
Subjt:  MATASSVICFSSSFISSPASFTDGNFRFHLKFSNRPPKFNSKRFRVQSLKEKTGEIERPSP--AASSSSADEVTKKYGLEAGLWKVPFSSFSSYFICREV

Query:  DWNLEELHIPCYGGIGGFKYIDSRHLLFEGSFKKHKFNSVCLIVVAKGCFNCSYFVQNFQVMQIFSSKEEGEEGEGKNKSKGDQAKELLAKYGGAYLATS
                                                                       IFSSK   EEGEGK+KSKGDQAKELLAKYGGAYLATS
Subjt:  DWNLEELHIPCYGGIGGFKYIDSRHLLFEGSFKKHKFNSVCLIVVAKGCFNCSYFVQNFQVMQIFSSKEEGEEGEGKNKSKGDQAKELLAKYGGAYLATS

Query:  ITLSLISFSLCYALISAGVDVQSLLQKVGISTDETGEKVGTFALAYAAHKAASPIRFPPTVALTPIVASWIGKKVEKE
        ITLSLISF++CYALISAGVDVQ+LLQKVGIS D  GEKVGTFALAYAAHKA SPIRFPPTVALT +VA+WIGKKVE+E
Subjt:  ITLSLISFSLCYALISAGVDVQSLLQKVGISTDETGEKVGTFALAYAAHKAASPIRFPPTVALTPIVASWIGKKVEKE

XP_038901803.1 uncharacterized protein LOC120088508 [Benincasa hispida]1.1e-7062.14Show/hide
Query:  MATASSVICFSSSFISSPASFTDGNFRFHLKFSNRPPKFNSKRFRVQSLKEKTGEIERP----SPAASSSSADEVTKKYGLEAGLWKVPFSSFSSYFICR
        MATA S+I FSSS I    SF D NFRF   F N P KFNSKRFRVQ+LK+KTGEIERP    S ++SSSSADEVTKKYGLEAGLWK             
Subjt:  MATASSVICFSSSFISSPASFTDGNFRFHLKFSNRPPKFNSKRFRVQSLKEKTGEIERP----SPAASSSSADEVTKKYGLEAGLWKVPFSSFSSYFICR

Query:  EVDWNLEELHIPCYGGIGGFKYIDSRHLLFEGSFKKHKFNSVCLIVVAKGCFNCSYFVQNFQVMQIFSSKEEGEEGEGKNKSKGDQAKELLAKYGGAYLA
                                                                         IFSSKEE EEGEGKNKSKGDQAKELLAKYGGAYLA
Subjt:  EVDWNLEELHIPCYGGIGGFKYIDSRHLLFEGSFKKHKFNSVCLIVVAKGCFNCSYFVQNFQVMQIFSSKEEGEEGEGKNKSKGDQAKELLAKYGGAYLA

Query:  TSITLSLISFSLCYALISAGVDVQSLLQKVGISTDETGEKVGTFALAYAAHKAASPIRFPPTVALTPIVASWIGKKVEKE
        TSITLSLISFSLCYALISAGVDVQ+LLQKVGIS DETGEKVGTFALAYAAHKAASPIRFPPTVALTPIVASWIGKKVEKE
Subjt:  TSITLSLISFSLCYALISAGVDVQSLLQKVGISTDETGEKVGTFALAYAAHKAASPIRFPPTVALTPIVASWIGKKVEKE

TrEMBL top hitse value%identityAlignment
A0A0A0KEW3 DUF1279 domain-containing protein3.2e-6862.32Show/hide
Query:  MATASSVICFSSSFISSPASFTDGNFRFHLKFSNRPPKFNSKRFRVQSLKEKTGEIERPSPAASSSSADEVTKKYGLEAGLWKVPFSSFSSYFICREVDW
        MA ASS+I FSS   SS  SFT  NFRFH  F N P KFNSK FRVQ+LK+KTGEIERPSP +SSSSADEVTKKYGLEAGLWK                 
Subjt:  MATASSVICFSSSFISSPASFTDGNFRFHLKFSNRPPKFNSKRFRVQSLKEKTGEIERPSPAASSSSADEVTKKYGLEAGLWKVPFSSFSSYFICREVDW

Query:  NLEELHIPCYGGIGGFKYIDSRHLLFEGSFKKHKFNSVCLIVVAKGCFNCSYFVQNFQVMQIFSSKEEGEEGEGKNKSKGDQAKELLAKYGGAYLATSIT
                                                                     IFSSK   EEGEG NKSKGDQAKELLAKYGGAYLATSIT
Subjt:  NLEELHIPCYGGIGGFKYIDSRHLLFEGSFKKHKFNSVCLIVVAKGCFNCSYFVQNFQVMQIFSSKEEGEEGEGKNKSKGDQAKELLAKYGGAYLATSIT

Query:  LSLISFSLCYALISAGVDVQSLLQKVGISTDETGEKVGTFALAYAAHKAASPIRFPPTVALTPIVASWIGKKVEKE
        LSLISFSLCYALISAGVDVQ LLQKVGIS DETGEKVGTFALAYAAHKAASPIRFPPTVALTPIVASWIGKKVEKE
Subjt:  LSLISFSLCYALISAGVDVQSLLQKVGISTDETGEKVGTFALAYAAHKAASPIRFPPTVALTPIVASWIGKKVEKE

A0A5D3B9U1 DUF1279 domain-containing protein4.1e-6861.96Show/hide
Query:  MATASSVICFSSSFISSPASFTDGNFRFHLKFSNRPPKFNSKRFRVQSLKEKTGEIERPSPAASSSSADEVTKKYGLEAGLWKVPFSSFSSYFICREVDW
        MATASS+I FSS    S  SFT  NFRFH  F N P KFNSK FRVQ+LK+KTGEI+RPSP +SSSSADEVTKKYGLEAGLWK                 
Subjt:  MATASSVICFSSSFISSPASFTDGNFRFHLKFSNRPPKFNSKRFRVQSLKEKTGEIERPSPAASSSSADEVTKKYGLEAGLWKVPFSSFSSYFICREVDW

Query:  NLEELHIPCYGGIGGFKYIDSRHLLFEGSFKKHKFNSVCLIVVAKGCFNCSYFVQNFQVMQIFSSKEEGEEGEGKNKSKGDQAKELLAKYGGAYLATSIT
                                                                     IFSSK   EEGEG NKSKGDQAKELLAKYGGAYLATSIT
Subjt:  NLEELHIPCYGGIGGFKYIDSRHLLFEGSFKKHKFNSVCLIVVAKGCFNCSYFVQNFQVMQIFSSKEEGEEGEGKNKSKGDQAKELLAKYGGAYLATSIT

Query:  LSLISFSLCYALISAGVDVQSLLQKVGISTDETGEKVGTFALAYAAHKAASPIRFPPTVALTPIVASWIGKKVEKE
        LSLISFSLCYALISAGVDVQ+LLQKVGISTDETG KVGTFALAYAAHKAASPIRFPPTVALTPIVASWIGKKVEKE
Subjt:  LSLISFSLCYALISAGVDVQSLLQKVGISTDETGEKVGTFALAYAAHKAASPIRFPPTVALTPIVASWIGKKVEKE

A0A6J1DHN1 uncharacterized protein LOC111020570 isoform X15.8e-7060.73Show/hide
Query:  SSVICFSSSFISSPASFTDGNFRFHLKFSNRPPKFNSK--RFRVQSLKEKTGEIERPSPAASSSSADEVTKKYGLEAGLWKVPFSSFSSYFICREVDWNL
        +S+ICFSS FISSPASF DGNFRFHLK  NR PKFNS+  RFRV++LKEKTGEIERPSP       DEVTKKYGLEAGLWK                   
Subjt:  SSVICFSSSFISSPASFTDGNFRFHLKFSNRPPKFNSK--RFRVQSLKEKTGEIERPSPAASSSSADEVTKKYGLEAGLWKVPFSSFSSYFICREVDWNL

Query:  EELHIPCYGGIGGFKYIDSRHLLFEGSFKKHKFNSVCLIVVAKGCFNCSYFVQNFQVMQIFSSKEEGEEGEGKNKSKGDQAKELLAKYGGAYLATSITLS
                                                                   IFSSKEEGE   GK KSKG+QAKELLAKYGGAYLATSITLS
Subjt:  EELHIPCYGGIGGFKYIDSRHLLFEGSFKKHKFNSVCLIVVAKGCFNCSYFVQNFQVMQIFSSKEEGEEGEGKNKSKGDQAKELLAKYGGAYLATSITLS

Query:  LISFSLCYALISAGVDVQSLLQKVGISTDETGEKVGTFALAYAAHKAASPIRFPPTVALTPIVASWIGKKVEKEK
        LISFSLCYALISAG+DVQ+LLQKVGIS +ETGEKVGTFALAYAAHKAASPIRFPPTVALTPIVASWIGKKVEK++
Subjt:  LISFSLCYALISAGVDVQSLLQKVGISTDETGEKVGTFALAYAAHKAASPIRFPPTVALTPIVASWIGKKVEKEK

A0A6J1FSN5 uncharacterized protein LOC1114481153.7e-6960.79Show/hide
Query:  MATASSVICFSSSFISSPASFTDGNFRFHLKFSNRPPKFNSKRFRVQSLKEKTGEIERPSP--AASSSSADEVTKKYGLEAGLWKVPFSSFSSYFICREV
        MATA S ICFSSSFISSPASF D NFRF    S RPPKFN KR RVQ+LKEKTGEIERPSP  + SSSSADEVTKKYGLEAGLWK               
Subjt:  MATASSVICFSSSFISSPASFTDGNFRFHLKFSNRPPKFNSKRFRVQSLKEKTGEIERPSP--AASSSSADEVTKKYGLEAGLWKVPFSSFSSYFICREV

Query:  DWNLEELHIPCYGGIGGFKYIDSRHLLFEGSFKKHKFNSVCLIVVAKGCFNCSYFVQNFQVMQIFSSKEEGEEGEGKNKSKGDQAKELLAKYGGAYLATS
                                                                       IFSSK   EEGEGK+KSKGDQAKELLAKYGGAYLATS
Subjt:  DWNLEELHIPCYGGIGGFKYIDSRHLLFEGSFKKHKFNSVCLIVVAKGCFNCSYFVQNFQVMQIFSSKEEGEEGEGKNKSKGDQAKELLAKYGGAYLATS

Query:  ITLSLISFSLCYALISAGVDVQSLLQKVGISTDETGEKVGTFALAYAAHKAASPIRFPPTVALTPIVASWIGKKVEKE
        ITLSLISF++CYALISAGVDVQ+LLQKVGIS D  GEKVGTFALAYAAHKA SPIRFPPTVALT +VA+WIGKKVE+E
Subjt:  ITLSLISFSLCYALISAGVDVQSLLQKVGISTDETGEKVGTFALAYAAHKAASPIRFPPTVALTPIVASWIGKKVEKE

A0A6J1JG12 uncharacterized protein LOC1114841823.7e-6960.79Show/hide
Query:  MATASSVICFSSSFISSPASFTDGNFRFHLKFSNRPPKFNSKRFRVQSLKEKTGEIERPSP--AASSSSADEVTKKYGLEAGLWKVPFSSFSSYFICREV
        MATA S ICFSSSFISSPASF D NFRF    S RPPKFN KR RVQ+LKEKTGEIERPSP  + SSSSADEVTKKYGLEAGLWK               
Subjt:  MATASSVICFSSSFISSPASFTDGNFRFHLKFSNRPPKFNSKRFRVQSLKEKTGEIERPSP--AASSSSADEVTKKYGLEAGLWKVPFSSFSSYFICREV

Query:  DWNLEELHIPCYGGIGGFKYIDSRHLLFEGSFKKHKFNSVCLIVVAKGCFNCSYFVQNFQVMQIFSSKEEGEEGEGKNKSKGDQAKELLAKYGGAYLATS
                                                                       IFSSK   EEGEGK+KSKGDQAKELLAKYGGAYLATS
Subjt:  DWNLEELHIPCYGGIGGFKYIDSRHLLFEGSFKKHKFNSVCLIVVAKGCFNCSYFVQNFQVMQIFSSKEEGEEGEGKNKSKGDQAKELLAKYGGAYLATS

Query:  ITLSLISFSLCYALISAGVDVQSLLQKVGISTDETGEKVGTFALAYAAHKAASPIRFPPTVALTPIVASWIGKKVEKE
        ITLSLISF++CYALISAGVDVQ+LLQKVGIS D  GEKVGTFALAYAAHKA SPIRFPPTVALT +VA+WIGKKVE+E
Subjt:  ITLSLISFSLCYALISAGVDVQSLLQKVGISTDETGEKVGTFALAYAAHKAASPIRFPPTVALTPIVASWIGKKVEKE

SwissProt top hitse value%identityAlignment
Q96KR6 Protein FAM210B, mitochondrial4.6e-0833.01Show/hide
Query:  EGKNKSKGDQAKELLAKYGGAYLATSITLSLISFSLCYALISAGVDVQSLLQKVGISTDETGEKV----GTFALAYAAHKAASPIRFPPTVALTPIVASW
        E K +SK  Q K++  +YG   ++  I +SLIS  + Y ++S+GVD+ ++L K+G        K+     TF +AYA HK  +P+R   T+   P++  +
Subjt:  EGKNKSKGDQAKELLAKYGGAYLATSITLSLISFSLCYALISAGVDVQSLLQKVGISTDETGEKV----GTFALAYAAHKAASPIRFPPTVALTPIVASW

Query:  IGK
          K
Subjt:  IGK

Q9D8B6 Protein FAM210B, mitochondrial1.1e-0630.69Show/hide
Query:  GEGKNKSKGDQAKELLAKYGGAYLATSITLSLISFSLCYALISAGVDVQSLLQKVGISTDETGEKV----GTFALAYAAHKAASPIRFPPTVALTPIVAS
        G  K  S+  Q K++  +YG   ++  I +SL+S  + Y ++S+G+D+ ++L K+G        K+     TF +AYA HK  +P+R   T+   P V  
Subjt:  GEGKNKSKGDQAKELLAKYGGAYLATSITLSLISFSLCYALISAGVDVQSLLQKVGISTDETGEKV----GTFALAYAAHKAASPIRFPPTVALTPIVAS

Query:  W
        +
Subjt:  W

Arabidopsis top hitse value%identityAlignment
AT2G20940.1 Protein of unknown function (DUF1279)1.9e-0430.77Show/hide
Query:  KELLAKYGGAYLATSITLSLISFSLCYALISAGVDVQSLLQK---------------VGISTDETGEKV-------------GTFALAYAAHKAASPIRF
        KE++ KYG   L    ++S +S S  Y  I   VDV+SLL+K               + +  +E G                G  ALA   +KA  PIR 
Subjt:  KELLAKYGGAYLATSITLSLISFSLCYALISAGVDVQSLLQK---------------VGISTDETGEKV-------------GTFALAYAAHKAASPIRF

Query:  PPTVALTPIVASWIGKK
        P T+ALTP +A ++ ++
Subjt:  PPTVALTPIVASWIGKK

AT2G27290.1 Protein of unknown function (DUF1279)3.6e-4845.66Show/hide
Query:  PASFTDGNFRFHLKFSNRPPKFNSKRFRVQSLKEKTGEIER----PSPAASSSSADEVTKKYGLEAGLWKVPFSSFSSYFICREVDWNLEELHIPCYGGI
        P+S         ++F +     +SK+FR ++++EK  +I++    PS +    SA+EVTKKYGLE GLWK+                             
Subjt:  PASFTDGNFRFHLKFSNRPPKFNSKRFRVQSLKEKTGEIER----PSPAASSSSADEVTKKYGLEAGLWKVPFSSFSSYFICREVDWNLEELHIPCYGGI

Query:  GGFKYIDSRHLLFEGSFKKHKFNSVCLIVVAKGCFNCSYFVQNFQVMQIFSSKEEGEEGEG-KNKSKGDQAKELLAKYGGAYLATSITLSLISFSLCYAL
                                                        + S  +EG +G+  K KSK D+AKELLAKYGGAYLATSITLSLISFSLCY L
Subjt:  GGFKYIDSRHLLFEGSFKKHKFNSVCLIVVAKGCFNCSYFVQNFQVMQIFSSKEEGEEGEG-KNKSKGDQAKELLAKYGGAYLATSITLSLISFSLCYAL

Query:  ISAGVDVQSLLQKVGISTDETGEKVGTFALAYAAHKAASPIRFPPTVALTPIVASWIGKKVEKEK
        +++GVDVQ+LL KVGIST+ETGEKVG FALAYAAHKAASPIRFPPTVALTPIVA+WIGKKV+KEK
Subjt:  ISAGVDVQSLLQKVGISTDETGEKVGTFALAYAAHKAASPIRFPPTVALTPIVASWIGKKVEKEK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGACAGCTTCCTCTGTAATTTGTTTTTCATCTTCTTTCATTTCCTCTCCTGCTTCCTTCACTGATGGCAATTTCAGATTTCACTTGAAATTTTCTAATCGACCCCC
AAAGTTCAATTCGAAACGATTCAGGGTTCAATCCCTTAAAGAGAAAACAGGAGAAATCGAACGCCCATCTCCAGCAGCTTCGTCTTCTTCAGCAGATGAAGTTACCAAGA
AGTACGGACTTGAAGCTGGTCTTTGGAAGGTACCCTTCTCTTCGTTTTCTTCTTATTTTATCTGCCGTGAAGTTGATTGGAACTTGGAGGAGCTTCATATTCCTTGTTAT
GGTGGCATAGGAGGTTTCAAGTACATTGATAGCAGACATTTGCTCTTTGAAGGGAGCTTTAAGAAGCATAAGTTTAATTCTGTCTGTCTGATTGTTGTGGCAAAGGGCTG
TTTCAATTGTTCATATTTTGTTCAGAACTTCCAAGTTATGCAGATATTTAGTTCAAAGGAGGAGGGTGAGGAAGGTGAAGGGAAAAACAAATCAAAGGGTGATCAGGCAA
AAGAGCTGCTTGCAAAATATGGAGGTGCATATTTAGCCACTTCCATTACTCTCTCCCTGATCTCTTTCTCTCTGTGTTACGCGCTCATCAGTGCCGGTGTCGACGTTCAG
TCTCTTCTGCAGAAGGTGGGAATTTCGACTGATGAGACTGGAGAGAAAGTTGGAACGTTTGCTTTGGCATATGCAGCACATAAAGCTGCTTCTCCAATAAGGTTTCCTCC
AACAGTAGCCCTCACTCCAATCGTTGCAAGTTGGATAGGGAAGAAAGTTGAGAAAGAGAAATAG
mRNA sequenceShow/hide mRNA sequence
ATGGCGACAGCTTCCTCTGTAATTTGTTTTTCATCTTCTTTCATTTCCTCTCCTGCTTCCTTCACTGATGGCAATTTCAGATTTCACTTGAAATTTTCTAATCGACCCCC
AAAGTTCAATTCGAAACGATTCAGGGTTCAATCCCTTAAAGAGAAAACAGGAGAAATCGAACGCCCATCTCCAGCAGCTTCGTCTTCTTCAGCAGATGAAGTTACCAAGA
AGTACGGACTTGAAGCTGGTCTTTGGAAGGTACCCTTCTCTTCGTTTTCTTCTTATTTTATCTGCCGTGAAGTTGATTGGAACTTGGAGGAGCTTCATATTCCTTGTTAT
GGTGGCATAGGAGGTTTCAAGTACATTGATAGCAGACATTTGCTCTTTGAAGGGAGCTTTAAGAAGCATAAGTTTAATTCTGTCTGTCTGATTGTTGTGGCAAAGGGCTG
TTTCAATTGTTCATATTTTGTTCAGAACTTCCAAGTTATGCAGATATTTAGTTCAAAGGAGGAGGGTGAGGAAGGTGAAGGGAAAAACAAATCAAAGGGTGATCAGGCAA
AAGAGCTGCTTGCAAAATATGGAGGTGCATATTTAGCCACTTCCATTACTCTCTCCCTGATCTCTTTCTCTCTGTGTTACGCGCTCATCAGTGCCGGTGTCGACGTTCAG
TCTCTTCTGCAGAAGGTGGGAATTTCGACTGATGAGACTGGAGAGAAAGTTGGAACGTTTGCTTTGGCATATGCAGCACATAAAGCTGCTTCTCCAATAAGGTTTCCTCC
AACAGTAGCCCTCACTCCAATCGTTGCAAGTTGGATAGGGAAGAAAGTTGAGAAAGAGAAATAG
Protein sequenceShow/hide protein sequence
MATASSVICFSSSFISSPASFTDGNFRFHLKFSNRPPKFNSKRFRVQSLKEKTGEIERPSPAASSSSADEVTKKYGLEAGLWKVPFSSFSSYFICREVDWNLEELHIPCY
GGIGGFKYIDSRHLLFEGSFKKHKFNSVCLIVVAKGCFNCSYFVQNFQVMQIFSSKEEGEEGEGKNKSKGDQAKELLAKYGGAYLATSITLSLISFSLCYALISAGVDVQ
SLLQKVGISTDETGEKVGTFALAYAAHKAASPIRFPPTVALTPIVASWIGKKVEKEK