; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10016531 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10016531
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionMediator of RNA polymerase II transcription subunit 26, putative
Genome locationChr03:5716974..5718325
RNA-Seq ExpressionHG10016531
SyntenyHG10016531
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004143470.1 uncharacterized protein LOC101209867 [Cucumis sativus]2.3e-10188.21Show/hide
Query:  MARFGNFRATKVFLLFLLLVLFASAFLLCVRAASESRNGELFRRGRRILESVEEVEPKKKKSSDALPTKTQ-NKLIKAPTQSSKNQTKLIKNNLSTKNKT
        MAR GNFRATKV LLFL+LVLFASAFLLCV A SESRNGE  RRGRRILESVEE EPKKKKSSDALPTKTQ NKLIKAPTQSSKNQTKLIKNNLSTKNKT
Subjt:  MARFGNFRATKVFLLFLLLVLFASAFLLCVRAASESRNGELFRRGRRILESVEEVEPKKKKSSDALPTKTQ-NKLIKAPTQSSKNQTKLIKNNLSTKNKT

Query:  TLGKATNSTKLTSGGILSKVGLK------KLNSTSKSSNSTKTTSFSAKKSSDPLKLSTPKNKTTTPSSSKQSQTTHLDKPKKEQKSEKKPNEEKPKKQA
         LGK TNSTKLTSGGIL KVGLK      KLNSTSKSSNSTKTT  SAKKSSD LK STPKNKTTTP+SSKQSQTTHLDK  KEQKSEKKPNEEKPKKQ 
Subjt:  TLGKATNSTKLTSGGILSKVGLK------KLNSTSKSSNSTKTTSFSAKKSSDPLKLSTPKNKTTTPSSSKQSQTTHLDKPKKEQKSEKKPNEEKPKKQA

Query:  QAK-KPSWVDDDEEDDLVSEFRDLPTKFQKTLIPDLAKISTTSKAYITKANKQMTMGFKPIVG
        QAK KPSWVDDDE++DLVSEFRDL TKFQKTLIPDLA+ISTTSKAYITKANKQMTMGFKPIVG
Subjt:  QAK-KPSWVDDDEEDDLVSEFRDLPTKFQKTLIPDLAKISTTSKAYITKANKQMTMGFKPIVG

XP_008440602.1 PREDICTED: uncharacterized protein LOC103484976 [Cucumis melo]1.3e-10489.73Show/hide
Query:  MARFGNFRATKVFLLFLLLVLFASAFLLCVRAASESRNGELFRRGRRILESVEEVEPKKKKSSDALPTKTQ-NKLIKAPTQSSKNQTKLIKNNLSTKNKT
        MAR GNFRATKV LLFL+LVLFASAFLLCVRA SESRNGELFRRGRRILESVEE EPKKKKSSDALPTKTQ NKLIKAPTQSSKNQTKLIKN+LSTKNKT
Subjt:  MARFGNFRATKVFLLFLLLVLFASAFLLCVRAASESRNGELFRRGRRILESVEEVEPKKKKSSDALPTKTQ-NKLIKAPTQSSKNQTKLIKNNLSTKNKT

Query:  TLGKATNSTKLTSGGILSKVGLKKLNS------TSKSSNSTKTTSFSAKKSSDPLKLSTPKNKTTTPSSSKQSQTTHLDKPKKEQKSEKKPNEEKPKKQA
         LGK TNSTKLTS GIL KVGLKKLNS      TSKSSNSTKTTS SAKKSSD LK+STPKNKTTTP+SSKQSQTTHLDK  KEQKSEKKPNEEKPKKQ 
Subjt:  TLGKATNSTKLTSGGILSKVGLKKLNS------TSKSSNSTKTTSFSAKKSSDPLKLSTPKNKTTTPSSSKQSQTTHLDKPKKEQKSEKKPNEEKPKKQA

Query:  QAK-KPSWVDDDEEDDLVSEFRDLPTKFQKTLIPDLAKISTTSKAYITKANKQMTMGFKPIVG
        QAK KPSWVDDDE+DDLVSEFRDLPTKFQKTLIPDLA+ISTTSKAYITKANKQMTMGFKPIVG
Subjt:  QAK-KPSWVDDDEEDDLVSEFRDLPTKFQKTLIPDLAKISTTSKAYITKANKQMTMGFKPIVG

XP_022978464.1 uncharacterized protein LOC111478436 [Cucurbita maxima]2.3e-8878.95Show/hide
Query:  FRATKVFLLFLLLVLFASAFLLCVRAASESRNGELFR-----RGRRILESVEEVEPKKK--KSSDALPTKTQNKLIKAPTQSSKNQTKLIKNNLSTKNKT
        FRA+KVF+LFLLLVLFA A LLCVRA SES NG LFR      GRR LESVEE  PK K  KS+DALPTKTQNKL+K PTQSSKNQTKLI NN STKNKT
Subjt:  FRATKVFLLFLLLVLFASAFLLCVRAASESRNGELFR-----RGRRILESVEEVEPKKK--KSSDALPTKTQNKLIKAPTQSSKNQTKLIKNNLSTKNKT

Query:  TLGKATNSTKLTSGGILSKVGLK---------KLNSTSKSSNSTKTTSFSAKKSSDPLKLSTPKNKTTTPSSSKQSQTTHLDKPKKEQKSEKKPNEEKPK
         LGKA NSTKL SGG LSKVGLK         KLNSTSKSSNSTKTTSF AKKSSD LKLSTPKNKTTTP+S+KQSQTTHLDKP K+ KSE KP +EKPK
Subjt:  TLGKATNSTKLTSGGILSKVGLK---------KLNSTSKSSNSTKTTSFSAKKSSDPLKLSTPKNKTTTPSSSKQSQTTHLDKPKKEQKSEKKPNEEKPK

Query:  KQAQAK-KPSWVDDDEEDDLVSEFRDLPTKFQKTLIPDLAKISTTSKAYITKANKQMTMGFKPIVG
        KQAQ + KPSWVD+DE+DDLVSEFRDL TKFQKT IPDLA+ISTTSKAYITKANKQMT GFKPIVG
Subjt:  KQAQAK-KPSWVDDDEEDDLVSEFRDLPTKFQKTLIPDLAKISTTSKAYITKANKQMTMGFKPIVG

XP_023544541.1 uncharacterized protein LOC111804090 [Cucurbita pepo subsp. pepo]6.2e-8678.33Show/hide
Query:  FRATKVFLLFLLLVLFASAFLLCVRAASESRNGELFR--RGRRILESVEEVEP--KKKKSSDALPTKTQNKLIKAPTQSSKNQTKLIKNNLSTKNKTTLG
        FRA+KVF+LFL+LVLFA+A LLCVRA SES NG LFR   GRR+LE+VEE  P  K KKS+DALPTKTQNKL+K+PTQSSKNQTKLI NN STKNKT LG
Subjt:  FRATKVFLLFLLLVLFASAFLLCVRAASESRNGELFR--RGRRILESVEEVEP--KKKKSSDALPTKTQNKLIKAPTQSSKNQTKLIKNNLSTKNKTTLG

Query:  KATNSTKLTSGGILSKVGLK---------KLNSTSKSSNSTKTTSFSAKKSSDPLKLSTPKNKTTTPSSSKQSQTTHLDKPKKEQKSEKKPNEEKPKKQA
        KA NSTKL SGG LSKVGLK         KLNSTSKSSNSTKTTSFSAKKSSD LKLSTPKNKTTTP+S+KQSQTTHLD      KSE K  +EKPKKQA
Subjt:  KATNSTKLTSGGILSKVGLK---------KLNSTSKSSNSTKTTSFSAKKSSDPLKLSTPKNKTTTPSSSKQSQTTHLDKPKKEQKSEKKPNEEKPKKQA

Query:  QAK-KPSWVDDDEEDDLVSEFRDLPTKFQKTLIPDLAKISTTSKAYITKANKQMTMGFKPIVG
        Q K K  WVD+DE+DDLVSEFRDL TKFQKTLIPDLA+ISTTSKAYITKANKQMTMGFKPIVG
Subjt:  QAK-KPSWVDDDEEDDLVSEFRDLPTKFQKTLIPDLAKISTTSKAYITKANKQMTMGFKPIVG

XP_038882955.1 uncharacterized protein LOC120074049 [Benincasa hispida]3.5e-10589.53Show/hide
Query:  MARFGNFRATKVFLLFLLLVLFASAFLLCVRAASESRNGELFRRGRRILESVEEVEPKKKKSSDALPTKTQNKLIKAPTQSSKNQTKLIKNNLSTKNKTT
        MAR GNFR TKVFLLFL+ VLFA+AFLLCV A SES NGELFRRGRRILESVEE EPKKKKSSDALPTKTQ+KLIK+PTQSSKNQTKLIKNNLSTKNKT 
Subjt:  MARFGNFRATKVFLLFLLLVLFASAFLLCVRAASESRNGELFRRGRRILESVEEVEPKKKKSSDALPTKTQNKLIKAPTQSSKNQTKLIKNNLSTKNKTT

Query:  LGKATNSTKLTSGGILSKVGLKKLNSTSKSSNSTKTTSFSAKKSSDPLKLSTPKNKTTTPSSSKQSQTTHLDKPKKEQKSEKK--PNEEKPKKQAQAK-K
        LGKATNSTKLTSGGILSKVGLKKLNSTSKSSNSTKTTSFSAKKSSD  KLSTPKNK TTP+SSKQSQTTHLDKP KEQKSEKK   NEEKPKKQ +AK K
Subjt:  LGKATNSTKLTSGGILSKVGLKKLNSTSKSSNSTKTTSFSAKKSSDPLKLSTPKNKTTTPSSSKQSQTTHLDKPKKEQKSEKK--PNEEKPKKQAQAK-K

Query:  PSWVDDDEEDDLVSEFRDLPTKFQKTLIPDLAKISTTSKAYITKANKQMTMGFKPIVG
        PSWVDDDE+DDLVSEFRDLPTKFQKTLIPDLA+ISTTSKAY+ KANKQMTMGFKPIVG
Subjt:  PSWVDDDEEDDLVSEFRDLPTKFQKTLIPDLAKISTTSKAYITKANKQMTMGFKPIVG

TrEMBL top hitse value%identityAlignment
A0A0A0KG97 Uncharacterized protein1.1e-10188.21Show/hide
Query:  MARFGNFRATKVFLLFLLLVLFASAFLLCVRAASESRNGELFRRGRRILESVEEVEPKKKKSSDALPTKTQ-NKLIKAPTQSSKNQTKLIKNNLSTKNKT
        MAR GNFRATKV LLFL+LVLFASAFLLCV A SESRNGE  RRGRRILESVEE EPKKKKSSDALPTKTQ NKLIKAPTQSSKNQTKLIKNNLSTKNKT
Subjt:  MARFGNFRATKVFLLFLLLVLFASAFLLCVRAASESRNGELFRRGRRILESVEEVEPKKKKSSDALPTKTQ-NKLIKAPTQSSKNQTKLIKNNLSTKNKT

Query:  TLGKATNSTKLTSGGILSKVGLK------KLNSTSKSSNSTKTTSFSAKKSSDPLKLSTPKNKTTTPSSSKQSQTTHLDKPKKEQKSEKKPNEEKPKKQA
         LGK TNSTKLTSGGIL KVGLK      KLNSTSKSSNSTKTT  SAKKSSD LK STPKNKTTTP+SSKQSQTTHLDK  KEQKSEKKPNEEKPKKQ 
Subjt:  TLGKATNSTKLTSGGILSKVGLK------KLNSTSKSSNSTKTTSFSAKKSSDPLKLSTPKNKTTTPSSSKQSQTTHLDKPKKEQKSEKKPNEEKPKKQA

Query:  QAK-KPSWVDDDEEDDLVSEFRDLPTKFQKTLIPDLAKISTTSKAYITKANKQMTMGFKPIVG
        QAK KPSWVDDDE++DLVSEFRDL TKFQKTLIPDLA+ISTTSKAYITKANKQMTMGFKPIVG
Subjt:  QAK-KPSWVDDDEEDDLVSEFRDLPTKFQKTLIPDLAKISTTSKAYITKANKQMTMGFKPIVG

A0A1S3B126 uncharacterized protein LOC1034849766.4e-10589.73Show/hide
Query:  MARFGNFRATKVFLLFLLLVLFASAFLLCVRAASESRNGELFRRGRRILESVEEVEPKKKKSSDALPTKTQ-NKLIKAPTQSSKNQTKLIKNNLSTKNKT
        MAR GNFRATKV LLFL+LVLFASAFLLCVRA SESRNGELFRRGRRILESVEE EPKKKKSSDALPTKTQ NKLIKAPTQSSKNQTKLIKN+LSTKNKT
Subjt:  MARFGNFRATKVFLLFLLLVLFASAFLLCVRAASESRNGELFRRGRRILESVEEVEPKKKKSSDALPTKTQ-NKLIKAPTQSSKNQTKLIKNNLSTKNKT

Query:  TLGKATNSTKLTSGGILSKVGLKKLNS------TSKSSNSTKTTSFSAKKSSDPLKLSTPKNKTTTPSSSKQSQTTHLDKPKKEQKSEKKPNEEKPKKQA
         LGK TNSTKLTS GIL KVGLKKLNS      TSKSSNSTKTTS SAKKSSD LK+STPKNKTTTP+SSKQSQTTHLDK  KEQKSEKKPNEEKPKKQ 
Subjt:  TLGKATNSTKLTSGGILSKVGLKKLNS------TSKSSNSTKTTSFSAKKSSDPLKLSTPKNKTTTPSSSKQSQTTHLDKPKKEQKSEKKPNEEKPKKQA

Query:  QAK-KPSWVDDDEEDDLVSEFRDLPTKFQKTLIPDLAKISTTSKAYITKANKQMTMGFKPIVG
        QAK KPSWVDDDE+DDLVSEFRDLPTKFQKTLIPDLA+ISTTSKAYITKANKQMTMGFKPIVG
Subjt:  QAK-KPSWVDDDEEDDLVSEFRDLPTKFQKTLIPDLAKISTTSKAYITKANKQMTMGFKPIVG

A0A5A7T3Z1 Putative Mediator of RNA polymerase II transcription subunit 266.4e-10589.73Show/hide
Query:  MARFGNFRATKVFLLFLLLVLFASAFLLCVRAASESRNGELFRRGRRILESVEEVEPKKKKSSDALPTKTQ-NKLIKAPTQSSKNQTKLIKNNLSTKNKT
        MAR GNFRATKV LLFL+LVLFASAFLLCVRA SESRNGELFRRGRRILESVEE EPKKKKSSDALPTKTQ NKLIKAPTQSSKNQTKLIKN+LSTKNKT
Subjt:  MARFGNFRATKVFLLFLLLVLFASAFLLCVRAASESRNGELFRRGRRILESVEEVEPKKKKSSDALPTKTQ-NKLIKAPTQSSKNQTKLIKNNLSTKNKT

Query:  TLGKATNSTKLTSGGILSKVGLKKLNS------TSKSSNSTKTTSFSAKKSSDPLKLSTPKNKTTTPSSSKQSQTTHLDKPKKEQKSEKKPNEEKPKKQA
         LGK TNSTKLTS GIL KVGLKKLNS      TSKSSNSTKTTS SAKKSSD LK+STPKNKTTTP+SSKQSQTTHLDK  KEQKSEKKPNEEKPKKQ 
Subjt:  TLGKATNSTKLTSGGILSKVGLKKLNS------TSKSSNSTKTTSFSAKKSSDPLKLSTPKNKTTTPSSSKQSQTTHLDKPKKEQKSEKKPNEEKPKKQA

Query:  QAK-KPSWVDDDEEDDLVSEFRDLPTKFQKTLIPDLAKISTTSKAYITKANKQMTMGFKPIVG
        QAK KPSWVDDDE+DDLVSEFRDLPTKFQKTLIPDLA+ISTTSKAYITKANKQMTMGFKPIVG
Subjt:  QAK-KPSWVDDDEEDDLVSEFRDLPTKFQKTLIPDLAKISTTSKAYITKANKQMTMGFKPIVG

A0A6J1GFI8 uncharacterized protein LOC1114537107.4e-8577.57Show/hide
Query:  FRATKVFLLFLLLVLFASAFLLCVRAASESRNGELFR--RGRRILESVEE--VEPKKKKSSDALPTKTQNKLIKAPTQSSKNQTKLIKNNLSTKNKTTLG
        FRA+KVF+LFL+LVLFA+A  LCVRA SES NG LFR   GRR+LE+VEE   + K KKS+DALPTKTQNKL+K PTQS KNQTKLI NN STKNKT LG
Subjt:  FRATKVFLLFLLLVLFASAFLLCVRAASESRNGELFR--RGRRILESVEE--VEPKKKKSSDALPTKTQNKLIKAPTQSSKNQTKLIKNNLSTKNKTTLG

Query:  KATNSTKLTSGGILSKVGLK---------KLNSTSKSSNSTKTTSFSAKKSSDPLKLSTPKNKTTTPSSSKQSQTTHLDKPKKEQKSEKKPNEEKPKKQA
        KA NST+L SGG LSKVGLK         KLNSTSKSSNSTKTTSFSAKKSSD LKLSTPKNKTTTP+S+KQSQTTHLD      KSE KP +EKPKKQA
Subjt:  KATNSTKLTSGGILSKVGLK---------KLNSTSKSSNSTKTTSFSAKKSSDPLKLSTPKNKTTTPSSSKQSQTTHLDKPKKEQKSEKKPNEEKPKKQA

Query:  QAK-KPSWVDDDEEDDLVSEFRDLPTKFQKTLIPDLAKISTTSKAYITKANKQMTMGFKPIVG
        Q K K SWVD+DE+DDLVSEFRDL TKFQKTLIPDLA+ISTTSKAYITKANKQMTMGFKPIVG
Subjt:  QAK-KPSWVDDDEEDDLVSEFRDLPTKFQKTLIPDLAKISTTSKAYITKANKQMTMGFKPIVG

A0A6J1IU43 uncharacterized protein LOC1114784361.1e-8878.95Show/hide
Query:  FRATKVFLLFLLLVLFASAFLLCVRAASESRNGELFR-----RGRRILESVEEVEPKKK--KSSDALPTKTQNKLIKAPTQSSKNQTKLIKNNLSTKNKT
        FRA+KVF+LFLLLVLFA A LLCVRA SES NG LFR      GRR LESVEE  PK K  KS+DALPTKTQNKL+K PTQSSKNQTKLI NN STKNKT
Subjt:  FRATKVFLLFLLLVLFASAFLLCVRAASESRNGELFR-----RGRRILESVEEVEPKKK--KSSDALPTKTQNKLIKAPTQSSKNQTKLIKNNLSTKNKT

Query:  TLGKATNSTKLTSGGILSKVGLK---------KLNSTSKSSNSTKTTSFSAKKSSDPLKLSTPKNKTTTPSSSKQSQTTHLDKPKKEQKSEKKPNEEKPK
         LGKA NSTKL SGG LSKVGLK         KLNSTSKSSNSTKTTSF AKKSSD LKLSTPKNKTTTP+S+KQSQTTHLDKP K+ KSE KP +EKPK
Subjt:  TLGKATNSTKLTSGGILSKVGLK---------KLNSTSKSSNSTKTTSFSAKKSSDPLKLSTPKNKTTTPSSSKQSQTTHLDKPKKEQKSEKKPNEEKPK

Query:  KQAQAK-KPSWVDDDEEDDLVSEFRDLPTKFQKTLIPDLAKISTTSKAYITKANKQMTMGFKPIVG
        KQAQ + KPSWVD+DE+DDLVSEFRDL TKFQKT IPDLA+ISTTSKAYITKANKQMT GFKPIVG
Subjt:  KQAQAK-KPSWVDDDEEDDLVSEFRDLPTKFQKTLIPDLAKISTTSKAYITKANKQMTMGFKPIVG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G39840.1 unknown protein4.1e-2742.09Show/hide
Query:  FLLFLLLVLFASAFLLCVRA--ASESRNGELFR-----RGRRILESVEEVEPKKKKSSDALPTKTQNKLIKAPTQSS--------KNQTKLIK---NNLS
        F   LLL+L++S FL    +  AS   + EL R     R RR+L  V++++         LP   + K +   + SS        KNQTKL+K   ++ S
Subjt:  FLLFLLLVLFASAFLLCVRA--ASESRNGELFR-----RGRRILESVEEVEPKKKKSSDALPTKTQNKLIKAPTQSS--------KNQTKLIK---NNLS

Query:  TKNKTTLGKAT--------NSTKLTSGGILSKVGLKKLNSTSKSSNSTKTTSFSAKKSSDPLKLSTPKNKTT-TPSSSKQSQTTHLDKPKKEQKSEKKPN
        TKN+T L K T        NSTK +S    +   LKKLNS +KS+NST     S KKS+D  K S+ KNKTT  P SSK      L  P  E+KS+    
Subjt:  TKNKTTLGKAT--------NSTKLTSGGILSKVGLKKLNSTSKSSNSTKTTSFSAKKSSDPLKLSTPKNKTT-TPSSSKQSQTTHLDKPKKEQKSEKKPN

Query:  EEKPKKQAQAK-KPSWVDDDEEDDLVSEFRDLPTKFQKTLIPDLAKISTTSKAYITKANKQMTMGFKPIVGLFLTPKAPDIHPSLPRNLLRYPLLIV
             KQ++ + KP W+DD+E++D VSEFRDLPT+FQ++LIPDL +ISTTSK YI KANKQ+T  FKP  G      AP I   +    +  PLL+V
Subjt:  EEKPKKQAQAK-KPSWVDDDEEDDLVSEFRDLPTKFQKTLIPDLAKISTTSKAYITKANKQMTMGFKPIVGLFLTPKAPDIHPSLPRNLLRYPLLIV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTCGATTTGGGAATTTCAGAGCCACAAAGGTATTTCTGCTCTTTCTTCTTTTGGTTCTTTTTGCTTCGGCTTTTCTTTTGTGTGTGCGCGCCGCATCTGAATCTCG
AAATGGAGAATTGTTTCGGAGAGGGAGAAGAATTTTGGAGTCGGTGGAAGAGGTTGAGCCGAAGAAGAAAAAGTCCAGTGATGCCTTACCCACCAAAACTCAGAACAAGC
TCATTAAAGCTCCTACCCAATCCTCGAAAAACCAGACCAAGCTCATTAAGAACAATTTGTCTACGAAGAACAAGACTACGCTTGGTAAGGCTACGAATTCCACCAAACTG
ACATCCGGCGGCATCCTCTCCAAGGTTGGGCTCAAGAAGCTCAATTCCACATCGAAATCCTCCAATTCTACCAAAACCACGTCATTCTCTGCTAAGAAATCTTCAGATCC
ACTCAAATTAAGCACGCCCAAGAACAAAACGACGACACCCAGTTCCTCAAAACAATCCCAAACCACCCATCTAGACAAACCAAAGAAGGAGCAGAAATCAGAGAAGAAAC
CGAATGAGGAAAAACCCAAGAAACAAGCACAGGCGAAAAAGCCCAGTTGGGTTGACGATGATGAAGAAGACGATTTGGTATCGGAGTTCAGAGATCTACCCACGAAATTT
CAGAAAACTTTGATCCCAGACCTGGCGAAAATCTCCACTACTTCCAAGGCATACATTACGAAAGCCAACAAACAAATGACAATGGGATTCAAACCCATCGTGGGCTTATT
TCTCACTCCAAAAGCTCCTGATATTCATCCAAGTCTACCTCGCAATCTACTTCGGTATCCTCTGCTTATCGTCGGTGGTGACTGGGCTGGAGCCCCTCAAATTTTTCTAC
TCCACCTCGCAATCCACCTACATTTGCCTGCAGGTGATGCAAACCCTTGGATACATCTTGTATCTGCTGCTTCTGGTGATGTACCTGGTGCTGGTGTTCTCGACGGATTG
TGGGCTGGGCTCGAGAATGTTGGGCCTGGCCCAAACCTTCGTCGGGTATGCGGTGGGGTTGCATTACTATGTATCGGTGTTCCACAGGATGGTTCTACACCAGCCACCGA
GGACGAACTGGAAAATTCATGGGATTTATGCCACGTGTTTCCTGGTGATTTGCGGGTTCGCCGGGGCGGAGAGGAGGAAGAAAGCTTACTTGGAAGAAGACGGGGCAGAG
GGCAAGAAGAGTTGATTCGATTCAATTCAATTAAGTGTGGAATTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCTCGATTTGGGAATTTCAGAGCCACAAAGGTATTTCTGCTCTTTCTTCTTTTGGTTCTTTTTGCTTCGGCTTTTCTTTTGTGTGTGCGCGCCGCATCTGAATCTCG
AAATGGAGAATTGTTTCGGAGAGGGAGAAGAATTTTGGAGTCGGTGGAAGAGGTTGAGCCGAAGAAGAAAAAGTCCAGTGATGCCTTACCCACCAAAACTCAGAACAAGC
TCATTAAAGCTCCTACCCAATCCTCGAAAAACCAGACCAAGCTCATTAAGAACAATTTGTCTACGAAGAACAAGACTACGCTTGGTAAGGCTACGAATTCCACCAAACTG
ACATCCGGCGGCATCCTCTCCAAGGTTGGGCTCAAGAAGCTCAATTCCACATCGAAATCCTCCAATTCTACCAAAACCACGTCATTCTCTGCTAAGAAATCTTCAGATCC
ACTCAAATTAAGCACGCCCAAGAACAAAACGACGACACCCAGTTCCTCAAAACAATCCCAAACCACCCATCTAGACAAACCAAAGAAGGAGCAGAAATCAGAGAAGAAAC
CGAATGAGGAAAAACCCAAGAAACAAGCACAGGCGAAAAAGCCCAGTTGGGTTGACGATGATGAAGAAGACGATTTGGTATCGGAGTTCAGAGATCTACCCACGAAATTT
CAGAAAACTTTGATCCCAGACCTGGCGAAAATCTCCACTACTTCCAAGGCATACATTACGAAAGCCAACAAACAAATGACAATGGGATTCAAACCCATCGTGGGCTTATT
TCTCACTCCAAAAGCTCCTGATATTCATCCAAGTCTACCTCGCAATCTACTTCGGTATCCTCTGCTTATCGTCGGTGGTGACTGGGCTGGAGCCCCTCAAATTTTTCTAC
TCCACCTCGCAATCCACCTACATTTGCCTGCAGGTGATGCAAACCCTTGGATACATCTTGTATCTGCTGCTTCTGGTGATGTACCTGGTGCTGGTGTTCTCGACGGATTG
TGGGCTGGGCTCGAGAATGTTGGGCCTGGCCCAAACCTTCGTCGGGTATGCGGTGGGGTTGCATTACTATGTATCGGTGTTCCACAGGATGGTTCTACACCAGCCACCGA
GGACGAACTGGAAAATTCATGGGATTTATGCCACGTGTTTCCTGGTGATTTGCGGGTTCGCCGGGGCGGAGAGGAGGAAGAAAGCTTACTTGGAAGAAGACGGGGCAGAG
GGCAAGAAGAGTTGATTCGATTCAATTCAATTAAGTGTGGAATTTAG
Protein sequenceShow/hide protein sequence
MARFGNFRATKVFLLFLLLVLFASAFLLCVRAASESRNGELFRRGRRILESVEEVEPKKKKSSDALPTKTQNKLIKAPTQSSKNQTKLIKNNLSTKNKTTLGKATNSTKL
TSGGILSKVGLKKLNSTSKSSNSTKTTSFSAKKSSDPLKLSTPKNKTTTPSSSKQSQTTHLDKPKKEQKSEKKPNEEKPKKQAQAKKPSWVDDDEEDDLVSEFRDLPTKF
QKTLIPDLAKISTTSKAYITKANKQMTMGFKPIVGLFLTPKAPDIHPSLPRNLLRYPLLIVGGDWAGAPQIFLLHLAIHLHLPAGDANPWIHLVSAASGDVPGAGVLDGL
WAGLENVGPGPNLRRVCGGVALLCIGVPQDGSTPATEDELENSWDLCHVFPGDLRVRRGGEEEESLLGRRRGRGQEELIRFNSIKCGI