; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0013816 (gene) of Snake gourd v1 genome

Gene IDTan0013816
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionProtein of unknown function (DUF1639)
Genome locationLG10:24144769..24147114
RNA-Seq ExpressionTan0013816
SyntenyTan0013816
Gene Ontology termsNA
InterPro domainsIPR012438 - Protein of unknown function DUF1639


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6597703.1 hypothetical protein SDJN03_10883, partial [Cucurbita argyrosperma subsp. sororia]3.5e-10995.02Show/hide
Query:  METEVRNQNQRGCKPLEPEVFLQWGKRKRLRCARIKDPEISERLCGGLRKKIASRSDRCVVSSSEKERTPLQPNRLTRNSEGVTTLRNGAGTSPSPEKED
        METEVRNQNQRGCKPLEPEVFLQW KRKRLRCARIKDPEISERLCGG+RKKIASR+DRCVVS SEKERTPLQPNRLTRNSEGVTTLRNGAGTSPSPEKED
Subjt:  METEVRNQNQRGCKPLEPEVFLQWGKRKRLRCARIKDPEISERLCGGLRKKIASRSDRCVVSSSEKERTPLQPNRLTRNSEGVTTLRNGAGTSPSPEKED

Query:  RYYATRGSA-ATVVDENGRIST--NGNGNGKAEERGFVWPKLFIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLSEMSQERYEVREK
        RYYATRGSA A VVDENGRIST  NGN N KAEERGFVWPKLFIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLL+SPGAWLSEMSQERYEVREK
Subjt:  RYYATRGSA-ATVVDENGRIST--NGNGNGKAEERGFVWPKLFIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLSEMSQERYEVREK

Query:  KSTKKRPTGLKAMGSMESDSE
        KSTKKRPTGLKAMGSMESDSE
Subjt:  KSTKKRPTGLKAMGSMESDSE

XP_022932617.1 uncharacterized protein LOC111439121 isoform X1 [Cucurbita moschata]3.5e-10995.48Show/hide
Query:  METEVRNQNQRGCKPLEPEVFLQWGKRKRLRCARIKDPEISERLCGGLRKKIASRSDRCVVSSSEKERTPLQPNRLTRNSEGVTTLRNGAGTSPSPEKED
        METEVRNQNQRGCKPLEPEVFLQW KRKRLRCARIKDPEISERLCGGLRKKIASR+DRCVVS SEKERTPLQPNRLTRNSEGVTTLRNGAGTSPSPEKED
Subjt:  METEVRNQNQRGCKPLEPEVFLQWGKRKRLRCARIKDPEISERLCGGLRKKIASRSDRCVVSSSEKERTPLQPNRLTRNSEGVTTLRNGAGTSPSPEKED

Query:  RYYATRGSA-ATVVDENGRIST--NGNGNGKAEERGFVWPKLFIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLSEMSQERYEVREK
        RYYATRGSA A VVDENGRIST  NGN N KAEERGFVWPKLFIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLSEMSQERYEVREK
Subjt:  RYYATRGSA-ATVVDENGRIST--NGNGNGKAEERGFVWPKLFIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLSEMSQERYEVREK

Query:  KSTKKRPTGLKAMGSMESDSE
        KS KKRPTGLKAMGSMESDSE
Subjt:  KSTKKRPTGLKAMGSMESDSE

XP_022932618.1 uncharacterized protein LOC111439121 isoform X2 [Cucurbita moschata]3.5e-10995.48Show/hide
Query:  METEVRNQNQRGCKPLEPEVFLQWGKRKRLRCARIKDPEISERLCGGLRKKIASRSDRCVVSSSEKERTPLQPNRLTRNSEGVTTLRNGAGTSPSPEKED
        METEVRNQNQRGCKPLEPEVFLQW KRKRLRCARIKDPEISERLCGGLRKKIASR+DRCVVS SEKERTPLQPNRLTRNSEGVTTLRNGAGTSPSPEKED
Subjt:  METEVRNQNQRGCKPLEPEVFLQWGKRKRLRCARIKDPEISERLCGGLRKKIASRSDRCVVSSSEKERTPLQPNRLTRNSEGVTTLRNGAGTSPSPEKED

Query:  RYYATRGSA-ATVVDENGRIST--NGNGNGKAEERGFVWPKLFIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLSEMSQERYEVREK
        RYYATRGSA A VVDENGRIST  NGN N KAEERGFVWPKLFIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLSEMSQERYEVREK
Subjt:  RYYATRGSA-ATVVDENGRIST--NGNGNGKAEERGFVWPKLFIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLSEMSQERYEVREK

Query:  KSTKKRPTGLKAMGSMESDSE
        KS KKRPTGLKAMGSMESDSE
Subjt:  KSTKKRPTGLKAMGSMESDSE

XP_022972122.1 uncharacterized protein LOC111470755 isoform X1 [Cucurbita maxima]4.6e-10995.02Show/hide
Query:  METEVRNQNQRGCKPLEPEVFLQWGKRKRLRCARIKDPEISERLCGGLRKKIASRSDRCVVSSSEKERTPLQPNRLTRNSEGVTTLRNGAGTSPSPEKED
        METEVRNQNQRGCKPLEPEVFLQW K+KRLRCARIKDPEISERLCGGLRKKIASR+DRCVVS SEKERTPLQPNRLTRNSEGVTTLRNGAGTSPSPEKED
Subjt:  METEVRNQNQRGCKPLEPEVFLQWGKRKRLRCARIKDPEISERLCGGLRKKIASRSDRCVVSSSEKERTPLQPNRLTRNSEGVTTLRNGAGTSPSPEKED

Query:  RYYATRGSA-ATVVDENGRIST--NGNGNGKAEERGFVWPKLFIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLSEMSQERYEVREK
        RYYATRGSA A VVDENGRIST  NGN N KAEERGFVWPKLFIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLSEMSQERYEVREK
Subjt:  RYYATRGSA-ATVVDENGRIST--NGNGNGKAEERGFVWPKLFIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLSEMSQERYEVREK

Query:  KSTKKRPTGLKAMGSMESDSE
        KSTKKRPTGLKAMGSME+DSE
Subjt:  KSTKKRPTGLKAMGSMESDSE

XP_022972123.1 uncharacterized protein LOC111470755 isoform X2 [Cucurbita maxima]4.6e-10995.02Show/hide
Query:  METEVRNQNQRGCKPLEPEVFLQWGKRKRLRCARIKDPEISERLCGGLRKKIASRSDRCVVSSSEKERTPLQPNRLTRNSEGVTTLRNGAGTSPSPEKED
        METEVRNQNQRGCKPLEPEVFLQW K+KRLRCARIKDPEISERLCGGLRKKIASR+DRCVVS SEKERTPLQPNRLTRNSEGVTTLRNGAGTSPSPEKED
Subjt:  METEVRNQNQRGCKPLEPEVFLQWGKRKRLRCARIKDPEISERLCGGLRKKIASRSDRCVVSSSEKERTPLQPNRLTRNSEGVTTLRNGAGTSPSPEKED

Query:  RYYATRGSA-ATVVDENGRIST--NGNGNGKAEERGFVWPKLFIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLSEMSQERYEVREK
        RYYATRGSA A VVDENGRIST  NGN N KAEERGFVWPKLFIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLSEMSQERYEVREK
Subjt:  RYYATRGSA-ATVVDENGRIST--NGNGNGKAEERGFVWPKLFIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLSEMSQERYEVREK

Query:  KSTKKRPTGLKAMGSMESDSE
        KSTKKRPTGLKAMGSME+DSE
Subjt:  KSTKKRPTGLKAMGSMESDSE

TrEMBL top hitse value%identityAlignment
A0A6J1DQU0 uncharacterized protein LOC1110229595.7e-9787.95Show/hide
Query:  METEVRNQNQRGCKPLEPEVFLQWGKRKRLRCARIKDPEISERLCGGLRKKIASRSDRCVVSSSEKERTPLQPNRLTRNSEGVTTLRNGAGTS-----PS
        METEVR  NQRGCKPLEPEVFLQWGKR+RLRCARIKDPEISERLCGGLRKKIASR+DRCVV++SE+ERTPLQPNRLTR+SEGV TLRNGAGT+     PS
Subjt:  METEVRNQNQRGCKPLEPEVFLQWGKRKRLRCARIKDPEISERLCGGLRKKIASRSDRCVVSSSEKERTPLQPNRLTRNSEGVTTLRNGAGTS-----PS

Query:  PEKEDRYYATRGSAATVVDENGRISTNGNGNGKAEERGFVWPKLFIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLSEMSQERYEVR
        PEKEDRYYATRGS A  VDENG+ISTNGN   KAE+RGFVWPKLFIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLSEMSQERYEVR
Subjt:  PEKEDRYYATRGSAATVVDENGRISTNGNGNGKAEERGFVWPKLFIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLSEMSQERYEVR

Query:  EKKSTKKRPTGLKAM-GSMESDSE
        EKKS KKRP GLKAM GSME+DSE
Subjt:  EKKSTKKRPTGLKAM-GSMESDSE

A0A6J1EX97 uncharacterized protein LOC111439121 isoform X11.7e-10995.48Show/hide
Query:  METEVRNQNQRGCKPLEPEVFLQWGKRKRLRCARIKDPEISERLCGGLRKKIASRSDRCVVSSSEKERTPLQPNRLTRNSEGVTTLRNGAGTSPSPEKED
        METEVRNQNQRGCKPLEPEVFLQW KRKRLRCARIKDPEISERLCGGLRKKIASR+DRCVVS SEKERTPLQPNRLTRNSEGVTTLRNGAGTSPSPEKED
Subjt:  METEVRNQNQRGCKPLEPEVFLQWGKRKRLRCARIKDPEISERLCGGLRKKIASRSDRCVVSSSEKERTPLQPNRLTRNSEGVTTLRNGAGTSPSPEKED

Query:  RYYATRGSA-ATVVDENGRIST--NGNGNGKAEERGFVWPKLFIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLSEMSQERYEVREK
        RYYATRGSA A VVDENGRIST  NGN N KAEERGFVWPKLFIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLSEMSQERYEVREK
Subjt:  RYYATRGSA-ATVVDENGRIST--NGNGNGKAEERGFVWPKLFIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLSEMSQERYEVREK

Query:  KSTKKRPTGLKAMGSMESDSE
        KS KKRPTGLKAMGSMESDSE
Subjt:  KSTKKRPTGLKAMGSMESDSE

A0A6J1F285 uncharacterized protein LOC111439121 isoform X21.7e-10995.48Show/hide
Query:  METEVRNQNQRGCKPLEPEVFLQWGKRKRLRCARIKDPEISERLCGGLRKKIASRSDRCVVSSSEKERTPLQPNRLTRNSEGVTTLRNGAGTSPSPEKED
        METEVRNQNQRGCKPLEPEVFLQW KRKRLRCARIKDPEISERLCGGLRKKIASR+DRCVVS SEKERTPLQPNRLTRNSEGVTTLRNGAGTSPSPEKED
Subjt:  METEVRNQNQRGCKPLEPEVFLQWGKRKRLRCARIKDPEISERLCGGLRKKIASRSDRCVVSSSEKERTPLQPNRLTRNSEGVTTLRNGAGTSPSPEKED

Query:  RYYATRGSA-ATVVDENGRIST--NGNGNGKAEERGFVWPKLFIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLSEMSQERYEVREK
        RYYATRGSA A VVDENGRIST  NGN N KAEERGFVWPKLFIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLSEMSQERYEVREK
Subjt:  RYYATRGSA-ATVVDENGRIST--NGNGNGKAEERGFVWPKLFIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLSEMSQERYEVREK

Query:  KSTKKRPTGLKAMGSMESDSE
        KS KKRPTGLKAMGSMESDSE
Subjt:  KSTKKRPTGLKAMGSMESDSE

A0A6J1I3X6 uncharacterized protein LOC111470755 isoform X12.2e-10995.02Show/hide
Query:  METEVRNQNQRGCKPLEPEVFLQWGKRKRLRCARIKDPEISERLCGGLRKKIASRSDRCVVSSSEKERTPLQPNRLTRNSEGVTTLRNGAGTSPSPEKED
        METEVRNQNQRGCKPLEPEVFLQW K+KRLRCARIKDPEISERLCGGLRKKIASR+DRCVVS SEKERTPLQPNRLTRNSEGVTTLRNGAGTSPSPEKED
Subjt:  METEVRNQNQRGCKPLEPEVFLQWGKRKRLRCARIKDPEISERLCGGLRKKIASRSDRCVVSSSEKERTPLQPNRLTRNSEGVTTLRNGAGTSPSPEKED

Query:  RYYATRGSA-ATVVDENGRIST--NGNGNGKAEERGFVWPKLFIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLSEMSQERYEVREK
        RYYATRGSA A VVDENGRIST  NGN N KAEERGFVWPKLFIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLSEMSQERYEVREK
Subjt:  RYYATRGSA-ATVVDENGRIST--NGNGNGKAEERGFVWPKLFIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLSEMSQERYEVREK

Query:  KSTKKRPTGLKAMGSMESDSE
        KSTKKRPTGLKAMGSME+DSE
Subjt:  KSTKKRPTGLKAMGSMESDSE

A0A6J1I8Z1 uncharacterized protein LOC111470755 isoform X22.2e-10995.02Show/hide
Query:  METEVRNQNQRGCKPLEPEVFLQWGKRKRLRCARIKDPEISERLCGGLRKKIASRSDRCVVSSSEKERTPLQPNRLTRNSEGVTTLRNGAGTSPSPEKED
        METEVRNQNQRGCKPLEPEVFLQW K+KRLRCARIKDPEISERLCGGLRKKIASR+DRCVVS SEKERTPLQPNRLTRNSEGVTTLRNGAGTSPSPEKED
Subjt:  METEVRNQNQRGCKPLEPEVFLQWGKRKRLRCARIKDPEISERLCGGLRKKIASRSDRCVVSSSEKERTPLQPNRLTRNSEGVTTLRNGAGTSPSPEKED

Query:  RYYATRGSA-ATVVDENGRIST--NGNGNGKAEERGFVWPKLFIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLSEMSQERYEVREK
        RYYATRGSA A VVDENGRIST  NGN N KAEERGFVWPKLFIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLSEMSQERYEVREK
Subjt:  RYYATRGSA-ATVVDENGRIST--NGNGNGKAEERGFVWPKLFIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLSEMSQERYEVREK

Query:  KSTKKRPTGLKAMGSMESDSE
        KSTKKRPTGLKAMGSME+DSE
Subjt:  KSTKKRPTGLKAMGSMESDSE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G55340.1 Protein of unknown function (DUF1639)9.8e-4952.07Show/hide
Query:  QNQRGCKPLEPEVFLQWGKRKRLRCARIKDPEISERLCGG------LRKKIASRSDRCVVSSSEKERTPLQPNRLTRNSEGVTTLRNGAGTSPSPEKEDR
        + QRG    E +  LQWG+RKR+RC ++K     + L  G       ++K+ SR+      SSE+       NR  + ++ +  +R       SPEKEDR
Subjt:  QNQRGCKPLEPEVFLQWGKRKRLRCARIKDPEISERLCGG------LRKKIASRSDRCVVSSSEKERTPLQPNRLTRNSEGVTTLRNGAGTSPSPEKEDR

Query:  YYATRGSAATVVDENGRISTNGNGNGKAEERGFVWPKLFIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLSEMSQERYEVREKKSTK
        YY TRGS    +DE+G+I          E +  VWPKL+IALS+KEKEEDF+AMKGCKLPQRPKKRAK++Q++LLLVSPGAWLS++ +ERYEVREKK++K
Subjt:  YYATRGSAATVVDENGRISTNGNGNGKAEERGFVWPKLFIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLSEMSQERYEVREKKSTK

Query:  KRPTGLKAMGSMESDSE
        KRP GLKAMGSMESDSE
Subjt:  KRPTGLKAMGSMESDSE

AT1G55340.2 Protein of unknown function (DUF1639)2.4e-4751.66Show/hide
Query:  QNQRGCKPLEPEVFLQWGKRKRLRCARIKDPEISERLCGGLRKKIASRSDRCVVSSSEKERTPLQPNRLTRNSEGVTTLRNGAGTSPSPEKEDRYYATRG
        + QRG    E +  LQWG+RKR+RC ++K     + L  G  K     + R ++S +           L R ++             SPEKEDRYY TRG
Subjt:  QNQRGCKPLEPEVFLQWGKRKRLRCARIKDPEISERLCGGLRKKIASRSDRCVVSSSEKERTPLQPNRLTRNSEGVTTLRNGAGTSPSPEKEDRYYATRG

Query:  SAATVVDENGRISTNGNGNGKAEERGFVWPKLFIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLSEMSQERYEVREKKSTKKRPTGL
        S    +DE+G+I          E +  VWPKL+IALS+KEKEEDF+AMKGCKLPQRPKKRAK++Q++LLLVSPGAWLS++ +ERYEVREKK++KKRP GL
Subjt:  SAATVVDENGRISTNGNGNGKAEERGFVWPKLFIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLSEMSQERYEVREKKSTKKRPTGL

Query:  KAMGSMESDSE
        KAMGSMESDSE
Subjt:  KAMGSMESDSE

AT3G03880.1 Protein of unknown function (DUF1639)6.5e-4554.33Show/hide
Query:  LEPEVFLQWGKRKRLRCARIKDPEISERLCGGLRKKIASRSDRCVVSSSEKERTPLQPNRLTRNSEGVTTLRNGA--GTSPSPEKEDRYYATRGSAATVV
        LE E+FLQWG +KRLRC R K  +IS       R K +SR          ++   LQ +R +R SEG   LR+G      PSPEKE+RYY TRG    VV
Subjt:  LEPEVFLQWGKRKRLRCARIKDPEISERLCGGLRKKIASRSDRCVVSSSEKERTPLQPNRLTRNSEGVTTLRNGA--GTSPSPEKEDRYYATRGSAATVV

Query:  DENGRISTNGNGNGK--AEERGFVWPKLFIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLSEMSQERYEVREKKSTKK-RPTGLKAM
        D  G+   +GN NG+    +   +WPKLFI LS+KEKEEDFMAMKGCK   RPKKRAK+IQRSLLLVSPG WL+++  +RY+VR KKS+KK R  GLKAM
Subjt:  DENGRISTNGNGNGK--AEERGFVWPKLFIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLSEMSQERYEVREKKSTKK-RPTGLKAM

Query:  GSMESDSE
        G+ME+DS+
Subjt:  GSMESDSE

AT4G20300.1 Protein of unknown function (DUF1639)1.6e-1937.93Show/hide
Query:  RCVVSSSEKERTPLQPNRLTRNSEGVTTLRNGAGTSPSPEKEDRYYATRGSAATVVDENGRISTNGNGNGKAEERGFVWPKLFIALSSKEKEEDFMAMKG
        R +  S   +R+P  P+++ + S      +NG       +++      R  +     +   I      NG+ E+    WP+++IALS KEKEEDF+ MKG
Subjt:  RCVVSSSEKERTPLQPNRLTRNSEGVTTLRNGAGTSPSPEKEDRYYATRGSAATVVDENGRISTNGNGNGKAEERGFVWPKLFIALSSKEKEEDFMAMKG

Query:  CKLPQRPKKRAKMIQRSLLLVSPGAWLSEMSQERYEVREKKSTKK
         KLP RP+KRAK I ++L    PG WLS++++ RYEVREKK+ KK
Subjt:  CKLPQRPKKRAKMIQRSLLLVSPGAWLSEMSQERYEVREKKSTKK

AT4G20300.2 Protein of unknown function (DUF1639)2.0e-2238.65Show/hide
Query:  RCVVSSSEKERTPLQPNRLTRNSEGVTTLRNGAGTSPSPEKEDRYYATRGSAATVVDENGRISTNGNGNGKAEERGFVWPKLFIALSSKEKEEDFMAMKG
        R +  S   +R+P  P+++ + S      +NG       +++      R  +     +   I      NG+ E+    WP+++IALS KEKEEDF+ MKG
Subjt:  RCVVSSSEKERTPLQPNRLTRNSEGVTTLRNGAGTSPSPEKEDRYYATRGSAATVVDENGRISTNGNGNGKAEERGFVWPKLFIALSSKEKEEDFMAMKG

Query:  CKLPQRPKKRAKMIQRSLLLVSPGAWLSEMSQERYEVREKKSTKK--RPTGLKAMGSMESDSE
         KLP RP+KRAK I ++L    PG WLS++++ RYEVREKK+ KK  +  GLK M +M++DSE
Subjt:  CKLPQRPKKRAKMIQRSLLLVSPGAWLSEMSQERYEVREKKSTKK--RPTGLKAMGSMESDSE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGACGGAGGTGAGGAATCAGAATCAGAGAGGTTGTAAGCCTTTAGAACCGGAGGTGTTCTTACAGTGGGGAAAGAGGAAGCGACTGCGATGTGCTAGAATCAAAGA
CCCTGAGATCTCTGAGCGATTGTGCGGCGGCCTACGGAAGAAAATCGCATCTCGGTCCGATCGCTGTGTGGTTTCTTCTTCCGAGAAAGAAAGGACCCCACTCCAACCTA
ATCGCCTAACTAGGAATTCTGAGGGCGTTACTACGCTGCGTAACGGTGCAGGCACATCGCCGTCGCCGGAGAAGGAGGACCGTTACTACGCCACCAGAGGATCCGCCGCG
ACAGTGGTGGATGAGAACGGTAGGATCTCAACCAACGGCAATGGAAACGGGAAGGCGGAGGAAAGAGGGTTTGTTTGGCCAAAGCTGTTTATAGCTCTGTCAAGCAAAGA
AAAGGAGGAAGACTTCATGGCTATGAAGGGCTGCAAGCTCCCACAAAGGCCCAAAAAGAGAGCGAAGATGATTCAGAGAAGCTTACTTCTGGTGAGCCCTGGGGCATGGC
TGAGTGAAATGAGCCAAGAGAGATATGAAGTTAGGGAAAAGAAGAGTACAAAGAAGAGGCCAACAGGGTTGAAAGCCATGGGAAGCATGGAGAGTGATTCAGAATAA
mRNA sequenceShow/hide mRNA sequence
GGCGATGTGCAGCATAAAAACGAGATTGAAGACGAAGAAGAAGAAGAGAGAGAAACAGAAACAGCAACAACAAAGAAGACGACCGCCATAGCCATAGCTCAGACTTCAAT
AAGCTCTGCTGCTCCAAACTCACACAACCATACGCTCTCTCTTGTAATGCCGAAAATTTCCACACCTGAAACCCGATTATTCCAAAAAAAAATTCAGGTCACCGAGGCTC
CAAATTGATTGACGCGATCGGAGCTTCAATTTCCTCTGGAGATCTGCTTTTTTTTTTTCATCAGATTAGCCTTCTTGTATTGTTGATAAGAATCTTCCTGTTCTCATTGT
ATGGAGACGGAGGTGAGGAATCAGAATCAGAGAGGTTGTAAGCCTTTAGAACCGGAGGTGTTCTTACAGTGGGGAAAGAGGAAGCGACTGCGATGTGCTAGAATCAAAGA
CCCTGAGATCTCTGAGCGATTGTGCGGCGGCCTACGGAAGAAAATCGCATCTCGGTCCGATCGCTGTGTGGTTTCTTCTTCCGAGAAAGAAAGGACCCCACTCCAACCTA
ATCGCCTAACTAGGAATTCTGAGGGCGTTACTACGCTGCGTAACGGTGCAGGCACATCGCCGTCGCCGGAGAAGGAGGACCGTTACTACGCCACCAGAGGATCCGCCGCG
ACAGTGGTGGATGAGAACGGTAGGATCTCAACCAACGGCAATGGAAACGGGAAGGCGGAGGAAAGAGGGTTTGTTTGGCCAAAGCTGTTTATAGCTCTGTCAAGCAAAGA
AAAGGAGGAAGACTTCATGGCTATGAAGGGCTGCAAGCTCCCACAAAGGCCCAAAAAGAGAGCGAAGATGATTCAGAGAAGCTTACTTCTGGTGAGCCCTGGGGCATGGC
TGAGTGAAATGAGCCAAGAGAGATATGAAGTTAGGGAAAAGAAGAGTACAAAGAAGAGGCCAACAGGGTTGAAAGCCATGGGAAGCATGGAGAGTGATTCAGAATAAGAT
GGAAAATGGGAAGAAACTGAAACTGACCCTTCAATATTTTTTTTGGAATGCATGCTGAATTGAATGAATGAAGAAGAGAAAGATAAAAGAGAAGAGATAATCTCAATCTC
AATCTCAATCTCAATCTCAATCGGTTCTGCTTTGCCCTTTTGGAATTATGAGTCAGAATTTGGTTTTGGATTTGATGCTAAAGAAGTCCACTTTTGGGGGTTGGTTAGTT
AGGCAGGGCTGCAAAAGCGATGAGTTGTTTACGGAATTAAAGGGAAAAAAAGTTGCTCTTTTTTGTGTATAAAATGTAATGTAAAAAAAGAAATTGTTATTTATAATGGA
AGGTCGATTTTAACTTGGATTCTGTATCAGGGGGGTGATAACCCAGCAAAAATTGAAAAATGGTCAAAAGAGAGTTATAGTGAAAATATATATATAGAGAGAGAGGCTTG
AATCTTGTTTATCATCAAATGAAATGGAAACTTATCGATTCTTAATTGAATTTTTTTACTCAAA
Protein sequenceShow/hide protein sequence
METEVRNQNQRGCKPLEPEVFLQWGKRKRLRCARIKDPEISERLCGGLRKKIASRSDRCVVSSSEKERTPLQPNRLTRNSEGVTTLRNGAGTSPSPEKEDRYYATRGSAA
TVVDENGRISTNGNGNGKAEERGFVWPKLFIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLSEMSQERYEVREKKSTKKRPTGLKAMGSMESDSE