; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10019531 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10019531
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionProtein of unknown function (DUF1639)
Genome locationChr04:22858048..22859796
RNA-Seq ExpressionHG10019531
SyntenyHG10019531
Gene Ontology termsNA
InterPro domainsIPR012438 - Protein of unknown function DUF1639


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008465311.1 PREDICTED: uncharacterized protein LOC103502965 [Cucumis melo]3.5e-10694.39Show/hide
Query:  METEVRNQNQRGCKPLEPEVFLQWGKRKRLRCPRNKDPEISERLCGSLRKKIGSRSDRCVISSSEKERIPLQPNRLTRNFEGVTSLRNGGAGTSPSPEKE
        METEVRNQNQRGCK LEPEVFLQWGKRKRLRCPRNKDPEISERLCGSLRKKIGSRS RCVIS+SEKERIPLQPNRLTRN EGVT+LRNGGAGT+PSPEKE
Subjt:  METEVRNQNQRGCKPLEPEVFLQWGKRKRLRCPRNKDPEISERLCGSLRKKIGSRSDRCVISSSEKERIPLQPNRLTRNFEGVTSLRNGGAGTSPSPEKE

Query:  DRYYATRGSAAAVVDENGHSHGHEERGGSFVLPKLLIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLTEMSQERYEVREKKTTKKRP
        DRYY+TRGS AAVVDEN    GHEERGGSFVLPKLLIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLTEMSQERYEVREKKTTKKRP
Subjt:  DRYYATRGSAAAVVDENGHSHGHEERGGSFVLPKLLIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLTEMSQERYEVREKKTTKKRP

Query:  TGLKAMGSMETDSE
        TGLKAMGSMETDSE
Subjt:  TGLKAMGSMETDSE

XP_011652319.1 uncharacterized protein LOC101204535 [Cucumis sativus]1.3e-10594.42Show/hide
Query:  METEVRNQNQRGCKPLEPEVFLQWGKRKRLRCPRNKDPEISERLCGSLRKKIGSRSDRCVISSSEKERIPLQPNRLTRNFEGVTSLRNGGAGTSPSPEKE
        METEVRNQNQRGCK LEPEVFLQWGKRKRLRCPRNKDPEISERLCGSLRKKIGSRSDRCVIS+SEKERIPLQPNRLTRN EGVT+LRNGGAGT+PSPEKE
Subjt:  METEVRNQNQRGCKPLEPEVFLQWGKRKRLRCPRNKDPEISERLCGSLRKKIGSRSDRCVISSSEKERIPLQPNRLTRNFEGVTSLRNGGAGTSPSPEKE

Query:  DRYYATRGSAAAVVDENGHSHGHEERGGSFVLPKLLIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLTEMSQERYEVREKKTTKKRP
        DRYY+TRGS AAVVDEN    GHEERGGSFVLPKLLIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLTEMSQERYEVREKKTTKKRP
Subjt:  DRYYATRGSAAAVVDENGHSHGHEERGGSFVLPKLLIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLTEMSQERYEVREKKTTKKRP

Query:  TGLKAM-GSMETDSE
        TGLKAM GSMETDSE
Subjt:  TGLKAM-GSMETDSE

XP_022974216.1 uncharacterized protein LOC111472840 [Cucurbita maxima]1.3e-9787.85Show/hide
Query:  METEVRNQNQRGCKPLEPEVFLQWGKRKRLRCPRNKDPEISERLCGSLRKKIGSRSDRCVISSSEKERIPLQPNRLTRNFEGVTSLRNGGAGTSPSPEKE
        METEVRNQNQ GCK LEPEVF+QW KRKRLRC R KDPEISERLCG LRKKIGSR DRCV+S+S+KER PLQPNRLTRN EGV +LRNGGAGT+PSPEKE
Subjt:  METEVRNQNQRGCKPLEPEVFLQWGKRKRLRCPRNKDPEISERLCGSLRKKIGSRSDRCVISSSEKERIPLQPNRLTRNFEGVTSLRNGGAGTSPSPEKE

Query:  DRYYATRGSAAAVVDENGHSHGHEERGGSFVLPKLLIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLTEMSQERYEVREKKTTKKRP
        DRYYATRGS AAVVDEN    GHEERGG +V PKLLIALSSKEKEEDFMAMKGCKLP RPKKRAKMIQRSLLLVSPGAWLTEMSQERYEV EKKTTKKRP
Subjt:  DRYYATRGSAAAVVDENGHSHGHEERGGSFVLPKLLIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLTEMSQERYEVREKKTTKKRP

Query:  TGLKAMGSMETDSE
        TGLKAMGSMETDSE
Subjt:  TGLKAMGSMETDSE

XP_038905228.1 uncharacterized protein LOC120091318 isoform X1 [Benincasa hispida]3.5e-10693.21Show/hide
Query:  METEVRNQNQRGCKPLEPEVFLQWGKRKRLRCPRNKDPEISERLCGSLRKKIGSRSDRCVISSSEKERIPLQPNRLTRNFEGVTSLRNGGAGTSPSPEKE
        METEVRNQNQRGCKPLEPEVFLQWGKRKRLRCPRNKDPEISERLCGSLRKKIGSRSDRCVISSSEKERIPLQPNRLTRN EGVT+LRNGGAGTSPSPEKE
Subjt:  METEVRNQNQRGCKPLEPEVFLQWGKRKRLRCPRNKDPEISERLCGSLRKKIGSRSDRCVISSSEKERIPLQPNRLTRNFEGVTSLRNGGAGTSPSPEKE

Query:  DRYYATRGSAAAVVDENGHSHGHEERGGSFVLPKLLIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLL-------LVSPGAWLTEMSQERYEVREK
        DRYYATRGS AAVVDEN    GHEERGGSFVLPKLLIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLL       LVSPGAWLTEMSQERYEVREK
Subjt:  DRYYATRGSAAAVVDENGHSHGHEERGGSFVLPKLLIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLL-------LVSPGAWLTEMSQERYEVREK

Query:  KTTKKRPTGLKAMGSMETDSE
        KTTKKRPTGLKAMGS ETDSE
Subjt:  KTTKKRPTGLKAMGSMETDSE

XP_038905231.1 uncharacterized protein LOC120091318 isoform X2 [Benincasa hispida]2.8e-10896.26Show/hide
Query:  METEVRNQNQRGCKPLEPEVFLQWGKRKRLRCPRNKDPEISERLCGSLRKKIGSRSDRCVISSSEKERIPLQPNRLTRNFEGVTSLRNGGAGTSPSPEKE
        METEVRNQNQRGCKPLEPEVFLQWGKRKRLRCPRNKDPEISERLCGSLRKKIGSRSDRCVISSSEKERIPLQPNRLTRN EGVT+LRNGGAGTSPSPEKE
Subjt:  METEVRNQNQRGCKPLEPEVFLQWGKRKRLRCPRNKDPEISERLCGSLRKKIGSRSDRCVISSSEKERIPLQPNRLTRNFEGVTSLRNGGAGTSPSPEKE

Query:  DRYYATRGSAAAVVDENGHSHGHEERGGSFVLPKLLIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLTEMSQERYEVREKKTTKKRP
        DRYYATRGS AAVVDEN    GHEERGGSFVLPKLLIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLTEMSQERYEVREKKTTKKRP
Subjt:  DRYYATRGSAAAVVDENGHSHGHEERGGSFVLPKLLIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLTEMSQERYEVREKKTTKKRP

Query:  TGLKAMGSMETDSE
        TGLKAMGS ETDSE
Subjt:  TGLKAMGSMETDSE

TrEMBL top hitse value%identityAlignment
A0A0A0LFM7 Uncharacterized protein6.4e-10694.42Show/hide
Query:  METEVRNQNQRGCKPLEPEVFLQWGKRKRLRCPRNKDPEISERLCGSLRKKIGSRSDRCVISSSEKERIPLQPNRLTRNFEGVTSLRNGGAGTSPSPEKE
        METEVRNQNQRGCK LEPEVFLQWGKRKRLRCPRNKDPEISERLCGSLRKKIGSRSDRCVIS+SEKERIPLQPNRLTRN EGVT+LRNGGAGT+PSPEKE
Subjt:  METEVRNQNQRGCKPLEPEVFLQWGKRKRLRCPRNKDPEISERLCGSLRKKIGSRSDRCVISSSEKERIPLQPNRLTRNFEGVTSLRNGGAGTSPSPEKE

Query:  DRYYATRGSAAAVVDENGHSHGHEERGGSFVLPKLLIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLTEMSQERYEVREKKTTKKRP
        DRYY+TRGS AAVVDEN    GHEERGGSFVLPKLLIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLTEMSQERYEVREKKTTKKRP
Subjt:  DRYYATRGSAAAVVDENGHSHGHEERGGSFVLPKLLIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLTEMSQERYEVREKKTTKKRP

Query:  TGLKAM-GSMETDSE
        TGLKAM GSMETDSE
Subjt:  TGLKAM-GSMETDSE

A0A1S4E569 uncharacterized protein LOC1035029651.7e-10694.39Show/hide
Query:  METEVRNQNQRGCKPLEPEVFLQWGKRKRLRCPRNKDPEISERLCGSLRKKIGSRSDRCVISSSEKERIPLQPNRLTRNFEGVTSLRNGGAGTSPSPEKE
        METEVRNQNQRGCK LEPEVFLQWGKRKRLRCPRNKDPEISERLCGSLRKKIGSRS RCVIS+SEKERIPLQPNRLTRN EGVT+LRNGGAGT+PSPEKE
Subjt:  METEVRNQNQRGCKPLEPEVFLQWGKRKRLRCPRNKDPEISERLCGSLRKKIGSRSDRCVISSSEKERIPLQPNRLTRNFEGVTSLRNGGAGTSPSPEKE

Query:  DRYYATRGSAAAVVDENGHSHGHEERGGSFVLPKLLIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLTEMSQERYEVREKKTTKKRP
        DRYY+TRGS AAVVDEN    GHEERGGSFVLPKLLIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLTEMSQERYEVREKKTTKKRP
Subjt:  DRYYATRGSAAAVVDENGHSHGHEERGGSFVLPKLLIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLTEMSQERYEVREKKTTKKRP

Query:  TGLKAMGSMETDSE
        TGLKAMGSMETDSE
Subjt:  TGLKAMGSMETDSE

A0A5A7UCC4 DUF1639 domain-containing protein1.7e-10694.39Show/hide
Query:  METEVRNQNQRGCKPLEPEVFLQWGKRKRLRCPRNKDPEISERLCGSLRKKIGSRSDRCVISSSEKERIPLQPNRLTRNFEGVTSLRNGGAGTSPSPEKE
        METEVRNQNQRGCK LEPEVFLQWGKRKRLRCPRNKDPEISERLCGSLRKKIGSRS RCVIS+SEKERIPLQPNRLTRN EGVT+LRNGGAGT+PSPEKE
Subjt:  METEVRNQNQRGCKPLEPEVFLQWGKRKRLRCPRNKDPEISERLCGSLRKKIGSRSDRCVISSSEKERIPLQPNRLTRNFEGVTSLRNGGAGTSPSPEKE

Query:  DRYYATRGSAAAVVDENGHSHGHEERGGSFVLPKLLIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLTEMSQERYEVREKKTTKKRP
        DRYY+TRGS AAVVDEN    GHEERGGSFVLPKLLIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLTEMSQERYEVREKKTTKKRP
Subjt:  DRYYATRGSAAAVVDENGHSHGHEERGGSFVLPKLLIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLTEMSQERYEVREKKTTKKRP

Query:  TGLKAMGSMETDSE
        TGLKAMGSMETDSE
Subjt:  TGLKAMGSMETDSE

A0A6J1F339 uncharacterized protein LOC1114392541.9e-9787.38Show/hide
Query:  METEVRNQNQRGCKPLEPEVFLQWGKRKRLRCPRNKDPEISERLCGSLRKKIGSRSDRCVISSSEKERIPLQPNRLTRNFEGVTSLRNGGAGTSPSPEKE
        METEVRNQNQ GCK LEPEVF+QW KRKRLRC R KDPEISERLCG LR KIGSR DRCV+S+S+KER PLQPNRLTRN EGV +LRNGGAGT+PSPEKE
Subjt:  METEVRNQNQRGCKPLEPEVFLQWGKRKRLRCPRNKDPEISERLCGSLRKKIGSRSDRCVISSSEKERIPLQPNRLTRNFEGVTSLRNGGAGTSPSPEKE

Query:  DRYYATRGSAAAVVDENGHSHGHEERGGSFVLPKLLIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLTEMSQERYEVREKKTTKKRP
        DRYYATRGS AAVVDE    HGHEERGG +V PKLLIALSSKEKEEDFMAMKGCKLP RPKKRAKMIQRSLLLVSPGAWLTEMSQERYEV EKKTTKKRP
Subjt:  DRYYATRGSAAAVVDENGHSHGHEERGGSFVLPKLLIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLTEMSQERYEVREKKTTKKRP

Query:  TGLKAMGSMETDSE
        TGLKAMGSMETDSE
Subjt:  TGLKAMGSMETDSE

A0A6J1IAR3 uncharacterized protein LOC1114728406.4e-9887.85Show/hide
Query:  METEVRNQNQRGCKPLEPEVFLQWGKRKRLRCPRNKDPEISERLCGSLRKKIGSRSDRCVISSSEKERIPLQPNRLTRNFEGVTSLRNGGAGTSPSPEKE
        METEVRNQNQ GCK LEPEVF+QW KRKRLRC R KDPEISERLCG LRKKIGSR DRCV+S+S+KER PLQPNRLTRN EGV +LRNGGAGT+PSPEKE
Subjt:  METEVRNQNQRGCKPLEPEVFLQWGKRKRLRCPRNKDPEISERLCGSLRKKIGSRSDRCVISSSEKERIPLQPNRLTRNFEGVTSLRNGGAGTSPSPEKE

Query:  DRYYATRGSAAAVVDENGHSHGHEERGGSFVLPKLLIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLTEMSQERYEVREKKTTKKRP
        DRYYATRGS AAVVDEN    GHEERGG +V PKLLIALSSKEKEEDFMAMKGCKLP RPKKRAKMIQRSLLLVSPGAWLTEMSQERYEV EKKTTKKRP
Subjt:  DRYYATRGSAAAVVDENGHSHGHEERGGSFVLPKLLIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLTEMSQERYEVREKKTTKKRP

Query:  TGLKAMGSMETDSE
        TGLKAMGSMETDSE
Subjt:  TGLKAMGSMETDSE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G55340.1 Protein of unknown function (DUF1639)8.1e-4553.05Show/hide
Query:  QNQRGCKPLEPEVFLQWGKRKRLRCPR-NKDPEISERL---CGSLRKKIGSRSDRCVISS-SEKERIPLQPNRLTRNFEGVTSLRNGGAGTSPSPEKEDR
        + QRG    E +  LQWG+RKR+RC +  KD  ++      C + RK I     R V S      R   +PN++T       SL N       SPEKEDR
Subjt:  QNQRGCKPLEPEVFLQWGKRKRLRCPR-NKDPEISERL---CGSLRKKIGSRSDRCVISS-SEKERIPLQPNRLTRNFEGVTSLRNGGAGTSPSPEKEDR

Query:  YYATRGSAAAVVDENGHSHGHE-ERGGSFVLPKLLIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLTEMSQERYEVREKKTTKKRPT
        YY TRGS    +DE+G       +     V PKL IALS+KEKEEDF+AMKGCKLPQRPKKRAK++Q++LLLVSPGAWL+++ +ERYEVREKKT+KKRP 
Subjt:  YYATRGSAAAVVDENGHSHGHE-ERGGSFVLPKLLIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLTEMSQERYEVREKKTTKKRPT

Query:  GLKAMGSMETDSE
        GLKAMGSME+DSE
Subjt:  GLKAMGSMETDSE

AT1G55340.2 Protein of unknown function (DUF1639)6.9e-4452.36Show/hide
Query:  QNQRGCKPLEPEVFLQWGKRKRLRCPR-NKDPEISERL---CGSLRKKIGSRSDRCVISSSEKERIPLQPNRLTRNFEGVTSLRNGGAGTSPSPEKEDRY
        + QRG    E +  LQWG+RKR+RC +  KD  ++      C + RK I     R V  SSE+       NR  ++F               SPEKEDRY
Subjt:  QNQRGCKPLEPEVFLQWGKRKRLRCPR-NKDPEISERL---CGSLRKKIGSRSDRCVISSSEKERIPLQPNRLTRNFEGVTSLRNGGAGTSPSPEKEDRY

Query:  YATRGSAAAVVDENGHSHGHE-ERGGSFVLPKLLIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLTEMSQERYEVREKKTTKKRPTG
        Y TRGS    +DE+G       +     V PKL IALS+KEKEEDF+AMKGCKLPQRPKKRAK++Q++LLLVSPGAWL+++ +ERYEVREKKT+KKRP G
Subjt:  YATRGSAAAVVDENGHSHGHE-ERGGSFVLPKLLIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLTEMSQERYEVREKKTTKKRPTG

Query:  LKAMGSMETDSE
        LKAMGSME+DSE
Subjt:  LKAMGSMETDSE

AT3G03880.1 Protein of unknown function (DUF1639)4.6e-4050.72Show/hide
Query:  LEPEVFLQWGKRKRLRCPRNKDPEISERLCGSLRKKIGSRSDRCVISSSEKERIPLQPNRLTRNFEGVTSLRNG-GAGTSPSPEKEDRYYATRGSAAAVV
        LE E+FLQWG +KRLRC R K  +IS       R K  SR          ++ + LQ +R +R  EG   LR+G      PSPEKE+RYY TRG    + 
Subjt:  LEPEVFLQWGKRKRLRCPRNKDPEISERLCGSLRKKIGSRSDRCVISSSEKERIPLQPNRLTRNFEGVTSLRNG-GAGTSPSPEKEDRYYATRGSAAAVV

Query:  DE------NGHSHGHEERGGSFVLPKLLIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLTEMSQERYEVREKKTTKK-RPTGLKAMG
         +      NG    ++E     + PKL I LS+KEKEEDFMAMKGCK   RPKKRAK+IQRSLLLVSPG WL ++  +RY+VR KK++KK R  GLKAMG
Subjt:  DE------NGHSHGHEERGGSFVLPKLLIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLTEMSQERYEVREKKTTKK-RPTGLKAMG

Query:  SMETDSE
        +METDS+
Subjt:  SMETDSE

AT4G20300.1 Protein of unknown function (DUF1639)1.9e-1739.57Show/hide
Query:  NFEGVTSLRNGGAGTS-PSPEKEDRYYATRGSAAAVVDENGHSH-----------GHEE-------RGGSFVLPKLLIALSSKEKEEDFMAMKGCKLPQR
        N  G    R+GG+  S PSP++ ++  + R       D     H           GH+E          +   P++ IALS KEKEEDF+ MKG KLP R
Subjt:  NFEGVTSLRNGGAGTS-PSPEKEDRYYATRGSAAAVVDENGHSH-----------GHEE-------RGGSFVLPKLLIALSSKEKEEDFMAMKGCKLPQR

Query:  PKKRAKMIQRSLLLVSPGAWLTEMSQERYEVREKKTTKK
        P+KRAK I ++L    PG WL+++++ RYEVREKK  KK
Subjt:  PKKRAKMIQRSLLLVSPGAWLTEMSQERYEVREKKTTKK

AT4G20300.2 Protein of unknown function (DUF1639)6.2e-2140.76Show/hide
Query:  NFEGVTSLRNGGAGTS-PSPEKEDRYYATRGSAAAVVDENGHSH-----------GHEE-------RGGSFVLPKLLIALSSKEKEEDFMAMKGCKLPQR
        N  G    R+GG+  S PSP++ ++  + R       D     H           GH+E          +   P++ IALS KEKEEDF+ MKG KLP R
Subjt:  NFEGVTSLRNGGAGTS-PSPEKEDRYYATRGSAAAVVDENGHSH-----------GHEE-------RGGSFVLPKLLIALSSKEKEEDFMAMKGCKLPQR

Query:  PKKRAKMIQRSLLLVSPGAWLTEMSQERYEVREKKTTKK--RPTGLKAMGSMETDSE
        P+KRAK I ++L    PG WL+++++ RYEVREKK  KK  +  GLK M +M+TDSE
Subjt:  PKKRAKMIQRSLLLVSPGAWLTEMSQERYEVREKKTTKK--RPTGLKAMGSMETDSE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGCAGCATAAAAACGAGATTGAAGACGAAGACGAAGAAGAAGAAGAGAAGAGAGAGAGAGAAAACCAGAAACAGCAACAACAAAGAAGACGACCGCCATAGCCATAG
CTCAGTCTTCAATAAGCGCTGCAGCTCCAAACTCACTCATCCACGCCTTCTCTCTTATATGGAGACGGAGGTGAGGAATCAGAATCAGAGAGGGTGTAAACCCCTGGAAC
CGGAGGTTTTCTTACAGTGGGGAAAGAGGAAGCGACTGCGATGCCCTAGAAACAAAGACCCAGAGATCTCCGAGCGATTGTGCGGCAGTCTACGGAAGAAAATCGGATCT
CGGAGCGATCGCTGTGTGATTTCTTCTTCGGAGAAAGAAAGGATCCCACTCCAACCAAATCGTCTAACTAGGAATTTTGAGGGCGTTACTTCGCTGCGTAACGGTGGTGC
AGGCACATCGCCGTCGCCAGAGAAGGAAGACCGTTACTACGCCACCAGAGGATCCGCGGCGGCGGTAGTGGACGAGAACGGCCACAGCCACGGCCATGAGGAGAGAGGAG
GCAGCTTTGTTTTACCAAAGCTGTTGATTGCTTTGTCAAGCAAAGAAAAGGAAGAAGACTTCATGGCCATGAAAGGCTGCAAGCTCCCACAAAGGCCCAAAAAGAGAGCC
AAGATGATTCAGAGAAGCTTACTTCTGGTGAGCCCTGGTGCATGGCTGACTGAAATGAGCCAAGAAAGATATGAAGTTAGGGAAAAGAAGACAACAAAGAAGAGGCCAAC
AGGGCTGAAAGCCATGGGAAGTATGGAAACTGATTCAGAATGA
mRNA sequenceShow/hide mRNA sequence
ATGCGCAGCATAAAAACGAGATTGAAGACGAAGACGAAGAAGAAGAAGAGAAGAGAGAGAGAGAAAACCAGAAACAGCAACAACAAAGAAGACGACCGCCATAGCCATAG
CTCAGTCTTCAATAAGCGCTGCAGCTCCAAACTCACTCATCCACGCCTTCTCTCTTATATGGAGACGGAGGTGAGGAATCAGAATCAGAGAGGGTGTAAACCCCTGGAAC
CGGAGGTTTTCTTACAGTGGGGAAAGAGGAAGCGACTGCGATGCCCTAGAAACAAAGACCCAGAGATCTCCGAGCGATTGTGCGGCAGTCTACGGAAGAAAATCGGATCT
CGGAGCGATCGCTGTGTGATTTCTTCTTCGGAGAAAGAAAGGATCCCACTCCAACCAAATCGTCTAACTAGGAATTTTGAGGGCGTTACTTCGCTGCGTAACGGTGGTGC
AGGCACATCGCCGTCGCCAGAGAAGGAAGACCGTTACTACGCCACCAGAGGATCCGCGGCGGCGGTAGTGGACGAGAACGGCCACAGCCACGGCCATGAGGAGAGAGGAG
GCAGCTTTGTTTTACCAAAGCTGTTGATTGCTTTGTCAAGCAAAGAAAAGGAAGAAGACTTCATGGCCATGAAAGGCTGCAAGCTCCCACAAAGGCCCAAAAAGAGAGCC
AAGATGATTCAGAGAAGCTTACTTCTGGTGAGCCCTGGTGCATGGCTGACTGAAATGAGCCAAGAAAGATATGAAGTTAGGGAAAAGAAGACAACAAAGAAGAGGCCAAC
AGGGCTGAAAGCCATGGGAAGTATGGAAACTGATTCAGAATGA
Protein sequenceShow/hide protein sequence
MRSIKTRLKTKTKKKKRREREKTRNSNNKEDDRHSHSSVFNKRCSSKLTHPRLLSYMETEVRNQNQRGCKPLEPEVFLQWGKRKRLRCPRNKDPEISERLCGSLRKKIGS
RSDRCVISSSEKERIPLQPNRLTRNFEGVTSLRNGGAGTSPSPEKEDRYYATRGSAAAVVDENGHSHGHEERGGSFVLPKLLIALSSKEKEEDFMAMKGCKLPQRPKKRA
KMIQRSLLLVSPGAWLTEMSQERYEVREKKTTKKRPTGLKAMGSMETDSE