; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0002323 (gene) of Chayote v1 genome

Gene IDSed0002323
OrganismSechium edule (Chayote v1)
DescriptionUPF0114 domain-containing protein
Genome locationLG10:28649579..28651082
RNA-Seq ExpressionSed0002323
SyntenySed0002323
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR005134 - Uncharacterised protein family UPF0114


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022144307.1 uncharacterized protein LOC111014021 isoform X1 [Momordica charantia]6.5e-10374.65Show/hide
Query:  MAAFRLLRPVQTSASVVSSSSPPSTAVTVRCLTKTGLNSNNGERFVTSGDGERRQIAALKAAVAAAPETVETKNRELDLVSLLVNVLRQLKTALVKTKIR
        MAA R  RP++ SASVVSSS+ PS A TVRC+ +T    NNGER +TSGDGERR++  +KAAV AAPETV+TK RELDL SLL N+L +LKTA+ KTK  
Subjt:  MAAFRLLRPVQTSASVVSSSSPPSTAVTVRCLTKTGLNSNNGERFVTSGDGERRQIAALKAAVAAAPETVETKNRELDLVSLLVNVLRQLKTALVKTKIR

Query:  RLQIQKLIEKIIIDCRFFTLFAVAGSLMGSILCFLEGSFIVAESYLQYFNGLSQRSDQSHTMELLIEALDMFLVGTGLVVFGVGLFVMFIGPEKTKEKKG
           IQ  IEK IIDCRFFTLFAVAGSL+GSILC+LEGSFIVAESYLQYF+GLSQ+SDQ+HT+ELLI+A+DMFLVGT L VFGVGLF MF+GPEK KE+  
Subjt:  RLQIQKLIEKIIIDCRFFTLFAVAGSLMGSILCFLEGSFIVAESYLQYFNGLSQRSDQSHTMELLIEALDMFLVGTGLVVFGVGLFVMFIGPEKTKEKKG

Query:  RWICGSNLCGLFYMKKIPRWVAMESVSEAKSKIGHAVVLILQVGVLEKFKSVPLSSAADLACFAAAVLISSASIFFLSRLNGGG
         W  GSNL GLFYMKK+P WV MESVS  KSKIGHAVV+ILQVGVLEKFKS+PL+SAADLACFAAAVLISSASIFFLS+LN GG
Subjt:  RWICGSNLCGLFYMKKIPRWVAMESVSEAKSKIGHAVVLILQVGVLEKFKSVPLSSAADLACFAAAVLISSASIFFLSRLNGGG

XP_022951147.1 uncharacterized protein LOC111454079 [Cucurbita moschata]3.1e-11380.9Show/hide
Query:  MAAFRLLRPVQTSASVV--SSSSPPSTAVTVRCLTKTGLNSNNGERFVTSGDGERRQIAALK--AAVAAAPETVETKNRELDLVSLLVNVLRQLKTALVK
        MAA RLLR V+ SA+VV  SSSS PSTA TVRCL KTGLNS NGER VTSGDGERRQI  LK  AA AAAPETVET+ RELDL SLL N+L QLK   VK
Subjt:  MAAFRLLRPVQTSASVV--SSSSPPSTAVTVRCLTKTGLNSNNGERFVTSGDGERRQIAALK--AAVAAAPETVETKNRELDLVSLLVNVLRQLKTALVK

Query:  TKIRRLQIQKLIEKIIIDCRFFTLFAVAGSLMGSILCFLEGSFIVAESYLQYFNGLSQRSDQSHTMELLIEALDMFLVGTGLVVFGVGLFVMFIGPEKTK
        TKIRR QIQK IEKIIIDCRFFTLFAVAGSL+GSILCFLEGSFIVAESYLQYFNG+S+RSD+SH +ELLIE+LDMFLVGT LVVFGVGLF MF+G EK  
Subjt:  TKIRRLQIQKLIEKIIIDCRFFTLFAVAGSLMGSILCFLEGSFIVAESYLQYFNGLSQRSDQSHTMELLIEALDMFLVGTGLVVFGVGLFVMFIGPEKTK

Query:  EKKGRWICGSNLCGLFYMKKIPRWVAMESVSEAKSKIGHAVVLILQVGVLEKFKSVPLSSAADLACFAAAVLISSASIFFLSRLNGGG
        EK  RW+ GSNL GLFYMK IP WV MESVSEAKSKIGHAV++ILQVGVLEKFKS+PLSSA DLACFAAA+LISSASIFFLSRLN GG
Subjt:  EKKGRWICGSNLCGLFYMKKIPRWVAMESVSEAKSKIGHAVVLILQVGVLEKFKSVPLSSAADLACFAAAVLISSASIFFLSRLNGGG

XP_023002007.1 uncharacterized protein LOC111496020 [Cucurbita maxima]2.5e-11079.17Show/hide
Query:  MAAFRLLRPVQTSASVV-SSSSPPSTAVTVRCLTKTGLNSNNGERFVTSGDGERRQIAALK---AAVAAAPETVETKNRELDLVSLLVNVLRQLKTALVK
        MAA RLLR V+ SA+VV SSSS PSTA  VRCL KTGLNS NGER +TSGDGERRQI  LK   AA AAAPETVETK RELDL SLL N+L QLK   VK
Subjt:  MAAFRLLRPVQTSASVV-SSSSPPSTAVTVRCLTKTGLNSNNGERFVTSGDGERRQIAALK---AAVAAAPETVETKNRELDLVSLLVNVLRQLKTALVK

Query:  TKIRRLQIQKLIEKIIIDCRFFTLFAVAGSLMGSILCFLEGSFIVAESYLQYFNGLSQRSDQSHTMELLIEALDMFLVGTGLVVFGVGLFVMFIGPEKTK
         KIRR QIQK IEKIII+CRFFTLFAVAGSL+GSILCFLEGSFIVAESYLQYFN +S+RSD+SH +ELLIE+LDMFLVGT LVVFGVGLF MF+G EK  
Subjt:  TKIRRLQIQKLIEKIIIDCRFFTLFAVAGSLMGSILCFLEGSFIVAESYLQYFNGLSQRSDQSHTMELLIEALDMFLVGTGLVVFGVGLFVMFIGPEKTK

Query:  EKKGRWICGSNLCGLFYMKKIPRWVAMESVSEAKSKIGHAVVLILQVGVLEKFKSVPLSSAADLACFAAAVLISSASIFFLSRLNGGG
        EK  RW+ GSNL GLFYMK IP WV MESVSEAKSKIGHAV++ILQVGVLEK KS+PLSSAADLACFAAA+LI SASIFFLSRLN GG
Subjt:  EKKGRWICGSNLCGLFYMKKIPRWVAMESVSEAKSKIGHAVVLILQVGVLEKFKSVPLSSAADLACFAAAVLISSASIFFLSRLNGGG

XP_023538418.1 uncharacterized protein LOC111799204 [Cucurbita pepo subsp. pepo]1.7e-11179.51Show/hide
Query:  MAAFRLLRPVQTSASVV--SSSSPPSTAVTVRCLTKTGLNSNNGERFVTSGDGERRQIAALK--AAVAAAPETVETKNRELDLVSLLVNVLRQLKTALVK
        MAA RL R V+ SA+VV  SSSS PSTA TVRCL KTGLNS NGER +TSGDGER+    LK  AA AAAPETVETK RELDL SLL N+L QLK  +VK
Subjt:  MAAFRLLRPVQTSASVV--SSSSPPSTAVTVRCLTKTGLNSNNGERFVTSGDGERRQIAALK--AAVAAAPETVETKNRELDLVSLLVNVLRQLKTALVK

Query:  TKIRRLQIQKLIEKIIIDCRFFTLFAVAGSLMGSILCFLEGSFIVAESYLQYFNGLSQRSDQSHTMELLIEALDMFLVGTGLVVFGVGLFVMFIGPEKTK
        TKIRR QIQK IEKIIIDCRFFTLFAVAGSL+GSILCFLEGSFIVAESYLQYFNG+S+RSD+SH +ELLIE+LDMFLVGT LVVFGVGLF MF+G EK  
Subjt:  TKIRRLQIQKLIEKIIIDCRFFTLFAVAGSLMGSILCFLEGSFIVAESYLQYFNGLSQRSDQSHTMELLIEALDMFLVGTGLVVFGVGLFVMFIGPEKTK

Query:  EKKGRWICGSNLCGLFYMKKIPRWVAMESVSEAKSKIGHAVVLILQVGVLEKFKSVPLSSAADLACFAAAVLISSASIFFLSRLNGGG
        EK  RW+ GSNL GLFYMK IP WV MESVSEAKSKIGHAV++ILQVGVLEKFKS+PLSSAADLACFA A+LISSASIFFLSRLN GG
Subjt:  EKKGRWICGSNLCGLFYMKKIPRWVAMESVSEAKSKIGHAVVLILQVGVLEKFKSVPLSSAADLACFAAAVLISSASIFFLSRLNGGG

XP_038885641.1 uncharacterized protein LOC120075956 [Benincasa hispida]7.9e-10175.18Show/hide
Query:  LRPVQTSASVVSSSSPPSTAVTVRCLTKTGLNSNNGERFVTSGDGERRQIAALKAAVAAAPETVETKNRELDLVSLLVNVLRQLKTALVKTKIRRLQIQK
        LRP  ++ S  SSSS PS+A+TVRCL KTGLN NNGER +TSGDGER+QI A+KA  AAAP+TVET+  EL+L SLL N+L QLKT + KTKI+R QIQK
Subjt:  LRPVQTSASVVSSSSPPSTAVTVRCLTKTGLNSNNGERFVTSGDGERRQIAALKAAVAAAPETVETKNRELDLVSLLVNVLRQLKTALVKTKIRRLQIQK

Query:  LIEKIIIDCRFFTLFAVAGSLMGSILCFLEGSFIVAESYLQYFNGLSQRSDQSHTMELLIEALDMFLVGTGLVVFGVGLFVMFIGPEKTKEKKGRWICGS
         IEKIIIDCRFFTL AVAGSL+GSILC++EGSFIVAESYLQYF+GLSQ S+Q+HT+ELLIEALDMFLVGT LVVFGVGLF MFIG  K KEK    I GS
Subjt:  LIEKIIIDCRFFTLFAVAGSLMGSILCFLEGSFIVAESYLQYFNGLSQRSDQSHTMELLIEALDMFLVGTGLVVFGVGLFVMFIGPEKTKEKKGRWICGS

Query:  NLCGLFYMKKIPRWVAMESVSEAKSKIGHAVVLILQVGVLEKFKSVPLSSAADLACFAAAVLISSASIFFLSRLNGGG
        N  GLF MKKIP WV MES+S+AKSKIGHAV++ILQVGVLEKFK++PLSSA DLACFAAAV++SSASIFFLS+LN GG
Subjt:  NLCGLFYMKKIPRWVAMESVSEAKSKIGHAVVLILQVGVLEKFKSVPLSSAADLACFAAAVLISSASIFFLSRLNGGG

TrEMBL top hitse value%identityAlignment
A0A0A0LLC9 Uncharacterized protein4.3e-10071.78Show/hide
Query:  MAAFRLLRPVQTSASV--VSSSSPPSTAVTVRCLTKTGLNSNNGERFVTSGDGERRQIAALKAAVA-AAPETVETKNRELDLVSLLVNVLRQLKTALVKT
        MAA R ++ V+ +A+V   SSSS PS+   VR L KTGLN NNGER +TSG  ERRQ+  +KAA A AAP+TVETK  ELDL SL+ N+L QLK  L KT
Subjt:  MAAFRLLRPVQTSASV--VSSSSPPSTAVTVRCLTKTGLNSNNGERFVTSGDGERRQIAALKAAVA-AAPETVETKNRELDLVSLLVNVLRQLKTALVKT

Query:  KIRRLQIQKLIEKIIIDCRFFTLFAVAGSLMGSILCFLEGSFIVAESYLQYFNGLSQRSDQSHTMELLIEALDMFLVGTGLVVFGVGLFVMFIGPEKTKE
        KI++ +IQK IEKIIIDCRFFTL AV+GSLMGSILC++EGSFIV ESYLQYF+GLSQR+DQ+HT+ELLIEALDMFLVGT L+VFG+GLF MF+G EK K+
Subjt:  KIRRLQIQKLIEKIIIDCRFFTLFAVAGSLMGSILCFLEGSFIVAESYLQYFNGLSQRSDQSHTMELLIEALDMFLVGTGLVVFGVGLFVMFIGPEKTKE

Query:  KKGRWICGSNLCGLFYMKKIPRWVAMESVSEAKSKIGHAVVLILQVGVLEKFKSVPLSSAADLACFAAAVLISSASIFFLSRLNGGG
        K  +W   SNL GLFYMKKIP WV MES+S AKSKIGHAV++ILQVGVLEKFK++PLSSA DLACFAAAVLISSASIFFLS+LN GG
Subjt:  KKGRWICGSNLCGLFYMKKIPRWVAMESVSEAKSKIGHAVVLILQVGVLEKFKSVPLSSAADLACFAAAVLISSASIFFLSRLNGGG

A0A5A7VG09 UPF0114 domain-containing protein1.2e-9970.99Show/hide
Query:  MAAFRLLRPVQTSASV--VSSSSPPSTAVTVRCLTKTGLNSNNGERFVTSGDGERRQIAALKAAVAAAPETVETKNRELDLVSLLVNVLRQLKTALVKTK
        MAA R ++ V+ +A+V   SSSS PS+   VR L KTGLN NNGER +TSG GE RQ+ A+KAA   AP+TVETK  ELDL SL+ ++L QLKT L KTK
Subjt:  MAAFRLLRPVQTSASV--VSSSSPPSTAVTVRCLTKTGLNSNNGERFVTSGDGERRQIAALKAAVAAAPETVETKNRELDLVSLLVNVLRQLKTALVKTK

Query:  IRRLQIQKLIEKIIIDCRFFTLFAVAGSLMGSILCFLEGSFIVAESYLQYFNGLSQRSDQSHTMELLIEALDMFLVGTGLVVFGVGLFVMFIGPEKTKEK
        I++ +IQK IEKIIIDCRFFTL AV+GSLMGSILC++EGSFIVAESYLQYF+ LSQR++Q+HT+ELLIEALDMFLVGT LVVFG+GLF MF+G EK KEK
Subjt:  IRRLQIQKLIEKIIIDCRFFTLFAVAGSLMGSILCFLEGSFIVAESYLQYFNGLSQRSDQSHTMELLIEALDMFLVGTGLVVFGVGLFVMFIGPEKTKEK

Query:  KGRWICGSNLCGLFYMKKIPRWVAMESVSEAKSKIGHAVVLILQVGVLEKFKSVPLSSAADLACFAAAVLISSASIFFLSRLN----GGGAYK
          +WI  SNL GLFYMKKIP WV MES+S AKSKIGHAV++ILQVGVLEKFK++PLSSA DLACFAAAVLISSASIFFLS+LN    G G +K
Subjt:  KGRWICGSNLCGLFYMKKIPRWVAMESVSEAKSKIGHAVVLILQVGVLEKFKSVPLSSAADLACFAAAVLISSASIFFLSRLN----GGGAYK

A0A6J1CTB3 uncharacterized protein LOC111014021 isoform X13.1e-10374.65Show/hide
Query:  MAAFRLLRPVQTSASVVSSSSPPSTAVTVRCLTKTGLNSNNGERFVTSGDGERRQIAALKAAVAAAPETVETKNRELDLVSLLVNVLRQLKTALVKTKIR
        MAA R  RP++ SASVVSSS+ PS A TVRC+ +T    NNGER +TSGDGERR++  +KAAV AAPETV+TK RELDL SLL N+L +LKTA+ KTK  
Subjt:  MAAFRLLRPVQTSASVVSSSSPPSTAVTVRCLTKTGLNSNNGERFVTSGDGERRQIAALKAAVAAAPETVETKNRELDLVSLLVNVLRQLKTALVKTKIR

Query:  RLQIQKLIEKIIIDCRFFTLFAVAGSLMGSILCFLEGSFIVAESYLQYFNGLSQRSDQSHTMELLIEALDMFLVGTGLVVFGVGLFVMFIGPEKTKEKKG
           IQ  IEK IIDCRFFTLFAVAGSL+GSILC+LEGSFIVAESYLQYF+GLSQ+SDQ+HT+ELLI+A+DMFLVGT L VFGVGLF MF+GPEK KE+  
Subjt:  RLQIQKLIEKIIIDCRFFTLFAVAGSLMGSILCFLEGSFIVAESYLQYFNGLSQRSDQSHTMELLIEALDMFLVGTGLVVFGVGLFVMFIGPEKTKEKKG

Query:  RWICGSNLCGLFYMKKIPRWVAMESVSEAKSKIGHAVVLILQVGVLEKFKSVPLSSAADLACFAAAVLISSASIFFLSRLNGGG
         W  GSNL GLFYMKK+P WV MESVS  KSKIGHAVV+ILQVGVLEKFKS+PL+SAADLACFAAAVLISSASIFFLS+LN GG
Subjt:  RWICGSNLCGLFYMKKIPRWVAMESVSEAKSKIGHAVVLILQVGVLEKFKSVPLSSAADLACFAAAVLISSASIFFLSRLNGGG

A0A6J1GGV4 uncharacterized protein LOC1114540791.5e-11380.9Show/hide
Query:  MAAFRLLRPVQTSASVV--SSSSPPSTAVTVRCLTKTGLNSNNGERFVTSGDGERRQIAALK--AAVAAAPETVETKNRELDLVSLLVNVLRQLKTALVK
        MAA RLLR V+ SA+VV  SSSS PSTA TVRCL KTGLNS NGER VTSGDGERRQI  LK  AA AAAPETVET+ RELDL SLL N+L QLK   VK
Subjt:  MAAFRLLRPVQTSASVV--SSSSPPSTAVTVRCLTKTGLNSNNGERFVTSGDGERRQIAALK--AAVAAAPETVETKNRELDLVSLLVNVLRQLKTALVK

Query:  TKIRRLQIQKLIEKIIIDCRFFTLFAVAGSLMGSILCFLEGSFIVAESYLQYFNGLSQRSDQSHTMELLIEALDMFLVGTGLVVFGVGLFVMFIGPEKTK
        TKIRR QIQK IEKIIIDCRFFTLFAVAGSL+GSILCFLEGSFIVAESYLQYFNG+S+RSD+SH +ELLIE+LDMFLVGT LVVFGVGLF MF+G EK  
Subjt:  TKIRRLQIQKLIEKIIIDCRFFTLFAVAGSLMGSILCFLEGSFIVAESYLQYFNGLSQRSDQSHTMELLIEALDMFLVGTGLVVFGVGLFVMFIGPEKTK

Query:  EKKGRWICGSNLCGLFYMKKIPRWVAMESVSEAKSKIGHAVVLILQVGVLEKFKSVPLSSAADLACFAAAVLISSASIFFLSRLNGGG
        EK  RW+ GSNL GLFYMK IP WV MESVSEAKSKIGHAV++ILQVGVLEKFKS+PLSSA DLACFAAA+LISSASIFFLSRLN GG
Subjt:  EKKGRWICGSNLCGLFYMKKIPRWVAMESVSEAKSKIGHAVVLILQVGVLEKFKSVPLSSAADLACFAAAVLISSASIFFLSRLNGGG

A0A6J1KI88 uncharacterized protein LOC1114960201.2e-11079.17Show/hide
Query:  MAAFRLLRPVQTSASVV-SSSSPPSTAVTVRCLTKTGLNSNNGERFVTSGDGERRQIAALK---AAVAAAPETVETKNRELDLVSLLVNVLRQLKTALVK
        MAA RLLR V+ SA+VV SSSS PSTA  VRCL KTGLNS NGER +TSGDGERRQI  LK   AA AAAPETVETK RELDL SLL N+L QLK   VK
Subjt:  MAAFRLLRPVQTSASVV-SSSSPPSTAVTVRCLTKTGLNSNNGERFVTSGDGERRQIAALK---AAVAAAPETVETKNRELDLVSLLVNVLRQLKTALVK

Query:  TKIRRLQIQKLIEKIIIDCRFFTLFAVAGSLMGSILCFLEGSFIVAESYLQYFNGLSQRSDQSHTMELLIEALDMFLVGTGLVVFGVGLFVMFIGPEKTK
         KIRR QIQK IEKIII+CRFFTLFAVAGSL+GSILCFLEGSFIVAESYLQYFN +S+RSD+SH +ELLIE+LDMFLVGT LVVFGVGLF MF+G EK  
Subjt:  TKIRRLQIQKLIEKIIIDCRFFTLFAVAGSLMGSILCFLEGSFIVAESYLQYFNGLSQRSDQSHTMELLIEALDMFLVGTGLVVFGVGLFVMFIGPEKTK

Query:  EKKGRWICGSNLCGLFYMKKIPRWVAMESVSEAKSKIGHAVVLILQVGVLEKFKSVPLSSAADLACFAAAVLISSASIFFLSRLNGGG
        EK  RW+ GSNL GLFYMK IP WV MESVSEAKSKIGHAV++ILQVGVLEK KS+PLSSAADLACFAAA+LI SASIFFLSRLN GG
Subjt:  EKKGRWICGSNLCGLFYMKKIPRWVAMESVSEAKSKIGHAVVLILQVGVLEKFKSVPLSSAADLACFAAAVLISSASIFFLSRLNGGG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G19390.1 Uncharacterised protein family (UPF0114)1.6e-3541.44Show/hide
Query:  IQKLIEKIIIDCRFFTLFAVAGSLMGSILCFLEGSFIVAESYLQYFNGLSQRSDQSHTMELLIEALDMFLVGTGLVVFGVGLFVMFIGPEKTKEKKGRWI
        +++ IEK+I  CRF T     GSL+GS+LCF++G   V +S+LQY        ++   + LL+EA+D++L+GT ++VFG+GL+ +FI    T E +   I
Subjt:  IQKLIEKIIIDCRFFTLFAVAGSLMGSILCFLEGSFIVAESYLQYFNGLSQRSDQSHTMELLIEALDMFLVGTGLVVFGVGLFVMFIGPEKTKEKKGRWI

Query:  CG--SNLCGLFYMKKIPRWVAMESVSEAKSKIGHAVVLILQVGVLEKFKSVPLSSAADLACFAAAVLISSASIFFLSRLNG
            S+L G+F +K+ P+W+ ++SVSE K+K+GH +V++L +G+ +K K V ++S  DL C + ++  SSA +F LSRLNG
Subjt:  CG--SNLCGLFYMKKIPRWVAMESVSEAKSKIGHAVVLILQVGVLEKFKSVPLSSAADLACFAAAVLISSASIFFLSRLNG

AT5G13720.1 Uncharacterised protein family (UPF0114)9.2e-3137.64Show/hide
Query:  IEKIIIDCRFFTLFAVAGSLMGSILCFLEGSFIVAESYLQYFNGLSQRSDQSHTMELLIEALDMFLVGTGLVVFGVGLFVMFI--GPEKTKEKKGRWICG
        +E+II D RF  L AV GSL GS+LCFL G   + E+Y  Y+   S+       +  L+EA+D++L GT +++F +GL+ +FI   P     +  R +  
Subjt:  IEKIIIDCRFFTLFAVAGSLMGSILCFLEGSFIVAESYLQYFNGLSQRSDQSHTMELLIEALDMFLVGTGLVVFGVGLFVMFI--GPEKTKEKKGRWICG

Query:  SNLCGLFYMKKIPRWVAMESVSEAKSKIGHAVVLILQVGVLEKFKSVPLSSAADLACFAAAVLISSASIFFLSRLNGG
        S+L G+F MK+ P+W+ + S+ E K+K+GH +V+IL V + E+ K V +++  DL  ++  + +SSAS++ L  L+ G
Subjt:  SNLCGLFYMKKIPRWVAMESVSEAKSKIGHAVVLILQVGVLEKFKSVPLSSAADLACFAAAVLISSASIFFLSRLNGG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGCTTTTAGGTTGCTCCGGCCGGTTCAGACTTCAGCCTCCGTCGTGTCTTCTTCCTCACCTCCGTCGACGGCGGTGACTGTGAGGTGTTTGACCAAAACAGGGTT
AAATTCGAACAATGGGGAAAGGTTCGTAACTTCCGGCGACGGAGAGAGAAGGCAGATCGCAGCCCTGAAGGCGGCGGTGGCGGCGGCTCCCGAGACTGTTGAAACTAAAA
ACAGAGAGCTCGATTTGGTTTCCCTGTTGGTGAATGTGCTCCGTCAATTGAAGACTGCTCTGGTTAAGACGAAGATTCGAAGGCTACAGATTCAGAAACTAATCGAAAAG
ATCATAATCGACTGCCGATTCTTCACATTATTCGCCGTCGCTGGATCTTTAATGGGTTCAATACTCTGTTTCCTTGAGGGGAGCTTTATAGTTGCAGAGTCATATCTGCA
GTATTTCAATGGTCTTTCACAGAGGTCGGACCAAAGTCATACAATGGAGCTTCTAATTGAAGCATTAGATATGTTCCTGGTGGGAACTGGTCTGGTTGTTTTTGGGGTTG
GACTATTCGTAATGTTCATCGGACCGGAGAAAACAAAGGAAAAAAAGGGGCGTTGGATTTGTGGGTCGAACTTGTGTGGTCTGTTCTACATGAAGAAAATTCCGAGGTGG
GTGGCAATGGAGTCGGTGTCGGAGGCGAAATCGAAGATCGGACATGCGGTGGTGTTGATACTGCAAGTGGGTGTGTTGGAGAAGTTCAAGAGCGTTCCGTTGAGCTCTGC
CGCCGATCTCGCGTGTTTCGCCGCTGCCGTTCTGATTTCCTCTGCCTCCATCTTCTTCCTCTCCAGACTCAACGGCGGCGGCGCCTACAAGTGA
mRNA sequenceShow/hide mRNA sequence
ATTCCGGCCAAAATATTCTGCGAATCTTCCTTTTCCATTACATGGTATAAGAAATCAAACTAAATTTCTTATACCATAAATCAAAACGACAAGATCAATGGCCGCTTTTA
GGTTGCTCCGGCCGGTTCAGACTTCAGCCTCCGTCGTGTCTTCTTCCTCACCTCCGTCGACGGCGGTGACTGTGAGGTGTTTGACCAAAACAGGGTTAAATTCGAACAAT
GGGGAAAGGTTCGTAACTTCCGGCGACGGAGAGAGAAGGCAGATCGCAGCCCTGAAGGCGGCGGTGGCGGCGGCTCCCGAGACTGTTGAAACTAAAAACAGAGAGCTCGA
TTTGGTTTCCCTGTTGGTGAATGTGCTCCGTCAATTGAAGACTGCTCTGGTTAAGACGAAGATTCGAAGGCTACAGATTCAGAAACTAATCGAAAAGATCATAATCGACT
GCCGATTCTTCACATTATTCGCCGTCGCTGGATCTTTAATGGGTTCAATACTCTGTTTCCTTGAGGGGAGCTTTATAGTTGCAGAGTCATATCTGCAGTATTTCAATGGT
CTTTCACAGAGGTCGGACCAAAGTCATACAATGGAGCTTCTAATTGAAGCATTAGATATGTTCCTGGTGGGAACTGGTCTGGTTGTTTTTGGGGTTGGACTATTCGTAAT
GTTCATCGGACCGGAGAAAACAAAGGAAAAAAAGGGGCGTTGGATTTGTGGGTCGAACTTGTGTGGTCTGTTCTACATGAAGAAAATTCCGAGGTGGGTGGCAATGGAGT
CGGTGTCGGAGGCGAAATCGAAGATCGGACATGCGGTGGTGTTGATACTGCAAGTGGGTGTGTTGGAGAAGTTCAAGAGCGTTCCGTTGAGCTCTGCCGCCGATCTCGCG
TGTTTCGCCGCTGCCGTTCTGATTTCCTCTGCCTCCATCTTCTTCCTCTCCAGACTCAACGGCGGCGGCGCCTACAAGTGAACCGCCCCTGTGGCGGCGCGTTGGTCTTC
GCTGTCTCCCATTAGTATATATGTAATTTTTGGAAAGATTATGCTACGGGGGAATATTACCGATTCTTTCATTAATTAGTTTCCAAAATGCTAACGTAATGTGTATAATT
GTTTACCTCATGTATAGAATAATACTCATTTTTCAATTAAATAAAAT
Protein sequenceShow/hide protein sequence
MAAFRLLRPVQTSASVVSSSSPPSTAVTVRCLTKTGLNSNNGERFVTSGDGERRQIAALKAAVAAAPETVETKNRELDLVSLLVNVLRQLKTALVKTKIRRLQIQKLIEK
IIIDCRFFTLFAVAGSLMGSILCFLEGSFIVAESYLQYFNGLSQRSDQSHTMELLIEALDMFLVGTGLVVFGVGLFVMFIGPEKTKEKKGRWICGSNLCGLFYMKKIPRW
VAMESVSEAKSKIGHAVVLILQVGVLEKFKSVPLSSAADLACFAAAVLISSASIFFLSRLNGGGAYK