; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10004894 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10004894
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionUPF0114 domain-containing protein
Genome locationChr08:21240170..21241392
RNA-Seq ExpressionHG10004894
SyntenyHG10004894
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR005134 - Uncharacterised protein family UPF0114


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0065266.1 UPF0114 domain-containing protein [Cucumis melo var. makuwa]1.2e-12586.35Show/hide
Query:  MAATRFMQRLRPASAVSSSSSSSSPSSAMTVRCLGKTGLNLNNGERLITSGDGERRQIVAMKAAAAAAPETVETKTEELNLGSLLANLLVQLKTPVGKTK
        MAATRFMQR+RPA+AVSSSSSSSSPSS   VR LGKTGLNLNNGERLITSG GE RQ+VA+K AA  AP+TVETKT EL+LGSL+++LLVQLKT +GKTK
Subjt:  MAATRFMQRLRPASAVSSSSSSSSPSSAMTVRCLGKTGLNLNNGERLITSGDGERRQIVAMKAAAAAAPETVETKTEELNLGSLLANLLVQLKTPVGKTK

Query:  IQKRQIQKFIEKIIIDCRFFTLLAVAGSLLGSVLCFIEGSFIVAESYLQYFHGLSQSSDQTHMVALLIEALDMFLVGTALVVFGVGLFAMFVGSEKMKEK
        I+KR+IQKFIEKIIIDCRFFTLLAV+GSL+GS+LC+IEGSFIVAESYLQYFH LSQ ++QTH V LLIEALDMFLVGTALVVFG+GLFAMFVGSEKMKEK
Subjt:  IQKRQIQKFIEKIIIDCRFFTLLAVAGSLLGSVLCFIEGSFIVAESYLQYFHGLSQSSDQTHMVALLIEALDMFLVGTALVVFGVGLFAMFVGSEKMKEK

Query:  NRPVISGSNLFGLFYMKKIPTWVEMESMSQAKSKIGHAVMMILQVGVLEKFKNIPLSSAVDLACFAAAVLISSASIFFLSKLNVGGGGSVGFK
        NR  IS SNLFGLFYMKKIPTWVEMESMS AKSKIGHAVMMILQVGVLEKFKNIPLSSAVDLACFAAAVLISSASIFFLSKLNVG GGS GFK
Subjt:  NRPVISGSNLFGLFYMKKIPTWVEMESMSQAKSKIGHAVMMILQVGVLEKFKNIPLSSAVDLACFAAAVLISSASIFFLSKLNVGGGGSVGFK

XP_008444667.1 PREDICTED: uncharacterized protein LOC103487936 [Cucumis melo]4.7e-12585.67Show/hide
Query:  MAATRFMQRLRPASAVSSSSSSSSPSSAMTVRCLGKTGLNLNNGERLITSGDGERRQIVAMKAAAAAAPETVETKTEELNLGSLLANLLVQLKTPVGKTK
        MAATRFMQR+RPA+AVSSSSSSSSPSS   VR LGKTGLNLNNGERLITSG GE RQ++A+K AA  AP+TVETKT EL+LGSL+++LLVQLKT +GKTK
Subjt:  MAATRFMQRLRPASAVSSSSSSSSPSSAMTVRCLGKTGLNLNNGERLITSGDGERRQIVAMKAAAAAAPETVETKTEELNLGSLLANLLVQLKTPVGKTK

Query:  IQKRQIQKFIEKIIIDCRFFTLLAVAGSLLGSVLCFIEGSFIVAESYLQYFHGLSQSSDQTHMVALLIEALDMFLVGTALVVFGVGLFAMFVGSEKMKEK
        I+KR+IQKFIEKIIIDCRFFTLLAV+GSL+GS+LC+IEGSFIVAESYLQYFH LSQ ++QTH V LLIEALDMFLVGTALVVFG+GLFAMFVGSEKMKEK
Subjt:  IQKRQIQKFIEKIIIDCRFFTLLAVAGSLLGSVLCFIEGSFIVAESYLQYFHGLSQSSDQTHMVALLIEALDMFLVGTALVVFGVGLFAMFVGSEKMKEK

Query:  NRPVISGSNLFGLFYMKKIPTWVEMESMSQAKSKIGHAVMMILQVGVLEKFKNIPLSSAVDLACFAAAVLISSASIFFLSKLNVGGGGSVGFK
        N+  IS SNLFGLFYMKKIPTWVEMESMS AKSKIGHAVMMILQVGVLEKFKNIPLSSAVDLACFAAAVLISSASIFFLSKLNVG GGS GFK
Subjt:  NRPVISGSNLFGLFYMKKIPTWVEMESMSQAKSKIGHAVMMILQVGVLEKFKNIPLSSAVDLACFAAAVLISSASIFFLSKLNVGGGGSVGFK

XP_011649594.1 uncharacterized protein LOC101218655 isoform X1 [Cucumis sativus]2.1e-12585.03Show/hide
Query:  MAATRFMQRLRPASAVSSSSSSSSPSSAMTVRCLGKTGLNLNNGERLITSGDGERRQIVAMKAAAA-AAPETVETKTEELNLGSLLANLLVQLKTPVGKT
        MAATRF+QR+RPA+AVS+SSSSSSPSS   VR LGKTGLNLNNGERLITSG  ERRQ+V +KAAAA AAP+TVETKT EL+LGSL+ANLL+QLK  +GKT
Subjt:  MAATRFMQRLRPASAVSSSSSSSSPSSAMTVRCLGKTGLNLNNGERLITSGDGERRQIVAMKAAAA-AAPETVETKTEELNLGSLLANLLVQLKTPVGKT

Query:  KIQKRQIQKFIEKIIIDCRFFTLLAVAGSLLGSVLCFIEGSFIVAESYLQYFHGLSQSSDQTHMVALLIEALDMFLVGTALVVFGVGLFAMFVGSEKMKE
        KI+K +IQKFIEKIIIDCRFFTLLAV+GSL+GS+LC+IEGSFIV ESYLQYFHGLSQ +DQTH V LLIEALDMFLVGTAL+VFG+GLFAMFVGSEKMK+
Subjt:  KIQKRQIQKFIEKIIIDCRFFTLLAVAGSLLGSVLCFIEGSFIVAESYLQYFHGLSQSSDQTHMVALLIEALDMFLVGTALVVFGVGLFAMFVGSEKMKE

Query:  KNRPVISGSNLFGLFYMKKIPTWVEMESMSQAKSKIGHAVMMILQVGVLEKFKNIPLSSAVDLACFAAAVLISSASIFFLSKLNVGGGGSVGFK
        KN+   S SNLFGLFYMKKIPTWVEMESMS AKSKIGHAVMMILQVGVLEKFKNIPLSSAVDLACFAAAVLISSASIFFLSKLNVGGGGS GFK
Subjt:  KNRPVISGSNLFGLFYMKKIPTWVEMESMSQAKSKIGHAVMMILQVGVLEKFKNIPLSSAVDLACFAAAVLISSASIFFLSKLNVGGGGSVGFK

XP_022951147.1 uncharacterized protein LOC111454079 [Cucurbita moschata]9.2e-12181.36Show/hide
Query:  MAATRFMQRLRPASAVSSSSSSSSPSSAMTVRCLGKTGLNLNNGERLITSGDGERRQIVAMK--AAAAAAPETVETKTEELNLGSLLANLLVQLKTPVGK
        MAATR ++ +RP++ V SSSSSSSPS+A TVRCLGKTGLN  NGERL+TSGDGERRQIV +K  AAAAAAPETVET+T EL+LGSLLANLLVQLK    K
Subjt:  MAATRFMQRLRPASAVSSSSSSSSPSSAMTVRCLGKTGLNLNNGERLITSGDGERRQIVAMK--AAAAAAPETVETKTEELNLGSLLANLLVQLKTPVGK

Query:  TKIQKRQIQKFIEKIIIDCRFFTLLAVAGSLLGSVLCFIEGSFIVAESYLQYFHGLSQSSDQTHMVALLIEALDMFLVGTALVVFGVGLFAMFVGSEKMK
        TKI++RQIQKFIEKIIIDCRFFTL AVAGSLLGS+LCF+EGSFIVAESYLQYF+G+S+ SD++H V LLIE+LDMFLVGTALVVFGVGLFAMFVGSEKM 
Subjt:  TKIQKRQIQKFIEKIIIDCRFFTLLAVAGSLLGSVLCFIEGSFIVAESYLQYFHGLSQSSDQTHMVALLIEALDMFLVGTALVVFGVGLFAMFVGSEKMK

Query:  EKNRPVISGSNLFGLFYMKKIPTWVEMESMSQAKSKIGHAVMMILQVGVLEKFKNIPLSSAVDLACFAAAVLISSASIFFLSKLNVGGGGSVGFK
        EKN   +SGSNLFGLFYMK IPTWVEMES+S+AKSKIGHAVMMILQVGVLEKFK+IPLSSA DLACFAAA+LISSASIFFLS+LN+GGGG  G+K
Subjt:  EKNRPVISGSNLFGLFYMKKIPTWVEMESMSQAKSKIGHAVMMILQVGVLEKFKNIPLSSAVDLACFAAAVLISSASIFFLSKLNVGGGGSVGFK

XP_038885641.1 uncharacterized protein LOC120075956 [Benincasa hispida]5.4e-13792.15Show/hide
Query:  MAATRFMQRLRPASAVSSSSSSSSPSSAMTVRCLGKTGLNLNNGERLITSGDGERRQIVAMKAAAAAAPETVETKTEELNLGSLLANLLVQLKTPVGKTK
        M ATRFMQRLRPASAVSSSSSSSSPSSAMTVRCLGKTGLNLNNGERLITSGDGER+QIVA+K  AAAAP+TVET+TEELNLGSLLANLLVQLKT VGKTK
Subjt:  MAATRFMQRLRPASAVSSSSSSSSPSSAMTVRCLGKTGLNLNNGERLITSGDGERRQIVAMKAAAAAAPETVETKTEELNLGSLLANLLVQLKTPVGKTK

Query:  IQKRQIQKFIEKIIIDCRFFTLLAVAGSLLGSVLCFIEGSFIVAESYLQYFHGLSQSSDQTHMVALLIEALDMFLVGTALVVFGVGLFAMFVGSEKMKEK
        IQ+RQIQKFIEKIIIDCRFFTLLAVAGSLLGS+LC+IEGSFIVAESYLQYFHGLSQSS+Q H V LLIEALDMFLVGTALVVFGVGLFAMF+GS KMKEK
Subjt:  IQKRQIQKFIEKIIIDCRFFTLLAVAGSLLGSVLCFIEGSFIVAESYLQYFHGLSQSSDQTHMVALLIEALDMFLVGTALVVFGVGLFAMFVGSEKMKEK

Query:  NRPVISGSNLFGLFYMKKIPTWVEMESMSQAKSKIGHAVMMILQVGVLEKFKNIPLSSAVDLACFAAAVLISSASIFFLSKLNVGGGGSVGFK
        NRPVISGSN FGLF MKKIPTWVEMESMSQAKSKIGHAVMMILQVGVLEKFKNIPLSSAVDLACFAAAV++SSASIFFLSKLN+GGGGS GFK
Subjt:  NRPVISGSNLFGLFYMKKIPTWVEMESMSQAKSKIGHAVMMILQVGVLEKFKNIPLSSAVDLACFAAAVLISSASIFFLSKLNVGGGGSVGFK

TrEMBL top hitse value%identityAlignment
A0A0A0LLC9 Uncharacterized protein1.0e-12585.03Show/hide
Query:  MAATRFMQRLRPASAVSSSSSSSSPSSAMTVRCLGKTGLNLNNGERLITSGDGERRQIVAMKAAAA-AAPETVETKTEELNLGSLLANLLVQLKTPVGKT
        MAATRF+QR+RPA+AVS+SSSSSSPSS   VR LGKTGLNLNNGERLITSG  ERRQ+V +KAAAA AAP+TVETKT EL+LGSL+ANLL+QLK  +GKT
Subjt:  MAATRFMQRLRPASAVSSSSSSSSPSSAMTVRCLGKTGLNLNNGERLITSGDGERRQIVAMKAAAA-AAPETVETKTEELNLGSLLANLLVQLKTPVGKT

Query:  KIQKRQIQKFIEKIIIDCRFFTLLAVAGSLLGSVLCFIEGSFIVAESYLQYFHGLSQSSDQTHMVALLIEALDMFLVGTALVVFGVGLFAMFVGSEKMKE
        KI+K +IQKFIEKIIIDCRFFTLLAV+GSL+GS+LC+IEGSFIV ESYLQYFHGLSQ +DQTH V LLIEALDMFLVGTAL+VFG+GLFAMFVGSEKMK+
Subjt:  KIQKRQIQKFIEKIIIDCRFFTLLAVAGSLLGSVLCFIEGSFIVAESYLQYFHGLSQSSDQTHMVALLIEALDMFLVGTALVVFGVGLFAMFVGSEKMKE

Query:  KNRPVISGSNLFGLFYMKKIPTWVEMESMSQAKSKIGHAVMMILQVGVLEKFKNIPLSSAVDLACFAAAVLISSASIFFLSKLNVGGGGSVGFK
        KN+   S SNLFGLFYMKKIPTWVEMESMS AKSKIGHAVMMILQVGVLEKFKNIPLSSAVDLACFAAAVLISSASIFFLSKLNVGGGGS GFK
Subjt:  KNRPVISGSNLFGLFYMKKIPTWVEMESMSQAKSKIGHAVMMILQVGVLEKFKNIPLSSAVDLACFAAAVLISSASIFFLSKLNVGGGGSVGFK

A0A1S3BAX1 uncharacterized protein LOC1034879362.3e-12585.67Show/hide
Query:  MAATRFMQRLRPASAVSSSSSSSSPSSAMTVRCLGKTGLNLNNGERLITSGDGERRQIVAMKAAAAAAPETVETKTEELNLGSLLANLLVQLKTPVGKTK
        MAATRFMQR+RPA+AVSSSSSSSSPSS   VR LGKTGLNLNNGERLITSG GE RQ++A+K AA  AP+TVETKT EL+LGSL+++LLVQLKT +GKTK
Subjt:  MAATRFMQRLRPASAVSSSSSSSSPSSAMTVRCLGKTGLNLNNGERLITSGDGERRQIVAMKAAAAAAPETVETKTEELNLGSLLANLLVQLKTPVGKTK

Query:  IQKRQIQKFIEKIIIDCRFFTLLAVAGSLLGSVLCFIEGSFIVAESYLQYFHGLSQSSDQTHMVALLIEALDMFLVGTALVVFGVGLFAMFVGSEKMKEK
        I+KR+IQKFIEKIIIDCRFFTLLAV+GSL+GS+LC+IEGSFIVAESYLQYFH LSQ ++QTH V LLIEALDMFLVGTALVVFG+GLFAMFVGSEKMKEK
Subjt:  IQKRQIQKFIEKIIIDCRFFTLLAVAGSLLGSVLCFIEGSFIVAESYLQYFHGLSQSSDQTHMVALLIEALDMFLVGTALVVFGVGLFAMFVGSEKMKEK

Query:  NRPVISGSNLFGLFYMKKIPTWVEMESMSQAKSKIGHAVMMILQVGVLEKFKNIPLSSAVDLACFAAAVLISSASIFFLSKLNVGGGGSVGFK
        N+  IS SNLFGLFYMKKIPTWVEMESMS AKSKIGHAVMMILQVGVLEKFKNIPLSSAVDLACFAAAVLISSASIFFLSKLNVG GGS GFK
Subjt:  NRPVISGSNLFGLFYMKKIPTWVEMESMSQAKSKIGHAVMMILQVGVLEKFKNIPLSSAVDLACFAAAVLISSASIFFLSKLNVGGGGSVGFK

A0A5A7VG09 UPF0114 domain-containing protein6.0e-12686.35Show/hide
Query:  MAATRFMQRLRPASAVSSSSSSSSPSSAMTVRCLGKTGLNLNNGERLITSGDGERRQIVAMKAAAAAAPETVETKTEELNLGSLLANLLVQLKTPVGKTK
        MAATRFMQR+RPA+AVSSSSSSSSPSS   VR LGKTGLNLNNGERLITSG GE RQ+VA+K AA  AP+TVETKT EL+LGSL+++LLVQLKT +GKTK
Subjt:  MAATRFMQRLRPASAVSSSSSSSSPSSAMTVRCLGKTGLNLNNGERLITSGDGERRQIVAMKAAAAAAPETVETKTEELNLGSLLANLLVQLKTPVGKTK

Query:  IQKRQIQKFIEKIIIDCRFFTLLAVAGSLLGSVLCFIEGSFIVAESYLQYFHGLSQSSDQTHMVALLIEALDMFLVGTALVVFGVGLFAMFVGSEKMKEK
        I+KR+IQKFIEKIIIDCRFFTLLAV+GSL+GS+LC+IEGSFIVAESYLQYFH LSQ ++QTH V LLIEALDMFLVGTALVVFG+GLFAMFVGSEKMKEK
Subjt:  IQKRQIQKFIEKIIIDCRFFTLLAVAGSLLGSVLCFIEGSFIVAESYLQYFHGLSQSSDQTHMVALLIEALDMFLVGTALVVFGVGLFAMFVGSEKMKEK

Query:  NRPVISGSNLFGLFYMKKIPTWVEMESMSQAKSKIGHAVMMILQVGVLEKFKNIPLSSAVDLACFAAAVLISSASIFFLSKLNVGGGGSVGFK
        NR  IS SNLFGLFYMKKIPTWVEMESMS AKSKIGHAVMMILQVGVLEKFKNIPLSSAVDLACFAAAVLISSASIFFLSKLNVG GGS GFK
Subjt:  NRPVISGSNLFGLFYMKKIPTWVEMESMSQAKSKIGHAVMMILQVGVLEKFKNIPLSSAVDLACFAAAVLISSASIFFLSKLNVGGGGSVGFK

A0A6J1GGV4 uncharacterized protein LOC1114540794.5e-12181.36Show/hide
Query:  MAATRFMQRLRPASAVSSSSSSSSPSSAMTVRCLGKTGLNLNNGERLITSGDGERRQIVAMK--AAAAAAPETVETKTEELNLGSLLANLLVQLKTPVGK
        MAATR ++ +RP++ V SSSSSSSPS+A TVRCLGKTGLN  NGERL+TSGDGERRQIV +K  AAAAAAPETVET+T EL+LGSLLANLLVQLK    K
Subjt:  MAATRFMQRLRPASAVSSSSSSSSPSSAMTVRCLGKTGLNLNNGERLITSGDGERRQIVAMK--AAAAAAPETVETKTEELNLGSLLANLLVQLKTPVGK

Query:  TKIQKRQIQKFIEKIIIDCRFFTLLAVAGSLLGSVLCFIEGSFIVAESYLQYFHGLSQSSDQTHMVALLIEALDMFLVGTALVVFGVGLFAMFVGSEKMK
        TKI++RQIQKFIEKIIIDCRFFTL AVAGSLLGS+LCF+EGSFIVAESYLQYF+G+S+ SD++H V LLIE+LDMFLVGTALVVFGVGLFAMFVGSEKM 
Subjt:  TKIQKRQIQKFIEKIIIDCRFFTLLAVAGSLLGSVLCFIEGSFIVAESYLQYFHGLSQSSDQTHMVALLIEALDMFLVGTALVVFGVGLFAMFVGSEKMK

Query:  EKNRPVISGSNLFGLFYMKKIPTWVEMESMSQAKSKIGHAVMMILQVGVLEKFKNIPLSSAVDLACFAAAVLISSASIFFLSKLNVGGGGSVGFK
        EKN   +SGSNLFGLFYMK IPTWVEMES+S+AKSKIGHAVMMILQVGVLEKFK+IPLSSA DLACFAAA+LISSASIFFLS+LN+GGGG  G+K
Subjt:  EKNRPVISGSNLFGLFYMKKIPTWVEMESMSQAKSKIGHAVMMILQVGVLEKFKNIPLSSAVDLACFAAAVLISSASIFFLSKLNVGGGGSVGFK

A0A6J1KI88 uncharacterized protein LOC1114960206.2e-11579.39Show/hide
Query:  MAATRFMQRLRPASAVSSSSSSSSPSSAMTVRCLGKTGLNLNNGERLITSGDGERRQIVAMK---AAAAAAPETVETKTEELNLGSLLANLLVQLKTPVG
        MAATR ++ +RP SA   SSSSSSPS+A  VRCL KTGLN  NGERLITSGDGERRQIV +K   AAAAAAPETVETKT EL+LGSLLANLLVQLK    
Subjt:  MAATRFMQRLRPASAVSSSSSSSSPSSAMTVRCLGKTGLNLNNGERLITSGDGERRQIVAMK---AAAAAAPETVETKTEELNLGSLLANLLVQLKTPVG

Query:  KTKIQKRQIQKFIEKIIIDCRFFTLLAVAGSLLGSVLCFIEGSFIVAESYLQYFHGLSQSSDQTHMVALLIEALDMFLVGTALVVFGVGLFAMFVGSEKM
        K KI++RQIQKFIEKIII+CRFFTL AVAGSLLGS+LCF+EGSFIVAESYLQYF+ +S+ SD++H V LLIE+LDMFLVGTALVVFGVGLFAMFVGSEKM
Subjt:  KTKIQKRQIQKFIEKIIIDCRFFTLLAVAGSLLGSVLCFIEGSFIVAESYLQYFHGLSQSSDQTHMVALLIEALDMFLVGTALVVFGVGLFAMFVGSEKM

Query:  KEKNRPVISGSNLFGLFYMKKIPTWVEMESMSQAKSKIGHAVMMILQVGVLEKFKNIPLSSAVDLACFAAAVLISSASIFFLSKLNVGGGGSVGFK
         EKNR  +SGSNLFGLFYMK IPTWVEMES+S+AKSKIGHAVMMILQVGVLEK K+IPLSSA DLACFAAA+LI SASIFFLS+LN+GGG   G+K
Subjt:  KEKNRPVISGSNLFGLFYMKKIPTWVEMESMSQAKSKIGHAVMMILQVGVLEKFKNIPLSSAVDLACFAAAVLISSASIFFLSKLNVGGGGSVGFK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G19390.1 Uncharacterised protein family (UPF0114)5.2e-3741.11Show/hide
Query:  IQKFIEKIIIDCRFFTLLAVAGSLLGSVLCFIEGSFIVAESYLQYFHGLSQSSDQTHMVALLIEALDMFLVGTALVVFGVGLFAMFVGS-EKMKEKNRPV
        +++ IEK+I  CRF T L   GSLLGSVLCFI+G   V +S+LQY      S ++  ++ LL+EA+D++L+GT ++VFG+GL+ +F+ + +  + +   +
Subjt:  IQKFIEKIIIDCRFFTLLAVAGSLLGSVLCFIEGSFIVAESYLQYFHGLSQSSDQTHMVALLIEALDMFLVGTALVVFGVGLFAMFVGS-EKMKEKNRPV

Query:  ISG-SNLFGLFYMKKIPTWVEMESMSQAKSKIGHAVMMILQVGVLEKFKNIPLSSAVDLACFAAAVLISSASIFFLSKLN
        +S  S+LFG+F +K+ P W+E++S+S+ K+K+GH ++M+L +G+ +K K + ++S  DL C + ++  SSA +F LS+LN
Subjt:  ISG-SNLFGLFYMKKIPTWVEMESMSQAKSKIGHAVMMILQVGVLEKFKNIPLSSAVDLACFAAAVLISSASIFFLSKLN

AT5G13720.1 Uncharacterised protein family (UPF0114)8.5e-3235.87Show/hide
Query:  RQIQKFIEKIIIDCRFFTLLAVAGSLLGSVLCFIEGSFIVAESYLQYFHGLSQSSDQTHMVALLIEALDMFLVGTALVVFGVGLFAMFV--GSEKMKEKN
        R  +  +E+II D RF  LLAV GSL GS+LCF+ G   + E+Y  Y+   S+      MV  L+EA+D++L GT +++F +GL+ +F+      +  ++
Subjt:  RQIQKFIEKIIIDCRFFTLLAVAGSLLGSVLCFIEGSFIVAESYLQYFHGLSQSSDQTHMVALLIEALDMFLVGTALVVFGVGLFAMFV--GSEKMKEKN

Query:  RPVISGSNLFGLFYMKKIPTWVEMESMSQAKSKIGHAVMMILQVGVLEKFKNIPLSSAVDLACFAAAVLISSASIFFLSKLNVG
           +  S+LFG+F MK+ P W+++ S+ + K+K+GH ++MIL V + E+ K + +++ +DL  ++  + +SSAS++ L  L+ G
Subjt:  RPVISGSNLFGLFYMKKIPTWVEMESMSQAKSKIGHAVMMILQVGVLEKFKNIPLSSAVDLACFAAAVLISSASIFFLSKLNVG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGCGACTAGATTCATGCAGCGACTTCGGCCAGCCTCCGCCGTGTCTTCCTCTTCCTCTTCTTCATCTCCGTCGTCGGCGATGACCGTGAGGTGTTTGGGCAAAAC
AGGGCTAAATTTGAACAACGGCGAACGGTTAATAACTTCTGGAGATGGCGAGAGAAGGCAGATAGTCGCCATGAAGGCGGCGGCGGCGGCGGCTCCCGAGACTGTGGAAA
CTAAAACCGAAGAATTGAATTTGGGTTCTTTGCTGGCGAATCTACTCGTTCAATTGAAGACTCCCGTGGGGAAGACGAAGATTCAGAAGCGACAGATTCAGAAGTTCATC
GAAAAGATCATAATCGACTGTCGATTCTTCACATTGTTAGCCGTCGCCGGATCTTTACTGGGTTCAGTGCTCTGTTTTATTGAGGGGAGTTTCATTGTTGCAGAGTCATA
TCTGCAGTACTTTCATGGTCTTTCACAGAGCTCGGACCAAACTCATATGGTGGCGCTTCTAATTGAAGCCTTAGATATGTTCCTCGTCGGAACCGCTCTGGTTGTTTTTG
GGGTCGGATTGTTTGCAATGTTCGTCGGATCGGAGAAGATGAAGGAAAAAAATCGGCCAGTGATTTCTGGGTCGAATTTGTTTGGTCTGTTCTACATGAAGAAAATTCCG
ACGTGGGTAGAAATGGAATCGATGTCGCAGGCGAAATCGAAGATCGGACATGCGGTGATGATGATACTGCAAGTGGGTGTGTTGGAAAAGTTCAAGAACATACCATTGAG
CTCTGCCGTCGATCTCGCATGTTTCGCCGCCGCCGTTCTGATTTCCTCCGCCTCCATCTTCTTCCTCTCCAAACTCAACGTTGGGGGAGGCGGCAGCGTCGGTTTCAAGT
GA
mRNA sequenceShow/hide mRNA sequence
ATGGCGGCGACTAGATTCATGCAGCGACTTCGGCCAGCCTCCGCCGTGTCTTCCTCTTCCTCTTCTTCATCTCCGTCGTCGGCGATGACCGTGAGGTGTTTGGGCAAAAC
AGGGCTAAATTTGAACAACGGCGAACGGTTAATAACTTCTGGAGATGGCGAGAGAAGGCAGATAGTCGCCATGAAGGCGGCGGCGGCGGCGGCTCCCGAGACTGTGGAAA
CTAAAACCGAAGAATTGAATTTGGGTTCTTTGCTGGCGAATCTACTCGTTCAATTGAAGACTCCCGTGGGGAAGACGAAGATTCAGAAGCGACAGATTCAGAAGTTCATC
GAAAAGATCATAATCGACTGTCGATTCTTCACATTGTTAGCCGTCGCCGGATCTTTACTGGGTTCAGTGCTCTGTTTTATTGAGGGGAGTTTCATTGTTGCAGAGTCATA
TCTGCAGTACTTTCATGGTCTTTCACAGAGCTCGGACCAAACTCATATGGTGGCGCTTCTAATTGAAGCCTTAGATATGTTCCTCGTCGGAACCGCTCTGGTTGTTTTTG
GGGTCGGATTGTTTGCAATGTTCGTCGGATCGGAGAAGATGAAGGAAAAAAATCGGCCAGTGATTTCTGGGTCGAATTTGTTTGGTCTGTTCTACATGAAGAAAATTCCG
ACGTGGGTAGAAATGGAATCGATGTCGCAGGCGAAATCGAAGATCGGACATGCGGTGATGATGATACTGCAAGTGGGTGTGTTGGAAAAGTTCAAGAACATACCATTGAG
CTCTGCCGTCGATCTCGCATGTTTCGCCGCCGCCGTTCTGATTTCCTCCGCCTCCATCTTCTTCCTCTCCAAACTCAACGTTGGGGGAGGCGGCAGCGTCGGTTTCAAGT
GA
Protein sequenceShow/hide protein sequence
MAATRFMQRLRPASAVSSSSSSSSPSSAMTVRCLGKTGLNLNNGERLITSGDGERRQIVAMKAAAAAAPETVETKTEELNLGSLLANLLVQLKTPVGKTKIQKRQIQKFI
EKIIIDCRFFTLLAVAGSLLGSVLCFIEGSFIVAESYLQYFHGLSQSSDQTHMVALLIEALDMFLVGTALVVFGVGLFAMFVGSEKMKEKNRPVISGSNLFGLFYMKKIP
TWVEMESMSQAKSKIGHAVMMILQVGVLEKFKNIPLSSAVDLACFAAAVLISSASIFFLSKLNVGGGGSVGFK