; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc05g24320 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc05g24320
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr5:17288161..17290756
RNA-Seq ExpressionMoc05g24320
SyntenyMoc05g24320
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022142326.1 uncharacterized protein LOC111012467 [Momordica charantia]4.0e-8045.35Show/hide
Query:  SPTPVRFRIEPSSSGVKDQASRISAACLDGCLRRASKFMSDPRSVLQRTIDHVAEAFIASIHSAVMIKAELDEREILAARESANSSAALEVA-TTMKGEL
        S    RFRIEPSSSGV+DQ SRISAA LD CLRRASKF+S P SVL R ID+ AEAF+ASI SA+ +KAELD RE+LAARE    SAALE A +TMK EL
Subjt:  SPTPVRFRIEPSSSGVKDQASRISAACLDGCLRRASKFMSDPRSVLQRTIDHVAEAFIASIHSAVMIKAELDEREILAARESANSSAALEVA-TTMKGEL

Query:  LKARSEMEAFKFEV--------------------------------------------------------------------------------------
        LKA SE+E  K EV                                                                                      
Subjt:  LKARSEMEAFKFEV--------------------------------------------------------------------------------------

Query:  --------------------------------------------------------------------------------------------------EA
                                                                                                          EA
Subjt:  --------------------------------------------------------------------------------------------------EA

Query:  KVLLLKKEEEKHKAHLRAVNAITKGLEKEKFQLLKEKDDLAQVLEEKDTLIGRLTTELKEEKEHLTNGALLEAAFRQHPDFDGFAKDFSDAGFKFLMKGI
        K  LLK+E+E+HKAHLRA +AITKGLEKEKFQLLKEKDD+ Q LE KD  IGRL  ELK EKE LTNGALLEAAFRQHPDFDGFAKDFSDAGFKFLMKGI
Subjt:  KVLLLKKEEEKHKAHLRAVNAITKGLEKEKFQLLKEKDDLAQVLEEKDTLIGRLTTELKEEKEHLTNGALLEAAFRQHPDFDGFAKDFSDAGFKFLMKGI

Query:  AADMPHLQIDLGDLKKRYAEKWAFGPNSTPGPTSLVEKYVRELNSDYSDLEE
        AAD+PHL++DLGDLKKRYAEKWA GPN T GP SLV+KYVR+L+SDYSDL+E
Subjt:  AADMPHLQIDLGDLKKRYAEKWAFGPNSTPGPTSLVEKYVRELNSDYSDLEE

XP_022147182.1 uncharacterized protein LOC111016193 [Momordica charantia]2.5e-10678.71Show/hide
Query:  VRFRIEPSSSGVKDQASRISAACLDGCLRRASKFMSDPRSVLQRTIDHVAEAFIASIHSAVMIKAELDEREILAARESANSSAALEVATTMKGELLKARS
        +RFR+E SSSGVKDQ SRISA CLD CLRRAS+F+SDP SVLQRTID+ AEAFIASIHSAVM+KAELD RE L A+E  N S  LE ATT+KGELLKA+ 
Subjt:  VRFRIEPSSSGVKDQASRISAACLDGCLRRASKFMSDPRSVLQRTIDHVAEAFIASIHSAVMIKAELDEREILAARESANSSAALEVATTMKGELLKARS

Query:  EMEAFKFEVEAKVLLLKKEEEKHKAHLRAVNAITKGLEKEKFQLLKEKDDLAQVLEEKDTLIGRLTTELKEEKEHLTNGALLEAAFRQHPDFDGFAKDFS
        E++  + EV+AK  LLKKE EKHKAHLRA +AITKGLEKEKFQLLKEKDDLAQVLE+KD  IGRLTTELK+ KE LT+GALLE +FRQHP+FDGFAKDFS
Subjt:  EMEAFKFEVEAKVLLLKKEEEKHKAHLRAVNAITKGLEKEKFQLLKEKDDLAQVLEEKDTLIGRLTTELKEEKEHLTNGALLEAAFRQHPDFDGFAKDFS

Query:  DAGFKFLMKGIAADMPHLQIDLGDLKKRYAEKWAFGPNSTPGPTSLVEKYVRELNSDYSDLEE
        DAGFKFLMKGIAADMPHLQIDL DLKKRY+E WA GPN TPGP SLV+KYVREL+SDYSD+EE
Subjt:  DAGFKFLMKGIAADMPHLQIDLGDLKKRYAEKWAFGPNSTPGPTSLVEKYVRELNSDYSDLEE

XP_022150343.1 uncharacterized protein LOC111018538 [Momordica charantia]6.6e-9171.26Show/hide
Query:  RIEPSSSGVKDQASRISAACLDGCLRRASKFMSDPRSVLQRTIDHVAEAFIASIHSAVMIKAELDEREILAARESANSSAALEVA-TTMKGELLKARSEM
        RIEPSSSGV+DQ SRISAA LD CLRRASKF+S P SVLQRTID+ AEAF+ASI SA+ +KAELD RE+LAARE    SAALE A +TMK ELLKA SE+
Subjt:  RIEPSSSGVKDQASRISAACLDGCLRRASKFMSDPRSVLQRTIDHVAEAFIASIHSAVMIKAELDEREILAARESANSSAALEVA-TTMKGELLKARSEM

Query:  EAFKFEVEAKVLLLKKEEEKHKAHLRAVNAITKGLEKEKFQLLKEKDDLAQVLEEKDTLIGRLTTELKEEKEHLTNGALLEAAFRQHPDFDGFAKDFSDA
        E  K EVE++  LLKKEE++ +A LRA +AIT+GLE+EKFQLLKEKDD+ Q LE KD  +   T EL+  KE L+NG LLE AFRQHPDFDGFAKDFSDA
Subjt:  EAFKFEVEAKVLLLKKEEEKHKAHLRAVNAITKGLEKEKFQLLKEKDDLAQVLEEKDTLIGRLTTELKEEKEHLTNGALLEAAFRQHPDFDGFAKDFSDA

Query:  GFKFLMKGIAADMPHLQIDLGDLKKRYAEKWAFGPNSTPGPTSLVEKYVRELNSDYSDLEE
        GFKFLMKGIA+DMP LQIDL  LK+RYAEKWA GP  TPGP +LV++YVR+L+SDYSD EE
Subjt:  GFKFLMKGIAADMPHLQIDLGDLKKRYAEKWAFGPNSTPGPTSLVEKYVRELNSDYSDLEE

XP_022152119.1 uncharacterized protein LOC111019909 [Momordica charantia]5.0e-10779.77Show/hide
Query:  RFRIEPSSSGVKDQASRISAACLDGCLRRASKFMSDPRSVLQRTIDHVAEAFIASIHSAVMIKAELDEREILAARESANSSAALEVATTMKGELLKARSE
        RFR+EPSSSGVKDQ SRISA CLD CL+RASKF+SDP SVLQRTID+ AEAF+ASIHSA+M+KAELD RE LAA+E  NSSAALE ATT+KGELLKA+ E
Subjt:  RFRIEPSSSGVKDQASRISAACLDGCLRRASKFMSDPRSVLQRTIDHVAEAFIASIHSAVMIKAELDEREILAARESANSSAALEVATTMKGELLKARSE

Query:  MEAFKFEVEAKVLLLKKEEEKHKAHLRAVNAITKGLEKEKFQLLKEKDDLAQVLEEKDTLIGRLTTELKEEKEHLTNGALLEAAFRQHPDFDGFAKDFSD
        +   + EV+AK  LLKKE EKHKAHLRA +AITKGLEKEKFQLLKEKDDLAQVLE KDT IGRLT ELK+ KE LTNG+LLE +FRQH DFDGFAKDFSD
Subjt:  MEAFKFEVEAKVLLLKKEEEKHKAHLRAVNAITKGLEKEKFQLLKEKDDLAQVLEEKDTLIGRLTTELKEEKEHLTNGALLEAAFRQHPDFDGFAKDFSD

Query:  AGFKFLMKGIAADMPHLQIDLGDLKKRYAEKWAFGPNSTPGPTSLVEKYVRELNSDYSDLEE
        AGFKFLMKGIAADMPHLQIDL +LKK+Y+EKWA GPN TPGP SLV KYVREL+SDYSD+EE
Subjt:  AGFKFLMKGIAADMPHLQIDLGDLKKRYAEKWAFGPNSTPGPTSLVEKYVRELNSDYSDLEE

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]2.7e-13757.56Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVGKWFFVSREWLAKNESGRPFFDVPVRFEN--------------------------------------------------
        MCARKG GGIVKGPTSIKGWVGKWFF S EWLAK+ESGR FFDVP RF N                                                  
Subjt:  MCARKGAGGIVKGPTSIKGWVGKWFFVSREWLAKNESGRPFFDVPVRFEN--------------------------------------------------

Query:  -------------LAMVCGFTSSVKRKSKGRAHALKTIQSTEPATPVVARPAAQDKAGPSVDSPTPV---------------------------------
                     LAMVCGFT SVKRKSKGRAHALKT+  TEP TP V R  AQ  +GPS   PTPV                                 
Subjt:  -------------LAMVCGFTSSVKRKSKGRAHALKTIQSTEPATPVVARPAAQDKAGPSVDSPTPV---------------------------------

Query:  -----------------------------------------------RFRIEPSSSGVKDQASRISAACLDGCLRRASKFMSDPRSVLQRTIDHVAEAFI
                                                       RF +EPSSSGVKDQ SRISA CLD  LRRASKF+SDP SVLQRTID+VAEAFI
Subjt:  -----------------------------------------------RFRIEPSSSGVKDQASRISAACLDGCLRRASKFMSDPRSVLQRTIDHVAEAFI

Query:  ASIHSAVMIKAELDEREILAARESANSSAALEVATTMKGELLKARSEMEAFKFEVEAKVLLLKKEEEKHKAHLRAVNAITKGLEKEKFQLLKEKDDLAQV
        ASIH AVM+KAELD RE LAA+E  NS AALE ATT+KGELLKA+ E++  + EV+AKV LLKKE EKHKAHLRA +AITKGLEKEKFQLLKEKDDLAQV
Subjt:  ASIHSAVMIKAELDEREILAARESANSSAALEVATTMKGELLKARSEMEAFKFEVEAKVLLLKKEEEKHKAHLRAVNAITKGLEKEKFQLLKEKDDLAQV

Query:  LEEKDTLIGRLTTELKEEKEHLTNGALLEAAFRQHPDFDGFAKDFSDAGFKFLMKGIAADMPHLQIDLGDLKKRYAEKWAFGPNSTPGPTSLVEKYVREL
        LEEKD  IGRLTTELK+ KE LTNG LLE +FRQHPDFDGFAKDFSDAGFKFLMKGIAADMPHLQIDL  LKK+Y+EKWA GPN TP P SLV+KYVREL
Subjt:  LEEKDTLIGRLTTELKEEKEHLTNGALLEAAFRQHPDFDGFAKDFSDAGFKFLMKGIAADMPHLQIDLGDLKKRYAEKWAFGPNSTPGPTSLVEKYVREL

Query:  NSDYSDLEE
        +SDYSD+EE
Subjt:  NSDYSDLEE

TrEMBL top hitse value%identityAlignment
A0A6J1CLV1 uncharacterized protein LOC1110124671.9e-8045.35Show/hide
Query:  SPTPVRFRIEPSSSGVKDQASRISAACLDGCLRRASKFMSDPRSVLQRTIDHVAEAFIASIHSAVMIKAELDEREILAARESANSSAALEVA-TTMKGEL
        S    RFRIEPSSSGV+DQ SRISAA LD CLRRASKF+S P SVL R ID+ AEAF+ASI SA+ +KAELD RE+LAARE    SAALE A +TMK EL
Subjt:  SPTPVRFRIEPSSSGVKDQASRISAACLDGCLRRASKFMSDPRSVLQRTIDHVAEAFIASIHSAVMIKAELDEREILAARESANSSAALEVA-TTMKGEL

Query:  LKARSEMEAFKFEV--------------------------------------------------------------------------------------
        LKA SE+E  K EV                                                                                      
Subjt:  LKARSEMEAFKFEV--------------------------------------------------------------------------------------

Query:  --------------------------------------------------------------------------------------------------EA
                                                                                                          EA
Subjt:  --------------------------------------------------------------------------------------------------EA

Query:  KVLLLKKEEEKHKAHLRAVNAITKGLEKEKFQLLKEKDDLAQVLEEKDTLIGRLTTELKEEKEHLTNGALLEAAFRQHPDFDGFAKDFSDAGFKFLMKGI
        K  LLK+E+E+HKAHLRA +AITKGLEKEKFQLLKEKDD+ Q LE KD  IGRL  ELK EKE LTNGALLEAAFRQHPDFDGFAKDFSDAGFKFLMKGI
Subjt:  KVLLLKKEEEKHKAHLRAVNAITKGLEKEKFQLLKEKDDLAQVLEEKDTLIGRLTTELKEEKEHLTNGALLEAAFRQHPDFDGFAKDFSDAGFKFLMKGI

Query:  AADMPHLQIDLGDLKKRYAEKWAFGPNSTPGPTSLVEKYVRELNSDYSDLEE
        AAD+PHL++DLGDLKKRYAEKWA GPN T GP SLV+KYVR+L+SDYSDL+E
Subjt:  AADMPHLQIDLGDLKKRYAEKWAFGPNSTPGPTSLVEKYVRELNSDYSDLEE

A0A6J1D1N9 uncharacterized protein LOC1110161931.2e-10678.71Show/hide
Query:  VRFRIEPSSSGVKDQASRISAACLDGCLRRASKFMSDPRSVLQRTIDHVAEAFIASIHSAVMIKAELDEREILAARESANSSAALEVATTMKGELLKARS
        +RFR+E SSSGVKDQ SRISA CLD CLRRAS+F+SDP SVLQRTID+ AEAFIASIHSAVM+KAELD RE L A+E  N S  LE ATT+KGELLKA+ 
Subjt:  VRFRIEPSSSGVKDQASRISAACLDGCLRRASKFMSDPRSVLQRTIDHVAEAFIASIHSAVMIKAELDEREILAARESANSSAALEVATTMKGELLKARS

Query:  EMEAFKFEVEAKVLLLKKEEEKHKAHLRAVNAITKGLEKEKFQLLKEKDDLAQVLEEKDTLIGRLTTELKEEKEHLTNGALLEAAFRQHPDFDGFAKDFS
        E++  + EV+AK  LLKKE EKHKAHLRA +AITKGLEKEKFQLLKEKDDLAQVLE+KD  IGRLTTELK+ KE LT+GALLE +FRQHP+FDGFAKDFS
Subjt:  EMEAFKFEVEAKVLLLKKEEEKHKAHLRAVNAITKGLEKEKFQLLKEKDDLAQVLEEKDTLIGRLTTELKEEKEHLTNGALLEAAFRQHPDFDGFAKDFS

Query:  DAGFKFLMKGIAADMPHLQIDLGDLKKRYAEKWAFGPNSTPGPTSLVEKYVRELNSDYSDLEE
        DAGFKFLMKGIAADMPHLQIDL DLKKRY+E WA GPN TPGP SLV+KYVREL+SDYSD+EE
Subjt:  DAGFKFLMKGIAADMPHLQIDLGDLKKRYAEKWAFGPNSTPGPTSLVEKYVRELNSDYSDLEE

A0A6J1D971 uncharacterized protein LOC1110185383.2e-9171.26Show/hide
Query:  RIEPSSSGVKDQASRISAACLDGCLRRASKFMSDPRSVLQRTIDHVAEAFIASIHSAVMIKAELDEREILAARESANSSAALEVA-TTMKGELLKARSEM
        RIEPSSSGV+DQ SRISAA LD CLRRASKF+S P SVLQRTID+ AEAF+ASI SA+ +KAELD RE+LAARE    SAALE A +TMK ELLKA SE+
Subjt:  RIEPSSSGVKDQASRISAACLDGCLRRASKFMSDPRSVLQRTIDHVAEAFIASIHSAVMIKAELDEREILAARESANSSAALEVA-TTMKGELLKARSEM

Query:  EAFKFEVEAKVLLLKKEEEKHKAHLRAVNAITKGLEKEKFQLLKEKDDLAQVLEEKDTLIGRLTTELKEEKEHLTNGALLEAAFRQHPDFDGFAKDFSDA
        E  K EVE++  LLKKEE++ +A LRA +AIT+GLE+EKFQLLKEKDD+ Q LE KD  +   T EL+  KE L+NG LLE AFRQHPDFDGFAKDFSDA
Subjt:  EAFKFEVEAKVLLLKKEEEKHKAHLRAVNAITKGLEKEKFQLLKEKDDLAQVLEEKDTLIGRLTTELKEEKEHLTNGALLEAAFRQHPDFDGFAKDFSDA

Query:  GFKFLMKGIAADMPHLQIDLGDLKKRYAEKWAFGPNSTPGPTSLVEKYVRELNSDYSDLEE
        GFKFLMKGIA+DMP LQIDL  LK+RYAEKWA GP  TPGP +LV++YVR+L+SDYSD EE
Subjt:  GFKFLMKGIAADMPHLQIDLGDLKKRYAEKWAFGPNSTPGPTSLVEKYVRELNSDYSDLEE

A0A6J1DF31 uncharacterized protein LOC1110199092.4e-10779.77Show/hide
Query:  RFRIEPSSSGVKDQASRISAACLDGCLRRASKFMSDPRSVLQRTIDHVAEAFIASIHSAVMIKAELDEREILAARESANSSAALEVATTMKGELLKARSE
        RFR+EPSSSGVKDQ SRISA CLD CL+RASKF+SDP SVLQRTID+ AEAF+ASIHSA+M+KAELD RE LAA+E  NSSAALE ATT+KGELLKA+ E
Subjt:  RFRIEPSSSGVKDQASRISAACLDGCLRRASKFMSDPRSVLQRTIDHVAEAFIASIHSAVMIKAELDEREILAARESANSSAALEVATTMKGELLKARSE

Query:  MEAFKFEVEAKVLLLKKEEEKHKAHLRAVNAITKGLEKEKFQLLKEKDDLAQVLEEKDTLIGRLTTELKEEKEHLTNGALLEAAFRQHPDFDGFAKDFSD
        +   + EV+AK  LLKKE EKHKAHLRA +AITKGLEKEKFQLLKEKDDLAQVLE KDT IGRLT ELK+ KE LTNG+LLE +FRQH DFDGFAKDFSD
Subjt:  MEAFKFEVEAKVLLLKKEEEKHKAHLRAVNAITKGLEKEKFQLLKEKDDLAQVLEEKDTLIGRLTTELKEEKEHLTNGALLEAAFRQHPDFDGFAKDFSD

Query:  AGFKFLMKGIAADMPHLQIDLGDLKKRYAEKWAFGPNSTPGPTSLVEKYVRELNSDYSDLEE
        AGFKFLMKGIAADMPHLQIDL +LKK+Y+EKWA GPN TPGP SLV KYVREL+SDYSD+EE
Subjt:  AGFKFLMKGIAADMPHLQIDLGDLKKRYAEKWAFGPNSTPGPTSLVEKYVRELNSDYSDLEE

A0A6J1DZB3 uncharacterized protein LOC1110256651.3e-13757.56Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVGKWFFVSREWLAKNESGRPFFDVPVRFEN--------------------------------------------------
        MCARKG GGIVKGPTSIKGWVGKWFF S EWLAK+ESGR FFDVP RF N                                                  
Subjt:  MCARKGAGGIVKGPTSIKGWVGKWFFVSREWLAKNESGRPFFDVPVRFEN--------------------------------------------------

Query:  -------------LAMVCGFTSSVKRKSKGRAHALKTIQSTEPATPVVARPAAQDKAGPSVDSPTPV---------------------------------
                     LAMVCGFT SVKRKSKGRAHALKT+  TEP TP V R  AQ  +GPS   PTPV                                 
Subjt:  -------------LAMVCGFTSSVKRKSKGRAHALKTIQSTEPATPVVARPAAQDKAGPSVDSPTPV---------------------------------

Query:  -----------------------------------------------RFRIEPSSSGVKDQASRISAACLDGCLRRASKFMSDPRSVLQRTIDHVAEAFI
                                                       RF +EPSSSGVKDQ SRISA CLD  LRRASKF+SDP SVLQRTID+VAEAFI
Subjt:  -----------------------------------------------RFRIEPSSSGVKDQASRISAACLDGCLRRASKFMSDPRSVLQRTIDHVAEAFI

Query:  ASIHSAVMIKAELDEREILAARESANSSAALEVATTMKGELLKARSEMEAFKFEVEAKVLLLKKEEEKHKAHLRAVNAITKGLEKEKFQLLKEKDDLAQV
        ASIH AVM+KAELD RE LAA+E  NS AALE ATT+KGELLKA+ E++  + EV+AKV LLKKE EKHKAHLRA +AITKGLEKEKFQLLKEKDDLAQV
Subjt:  ASIHSAVMIKAELDEREILAARESANSSAALEVATTMKGELLKARSEMEAFKFEVEAKVLLLKKEEEKHKAHLRAVNAITKGLEKEKFQLLKEKDDLAQV

Query:  LEEKDTLIGRLTTELKEEKEHLTNGALLEAAFRQHPDFDGFAKDFSDAGFKFLMKGIAADMPHLQIDLGDLKKRYAEKWAFGPNSTPGPTSLVEKYVREL
        LEEKD  IGRLTTELK+ KE LTNG LLE +FRQHPDFDGFAKDFSDAGFKFLMKGIAADMPHLQIDL  LKK+Y+EKWA GPN TP P SLV+KYVREL
Subjt:  LEEKDTLIGRLTTELKEEKEHLTNGALLEAAFRQHPDFDGFAKDFSDAGFKFLMKGIAADMPHLQIDLGDLKKRYAEKWAFGPNSTPGPTSLVEKYVREL

Query:  NSDYSDLEE
        +SDYSD+EE
Subjt:  NSDYSDLEE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCGTTGTGGAAGATGTTTCGATCTGCCAGGCTGTCGGAGCACTCGAGTATTCCATCGTTACGAATTTCGAGATGATCCTAGCCACTCGTTCACTACACATGTCAGT
CATTCTTGTTTTACTCTTTATTTCGAATATGTTGGTTTTCATGTCCTCCCCCTCTACTAGTGATGGCTTAGGTAGCGCAGGTCGGACTATAAGCAGTTCACCCCCTCAGC
CAAGTGACTCTGGGGAGGATTTAGCTCATAGGTTAGAGTCCGAATTGGAAGAGGTAGAGAACTTTAGGTTTTCTGATGATGGGGAAGATAGCGACACTTCCACCTCGGTC
CAGGGTTTGGAATACCCTTCGAAAATGCCTGAACACTATCTTGGAACCCTCCGTAGGGGACTTCCCCTTCACCCTTTTTCCCAGGAGTTCTTAAACCGAACTGGACTGAC
TCCTGCTCAAGTGGCCCTCAATGGCTGGGGTGTCATTTTTGCTTTGGCCATCCTCTTTTGGCGACGAGCTCGAGAAGAGGACGAGGCCGATCTGCTAGATATTGAACAGC
TTCTAGGGTGCTTTGAAGCTAAAAGGATAGTTAAGAAGCCAGGTCGGTACTACATGTGCGCAAGGAAGGGCGCGGGTGGTATAGTCAAAGGGCCGACCTCCATCAAAGGA
TGGGTGGGGAAATGGTTCTTTGTCTCTAGAGAATGGCTGGCAAAAAACGAGTCTGGTCGTCCCTTCTTTGACGTTCCTGTTAGGTTTGAGAATTTAGCAATGGTGTGCGG
ATTCACTAGTAGCGTGAAGCGCAAGTCTAAGGGTCGTGCTCACGCCCTCAAGACCATTCAGAGCACTGAGCCAGCAACTCCCGTTGTCGCTCGACCTGCGGCTCAAGACA
AAGCTGGGCCCTCTGTCGACTCCCCAACTCCGGTGCGATTCAGGATAGAACCATCGAGCTCCGGGGTAAAGGACCAGGCGTCCCGCATCTCGGCTGCGTGCTTGGACGGC
TGCCTTAGAAGGGCGTCCAAGTTCATGAGCGACCCTAGGTCTGTACTGCAACGGACCATTGACCACGTCGCTGAGGCATTCATTGCTTCCATTCACTCGGCAGTCATGAT
AAAGGCTGAGTTGGATGAGAGGGAGATCCTGGCAGCTAGGGAGAGTGCGAACTCTTCTGCTGCCTTGGAAGTTGCCACCACAATGAAGGGCGAGCTACTGAAAGCTCGCT
CCGAAATGGAGGCCTTCAAATTCGAGGTGGAGGCCAAGGTTCTGCTGCTGAAAAAAGAAGAAGAAAAGCACAAGGCCCACCTCCGAGCTGTTAATGCCATCACCAAGGGG
TTGGAGAAGGAGAAGTTCCAGCTCCTGAAAGAGAAGGACGACCTGGCTCAAGTCCTTGAGGAGAAGGACACTTTGATAGGGCGTCTTACCACCGAGCTCAAGGAGGAGAA
GGAACACCTTACCAATGGAGCTCTCTTGGAAGCAGCATTCAGGCAACACCCTGACTTTGATGGGTTCGCCAAGGACTTCAGTGACGCGGGCTTCAAATTCCTGATGAAGG
GCATTGCTGCTGACATGCCCCACCTCCAAATCGACCTCGGCGATCTAAAAAAGAGGTATGCTGAGAAATGGGCTTTTGGGCCTAACAGCACTCCAGGCCCTACATCCTTG
GTGGAAAAGTACGTCAGAGAACTGAACTCTGACTACTCCGACCTGGAAGAATAA
mRNA sequenceShow/hide mRNA sequence
ATGACCGTTGTGGAAGATGTTTCGATCTGCCAGGCTGTCGGAGCACTCGAGTATTCCATCGTTACGAATTTCGAGATGATCCTAGCCACTCGTTCACTACACATGTCAGT
CATTCTTGTTTTACTCTTTATTTCGAATATGTTGGTTTTCATGTCCTCCCCCTCTACTAGTGATGGCTTAGGTAGCGCAGGTCGGACTATAAGCAGTTCACCCCCTCAGC
CAAGTGACTCTGGGGAGGATTTAGCTCATAGGTTAGAGTCCGAATTGGAAGAGGTAGAGAACTTTAGGTTTTCTGATGATGGGGAAGATAGCGACACTTCCACCTCGGTC
CAGGGTTTGGAATACCCTTCGAAAATGCCTGAACACTATCTTGGAACCCTCCGTAGGGGACTTCCCCTTCACCCTTTTTCCCAGGAGTTCTTAAACCGAACTGGACTGAC
TCCTGCTCAAGTGGCCCTCAATGGCTGGGGTGTCATTTTTGCTTTGGCCATCCTCTTTTGGCGACGAGCTCGAGAAGAGGACGAGGCCGATCTGCTAGATATTGAACAGC
TTCTAGGGTGCTTTGAAGCTAAAAGGATAGTTAAGAAGCCAGGTCGGTACTACATGTGCGCAAGGAAGGGCGCGGGTGGTATAGTCAAAGGGCCGACCTCCATCAAAGGA
TGGGTGGGGAAATGGTTCTTTGTCTCTAGAGAATGGCTGGCAAAAAACGAGTCTGGTCGTCCCTTCTTTGACGTTCCTGTTAGGTTTGAGAATTTAGCAATGGTGTGCGG
ATTCACTAGTAGCGTGAAGCGCAAGTCTAAGGGTCGTGCTCACGCCCTCAAGACCATTCAGAGCACTGAGCCAGCAACTCCCGTTGTCGCTCGACCTGCGGCTCAAGACA
AAGCTGGGCCCTCTGTCGACTCCCCAACTCCGGTGCGATTCAGGATAGAACCATCGAGCTCCGGGGTAAAGGACCAGGCGTCCCGCATCTCGGCTGCGTGCTTGGACGGC
TGCCTTAGAAGGGCGTCCAAGTTCATGAGCGACCCTAGGTCTGTACTGCAACGGACCATTGACCACGTCGCTGAGGCATTCATTGCTTCCATTCACTCGGCAGTCATGAT
AAAGGCTGAGTTGGATGAGAGGGAGATCCTGGCAGCTAGGGAGAGTGCGAACTCTTCTGCTGCCTTGGAAGTTGCCACCACAATGAAGGGCGAGCTACTGAAAGCTCGCT
CCGAAATGGAGGCCTTCAAATTCGAGGTGGAGGCCAAGGTTCTGCTGCTGAAAAAAGAAGAAGAAAAGCACAAGGCCCACCTCCGAGCTGTTAATGCCATCACCAAGGGG
TTGGAGAAGGAGAAGTTCCAGCTCCTGAAAGAGAAGGACGACCTGGCTCAAGTCCTTGAGGAGAAGGACACTTTGATAGGGCGTCTTACCACCGAGCTCAAGGAGGAGAA
GGAACACCTTACCAATGGAGCTCTCTTGGAAGCAGCATTCAGGCAACACCCTGACTTTGATGGGTTCGCCAAGGACTTCAGTGACGCGGGCTTCAAATTCCTGATGAAGG
GCATTGCTGCTGACATGCCCCACCTCCAAATCGACCTCGGCGATCTAAAAAAGAGGTATGCTGAGAAATGGGCTTTTGGGCCTAACAGCACTCCAGGCCCTACATCCTTG
GTGGAAAAGTACGTCAGAGAACTGAACTCTGACTACTCCGACCTGGAAGAATAA
Protein sequenceShow/hide protein sequence
MTVVEDVSICQAVGALEYSIVTNFEMILATRSLHMSVILVLLFISNMLVFMSSPSTSDGLGSAGRTISSSPPQPSDSGEDLAHRLESELEEVENFRFSDDGEDSDTSTSV
QGLEYPSKMPEHYLGTLRRGLPLHPFSQEFLNRTGLTPAQVALNGWGVIFALAILFWRRAREEDEADLLDIEQLLGCFEAKRIVKKPGRYYMCARKGAGGIVKGPTSIKG
WVGKWFFVSREWLAKNESGRPFFDVPVRFENLAMVCGFTSSVKRKSKGRAHALKTIQSTEPATPVVARPAAQDKAGPSVDSPTPVRFRIEPSSSGVKDQASRISAACLDG
CLRRASKFMSDPRSVLQRTIDHVAEAFIASIHSAVMIKAELDEREILAARESANSSAALEVATTMKGELLKARSEMEAFKFEVEAKVLLLKKEEEKHKAHLRAVNAITKG
LEKEKFQLLKEKDDLAQVLEEKDTLIGRLTTELKEEKEHLTNGALLEAAFRQHPDFDGFAKDFSDAGFKFLMKGIAADMPHLQIDLGDLKKRYAEKWAFGPNSTPGPTSL
VEKYVRELNSDYSDLEE