; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g29460 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g29460
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr6:22185790..22187627
RNA-Seq ExpressionMoc06g29460
SyntenyMoc06g29460
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]1.7e-9769.03Show/hide
Query:  MFEYGLRLPLHPFAKEFLNRTGLAPAQVAPNGWSVIFALAILFWLRAREEDEVDLLDVEQLLGCFEAKRIAKKPCRYYMCARKGAGGTVKGLTSIKGWMG
        MFEYGLRLPLHPF +EFL RTGLAPAQVAPNGW VIFALAILFWLRAR+ +E +LLDV+QLL CFEAKRIAKKP R+YMCARKGAGG VKG TSIKGW+ 
Subjt:  MFEYGLRLPLHPFAKEFLNRTGLAPAQVAPNGWSVIFALAILFWLRAREEDEVDLLDVEQLLGCFEAKRIAKKPCRYYMCARKGAGGTVKGLTSIKGWMG

Query:  KWFFASGEWLAKNEFGRPFFDVPVRFGNLVLIKPIPELTRASFNTLKYYKDHFPRGRKIGTLMTDKLLLESGLLDYNPLLRP------------------
        KWF+ASGEWLAK+E GR FFDVP RFGNLV I+P+PELT+ASF+TLKYYK+ FPRGRK+GTL+TD+LLLESGLLDYNP +RP                  
Subjt:  KWFFASGEWLAKNEFGRPFFDVPVRFGNLVLIKPIPELTRASFNTLKYYKDHFPRGRKIGTLMTDKLLLESGLLDYNPLLRP------------------

Query:  -VKRKSKGRAHALKTIQSTEPATPAVAQPAAQDKVGPSVEAPTPMIELDSVEECSREKHSRNESEALD
         VKRKSKGRAHAL+  QS++P TPAV        VGP+ E P P+IEL+S    SREK  R+++EA+D
Subjt:  -VKRKSKGRAHALKTIQSTEPATPAVAQPAAQDKVGPSVEAPTPMIELDSVEECSREKHSRNESEALD

XP_022158122.1 uncharacterized protein LOC111024680 [Momordica charantia]6.2e-8480.43Show/hide
Query:  MFEYGLRLPLHPFAKEFLNRTGLAPAQVAPNGWSVIFALAILFWLRAREEDEVDLLDVEQLLGCFEAKRIAKKPCRYYMCARKGAGGTVKGLTSIKGWMG
        MFEYGLRLPLHPF +EFL RTGLAPAQVAPNGW VIFALAILFWLRAR+ +E +LLDV+QLL CFEAKRIAKKP R+YMCARKGAGG VKG TSIKGW+ 
Subjt:  MFEYGLRLPLHPFAKEFLNRTGLAPAQVAPNGWSVIFALAILFWLRAREEDEVDLLDVEQLLGCFEAKRIAKKPCRYYMCARKGAGGTVKGLTSIKGWMG

Query:  KWFFASGEWLAKNEFGRPFFDVPVRFGNLVLIKPIPELTRASFNTLKYYKDHFPRGRKIGTLMTDKLLLESGLLDYNPLLRPVK
        KWF+ASGEWLAK+E GR FFDVP RFGNLV I+P+PELT+ASF+TLKYYK+ FPRGRK+GTL+TD+LLLESGLLDYNP +RP++
Subjt:  KWFFASGEWLAKNEFGRPFFDVPVRFGNLVLIKPIPELTRASFNTLKYYKDHFPRGRKIGTLMTDKLLLESGLLDYNPLLRPVK

XP_022158650.1 uncharacterized protein LOC111025108 [Momordica charantia]1.6e-8480.98Show/hide
Query:  MFEYGLRLPLHPFAKEFLNRTGLAPAQVAPNGWSVIFALAILFWLRAREEDEVDLLDVEQLLGCFEAKRIAKKPCRYYMCARKGAGGTVKGLTSIKGWMG
        MFEYGLRLPLHPF +EFL RTGLAPAQVAPNGW VIFALAILFWLRAR+ +E +LLDV+QLL CFEAKRIAKKP R+YMCARKGA G VKG TSIKGW+ 
Subjt:  MFEYGLRLPLHPFAKEFLNRTGLAPAQVAPNGWSVIFALAILFWLRAREEDEVDLLDVEQLLGCFEAKRIAKKPCRYYMCARKGAGGTVKGLTSIKGWMG

Query:  KWFFASGEWLAKNEFGRPFFDVPVRFGNLVLIKPIPELTRASFNTLKYYKDHFPRGRKIGTLMTDKLLLESGLLDYNPLLRPVK
        KWF+ASGEWLAK+E GR FFDVP RFGNLV I+P+PELT+ASF+TLKYYK+HFPRGRK+GTL+TDKLLLESGLLDYNP +RP++
Subjt:  KWFFASGEWLAKNEFGRPFFDVPVRFGNLVLIKPIPELTRASFNTLKYYKDHFPRGRKIGTLMTDKLLLESGLLDYNPLLRPVK

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]4.6e-11969.13Show/hide
Query:  MPEHYLGALHKGFSIPDDIIFRIPEEGERADNPTEGWVTLYLKMFEYGLRLPLHPFAKEFLNRTGLAPAQVAPNGWSVIFALAILFWLRAREEDEVDLLD
        +PEHYLG+L +GF+IP++I+ R+PEEGERADNP EGWVTLY KMFEYGLRLPLHPF +EFL RTGLAPAQVAPNGW VIFALAILFWLRAR+ +E +L D
Subjt:  MPEHYLGALHKGFSIPDDIIFRIPEEGERADNPTEGWVTLYLKMFEYGLRLPLHPFAKEFLNRTGLAPAQVAPNGWSVIFALAILFWLRAREEDEVDLLD

Query:  VEQLLGCFEAKRIAKKPCRYYMCARKGAGGTVKGLTSIKGWMGKWFFASGEWLAKNEFGRPFFDVPVRFGNLVLIKPIPELTRASFNTLKYYKDHFPRGR
        V+QLL CFEAKRIAKKP R+YMCARKGAGG VKG TSIKGW+ KWF+ASGEWLAK+E GR FFDVP RFGNLV I+P+PELT+ASF+TLKYYK+ FPRGR
Subjt:  VEQLLGCFEAKRIAKKPCRYYMCARKGAGGTVKGLTSIKGWMGKWFFASGEWLAKNEFGRPFFDVPVRFGNLVLIKPIPELTRASFNTLKYYKDHFPRGR

Query:  KIGTLMTDKLLLESGLLDYNPLLRP-------------------VKRKSKGRAHALKTIQSTEPATPAVAQPAAQDKVGPSVEAPTPMIELDSVEECSRE
        K+GTL+TD+LLLESGLLDYNP +RP                   VKRKSKGRAHAL+  QS++PATPAV        VGP+ E P  +IEL+S    SRE
Subjt:  KIGTLMTDKLLLESGLLDYNPLLRP-------------------VKRKSKGRAHALKTIQSTEPATPAVAQPAAQDKVGPSVEAPTPMIELDSVEECSRE

Query:  KHSRNESEALD
        K  R+++EA+D
Subjt:  KHSRNESEALD

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]1.9e-11259.43Show/hide
Query:  MCARKGAGGTVKGLTSIKGWMGKWFFASGEWLAKNEFGRPFFDVPVRFGNLVLIKPIPELTRASFNTLKYYKDHFPRGRKIGTLMTDKLLLESGLLDYNP
        MCARKG GG VKG TSIKGW+GKWFFASGEWLAK+E GR FFDVP RFGNLV IK IPEL +A+F+TLK+YKDHFPR RKI TL+TDKLLLESGLLDYNP
Subjt:  MCARKGAGGTVKGLTSIKGWMGKWFFASGEWLAKNEFGRPFFDVPVRFGNLVLIKPIPELTRASFNTLKYYKDHFPRGRKIGTLMTDKLLLESGLLDYNP

Query:  LLR-------------------PVKRKSKGRAHALKTIQSTEPATPAVAQPAAQDKVGPSVEAPTPMIELDSVEECSREKHSRNESEALDMSPFSD----
        L+R                    VKRKSKGRAHALKT+  TEP TP V +  AQ   GPS   PTP+IELD     S EK SR ESEALD+SP ++    
Subjt:  LLR-------------------PVKRKSKGRAHALKTIQSTEPATPAVAQPAAQDKVGPSVEAPTPMIELDSVEECSREKHSRNESEALDMSPFSD----

Query:  ---------------------------------------RGTSDVKMRFRMEPSSSGVKDQVSRISAACLDCCLRRASKFVSDHGSVLQQSIDHAAEAFI
                                               RGTS+V+MRF MEPSSSGVKDQVSRISA CLD  LRRASKFVSD GSVLQ++ID+ AEAFI
Subjt:  ---------------------------------------RGTSDVKMRFRMEPSSSGVKDQVSRISAACLDCCLRRASKFVSDHGSVLQQSIDHAAEAFI

Query:  ASIHSAVMKKAELDGREILAARESVNSSAALEAATTMNGELLKARSEVETLKAE---------------------------GLEKEKFQLLKEKDDMVQI
        ASIH AVM KAELDGRE LAA+E  NS AALEAATT+ GELLKA+ EV+ L+AE                           GLEKEKFQLLKEKDD+ Q+
Subjt:  ASIHSAVMKKAELDGREILAARESVNSSAALEAATTMNGELLKARSEVETLKAE---------------------------GLEKEKFQLLKEKDDMVQI

Query:  LEENDALIGRLTTELNEEK
        LEE DA IGRLTTEL + K
Subjt:  LEENDALIGRLTTELNEEK

TrEMBL top hitse value%identityAlignment
A0A6J1CR42 uncharacterized protein LOC1110138268.2e-9869.03Show/hide
Query:  MFEYGLRLPLHPFAKEFLNRTGLAPAQVAPNGWSVIFALAILFWLRAREEDEVDLLDVEQLLGCFEAKRIAKKPCRYYMCARKGAGGTVKGLTSIKGWMG
        MFEYGLRLPLHPF +EFL RTGLAPAQVAPNGW VIFALAILFWLRAR+ +E +LLDV+QLL CFEAKRIAKKP R+YMCARKGAGG VKG TSIKGW+ 
Subjt:  MFEYGLRLPLHPFAKEFLNRTGLAPAQVAPNGWSVIFALAILFWLRAREEDEVDLLDVEQLLGCFEAKRIAKKPCRYYMCARKGAGGTVKGLTSIKGWMG

Query:  KWFFASGEWLAKNEFGRPFFDVPVRFGNLVLIKPIPELTRASFNTLKYYKDHFPRGRKIGTLMTDKLLLESGLLDYNPLLRP------------------
        KWF+ASGEWLAK+E GR FFDVP RFGNLV I+P+PELT+ASF+TLKYYK+ FPRGRK+GTL+TD+LLLESGLLDYNP +RP                  
Subjt:  KWFFASGEWLAKNEFGRPFFDVPVRFGNLVLIKPIPELTRASFNTLKYYKDHFPRGRKIGTLMTDKLLLESGLLDYNPLLRP------------------

Query:  -VKRKSKGRAHALKTIQSTEPATPAVAQPAAQDKVGPSVEAPTPMIELDSVEECSREKHSRNESEALD
         VKRKSKGRAHAL+  QS++P TPAV        VGP+ E P P+IEL+S    SREK  R+++EA+D
Subjt:  -VKRKSKGRAHALKTIQSTEPATPAVAQPAAQDKVGPSVEAPTPMIELDSVEECSREKHSRNESEALD

A0A6J1DWD2 uncharacterized protein LOC1110246803.0e-8480.43Show/hide
Query:  MFEYGLRLPLHPFAKEFLNRTGLAPAQVAPNGWSVIFALAILFWLRAREEDEVDLLDVEQLLGCFEAKRIAKKPCRYYMCARKGAGGTVKGLTSIKGWMG
        MFEYGLRLPLHPF +EFL RTGLAPAQVAPNGW VIFALAILFWLRAR+ +E +LLDV+QLL CFEAKRIAKKP R+YMCARKGAGG VKG TSIKGW+ 
Subjt:  MFEYGLRLPLHPFAKEFLNRTGLAPAQVAPNGWSVIFALAILFWLRAREEDEVDLLDVEQLLGCFEAKRIAKKPCRYYMCARKGAGGTVKGLTSIKGWMG

Query:  KWFFASGEWLAKNEFGRPFFDVPVRFGNLVLIKPIPELTRASFNTLKYYKDHFPRGRKIGTLMTDKLLLESGLLDYNPLLRPVK
        KWF+ASGEWLAK+E GR FFDVP RFGNLV I+P+PELT+ASF+TLKYYK+ FPRGRK+GTL+TD+LLLESGLLDYNP +RP++
Subjt:  KWFFASGEWLAKNEFGRPFFDVPVRFGNLVLIKPIPELTRASFNTLKYYKDHFPRGRKIGTLMTDKLLLESGLLDYNPLLRPVK

A0A6J1DWF1 uncharacterized protein LOC1110251087.9e-8580.98Show/hide
Query:  MFEYGLRLPLHPFAKEFLNRTGLAPAQVAPNGWSVIFALAILFWLRAREEDEVDLLDVEQLLGCFEAKRIAKKPCRYYMCARKGAGGTVKGLTSIKGWMG
        MFEYGLRLPLHPF +EFL RTGLAPAQVAPNGW VIFALAILFWLRAR+ +E +LLDV+QLL CFEAKRIAKKP R+YMCARKGA G VKG TSIKGW+ 
Subjt:  MFEYGLRLPLHPFAKEFLNRTGLAPAQVAPNGWSVIFALAILFWLRAREEDEVDLLDVEQLLGCFEAKRIAKKPCRYYMCARKGAGGTVKGLTSIKGWMG

Query:  KWFFASGEWLAKNEFGRPFFDVPVRFGNLVLIKPIPELTRASFNTLKYYKDHFPRGRKIGTLMTDKLLLESGLLDYNPLLRPVK
        KWF+ASGEWLAK+E GR FFDVP RFGNLV I+P+PELT+ASF+TLKYYK+HFPRGRK+GTL+TDKLLLESGLLDYNP +RP++
Subjt:  KWFFASGEWLAKNEFGRPFFDVPVRFGNLVLIKPIPELTRASFNTLKYYKDHFPRGRKIGTLMTDKLLLESGLLDYNPLLRPVK

A0A6J1DXS5 uncharacterized protein LOC1110255022.2e-11969.13Show/hide
Query:  MPEHYLGALHKGFSIPDDIIFRIPEEGERADNPTEGWVTLYLKMFEYGLRLPLHPFAKEFLNRTGLAPAQVAPNGWSVIFALAILFWLRAREEDEVDLLD
        +PEHYLG+L +GF+IP++I+ R+PEEGERADNP EGWVTLY KMFEYGLRLPLHPF +EFL RTGLAPAQVAPNGW VIFALAILFWLRAR+ +E +L D
Subjt:  MPEHYLGALHKGFSIPDDIIFRIPEEGERADNPTEGWVTLYLKMFEYGLRLPLHPFAKEFLNRTGLAPAQVAPNGWSVIFALAILFWLRAREEDEVDLLD

Query:  VEQLLGCFEAKRIAKKPCRYYMCARKGAGGTVKGLTSIKGWMGKWFFASGEWLAKNEFGRPFFDVPVRFGNLVLIKPIPELTRASFNTLKYYKDHFPRGR
        V+QLL CFEAKRIAKKP R+YMCARKGAGG VKG TSIKGW+ KWF+ASGEWLAK+E GR FFDVP RFGNLV I+P+PELT+ASF+TLKYYK+ FPRGR
Subjt:  VEQLLGCFEAKRIAKKPCRYYMCARKGAGGTVKGLTSIKGWMGKWFFASGEWLAKNEFGRPFFDVPVRFGNLVLIKPIPELTRASFNTLKYYKDHFPRGR

Query:  KIGTLMTDKLLLESGLLDYNPLLRP-------------------VKRKSKGRAHALKTIQSTEPATPAVAQPAAQDKVGPSVEAPTPMIELDSVEECSRE
        K+GTL+TD+LLLESGLLDYNP +RP                   VKRKSKGRAHAL+  QS++PATPAV        VGP+ E P  +IEL+S    SRE
Subjt:  KIGTLMTDKLLLESGLLDYNPLLRP-------------------VKRKSKGRAHALKTIQSTEPATPAVAQPAAQDKVGPSVEAPTPMIELDSVEECSRE

Query:  KHSRNESEALD
        K  R+++EA+D
Subjt:  KHSRNESEALD

A0A6J1DZB3 uncharacterized protein LOC1110256659.0e-11359.43Show/hide
Query:  MCARKGAGGTVKGLTSIKGWMGKWFFASGEWLAKNEFGRPFFDVPVRFGNLVLIKPIPELTRASFNTLKYYKDHFPRGRKIGTLMTDKLLLESGLLDYNP
        MCARKG GG VKG TSIKGW+GKWFFASGEWLAK+E GR FFDVP RFGNLV IK IPEL +A+F+TLK+YKDHFPR RKI TL+TDKLLLESGLLDYNP
Subjt:  MCARKGAGGTVKGLTSIKGWMGKWFFASGEWLAKNEFGRPFFDVPVRFGNLVLIKPIPELTRASFNTLKYYKDHFPRGRKIGTLMTDKLLLESGLLDYNP

Query:  LLR-------------------PVKRKSKGRAHALKTIQSTEPATPAVAQPAAQDKVGPSVEAPTPMIELDSVEECSREKHSRNESEALDMSPFSD----
        L+R                    VKRKSKGRAHALKT+  TEP TP V +  AQ   GPS   PTP+IELD     S EK SR ESEALD+SP ++    
Subjt:  LLR-------------------PVKRKSKGRAHALKTIQSTEPATPAVAQPAAQDKVGPSVEAPTPMIELDSVEECSREKHSRNESEALDMSPFSD----

Query:  ---------------------------------------RGTSDVKMRFRMEPSSSGVKDQVSRISAACLDCCLRRASKFVSDHGSVLQQSIDHAAEAFI
                                               RGTS+V+MRF MEPSSSGVKDQVSRISA CLD  LRRASKFVSD GSVLQ++ID+ AEAFI
Subjt:  ---------------------------------------RGTSDVKMRFRMEPSSSGVKDQVSRISAACLDCCLRRASKFVSDHGSVLQQSIDHAAEAFI

Query:  ASIHSAVMKKAELDGREILAARESVNSSAALEAATTMNGELLKARSEVETLKAE---------------------------GLEKEKFQLLKEKDDMVQI
        ASIH AVM KAELDGRE LAA+E  NS AALEAATT+ GELLKA+ EV+ L+AE                           GLEKEKFQLLKEKDD+ Q+
Subjt:  ASIHSAVMKKAELDGREILAARESVNSSAALEAATTMNGELLKARSEVETLKAE---------------------------GLEKEKFQLLKEKDDMVQI

Query:  LEENDALIGRLTTELNEEK
        LEE DA IGRLTTEL + K
Subjt:  LEENDALIGRLTTELNEEK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTGAGCACTATCTTGGAGCCCTCCATAAGGGGTTTAGTATTCCGGATGACATCATCTTTAGAATTCCGGAGGAAGGGGAAAGAGCTGACAATCCTACAGAGGGATG
GGTCACTCTTTACTTAAAGATGTTTGAGTACGGCCTCAGACTCCCCCTTCACCCTTTTGCCAAAGAGTTCTTAAACCGAACTGGACTGGCTCCTGCTCAAGTGGCCCCCA
ATGGATGGAGTGTCATTTTTGCTTTGGCCATCCTTTTTTGGCTACGAGCTCGAGAAGAGGACGAAGTCGATCTGTTAGATGTTGAACAGCTTCTAGGGTGCTTTGAAGCT
AAAAGGATAGCTAAGAAGCCATGTCGGTACTACATGTGCGCAAGGAAGGGCGCGGGTGGTACAGTCAAAGGGCTGACCTCCATCAAAGGATGGATGGGGAAGTGGTTCTT
TGCCTCTGGAGAGTGGCTGGCAAAGAACGAGTTTGGTCGTCCCTTCTTTGACGTTCCCGTTAGGTTCGGGAATTTAGTGTTGATCAAACCAATTCCCGAGCTCACTCGAG
CCTCCTTCAATACCCTTAAGTATTATAAGGATCACTTTCCAAGGGGTAGGAAGATCGGAACCTTGATGACTGACAAGCTGCTTCTCGAATCCGGGTTGTTAGATTACAAC
CCCTTATTACGCCCTGTGAAGCGCAAGTCTAAGGGTCGTGCTCACGCCCTCAAGACTATTCAGAGCACAGAGCCAGCAACTCCCGCTGTCGCTCAACCTGCGGCTCAAGA
CAAAGTTGGGCCCTCTGTTGAAGCCCCAACTCCGATGATCGAGTTGGACTCTGTCGAGGAGTGTTCCAGAGAAAAGCATTCGAGGAATGAGTCCGAGGCGCTGGACATGT
CTCCTTTCTCGGATAGGGGGACATCCGACGTGAAGATGCGGTTCAGAATGGAACCGTCGAGCTCCGGGGTAAAGGACCAAGTGTCCCGCATCTCGGCTGCATGCTTGGAC
TGCTGCCTTAGAAGGGCGTCCAAGTTCGTAAGTGACCATGGGTCCGTACTGCAACAGTCCATTGACCACGCCGCTGAGGCGTTCATTGCTTCCATTCACTCGGCAGTTAT
GAAGAAGGCCGAGCTGGATGGAAGGGAGATCTTGGCAGCTAGGGAGAGTGTGAATTCCTCTGCTGCCTTGGAGGCTGCCACCACAATGAATGGCGAGCTACTGAAAGCTC
GCTCCGAAGTGGAGACTTTGAAGGCCGAGGGGTTGGAGAAGGAGAAGTTCCAGCTCCTGAAGGAGAAGGATGACATGGTTCAAATCCTTGAGGAGAATGACGCTTTGATA
GGCCGCCTTACCACCGAGCTCAATGAGGAGAAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGCCTGAGCACTATCTTGGAGCCCTCCATAAGGGGTTTAGTATTCCGGATGACATCATCTTTAGAATTCCGGAGGAAGGGGAAAGAGCTGACAATCCTACAGAGGGATG
GGTCACTCTTTACTTAAAGATGTTTGAGTACGGCCTCAGACTCCCCCTTCACCCTTTTGCCAAAGAGTTCTTAAACCGAACTGGACTGGCTCCTGCTCAAGTGGCCCCCA
ATGGATGGAGTGTCATTTTTGCTTTGGCCATCCTTTTTTGGCTACGAGCTCGAGAAGAGGACGAAGTCGATCTGTTAGATGTTGAACAGCTTCTAGGGTGCTTTGAAGCT
AAAAGGATAGCTAAGAAGCCATGTCGGTACTACATGTGCGCAAGGAAGGGCGCGGGTGGTACAGTCAAAGGGCTGACCTCCATCAAAGGATGGATGGGGAAGTGGTTCTT
TGCCTCTGGAGAGTGGCTGGCAAAGAACGAGTTTGGTCGTCCCTTCTTTGACGTTCCCGTTAGGTTCGGGAATTTAGTGTTGATCAAACCAATTCCCGAGCTCACTCGAG
CCTCCTTCAATACCCTTAAGTATTATAAGGATCACTTTCCAAGGGGTAGGAAGATCGGAACCTTGATGACTGACAAGCTGCTTCTCGAATCCGGGTTGTTAGATTACAAC
CCCTTATTACGCCCTGTGAAGCGCAAGTCTAAGGGTCGTGCTCACGCCCTCAAGACTATTCAGAGCACAGAGCCAGCAACTCCCGCTGTCGCTCAACCTGCGGCTCAAGA
CAAAGTTGGGCCCTCTGTTGAAGCCCCAACTCCGATGATCGAGTTGGACTCTGTCGAGGAGTGTTCCAGAGAAAAGCATTCGAGGAATGAGTCCGAGGCGCTGGACATGT
CTCCTTTCTCGGATAGGGGGACATCCGACGTGAAGATGCGGTTCAGAATGGAACCGTCGAGCTCCGGGGTAAAGGACCAAGTGTCCCGCATCTCGGCTGCATGCTTGGAC
TGCTGCCTTAGAAGGGCGTCCAAGTTCGTAAGTGACCATGGGTCCGTACTGCAACAGTCCATTGACCACGCCGCTGAGGCGTTCATTGCTTCCATTCACTCGGCAGTTAT
GAAGAAGGCCGAGCTGGATGGAAGGGAGATCTTGGCAGCTAGGGAGAGTGTGAATTCCTCTGCTGCCTTGGAGGCTGCCACCACAATGAATGGCGAGCTACTGAAAGCTC
GCTCCGAAGTGGAGACTTTGAAGGCCGAGGGGTTGGAGAAGGAGAAGTTCCAGCTCCTGAAGGAGAAGGATGACATGGTTCAAATCCTTGAGGAGAATGACGCTTTGATA
GGCCGCCTTACCACCGAGCTCAATGAGGAGAAGTAA
Protein sequenceShow/hide protein sequence
MPEHYLGALHKGFSIPDDIIFRIPEEGERADNPTEGWVTLYLKMFEYGLRLPLHPFAKEFLNRTGLAPAQVAPNGWSVIFALAILFWLRAREEDEVDLLDVEQLLGCFEA
KRIAKKPCRYYMCARKGAGGTVKGLTSIKGWMGKWFFASGEWLAKNEFGRPFFDVPVRFGNLVLIKPIPELTRASFNTLKYYKDHFPRGRKIGTLMTDKLLLESGLLDYN
PLLRPVKRKSKGRAHALKTIQSTEPATPAVAQPAAQDKVGPSVEAPTPMIELDSVEECSREKHSRNESEALDMSPFSDRGTSDVKMRFRMEPSSSGVKDQVSRISAACLD
CCLRRASKFVSDHGSVLQQSIDHAAEAFIASIHSAVMKKAELDGREILAARESVNSSAALEAATTMNGELLKARSEVETLKAEGLEKEKFQLLKEKDDMVQILEENDALI
GRLTTELNEEK