; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g28760 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g28760
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUlp1-like peptidase
Genome locationchr6:21687952..21694836
RNA-Seq ExpressionMoc06g28760
SyntenyMoc06g28760
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022154561.1 uncharacterized protein LOC111021802 [Momordica charantia]4.7e-6747.22Show/hide
Query:  MLRKIVFGHLLDLDLVFNKPLIHNILLREIDDSTPDTINFNLFRSKVSFGRREFDIISG-----------------------------LNDFEKIYIIAQ
        M RK  F HLLD+DLVFN  LIHNILLRE+++STP+TI+FNLFR ++SF R +F +ISG                             L+DFEK+Y  A+
Subjt:  MLRKIVFGHLLDLDLVFNKPLIHNILLREIDDSTPDTINFNLFRSKVSFGRREFDIISG-----------------------------LNDFEKIYIIAQ

Query:  FEDDFDAVNISIVYLVELVLLRKERALKYDYTLLGIVDDWETFYNHDWSILSFDKTIYSLKRGPTKRSKNGRFRKSYSLYNLLQPRLGHAVWAYETISSL
        FEDD+D V + IVY+V + LL +ER +K+D+TLLGIVDDWE   N++W+ LSF+KTI SL+RGP K SK+G+ RKSYSLY    P +   VWAY+TISSL
Subjt:  FEDDFDAVNISIVYLVELVLLRKERALKYDYTLLGIVDDWETFYNHDWSILSFDKTIYSLKRGPTKRSKNGRFRKSYSLYNLLQPRLGHAVWAYETISSL

Query:  FGRMVNKV-----------NLDVATYSSVEVQPFNC--MGKTRSIEAMEVEMIFLNRAFKPPEPKDEDEVPHENDAAKPSNARVGPEKDDRGQAANIDEG
          R+ NKV             D +T   V  +   C   G+TR+++  +VE  FLNR+F PP   D+D +    D A PS  R G + DD  + A++ E 
Subjt:  FGRMVNKV-----------NLDVATYSSVEVQPFNC--MGKTRSIEAMEVEMIFLNRAFKPPEPKDEDEVPHENDAAKPSNARVGPEKDDRGQAANIDEG

Query:  VRKDIHVEVEEKEENGSGKKVHMSSVRLRKVEKRL----KRMDDRMKGIEAELKSIQKFL
        V KD  +   E EE     KV +S+ RL++VEK L    KRMD+RM  IEAELKSI+KFL
Subjt:  VRKDIHVEVEEKEENGSGKKVHMSSVRLRKVEKRL----KRMDDRMKGIEAELKSIQKFL

XP_022156465.1 uncharacterized protein LOC111023353 [Momordica charantia]4.7e-5145.71Show/hide
Query:  MLRKIVFGHLLDLDLVFNKPLIHNILLREIDDSTPDTINFNLFRSKVSFGRREFDIISG-----------------------------LNDFEKIYIIAQ
        M RK +FGHLLD+DLVFN PLIHNILLRE++DSTP+TI+FNLF  +VSFGRREFD+ISG                             L+DF K+YI A 
Subjt:  MLRKIVFGHLLDLDLVFNKPLIHNILLREIDDSTPDTINFNLFRSKVSFGRREFDIISG-----------------------------LNDFEKIYIIAQ

Query:  FEDDFDAVNISIVYLVELVLLRKERALKYDYTLLGIVDDWETFYNHDWSILSFDKTIYSLKRGPTKRSKNGRFRKSYSLYNLLQPRLGHAVWAYETISSL
        F+DDFD + +SI+Y+VELVLL +E  +K+D  LLG+VDDWE   NHD + LSFDKTI SL RGPT  +K+   RKSYSLY    P +   VW YE     
Subjt:  FEDDFDAVNISIVYLVELVLLRKERALKYDYTLLGIVDDWETFYNHDWSILSFDKTIYSLKRGPTKRSKNGRFRKSYSLYNLLQPRLGHAVWAYETISSL

Query:  FGRMVNKVNLDVATYSSVEVQPFNCMGKTRSIEAMEVEMIFLNRAFKPPEPKDEDEVPHENDAAKPSNARVGPEKDDRGQ
                                   +TR +EA + E  F+ R F+PPEP+D+D    +   A PS  R G +  D G+
Subjt:  FGRMVNKVNLDVATYSSVEVQPFNCMGKTRSIEAMEVEMIFLNRAFKPPEPKDEDEVPHENDAAKPSNARVGPEKDDRGQ

XP_022157998.1 uncharacterized protein LOC111024595 [Momordica charantia]4.9e-4856.86Show/hide
Query:  MLRKIVFGHLLDLDLVFNKPLIHNILLREIDDSTPDTINFNLFRSKVSFGRREFDIISG-----------------------------LNDFEKIYIIAQ
        M RK VFGHLLD+DLVFN PLIH++LLRE+++STPDTI+FNLF +KVSFGRREFDIISG                             L++ EK+Y    
Subjt:  MLRKIVFGHLLDLDLVFNKPLIHNILLREIDDSTPDTINFNLFRSKVSFGRREFDIISG-----------------------------LNDFEKIYIIAQ

Query:  FEDDFDAVNISIVYLVELVLLRKERALKYDYTLLGIVDDWETFYNHDWSILSFDKTIYSLKRGPTKRSKNGRFRKSYSLYNLLQPRLGHAVWAYETISSL
        FEDDFDAV +SIVY VELVLL           LLGIVDDWE   NHDW++LSF+KTIYSL+RG +K+SK G  RKSYSL+    P +   VWAYETISSL
Subjt:  FEDDFDAVNISIVYLVELVLLRKERALKYDYTLLGIVDDWETFYNHDWSILSFDKTIYSLKRGPTKRSKNGRFRKSYSLYNLLQPRLGHAVWAYETISSL

Query:  FGRM
         GR+
Subjt:  FGRM

XP_022159061.1 uncharacterized protein LOC111025501 [Momordica charantia]3.7e-4065.99Show/hide
Query:  MLRKIVFGHLLDLDLVFNKPLIHNILLREIDDSTPDTINFNLFRSKVSFGRREFDIISG-----------------------------LNDFEKIYIIAQ
        M RK VFGHLLDLDLVFN PLIHNILLREI+ STPDTI+FNLF +K SFGR EFDIISG                             L++FEKIY+ A+
Subjt:  MLRKIVFGHLLDLDLVFNKPLIHNILLREIDDSTPDTINFNLFRSKVSFGRREFDIISG-----------------------------LNDFEKIYIIAQ

Query:  FEDDFDAVNISIVYLVELVLLRKERALKYDYTLLGIVDDWETFYNHD
        FEDDFDAV ISIVYLVELVLL +ER LKYDYTLLGIVDDWET  NHD
Subjt:  FEDDFDAVNISIVYLVELVLLRKERALKYDYTLLGIVDDWETFYNHD

XP_022159253.1 uncharacterized protein LOC111025666 [Momordica charantia]1.1e-5266.85Show/hide
Query:  MLRKIVFGHLLDLDLVFNKPLIHNILLREIDD-STPDTINFNLFRSKVSFGRREFDIISG-----------------------------LNDFEKIYIIA
        M RK VFGHLLDLDLVFN  LIH ILLREI+D STP+TI+FNLF SKV F RREFDIISG                             L+DFEKIYI+ 
Subjt:  MLRKIVFGHLLDLDLVFNKPLIHNILLREIDD-STPDTINFNLFRSKVSFGRREFDIISG-----------------------------LNDFEKIYIIA

Query:  QFEDDFDAVNISIVYLVELVLLRKERALKYDYTLLGIVDDWETFYNHDWSILSFDKTIYSLKRGPTKRSKNGRFRKSYSLY
        +FEDDFDA  ISIVYL+ELVLL +ER LKYDYTLLGIVDD ET  NHDW ++SFDKTIYSLKRGPTKRSK+G FRK YSLY
Subjt:  QFEDDFDAVNISIVYLVELVLLRKERALKYDYTLLGIVDDWETFYNHDWSILSFDKTIYSLKRGPTKRSKNGRFRKSYSLY

TrEMBL top hitse value%identityAlignment
A0A6J1DP34 uncharacterized protein LOC1110218022.3e-6747.22Show/hide
Query:  MLRKIVFGHLLDLDLVFNKPLIHNILLREIDDSTPDTINFNLFRSKVSFGRREFDIISG-----------------------------LNDFEKIYIIAQ
        M RK  F HLLD+DLVFN  LIHNILLRE+++STP+TI+FNLFR ++SF R +F +ISG                             L+DFEK+Y  A+
Subjt:  MLRKIVFGHLLDLDLVFNKPLIHNILLREIDDSTPDTINFNLFRSKVSFGRREFDIISG-----------------------------LNDFEKIYIIAQ

Query:  FEDDFDAVNISIVYLVELVLLRKERALKYDYTLLGIVDDWETFYNHDWSILSFDKTIYSLKRGPTKRSKNGRFRKSYSLYNLLQPRLGHAVWAYETISSL
        FEDD+D V + IVY+V + LL +ER +K+D+TLLGIVDDWE   N++W+ LSF+KTI SL+RGP K SK+G+ RKSYSLY    P +   VWAY+TISSL
Subjt:  FEDDFDAVNISIVYLVELVLLRKERALKYDYTLLGIVDDWETFYNHDWSILSFDKTIYSLKRGPTKRSKNGRFRKSYSLYNLLQPRLGHAVWAYETISSL

Query:  FGRMVNKV-----------NLDVATYSSVEVQPFNC--MGKTRSIEAMEVEMIFLNRAFKPPEPKDEDEVPHENDAAKPSNARVGPEKDDRGQAANIDEG
          R+ NKV             D +T   V  +   C   G+TR+++  +VE  FLNR+F PP   D+D +    D A PS  R G + DD  + A++ E 
Subjt:  FGRMVNKV-----------NLDVATYSSVEVQPFNC--MGKTRSIEAMEVEMIFLNRAFKPPEPKDEDEVPHENDAAKPSNARVGPEKDDRGQAANIDEG

Query:  VRKDIHVEVEEKEENGSGKKVHMSSVRLRKVEKRL----KRMDDRMKGIEAELKSIQKFL
        V KD  +   E EE     KV +S+ RL++VEK L    KRMD+RM  IEAELKSI+KFL
Subjt:  VRKDIHVEVEEKEENGSGKKVHMSSVRLRKVEKRL----KRMDDRMKGIEAELKSIQKFL

A0A6J1DQC8 uncharacterized protein LOC1110233532.3e-5145.71Show/hide
Query:  MLRKIVFGHLLDLDLVFNKPLIHNILLREIDDSTPDTINFNLFRSKVSFGRREFDIISG-----------------------------LNDFEKIYIIAQ
        M RK +FGHLLD+DLVFN PLIHNILLRE++DSTP+TI+FNLF  +VSFGRREFD+ISG                             L+DF K+YI A 
Subjt:  MLRKIVFGHLLDLDLVFNKPLIHNILLREIDDSTPDTINFNLFRSKVSFGRREFDIISG-----------------------------LNDFEKIYIIAQ

Query:  FEDDFDAVNISIVYLVELVLLRKERALKYDYTLLGIVDDWETFYNHDWSILSFDKTIYSLKRGPTKRSKNGRFRKSYSLYNLLQPRLGHAVWAYETISSL
        F+DDFD + +SI+Y+VELVLL +E  +K+D  LLG+VDDWE   NHD + LSFDKTI SL RGPT  +K+   RKSYSLY    P +   VW YE     
Subjt:  FEDDFDAVNISIVYLVELVLLRKERALKYDYTLLGIVDDWETFYNHDWSILSFDKTIYSLKRGPTKRSKNGRFRKSYSLYNLLQPRLGHAVWAYETISSL

Query:  FGRMVNKVNLDVATYSSVEVQPFNCMGKTRSIEAMEVEMIFLNRAFKPPEPKDEDEVPHENDAAKPSNARVGPEKDDRGQ
                                   +TR +EA + E  F+ R F+PPEP+D+D    +   A PS  R G +  D G+
Subjt:  FGRMVNKVNLDVATYSSVEVQPFNCMGKTRSIEAMEVEMIFLNRAFKPPEPKDEDEVPHENDAAKPSNARVGPEKDDRGQ

A0A6J1DUW1 uncharacterized protein LOC1110245952.4e-4856.86Show/hide
Query:  MLRKIVFGHLLDLDLVFNKPLIHNILLREIDDSTPDTINFNLFRSKVSFGRREFDIISG-----------------------------LNDFEKIYIIAQ
        M RK VFGHLLD+DLVFN PLIH++LLRE+++STPDTI+FNLF +KVSFGRREFDIISG                             L++ EK+Y    
Subjt:  MLRKIVFGHLLDLDLVFNKPLIHNILLREIDDSTPDTINFNLFRSKVSFGRREFDIISG-----------------------------LNDFEKIYIIAQ

Query:  FEDDFDAVNISIVYLVELVLLRKERALKYDYTLLGIVDDWETFYNHDWSILSFDKTIYSLKRGPTKRSKNGRFRKSYSLYNLLQPRLGHAVWAYETISSL
        FEDDFDAV +SIVY VELVLL           LLGIVDDWE   NHDW++LSF+KTIYSL+RG +K+SK G  RKSYSL+    P +   VWAYETISSL
Subjt:  FEDDFDAVNISIVYLVELVLLRKERALKYDYTLLGIVDDWETFYNHDWSILSFDKTIYSLKRGPTKRSKNGRFRKSYSLYNLLQPRLGHAVWAYETISSL

Query:  FGRM
         GR+
Subjt:  FGRM

A0A6J1DYB1 uncharacterized protein LOC1110256665.4e-5366.85Show/hide
Query:  MLRKIVFGHLLDLDLVFNKPLIHNILLREIDD-STPDTINFNLFRSKVSFGRREFDIISG-----------------------------LNDFEKIYIIA
        M RK VFGHLLDLDLVFN  LIH ILLREI+D STP+TI+FNLF SKV F RREFDIISG                             L+DFEKIYI+ 
Subjt:  MLRKIVFGHLLDLDLVFNKPLIHNILLREIDD-STPDTINFNLFRSKVSFGRREFDIISG-----------------------------LNDFEKIYIIA

Query:  QFEDDFDAVNISIVYLVELVLLRKERALKYDYTLLGIVDDWETFYNHDWSILSFDKTIYSLKRGPTKRSKNGRFRKSYSLY
        +FEDDFDA  ISIVYL+ELVLL +ER LKYDYTLLGIVDD ET  NHDW ++SFDKTIYSLKRGPTKRSK+G FRK YSLY
Subjt:  QFEDDFDAVNISIVYLVELVLLRKERALKYDYTLLGIVDDWETFYNHDWSILSFDKTIYSLKRGPTKRSKNGRFRKSYSLY

A0A6J1E2S4 uncharacterized protein LOC1110255011.8e-4065.99Show/hide
Query:  MLRKIVFGHLLDLDLVFNKPLIHNILLREIDDSTPDTINFNLFRSKVSFGRREFDIISG-----------------------------LNDFEKIYIIAQ
        M RK VFGHLLDLDLVFN PLIHNILLREI+ STPDTI+FNLF +K SFGR EFDIISG                             L++FEKIY+ A+
Subjt:  MLRKIVFGHLLDLDLVFNKPLIHNILLREIDDSTPDTINFNLFRSKVSFGRREFDIISG-----------------------------LNDFEKIYIIAQ

Query:  FEDDFDAVNISIVYLVELVLLRKERALKYDYTLLGIVDDWETFYNHD
        FEDDFDAV ISIVYLVELVLL +ER LKYDYTLLGIVDDWET  NHD
Subjt:  FEDDFDAVNISIVYLVELVLLRKERALKYDYTLLGIVDDWETFYNHD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTTAGGAAAATTGTATTCGGCCACCTCCTTGACTTGGACCTCGTATTTAACAAGCCATTGATACACAACATCCTACTTAGGGAGATCGATGATAGTACACCTGACAC
CATTAACTTCAACCTGTTTAGGAGTAAGGTGTCATTTGGGCGGAGGGAGTTTGACATTATTTCTGGTCTTAATGATTTTGAGAAGATTTATATAATCGCACAGTTCGAGG
ATGACTTCGACGCGGTCAACATATCTATTGTGTACTTAGTAGAGTTAGTTCTGTTGAGGAAAGAGAGGGCCCTAAAGTACGACTATACCTTGCTGGGAATAGTCGATGAT
TGGGAAACTTTCTACAACCACGATTGGAGCATACTGTCCTTTGATAAGACTATATATAGTCTGAAGCGTGGCCCGACAAAGAGGTCGAAGAATGGCCGGTTCAGGAAATC
ATACAGTCTCTACAATTTGCTACAACCACGATTGGGGCATGCTGTGTGGGCGTACGAGACTATATCTTCCCTATTTGGGCGTATGGTCAATAAGGTAAATCTGGACGTCG
CCACGTATTCTTCGGTGGAGGTGCAGCCATTCAACTGCATGGGAAAAACTCGATCAATAGAAGCAATGGAGGTTGAGATGATCTTCCTAAATAGGGCGTTTAAACCACCC
GAGCCCAAAGATGAGGACGAAGTCCCGCATGAGAACGATGCTGCTAAACCATCAAATGCACGTGTAGGACCAGAGAAGGATGACAGGGGACAAGCAGCGAACATCGATGA
AGGCGTCAGAAAAGACATTCATGTGGAGGTTGAGGAGAAAGAGGAGAATGGAAGTGGTAAAAAGGTACACATGTCCAGTGTGCGTCTGAGAAAAGTTGAGAAACGCTTGA
AGCGCATGGACGACAGGATGAAAGGTATTGAGGCTGAATTAAAATCCATCCAGAAATTTTTGCAAAGAATCGCTAACGGTTTACCTGCCAACCCGAATGACATGAGAAGA
GGCAGCAACAGTGATGGTACTGGGCCAGGAAATGATCCTAGTGATGCACCCAGTGGCGGACCCAGTGATGGACCTAGTGGCAGACCCAGTGATGGACCCAGTGGCGCCGA
TGGTGGTCGGGGTGATGGTCCTAAGGGTGGCGATGGACAGAAGCACCTCATTCCAGTCAAACTGCTGAAGACCAGGCTGAACAAGACATTCCTGGATTTTAGACACCAGA
ACGCGAGGTATGTGTTCGTAATCCATCCGTTAAAATTCGATCGTAACCATGGTCTGACGAAGGTTCATGTGACCTGTTTGAAGCAGCATGCACCTGCCTTATCGCATAAG
GAAGACATGGGTACAGAGGATGTGCATAAGGAGAGTATGGAAACCGGTCTAAATGCGCGTTGCGAGGCTGTCTCTCTCGAGGAGACTCCTGTTCAGAGCAAACCGATGGA
CCATATTATGATCGATTCATTGCGGTTAGAGTCATCCATGGATGACGAGGTGTTGGCGTACGAGACTATATCTTTCCTATCTGGGCGTATGGTCAATAAGGTAAATCCAG
ACGCCTCCACGTATTTTTCAGTGGAGGGAAGAACTCGATCAATAGAAGCAACAGAGGTTGAGACGATCTTCCTAAATAGGGCGTTTGAACCACCCGGATCTGAAGATGAG
GACGAAGTCCCGCATGAGAATGATGCTGCTGGACCATCAAGTGCACATGTAGGACCAGAGAAAGATGACAGGGGACAAGCAGCGAACGTCGATGAAGGCGTCAGAAAAGA
CATTCATGTAGAAGTTGAGGAGAAAGAGGATAATGGAAGTGGTAAGAAGGTACGTATGTCCAATGTGCGTCTGAGAAAAGTTGAGAAACGTTTGAAGTGCATGGATGACA
GAATCAGAGACATGAGAAGAGGCAGCAGCGGTGATGGTACTGGGCCAGGAGATGGTCCTAGTGATGCACCTAGTGGCGGACCCAGAAACAGACCCAGTGATGGACCTAGT
GGCGTCATGGTGGTCGGGGTGATGGTCCTAAGGGTGGCAATGGTTCTACACCAATGCATGACGACCCTGAACGCAGTACTGCGAGAGTTGCAGCAGAATACGATTGGGCT
AGCAAGTGACGGTGAGATCATCGTATGGGATTCAATGAGGTCGATGACATTGTTGCTAGCTTTAGAGTTCGAGTTGAGGCCGATGGCCGTTGTCCTACCGGCATTGATGC
ATAGGGCCGGTGTTCAGGTATTTCAGGCAATTGCATTTCCACTGCCCGTCCATGTTGCAGTGGAAACATTTTCCTTTGTCTGCAACATTGGCTTTGCATTTCTTGGTAGC
GGCAGCAGTAGAGTCAGGTTTGGGCCCCTTACCAACAGTTTTCTCCTTCTTGAAAGTCTTACTTCCAGAATAAGAGGGCGTAGACTTAGTTCCAGAGGTCGAACCTCGGT
GGAACCTTTTGGAGGTGGAAACTCTTCAGAAGAGATTCCAGAATATAATGAAGCTGACTTGACTCGGCTCGTCTATGACGACCTCGTTCGTCTCTGCCACATTGAAGTGG
ACCATCAGCAAGCACATCAGATATGTTCTCATGCTCGTCAAGATGTAGACCTTGGCCTTGTCGTTGGCCTTGATCCACCAATCAGGAGCTTGAGGACATTCCTCTTGCAC
GACAAACTTAAGATCATTTATTAA
mRNA sequenceShow/hide mRNA sequence
ATGCTTAGGAAAATTGTATTCGGCCACCTCCTTGACTTGGACCTCGTATTTAACAAGCCATTGATACACAACATCCTACTTAGGGAGATCGATGATAGTACACCTGACAC
CATTAACTTCAACCTGTTTAGGAGTAAGGTGTCATTTGGGCGGAGGGAGTTTGACATTATTTCTGGTCTTAATGATTTTGAGAAGATTTATATAATCGCACAGTTCGAGG
ATGACTTCGACGCGGTCAACATATCTATTGTGTACTTAGTAGAGTTAGTTCTGTTGAGGAAAGAGAGGGCCCTAAAGTACGACTATACCTTGCTGGGAATAGTCGATGAT
TGGGAAACTTTCTACAACCACGATTGGAGCATACTGTCCTTTGATAAGACTATATATAGTCTGAAGCGTGGCCCGACAAAGAGGTCGAAGAATGGCCGGTTCAGGAAATC
ATACAGTCTCTACAATTTGCTACAACCACGATTGGGGCATGCTGTGTGGGCGTACGAGACTATATCTTCCCTATTTGGGCGTATGGTCAATAAGGTAAATCTGGACGTCG
CCACGTATTCTTCGGTGGAGGTGCAGCCATTCAACTGCATGGGAAAAACTCGATCAATAGAAGCAATGGAGGTTGAGATGATCTTCCTAAATAGGGCGTTTAAACCACCC
GAGCCCAAAGATGAGGACGAAGTCCCGCATGAGAACGATGCTGCTAAACCATCAAATGCACGTGTAGGACCAGAGAAGGATGACAGGGGACAAGCAGCGAACATCGATGA
AGGCGTCAGAAAAGACATTCATGTGGAGGTTGAGGAGAAAGAGGAGAATGGAAGTGGTAAAAAGGTACACATGTCCAGTGTGCGTCTGAGAAAAGTTGAGAAACGCTTGA
AGCGCATGGACGACAGGATGAAAGGTATTGAGGCTGAATTAAAATCCATCCAGAAATTTTTGCAAAGAATCGCTAACGGTTTACCTGCCAACCCGAATGACATGAGAAGA
GGCAGCAACAGTGATGGTACTGGGCCAGGAAATGATCCTAGTGATGCACCCAGTGGCGGACCCAGTGATGGACCTAGTGGCAGACCCAGTGATGGACCCAGTGGCGCCGA
TGGTGGTCGGGGTGATGGTCCTAAGGGTGGCGATGGACAGAAGCACCTCATTCCAGTCAAACTGCTGAAGACCAGGCTGAACAAGACATTCCTGGATTTTAGACACCAGA
ACGCGAGGTATGTGTTCGTAATCCATCCGTTAAAATTCGATCGTAACCATGGTCTGACGAAGGTTCATGTGACCTGTTTGAAGCAGCATGCACCTGCCTTATCGCATAAG
GAAGACATGGGTACAGAGGATGTGCATAAGGAGAGTATGGAAACCGGTCTAAATGCGCGTTGCGAGGCTGTCTCTCTCGAGGAGACTCCTGTTCAGAGCAAACCGATGGA
CCATATTATGATCGATTCATTGCGGTTAGAGTCATCCATGGATGACGAGGTGTTGGCGTACGAGACTATATCTTTCCTATCTGGGCGTATGGTCAATAAGGTAAATCCAG
ACGCCTCCACGTATTTTTCAGTGGAGGGAAGAACTCGATCAATAGAAGCAACAGAGGTTGAGACGATCTTCCTAAATAGGGCGTTTGAACCACCCGGATCTGAAGATGAG
GACGAAGTCCCGCATGAGAATGATGCTGCTGGACCATCAAGTGCACATGTAGGACCAGAGAAAGATGACAGGGGACAAGCAGCGAACGTCGATGAAGGCGTCAGAAAAGA
CATTCATGTAGAAGTTGAGGAGAAAGAGGATAATGGAAGTGGTAAGAAGGTACGTATGTCCAATGTGCGTCTGAGAAAAGTTGAGAAACGTTTGAAGTGCATGGATGACA
GAATCAGAGACATGAGAAGAGGCAGCAGCGGTGATGGTACTGGGCCAGGAGATGGTCCTAGTGATGCACCTAGTGGCGGACCCAGAAACAGACCCAGTGATGGACCTAGT
GGCGTCATGGTGGTCGGGGTGATGGTCCTAAGGGTGGCAATGGTTCTACACCAATGCATGACGACCCTGAACGCAGTACTGCGAGAGTTGCAGCAGAATACGATTGGGCT
AGCAAGTGACGGTGAGATCATCGTATGGGATTCAATGAGGTCGATGACATTGTTGCTAGCTTTAGAGTTCGAGTTGAGGCCGATGGCCGTTGTCCTACCGGCATTGATGC
ATAGGGCCGGTGTTCAGGTATTTCAGGCAATTGCATTTCCACTGCCCGTCCATGTTGCAGTGGAAACATTTTCCTTTGTCTGCAACATTGGCTTTGCATTTCTTGGTAGC
GGCAGCAGTAGAGTCAGGTTTGGGCCCCTTACCAACAGTTTTCTCCTTCTTGAAAGTCTTACTTCCAGAATAAGAGGGCGTAGACTTAGTTCCAGAGGTCGAACCTCGGT
GGAACCTTTTGGAGGTGGAAACTCTTCAGAAGAGATTCCAGAATATAATGAAGCTGACTTGACTCGGCTCGTCTATGACGACCTCGTTCGTCTCTGCCACATTGAAGTGG
ACCATCAGCAAGCACATCAGATATGTTCTCATGCTCGTCAAGATGTAGACCTTGGCCTTGTCGTTGGCCTTGATCCACCAATCAGGAGCTTGAGGACATTCCTCTTGCAC
GACAAACTTAAGATCATTTATTAA
Protein sequenceShow/hide protein sequence
MLRKIVFGHLLDLDLVFNKPLIHNILLREIDDSTPDTINFNLFRSKVSFGRREFDIISGLNDFEKIYIIAQFEDDFDAVNISIVYLVELVLLRKERALKYDYTLLGIVDD
WETFYNHDWSILSFDKTIYSLKRGPTKRSKNGRFRKSYSLYNLLQPRLGHAVWAYETISSLFGRMVNKVNLDVATYSSVEVQPFNCMGKTRSIEAMEVEMIFLNRAFKPP
EPKDEDEVPHENDAAKPSNARVGPEKDDRGQAANIDEGVRKDIHVEVEEKEENGSGKKVHMSSVRLRKVEKRLKRMDDRMKGIEAELKSIQKFLQRIANGLPANPNDMRR
GSNSDGTGPGNDPSDAPSGGPSDGPSGRPSDGPSGADGGRGDGPKGGDGQKHLIPVKLLKTRLNKTFLDFRHQNARYVFVIHPLKFDRNHGLTKVHVTCLKQHAPALSHK
EDMGTEDVHKESMETGLNARCEAVSLEETPVQSKPMDHIMIDSLRLESSMDDEVLAYETISFLSGRMVNKVNPDASTYFSVEGRTRSIEATEVETIFLNRAFEPPGSEDE
DEVPHENDAAGPSSAHVGPEKDDRGQAANVDEGVRKDIHVEVEEKEDNGSGKKVRMSNVRLRKVEKRLKCMDDRIRDMRRGSSGDGTGPGDGPSDAPSGGPRNRPSDGPS
GVMVVGVMVLRVAMVLHQCMTTLNAVLRELQQNTIGLASDGEIIVWDSMRSMTLLLALEFELRPMAVVLPALMHRAGVQVFQAIAFPLPVHVAVETFSFVCNIGFAFLGS
GSSRVRFGPLTNSFLLLESLTSRIRGRRLSSRGRTSVEPFGGGNSSEEIPEYNEADLTRLVYDDLVRLCHIEVDHQQAHQICSHARQDVDLGLVVGLDPPIRSLRTFLLH
DKLKIIY