; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g29400 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g29400
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUlp1-like peptidase
Genome locationchr6:22154706..22157911
RNA-Seq ExpressionMoc06g29400
SyntenyMoc06g29400
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008234 - cysteine-type peptidase activity (molecular function)
InterPro domainsIPR003653 - Ulp1 protease family, C-terminal catalytic domain
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022146372.1 uncharacterized protein LOC111015600 [Momordica charantia]1.9e-5850.98Show/hide
Query:  MDLRLILDRNDWFPATLTNLAHVDKTTTRIKARLTPTQLYMFRQTRFGPILDIDVLFNGPLIHHLLLREVEEPRQDVISFDLFGKRVSF-----------
        MDLRLILDRNDWFPATLTNLAHVDKTTTRIKARLTPTQL MFRQT FGPILD+ V+FNGPLIHHLLL EVEEPRQDVISFDLF KRVSF           
Subjt:  MDLRLILDRNDWFPATLTNLAHVDKTTTRIKARLTPTQLYMFRQTRFGPILDIDVLFNGPLIHHLLLREVEEPRQDVISFDLFGKRVSF-----------

Query:  ----------------------DSVRVKCSELEKIFLEDVFYVDEDVVK---------------------------------------------------
                              DSVRVKCSELEKIFLED+FY DEDVVK                                                   
Subjt:  ----------------------DSVRVKCSELEKIFLEDVFYVDEDVVK---------------------------------------------------

Query:  ------------------------------------------------SKVKEHLLATDAEEQHMVRVILPPEVCVIPDPPTVPDRAVVPDPPASPERAA
                                                        SKVKEHLLATDAEEQHMVRVILPPEV VIPDPP VPDRAVVPD      RA 
Subjt:  ------------------------------------------------SKVKEHLLATDAEEQHMVRVILPPEVCVIPDPPTVPDRAVVPDPPASPERAA

Query:  VPDPPA
        VPDPPA
Subjt:  VPDPPA

XP_022153201.1 uncharacterized protein LOC111020757 [Momordica charantia]9.3e-13861.57Show/hide
Query:  MDLRLILDRNDWFPATLTNLAHVDKTTTRIKARLTPTQLYMFRQTRFGPILDIDVLFNGPLIHHLLLREVEEPRQDVISFDLFGKRVSF-----------
        MDLRLI+DRNDWFPATLTNLAH+DKT+TRIKARLTPTQL MFRQT FGPILDIDV+FNGPLIHHLLLREVEEPRQDVISFDLFGKRVSF           
Subjt:  MDLRLILDRNDWFPATLTNLAHVDKTTTRIKARLTPTQLYMFRQTRFGPILDIDVLFNGPLIHHLLLREVEEPRQDVISFDLFGKRVSF-----------

Query:  ----------------------DSVRVKCSELEKIFLEDVFYVDEDVVK---------------------------------------------------
                              D VRVKCSELEKIFLEDVFY DEDVVK                                                   
Subjt:  ----------------------DSVRVKCSELEKIFLEDVFYVDEDVVK---------------------------------------------------

Query:  ----------------------------------------------------------------------------SKVKEHLLATDAEEQHMVRVILPP
                                                                                    SKVKEHLLATDA+EQHMVRVILPP
Subjt:  ----------------------------------------------------------------------------SKVKEHLLATDAEEQHMVRVILPP

Query:  EVCVIPDPPTVPDRAVVPDPPASPERAAVPDPPADVEMGPLEDPVVDSHVVDEARPSANDGEGLEKRSKKNKFKKRISRRLKRLDNRVGAIEDTLGDFGV
        EV VIPDPP VPDRAVVPDPPASPERAAVPDPPADVEMGPLEDPVVD+H VDEARPSANDGEGLEKR KKNKFKKRISRRLKRLDN VGAIED LGDFGV
Subjt:  EVCVIPDPPTVPDRAVVPDPPASPERAAVPDPPADVEMGPLEDPVVDSHVVDEARPSANDGEGLEKRSKKNKFKKRISRRLKRLDNRVGAIEDTLGDFGV

Query:  ALKGIQRYLKKLAKGKFPDPSKYFGGGGGPDDDGPSDQRPDESLKPDEGRKSMDEDQRPDEDQRTDENLETEKEPTSGHGPNSI
        ALKGIQ YLKKLAKGKFPD SKYFGGGGGPDDDGPSDQRPDES KPD GRKSMDEDQR DEDQRTDE+LETEKEPTSGHG +++
Subjt:  ALKGIQRYLKKLAKGKFPDPSKYFGGGGGPDDDGPSDQRPDESLKPDEGRKSMDEDQRPDEDQRTDENLETEKEPTSGHGPNSI

XP_022155476.1 uncharacterized protein LOC111022607 [Momordica charantia]1.0e-10776.01Show/hide
Query:  SGHGPNSIDEDPKRRDDDPMITEEDDGMITDGDEDPNQDITIGRPPDGSEVDHADYHGPQVVVIQDLTVGRQEPDAQPDTQPTRRRVRRPYKDWAPDAIV
        SGHGPNS+DEDPKRRD+DPMI EEDDGMITDGDEDPNQDITIGR PDGSEVDH D H PQV VIQDLTVGRQEPDAQPDTQPTRRRVRRPYKDWAPDAIV
Subjt:  SGHGPNSIDEDPKRRDDDPMITEEDDGMITDGDEDPNQDITIGRPPDGSEVDHADYHGPQVVVIQDLTVGRQEPDAQPDTQPTRRRVRRPYKDWAPDAIV

Query:  K-------DETDLQHTPTGRGLRKRHYSWTVSSRHDP--------------------------------DTDGRTRSTAAGLQGKEWYRDLLDPTIQLND
        K       DETDLQH PTGRGLRK HYSW +   + P                                D DGRTRSTAAGLQGKEWYRDLLDPT+QL D
Subjt:  K-------DETDLQHTPTGRGLRKRHYSWTVSSRHDP--------------------------------DTDGRTRSTAAGLQGKEWYRDLLDPTIQLND

Query:  EVVDALVLFTAKKLEKCLHLCRKKFAIGDVLLSTLLNRTDGPYAAMKPGVLSMRIEYPWSQENTIFRFMFG
        EVVDALVLFTAKKLEKC++LCRKKFAIGDVLLSTLLNRTDGPYAAMKPGVLS RIEYP SQENTIFR++FG
Subjt:  EVVDALVLFTAKKLEKCLHLCRKKFAIGDVLLSTLLNRTDGPYAAMKPGVLSMRIEYPWSQENTIFRFMFG

XP_022157016.1 uncharacterized protein LOC111023842 [Momordica charantia]1.4e-4570.75Show/hide
Query:  LRLILDRNDWFPATLTNLAHVDKTTTRIKARLTPTQLYMFRQTRFGPILDIDVLFNGPLIHHLLLREVEEPRQDVISFDLFGKRVSF-------------
        LRLI+DRNDWFP TLTN AHVDKT+TRIKARLTPTQL MFRQT FGPILD+DV+FNGPLIHHLLLREVEEPRQDVISFDLFGKRVSF             
Subjt:  LRLILDRNDWFPATLTNLAHVDKTTTRIKARLTPTQLYMFRQTRFGPILDIDVLFNGPLIHHLLLREVEEPRQDVISFDLFGKRVSF-------------

Query:  --------------------DSVRVKCSELEKIFLEDVFYVDEDVVK
                            DSV+VKCSELEKIFLEDVFY DEDVVK
Subjt:  --------------------DSVRVKCSELEKIFLEDVFYVDEDVVK

XP_022158807.1 uncharacterized protein LOC111025273 [Momordica charantia]3.6e-7356.05Show/hide
Query:  DPDTDGRTRSTAAGLQGKEWYRDLLDPTIQLNDEVVDALVLFTAKKLEKCLHLCRKKFAIGDVLLSTLLNRTDGPYAAMKPGVLSMRIEYPWSQENTIFR
        DP TD  +RST+ G++ K W+  LLDP  QL+DE +D+L++ TA+K+EKC HL R +FAIGDVLLS LL RTDGPYAAMKPGVL  +  Y W QE TIFR
Subjt:  DPDTDGRTRSTAAGLQGKEWYRDLLDPTIQLNDEVVDALVLFTAKKLEKCLHLCRKKFAIGDVLLSTLLNRTDGPYAAMKPGVLSMRIEYPWSQENTIFR

Query:  FMF------------------------------------GDLTVWDSLQSATPLDSLENELKPICMILPAVLHHGGIFASRPDLPVVPWRVHRVRTPQQS
        ++                                     GDLTVWDSLQ+ TPL+ LE  LKP+C I+PA+LH  GI A RP+LP+VPWRV R   PQQ+
Subjt:  FMF------------------------------------GDLTVWDSLQSATPLDSLENELKPICMILPAVLHHGGIFASRPDLPVVPWRVHRVRTPQQS

Query:  NATNCGIFCVRFFEYDVTGSKLDTLTQDNIVFFRRQYAVQMWARRPIF
          T+C IFCVRFFEYDV GSK+DTL Q NI  FRRQYAVQMWARRP F
Subjt:  NATNCGIFCVRFFEYDVTGSKLDTLTQDNIVFFRRQYAVQMWARRPIF

TrEMBL top hitse value%identityAlignment
A0A6J1CZE8 uncharacterized protein LOC1110156009.3e-5950.98Show/hide
Query:  MDLRLILDRNDWFPATLTNLAHVDKTTTRIKARLTPTQLYMFRQTRFGPILDIDVLFNGPLIHHLLLREVEEPRQDVISFDLFGKRVSF-----------
        MDLRLILDRNDWFPATLTNLAHVDKTTTRIKARLTPTQL MFRQT FGPILD+ V+FNGPLIHHLLL EVEEPRQDVISFDLF KRVSF           
Subjt:  MDLRLILDRNDWFPATLTNLAHVDKTTTRIKARLTPTQLYMFRQTRFGPILDIDVLFNGPLIHHLLLREVEEPRQDVISFDLFGKRVSF-----------

Query:  ----------------------DSVRVKCSELEKIFLEDVFYVDEDVVK---------------------------------------------------
                              DSVRVKCSELEKIFLED+FY DEDVVK                                                   
Subjt:  ----------------------DSVRVKCSELEKIFLEDVFYVDEDVVK---------------------------------------------------

Query:  ------------------------------------------------SKVKEHLLATDAEEQHMVRVILPPEVCVIPDPPTVPDRAVVPDPPASPERAA
                                                        SKVKEHLLATDAEEQHMVRVILPPEV VIPDPP VPDRAVVPD      RA 
Subjt:  ------------------------------------------------SKVKEHLLATDAEEQHMVRVILPPEVCVIPDPPTVPDRAVVPDPPASPERAA

Query:  VPDPPA
        VPDPPA
Subjt:  VPDPPA

A0A6J1DJX9 uncharacterized protein LOC1110207574.5e-13861.57Show/hide
Query:  MDLRLILDRNDWFPATLTNLAHVDKTTTRIKARLTPTQLYMFRQTRFGPILDIDVLFNGPLIHHLLLREVEEPRQDVISFDLFGKRVSF-----------
        MDLRLI+DRNDWFPATLTNLAH+DKT+TRIKARLTPTQL MFRQT FGPILDIDV+FNGPLIHHLLLREVEEPRQDVISFDLFGKRVSF           
Subjt:  MDLRLILDRNDWFPATLTNLAHVDKTTTRIKARLTPTQLYMFRQTRFGPILDIDVLFNGPLIHHLLLREVEEPRQDVISFDLFGKRVSF-----------

Query:  ----------------------DSVRVKCSELEKIFLEDVFYVDEDVVK---------------------------------------------------
                              D VRVKCSELEKIFLEDVFY DEDVVK                                                   
Subjt:  ----------------------DSVRVKCSELEKIFLEDVFYVDEDVVK---------------------------------------------------

Query:  ----------------------------------------------------------------------------SKVKEHLLATDAEEQHMVRVILPP
                                                                                    SKVKEHLLATDA+EQHMVRVILPP
Subjt:  ----------------------------------------------------------------------------SKVKEHLLATDAEEQHMVRVILPP

Query:  EVCVIPDPPTVPDRAVVPDPPASPERAAVPDPPADVEMGPLEDPVVDSHVVDEARPSANDGEGLEKRSKKNKFKKRISRRLKRLDNRVGAIEDTLGDFGV
        EV VIPDPP VPDRAVVPDPPASPERAAVPDPPADVEMGPLEDPVVD+H VDEARPSANDGEGLEKR KKNKFKKRISRRLKRLDN VGAIED LGDFGV
Subjt:  EVCVIPDPPTVPDRAVVPDPPASPERAAVPDPPADVEMGPLEDPVVDSHVVDEARPSANDGEGLEKRSKKNKFKKRISRRLKRLDNRVGAIEDTLGDFGV

Query:  ALKGIQRYLKKLAKGKFPDPSKYFGGGGGPDDDGPSDQRPDESLKPDEGRKSMDEDQRPDEDQRTDENLETEKEPTSGHGPNSI
        ALKGIQ YLKKLAKGKFPD SKYFGGGGGPDDDGPSDQRPDES KPD GRKSMDEDQR DEDQRTDE+LETEKEPTSGHG +++
Subjt:  ALKGIQRYLKKLAKGKFPDPSKYFGGGGGPDDDGPSDQRPDESLKPDEGRKSMDEDQRPDEDQRTDENLETEKEPTSGHGPNSI

A0A6J1DRS0 uncharacterized protein LOC1110226074.8e-10876.01Show/hide
Query:  SGHGPNSIDEDPKRRDDDPMITEEDDGMITDGDEDPNQDITIGRPPDGSEVDHADYHGPQVVVIQDLTVGRQEPDAQPDTQPTRRRVRRPYKDWAPDAIV
        SGHGPNS+DEDPKRRD+DPMI EEDDGMITDGDEDPNQDITIGR PDGSEVDH D H PQV VIQDLTVGRQEPDAQPDTQPTRRRVRRPYKDWAPDAIV
Subjt:  SGHGPNSIDEDPKRRDDDPMITEEDDGMITDGDEDPNQDITIGRPPDGSEVDHADYHGPQVVVIQDLTVGRQEPDAQPDTQPTRRRVRRPYKDWAPDAIV

Query:  K-------DETDLQHTPTGRGLRKRHYSWTVSSRHDP--------------------------------DTDGRTRSTAAGLQGKEWYRDLLDPTIQLND
        K       DETDLQH PTGRGLRK HYSW +   + P                                D DGRTRSTAAGLQGKEWYRDLLDPT+QL D
Subjt:  K-------DETDLQHTPTGRGLRKRHYSWTVSSRHDP--------------------------------DTDGRTRSTAAGLQGKEWYRDLLDPTIQLND

Query:  EVVDALVLFTAKKLEKCLHLCRKKFAIGDVLLSTLLNRTDGPYAAMKPGVLSMRIEYPWSQENTIFRFMFG
        EVVDALVLFTAKKLEKC++LCRKKFAIGDVLLSTLLNRTDGPYAAMKPGVLS RIEYP SQENTIFR++FG
Subjt:  EVVDALVLFTAKKLEKCLHLCRKKFAIGDVLLSTLLNRTDGPYAAMKPGVLSMRIEYPWSQENTIFRFMFG

A0A6J1DWQ3 uncharacterized protein LOC1110238426.9e-4670.75Show/hide
Query:  LRLILDRNDWFPATLTNLAHVDKTTTRIKARLTPTQLYMFRQTRFGPILDIDVLFNGPLIHHLLLREVEEPRQDVISFDLFGKRVSF-------------
        LRLI+DRNDWFP TLTN AHVDKT+TRIKARLTPTQL MFRQT FGPILD+DV+FNGPLIHHLLLREVEEPRQDVISFDLFGKRVSF             
Subjt:  LRLILDRNDWFPATLTNLAHVDKTTTRIKARLTPTQLYMFRQTRFGPILDIDVLFNGPLIHHLLLREVEEPRQDVISFDLFGKRVSF-------------

Query:  --------------------DSVRVKCSELEKIFLEDVFYVDEDVVK
                            DSV+VKCSELEKIFLEDVFY DEDVVK
Subjt:  --------------------DSVRVKCSELEKIFLEDVFYVDEDVVK

A0A6J1DY60 uncharacterized protein LOC1110252731.7e-7356.05Show/hide
Query:  DPDTDGRTRSTAAGLQGKEWYRDLLDPTIQLNDEVVDALVLFTAKKLEKCLHLCRKKFAIGDVLLSTLLNRTDGPYAAMKPGVLSMRIEYPWSQENTIFR
        DP TD  +RST+ G++ K W+  LLDP  QL+DE +D+L++ TA+K+EKC HL R +FAIGDVLLS LL RTDGPYAAMKPGVL  +  Y W QE TIFR
Subjt:  DPDTDGRTRSTAAGLQGKEWYRDLLDPTIQLNDEVVDALVLFTAKKLEKCLHLCRKKFAIGDVLLSTLLNRTDGPYAAMKPGVLSMRIEYPWSQENTIFR

Query:  FMF------------------------------------GDLTVWDSLQSATPLDSLENELKPICMILPAVLHHGGIFASRPDLPVVPWRVHRVRTPQQS
        ++                                     GDLTVWDSLQ+ TPL+ LE  LKP+C I+PA+LH  GI A RP+LP+VPWRV R   PQQ+
Subjt:  FMF------------------------------------GDLTVWDSLQSATPLDSLENELKPICMILPAVLHHGGIFASRPDLPVVPWRVHRVRTPQQS

Query:  NATNCGIFCVRFFEYDVTGSKLDTLTQDNIVFFRRQYAVQMWARRPIF
          T+C IFCVRFFEYDV GSK+DTL Q NI  FRRQYAVQMWARRP F
Subjt:  NATNCGIFCVRFFEYDVTGSKLDTLTQDNIVFFRRQYAVQMWARRPIF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGGTCGATCCCGGACACGTCTCGATCCTATTCGGATCCGAGATGAGATGTAGATTGAATCCAAGATTTAATCTGCCTCTCATCTCTGACCCGAACAGGACCAGGAC
GTGTCCAGGATCGACCTTCATCCCGGACCGATCGGCCCGGGACAAGTCCGAGATGAGAGGCAGATTTGAGTTGTCCCATTCTGCCTTTCATCTTGGACTTGTCCCGGTCC
GAATGGATTTGAGACTCATCTTAGATCGTAATGACTGGTTTCCGGCCACGTTGACGAACCTTGCCCATGTTGATAAAACCACTACTAGGATTAAGGCCAGGTTAACCCCA
ACCCAGTTATACATGTTTAGGCAAACACGTTTCGGTCCTATTTTGGACATAGACGTTCTTTTCAACGGTCCATTGATCCATCACCTGTTGTTGAGAGAGGTTGAAGAGCC
TAGACAGGACGTCATTAGCTTTGACTTGTTTGGGAAGAGGGTGTCTTTTGACAGTGTTAGGGTTAAGTGTAGTGAGCTGGAGAAGATTTTTTTGGAGGACGTTTTTTACG
TCGACGAGGATGTTGTGAAGTCCAAGGTTAAGGAACACTTGTTGGCGACGGATGCTGAGGAACAACACATGGTTCGTGTCATTCTTCCACCAGAAGTCTGTGTTATACCT
GATCCGCCTACTGTACCTGATCGGGCTGTTGTACCTGATCCACCTGCTTCACCTGAACGGGCAGCTGTACCTGATCCGCCCGCTGATGTGGAAATGGGTCCTCTAGAGGA
TCCAGTAGTAGATTCACATGTGGTAGACGAGGCTAGACCCAGTGCAAACGACGGTGAAGGGTTAGAGAAGAGGTCGAAGAAGAATAAATTCAAGAAGAGGATCAGCAGAC
GGTTGAAGAGGCTGGATAACCGTGTCGGTGCTATCGAGGACACACTGGGTGACTTTGGAGTCGCCCTGAAAGGTATTCAGAGATACCTAAAGAAACTGGCGAAGGGTAAA
TTCCCTGATCCGAGCAAGTATTTCGGAGGTGGGGGTGGGCCCGATGATGACGGTCCATCGGATCAAAGGCCTGATGAGTCCCTGAAGCCAGATGAAGGTCGGAAGAGTAT
GGACGAGGACCAGAGGCCTGATGAGGACCAGAGGACGGATGAAAACCTGGAGACTGAAAAGGAGCCGACGTCGGGACATGGTCCGAATAGTATCGACGAGGATCCGAAGA
GAAGGGACGATGATCCAATGATAACGGAGGAGGATGATGGTATGATAACGGATGGGGACGAGGATCCAAATCAGGACATTACGATCGGGAGACCGCCTGATGGCTCGGAA
GTAGATCATGCAGATTACCATGGACCTCAGGTGGTCGTAATTCAGGATCTCACCGTTGGCAGGCAAGAGCCGGATGCTCAGCCAGATACACAGCCCACGAGACGACGAGT
TAGGCGTCCATATAAGGACTGGGCACCCGACGCGATCGTTAAGGATGAAACTGACCTTCAGCATACCCCAACTGGCCGGGGGCTACGCAAGCGCCATTATTCCTGGACGG
TCAGTTCCAGACATGACCCAGACACCGATGGACGAACTCGATCTACTGCAGCTGGCTTACAAGGGAAGGAATGGTATCGCGATCTACTAGACCCTACTATCCAATTGAAT
GACGAGGTAGTTGATGCTCTCGTCCTATTTACGGCCAAAAAGTTGGAGAAGTGTCTACATCTGTGTCGCAAAAAGTTTGCGATAGGCGACGTGCTACTTTCGACTCTACT
GAATCGAACAGACGGTCCATATGCGGCCATGAAACCAGGGGTCCTGTCCATGAGAATCGAATACCCCTGGAGCCAAGAGAATACAATATTTCGATTTATGTTCGGTGACT
TAACCGTATGGGATTCACTCCAATCGGCCACTCCACTAGATTCACTCGAGAATGAGCTGAAGCCCATTTGTATGATCCTACCTGCAGTACTACATCATGGCGGGATATTT
GCATCACGACCGGACCTACCAGTGGTGCCATGGAGGGTGCATCGGGTTCGCACACCCCAACAGAGTAACGCCACAAATTGCGGGATTTTCTGTGTACGCTTCTTCGAGTA
CGATGTTACCGGGTCAAAGCTGGACACTTTGACCCAAGATAATATTGTATTTTTTAGGCGTCAGTACGCTGTACAGATGTGGGCGCGCCGTCCCATTTTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAGGTCGATCCCGGACACGTCTCGATCCTATTCGGATCCGAGATGAGATGTAGATTGAATCCAAGATTTAATCTGCCTCTCATCTCTGACCCGAACAGGACCAGGAC
GTGTCCAGGATCGACCTTCATCCCGGACCGATCGGCCCGGGACAAGTCCGAGATGAGAGGCAGATTTGAGTTGTCCCATTCTGCCTTTCATCTTGGACTTGTCCCGGTCC
GAATGGATTTGAGACTCATCTTAGATCGTAATGACTGGTTTCCGGCCACGTTGACGAACCTTGCCCATGTTGATAAAACCACTACTAGGATTAAGGCCAGGTTAACCCCA
ACCCAGTTATACATGTTTAGGCAAACACGTTTCGGTCCTATTTTGGACATAGACGTTCTTTTCAACGGTCCATTGATCCATCACCTGTTGTTGAGAGAGGTTGAAGAGCC
TAGACAGGACGTCATTAGCTTTGACTTGTTTGGGAAGAGGGTGTCTTTTGACAGTGTTAGGGTTAAGTGTAGTGAGCTGGAGAAGATTTTTTTGGAGGACGTTTTTTACG
TCGACGAGGATGTTGTGAAGTCCAAGGTTAAGGAACACTTGTTGGCGACGGATGCTGAGGAACAACACATGGTTCGTGTCATTCTTCCACCAGAAGTCTGTGTTATACCT
GATCCGCCTACTGTACCTGATCGGGCTGTTGTACCTGATCCACCTGCTTCACCTGAACGGGCAGCTGTACCTGATCCGCCCGCTGATGTGGAAATGGGTCCTCTAGAGGA
TCCAGTAGTAGATTCACATGTGGTAGACGAGGCTAGACCCAGTGCAAACGACGGTGAAGGGTTAGAGAAGAGGTCGAAGAAGAATAAATTCAAGAAGAGGATCAGCAGAC
GGTTGAAGAGGCTGGATAACCGTGTCGGTGCTATCGAGGACACACTGGGTGACTTTGGAGTCGCCCTGAAAGGTATTCAGAGATACCTAAAGAAACTGGCGAAGGGTAAA
TTCCCTGATCCGAGCAAGTATTTCGGAGGTGGGGGTGGGCCCGATGATGACGGTCCATCGGATCAAAGGCCTGATGAGTCCCTGAAGCCAGATGAAGGTCGGAAGAGTAT
GGACGAGGACCAGAGGCCTGATGAGGACCAGAGGACGGATGAAAACCTGGAGACTGAAAAGGAGCCGACGTCGGGACATGGTCCGAATAGTATCGACGAGGATCCGAAGA
GAAGGGACGATGATCCAATGATAACGGAGGAGGATGATGGTATGATAACGGATGGGGACGAGGATCCAAATCAGGACATTACGATCGGGAGACCGCCTGATGGCTCGGAA
GTAGATCATGCAGATTACCATGGACCTCAGGTGGTCGTAATTCAGGATCTCACCGTTGGCAGGCAAGAGCCGGATGCTCAGCCAGATACACAGCCCACGAGACGACGAGT
TAGGCGTCCATATAAGGACTGGGCACCCGACGCGATCGTTAAGGATGAAACTGACCTTCAGCATACCCCAACTGGCCGGGGGCTACGCAAGCGCCATTATTCCTGGACGG
TCAGTTCCAGACATGACCCAGACACCGATGGACGAACTCGATCTACTGCAGCTGGCTTACAAGGGAAGGAATGGTATCGCGATCTACTAGACCCTACTATCCAATTGAAT
GACGAGGTAGTTGATGCTCTCGTCCTATTTACGGCCAAAAAGTTGGAGAAGTGTCTACATCTGTGTCGCAAAAAGTTTGCGATAGGCGACGTGCTACTTTCGACTCTACT
GAATCGAACAGACGGTCCATATGCGGCCATGAAACCAGGGGTCCTGTCCATGAGAATCGAATACCCCTGGAGCCAAGAGAATACAATATTTCGATTTATGTTCGGTGACT
TAACCGTATGGGATTCACTCCAATCGGCCACTCCACTAGATTCACTCGAGAATGAGCTGAAGCCCATTTGTATGATCCTACCTGCAGTACTACATCATGGCGGGATATTT
GCATCACGACCGGACCTACCAGTGGTGCCATGGAGGGTGCATCGGGTTCGCACACCCCAACAGAGTAACGCCACAAATTGCGGGATTTTCTGTGTACGCTTCTTCGAGTA
CGATGTTACCGGGTCAAAGCTGGACACTTTGACCCAAGATAATATTGTATTTTTTAGGCGTCAGTACGCTGTACAGATGTGGGCGCGCCGTCCCATTTTTTGA
Protein sequenceShow/hide protein sequence
MKVDPGHVSILFGSEMRCRLNPRFNLPLISDPNRTRTCPGSTFIPDRSARDKSEMRGRFELSHSAFHLGLVPVRMDLRLILDRNDWFPATLTNLAHVDKTTTRIKARLTP
TQLYMFRQTRFGPILDIDVLFNGPLIHHLLLREVEEPRQDVISFDLFGKRVSFDSVRVKCSELEKIFLEDVFYVDEDVVKSKVKEHLLATDAEEQHMVRVILPPEVCVIP
DPPTVPDRAVVPDPPASPERAAVPDPPADVEMGPLEDPVVDSHVVDEARPSANDGEGLEKRSKKNKFKKRISRRLKRLDNRVGAIEDTLGDFGVALKGIQRYLKKLAKGK
FPDPSKYFGGGGGPDDDGPSDQRPDESLKPDEGRKSMDEDQRPDEDQRTDENLETEKEPTSGHGPNSIDEDPKRRDDDPMITEEDDGMITDGDEDPNQDITIGRPPDGSE
VDHADYHGPQVVVIQDLTVGRQEPDAQPDTQPTRRRVRRPYKDWAPDAIVKDETDLQHTPTGRGLRKRHYSWTVSSRHDPDTDGRTRSTAAGLQGKEWYRDLLDPTIQLN
DEVVDALVLFTAKKLEKCLHLCRKKFAIGDVLLSTLLNRTDGPYAAMKPGVLSMRIEYPWSQENTIFRFMFGDLTVWDSLQSATPLDSLENELKPICMILPAVLHHGGIF
ASRPDLPVVPWRVHRVRTPQQSNATNCGIFCVRFFEYDVTGSKLDTLTQDNIVFFRRQYAVQMWARRPIF