; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g04250 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g04250
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUlp1-like peptidase
Genome locationchr3:3139041..3143069
RNA-Seq ExpressionMoc03g04250
SyntenyMoc03g04250
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022146372.1 uncharacterized protein LOC111015600 [Momordica charantia]9.5e-5161.81Show/hide
Query:  KRFFGGRFYDDEDVVKVGIVYFIELAMMGKKRKQFIDMAMLGVVHWWEALCNYDWSSMVFDRTIWSLKNALKDKLSAYQQKARADPTHVETYSLYGFPYA
        K F    FYDDEDVVKVGIVYFIELAMMGK+RKQFID   +GVV  WEA CN DWSSM+FDRTIWSLKN LKDKLSAYQQKA ADPTHVETYSLYGFPY 
Subjt:  KRFFGGRFYDDEDVVKVGIVYFIELAMMGKKRKQFIDMAMLGVVHWWEALCNYDWSSMVFDRTIWSLKNALKDKLSAYQQKARADPTHVETYSLYGFPYA

Query:  FQVWAYEMISTLSLRVETRLSDEPFHDFSGGRALIRAGFMYWRSNIKEHLVATDAEEQDMVRVILPPEARVIIDPPTVPYWAAVRDPPTDVERGPVEDP
                     +R    L+ E F +              W S +KEHL+ATDAEEQ MVRVILPPE RVI DPP VP  A V D      R  V DP
Subjt:  FQVWAYEMISTLSLRVETRLSDEPFHDFSGGRALIRAGFMYWRSNIKEHLVATDAEEQDMVRVILPPEARVIIDPPTVPYWAAVRDPPTDVERGPVEDP

XP_022153201.1 uncharacterized protein LOC111020757 [Momordica charantia]1.8e-11066.1Show/hide
Query:  KRFFGGRFYDDEDVVKVGIVYFIELAMMGKKRKQFIDMAMLGVVHWWEALCNYDWSSMVFDRTIWSLKNALKDKLSAYQQKARADPTHVETYSLYGFPYA
        K F    FYDDEDVVKV IVYFIELAMMGK+RKQFID A+LGVV  WE  CNYDWSSM+FDRTIWSLKNALKDKLS YQQKA ADP+HVETYSLYGFPYA
Subjt:  KRFFGGRFYDDEDVVKVGIVYFIELAMMGKKRKQFIDMAMLGVVHWWEALCNYDWSSMVFDRTIWSLKNALKDKLSAYQQKARADPTHVETYSLYGFPYA

Query:  FQVWAYEMISTLSLRVETRLSDEPFHDFSGGRALIRAGFMYWRSNIKEHLVATDAEEQDMVRVILPPEARVIIDPPTV------------PYWAAVRDPP
        FQVWAYE ISTLS     RL         G R L    F   RS +KEHL+ATDA+EQ MVRVILPPE RVI DPP V            P  AAV DPP
Subjt:  FQVWAYEMISTLSLRVETRLSDEPFHDFSGGRALIRAGFMYWRSNIKEHLVATDAEEQDMVRVILPPEARVIIDPPTV------------PYWAAVRDPP

Query:  TDVERGPVEDPVVEAPVIDEAGPSANDGEALE-------------QRLKRLDDHVGAIEDTLGDFGVALKGIQ---SVWVQGKFSDPNKYFGGGGGPDDD
         DVE GP+EDPVV+A  +DEA PSANDGE LE             +RLKRLD+ VGAIED LGDFGVALKGIQ       +GKF D +KYFGGGGGPDDD
Subjt:  TDVERGPVEDPVVEAPVIDEAGPSANDGEALE-------------QRLKRLDDHVGAIEDTLGDFGVALKGIQ---SVWVQGKFSDPNKYFGGGGGPDDD

Query:  GPSDQRPNESPKLDGGRKSMEEDQRPDEDPENDEKL------TSGHGPNNM
        GPSDQRP+ESPK DGGRKSM+EDQR DED   DE L      TSGHG + +
Subjt:  GPSDQRPNESPKLDGGRKSMEEDQRPDEDPENDEKL------TSGHGPNNM

XP_022154965.1 uncharacterized protein LOC111022110 [Momordica charantia]1.5e-4037.24Show/hide
Query:  MMGKKRKQFIDMAMLGVVHWWEALCNYDWSSMVFDRTIWSLKNALKDKLSAYQQKARADPTHVETYSLYGFPYAFQVWAYEMISTLSLRVETRLSDEPFH
        MMGK+RKQ +D ++LG+V  WE  C+YD SSM+F+RT+WSLKNALKDK+ AY+QK   D +HVETYSLYGFPYAFQVWAYE ISTLS RV  RL+D+   
Subjt:  MMGKKRKQFIDMAMLGVVHWWEALCNYDWSSMVFDRTIWSLKNALKDKLSAYQQKARADPTHVETYSLYGFPYAFQVWAYEMISTLSLRVETRLSDEPFH

Query:  DF--------SGGRALIRAGFMYWRSNIKEHLVATDAEEQDMVRVILPPEARV-------IIDPPTVPYWAAVRDPPTDVERGPVE--------DPVVEA
                       L R  F   +S +   L ATD E Q M RV+ PP A V       +   P      A + P T      VE         P+V+ 
Subjt:  DF--------SGGRALIRAGFMYWRSNIKEHLVATDAEEQDMVRVILPPEARV-------IIDPPTVPYWAAVRDPPTDVERGPVE--------DPVVEA

Query:  PVIDEAGPSANDGEALEQR----------------LKRLDDHVGAIEDTLGDFGVALKGIQSVW---VQGKFSDPNKYFGGGGGPDDDGPSDQRPNESPK
           D  G      + L Q+                L+RL D V AIE TL      +K I+       +      NKY   GG PD DG S  R      
Subjt:  PVIDEAGPSANDGEALEQR----------------LKRLDDHVGAIEDTLGDFGVALKGIQSVW---VQGKFSDPNKYFGGGGGPDDDGPSDQRPNESPK

Query:  LDGGRKSMEEDQRPDEDPENDEKLTSGHGPNNMDEDPKKKDEDAKKKDDDPMITEDDDGHYDRETTVDHADDHRHRVGVIQDQSDLQHAPID
           GR   EED   DEDP+  ++  +G  P  MDEDPK  +E        P    + D   D   T+         VG  Q+      +P+D
Subjt:  LDGGRKSMEEDQRPDEDPENDEKLTSGHGPNNMDEDPKKKDEDAKKKDDDPMITEDDDGHYDRETTVDHADDHRHRVGVIQDQSDLQHAPID

XP_022155476.1 uncharacterized protein LOC111022607 [Momordica charantia]1.3e-3149.45Show/hide
Query:  SGHGPNNMDEDPKKKDEDAK-KKDDDPMITEDDD--------GHYDRETTVDHADDHRHRVGVI------------------------------------
        SGHGPN++DEDPK++D D    ++DD MIT+ D+        G     + VDH DDH  +V VI                                    
Subjt:  SGHGPNNMDEDPKKKDEDAK-KKDDDPMITEDDD--------GHYDRETTVDHADDHRHRVGVI------------------------------------

Query:  -------QDQSDLQHAPIDRRLRKRHYSWKLKGIYIPTGRRRITVDAYDPACPIPLQLDYQFQTWMDDPDTDGRSQSTATGL
               QD++DLQHAP  R LRK HYSWKLKGIY PTGRRRITVDAYDPACPIP QLD QFQTWMDD D DGR++STA GL
Subjt:  -------QDQSDLQHAPIDRRLRKRHYSWKLKGIYIPTGRRRITVDAYDPACPIPLQLDYQFQTWMDDPDTDGRSQSTATGL

XP_022157020.1 uncharacterized protein LOC111023847 [Momordica charantia]8.9e-4155.49Show/hide
Query:  KRFFGGRFYDDEDVVKVGIVYFIELAMMGKKRKQFIDMAMLGVVHWWEALCNYDWSSMVFDRTIWSLKNALKDKLSAYQQKARADPTHVETYSLYGFPYA
        K F    F +DED VK+ IVYFIELAMMGK+RK  +D ++LG+V  WE  CNYDWSSM+F+RT+WSLKNALKDK+  Y+QK   D +HVETYSLY FPYA
Subjt:  KRFFGGRFYDDEDVVKVGIVYFIELAMMGKKRKQFIDMAMLGVVHWWEALCNYDWSSMVFDRTIWSLKNALKDKLSAYQQKARADPTHVETYSLYGFPYA

Query:  FQVWAYEMISTLSLRVETRLSDEPFHDFSGGRALIRAGFMYWRS-NIKEHLVATDAEEQDMVRV
        FQVWAYE ISTLS RV  RL+D+          L+R    Y R+ N+ E  V  + + + +VR+
Subjt:  FQVWAYEMISTLSLRVETRLSDEPFHDFSGGRALIRAGFMYWRS-NIKEHLVATDAEEQDMVRV

TrEMBL top hitse value%identityAlignment
A0A6J1CZE8 uncharacterized protein LOC1110156004.6e-5161.81Show/hide
Query:  KRFFGGRFYDDEDVVKVGIVYFIELAMMGKKRKQFIDMAMLGVVHWWEALCNYDWSSMVFDRTIWSLKNALKDKLSAYQQKARADPTHVETYSLYGFPYA
        K F    FYDDEDVVKVGIVYFIELAMMGK+RKQFID   +GVV  WEA CN DWSSM+FDRTIWSLKN LKDKLSAYQQKA ADPTHVETYSLYGFPY 
Subjt:  KRFFGGRFYDDEDVVKVGIVYFIELAMMGKKRKQFIDMAMLGVVHWWEALCNYDWSSMVFDRTIWSLKNALKDKLSAYQQKARADPTHVETYSLYGFPYA

Query:  FQVWAYEMISTLSLRVETRLSDEPFHDFSGGRALIRAGFMYWRSNIKEHLVATDAEEQDMVRVILPPEARVIIDPPTVPYWAAVRDPPTDVERGPVEDP
                     +R    L+ E F +              W S +KEHL+ATDAEEQ MVRVILPPE RVI DPP VP  A V D      R  V DP
Subjt:  FQVWAYEMISTLSLRVETRLSDEPFHDFSGGRALIRAGFMYWRSNIKEHLVATDAEEQDMVRVILPPEARVIIDPPTVPYWAAVRDPPTDVERGPVEDP

A0A6J1DJX9 uncharacterized protein LOC1110207578.8e-11166.1Show/hide
Query:  KRFFGGRFYDDEDVVKVGIVYFIELAMMGKKRKQFIDMAMLGVVHWWEALCNYDWSSMVFDRTIWSLKNALKDKLSAYQQKARADPTHVETYSLYGFPYA
        K F    FYDDEDVVKV IVYFIELAMMGK+RKQFID A+LGVV  WE  CNYDWSSM+FDRTIWSLKNALKDKLS YQQKA ADP+HVETYSLYGFPYA
Subjt:  KRFFGGRFYDDEDVVKVGIVYFIELAMMGKKRKQFIDMAMLGVVHWWEALCNYDWSSMVFDRTIWSLKNALKDKLSAYQQKARADPTHVETYSLYGFPYA

Query:  FQVWAYEMISTLSLRVETRLSDEPFHDFSGGRALIRAGFMYWRSNIKEHLVATDAEEQDMVRVILPPEARVIIDPPTV------------PYWAAVRDPP
        FQVWAYE ISTLS     RL         G R L    F   RS +KEHL+ATDA+EQ MVRVILPPE RVI DPP V            P  AAV DPP
Subjt:  FQVWAYEMISTLSLRVETRLSDEPFHDFSGGRALIRAGFMYWRSNIKEHLVATDAEEQDMVRVILPPEARVIIDPPTV------------PYWAAVRDPP

Query:  TDVERGPVEDPVVEAPVIDEAGPSANDGEALE-------------QRLKRLDDHVGAIEDTLGDFGVALKGIQ---SVWVQGKFSDPNKYFGGGGGPDDD
         DVE GP+EDPVV+A  +DEA PSANDGE LE             +RLKRLD+ VGAIED LGDFGVALKGIQ       +GKF D +KYFGGGGGPDDD
Subjt:  TDVERGPVEDPVVEAPVIDEAGPSANDGEALE-------------QRLKRLDDHVGAIEDTLGDFGVALKGIQ---SVWVQGKFSDPNKYFGGGGGPDDD

Query:  GPSDQRPNESPKLDGGRKSMEEDQRPDEDPENDEKL------TSGHGPNNM
        GPSDQRP+ESPK DGGRKSM+EDQR DED   DE L      TSGHG + +
Subjt:  GPSDQRPNESPKLDGGRKSMEEDQRPDEDPENDEKL------TSGHGPNNM

A0A6J1DL40 uncharacterized protein LOC1110221107.3e-4137.24Show/hide
Query:  MMGKKRKQFIDMAMLGVVHWWEALCNYDWSSMVFDRTIWSLKNALKDKLSAYQQKARADPTHVETYSLYGFPYAFQVWAYEMISTLSLRVETRLSDEPFH
        MMGK+RKQ +D ++LG+V  WE  C+YD SSM+F+RT+WSLKNALKDK+ AY+QK   D +HVETYSLYGFPYAFQVWAYE ISTLS RV  RL+D+   
Subjt:  MMGKKRKQFIDMAMLGVVHWWEALCNYDWSSMVFDRTIWSLKNALKDKLSAYQQKARADPTHVETYSLYGFPYAFQVWAYEMISTLSLRVETRLSDEPFH

Query:  DF--------SGGRALIRAGFMYWRSNIKEHLVATDAEEQDMVRVILPPEARV-------IIDPPTVPYWAAVRDPPTDVERGPVE--------DPVVEA
                       L R  F   +S +   L ATD E Q M RV+ PP A V       +   P      A + P T      VE         P+V+ 
Subjt:  DF--------SGGRALIRAGFMYWRSNIKEHLVATDAEEQDMVRVILPPEARV-------IIDPPTVPYWAAVRDPPTDVERGPVE--------DPVVEA

Query:  PVIDEAGPSANDGEALEQR----------------LKRLDDHVGAIEDTLGDFGVALKGIQSVW---VQGKFSDPNKYFGGGGGPDDDGPSDQRPNESPK
           D  G      + L Q+                L+RL D V AIE TL      +K I+       +      NKY   GG PD DG S  R      
Subjt:  PVIDEAGPSANDGEALEQR----------------LKRLDDHVGAIEDTLGDFGVALKGIQSVW---VQGKFSDPNKYFGGGGGPDDDGPSDQRPNESPK

Query:  LDGGRKSMEEDQRPDEDPENDEKLTSGHGPNNMDEDPKKKDEDAKKKDDDPMITEDDDGHYDRETTVDHADDHRHRVGVIQDQSDLQHAPID
           GR   EED   DEDP+  ++  +G  P  MDEDPK  +E        P    + D   D   T+         VG  Q+      +P+D
Subjt:  LDGGRKSMEEDQRPDEDPENDEKLTSGHGPNNMDEDPKKKDEDAKKKDDDPMITEDDDGHYDRETTVDHADDHRHRVGVIQDQSDLQHAPID

A0A6J1DRS0 uncharacterized protein LOC1110226076.2e-3249.45Show/hide
Query:  SGHGPNNMDEDPKKKDEDAK-KKDDDPMITEDDD--------GHYDRETTVDHADDHRHRVGVI------------------------------------
        SGHGPN++DEDPK++D D    ++DD MIT+ D+        G     + VDH DDH  +V VI                                    
Subjt:  SGHGPNNMDEDPKKKDEDAK-KKDDDPMITEDDD--------GHYDRETTVDHADDHRHRVGVI------------------------------------

Query:  -------QDQSDLQHAPIDRRLRKRHYSWKLKGIYIPTGRRRITVDAYDPACPIPLQLDYQFQTWMDDPDTDGRSQSTATGL
               QD++DLQHAP  R LRK HYSWKLKGIY PTGRRRITVDAYDPACPIP QLD QFQTWMDD D DGR++STA GL
Subjt:  -------QDQSDLQHAPIDRRLRKRHYSWKLKGIYIPTGRRRITVDAYDPACPIPLQLDYQFQTWMDDPDTDGRSQSTATGL

A0A6J1DRZ7 uncharacterized protein LOC1110238474.3e-4155.49Show/hide
Query:  KRFFGGRFYDDEDVVKVGIVYFIELAMMGKKRKQFIDMAMLGVVHWWEALCNYDWSSMVFDRTIWSLKNALKDKLSAYQQKARADPTHVETYSLYGFPYA
        K F    F +DED VK+ IVYFIELAMMGK+RK  +D ++LG+V  WE  CNYDWSSM+F+RT+WSLKNALKDK+  Y+QK   D +HVETYSLY FPYA
Subjt:  KRFFGGRFYDDEDVVKVGIVYFIELAMMGKKRKQFIDMAMLGVVHWWEALCNYDWSSMVFDRTIWSLKNALKDKLSAYQQKARADPTHVETYSLYGFPYA

Query:  FQVWAYEMISTLSLRVETRLSDEPFHDFSGGRALIRAGFMYWRS-NIKEHLVATDAEEQDMVRV
        FQVWAYE ISTLS RV  RL+D+          L+R    Y R+ N+ E  V  + + + +VR+
Subjt:  FQVWAYEMISTLSLRVETRLSDEPFHDFSGGRALIRAGFMYWRS-NIKEHLVATDAEEQDMVRV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTCCATATAATGGAGCCGATGGAGCCACCGGAGACGAAGGAACAGTGACGATGTTACCGTATGAGATGCGTCGGAGGCCGGCAGCGAAATGTTCTGCAAACAAAGG
AGAAGAAGCTGTCGGAGACAACGAGGACGTCAGAGAGATGCTTCATAGGAGACGAAGCCTGACGATCAGCCTTCGATCCGAGCCTGGTCTGGGAGTGTCTTTCATCTCGG
ACTTGGCCCGGTCCGGTTCGGTCTTGGATGAAGGCCGATCCCGGACACACCCCGATCCTGTTCGGATCCGAGATGAAAGGCAGATTCATTCTGCCTCATTCTGCCTCTCA
TCTCGGATTGAACCGGACCGGGACGTGTCCGGGATCGGCCTTCATCCCGGACCAGGCCAATTCCGGGATGAAAGGCAGAATGGGACAACCCTAATCTGCTTCTCATCTCA
GACTCGTCTCAGTCCAATTGAGTCTGGGATGAAAGGCAGAATGCTCTTTTGCTTAATAGTGTTAAGGTTAAGTGTAGGGAGCTGGAAAAGATTTTTTGGAGGACGTTTTT
ACGACGACGAGGATGTTGTCAAGGTTGGCATAGTTTACTTCATAGAACTTGCTATGATGGGGAAGAAGAGGAAGCAGTTCATAGATATGGCCATGTTAGGTGTTGTACAT
TGGTGGGAGGCGTTATGCAACTATGACTGGAGTTCGATGGTTTTTGATAGGACGATCTGGAGTCTCAAGAACGCCCTGAAGGATAAACTGTCGGCCTACCAACAGAAGGC
AAGAGCCGACCCCACACACGTTGAGACTTATAGTTTGTACGGGTTTCCGTACGCATTTCAGGTATGGGCGTATGAGATGATCTCGACGTTGAGTTTGCGAGTAGAAACGA
GGTTGAGCGATGAGCCATTCCACGACTTCTCAGGTGGTCGTGCACTTATTCGAGCAGGATTCATGTACTGGCGATCCAATATTAAGGAACACTTGGTGGCGACAGATGCT
GAAGAACAAGATATGGTCCGTGTCATTCTTCCACCAGAAGCCCGCGTTATAATTGATCCGCCTACTGTACCTTATTGGGCTGCTGTACGTGATCCGCCTACTGATGTGGA
AAGGGGTCCTGTAGAGGATCCGGTAGTAGAGGCCCCTGTGATAGACGAGGCTGGACCCAGTGCAAATGACGGTGAAGCGTTAGAGCAGAGGTTGAAGAGGCTCGATGACC
ATGTTGGTGCTATCGAGGATACACTGGGTGACTTTGGAGTCGCCTTGAAAGGTATTCAGAGTGTTTGGGTGCAGGGTAAATTCTCTGATCCAAACAAGTATTTCGGAGGT
GGGGGTGGGCCCGATGATGATGGTCCATCGGATCAAAGGCCTAATGAGTCCCCGAAGCTAGATGGAGGTCGAAAGAGTATGGAAGAAGACCAGAGGCCAGATGAGGATCC
GGAGAATGACGAGAAACTGACGTCGGGGCATGGTCCGAATAATATGGACGAAGATCCAAAGAAAAAGGACGAGGATGCGAAAAAAAAGGACGATGATCCAATGATTACAG
AGGATGACGATGGACATTACGATCGAGAGACGACAGTGGATCATGCAGATGACCATAGACATCGGGTGGGCGTAATTCAGGACCAATCTGACCTTCAGCATGCCCCAATT
GACCGGAGGCTACGCAAGCGCCATTATTCGTGGAAACTGAAGGGTATATACATACCAACCGGCCGGCGTAGGATCACCGTGGATGCATACGACCCAGCATGTCCCATTCC
TCTGCAGCTGGACTATCAGTTCCAGACATGGATGGATGACCCGGACACCGATGGACGAAGTCAGTCTACTGCAACTGGCTTATAA
mRNA sequenceShow/hide mRNA sequence
ATGTTTCCATATAATGGAGCCGATGGAGCCACCGGAGACGAAGGAACAGTGACGATGTTACCGTATGAGATGCGTCGGAGGCCGGCAGCGAAATGTTCTGCAAACAAAGG
AGAAGAAGCTGTCGGAGACAACGAGGACGTCAGAGAGATGCTTCATAGGAGACGAAGCCTGACGATCAGCCTTCGATCCGAGCCTGGTCTGGGAGTGTCTTTCATCTCGG
ACTTGGCCCGGTCCGGTTCGGTCTTGGATGAAGGCCGATCCCGGACACACCCCGATCCTGTTCGGATCCGAGATGAAAGGCAGATTCATTCTGCCTCATTCTGCCTCTCA
TCTCGGATTGAACCGGACCGGGACGTGTCCGGGATCGGCCTTCATCCCGGACCAGGCCAATTCCGGGATGAAAGGCAGAATGGGACAACCCTAATCTGCTTCTCATCTCA
GACTCGTCTCAGTCCAATTGAGTCTGGGATGAAAGGCAGAATGCTCTTTTGCTTAATAGTGTTAAGGTTAAGTGTAGGGAGCTGGAAAAGATTTTTTGGAGGACGTTTTT
ACGACGACGAGGATGTTGTCAAGGTTGGCATAGTTTACTTCATAGAACTTGCTATGATGGGGAAGAAGAGGAAGCAGTTCATAGATATGGCCATGTTAGGTGTTGTACAT
TGGTGGGAGGCGTTATGCAACTATGACTGGAGTTCGATGGTTTTTGATAGGACGATCTGGAGTCTCAAGAACGCCCTGAAGGATAAACTGTCGGCCTACCAACAGAAGGC
AAGAGCCGACCCCACACACGTTGAGACTTATAGTTTGTACGGGTTTCCGTACGCATTTCAGGTATGGGCGTATGAGATGATCTCGACGTTGAGTTTGCGAGTAGAAACGA
GGTTGAGCGATGAGCCATTCCACGACTTCTCAGGTGGTCGTGCACTTATTCGAGCAGGATTCATGTACTGGCGATCCAATATTAAGGAACACTTGGTGGCGACAGATGCT
GAAGAACAAGATATGGTCCGTGTCATTCTTCCACCAGAAGCCCGCGTTATAATTGATCCGCCTACTGTACCTTATTGGGCTGCTGTACGTGATCCGCCTACTGATGTGGA
AAGGGGTCCTGTAGAGGATCCGGTAGTAGAGGCCCCTGTGATAGACGAGGCTGGACCCAGTGCAAATGACGGTGAAGCGTTAGAGCAGAGGTTGAAGAGGCTCGATGACC
ATGTTGGTGCTATCGAGGATACACTGGGTGACTTTGGAGTCGCCTTGAAAGGTATTCAGAGTGTTTGGGTGCAGGGTAAATTCTCTGATCCAAACAAGTATTTCGGAGGT
GGGGGTGGGCCCGATGATGATGGTCCATCGGATCAAAGGCCTAATGAGTCCCCGAAGCTAGATGGAGGTCGAAAGAGTATGGAAGAAGACCAGAGGCCAGATGAGGATCC
GGAGAATGACGAGAAACTGACGTCGGGGCATGGTCCGAATAATATGGACGAAGATCCAAAGAAAAAGGACGAGGATGCGAAAAAAAAGGACGATGATCCAATGATTACAG
AGGATGACGATGGACATTACGATCGAGAGACGACAGTGGATCATGCAGATGACCATAGACATCGGGTGGGCGTAATTCAGGACCAATCTGACCTTCAGCATGCCCCAATT
GACCGGAGGCTACGCAAGCGCCATTATTCGTGGAAACTGAAGGGTATATACATACCAACCGGCCGGCGTAGGATCACCGTGGATGCATACGACCCAGCATGTCCCATTCC
TCTGCAGCTGGACTATCAGTTCCAGACATGGATGGATGACCCGGACACCGATGGACGAAGTCAGTCTACTGCAACTGGCTTATAA
Protein sequenceShow/hide protein sequence
MFPYNGADGATGDEGTVTMLPYEMRRRPAAKCSANKGEEAVGDNEDVREMLHRRRSLTISLRSEPGLGVSFISDLARSGSVLDEGRSRTHPDPVRIRDERQIHSASFCLS
SRIEPDRDVSGIGLHPGPGQFRDERQNGTTLICFSSQTRLSPIESGMKGRMLFCLIVLRLSVGSWKRFFGGRFYDDEDVVKVGIVYFIELAMMGKKRKQFIDMAMLGVVH
WWEALCNYDWSSMVFDRTIWSLKNALKDKLSAYQQKARADPTHVETYSLYGFPYAFQVWAYEMISTLSLRVETRLSDEPFHDFSGGRALIRAGFMYWRSNIKEHLVATDA
EEQDMVRVILPPEARVIIDPPTVPYWAAVRDPPTDVERGPVEDPVVEAPVIDEAGPSANDGEALEQRLKRLDDHVGAIEDTLGDFGVALKGIQSVWVQGKFSDPNKYFGG
GGGPDDDGPSDQRPNESPKLDGGRKSMEEDQRPDEDPENDEKLTSGHGPNNMDEDPKKKDEDAKKKDDDPMITEDDDGHYDRETTVDHADDHRHRVGVIQDQSDLQHAPI
DRRLRKRHYSWKLKGIYIPTGRRRITVDAYDPACPIPLQLDYQFQTWMDDPDTDGRSQSTATGL