; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc05g17080 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc05g17080
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUlp1-like peptidase
Genome locationchr5:12773828..12780566
RNA-Seq ExpressionMoc05g17080
SyntenyMoc05g17080
Gene Ontology termsNA
InterPro domainsIPR015410 - Domain of unknown function DUF1985


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022141672.1 uncharacterized protein LOC111011977 [Momordica charantia]7.1e-7261.43Show/hide
Query:  MEMVPKVPPAMYVSSQVSCLLHLAKTAQDIKEKLTPCQLRLFRKTVFGHLIDVDLVFNGPLIHSLLLREVLVENSSPDTISFNLLGSNVSFGRREFDLIS
        MEMVPK+PP+ Y SS ++CL H+  T + IK KLTP QL +FRKT+FGHL+DV+LV NGPL+H+LLLRE  +ENS  D I+ ++LG+ VSFG  EF LI+
Subjt:  MEMVPKVPPAMYVSSQVSCLLHLAKTAQDIKEKLTPCQLRLFRKTVFGHLIDVDLVFNGPLIHSLLLREVLVENSSPDTISFNLLGSNVSFGRREFDLIS

Query:  GLKYSRAQVRKDTFPARLRTLYFDDDVNLLLANLEKKYKMLRFDDDWDAVKVSIVYFVELVLHGRERSLKFDSSLLGIVDVWEVCCNYDWGSISFDKTIK
        GLK SR  +RKDT P RLR LYFDD V++LL+  E +Y +++F+DD DAVK+S++ FVELVL G++R+LK D+SLLG+VD  EVCCNYDW  +SF+KTIK
Subjt:  GLKYSRAQVRKDTFPARLRTLYFDDDVNLLLANLEKKYKMLRFDDDWDAVKVSIVYFVELVLHGRERSLKFDSSLLGIVDVWEVCCNYDWGSISFDKTIK

Query:  SLQRALTSKTDDGRLRKAYSLYG
        SL+R LT K  D RLRK YSL+G
Subjt:  SLQRALTSKTDDGRLRKAYSLYG

XP_022154561.1 uncharacterized protein LOC111021802 [Momordica charantia]5.6e-8559.41Show/hide
Query:  LFRKTVFGHLIDVDLVFNGPLIHSLLLREVLVENSSPDTISFNLLGSNVSFGRREFDLISGLKYSRAQVRKDTFPARLRTLYFDDDVNLLLANLEKKYKM
        +FRKT F HL+DVDLVFNG LIH++LLRE  VE S+P+TISFNL    +SF R +F LISGLKY R  VR++T P RL TLYF+D  +L+L++ EK Y  
Subjt:  LFRKTVFGHLIDVDLVFNGPLIHSLLLREVLVENSSPDTISFNLLGSNVSFGRREFDLISGLKYSRAQVRKDTFPARLRTLYFDDDVNLLLANLEKKYKM

Query:  LRFDDDWDAVKVSIVYFVELVLHGRERSLKFDSSLLGIVDVWEVCCNYDWGSISFDKTIKSLQRALTSKTDDGRLRKAYSLYGFPWVFQVWGYETISSLD
         RF+DD+D VKV IVY V + L GRER +KFD +LLGIVD WEVCCNY+W S+SF+KTI SLQR     + DG+LRK+YSLYGFPWVFQVW Y+TISSL 
Subjt:  LRFDDDWDAVKVSIVYFVELVLHGRERSLKFDSSLLGIVDVWEVCCNYDWGSISFDKTIKSLQRALTSKTDDGRLRKAYSLYGFPWVFQVWGYETISSLD

Query:  GRVAIKVNDDVVPYMLRWRCVRSIGWLMLERDIFQSTTAKAQSLEQTESETIFFNRTFEPPASNDVENDEE
         RVA KV  D VP++ +WR   S  W +L+RDIF ST  + ++L++T+ ET F NR+F+PP S+D +  EE
Subjt:  GRVAIKVNDDVVPYMLRWRCVRSIGWLMLERDIFQSTTAKAQSLEQTESETIFFNRTFEPPASNDVENDEE

XP_022157998.1 uncharacterized protein LOC111024595 [Momordica charantia]4.9e-7368.25Show/hide
Query:  LTPCQLRLFRKTVFGHLIDVDLVFNGPLIHSLLLREVLVENSSPDTISFNLLGSNVSFGRREFDLISGLKYSRAQVRKDTFPARLRTLYFDDDVNLLLAN
        +TP QL +FRKTVFGHL+DVDLVFNGPLIHS+LLRE  VE S+PDTISFNL  + VSFGRREFD+ISGLKY R+ VRK T+P RLRTLYF++  +LLL+ 
Subjt:  LTPCQLRLFRKTVFGHLIDVDLVFNGPLIHSLLLREVLVENSSPDTISFNLLGSNVSFGRREFDLISGLKYSRAQVRKDTFPARLRTLYFDDDVNLLLAN

Query:  LEKKYKMLRFDDDWDAVKVSIVYFVELVLHGRERSLKFDSSLLGIVDVWEVCCNYDWGSISFDKTIKSLQRALTSKTDDGRLRKAYSLYGFPWVFQVWGY
        LEK Y  + F+DD+DAVKVSIVYFVELVL            LLGIVD WE CCN+DW  +SF+KTI SLQR  + K+ +G LRK+YSL+GFPWVFQVW Y
Subjt:  LEKKYKMLRFDDDWDAVKVSIVYFVELVLHGRERSLKFDSSLLGIVDVWEVCCNYDWGSISFDKTIKSLQRALTSKTDDGRLRKAYSLYGFPWVFQVWGY

Query:  ETISSLDGRVA
        ETISSL GRVA
Subjt:  ETISSLDGRVA

XP_022158744.1 uncharacterized protein LOC111025209 [Momordica charantia]2.5e-8559.06Show/hide
Query:  MVPKVPPAMYVSSQVSCLLHLAKTAQDIKEKLTPCQLRLFRKTVFGHLIDVDLVFNGPLIHSLLLREVLVENSSPDTISFNLLGSNVSFGRREFDLISGL
        M+PK+ PA Y S++++CL H+AKT+ DIK KLTP QL +FRKT+F HL+DVDLVFNGP                       LLG+ VSFGRREFD+ISGL
Subjt:  MVPKVPPAMYVSSQVSCLLHLAKTAQDIKEKLTPCQLRLFRKTVFGHLIDVDLVFNGPLIHSLLLREVLVENSSPDTISFNLLGSNVSFGRREFDLISGL

Query:  KYSRAQVRKDTFPARLRTLYFDDDVNLLLANLEKKYKMLRFDDDWDAVKVSIVYFVELVLHGRERSLKFDSSLLGIVDVWEVCCNYDWGSISFDKTIKSL
        KYSR+ VRK T+P R  TLYF++  +LLL+ LEK Y  +RF+DD DAVKV +VYFVELVL GRERS KFD  LLGIVD WE CCN+DW  +SFDKTI SL
Subjt:  KYSRAQVRKDTFPARLRTLYFDDDVNLLLANLEKKYKMLRFDDDWDAVKVSIVYFVELVLHGRERSLKFDSSLLGIVDVWEVCCNYDWGSISFDKTIKSL

Query:  QRALTSKTDDGRLRKAYSLYGFPWVFQVWGYETISSLDGRVAIKVNDDVVPYMLRWRCVRSIGWLMLERDIFQSTT
        QR  ++K+ +G LRK+YSLYGFPW FQVW YE ISSL G +   V+ DVVP +L+WR   S  + ML R+IF+S+T
Subjt:  QRALTSKTDDGRLRKAYSLYGFPWVFQVWGYETISSLDGRVAIKVNDDVVPYMLRWRCVRSIGWLMLERDIFQSTT

XP_022159253.1 uncharacterized protein LOC111025666 [Momordica charantia]1.6e-7162.78Show/hide
Query:  MVPKVPPAMYVSSQVSCLLHLAKTAQDIKEKLTPCQLRLFRKTVFGHLIDVDLVFNGPLIHSLLLREVLVENSSPDTISFNLLGSNVSFGRREFDLISGL
        MV K+PP+   S+ ++ L HLAKT   IK KLTP QL +FRKTVFGHL+D+DLVFN  LIH +LLRE+  ++S+P+TISFNL GS V F RREFD+ISGL
Subjt:  MVPKVPPAMYVSSQVSCLLHLAKTAQDIKEKLTPCQLRLFRKTVFGHLIDVDLVFNGPLIHSLLLREVLVENSSPDTISFNLLGSNVSFGRREFDLISGL

Query:  KYSRAQVRKDTFPARLRTLYFDDDVNLLLANLEKKYKMLRFDDDWDAVKVSIVYFVELVLHGRERSLKFDSSLLGIVDVWEVCCNYDWGSISFDKTIKSL
        KY R+ VRKDT P RLR LYF+D  ++LL++ EK Y + RF+DD+DA K+SIVY +ELVL GRER+LK+D +LLGIVD  E CCN+DWG +SFDKTI SL
Subjt:  KYSRAQVRKDTFPARLRTLYFDDDVNLLLANLEKKYKMLRFDDDWDAVKVSIVYFVELVLHGRERSLKFDSSLLGIVDVWEVCCNYDWGSISFDKTIKSL

Query:  QRALTSKTDDGRLRKAYSLYGFP
        +R  T ++ DG  RK YSLYGFP
Subjt:  QRALTSKTDDGRLRKAYSLYGFP

TrEMBL top hitse value%identityAlignment
A0A6J1CKH8 uncharacterized protein LOC1110119773.4e-7261.43Show/hide
Query:  MEMVPKVPPAMYVSSQVSCLLHLAKTAQDIKEKLTPCQLRLFRKTVFGHLIDVDLVFNGPLIHSLLLREVLVENSSPDTISFNLLGSNVSFGRREFDLIS
        MEMVPK+PP+ Y SS ++CL H+  T + IK KLTP QL +FRKT+FGHL+DV+LV NGPL+H+LLLRE  +ENS  D I+ ++LG+ VSFG  EF LI+
Subjt:  MEMVPKVPPAMYVSSQVSCLLHLAKTAQDIKEKLTPCQLRLFRKTVFGHLIDVDLVFNGPLIHSLLLREVLVENSSPDTISFNLLGSNVSFGRREFDLIS

Query:  GLKYSRAQVRKDTFPARLRTLYFDDDVNLLLANLEKKYKMLRFDDDWDAVKVSIVYFVELVLHGRERSLKFDSSLLGIVDVWEVCCNYDWGSISFDKTIK
        GLK SR  +RKDT P RLR LYFDD V++LL+  E +Y +++F+DD DAVK+S++ FVELVL G++R+LK D+SLLG+VD  EVCCNYDW  +SF+KTIK
Subjt:  GLKYSRAQVRKDTFPARLRTLYFDDDVNLLLANLEKKYKMLRFDDDWDAVKVSIVYFVELVLHGRERSLKFDSSLLGIVDVWEVCCNYDWGSISFDKTIK

Query:  SLQRALTSKTDDGRLRKAYSLYG
        SL+R LT K  D RLRK YSL+G
Subjt:  SLQRALTSKTDDGRLRKAYSLYG

A0A6J1DP34 uncharacterized protein LOC1110218022.7e-8559.41Show/hide
Query:  LFRKTVFGHLIDVDLVFNGPLIHSLLLREVLVENSSPDTISFNLLGSNVSFGRREFDLISGLKYSRAQVRKDTFPARLRTLYFDDDVNLLLANLEKKYKM
        +FRKT F HL+DVDLVFNG LIH++LLRE  VE S+P+TISFNL    +SF R +F LISGLKY R  VR++T P RL TLYF+D  +L+L++ EK Y  
Subjt:  LFRKTVFGHLIDVDLVFNGPLIHSLLLREVLVENSSPDTISFNLLGSNVSFGRREFDLISGLKYSRAQVRKDTFPARLRTLYFDDDVNLLLANLEKKYKM

Query:  LRFDDDWDAVKVSIVYFVELVLHGRERSLKFDSSLLGIVDVWEVCCNYDWGSISFDKTIKSLQRALTSKTDDGRLRKAYSLYGFPWVFQVWGYETISSLD
         RF+DD+D VKV IVY V + L GRER +KFD +LLGIVD WEVCCNY+W S+SF+KTI SLQR     + DG+LRK+YSLYGFPWVFQVW Y+TISSL 
Subjt:  LRFDDDWDAVKVSIVYFVELVLHGRERSLKFDSSLLGIVDVWEVCCNYDWGSISFDKTIKSLQRALTSKTDDGRLRKAYSLYGFPWVFQVWGYETISSLD

Query:  GRVAIKVNDDVVPYMLRWRCVRSIGWLMLERDIFQSTTAKAQSLEQTESETIFFNRTFEPPASNDVENDEE
         RVA KV  D VP++ +WR   S  W +L+RDIF ST  + ++L++T+ ET F NR+F+PP S+D +  EE
Subjt:  GRVAIKVNDDVVPYMLRWRCVRSIGWLMLERDIFQSTTAKAQSLEQTESETIFFNRTFEPPASNDVENDEE

A0A6J1DUW1 uncharacterized protein LOC1110245952.4e-7368.25Show/hide
Query:  LTPCQLRLFRKTVFGHLIDVDLVFNGPLIHSLLLREVLVENSSPDTISFNLLGSNVSFGRREFDLISGLKYSRAQVRKDTFPARLRTLYFDDDVNLLLAN
        +TP QL +FRKTVFGHL+DVDLVFNGPLIHS+LLRE  VE S+PDTISFNL  + VSFGRREFD+ISGLKY R+ VRK T+P RLRTLYF++  +LLL+ 
Subjt:  LTPCQLRLFRKTVFGHLIDVDLVFNGPLIHSLLLREVLVENSSPDTISFNLLGSNVSFGRREFDLISGLKYSRAQVRKDTFPARLRTLYFDDDVNLLLAN

Query:  LEKKYKMLRFDDDWDAVKVSIVYFVELVLHGRERSLKFDSSLLGIVDVWEVCCNYDWGSISFDKTIKSLQRALTSKTDDGRLRKAYSLYGFPWVFQVWGY
        LEK Y  + F+DD+DAVKVSIVYFVELVL            LLGIVD WE CCN+DW  +SF+KTI SLQR  + K+ +G LRK+YSL+GFPWVFQVW Y
Subjt:  LEKKYKMLRFDDDWDAVKVSIVYFVELVLHGRERSLKFDSSLLGIVDVWEVCCNYDWGSISFDKTIKSLQRALTSKTDDGRLRKAYSLYGFPWVFQVWGY

Query:  ETISSLDGRVA
        ETISSL GRVA
Subjt:  ETISSLDGRVA

A0A6J1DYB1 uncharacterized protein LOC1110256667.7e-7262.78Show/hide
Query:  MVPKVPPAMYVSSQVSCLLHLAKTAQDIKEKLTPCQLRLFRKTVFGHLIDVDLVFNGPLIHSLLLREVLVENSSPDTISFNLLGSNVSFGRREFDLISGL
        MV K+PP+   S+ ++ L HLAKT   IK KLTP QL +FRKTVFGHL+D+DLVFN  LIH +LLRE+  ++S+P+TISFNL GS V F RREFD+ISGL
Subjt:  MVPKVPPAMYVSSQVSCLLHLAKTAQDIKEKLTPCQLRLFRKTVFGHLIDVDLVFNGPLIHSLLLREVLVENSSPDTISFNLLGSNVSFGRREFDLISGL

Query:  KYSRAQVRKDTFPARLRTLYFDDDVNLLLANLEKKYKMLRFDDDWDAVKVSIVYFVELVLHGRERSLKFDSSLLGIVDVWEVCCNYDWGSISFDKTIKSL
        KY R+ VRKDT P RLR LYF+D  ++LL++ EK Y + RF+DD+DA K+SIVY +ELVL GRER+LK+D +LLGIVD  E CCN+DWG +SFDKTI SL
Subjt:  KYSRAQVRKDTFPARLRTLYFDDDVNLLLANLEKKYKMLRFDDDWDAVKVSIVYFVELVLHGRERSLKFDSSLLGIVDVWEVCCNYDWGSISFDKTIKSL

Query:  QRALTSKTDDGRLRKAYSLYGFP
        +R  T ++ DG  RK YSLYGFP
Subjt:  QRALTSKTDDGRLRKAYSLYGFP

A0A6J1E0A9 uncharacterized protein LOC1110252091.2e-8559.06Show/hide
Query:  MVPKVPPAMYVSSQVSCLLHLAKTAQDIKEKLTPCQLRLFRKTVFGHLIDVDLVFNGPLIHSLLLREVLVENSSPDTISFNLLGSNVSFGRREFDLISGL
        M+PK+ PA Y S++++CL H+AKT+ DIK KLTP QL +FRKT+F HL+DVDLVFNGP                       LLG+ VSFGRREFD+ISGL
Subjt:  MVPKVPPAMYVSSQVSCLLHLAKTAQDIKEKLTPCQLRLFRKTVFGHLIDVDLVFNGPLIHSLLLREVLVENSSPDTISFNLLGSNVSFGRREFDLISGL

Query:  KYSRAQVRKDTFPARLRTLYFDDDVNLLLANLEKKYKMLRFDDDWDAVKVSIVYFVELVLHGRERSLKFDSSLLGIVDVWEVCCNYDWGSISFDKTIKSL
        KYSR+ VRK T+P R  TLYF++  +LLL+ LEK Y  +RF+DD DAVKV +VYFVELVL GRERS KFD  LLGIVD WE CCN+DW  +SFDKTI SL
Subjt:  KYSRAQVRKDTFPARLRTLYFDDDVNLLLANLEKKYKMLRFDDDWDAVKVSIVYFVELVLHGRERSLKFDSSLLGIVDVWEVCCNYDWGSISFDKTIKSL

Query:  QRALTSKTDDGRLRKAYSLYGFPWVFQVWGYETISSLDGRVAIKVNDDVVPYMLRWRCVRSIGWLMLERDIFQSTT
        QR  ++K+ +G LRK+YSLYGFPW FQVW YE ISSL G +   V+ DVVP +L+WR   S  + ML R+IF+S+T
Subjt:  QRALTSKTDDGRLRKAYSLYGFPWVFQVWGYETISSLDGRVAIKVNDDVVPYMLRWRCVRSIGWLMLERDIFQSTT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTGGGTGTCAGGGCCTCGGGTATAAATGGTCGGGGGCTGATAAGGGCTGCTTACTGAGTACTGTGGTTGTACTCATCCCTCTTTTCCCCCTCCAGTTTGCA
GGTATCGAGCTAGCTCCTGGGATGGTGAAAATTTTAGGTAGTCTTAGTAACGGTCCCTGTTTGGAAATGGTGAAATTCCTGGGGCGTTACAGACCTAATCCCTTA
CCGTATGATGACCAAAGATATAGGGAAATGGATTTGGTACCGAAGGTCCCTCTCGCGCTGTATGTCTGGTCCCAGCTCACCTGTCTATTGCACCTAGCGAAGACA
TCCGAGGCTATTAAGAAGAAACTTACCCCCGATCAGCTTCGTTTATTTAGGAAGACTGCAGTTGGGCATTTGCTTAATGTGGATCTTGTCTTTAACGGGCCACTA
ATTCATTGTCTGCTGCTTAGGGAGGTAAATGAGAGTCGTAAGGACACCATGAGTTTTAACTTGTTTGGTAGTAAGGTGTCCTTTGGACGGAGAGAATTTGATTTA
ATTAGCGGCCTAAAATATTCGACAGCCACCGTTATGAAGGACACCTGTCCTCGTAGGCTTAGGGCCCTATACTTCGATGATAATGTAGTGGCTGAGTTTGAGAAG
AAGTATGAAGTGCTTCGTTTTGAGGATGATTGGGATGCGGTCAAGGTCTCAATTGTATACTTTGTTGAATTTGTATTTTACGGGCGGGAAAGAAGCGTGAAGTAC
GACAGTAGTCTGCTTGGAATAGTAGATGACTGGGAAGTGTGCTGCAACCATGACTGGGGGCAACTATCATTTGAGAAGACTATGAGGAGTCTAGAGCGAGCACTT
GCGAAGAAGACACCTCAGGGGGCGTTGAGGAAATCGTACAGTCTTTATGGTTTCCCGTGGGTGTTCCAGGTGTGGGCGTATGAGATAATATCTTCCATGACGGGA
CGAGTTGCCAGGAAGATAAGTGATAGTTTGGTACCCCGCATGCTCCGGTGGAGGTGCGGTCACTCGACTGGATGGCTCGTGCTGGAGAGGGAGATCTTCCAATCT
ACAACAGCCAGAATTAGGGGCTTAGAACCGACTGAAGCGGAGAGGAGGTTTTTCAATAGGGCAATCGAACCACCAGCATCAGATGAACTAAAAGCTGATGAGGAA
GTTCGTCAACGTCCAGAGACTGCATTCGAGGGTGAGGACATAAACACGGACCGTCCAACCGACAATGTTGGTGCTCCTATTGCTGCCGAGGTCGATCGTAACCAG
GATGGAGAGGCGATACAGAACAAAGAAAAAAAGAAGAAGGAGGTTGATTTGATATATTTGAGGTGGTGGGGCGAGGCTGAGCCCGAGGAAATGGAAATGGTACCG
AAGGTCCCTCCCGCGATGTATGTCTCGTCCCAAGTGTCTTGTCTGTTGCACCTAGCAAAGACAGCCCAGGATATTAAAGAGAAACTTACCCCCTGTCAGCTTCGT
CTTTTTAGGAAGACTGTATTTGGCCATCTCATTGATGTGGATCTAGTATTTAACGGGCCGTTAATACATAGCTTGTTACTCAGGGAGGTATTGGTAGAAAATAGT
AGTCCGGACACCATTAGTTTCAACTTGTTGGGGAGTAACGTGTCATTCGGTCGGAGAGAATTTGACCTCATAAGCGGCCTAAAATATTCGAGAGCACAGGTTAGG
AAAGACACGTTTCCTGCCAGGCTTAGAACATTGTACTTCGATGATGACGTAAACCTGCTCTTGGCCAATTTAGAGAAGAAGTACAAAATGTTACGGTTTGATGAT
GATTGGGACGCGGTGAAGGTCTCAATTGTGTACTTCGTGGAACTAGTACTTCATGGACGGGAGAGAAGTTTGAAGTTTGACAGTAGTTTGCTAGGAATAGTAGAT
GTTTGGGAAGTTTGCTGCAACTATGACTGGGGATCCATATCGTTTGACAAGACAATAAAAAGTCTTCAACGAGCACTGACGAGTAAGACAGATGATGGGAGGTTG
AGGAAAGCGTACAGTCTGTATGGTTTCCCGTGGGTGTTTCAGGTGTGGGGGTATGAGACGATATCGTCCCTGGATGGACGAGTTGCTATAAAAGTTAATGATGAC
GTGGTACCCTACATGCTCCGGTGGAGGTGTGTTCGTTCGATAGGATGGCTGATGCTGGAAAGAGACATATTCCAGTCAACAACAGCCAAAGCACAGAGCTTGGAA
CAAACTGAATCAGAGACGATATTTTTTAATAGGACGTTCGAACCACCAGCATCCAATGATGTGGAAAATGATGAAGAAGGGCTCCAAAGTCCGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGTGGGTGTCAGGGCCTCGGGTATAAATGGTCGGGGGCTGATAAGGGCTGCTTACTGAGTACTGTGGTTGTACTCATCCCTCTTTTCCCCCTCCAGTTTGCA
GGTATCGAGCTAGCTCCTGGGATGGTGAAAATTTTAGGTAGTCTTAGTAACGGTCCCTGTTTGGAAATGGTGAAATTCCTGGGGCGTTACAGACCTAATCCCTTA
CCGTATGATGACCAAAGATATAGGGAAATGGATTTGGTACCGAAGGTCCCTCTCGCGCTGTATGTCTGGTCCCAGCTCACCTGTCTATTGCACCTAGCGAAGACA
TCCGAGGCTATTAAGAAGAAACTTACCCCCGATCAGCTTCGTTTATTTAGGAAGACTGCAGTTGGGCATTTGCTTAATGTGGATCTTGTCTTTAACGGGCCACTA
ATTCATTGTCTGCTGCTTAGGGAGGTAAATGAGAGTCGTAAGGACACCATGAGTTTTAACTTGTTTGGTAGTAAGGTGTCCTTTGGACGGAGAGAATTTGATTTA
ATTAGCGGCCTAAAATATTCGACAGCCACCGTTATGAAGGACACCTGTCCTCGTAGGCTTAGGGCCCTATACTTCGATGATAATGTAGTGGCTGAGTTTGAGAAG
AAGTATGAAGTGCTTCGTTTTGAGGATGATTGGGATGCGGTCAAGGTCTCAATTGTATACTTTGTTGAATTTGTATTTTACGGGCGGGAAAGAAGCGTGAAGTAC
GACAGTAGTCTGCTTGGAATAGTAGATGACTGGGAAGTGTGCTGCAACCATGACTGGGGGCAACTATCATTTGAGAAGACTATGAGGAGTCTAGAGCGAGCACTT
GCGAAGAAGACACCTCAGGGGGCGTTGAGGAAATCGTACAGTCTTTATGGTTTCCCGTGGGTGTTCCAGGTGTGGGCGTATGAGATAATATCTTCCATGACGGGA
CGAGTTGCCAGGAAGATAAGTGATAGTTTGGTACCCCGCATGCTCCGGTGGAGGTGCGGTCACTCGACTGGATGGCTCGTGCTGGAGAGGGAGATCTTCCAATCT
ACAACAGCCAGAATTAGGGGCTTAGAACCGACTGAAGCGGAGAGGAGGTTTTTCAATAGGGCAATCGAACCACCAGCATCAGATGAACTAAAAGCTGATGAGGAA
GTTCGTCAACGTCCAGAGACTGCATTCGAGGGTGAGGACATAAACACGGACCGTCCAACCGACAATGTTGGTGCTCCTATTGCTGCCGAGGTCGATCGTAACCAG
GATGGAGAGGCGATACAGAACAAAGAAAAAAAGAAGAAGGAGGTTGATTTGATATATTTGAGGTGGTGGGGCGAGGCTGAGCCCGAGGAAATGGAAATGGTACCG
AAGGTCCCTCCCGCGATGTATGTCTCGTCCCAAGTGTCTTGTCTGTTGCACCTAGCAAAGACAGCCCAGGATATTAAAGAGAAACTTACCCCCTGTCAGCTTCGT
CTTTTTAGGAAGACTGTATTTGGCCATCTCATTGATGTGGATCTAGTATTTAACGGGCCGTTAATACATAGCTTGTTACTCAGGGAGGTATTGGTAGAAAATAGT
AGTCCGGACACCATTAGTTTCAACTTGTTGGGGAGTAACGTGTCATTCGGTCGGAGAGAATTTGACCTCATAAGCGGCCTAAAATATTCGAGAGCACAGGTTAGG
AAAGACACGTTTCCTGCCAGGCTTAGAACATTGTACTTCGATGATGACGTAAACCTGCTCTTGGCCAATTTAGAGAAGAAGTACAAAATGTTACGGTTTGATGAT
GATTGGGACGCGGTGAAGGTCTCAATTGTGTACTTCGTGGAACTAGTACTTCATGGACGGGAGAGAAGTTTGAAGTTTGACAGTAGTTTGCTAGGAATAGTAGAT
GTTTGGGAAGTTTGCTGCAACTATGACTGGGGATCCATATCGTTTGACAAGACAATAAAAAGTCTTCAACGAGCACTGACGAGTAAGACAGATGATGGGAGGTTG
AGGAAAGCGTACAGTCTGTATGGTTTCCCGTGGGTGTTTCAGGTGTGGGGGTATGAGACGATATCGTCCCTGGATGGACGAGTTGCTATAAAAGTTAATGATGAC
GTGGTACCCTACATGCTCCGGTGGAGGTGTGTTCGTTCGATAGGATGGCTGATGCTGGAAAGAGACATATTCCAGTCAACAACAGCCAAAGCACAGAGCTTGGAA
CAAACTGAATCAGAGACGATATTTTTTAATAGGACGTTCGAACCACCAGCATCCAATGATGTGGAAAATGATGAAGAAGGGCTCCAAAGTCCGTGA
Protein sequenceShow/hide protein sequence
MGGCQGLGYKWSGADKGCLLSTVVVLIPLFPLQFAGIELAPGMVKILGSLSNGPCLEMVKFLGRYRPNPLPYDDQRYREMDLVPKVPLALYVWSQLTCLLHLAKT
SEAIKKKLTPDQLRLFRKTAVGHLLNVDLVFNGPLIHCLLLREVNESRKDTMSFNLFGSKVSFGRREFDLISGLKYSTATVMKDTCPRRLRALYFDDNVVAEFEK
KYEVLRFEDDWDAVKVSIVYFVEFVFYGRERSVKYDSSLLGIVDDWEVCCNHDWGQLSFEKTMRSLERALAKKTPQGALRKSYSLYGFPWVFQVWAYEIISSMTG
RVARKISDSLVPRMLRWRCGHSTGWLVLEREIFQSTTARIRGLEPTEAERRFFNRAIEPPASDELKADEEVRQRPETAFEGEDINTDRPTDNVGAPIAAEVDRNQ
DGEAIQNKEKKKKEVDLIYLRWWGEAEPEEMEMVPKVPPAMYVSSQVSCLLHLAKTAQDIKEKLTPCQLRLFRKTVFGHLIDVDLVFNGPLIHSLLLREVLVENS
SPDTISFNLLGSNVSFGRREFDLISGLKYSRAQVRKDTFPARLRTLYFDDDVNLLLANLEKKYKMLRFDDDWDAVKVSIVYFVELVLHGRERSLKFDSSLLGIVD
VWEVCCNYDWGSISFDKTIKSLQRALTSKTDDGRLRKAYSLYGFPWVFQVWGYETISSLDGRVAIKVNDDVVPYMLRWRCVRSIGWLMLERDIFQSTTAKAQSLE
QTESETIFFNRTFEPPASNDVENDEEGLQSP