; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc01g10760 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc01g10760
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionLOW QUALITY PROTEIN: uncharacterized protein LOC111019515
Genome locationchr1:6629358..6632488
RNA-Seq ExpressionMoc01g10760
SyntenyMoc01g10760
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022143608.1 uncharacterized protein LOC111013464 [Momordica charantia]8.7e-4294.68Show/hide
Query:  HGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPPPQQQYNQRTQTPPVQNNSSNLENMMKEYMARTDAVIQ
        HGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPP QQQYNQRT+TP VQNN+SNLENMMKEYMARTD VIQ
Subjt:  HGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPPPQQQYNQRTQTPPVQNNSSNLENMMKEYMARTDAVIQ

XP_022150317.1 uncharacterized protein LOC111018514 [Momordica charantia]5.4e-5268.86Show/hide
Query:  GHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPT---QQYIPPPQQQYNQRTQTP--PVQNNSSNLENMMKEYMARTDAV--
        GHG NR FNPYSNTYNPG RHHPNFSWGGQG SSG  QGQ+QQ KQPYVP T    Q +PPP QQYNQ  +TP  P  NN+++LENM KEYMAR DA+  
Subjt:  GHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPT---QQYIPPPQQQYNQRTQTP--PVQNNSSNLENMMKEYMARTDAV--

Query:  -----IQSQAASMRNFETQLGQLANELKNRPQGSFPGQTELPKREGKEQCKAVTLRSGLAYDGPTMP
             IQ+QAA MRN E Q+GQ AN+LK RPQGSFPG TE+ KR+G EQCKAVTLRSGL+Y+GP MP
Subjt:  -----IQSQAASMRNFETQLGQLANELKNRPQGSFPGQTELPKREGKEQCKAVTLRSGLAYDGPTMP

XP_022158408.1 uncharacterized protein LOC111024897, partial [Momordica charantia]2.8e-14958.79Show/hide
Query:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMREYAAT
        MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLE QKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFN+IQMADNRDVAMREYAAT
Subjt:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMREYAAT

Query:  AFHNFDSGIVNPIPAHANFELRPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPG---------------------------------------
        AF NFDSGIVNPIPAH NFEL+PMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPG                                       
Subjt:  AFHNFDSGIVNPIPAHANFELRPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPG---------------------------------------

Query:  ----------------------------------------------HG----------------------NN--------RNF-----------------
                                                      HG                      NN        + F                 
Subjt:  ----------------------------------------------HG----------------------NN--------RNF-----------------

Query:  ---------------------------------------------NPYSNTYNP------------------GWRHHPNFSWGGQGGSSGFNQGQSQQNK
                                                     NP +    P                   WRHHPNFSWGGQGGSSGFNQGQSQQNK
Subjt:  ---------------------------------------------NPYSNTYNP------------------GWRHHPNFSWGGQGGSSGFNQGQSQQNK

Query:  QPYVPPTQQYIPPPQQQYNQRTQTPPVQNNSSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFPGQTELPKREGKEQCKAVTLRS
        QPYVPPTQQ+IPPPQQQYNQRTQTPP+QNN+SNLENMMKEYMARTDAVIQSQAASMRNF TQLG LANELKNRPQGSFPG TELP+REGKEQCKAVTLRS
Subjt:  QPYVPPTQQYIPPPQQQYNQRTQTPPVQNNSSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFPGQTELPKREGKEQCKAVTLRS

Query:  GLAYDGPTMPTTDVQIPSTEPTVKIPENSTTPEKENTRKGDFEECS
        GL YDGPTMPTTDVQIPST+PTVKIPEN TTPEKEN RKG+ +  S
Subjt:  GLAYDGPTMPTTDVQIPSTEPTVKIPENSTTPEKENTRKGDFEECS

XP_022158979.1 uncharacterized protein LOC111025424 [Momordica charantia]5.4e-10066.99Show/hide
Query:  LPGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPPPQQQYNQRTQTPPVQNNSSNLENMMKEYMARTDAVIQSQA
        LP  GNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQ YVP TQQY PPPQQ YNQR QTPPVQNN+SNLEN MKEYMARTDAVIQSQA
Subjt:  LPGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPPPQQQYNQRTQTPPVQNNSSNLENMMKEYMARTDAVIQSQA

Query:  ASMRNFETQLGQLANELKNRPQGSFPGQTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPTVKIPENSTTPEKENTRKGDFEECSAITNLNP
        ASMRNFETQLG LAN LKNRPQGSF G TELPK EGKE CKAVTLRSGL Y+ PTMPTTDVQI STEPT                               
Subjt:  ASMRNFETQLGQLANELKNRPQGSFPGQTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPTVKIPENSTTPEKENTRKGDFEECSAITNLNP

Query:  VMFDEFYDLLVTEIEEELDKMAERPEDVTNPIEKIQKEECKSLLPSIVEPPTLEQKPLPSHPKYVYLGDNDTLPVIIASNLSPTNEYSLLQILEKHKKAI
                                                      IVEPPTLEQKPLPSH KY YLGDN+TLPVIIASNLSPTNEYSLLQILEKHKKAI
Subjt:  VMFDEFYDLLVTEIEEELDKMAERPEDVTNPIEKIQKEECKSLLPSIVEPPTLEQKPLPSHPKYVYLGDNDTLPVIIASNLSPTNEYSLLQILEKHKKAI

Query:  GWTIAISEG
        GWTIA   G
Subjt:  GWTIAISEG

XP_022159345.1 uncharacterized protein LOC111025764 [Momordica charantia]1.3e-9093.01Show/hide
Query:  GHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPPPQQQYNQRTQTPPVQNNSSNLENMMKEYMARTDAVIQSQAAS
        GHGNNRNFNPYSNTYNPGWRHHPNFSW GQGGS GFNQGQSQQNKQPYVPPTQQYIPPPQQ+YNQRTQTPPVQNN+SNLENMMKEYMARTDAVIQSQAAS
Subjt:  GHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPPPQQQYNQRTQTPPVQNNSSNLENMMKEYMARTDAVIQSQAAS

Query:  MRNFETQLGQLANELKNRPQGSFPGQTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPTVKIPENSTTPEKENTRKGD
        MRNFETQLGQLANELKNRPQGSFP  TELPKREGKEQCKAVTLRSGLAYD PTMPT DVQIPST PTVKIPEN TTPEK N RKG+
Subjt:  MRNFETQLGQLANELKNRPQGSFPGQTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPTVKIPENSTTPEKENTRKGD

TrEMBL top hitse value%identityAlignment
A0A6J1CR45 uncharacterized protein LOC1110134644.2e-4294.68Show/hide
Query:  HGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPPPQQQYNQRTQTPPVQNNSSNLENMMKEYMARTDAVIQ
        HGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPP QQQYNQRT+TP VQNN+SNLENMMKEYMARTD VIQ
Subjt:  HGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPPPQQQYNQRTQTPPVQNNSSNLENMMKEYMARTDAVIQ

A0A6J1DAE9 uncharacterized protein LOC1110185142.6e-5268.86Show/hide
Query:  GHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPT---QQYIPPPQQQYNQRTQTP--PVQNNSSNLENMMKEYMARTDAV--
        GHG NR FNPYSNTYNPG RHHPNFSWGGQG SSG  QGQ+QQ KQPYVP T    Q +PPP QQYNQ  +TP  P  NN+++LENM KEYMAR DA+  
Subjt:  GHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPT---QQYIPPPQQQYNQRTQTP--PVQNNSSNLENMMKEYMARTDAV--

Query:  -----IQSQAASMRNFETQLGQLANELKNRPQGSFPGQTELPKREGKEQCKAVTLRSGLAYDGPTMP
             IQ+QAA MRN E Q+GQ AN+LK RPQGSFPG TE+ KR+G EQCKAVTLRSGL+Y+GP MP
Subjt:  -----IQSQAASMRNFETQLGQLANELKNRPQGSFPGQTELPKREGKEQCKAVTLRSGLAYDGPTMP

A0A6J1DW02 uncharacterized protein LOC1110248971.4e-14958.79Show/hide
Query:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMREYAAT
        MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLE QKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFN+IQMADNRDVAMREYAAT
Subjt:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMREYAAT

Query:  AFHNFDSGIVNPIPAHANFELRPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPG---------------------------------------
        AF NFDSGIVNPIPAH NFEL+PMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPG                                       
Subjt:  AFHNFDSGIVNPIPAHANFELRPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPG---------------------------------------

Query:  ----------------------------------------------HG----------------------NN--------RNF-----------------
                                                      HG                      NN        + F                 
Subjt:  ----------------------------------------------HG----------------------NN--------RNF-----------------

Query:  ---------------------------------------------NPYSNTYNP------------------GWRHHPNFSWGGQGGSSGFNQGQSQQNK
                                                     NP +    P                   WRHHPNFSWGGQGGSSGFNQGQSQQNK
Subjt:  ---------------------------------------------NPYSNTYNP------------------GWRHHPNFSWGGQGGSSGFNQGQSQQNK

Query:  QPYVPPTQQYIPPPQQQYNQRTQTPPVQNNSSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFPGQTELPKREGKEQCKAVTLRS
        QPYVPPTQQ+IPPPQQQYNQRTQTPP+QNN+SNLENMMKEYMARTDAVIQSQAASMRNF TQLG LANELKNRPQGSFPG TELP+REGKEQCKAVTLRS
Subjt:  QPYVPPTQQYIPPPQQQYNQRTQTPPVQNNSSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFPGQTELPKREGKEQCKAVTLRS

Query:  GLAYDGPTMPTTDVQIPSTEPTVKIPENSTTPEKENTRKGDFEECS
        GL YDGPTMPTTDVQIPST+PTVKIPEN TTPEKEN RKG+ +  S
Subjt:  GLAYDGPTMPTTDVQIPSTEPTVKIPENSTTPEKENTRKGDFEECS

A0A6J1DYG0 uncharacterized protein LOC1110257646.4e-9193.01Show/hide
Query:  GHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPPPQQQYNQRTQTPPVQNNSSNLENMMKEYMARTDAVIQSQAAS
        GHGNNRNFNPYSNTYNPGWRHHPNFSW GQGGS GFNQGQSQQNKQPYVPPTQQYIPPPQQ+YNQRTQTPPVQNN+SNLENMMKEYMARTDAVIQSQAAS
Subjt:  GHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPPPQQQYNQRTQTPPVQNNSSNLENMMKEYMARTDAVIQSQAAS

Query:  MRNFETQLGQLANELKNRPQGSFPGQTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPTVKIPENSTTPEKENTRKGD
        MRNFETQLGQLANELKNRPQGSFP  TELPKREGKEQCKAVTLRSGLAYD PTMPT DVQIPST PTVKIPEN TTPEK N RKG+
Subjt:  MRNFETQLGQLANELKNRPQGSFPGQTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPTVKIPENSTTPEKENTRKGD

A0A6J1E110 uncharacterized protein LOC1110254248.9e-10167.64Show/hide
Query:  LPGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPPPQQQYNQRTQTPPVQNNSSNLENMMKEYMARTDAVIQSQA
        LP  GNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQ YVP TQQY PPPQQ YNQR QTPPVQNN+SNLEN MKEYMARTDAVIQSQA
Subjt:  LPGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPPPQQQYNQRTQTPPVQNNSSNLENMMKEYMARTDAVIQSQA

Query:  ASMRNFETQLGQLANELKNRPQGSFPGQTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPTVKIPENSTTPEKENTRKGDFEECSAITNLNP
        ASMRNFETQLG LAN LKNRPQGSF G TELPK EGKE CKAVTLRSGL YD PTMPTTDVQI STEPT                               
Subjt:  ASMRNFETQLGQLANELKNRPQGSFPGQTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPTVKIPENSTTPEKENTRKGDFEECSAITNLNP

Query:  VMFDEFYDLLVTEIEEELDKMAERPEDVTNPIEKIQKEECKSLLPSIVEPPTLEQKPLPSHPKYVYLGDNDTLPVIIASNLSPTNEYSLLQILEKHKKAI
                                                      IVEPPTLEQKPLPSH KY YLGDNDTLPVIIASNLSPTNEYSLLQILEKHKKAI
Subjt:  VMFDEFYDLLVTEIEEELDKMAERPEDVTNPIEKIQKEECKSLLPSIVEPPTLEQKPLPSHPKYVYLGDNDTLPVIIASNLSPTNEYSLLQILEKHKKAI

Query:  GWTIAISEG
        GWTIA   G
Subjt:  GWTIAISEG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCATCAGTTGGAACCACCAAAACGCATTGGAAGACGCGTTTCATTAAGGGCAAAGTTGGAAGAAAACCTGTTTTTCTGCAGGCATCCCAGGCGCCTGGCGCCT
CCCAGGGGTAAGTTAAAGTGCATGAGCACAAGATCTTTTCTTCTACCCCTTGACCCTGAGATTGAGCGGACCCTTCGAAAAACTCGAAAGGAGCAAAGACTTCGA
AAACAACTAGAAAAGCAAAAAGAGAGAGAAGGTGAAATCAGTCCTGAAAGTGAGGTAGAGAGTACGAGCACATCAATGGCAGATATTCCACCTCGTGATCCGGTT
GATCCACCTGCTGTTAACGGTAATATGAGGGATCATGCGAGAAATGATGAATTCAACCATATTCAGATGGCGGACAACAGAGACGTGGCAATGCGAGAATATGCC
GCCACGGCTTTTCATAACTTTGATTCAGGGATAGTCAACCCTATTCCAGCCCACGCAAACTTTGAGCTTAGACCAATGATGTTCCAAATGTTGCAGACAATTGGA
CATTTTGGAGGGCAGGAACATGAGGATCCACATGATCATCTAAAATCATTCATTCAAATTGCAAATGCATTTCGATTACCTGGACATGGGAACAATAGGAACTTT
AACCCATATTCGAACACCTACAACCCAGGTTGGAGGCACCACCCTAATTTCTCATGGGGAGGTCAAGGAGGTTCGAGCGGTTTTAATCAAGGGCAGAGCCAGCAG
AACAAGCAGCCCTATGTTCCACCTACACAGCAATACATCCCACCGCCGCAACAGCAGTACAATCAGAGAACACAGACTCCACCAGTTCAAAATAACAGCTCAAAT
CTTGAGAATATGATGAAGGAGTACATGGCTCGAACCGACGCAGTGATACAATCCCAAGCGGCATCAATGAGGAATTTCGAGACCCAATTGGGACAACTCGCCAAT
GAATTGAAGAATAGACCACAAGGTTCTTTTCCAGGCCAAACTGAATTACCAAAACGAGAAGGGAAAGAACAGTGCAAAGCTGTCACCCTTAGGAGTGGACTGGCA
TATGATGGACCAACAATGCCAACAACAGATGTACAGATTCCATCCACTGAACCAACTGTAAAGATACCAGAAAATTCAACAACACCAGAAAAAGAAAATACTAGA
AAAGGAGATTTTGAAGAATGCTCTGCTATAACTAACTTGAATCCTGTTATGTTTGATGAGTTTTATGACTTGTTAGTTACAGAGATTGAAGAAGAGCTTGATAAG
ATGGCAGAAAGACCAGAAGATGTGACTAATCCTATTGAAAAAATACAAAAAGAAGAATGCAAGTCGTTACTTCCGTCCATAGTGGAACCACCCACTTTGGAGCAG
AAGCCATTGCCGTCGCATCCGAAATATGTGTATCTAGGGGATAACGACACTTTACCAGTTATTATAGCTTCCAATTTATCACCTACTAATGAATATTCTTTATTG
CAGATTTTGGAGAAGCACAAAAAGGCCATTGGATGGACGATAGCGATATCCGAGGGATAA
mRNA sequenceShow/hide mRNA sequence
ATGCATCAGTTGGAACCACCAAAACGCATTGGAAGACGCGTTTCATTAAGGGCAAAGTTGGAAGAAAACCTGTTTTTCTGCAGGCATCCCAGGCGCCTGGCGCCT
CCCAGGGGTAAGTTAAAGTGCATGAGCACAAGATCTTTTCTTCTACCCCTTGACCCTGAGATTGAGCGGACCCTTCGAAAAACTCGAAAGGAGCAAAGACTTCGA
AAACAACTAGAAAAGCAAAAAGAGAGAGAAGGTGAAATCAGTCCTGAAAGTGAGGTAGAGAGTACGAGCACATCAATGGCAGATATTCCACCTCGTGATCCGGTT
GATCCACCTGCTGTTAACGGTAATATGAGGGATCATGCGAGAAATGATGAATTCAACCATATTCAGATGGCGGACAACAGAGACGTGGCAATGCGAGAATATGCC
GCCACGGCTTTTCATAACTTTGATTCAGGGATAGTCAACCCTATTCCAGCCCACGCAAACTTTGAGCTTAGACCAATGATGTTCCAAATGTTGCAGACAATTGGA
CATTTTGGAGGGCAGGAACATGAGGATCCACATGATCATCTAAAATCATTCATTCAAATTGCAAATGCATTTCGATTACCTGGACATGGGAACAATAGGAACTTT
AACCCATATTCGAACACCTACAACCCAGGTTGGAGGCACCACCCTAATTTCTCATGGGGAGGTCAAGGAGGTTCGAGCGGTTTTAATCAAGGGCAGAGCCAGCAG
AACAAGCAGCCCTATGTTCCACCTACACAGCAATACATCCCACCGCCGCAACAGCAGTACAATCAGAGAACACAGACTCCACCAGTTCAAAATAACAGCTCAAAT
CTTGAGAATATGATGAAGGAGTACATGGCTCGAACCGACGCAGTGATACAATCCCAAGCGGCATCAATGAGGAATTTCGAGACCCAATTGGGACAACTCGCCAAT
GAATTGAAGAATAGACCACAAGGTTCTTTTCCAGGCCAAACTGAATTACCAAAACGAGAAGGGAAAGAACAGTGCAAAGCTGTCACCCTTAGGAGTGGACTGGCA
TATGATGGACCAACAATGCCAACAACAGATGTACAGATTCCATCCACTGAACCAACTGTAAAGATACCAGAAAATTCAACAACACCAGAAAAAGAAAATACTAGA
AAAGGAGATTTTGAAGAATGCTCTGCTATAACTAACTTGAATCCTGTTATGTTTGATGAGTTTTATGACTTGTTAGTTACAGAGATTGAAGAAGAGCTTGATAAG
ATGGCAGAAAGACCAGAAGATGTGACTAATCCTATTGAAAAAATACAAAAAGAAGAATGCAAGTCGTTACTTCCGTCCATAGTGGAACCACCCACTTTGGAGCAG
AAGCCATTGCCGTCGCATCCGAAATATGTGTATCTAGGGGATAACGACACTTTACCAGTTATTATAGCTTCCAATTTATCACCTACTAATGAATATTCTTTATTG
CAGATTTTGGAGAAGCACAAAAAGGCCATTGGATGGACGATAGCGATATCCGAGGGATAA
Protein sequenceShow/hide protein sequence
MHQLEPPKRIGRRVSLRAKLEENLFFCRHPRRLAPPRGKLKCMSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSMADIPPRDPV
DPPAVNGNMRDHARNDEFNHIQMADNRDVAMREYAATAFHNFDSGIVNPIPAHANFELRPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPGHGNNRNF
NPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPPPQQQYNQRTQTPPVQNNSSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLAN
ELKNRPQGSFPGQTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPTVKIPENSTTPEKENTRKGDFEECSAITNLNPVMFDEFYDLLVTEIEEELDK
MAERPEDVTNPIEKIQKEECKSLLPSIVEPPTLEQKPLPSHPKYVYLGDNDTLPVIIASNLSPTNEYSLLQILEKHKKAIGWTIAISEG