; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g17710 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g17710
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionDNA-directed DNA polymerase
Genome locationchr6:13868218..13871695
RNA-Seq ExpressionMoc06g17710
SyntenyMoc06g17710
Gene Ontology termsNA
InterPro domainsIPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022142953.1 uncharacterized protein LOC111012947 [Momordica charantia]6.8e-8559.93Show/hide
Query:  GGTDVATPVPASTFTPQLEEKAETVSLEEKGKKANKGKQVVSCSTPQVGNIKIPPSFPQRLVKKNQDSHFKKFFEILKQLHINIPLVNALEQMPNYTKFL
        G    +T  P    +P    + +T  + +K  +      V     PQV N + PP FPQRLV+KNQD++F+KF +ILKQLHINIP V ALEQMP Y KFL
Subjt:  GGTDVATPVPASTFTPQLEEKAETVSLEEKGKKANKGKQVVSCSTPQVGNIKIPPSFPQRLVKKNQDSHFKKFFEILKQLHINIPLVNALEQMPNYTKFL

Query:  KDIISRHKKLGEHETVALTNCGSDALGNPLTVNCKDPGSFTIPYSIGGKNLRRALCALGASINLMPLYVFKELNIGEAYPTTVTLQLADMSIKKLEGKIE
        KDII+R KKLGE+ETVALT C S+   +      KDPGSFTI   IGGK++ RALC LGA INLMPL +FK+L IG+A PTTVTL LAD SI K EGKIE
Subjt:  KDIISRHKKLGEHETVALTNCGSDALGNPLTVNCKDPGSFTIPYSIGGKNLRRALCALGASINLMPLYVFKELNIGEAYPTTVTLQLADMSIKKLEGKIE

Query:  DVLVKVDKFIFPTDFIILDCEADLKVPIIFGRPFLATGDTVFIVRKGKITMKVNDEQVTFNVLNAMRLPDEVEECSTIGVVK
        DVLVKVDKFIFP DFIILDCEAD  VPII GRPFLATG+T+  V+KG++TM+V+D++VTFN+L+AM+ PD+ EEC  I + K
Subjt:  DVLVKVDKFIFPTDFIILDCEADLKVPIIFGRPFLATGDTVFIVRKGKITMKVNDEQVTFNVLNAMRLPDEVEECSTIGVVK

XP_022155996.1 uncharacterized protein LOC111022971 [Momordica charantia]5.1e-8864.36Show/hide
Query:  DVATPVPASTFTPQLEEKAETVSLEEKGKKANKGKQVVSCSTPQVGNIKIPPSFPQRLVKKNQDSHFKKFFEILKQLHINIPLVNALEQMPNYTKFLKDI
        +V  P  +   +P   EK +T S E +   A         S+ +  +  + P FPQ L KKNQ++ F+KF +ILKQLHINIPL++ALEQMPNY KFLKDI
Subjt:  DVATPVPASTFTPQLEEKAETVSLEEKGKKANKGKQVVSCSTPQVGNIKIPPSFPQRLVKKNQDSHFKKFFEILKQLHINIPLVNALEQMPNYTKFLKDI

Query:  ISRHKKLGEHETVALTNCGSDALGNPLTVNCKDPGSFTIPYSIGGKNLRRALCALGASINLMPLYVFKELNIGEAYPTTVTLQLADMSIKKLEGKIEDVL
        +SR KKLGEHE VA+T C S+A+G+PL + CKDP SFTIP SIGGKNL RALC LGASINLMPL VFKEL IGEA PTTVTLQLAD SIKK EGKI+D  
Subjt:  ISRHKKLGEHETVALTNCGSDALGNPLTVNCKDPGSFTIPYSIGGKNLRRALCALGASINLMPLYVFKELNIGEAYPTTVTLQLADMSIKKLEGKIEDVL

Query:  VKVDKFIFPTDFIILDCEADLKVPIIFGRPFLATGDTVFIVRKGKITMKVNDEQVTFNVLNAMRLPDEVEECSTI
          VDKFIFP DFIILDC+ADL VPII GRPFLATGDT+F VRKG+ITMKVN+E+V FNVL+AM+LP ++EE + +
Subjt:  VKVDKFIFPTDFIILDCEADLKVPIIFGRPFLATGDTVFIVRKGKITMKVNDEQVTFNVLNAMRLPDEVEECSTI

XP_022157244.1 uncharacterized protein LOC111024002 [Momordica charantia]1.1e-7780.73Show/hide
Query:  MPNYTKFLKDIISRHKKLGEHETVALTNCGSDALGNPLTVNCKDPGSFTIPYSIGGKNLRRALCALGASINLMPLYVFKELNIGEAYPTTVTLQLADMSI
        M NYTKFLKDIISRHKKLGEHETV LT C SDAL NPL V CKDPGSFTIP S+G KNL RALC LGA INLM L VFKELNIGEA PTTVTLQLAD SI
Subjt:  MPNYTKFLKDIISRHKKLGEHETVALTNCGSDALGNPLTVNCKDPGSFTIPYSIGGKNLRRALCALGASINLMPLYVFKELNIGEAYPTTVTLQLADMSI

Query:  KKLEGKIEDVLVKVDKFIFPTDFIILDCEADLKVPIIFGRPFLATGDTVFIVRKGKITMKVNDEQVTFNVLNAMRLPDEVEECSTIGVVKRN
        KK EGKIED+LVKVD+ IFP DFIILDCEADL+V +I  R FLAT D VF VRKG+ITMKVNDEQVTFNVL+AM LPDEVEECSTIG + RN
Subjt:  KKLEGKIEDVLVKVDKFIFPTDFIILDCEADLKVPIIFGRPFLATGDTVFIVRKGKITMKVNDEQVTFNVLNAMRLPDEVEECSTIGVVKRN

XP_022157810.1 uncharacterized protein LOC111024426 [Momordica charantia]3.0e-8065.4Show/hide
Query:  PQVGNIKIPPSFPQRLVKKNQDSHFKKFFEILKQLHINIPLVNALEQMPNYTKFLKDIISRHKKLGEHETVALTNCGSDALGNPLTVNCKDPGSFTIPYS
        PQV N + PP FPQRLV+KNQD++F+KF +ILKQLHIN+P V ALEQMP Y KFLKDII+R KKLGE+ETVAL    S+   + +    KDPGSFTIP  
Subjt:  PQVGNIKIPPSFPQRLVKKNQDSHFKKFFEILKQLHINIPLVNALEQMPNYTKFLKDIISRHKKLGEHETVALTNCGSDALGNPLTVNCKDPGSFTIPYS

Query:  IGGKNLRRALCALGASINLMPLYVFKELNIGEAYPTTVTLQLADMSIKKLEGKIEDVLVKVDKFIFPTDFIILDCEADLKVPIIFGRPFLATGDTVFIVR
        IGGK++ RALC LGASINLMPL +FK+L IG+A PTTV LQLAD SI K E KIED+LVKVDKFIFP DFIILD EAD  VPII GRPFLA G+T+  V+
Subjt:  IGGKNLRRALCALGASINLMPLYVFKELNIGEAYPTTVTLQLADMSIKKLEGKIEDVLVKVDKFIFPTDFIILDCEADLKVPIIFGRPFLATGDTVFIVR

Query:  KGKITMKVNDEQVTFNVLNAMRLPDEVEECSTIGVVK
        KG++ M+V+D++VTFN+L+AM+  D++EEC+ I + K
Subjt:  KGKITMKVNDEQVTFNVLNAMRLPDEVEECSTIGVVK

XP_022159235.1 uncharacterized protein LOC111025653 [Momordica charantia]1.2e-8662.36Show/hide
Query:  PQLEEKAETVSLEEKGKKANKGKQVVSCST----PQVGNIKIPPSFPQRLVKKNQDSHFKKFFEILKQLHINIPLVNALEQMPNYTKFLKDIISRHKKLG
        P++ +++      EK  +A   K V    +    PQV N + PP FPQRLV+KNQD++F+KF +ILKQLHINIP V ALEQMP Y KF+KDII+R KKLG
Subjt:  PQLEEKAETVSLEEKGKKANKGKQVVSCST----PQVGNIKIPPSFPQRLVKKNQDSHFKKFFEILKQLHINIPLVNALEQMPNYTKFLKDIISRHKKLG

Query:  EHETVALTNCGSDALGNPLTVNCKDPGSFTIPYSIGGKNLRRALCALGASINLMPLYVFKELNIGEAYPTTVTLQLADMSIKKLEGKIEDVLVKVDKFIF
        E+ETVALT C S+   + +    KDPGSFTIP  IGGK++ RALC LGASINLMPL +FK+  IG+A PTTVTLQLAD SI K EGKIEDVLVKVDKFIF
Subjt:  EHETVALTNCGSDALGNPLTVNCKDPGSFTIPYSIGGKNLRRALCALGASINLMPLYVFKELNIGEAYPTTVTLQLADMSIKKLEGKIEDVLVKVDKFIF

Query:  PTDFIILDCEADLKVPIIFGRPFLATGDTVFIVRKGKITMKVNDEQVTFNVLNAMRLPDEVEECSTIGVVK
        PTDFIILDCEAD  VPII GRPFLATG+T+  V+KG++TM+V+D++VTFN+L+AM+  D++EEC+ I + K
Subjt:  PTDFIILDCEADLKVPIIFGRPFLATGDTVFIVRKGKITMKVNDEQVTFNVLNAMRLPDEVEECSTIGVVK

TrEMBL top hitse value%identityAlignment
A0A6J1CPJ3 uncharacterized protein LOC1110129477.4e-8559.93Show/hide
Query:  GGTDVATPVPASTFTPQLEEKAETVSLEEKGKKANKGKQVVSCSTPQVGNIKIPPSFPQRLVKKNQDSHFKKFFEILKQLHINIPLVNALEQMPNYTKFL
        G    +T  P    +P    + +T  + +K  +      V     PQV N + PP FPQRLV+KNQD++F+KF +ILKQLHINIP V ALEQMP Y KFL
Subjt:  GGTDVATPVPASTFTPQLEEKAETVSLEEKGKKANKGKQVVSCSTPQVGNIKIPPSFPQRLVKKNQDSHFKKFFEILKQLHINIPLVNALEQMPNYTKFL

Query:  KDIISRHKKLGEHETVALTNCGSDALGNPLTVNCKDPGSFTIPYSIGGKNLRRALCALGASINLMPLYVFKELNIGEAYPTTVTLQLADMSIKKLEGKIE
        KDII+R KKLGE+ETVALT C S+   +      KDPGSFTI   IGGK++ RALC LGA INLMPL +FK+L IG+A PTTVTL LAD SI K EGKIE
Subjt:  KDIISRHKKLGEHETVALTNCGSDALGNPLTVNCKDPGSFTIPYSIGGKNLRRALCALGASINLMPLYVFKELNIGEAYPTTVTLQLADMSIKKLEGKIE

Query:  DVLVKVDKFIFPTDFIILDCEADLKVPIIFGRPFLATGDTVFIVRKGKITMKVNDEQVTFNVLNAMRLPDEVEECSTIGVVK
        DVLVKVDKFIFP DFIILDCEAD  VPII GRPFLATG+T+  V+KG++TM+V+D++VTFN+L+AM+ PD+ EEC  I + K
Subjt:  DVLVKVDKFIFPTDFIILDCEADLKVPIIFGRPFLATGDTVFIVRKGKITMKVNDEQVTFNVLNAMRLPDEVEECSTIGVVK

A0A6J1DTH7 uncharacterized protein LOC1110229712.4e-8864.36Show/hide
Query:  DVATPVPASTFTPQLEEKAETVSLEEKGKKANKGKQVVSCSTPQVGNIKIPPSFPQRLVKKNQDSHFKKFFEILKQLHINIPLVNALEQMPNYTKFLKDI
        +V  P  +   +P   EK +T S E +   A         S+ +  +  + P FPQ L KKNQ++ F+KF +ILKQLHINIPL++ALEQMPNY KFLKDI
Subjt:  DVATPVPASTFTPQLEEKAETVSLEEKGKKANKGKQVVSCSTPQVGNIKIPPSFPQRLVKKNQDSHFKKFFEILKQLHINIPLVNALEQMPNYTKFLKDI

Query:  ISRHKKLGEHETVALTNCGSDALGNPLTVNCKDPGSFTIPYSIGGKNLRRALCALGASINLMPLYVFKELNIGEAYPTTVTLQLADMSIKKLEGKIEDVL
        +SR KKLGEHE VA+T C S+A+G+PL + CKDP SFTIP SIGGKNL RALC LGASINLMPL VFKEL IGEA PTTVTLQLAD SIKK EGKI+D  
Subjt:  ISRHKKLGEHETVALTNCGSDALGNPLTVNCKDPGSFTIPYSIGGKNLRRALCALGASINLMPLYVFKELNIGEAYPTTVTLQLADMSIKKLEGKIEDVL

Query:  VKVDKFIFPTDFIILDCEADLKVPIIFGRPFLATGDTVFIVRKGKITMKVNDEQVTFNVLNAMRLPDEVEECSTI
          VDKFIFP DFIILDC+ADL VPII GRPFLATGDT+F VRKG+ITMKVN+E+V FNVL+AM+LP ++EE + +
Subjt:  VKVDKFIFPTDFIILDCEADLKVPIIFGRPFLATGDTVFIVRKGKITMKVNDEQVTFNVLNAMRLPDEVEECSTI

A0A6J1DU40 uncharacterized protein LOC1110244261.4e-8065.4Show/hide
Query:  PQVGNIKIPPSFPQRLVKKNQDSHFKKFFEILKQLHINIPLVNALEQMPNYTKFLKDIISRHKKLGEHETVALTNCGSDALGNPLTVNCKDPGSFTIPYS
        PQV N + PP FPQRLV+KNQD++F+KF +ILKQLHIN+P V ALEQMP Y KFLKDII+R KKLGE+ETVAL    S+   + +    KDPGSFTIP  
Subjt:  PQVGNIKIPPSFPQRLVKKNQDSHFKKFFEILKQLHINIPLVNALEQMPNYTKFLKDIISRHKKLGEHETVALTNCGSDALGNPLTVNCKDPGSFTIPYS

Query:  IGGKNLRRALCALGASINLMPLYVFKELNIGEAYPTTVTLQLADMSIKKLEGKIEDVLVKVDKFIFPTDFIILDCEADLKVPIIFGRPFLATGDTVFIVR
        IGGK++ RALC LGASINLMPL +FK+L IG+A PTTV LQLAD SI K E KIED+LVKVDKFIFP DFIILD EAD  VPII GRPFLA G+T+  V+
Subjt:  IGGKNLRRALCALGASINLMPLYVFKELNIGEAYPTTVTLQLADMSIKKLEGKIEDVLVKVDKFIFPTDFIILDCEADLKVPIIFGRPFLATGDTVFIVR

Query:  KGKITMKVNDEQVTFNVLNAMRLPDEVEECSTIGVVK
        KG++ M+V+D++VTFN+L+AM+  D++EEC+ I + K
Subjt:  KGKITMKVNDEQVTFNVLNAMRLPDEVEECSTIGVVK

A0A6J1DVY2 uncharacterized protein LOC1110240025.1e-7880.73Show/hide
Query:  MPNYTKFLKDIISRHKKLGEHETVALTNCGSDALGNPLTVNCKDPGSFTIPYSIGGKNLRRALCALGASINLMPLYVFKELNIGEAYPTTVTLQLADMSI
        M NYTKFLKDIISRHKKLGEHETV LT C SDAL NPL V CKDPGSFTIP S+G KNL RALC LGA INLM L VFKELNIGEA PTTVTLQLAD SI
Subjt:  MPNYTKFLKDIISRHKKLGEHETVALTNCGSDALGNPLTVNCKDPGSFTIPYSIGGKNLRRALCALGASINLMPLYVFKELNIGEAYPTTVTLQLADMSI

Query:  KKLEGKIEDVLVKVDKFIFPTDFIILDCEADLKVPIIFGRPFLATGDTVFIVRKGKITMKVNDEQVTFNVLNAMRLPDEVEECSTIGVVKRN
        KK EGKIED+LVKVD+ IFP DFIILDCEADL+V +I  R FLAT D VF VRKG+ITMKVNDEQVTFNVL+AM LPDEVEECSTIG + RN
Subjt:  KKLEGKIEDVLVKVDKFIFPTDFIILDCEADLKVPIIFGRPFLATGDTVFIVRKGKITMKVNDEQVTFNVLNAMRLPDEVEECSTIGVVKRN

A0A6J1DY39 uncharacterized protein LOC1110256536.0e-8762.36Show/hide
Query:  PQLEEKAETVSLEEKGKKANKGKQVVSCST----PQVGNIKIPPSFPQRLVKKNQDSHFKKFFEILKQLHINIPLVNALEQMPNYTKFLKDIISRHKKLG
        P++ +++      EK  +A   K V    +    PQV N + PP FPQRLV+KNQD++F+KF +ILKQLHINIP V ALEQMP Y KF+KDII+R KKLG
Subjt:  PQLEEKAETVSLEEKGKKANKGKQVVSCST----PQVGNIKIPPSFPQRLVKKNQDSHFKKFFEILKQLHINIPLVNALEQMPNYTKFLKDIISRHKKLG

Query:  EHETVALTNCGSDALGNPLTVNCKDPGSFTIPYSIGGKNLRRALCALGASINLMPLYVFKELNIGEAYPTTVTLQLADMSIKKLEGKIEDVLVKVDKFIF
        E+ETVALT C S+   + +    KDPGSFTIP  IGGK++ RALC LGASINLMPL +FK+  IG+A PTTVTLQLAD SI K EGKIEDVLVKVDKFIF
Subjt:  EHETVALTNCGSDALGNPLTVNCKDPGSFTIPYSIGGKNLRRALCALGASINLMPLYVFKELNIGEAYPTTVTLQLADMSIKKLEGKIEDVLVKVDKFIF

Query:  PTDFIILDCEADLKVPIIFGRPFLATGDTVFIVRKGKITMKVNDEQVTFNVLNAMRLPDEVEECSTIGVVK
        PTDFIILDCEAD  VPII GRPFLATG+T+  V+KG++TM+V+D++VTFN+L+AM+  D++EEC+ I + K
Subjt:  PTDFIILDCEADLKVPIIFGRPFLATGDTVFIVRKGKITMKVNDEQVTFNVLNAMRLPDEVEECSTIGVVK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGGACCTAGATGGAGGAACCGATGTGGCTACACCTGTTCCTGCATCCACCTTCACTCCACAACTAGAGGAGAAAGCAGAAACCGTAAGTTTAGAAGAGAAA
GGTAAAAAGGCGAACAAAGGTAAGCAAGTAGTGTCTTGCAGTACTCCGCAGGTAGGTAATATTAAGATCCCTCCATCTTTCCCCCAAAGATTAGTTAAGAAGAAT
CAGGATAGTCATTTTAAGAAGTTCTTTGAGATTCTAAAACAGTTGCATATAAATATACCTCTCGTAAATGCCTTAGAGCAAATGCCTAACTACACCAAGTTTTTG
AAAGATATTATTTCTAGGCATAAAAAGTTAGGAGAGCATGAGACGGTAGCCTTAACAAATTGCGGTAGTGATGCTTTAGGGAATCCATTGACTGTTAACTGTAAG
GACCCAGGTAGTTTTACTATCCCTTACTCTATAGGTGGTAAGAATCTAAGAAGAGCATTATGTGCCTTAGGGGCAAGCATTAATCTTATGCCTCTTTATGTCTTT
AAAGAATTGAATATAGGAGAAGCTTATCCTACTACTGTCACTTTACAACTAGCTGATATGTCCATAAAGAAACTAGAAGGGAAAATAGAAGATGTGCTTGTTAAA
GTCGATAAGTTTATTTTTCCCACCGATTTCATAATTTTGGATTGTGAAGCAGATCTTAAGGTGCCAATCATTTTCGGGAGGCCGTTTTTAGCAACTGGAGACACG
GTATTCATTGTTCGAAAAGGAAAGATCACTATGAAGGTCAATGATGAGCAAGTCACCTTCAACGTCCTTAATGCAATGCGGCTCCCAGATGAAGTCGAGGAATGC
TCTACAATAGGGGTTGTCAAGCGGAATTTGGAACAGTGTACGATTCTGTTTAAAGGATTTGTGAGTGGTCACAGGAGATGCAGGGGAATGCCGGGGCGATTAGGG
CTTGATCTGGATTCCGAATCCTGGGAGAGAGTAGTTAAGTTTCTGCCACCAAGAGTAGTTCAAGATTTGCATAACCTTCTGTATGTAAATTTCGTTGGTCTGGCC
AATCATGAAGGACTCCAGCAAATCATCGTGGAAGGATTAGAAGCAGATTTGGAGGCTGCAGAAAAAGAAGCGAAAATTGCGCCTGACACAATTTTGCCACAATAT
GACCATTTTGAGATTTTTCAGCCAACAATAGCTTATTTGAAAGCCTTGCAACCTTCCATCATTGAACCTCCAGAATTGGAGAAGAAACCCCTACCCTTCATTTAA
mRNA sequenceShow/hide mRNA sequence
ATGATGGACCTAGATGGAGGAACCGATGTGGCTACACCTGTTCCTGCATCCACCTTCACTCCACAACTAGAGGAGAAAGCAGAAACCGTAAGTTTAGAAGAGAAA
GGTAAAAAGGCGAACAAAGGTAAGCAAGTAGTGTCTTGCAGTACTCCGCAGGTAGGTAATATTAAGATCCCTCCATCTTTCCCCCAAAGATTAGTTAAGAAGAAT
CAGGATAGTCATTTTAAGAAGTTCTTTGAGATTCTAAAACAGTTGCATATAAATATACCTCTCGTAAATGCCTTAGAGCAAATGCCTAACTACACCAAGTTTTTG
AAAGATATTATTTCTAGGCATAAAAAGTTAGGAGAGCATGAGACGGTAGCCTTAACAAATTGCGGTAGTGATGCTTTAGGGAATCCATTGACTGTTAACTGTAAG
GACCCAGGTAGTTTTACTATCCCTTACTCTATAGGTGGTAAGAATCTAAGAAGAGCATTATGTGCCTTAGGGGCAAGCATTAATCTTATGCCTCTTTATGTCTTT
AAAGAATTGAATATAGGAGAAGCTTATCCTACTACTGTCACTTTACAACTAGCTGATATGTCCATAAAGAAACTAGAAGGGAAAATAGAAGATGTGCTTGTTAAA
GTCGATAAGTTTATTTTTCCCACCGATTTCATAATTTTGGATTGTGAAGCAGATCTTAAGGTGCCAATCATTTTCGGGAGGCCGTTTTTAGCAACTGGAGACACG
GTATTCATTGTTCGAAAAGGAAAGATCACTATGAAGGTCAATGATGAGCAAGTCACCTTCAACGTCCTTAATGCAATGCGGCTCCCAGATGAAGTCGAGGAATGC
TCTACAATAGGGGTTGTCAAGCGGAATTTGGAACAGTGTACGATTCTGTTTAAAGGATTTGTGAGTGGTCACAGGAGATGCAGGGGAATGCCGGGGCGATTAGGG
CTTGATCTGGATTCCGAATCCTGGGAGAGAGTAGTTAAGTTTCTGCCACCAAGAGTAGTTCAAGATTTGCATAACCTTCTGTATGTAAATTTCGTTGGTCTGGCC
AATCATGAAGGACTCCAGCAAATCATCGTGGAAGGATTAGAAGCAGATTTGGAGGCTGCAGAAAAAGAAGCGAAAATTGCGCCTGACACAATTTTGCCACAATAT
GACCATTTTGAGATTTTTCAGCCAACAATAGCTTATTTGAAAGCCTTGCAACCTTCCATCATTGAACCTCCAGAATTGGAGAAGAAACCCCTACCCTTCATTTAA
Protein sequenceShow/hide protein sequence
MMDLDGGTDVATPVPASTFTPQLEEKAETVSLEEKGKKANKGKQVVSCSTPQVGNIKIPPSFPQRLVKKNQDSHFKKFFEILKQLHINIPLVNALEQMPNYTKFL
KDIISRHKKLGEHETVALTNCGSDALGNPLTVNCKDPGSFTIPYSIGGKNLRRALCALGASINLMPLYVFKELNIGEAYPTTVTLQLADMSIKKLEGKIEDVLVK
VDKFIFPTDFIILDCEADLKVPIIFGRPFLATGDTVFIVRKGKITMKVNDEQVTFNVLNAMRLPDEVEECSTIGVVKRNLEQCTILFKGFVSGHRRCRGMPGRLG
LDLDSESWERVVKFLPPRVVQDLHNLLYVNFVGLANHEGLQQIIVEGLEADLEAAEKEAKIAPDTILPQYDHFEIFQPTIAYLKALQPSIIEPPELEKKPLPFI