; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc01g01860 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc01g01860
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionDNA-directed DNA polymerase
Genome locationchr1:1245351..1251273
RNA-Seq ExpressionMoc01g01860
SyntenyMoc01g01860
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022141932.1 uncharacterized protein LOC111012188 [Momordica charantia]1.0e-2341.03Show/hide
Query:  INMESNDARVNKEGSSEKKLGGVNKVYLRKNQSLEEKGVVLDEGIARLQERAEMFSKNNKIRDKKNESVYAKIEELNIKWQEFMENSKKVSEEIQLELNS
        + +E + AR+N+   +EKKL   +KVYLRKNQ + + G  LDE I  + ER +  +K  +IRDK+NE + AKI ELN KWQ FMENS+++SEEIQ+EL+ 
Subjt:  INMESNDARVNKEGSSEKKLGGVNKVYLRKNQSLEEKGVVLDEGIARLQERAEMFSKNNKIRDKKNESVYAKIEELNIKWQEFMENSKKVSEEIQLELNS

Query:  MSIRRKEQVSGDSEHDTEPLEHSDSATVEIQCQIAPGAIMDETPLATLQEQERTSGPVDVPSEAMAESFSSFSQGKTPSLSSLNVSDPNFVATTETSDEE
        +       +  +  ++ E  E ++  +             DE   A +Q QE  S P DVP+EA  ES SS S+  T S SSLNV DPNFVA  E S+EE
Subjt:  MSIRRKEQVSGDSEHDTEPLEHSDSATVEIQCQIAPGAIMDETPLATLQEQERTSGPVDVPSEAMAESFSSFSQGKTPSLSSLNVSDPNFVATTETSDEE

Query:  DPRRCSTFGCQKEAEAGPSKKAKRARVQREAEEP
            C T    K+ +     K + A+    A EP
Subjt:  DPRRCSTFGCQKEAEAGPSKKAKRARVQREAEEP

XP_022154847.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111022007 [Momordica charantia]2.3e-10555.7Show/hide
Query:  VLDEGIARLQERAEMFSKNNKIRDKKNESVYAKIEELNIKWQEFMENSKKVSEEIQLELNSMSIRRKEQVSGDSEHDTEPLEHSDSATVEIQCQIAPGAI
        V   G   L+       K N  R KK    Y   E+LN+  ++  E+  ++ +E               V GD   D E  +      V+++  +     
Subjt:  VLDEGIARLQERAEMFSKNNKIRDKKNESVYAKIEELNIKWQEFMENSKKVSEEIQLELNSMSIRRKEQVSGDSEHDTEPLEHSDSATVEIQCQIAPGAI

Query:  MDETPLATLQEQERTSGPVDVPSEAMAESFSSFSQGKTPSLSSLNVSDPNFVATTETSDEEDPRRCSTFGCQKEAEAGPSKKAKRARVQREAEEPLEEAN
        +DE P    +EQE TSGPVDVPSEAM ES SS SQG         VS P     T T+        +    QKEAEAGPSKKAK ARVQR AEEPLEEAN
Subjt:  MDETPLATLQEQERTSGPVDVPSEAMAESFSSFSQGKTPSLSSLNVSDPNFVATTETSDEEDPRRCSTFGCQKEAEAGPSKKAKRARVQREAEEPLEEAN

Query:  EEEPDSTEQTTSRVKRVRLEVRRPNFTKRDILLERG-DEAQEPVPEYVKRRLVEN-----------------------------------GNEILVHPSD
        EEEPDSTEQT SRVKRVRLEVRRP FT RDILLERG DEAQEPVPEYV++R+VEN                                   GNEILVHPSD
Subjt:  EEEPDSTEQTTSRVKRVRLEVRRPNFTKRDILLERG-DEAQEPVPEYVKRRLVEN-----------------------------------GNEILVHPSD

Query:  EQVEEVHRLICRPHKTWTVSTTGKLSLKPLDINEQAKIWMYVVKNRLIPTSHDSSIKHNRATIVYILMKGVEFNFGELIRNEIRSCSKKMVGSLVFPGLI
        EQVEE  RLICRPHKTWT+ST GKLSLKPLDINEQA +WMYVVKNRLIPTS+DSSIK NRA IVYIL+KGVEFNFGELIRNEI+SCS+K+          
Subjt:  EQVEEVHRLICRPHKTWTVSTTGKLSLKPLDINEQAKIWMYVVKNRLIPTSHDSSIKHNRATIVYILMKGVEFNFGELIRNEIRSCSKKMVGSLVFPGLI

Query:  TELCLQVGVAADDANVVMPKKPFTFLRRVWGYSIVREEDSPITVADPETRGVVTRE
               GV A DANVVMPKKPF  LR+V GYSIVREEDSPIT ADPETRGVVTRE
Subjt:  TELCLQVGVAADDANVVMPKKPFTFLRRVWGYSIVREEDSPITVADPETRGVVTRE

XP_022156786.1 uncharacterized protein LOC111023620 [Momordica charantia]4.8e-3454.89Show/hide
Query:  LQEQERTSGPVDVPSEAMAESFSSFSQGKTPSLSSLNVSDPNFVATTETSDEE-----------------------DPRRC-----STFGCQKEAEAGPS
        +QEQE  SG VDVP+EA+ ES SS S+GK+PSLSSLNVSDPNFVA   TS+E+                       + R C     ++   QKEAEAGP 
Subjt:  LQEQERTSGPVDVPSEAMAESFSSFSQGKTPSLSSLNVSDPNFVATTETSDEE-----------------------DPRRC-----STFGCQKEAEAGPS

Query:  KKAKRARVQREAEEPLEEANEEEPDSTEQTTSRVKRVRLEVRRPNFTKRDILLERG-DEAQEPVPEYVKRRLVENGNEILVHPS
        KKAKR +  R +EEPL+E N+EE DS EQT S+ KRVR EV+R NFT R+IL+E+G DEAQEPVP+Y+KRRL+ENG E L  P+
Subjt:  KKAKRARVQREAEEPLEEANEEEPDSTEQTTSRVKRVRLEVRRPNFTKRDILLERG-DEAQEPVPEYVKRRLVENGNEILVHPS

XP_022156935.1 uncharacterized protein LOC111023761 [Momordica charantia]7.6e-2460.48Show/hide
Query:  QVSGDSEHDTEPLEHSDSATVEIQCQIAPGAIMDETPLATL-------------------------------QEQERTSGPVDVPSEAMAESFSSFSQGK
        +VSGDSEHD EPLEHSDSATV+I+CQIAP  IM ETP ATL                               QEQE TSGP+DV SEAM ES SS+SQ K
Subjt:  QVSGDSEHDTEPLEHSDSATVEIQCQIAPGAIMDETPLATL-------------------------------QEQERTSGPVDVPSEAMAESFSSFSQGK

Query:  TPSLSSLNVSDPNFVATTETSDEE
        T SLSSLNVSDPNFVAT E SDEE
Subjt:  TPSLSSLNVSDPNFVATTETSDEE

XP_022159289.1 uncharacterized protein LOC111025702 [Momordica charantia]1.3e-3669.75Show/hide
Query:  IWMYVVKNRLIPTSHDSSIKHNRATIVYILMKGVEFNFGELIRNEIRSCSKKMVGSLVFPGLITELCLQVGVAADDANVVMPKKPFTFLRRVWGYSIVRE
        +W YVVKN LI TS+DSSI+  R  IVYILMKG+EFNF ELIRNEI  C++KMVG L+FP  I ELCL+ GV AD  +VVM KK  T +RRV GY IVRE
Subjt:  IWMYVVKNRLIPTSHDSSIKHNRATIVYILMKGVEFNFGELIRNEIRSCSKKMVGSLVFPGLITELCLQVGVAADDANVVMPKKPFTFLRRVWGYSIVRE

Query:  EDSPITVADPETRGVVTRE
        EDSPIT ADP+TRGVVTRE
Subjt:  EDSPITVADPETRGVVTRE

TrEMBL top hitse value%identityAlignment
A0A6J1CL76 uncharacterized protein LOC1110121884.8e-2441.03Show/hide
Query:  INMESNDARVNKEGSSEKKLGGVNKVYLRKNQSLEEKGVVLDEGIARLQERAEMFSKNNKIRDKKNESVYAKIEELNIKWQEFMENSKKVSEEIQLELNS
        + +E + AR+N+   +EKKL   +KVYLRKNQ + + G  LDE I  + ER +  +K  +IRDK+NE + AKI ELN KWQ FMENS+++SEEIQ+EL+ 
Subjt:  INMESNDARVNKEGSSEKKLGGVNKVYLRKNQSLEEKGVVLDEGIARLQERAEMFSKNNKIRDKKNESVYAKIEELNIKWQEFMENSKKVSEEIQLELNS

Query:  MSIRRKEQVSGDSEHDTEPLEHSDSATVEIQCQIAPGAIMDETPLATLQEQERTSGPVDVPSEAMAESFSSFSQGKTPSLSSLNVSDPNFVATTETSDEE
        +       +  +  ++ E  E ++  +             DE   A +Q QE  S P DVP+EA  ES SS S+  T S SSLNV DPNFVA  E S+EE
Subjt:  MSIRRKEQVSGDSEHDTEPLEHSDSATVEIQCQIAPGAIMDETPLATLQEQERTSGPVDVPSEAMAESFSSFSQGKTPSLSSLNVSDPNFVATTETSDEE

Query:  DPRRCSTFGCQKEAEAGPSKKAKRARVQREAEEP
            C T    K+ +     K + A+    A EP
Subjt:  DPRRCSTFGCQKEAEAGPSKKAKRARVQREAEEP

A0A6J1DMT3 LOW QUALITY PROTEIN: uncharacterized protein LOC1110220071.1e-10555.7Show/hide
Query:  VLDEGIARLQERAEMFSKNNKIRDKKNESVYAKIEELNIKWQEFMENSKKVSEEIQLELNSMSIRRKEQVSGDSEHDTEPLEHSDSATVEIQCQIAPGAI
        V   G   L+       K N  R KK    Y   E+LN+  ++  E+  ++ +E               V GD   D E  +      V+++  +     
Subjt:  VLDEGIARLQERAEMFSKNNKIRDKKNESVYAKIEELNIKWQEFMENSKKVSEEIQLELNSMSIRRKEQVSGDSEHDTEPLEHSDSATVEIQCQIAPGAI

Query:  MDETPLATLQEQERTSGPVDVPSEAMAESFSSFSQGKTPSLSSLNVSDPNFVATTETSDEEDPRRCSTFGCQKEAEAGPSKKAKRARVQREAEEPLEEAN
        +DE P    +EQE TSGPVDVPSEAM ES SS SQG         VS P     T T+        +    QKEAEAGPSKKAK ARVQR AEEPLEEAN
Subjt:  MDETPLATLQEQERTSGPVDVPSEAMAESFSSFSQGKTPSLSSLNVSDPNFVATTETSDEEDPRRCSTFGCQKEAEAGPSKKAKRARVQREAEEPLEEAN

Query:  EEEPDSTEQTTSRVKRVRLEVRRPNFTKRDILLERG-DEAQEPVPEYVKRRLVEN-----------------------------------GNEILVHPSD
        EEEPDSTEQT SRVKRVRLEVRRP FT RDILLERG DEAQEPVPEYV++R+VEN                                   GNEILVHPSD
Subjt:  EEEPDSTEQTTSRVKRVRLEVRRPNFTKRDILLERG-DEAQEPVPEYVKRRLVEN-----------------------------------GNEILVHPSD

Query:  EQVEEVHRLICRPHKTWTVSTTGKLSLKPLDINEQAKIWMYVVKNRLIPTSHDSSIKHNRATIVYILMKGVEFNFGELIRNEIRSCSKKMVGSLVFPGLI
        EQVEE  RLICRPHKTWT+ST GKLSLKPLDINEQA +WMYVVKNRLIPTS+DSSIK NRA IVYIL+KGVEFNFGELIRNEI+SCS+K+          
Subjt:  EQVEEVHRLICRPHKTWTVSTTGKLSLKPLDINEQAKIWMYVVKNRLIPTSHDSSIKHNRATIVYILMKGVEFNFGELIRNEIRSCSKKMVGSLVFPGLI

Query:  TELCLQVGVAADDANVVMPKKPFTFLRRVWGYSIVREEDSPITVADPETRGVVTRE
               GV A DANVVMPKKPF  LR+V GYSIVREEDSPIT ADPETRGVVTRE
Subjt:  TELCLQVGVAADDANVVMPKKPFTFLRRVWGYSIVREEDSPITVADPETRGVVTRE

A0A6J1DRR9 uncharacterized protein LOC1110237613.7e-2460.48Show/hide
Query:  QVSGDSEHDTEPLEHSDSATVEIQCQIAPGAIMDETPLATL-------------------------------QEQERTSGPVDVPSEAMAESFSSFSQGK
        +VSGDSEHD EPLEHSDSATV+I+CQIAP  IM ETP ATL                               QEQE TSGP+DV SEAM ES SS+SQ K
Subjt:  QVSGDSEHDTEPLEHSDSATVEIQCQIAPGAIMDETPLATL-------------------------------QEQERTSGPVDVPSEAMAESFSSFSQGK

Query:  TPSLSSLNVSDPNFVATTETSDEE
        T SLSSLNVSDPNFVAT E SDEE
Subjt:  TPSLSSLNVSDPNFVATTETSDEE

A0A6J1DW11 uncharacterized protein LOC1110236202.3e-3454.89Show/hide
Query:  LQEQERTSGPVDVPSEAMAESFSSFSQGKTPSLSSLNVSDPNFVATTETSDEE-----------------------DPRRC-----STFGCQKEAEAGPS
        +QEQE  SG VDVP+EA+ ES SS S+GK+PSLSSLNVSDPNFVA   TS+E+                       + R C     ++   QKEAEAGP 
Subjt:  LQEQERTSGPVDVPSEAMAESFSSFSQGKTPSLSSLNVSDPNFVATTETSDEE-----------------------DPRRC-----STFGCQKEAEAGPS

Query:  KKAKRARVQREAEEPLEEANEEEPDSTEQTTSRVKRVRLEVRRPNFTKRDILLERG-DEAQEPVPEYVKRRLVENGNEILVHPS
        KKAKR +  R +EEPL+E N+EE DS EQT S+ KRVR EV+R NFT R+IL+E+G DEAQEPVP+Y+KRRL+ENG E L  P+
Subjt:  KKAKRARVQREAEEPLEEANEEEPDSTEQTTSRVKRVRLEVRRPNFTKRDILLERG-DEAQEPVPEYVKRRLVENGNEILVHPS

A0A6J1E204 uncharacterized protein LOC1110257026.5e-3769.75Show/hide
Query:  IWMYVVKNRLIPTSHDSSIKHNRATIVYILMKGVEFNFGELIRNEIRSCSKKMVGSLVFPGLITELCLQVGVAADDANVVMPKKPFTFLRRVWGYSIVRE
        +W YVVKN LI TS+DSSI+  R  IVYILMKG+EFNF ELIRNEI  C++KMVG L+FP  I ELCL+ GV AD  +VVM KK  T +RRV GY IVRE
Subjt:  IWMYVVKNRLIPTSHDSSIKHNRATIVYILMKGVEFNFGELIRNEIRSCSKKMVGSLVFPGLITELCLQVGVAADDANVVMPKKPFTFLRRVWGYSIVRE

Query:  EDSPITVADPETRGVVTRE
        EDSPIT ADP+TRGVVTRE
Subjt:  EDSPITVADPETRGVVTRE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGGTCAATGATGAGCAGGTAACCTTCAATGTCCTCGATGCGATGCGTCTCCCGGATGAAGTCGAGGAGTGCTCTACAATAGGAGAAATCATGGAGGAACTA
CAACAAATGATGGTGGAAGACTTAGAAGCAAATTTAGAGGCCACAGAAAAAGAATCCAAAATTGCGCCTGGCGCAATTTTGCCCCAATTTGAGCGTTTTGAGTTT
TTGCAGCGGACAATTGCGGATTTGAAGGCCTTGCAACCTTCAATCATTGAACCTCCAGAATTGGAGAAGAAACCCCTACCTTTTCATTTAAAATATGCTTATTTG
GGTTTAAACGATACTTTGCCCGTTATCATTTCTTCATATTTGTCTAATGAACATGAACCTTTGCTTTTGCAGTTAGTTTTTGATCCTAGGAAACAAAGAAGAAAA
TATGAGGAAGCTATAAGAATGAACCCTAGGAGAAATCTATCCATAGGTGGTACAAATTTTGAAAAAATTAATATGGAATCTAATGATGCTAGGGTTAATAAAGAA
GGTTCTAGTGAAAAGAAATTAGGAGGTGTTAATAAAGTTTATCTTCGAAAAAATCAATCTCTAGAGGAAAAAGGTGTTGTTTTAGATGAAGGAATAGCTAGACTT
CAAGAGAGAGCGGAGATGTTCAGTAAAAATAACAAAATTAGGGATAAAAAGAATGAGAGCGTTTATGCGAAAATTGAGGAATTAAACATAAAATGGCAAGAATTC
ATGGAAAACTCAAAGAAAGTGAGTGAGGAGATTCAACTTGAGTTAAATAGCATGAGTATACGTCGTAAAGAACAGGTTAGTGGAGACTCAGAACACGACACGGAG
CCCTTGGAGCACTCAGATTCGGCCACAGTCGAAATTCAATGCCAAATTGCGCCTGGCGCAATTATGGATGAAACTCCACTGGCCACTCTACAAGAGCAAGAAAGA
ACATCCGGTCCTGTGGATGTCCCTAGTGAGGCCATGGCAGAATCATTCTCCTCTTTTTCACAAGGTAAAACCCCTTCTTTATCAAGTTTGAATGTTTCTGACCCA
AACTTTGTTGCTACTACAGAGACTTCAGATGAGGAGGACCCACGCCGCTGTAGCACGTTTGGCTGCCAAAAAGAAGCCGAGGCTGGTCCATCTAAAAAAGCCAAG
AGGGCTAGGGTGCAAAGAGAGGCAGAAGAGCCACTTGAGGAGGCCAATGAAGAGGAGCCCGATTCTACAGAACAAACAACATCAAGAGTAAAAAGGGTGAGATTG
GAGGTGAGGAGGCCCAACTTCACAAAACGTGATATCCTCCTTGAGAGAGGTGATGAGGCCCAAGAGCCGGTGCCAGAATATGTTAAGAGGAGGCTTGTGGAGAAT
GGTAATGAAATTTTGGTGCATCCATCGGACGAGCAAGTGGAGGAGGTGCATAGACTTATTTGTAGACCACATAAGACATGGACCGTCTCAACCACAGGGAAGCTT
TCCTTAAAGCCCCTTGACATTAATGAGCAAGCAAAGATTTGGATGTATGTGGTGAAGAACCGGTTGATCCCCACTTCTCACGATTCCTCCATTAAGCACAATAGG
GCGACGATAGTGTACATTCTCATGAAGGGCGTTGAGTTCAACTTTGGGGAGCTCATAAGAAACGAGATTCGGAGTTGCTCTAAGAAAATGGTAGGTTCTCTTGTT
TTTCCTGGACTAATAACTGAGTTATGCTTGCAGGTGGGAGTGGCAGCTGATGATGCCAATGTTGTGATGCCCAAGAAGCCGTTCACATTCCTAAGAAGAGTTTGG
GGGTATTCCATTGTTCGAGAGGAAGATTCTCCCATTACCGTCGCGGATCCCGAGACCCGAGGGGTGGTGACAAGGGAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGAAGGTCAATGATGAGCAGGTAACCTTCAATGTCCTCGATGCGATGCGTCTCCCGGATGAAGTCGAGGAGTGCTCTACAATAGGAGAAATCATGGAGGAACTA
CAACAAATGATGGTGGAAGACTTAGAAGCAAATTTAGAGGCCACAGAAAAAGAATCCAAAATTGCGCCTGGCGCAATTTTGCCCCAATTTGAGCGTTTTGAGTTT
TTGCAGCGGACAATTGCGGATTTGAAGGCCTTGCAACCTTCAATCATTGAACCTCCAGAATTGGAGAAGAAACCCCTACCTTTTCATTTAAAATATGCTTATTTG
GGTTTAAACGATACTTTGCCCGTTATCATTTCTTCATATTTGTCTAATGAACATGAACCTTTGCTTTTGCAGTTAGTTTTTGATCCTAGGAAACAAAGAAGAAAA
TATGAGGAAGCTATAAGAATGAACCCTAGGAGAAATCTATCCATAGGTGGTACAAATTTTGAAAAAATTAATATGGAATCTAATGATGCTAGGGTTAATAAAGAA
GGTTCTAGTGAAAAGAAATTAGGAGGTGTTAATAAAGTTTATCTTCGAAAAAATCAATCTCTAGAGGAAAAAGGTGTTGTTTTAGATGAAGGAATAGCTAGACTT
CAAGAGAGAGCGGAGATGTTCAGTAAAAATAACAAAATTAGGGATAAAAAGAATGAGAGCGTTTATGCGAAAATTGAGGAATTAAACATAAAATGGCAAGAATTC
ATGGAAAACTCAAAGAAAGTGAGTGAGGAGATTCAACTTGAGTTAAATAGCATGAGTATACGTCGTAAAGAACAGGTTAGTGGAGACTCAGAACACGACACGGAG
CCCTTGGAGCACTCAGATTCGGCCACAGTCGAAATTCAATGCCAAATTGCGCCTGGCGCAATTATGGATGAAACTCCACTGGCCACTCTACAAGAGCAAGAAAGA
ACATCCGGTCCTGTGGATGTCCCTAGTGAGGCCATGGCAGAATCATTCTCCTCTTTTTCACAAGGTAAAACCCCTTCTTTATCAAGTTTGAATGTTTCTGACCCA
AACTTTGTTGCTACTACAGAGACTTCAGATGAGGAGGACCCACGCCGCTGTAGCACGTTTGGCTGCCAAAAAGAAGCCGAGGCTGGTCCATCTAAAAAAGCCAAG
AGGGCTAGGGTGCAAAGAGAGGCAGAAGAGCCACTTGAGGAGGCCAATGAAGAGGAGCCCGATTCTACAGAACAAACAACATCAAGAGTAAAAAGGGTGAGATTG
GAGGTGAGGAGGCCCAACTTCACAAAACGTGATATCCTCCTTGAGAGAGGTGATGAGGCCCAAGAGCCGGTGCCAGAATATGTTAAGAGGAGGCTTGTGGAGAAT
GGTAATGAAATTTTGGTGCATCCATCGGACGAGCAAGTGGAGGAGGTGCATAGACTTATTTGTAGACCACATAAGACATGGACCGTCTCAACCACAGGGAAGCTT
TCCTTAAAGCCCCTTGACATTAATGAGCAAGCAAAGATTTGGATGTATGTGGTGAAGAACCGGTTGATCCCCACTTCTCACGATTCCTCCATTAAGCACAATAGG
GCGACGATAGTGTACATTCTCATGAAGGGCGTTGAGTTCAACTTTGGGGAGCTCATAAGAAACGAGATTCGGAGTTGCTCTAAGAAAATGGTAGGTTCTCTTGTT
TTTCCTGGACTAATAACTGAGTTATGCTTGCAGGTGGGAGTGGCAGCTGATGATGCCAATGTTGTGATGCCCAAGAAGCCGTTCACATTCCTAAGAAGAGTTTGG
GGGTATTCCATTGTTCGAGAGGAAGATTCTCCCATTACCGTCGCGGATCCCGAGACCCGAGGGGTGGTGACAAGGGAGTAG
Protein sequenceShow/hide protein sequence
MKVNDEQVTFNVLDAMRLPDEVEECSTIGEIMEELQQMMVEDLEANLEATEKESKIAPGAILPQFERFEFLQRTIADLKALQPSIIEPPELEKKPLPFHLKYAYL
GLNDTLPVIISSYLSNEHEPLLLQLVFDPRKQRRKYEEAIRMNPRRNLSIGGTNFEKINMESNDARVNKEGSSEKKLGGVNKVYLRKNQSLEEKGVVLDEGIARL
QERAEMFSKNNKIRDKKNESVYAKIEELNIKWQEFMENSKKVSEEIQLELNSMSIRRKEQVSGDSEHDTEPLEHSDSATVEIQCQIAPGAIMDETPLATLQEQER
TSGPVDVPSEAMAESFSSFSQGKTPSLSSLNVSDPNFVATTETSDEEDPRRCSTFGCQKEAEAGPSKKAKRARVQREAEEPLEEANEEEPDSTEQTTSRVKRVRL
EVRRPNFTKRDILLERGDEAQEPVPEYVKRRLVENGNEILVHPSDEQVEEVHRLICRPHKTWTVSTTGKLSLKPLDINEQAKIWMYVVKNRLIPTSHDSSIKHNR
ATIVYILMKGVEFNFGELIRNEIRSCSKKMVGSLVFPGLITELCLQVGVAADDANVVMPKKPFTFLRRVWGYSIVREEDSPITVADPETRGVVTRE