; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI03G20690 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI03G20690
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionCCHC-type domain-containing protein
Genome locationChr3:16658298..16659652
RNA-Seq ExpressionCSPI03G20690
SyntenyCSPI03G20690
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0038206.1 uncharacterized protein E6C27_scaffold270G00430 [Cucumis melo var. makuwa]1.0e-5860.09Show/hide
Query:  MKVDLPTFNGRMDVEKFLDWIKNVEIFFDYANTPEHKKVRLVALKLQGGASAWWDQFKTIDDYSYQQCHQGSRSIMDYTEDFYPLGARNNLFETEHQQIS
        MKVDLPTFNG+MD EKFLDWIKNVE FF YANTPEHKKVRLVALKLQGGASAW DQ             Q +R                           
Subjt:  MKVDLPTFNGRMDVEKFLDWIKNVEIFFDYANTPEHKKVRLVALKLQGGASAWWDQFKTIDDYSYQQCHQGSRSIMDYTEDFYPLGARNNLFETEHQQIS

Query:  RFIHGLRDEIKDIVHLHPLTFLSDAISLASKIEDSEEIKKTKNSQRKNNWDKQQRTNLTNSFRNFQQGSSSTTSQLAKKDENSSKIPATKPGENNAKKKV
                             L DA SLASKIE SEEIKKTK  QRKN+WDKQQRTN TNSFRNFQQGSSST SQ  KKD+   K PATKP EN +KKKV
Subjt:  RFIHGLRDEIKDIVHLHPLTFLSDAISLASKIEDSEEIKKTKNSQRKNNWDKQQRTNLTNSFRNFQQGSSSTTSQLAKKDENSSKIPATKPGENNAKKKV

Query:  DNIYNRPTLGKCFKCGQQGHFSNECPQR
        DN+YNRPTLGKCF+C QQGH SNEC QR
Subjt:  DNIYNRPTLGKCFKCGQQGHFSNECPQR

KAA0058889.1 Transposon Ty3-G Gag-Pol polyprotein [Cucumis melo var. makuwa]5.7e-10570.79Show/hide
Query:  DSDSSDEDNFLNIHQEPKQRLGLRPYFQEDQEIRMKVDLPTFNGRMDVEKFLDWIKNVEIFFDYANTPEHKKVRLVALKLQGGASAWWDQ----------
        +SDSSDEDN LNIHQEP++ L   PY QEDQEIRMKVDLPTFNGRMDVEKFLDWIKNVE FFDYAN  EHKKV+LVALKLQGGASAWWDQ          
Subjt:  DSDSSDEDNFLNIHQEPKQRLGLRPYFQEDQEIRMKVDLPTFNGRMDVEKFLDWIKNVEIFFDYANTPEHKKVRLVALKLQGGASAWWDQ----------

Query:  ---------------------FKTIDDYSYQQCHQGSRSIMDYTEDFYPLGARNNLFETEHQQISRFIHGLRDEIKDIVHLHPLTFLSDAISLASKIEDS
                             ++ +    YQQCHQG RSIMDYTE+FY LGAR+NL ETEHQQISR IHGL++EIKD+V+LH LTFL DAIS+ASKIED+
Subjt:  ---------------------FKTIDDYSYQQCHQGSRSIMDYTEDFYPLGARNNLFETEHQQISRFIHGLRDEIKDIVHLHPLTFLSDAISLASKIEDS

Query:  EEIKKTKNSQRKNNWDKQQRTNLTNSFRNFQQGSSSTTSQLAKKDENSSKIPATKPGENNAKKKVDNIYNRPTLGKCFKCGQQGHFSNECP
        EEIKKTK +QRKNNWDK QR N TNSFRNF QGSSS+TSQ  KKDEN  K P TK GE N KKKVDN+Y RPTLGKCFKCGQQGH SNECP
Subjt:  EEIKKTKNSQRKNNWDKQQRTNLTNSFRNFQQGSSSTTSQLAKKDENSSKIPATKPGENNAKKKVDNIYNRPTLGKCFKCGQQGHFSNECP

TYK30599.1 uncharacterized protein E5676_scaffold84664G00070 [Cucumis melo var. makuwa]2.3e-5860.09Show/hide
Query:  MKVDLPTFNGRMDVEKFLDWIKNVEIFFDYANTPEHKKVRLVALKLQGGASAWWDQFKTIDDYSYQQCHQGSRSIMDYTEDFYPLGARNNLFETEHQQIS
        MKVDLPTFNG+MD EKFLDWIKNVE FF YANTPEHKKVRLVALKLQGGASAW DQ             Q +R                           
Subjt:  MKVDLPTFNGRMDVEKFLDWIKNVEIFFDYANTPEHKKVRLVALKLQGGASAWWDQFKTIDDYSYQQCHQGSRSIMDYTEDFYPLGARNNLFETEHQQIS

Query:  RFIHGLRDEIKDIVHLHPLTFLSDAISLASKIEDSEEIKKTKNSQRKNNWDKQQRTNLTNSFRNFQQGSSSTTSQLAKKDENSSKIPATKPGENNAKKKV
                             L DA SLASKIE SEEIKKTK  QRKN+WDKQQRTN TNSFRNFQQGSSST SQ  KKD+   K PATKP EN +KKKV
Subjt:  RFIHGLRDEIKDIVHLHPLTFLSDAISLASKIEDSEEIKKTKNSQRKNNWDKQQRTNLTNSFRNFQQGSSSTTSQLAKKDENSSKIPATKPGENNAKKKV

Query:  DNIYNRPTLGKCFKCGQQGHFSNECPQR
        DN+YNRPTLGKCF+C QQGH SNEC QR
Subjt:  DNIYNRPTLGKCFKCGQQGHFSNECPQR

XP_022138327.1 uncharacterized protein LOC111009540 isoform X1 [Momordica charantia]1.8e-5040.06Show/hide
Query:  DSDSSD-EDNFLNIH------QEPKQRLGLRPYFQEDQEIRMKVDLPTFNGRMDVEKFLDWIKNVEIFFDYANTPEHKKVRLVALKLQGGASAWWDQ---
        DSD+SD ++ FL  H      +   +R G+R  F      +MK+DLPTFNG+MDVE FLD +KNVE FFDY NTPE KKV+LVA K+Q GASAWWDQ   
Subjt:  DSDSSD-EDNFLNIH------QEPKQRLGLRPYFQEDQEIRMKVDLPTFNGRMDVEKFLDWIKNVEIFFDYANTPEHKKVRLVALKLQGGASAWWDQ---

Query:  ----------------------------FKTIDDYSYQQCHQGSRSIMDYTEDFYPLGARNNLFETEHQQISRFIHGLRDEIKDIVHLHPLTFLSDAISL
                                    F+ +    YQ+C QG ++I DYTE F+ LGA+ N+ ETE  +I+RF+ GLR++I+D + + P+  L+DAI +
Subjt:  ----------------------------FKTIDDYSYQQCHQGSRSIMDYTEDFYPLGARNNLFETEHQQISRFIHGLRDEIKDIVHLHPLTFLSDAISL

Query:  ASKIEDSEEIKKTKNSQRKNNWDKQ--QRTNLTNSFRNFQQGSSSTTSQLAKKDENSSKIPATKPGENNAKKKVDNIYNRPTLGKCFKCGQQGHFSNECP
        A+KIED    K+ +   R+  WDK    +T  T++ +  Q G++S  S     D+ +   P   P  + + K+  N Y RPTLGKCF+CGQ  H SNECP
Subjt:  ASKIEDSEEIKKTKNSQRKNNWDKQ--QRTNLTNSFRNFQQGSSSTTSQLAKKDENSSKIPATKPGENNAKKKVDNIYNRPTLGKCFKCGQQGHFSNECP

Query:  QREFFSPLRLDKYLKGIPSFEQDAPLTEDLGNMTQITFIEED
        QR   + +  D  L+     + D P  +D       T++E D
Subjt:  QREFFSPLRLDKYLKGIPSFEQDAPLTEDLGNMTQITFIEED

XP_022138328.1 uncharacterized protein LOC111009540 isoform X2 [Momordica charantia]1.8e-5040.06Show/hide
Query:  DSDSSD-EDNFLNIH------QEPKQRLGLRPYFQEDQEIRMKVDLPTFNGRMDVEKFLDWIKNVEIFFDYANTPEHKKVRLVALKLQGGASAWWDQ---
        DSD+SD ++ FL  H      +   +R G+R  F      +MK+DLPTFNG+MDVE FLD +KNVE FFDY NTPE KKV+LVA K+Q GASAWWDQ   
Subjt:  DSDSSD-EDNFLNIH------QEPKQRLGLRPYFQEDQEIRMKVDLPTFNGRMDVEKFLDWIKNVEIFFDYANTPEHKKVRLVALKLQGGASAWWDQ---

Query:  ----------------------------FKTIDDYSYQQCHQGSRSIMDYTEDFYPLGARNNLFETEHQQISRFIHGLRDEIKDIVHLHPLTFLSDAISL
                                    F+ +    YQ+C QG ++I DYTE F+ LGA+ N+ ETE  +I+RF+ GLR++I+D + + P+  L+DAI +
Subjt:  ----------------------------FKTIDDYSYQQCHQGSRSIMDYTEDFYPLGARNNLFETEHQQISRFIHGLRDEIKDIVHLHPLTFLSDAISL

Query:  ASKIEDSEEIKKTKNSQRKNNWDKQ--QRTNLTNSFRNFQQGSSSTTSQLAKKDENSSKIPATKPGENNAKKKVDNIYNRPTLGKCFKCGQQGHFSNECP
        A+KIED    K+ +   R+  WDK    +T  T++ +  Q G++S  S     D+ +   P   P  + + K+  N Y RPTLGKCF+CGQ  H SNECP
Subjt:  ASKIEDSEEIKKTKNSQRKNNWDKQ--QRTNLTNSFRNFQQGSSSTTSQLAKKDENSSKIPATKPGENNAKKKVDNIYNRPTLGKCFKCGQQGHFSNECP

Query:  QREFFSPLRLDKYLKGIPSFEQDAPLTEDLGNMTQITFIEED
        QR   + +  D  L+     + D P  +D       T++E D
Subjt:  QREFFSPLRLDKYLKGIPSFEQDAPLTEDLGNMTQITFIEED

TrEMBL top hitse value%identityAlignment
A0A5A7T7L8 CCHC-type domain-containing protein5.1e-5960.09Show/hide
Query:  MKVDLPTFNGRMDVEKFLDWIKNVEIFFDYANTPEHKKVRLVALKLQGGASAWWDQFKTIDDYSYQQCHQGSRSIMDYTEDFYPLGARNNLFETEHQQIS
        MKVDLPTFNG+MD EKFLDWIKNVE FF YANTPEHKKVRLVALKLQGGASAW DQ             Q +R                           
Subjt:  MKVDLPTFNGRMDVEKFLDWIKNVEIFFDYANTPEHKKVRLVALKLQGGASAWWDQFKTIDDYSYQQCHQGSRSIMDYTEDFYPLGARNNLFETEHQQIS

Query:  RFIHGLRDEIKDIVHLHPLTFLSDAISLASKIEDSEEIKKTKNSQRKNNWDKQQRTNLTNSFRNFQQGSSSTTSQLAKKDENSSKIPATKPGENNAKKKV
                             L DA SLASKIE SEEIKKTK  QRKN+WDKQQRTN TNSFRNFQQGSSST SQ  KKD+   K PATKP EN +KKKV
Subjt:  RFIHGLRDEIKDIVHLHPLTFLSDAISLASKIEDSEEIKKTKNSQRKNNWDKQQRTNLTNSFRNFQQGSSSTTSQLAKKDENSSKIPATKPGENNAKKKV

Query:  DNIYNRPTLGKCFKCGQQGHFSNECPQR
        DN+YNRPTLGKCF+C QQGH SNEC QR
Subjt:  DNIYNRPTLGKCFKCGQQGHFSNECPQR

A0A5D3DJC1 Transposon Ty3-G Gag-Pol polyprotein2.7e-10570.79Show/hide
Query:  DSDSSDEDNFLNIHQEPKQRLGLRPYFQEDQEIRMKVDLPTFNGRMDVEKFLDWIKNVEIFFDYANTPEHKKVRLVALKLQGGASAWWDQ----------
        +SDSSDEDN LNIHQEP++ L   PY QEDQEIRMKVDLPTFNGRMDVEKFLDWIKNVE FFDYAN  EHKKV+LVALKLQGGASAWWDQ          
Subjt:  DSDSSDEDNFLNIHQEPKQRLGLRPYFQEDQEIRMKVDLPTFNGRMDVEKFLDWIKNVEIFFDYANTPEHKKVRLVALKLQGGASAWWDQ----------

Query:  ---------------------FKTIDDYSYQQCHQGSRSIMDYTEDFYPLGARNNLFETEHQQISRFIHGLRDEIKDIVHLHPLTFLSDAISLASKIEDS
                             ++ +    YQQCHQG RSIMDYTE+FY LGAR+NL ETEHQQISR IHGL++EIKD+V+LH LTFL DAIS+ASKIED+
Subjt:  ---------------------FKTIDDYSYQQCHQGSRSIMDYTEDFYPLGARNNLFETEHQQISRFIHGLRDEIKDIVHLHPLTFLSDAISLASKIEDS

Query:  EEIKKTKNSQRKNNWDKQQRTNLTNSFRNFQQGSSSTTSQLAKKDENSSKIPATKPGENNAKKKVDNIYNRPTLGKCFKCGQQGHFSNECP
        EEIKKTK +QRKNNWDK QR N TNSFRNF QGSSS+TSQ  KKDEN  K P TK GE N KKKVDN+Y RPTLGKCFKCGQQGH SNECP
Subjt:  EEIKKTKNSQRKNNWDKQQRTNLTNSFRNFQQGSSSTTSQLAKKDENSSKIPATKPGENNAKKKVDNIYNRPTLGKCFKCGQQGHFSNECP

A0A5D3E462 CCHC-type domain-containing protein1.1e-5860.09Show/hide
Query:  MKVDLPTFNGRMDVEKFLDWIKNVEIFFDYANTPEHKKVRLVALKLQGGASAWWDQFKTIDDYSYQQCHQGSRSIMDYTEDFYPLGARNNLFETEHQQIS
        MKVDLPTFNG+MD EKFLDWIKNVE FF YANTPEHKKVRLVALKLQGGASAW DQ             Q +R                           
Subjt:  MKVDLPTFNGRMDVEKFLDWIKNVEIFFDYANTPEHKKVRLVALKLQGGASAWWDQFKTIDDYSYQQCHQGSRSIMDYTEDFYPLGARNNLFETEHQQIS

Query:  RFIHGLRDEIKDIVHLHPLTFLSDAISLASKIEDSEEIKKTKNSQRKNNWDKQQRTNLTNSFRNFQQGSSSTTSQLAKKDENSSKIPATKPGENNAKKKV
                             L DA SLASKIE SEEIKKTK  QRKN+WDKQQRTN TNSFRNFQQGSSST SQ  KKD+   K PATKP EN +KKKV
Subjt:  RFIHGLRDEIKDIVHLHPLTFLSDAISLASKIEDSEEIKKTKNSQRKNNWDKQQRTNLTNSFRNFQQGSSSTTSQLAKKDENSSKIPATKPGENNAKKKV

Query:  DNIYNRPTLGKCFKCGQQGHFSNECPQR
        DN+YNRPTLGKCF+C QQGH SNEC QR
Subjt:  DNIYNRPTLGKCFKCGQQGHFSNECPQR

A0A6J1CAS9 uncharacterized protein LOC111009540 isoform X18.6e-5140.06Show/hide
Query:  DSDSSD-EDNFLNIH------QEPKQRLGLRPYFQEDQEIRMKVDLPTFNGRMDVEKFLDWIKNVEIFFDYANTPEHKKVRLVALKLQGGASAWWDQ---
        DSD+SD ++ FL  H      +   +R G+R  F      +MK+DLPTFNG+MDVE FLD +KNVE FFDY NTPE KKV+LVA K+Q GASAWWDQ   
Subjt:  DSDSSD-EDNFLNIH------QEPKQRLGLRPYFQEDQEIRMKVDLPTFNGRMDVEKFLDWIKNVEIFFDYANTPEHKKVRLVALKLQGGASAWWDQ---

Query:  ----------------------------FKTIDDYSYQQCHQGSRSIMDYTEDFYPLGARNNLFETEHQQISRFIHGLRDEIKDIVHLHPLTFLSDAISL
                                    F+ +    YQ+C QG ++I DYTE F+ LGA+ N+ ETE  +I+RF+ GLR++I+D + + P+  L+DAI +
Subjt:  ----------------------------FKTIDDYSYQQCHQGSRSIMDYTEDFYPLGARNNLFETEHQQISRFIHGLRDEIKDIVHLHPLTFLSDAISL

Query:  ASKIEDSEEIKKTKNSQRKNNWDKQ--QRTNLTNSFRNFQQGSSSTTSQLAKKDENSSKIPATKPGENNAKKKVDNIYNRPTLGKCFKCGQQGHFSNECP
        A+KIED    K+ +   R+  WDK    +T  T++ +  Q G++S  S     D+ +   P   P  + + K+  N Y RPTLGKCF+CGQ  H SNECP
Subjt:  ASKIEDSEEIKKTKNSQRKNNWDKQ--QRTNLTNSFRNFQQGSSSTTSQLAKKDENSSKIPATKPGENNAKKKVDNIYNRPTLGKCFKCGQQGHFSNECP

Query:  QREFFSPLRLDKYLKGIPSFEQDAPLTEDLGNMTQITFIEED
        QR   + +  D  L+     + D P  +D       T++E D
Subjt:  QREFFSPLRLDKYLKGIPSFEQDAPLTEDLGNMTQITFIEED

A0A6J1CCQ8 uncharacterized protein LOC111009540 isoform X28.6e-5140.06Show/hide
Query:  DSDSSD-EDNFLNIH------QEPKQRLGLRPYFQEDQEIRMKVDLPTFNGRMDVEKFLDWIKNVEIFFDYANTPEHKKVRLVALKLQGGASAWWDQ---
        DSD+SD ++ FL  H      +   +R G+R  F      +MK+DLPTFNG+MDVE FLD +KNVE FFDY NTPE KKV+LVA K+Q GASAWWDQ   
Subjt:  DSDSSD-EDNFLNIH------QEPKQRLGLRPYFQEDQEIRMKVDLPTFNGRMDVEKFLDWIKNVEIFFDYANTPEHKKVRLVALKLQGGASAWWDQ---

Query:  ----------------------------FKTIDDYSYQQCHQGSRSIMDYTEDFYPLGARNNLFETEHQQISRFIHGLRDEIKDIVHLHPLTFLSDAISL
                                    F+ +    YQ+C QG ++I DYTE F+ LGA+ N+ ETE  +I+RF+ GLR++I+D + + P+  L+DAI +
Subjt:  ----------------------------FKTIDDYSYQQCHQGSRSIMDYTEDFYPLGARNNLFETEHQQISRFIHGLRDEIKDIVHLHPLTFLSDAISL

Query:  ASKIEDSEEIKKTKNSQRKNNWDKQ--QRTNLTNSFRNFQQGSSSTTSQLAKKDENSSKIPATKPGENNAKKKVDNIYNRPTLGKCFKCGQQGHFSNECP
        A+KIED    K+ +   R+  WDK    +T  T++ +  Q G++S  S     D+ +   P   P  + + K+  N Y RPTLGKCF+CGQ  H SNECP
Subjt:  ASKIEDSEEIKKTKNSQRKNNWDKQ--QRTNLTNSFRNFQQGSSSTTSQLAKKDENSSKIPATKPGENNAKKKVDNIYNRPTLGKCFKCGQQGHFSNECP

Query:  QREFFSPLRLDKYLKGIPSFEQDAPLTEDLGNMTQITFIEED
        QR   + +  D  L+     + D P  +D       T++E D
Subjt:  QREFFSPLRLDKYLKGIPSFEQDAPLTEDLGNMTQITFIEED

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
GATTCTGACTCTTCTGATGAAGACAACTTTCTGAACATTCATCAAGAACCTAAACAAAGGCTTGGACTTCGACCATACTTTCAAGAAGATCAAGAAATACGTATGAAAGT
GGATCTCCCTACCTTCAATGGTCGAATGGACGTGGAGAAGTTTCTTGATTGGATCAAGAACGTAGAAATTTTTTTCGACTATGCCAATACACCCGAACACAAGAAGGTCC
GATTAGTTGCTCTCAAACTTCAAGGTGGCGCAAGCGCTTGGTGGGATCAGTTCAAAACAATAGACGATTATTCGTATCAACAATGTCATCAAGGCTCACGAAGCATCATG
GATTATACAGAAGACTTCTATCCACTTGGTGCTCGAAATAATCTTTTCGAAACAGAACACCAACAAATTTCCAGGTTTATTCATGGTCTACGAGATGAGATTAAAGATAT
TGTACACTTACATCCTTTAACTTTTCTTTCAGATGCCATCTCCTTAGCTTCCAAGATTGAGGATAGTGAAGAGATCAAGAAAACCAAGAATTCTCAAAGAAAGAACAATT
GGGACAAACAACAAAGAACTAACCTAACTAATTCATTTAGAAACTTTCAACAAGGAAGTTCATCCACAACTTCACAGCTCGCCAAGAAAGATGAAAATTCATCAAAGATT
CCAGCCACTAAACCAGGTGAGAATAACGCAAAGAAGAAGGTTGACAACATTTATAACCGTCCTACTTTGGGTAAATGTTTCAAGTGTGGACAACAAGGACACTTCTCCAA
CGAGTGCCCTCAAAGAGAATTCTTTTCACCCCTACGGCTGGACAAATACCTCAAAGGAATTCCCTCTTTCGAACAAGATGCACCATTAACGGAAGACCTTGGCAATATGA
CACAGATTACATTCATAGAGGAAGATCTTTATTGA
mRNA sequenceShow/hide mRNA sequence
ACGATTCTGACTCTTCTGATGAAGACAACTTTCTGAACATTCATCAAGAACCTAAACAAAGGCTTGGACTTCGACCATACTTTCAAGAAGATCAAGAAATACGTATGAAA
GTGGATCTCCCTACCTTCAATGGTCGAATGGACGTGGAGAAGTTTCTTGATTGGATCAAGAACGTAGAAATTTTTTTCGACTATGCCAATACACCCGAACACAAGAAGGT
CCGATTAGTTGCTCTCAAACTTCAAGGTGGCGCAAGCGCTTGGTGGGATCAGTTCAAAACAATAGACGATTATTCGTATCAACAATGTCATCAAGGCTCACGAAGCATCA
TGGATTATACAGAAGACTTCTATCCACTTGGTGCTCGAAATAATCTTTTCGAAACAGAACACCAACAAATTTCCAGGTTTATTCATGGTCTACGAGATGAGATTAAAGAT
ATTGTACACTTACATCCTTTAACTTTTCTTTCAGATGCCATCTCCTTAGCTTCCAAGATTGAGGATAGTGAAGAGATCAAGAAAACCAAGAATTCTCAAAGAAAGAACAA
TTGGGACAAACAACAAAGAACTAACCTAACTAATTCATTTAGAAACTTTCAACAAGGAAGTTCATCCACAACTTCACAGCTCGCCAAGAAAGATGAAAATTCATCAAAGA
TTCCAGCCACTAAACCAGGTGAGAATAACGCAAAGAAGAAGGTTGACAACATTTATAACCGTCCTACTTTGGGTAAATGTTTCAAGTGTGGACAACAAGGACACTTCTCC
AACGAGTGCCCTCAAAGAGAATTCTTTTCACCCCTACGGCTGGACAAATACCTCAAAGGAATTCCCTCTTTCGAACAAGATGCACCATTAACGGAAGACCTTGGCAATAT
GACACAGATTACATTCATAGAGGAAGATCTTTATTGA
Protein sequenceShow/hide protein sequence
DSDSSDEDNFLNIHQEPKQRLGLRPYFQEDQEIRMKVDLPTFNGRMDVEKFLDWIKNVEIFFDYANTPEHKKVRLVALKLQGGASAWWDQFKTIDDYSYQQCHQGSRSIM
DYTEDFYPLGARNNLFETEHQQISRFIHGLRDEIKDIVHLHPLTFLSDAISLASKIEDSEEIKKTKNSQRKNNWDKQQRTNLTNSFRNFQQGSSSTTSQLAKKDENSSKI
PATKPGENNAKKKVDNIYNRPTLGKCFKCGQQGHFSNECPQREFFSPLRLDKYLKGIPSFEQDAPLTEDLGNMTQITFIEEDLY