; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg05027 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg05027
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionRRM domain-containing protein
Genome locationCarg_Chr08:1124896..1127838
RNA-Seq ExpressionCarg05027
SyntenyCarg05027
Gene Ontology termsGO:0006396 - RNA processing (biological process)
GO:0032991 - protein-containing complex (cellular component)
GO:0003723 - RNA binding (molecular function)
InterPro domainsIPR000504 - RNA recognition motif domain
IPR012677 - Nucleotide-binding alpha-beta plait domain superfamily
IPR035979 - RNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6592987.1 hypothetical protein SDJN03_12463, partial [Cucurbita argyrosperma subsp. sororia]2.8e-13798.83Show/hide
Query:  LTAIGGDPTTSTFFHQIHLHFSTLRSSVVCSTATGSCPALTVAHFVSPLRTSSLSSHMGSATDEQYAKFEEKVKRTVYVDNLSPQVTEPVLRTALDQFGT
        L +IGGDPTTSTFFHQIHLHFSTLRSSVVCSTATGSCPALTVAHFV PLRTSSLSSHMGSATDEQYAKFEEKVKRTVYVDNLSPQVTEPVLRTALDQFGT
Subjt:  LTAIGGDPTTSTFFHQIHLHFSTLRSSVVCSTATGSCPALTVAHFVSPLRTSSLSSHMGSATDEQYAKFEEKVKRTVYVDNLSPQVTEPVLRTALDQFGT

Query:  VVSVHFIPNFTEPINSSQCALVEMKDSKEANSVISVIAQFPFMMSGMPRPVRAYPAEVEMFDDRPIKPGRKISFVWLGKDDPDFEVAKKMKCLTQRHVAE
        VVSVHFIPNFTEPINSSQCALVEMKDSKEANSVISVIAQFPFMMSGMPRPVRAYPAEVEMFDDRPIKPGRKISFVWLGKDDPDFEVAKKMKCLTQRHVAE
Subjt:  VVSVHFIPNFTEPINSSQCALVEMKDSKEANSVISVIAQFPFMMSGMPRPVRAYPAEVEMFDDRPIKPGRKISFVWLGKDDPDFEVAKKMKCLTQRHVAE

Query:  AAFLLKQQLVEEEKLAKQQQDALKANYKKYEIVEGVMTDGTARRLARRYNMRVADD
        AAFLLKQQLVEEEKLAKQQQDALKANYKKYEIVEGVMTDGTARRLARRYNMRVADD
Subjt:  AAFLLKQQLVEEEKLAKQQQDALKANYKKYEIVEGVMTDGTARRLARRYNMRVADD

KAG7025396.1 hypothetical protein SDJN02_11891, partial [Cucurbita argyrosperma subsp. argyrosperma]8.6e-139100Show/hide
Query:  LTAIGGDPTTSTFFHQIHLHFSTLRSSVVCSTATGSCPALTVAHFVSPLRTSSLSSHMGSATDEQYAKFEEKVKRTVYVDNLSPQVTEPVLRTALDQFGT
        LTAIGGDPTTSTFFHQIHLHFSTLRSSVVCSTATGSCPALTVAHFVSPLRTSSLSSHMGSATDEQYAKFEEKVKRTVYVDNLSPQVTEPVLRTALDQFGT
Subjt:  LTAIGGDPTTSTFFHQIHLHFSTLRSSVVCSTATGSCPALTVAHFVSPLRTSSLSSHMGSATDEQYAKFEEKVKRTVYVDNLSPQVTEPVLRTALDQFGT

Query:  VVSVHFIPNFTEPINSSQCALVEMKDSKEANSVISVIAQFPFMMSGMPRPVRAYPAEVEMFDDRPIKPGRKISFVWLGKDDPDFEVAKKMKCLTQRHVAE
        VVSVHFIPNFTEPINSSQCALVEMKDSKEANSVISVIAQFPFMMSGMPRPVRAYPAEVEMFDDRPIKPGRKISFVWLGKDDPDFEVAKKMKCLTQRHVAE
Subjt:  VVSVHFIPNFTEPINSSQCALVEMKDSKEANSVISVIAQFPFMMSGMPRPVRAYPAEVEMFDDRPIKPGRKISFVWLGKDDPDFEVAKKMKCLTQRHVAE

Query:  AAFLLKQQLVEEEKLAKQQQDALKANYKKYEIVEGVMTDGTARRLARRYNMRVADD
        AAFLLKQQLVEEEKLAKQQQDALKANYKKYEIVEGVMTDGTARRLARRYNMRVADD
Subjt:  AAFLLKQQLVEEEKLAKQQQDALKANYKKYEIVEGVMTDGTARRLARRYNMRVADD

XP_022959585.1 uncharacterized protein LOC111460615 [Cucurbita moschata]1.8e-10498.99Show/hide
Query:  MGSATDEQYAKFEEKVKRTVYVDNLSPQVTEPVLRTALDQFGTVVSVHFIPNFTEPINSSQCALVEMKDSKEANSVISVIAQFPFMMSGMPRPVRAYPAE
        MGSATDEQY KFEEKVKRTVYVDNLSPQVTEPVLRTALDQFGTVVSVHFIPNFTEPINSSQCALVEMKDSKEANSVIS+IAQFPFMMSGMPRPVRAYPAE
Subjt:  MGSATDEQYAKFEEKVKRTVYVDNLSPQVTEPVLRTALDQFGTVVSVHFIPNFTEPINSSQCALVEMKDSKEANSVISVIAQFPFMMSGMPRPVRAYPAE

Query:  VEMFDDRPIKPGRKISFVWLGKDDPDFEVAKKMKCLTQRHVAEAAFLLKQQLVEEEKLAKQQQDALKANYKKYEIVEGVMTDGTARRLARRYNMRVADD
        VEMFDDRPIKPGRKISFVWLGKDDPDFEVAKKMKCLTQRHVAEAAFLLKQQLVEEEKLAKQQQDALKANYKKYEIVEGVMTDGTARRLARRYNMRVADD
Subjt:  VEMFDDRPIKPGRKISFVWLGKDDPDFEVAKKMKCLTQRHVAEAAFLLKQQLVEEEKLAKQQQDALKANYKKYEIVEGVMTDGTARRLARRYNMRVADD

XP_023004840.1 uncharacterized protein LOC111498019 [Cucurbita maxima]1.2e-10397.99Show/hide
Query:  MGSATDEQYAKFEEKVKRTVYVDNLSPQVTEPVLRTALDQFGTVVSVHFIPNFTEPINSSQCALVEMKDSKEANSVISVIAQFPFMMSGMPRPVRAYPAE
        MGSATDEQYAKFEEKVKRTVYVDNLSPQVTEPVLRTALDQFGTVVSVHFIPNFTEPINSSQCALVEMKDSKEANSVISVI+QFPFMMSGMPRPVRAYPAE
Subjt:  MGSATDEQYAKFEEKVKRTVYVDNLSPQVTEPVLRTALDQFGTVVSVHFIPNFTEPINSSQCALVEMKDSKEANSVISVIAQFPFMMSGMPRPVRAYPAE

Query:  VEMFDDRPIKPGRKISFVWLGKDDPDFEVAKKMKCLTQRHVAEAAFLLKQQLVEEEKLAKQQQDALKANYKKYEIVEGVMTDGTARRLARRYNMRVADD
        VEMFDDRPIKPGRKISFVWLGK+DPDFEVAKKMKCLTQRHVAEAAFLLKQQLVEEEKLAKQQQDALK NYKKYEI+EGVMTDGTARRLARRYNMRVADD
Subjt:  VEMFDDRPIKPGRKISFVWLGKDDPDFEVAKKMKCLTQRHVAEAAFLLKQQLVEEEKLAKQQQDALKANYKKYEIVEGVMTDGTARRLARRYNMRVADD

XP_023514768.1 uncharacterized protein LOC111778981 [Cucurbita pepo subsp. pepo]2.8e-105100Show/hide
Query:  MGSATDEQYAKFEEKVKRTVYVDNLSPQVTEPVLRTALDQFGTVVSVHFIPNFTEPINSSQCALVEMKDSKEANSVISVIAQFPFMMSGMPRPVRAYPAE
        MGSATDEQYAKFEEKVKRTVYVDNLSPQVTEPVLRTALDQFGTVVSVHFIPNFTEPINSSQCALVEMKDSKEANSVISVIAQFPFMMSGMPRPVRAYPAE
Subjt:  MGSATDEQYAKFEEKVKRTVYVDNLSPQVTEPVLRTALDQFGTVVSVHFIPNFTEPINSSQCALVEMKDSKEANSVISVIAQFPFMMSGMPRPVRAYPAE

Query:  VEMFDDRPIKPGRKISFVWLGKDDPDFEVAKKMKCLTQRHVAEAAFLLKQQLVEEEKLAKQQQDALKANYKKYEIVEGVMTDGTARRLARRYNMRVADD
        VEMFDDRPIKPGRKISFVWLGKDDPDFEVAKKMKCLTQRHVAEAAFLLKQQLVEEEKLAKQQQDALKANYKKYEIVEGVMTDGTARRLARRYNMRVADD
Subjt:  VEMFDDRPIKPGRKISFVWLGKDDPDFEVAKKMKCLTQRHVAEAAFLLKQQLVEEEKLAKQQQDALKANYKKYEIVEGVMTDGTARRLARRYNMRVADD

TrEMBL top hitse value%identityAlignment
A0A0A0KAE7 RRM domain-containing protein3.7e-8782.91Show/hide
Query:  MGSATDEQYAKFEEKVKRTVYVDNLSPQVTEPVLRTALDQFGTVVSVHFIPNFTEPINSSQCALVEMKDSKEANSVISVIAQFPFMMSGMPRPVRAYPAE
        MGS  DEQ+A FEEKVKRTVYVDNLS QVTEPVLRTALDQFGTVVSVHFIPN+TEPINSSQCALVEMKDSKEA SVI+VIAQFPFMMSGMPRPVRA PAE
Subjt:  MGSATDEQYAKFEEKVKRTVYVDNLSPQVTEPVLRTALDQFGTVVSVHFIPNFTEPINSSQCALVEMKDSKEANSVISVIAQFPFMMSGMPRPVRAYPAE

Query:  VEMFDDRPIKPGRKISFVWLGKDDPDFEVAKKMKCLTQRHVAEAAFLLKQQLVEEEKLAKQQQDALKANYKKYEIVEGVMTDGTARRLARRYNMRVADD
        VEMFDDRP+KPGRKISF WL  DDPDFEVA+++K L+++HVAEAAFL+KQ + EEEKLAKQQQ+ LK NYKKYEIV+ VM DGTARRLA+ YNMR++DD
Subjt:  VEMFDDRPIKPGRKISFVWLGKDDPDFEVAKKMKCLTQRHVAEAAFLLKQQLVEEEKLAKQQQDALKANYKKYEIVEGVMTDGTARRLARRYNMRVADD

A0A1S3B3J3 uncharacterized protein LOC1034855921.5e-8883.92Show/hide
Query:  MGSATDEQYAKFEEKVKRTVYVDNLSPQVTEPVLRTALDQFGTVVSVHFIPNFTEPINSSQCALVEMKDSKEANSVISVIAQFPFMMSGMPRPVRAYPAE
        MGSA DEQ++ FEEKVKRTVYVDNLSPQVTEPVLRTALDQFGTVV+VHFIPN+TEPINSSQCALVEMKDSKEA SVI+VIAQFPFMMSGMPRPVRA PAE
Subjt:  MGSATDEQYAKFEEKVKRTVYVDNLSPQVTEPVLRTALDQFGTVVSVHFIPNFTEPINSSQCALVEMKDSKEANSVISVIAQFPFMMSGMPRPVRAYPAE

Query:  VEMFDDRPIKPGRKISFVWLGKDDPDFEVAKKMKCLTQRHVAEAAFLLKQQLVEEEKLAKQQQDALKANYKKYEIVEGVMTDGTARRLARRYNMRVADD
         EMFDDRP+KPGRKISF WL  DDPDFEVA+++K LT++HVAEAAFLLKQ L EEEKLAKQQQ+ L+ NYKKYEIV+ VM DGTARRLA+ YNMRV+DD
Subjt:  VEMFDDRPIKPGRKISFVWLGKDDPDFEVAKKMKCLTQRHVAEAAFLLKQQLVEEEKLAKQQQDALKANYKKYEIVEGVMTDGTARRLARRYNMRVADD

A0A6J1DB90 uncharacterized protein LOC1110185373.4e-8581.91Show/hide
Query:  MGSATDEQYAKFEEKVKRTVYVDNLSPQVTEPVLRTALDQFGTVVSVHFIPNFTEPINSSQCALVEMKDSKEANSVISVIAQFPFMMSGMPRPVRAYPAE
        MGSAT+EQ+A FEEKVKRTVYVDNLSPQVTEPV+RTALDQFGTVV V FIPN+ EP+NSSQCALVEMKD KEA SVISVIA+FPFM+SGMPRPVRA PAE
Subjt:  MGSATDEQYAKFEEKVKRTVYVDNLSPQVTEPVLRTALDQFGTVVSVHFIPNFTEPINSSQCALVEMKDSKEANSVISVIAQFPFMMSGMPRPVRAYPAE

Query:  VEMFDDRPIKPGRKISFVWLGKDDPDFEVAKKMKCLTQRHVAEAAFLLKQQLVEEEKLAKQQQDALKANYKKYEIVEGVMTDGTARRLARRYNMRVADD
        VEMFDDRPIKPG KISF WL ++DPDF+VAKKMK  T++H AE AFLLKQQL+EEEKLAKQQQ+ LK NYKK+EIVE VMTDGTARRLA+ Y+MRV DD
Subjt:  VEMFDDRPIKPGRKISFVWLGKDDPDFEVAKKMKCLTQRHVAEAAFLLKQQLVEEEKLAKQQQDALKANYKKYEIVEGVMTDGTARRLARRYNMRVADD

A0A6J1H6P8 uncharacterized protein LOC1114606158.7e-10598.99Show/hide
Query:  MGSATDEQYAKFEEKVKRTVYVDNLSPQVTEPVLRTALDQFGTVVSVHFIPNFTEPINSSQCALVEMKDSKEANSVISVIAQFPFMMSGMPRPVRAYPAE
        MGSATDEQY KFEEKVKRTVYVDNLSPQVTEPVLRTALDQFGTVVSVHFIPNFTEPINSSQCALVEMKDSKEANSVIS+IAQFPFMMSGMPRPVRAYPAE
Subjt:  MGSATDEQYAKFEEKVKRTVYVDNLSPQVTEPVLRTALDQFGTVVSVHFIPNFTEPINSSQCALVEMKDSKEANSVISVIAQFPFMMSGMPRPVRAYPAE

Query:  VEMFDDRPIKPGRKISFVWLGKDDPDFEVAKKMKCLTQRHVAEAAFLLKQQLVEEEKLAKQQQDALKANYKKYEIVEGVMTDGTARRLARRYNMRVADD
        VEMFDDRPIKPGRKISFVWLGKDDPDFEVAKKMKCLTQRHVAEAAFLLKQQLVEEEKLAKQQQDALKANYKKYEIVEGVMTDGTARRLARRYNMRVADD
Subjt:  VEMFDDRPIKPGRKISFVWLGKDDPDFEVAKKMKCLTQRHVAEAAFLLKQQLVEEEKLAKQQQDALKANYKKYEIVEGVMTDGTARRLARRYNMRVADD

A0A6J1KVQ1 uncharacterized protein LOC1114980195.7e-10497.99Show/hide
Query:  MGSATDEQYAKFEEKVKRTVYVDNLSPQVTEPVLRTALDQFGTVVSVHFIPNFTEPINSSQCALVEMKDSKEANSVISVIAQFPFMMSGMPRPVRAYPAE
        MGSATDEQYAKFEEKVKRTVYVDNLSPQVTEPVLRTALDQFGTVVSVHFIPNFTEPINSSQCALVEMKDSKEANSVISVI+QFPFMMSGMPRPVRAYPAE
Subjt:  MGSATDEQYAKFEEKVKRTVYVDNLSPQVTEPVLRTALDQFGTVVSVHFIPNFTEPINSSQCALVEMKDSKEANSVISVIAQFPFMMSGMPRPVRAYPAE

Query:  VEMFDDRPIKPGRKISFVWLGKDDPDFEVAKKMKCLTQRHVAEAAFLLKQQLVEEEKLAKQQQDALKANYKKYEIVEGVMTDGTARRLARRYNMRVADD
        VEMFDDRPIKPGRKISFVWLGK+DPDFEVAKKMKCLTQRHVAEAAFLLKQQLVEEEKLAKQQQDALK NYKKYEI+EGVMTDGTARRLARRYNMRVADD
Subjt:  VEMFDDRPIKPGRKISFVWLGKDDPDFEVAKKMKCLTQRHVAEAAFLLKQQLVEEEKLAKQQQDALKANYKKYEIVEGVMTDGTARRLARRYNMRVADD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G05970.1 RNA-binding (RRM/RBD/RNP motifs) family protein1.2e-4546.67Show/hide
Query:  MGSATDEQYAKFEEKVKRTVYVDNLSPQVTEPVLRTALDQFGTVVSVHFIPNFTEPINSSQCALVEMKDSKEANSVISVIAQFPFMMSGMPRPVRAYPAE
        M +  +  Y KF E+V+RTVYVD L+P  T PV+ +A +QFGTV  V FIPN+  P       LVEM++ +   +VIS ++Q PFM++GMPRPVRA  AE
Subjt:  MGSATDEQYAKFEEKVKRTVYVDNLSPQVTEPVLRTALDQFGTVVSVHFIPNFTEPINSSQCALVEMKDSKEANSVISVIAQFPFMMSGMPRPVRAYPAE

Query:  VEMFDDRPIKPGRKISFVWLGKDDPDFEVAKKMKCLTQRHVAEAAFLLKQQLVEEEKLAKQQQDALKANYKKYEIVEGVMTDGTARRLARRYNMR
          MF D+P KPGR + F W+  +DPDF+ A+++K L ++H AE +F+LK  L E EKL+KQQ +    ++KK+E+++ ++ DG A++LA RY+++
Subjt:  VEMFDDRPIKPGRKISFVWLGKDDPDFEVAKKMKCLTQRHVAEAAFLLKQQLVEEEKLAKQQQDALKANYKKYEIVEGVMTDGTARRLARRYNMR

AT1G05970.2 RNA-binding (RRM/RBD/RNP motifs) family protein9.7e-4847.18Show/hide
Query:  MGSATDEQYAKFEEKVKRTVYVDNLSPQVTEPVLRTALDQFGTVVSVHFIPNFTEPINSSQCALVEMKDSKEANSVISVIAQFPFMMSGMPRPVRAYPAE
        M +  +  Y KF E+V+RTVYVD L+P  T PV+ +A +QFGTV  V FIPN+  P       LVEM++ +   +VIS ++Q PFM++GMPRPVRA  AE
Subjt:  MGSATDEQYAKFEEKVKRTVYVDNLSPQVTEPVLRTALDQFGTVVSVHFIPNFTEPINSSQCALVEMKDSKEANSVISVIAQFPFMMSGMPRPVRAYPAE

Query:  VEMFDDRPIKPGRKISFVWLGKDDPDFEVAKKMKCLTQRHVAEAAFLLKQQLVEEEKLAKQQQDALKANYKKYEIVEGVMTDGTARRLARRYNMR
          MF D+P KPGR + F W+  +DPDF+ A+++K L ++H AE +F+LK+QL E EKL+KQQ +    ++KK+E+++ ++ DG A++LA RY+++
Subjt:  VEMFDDRPIKPGRKISFVWLGKDDPDFEVAKKMKCLTQRHVAEAAFLLKQQLVEEEKLAKQQQDALKANYKKYEIVEGVMTDGTARRLARRYNMR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
CTGACAGCCATTGGAGGAGACCCCACCACCTCCACCTTTTTTCATCAAATTCATCTCCATTTCTCCACGCTCCGCTCCTCCGTTGTTTGTTCTACTGCTACAGGCTCATG
CCCCGCGCTCACCGTTGCTCATTTCGTTTCTCCGCTTCGGACTAGTTCTTTATCATCCCACATGGGATCAGCTACGGATGAACAATATGCCAAGTTTGAGGAAAAGGTAA
AACGGACTGTCTATGTTGATAATCTCTCCCCCCAAGTGACTGAGCCTGTCTTGAGAACCGCTTTAGATCAGTTTGGGACTGTTGTCAGTGTCCATTTTATTCCAAATTTC
ACGGAGCCAATTAATAGCTCTCAATGTGCTTTAGTCGAGATGAAGGACTCGAAGGAGGCAAATTCTGTCATCTCTGTGATAGCTCAGTTCCCTTTCATGATGTCTGGAAT
GCCGAGACCGGTGAGGGCGTACCCTGCCGAAGTGGAAATGTTCGATGATCGCCCTATAAAGCCCGGTAGGAAGATTAGCTTTGTCTGGTTGGGAAAGGATGATCCTGACT
TTGAAGTGGCAAAGAAAATGAAGTGTCTTACTCAGAGGCATGTCGCTGAAGCTGCATTCTTGCTGAAGCAACAGTTGGTGGAAGAGGAGAAGCTTGCAAAGCAGCAGCAA
GATGCACTGAAAGCAAACTACAAGAAATATGAGATCGTTGAAGGTGTAATGACTGATGGAACTGCCCGCAGGTTAGCAAGGCGTTATAATATGCGAGTTGCAGATGATTA
A
mRNA sequenceShow/hide mRNA sequence
CTGACAGCCATTGGAGGAGACCCCACCACCTCCACCTTTTTTCATCAAATTCATCTCCATTTCTCCACGCTCCGCTCCTCCGTTGTTTGTTCTACTGCTACAGGCTCATG
CCCCGCGCTCACCGTTGCTCATTTCGTTTCTCCGCTTCGGACTAGTTCTTTATCATCCCACATGGGATCAGCTACGGATGAACAATATGCCAAGTTTGAGGAAAAGGTAA
AACGGACTGTCTATGTTGATAATCTCTCCCCCCAAGTGACTGAGCCTGTCTTGAGAACCGCTTTAGATCAGTTTGGGACTGTTGTCAGTGTCCATTTTATTCCAAATTTC
ACGGAGCCAATTAATAGCTCTCAATGTGCTTTAGTCGAGATGAAGGACTCGAAGGAGGCAAATTCTGTCATCTCTGTGATAGCTCAGTTCCCTTTCATGATGTCTGGAAT
GCCGAGACCGGTGAGGGCGTACCCTGCCGAAGTGGAAATGTTCGATGATCGCCCTATAAAGCCCGGTAGGAAGATTAGCTTTGTCTGGTTGGGAAAGGATGATCCTGACT
TTGAAGTGGCAAAGAAAATGAAGTGTCTTACTCAGAGGCATGTCGCTGAAGCTGCATTCTTGCTGAAGCAACAGTTGGTGGAAGAGGAGAAGCTTGCAAAGCAGCAGCAA
GATGCACTGAAAGCAAACTACAAGAAATATGAGATCGTTGAAGGTGTAATGACTGATGGAACTGCCCGCAGGTTAGCAAGGCGTTATAATATGCGAGTTGCAGATGATTA
ATTGCATGCTTTGAAGCTTTTTTTGCACTCCTAATTGCCTGATCACGTAAATTTTGTACAAATTATCACTTAGATGGCCTTATAGAGATTAATTAGAGAATTCTGTTGTG
ATTTAGAGGGAGAAGGATAGTTAATTAGTGTAATACTCCTATTATTACTCTTTTGGATTGGTCTCTTGTTTGGGAAATGACTTCGGTTTAAAATCATCTAAAGACATTCG
TTCCTATAAGAGCTTGAGAACGGACAAAGGCATAGGTCGAACACCTCAAAGAAACAACTCTAAACTTTTAAGGACCCTATCCCATCGAAAACAGATATTGTCTTCTTTAG
GCTTTCCTTTCTGGGTTTCCCCTTAAGATTTTTACACTATCCTTCCAACCATTGTGTGATCTCA
Protein sequenceShow/hide protein sequence
LTAIGGDPTTSTFFHQIHLHFSTLRSSVVCSTATGSCPALTVAHFVSPLRTSSLSSHMGSATDEQYAKFEEKVKRTVYVDNLSPQVTEPVLRTALDQFGTVVSVHFIPNF
TEPINSSQCALVEMKDSKEANSVISVIAQFPFMMSGMPRPVRAYPAEVEMFDDRPIKPGRKISFVWLGKDDPDFEVAKKMKCLTQRHVAEAAFLLKQQLVEEEKLAKQQQ
DALKANYKKYEIVEGVMTDGTARRLARRYNMRVADD