; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc01G14340 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc01G14340
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionRPA-interacting protein
Genome locationClcChr01:27170077..27174131
RNA-Seq ExpressionClc01G14340
SyntenyClc01G14340
Gene Ontology termsGO:0006606 - protein import into nucleus (biological process)
GO:0005634 - nucleus (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR028155 - RPA-interacting protein, central domain
IPR028156 - RPA-interacting protein
IPR028159 - RPA-interacting protein, C-terminal domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYJ99094.1 uncharacterized protein E5676_scaffold248G002980 [Cucumis melo var. makuwa]3.9e-10176.03Show/hide
Query:  EDFIKSAFQDIFSDELKKIKDQSGNDYDENLPSVPEAADVLWEYEGIQDAYQGEGEDILLEMQRIFYEDLNVDLRQK-----------------------
        +DFIKSAFQDIFSDELKKIKD+S ND++ENLPSVPEAADVLWEYEGI DAY+G+GE+ILLEMQRIFYEDLNVD+RQK                       
Subjt:  EDFIKSAFQDIFSDELKKIKDQSGNDYDENLPSVPEAADVLWEYEGIQDAYQGEGEDILLEMQRIFYEDLNVDLRQK-----------------------

Query:  -----AVKITILSRMFTQLFAESEGPIVTWEDEEDEFLARAVYEHMQLNSEKVLQKLWCPICKQGELQENNHFIHCTHCGLRLTKGNEVTLDILCCRLAD
             A K  ILSRMFTQ FAESEGPIVTWEDEEDEFLARAVYEHMQL++EK+L+K WCP+CKQGELQEN+HFIHCT CGLRL KGNEVTLD+L CRLAD
Subjt:  -----AVKITILSRMFTQLFAESEGPIVTWEDEEDEFLARAVYEHMQLNSEKVLQKLWCPICKQGELQENNHFIHCTHCGLRLTKGNEVTLDILCCRLAD

Query:  VHAEHLDRGCRLKPKFCVERRFNITALYISCEGCNTFETQKE
        VHAEHLDRGCRLKPKFCVE RFNITALYISCEGCNTFE + +
Subjt:  VHAEHLDRGCRLKPKFCVERRFNITALYISCEGCNTFETQKE

XP_008437393.1 PREDICTED: uncharacterized protein LOC103482820 [Cucumis melo]3.8e-9681.9Show/hide
Query:  EDFIKSAFQDIFSDELKKIKDQSGNDYDENLPSVPEAADVLWEYEGIQDAYQGEGEDILLEMQRIFYEDLNVDLRQKAVKITILSRMFTQLFAESEGPIV
        +DFIKSAFQDIFSDELKKIKD+S ND++ENLPSVPEAADVLWEYEGI DAY+G+GE+ILLEMQRIFYEDLNVD+RQK                ESEGPIV
Subjt:  EDFIKSAFQDIFSDELKKIKDQSGNDYDENLPSVPEAADVLWEYEGIQDAYQGEGEDILLEMQRIFYEDLNVDLRQKAVKITILSRMFTQLFAESEGPIV

Query:  TWEDEEDEFLARAVYEHMQLNSEKVLQKLWCPICKQGELQENNHFIHCTHCGLRLTKGNEVTLDILCCRLADVHAEHLDRGCRLKPKFCVERRFNITALY
        TWEDEEDEFLARAVYEHMQL++EK+L+K WCP+CKQGELQEN+HFIHCT CGLRL KGNEVTLD+L CRLADVHAEHLDRGCRLKPKFCVE RFNITALY
Subjt:  TWEDEEDEFLARAVYEHMQLNSEKVLQKLWCPICKQGELQENNHFIHCTHCGLRLTKGNEVTLDILCCRLADVHAEHLDRGCRLKPKFCVERRFNITALY

Query:  ISCEGCNTFE
        ISCEGCNTFE
Subjt:  ISCEGCNTFE

XP_011651131.1 uncharacterized protein LOC101216926 isoform X1 [Cucumis sativus]1.7e-9682.38Show/hide
Query:  EDFIKSAFQDIFSDELKKIKDQSGNDYDENLPSVPEAADVLWEYEGIQDAYQGEGEDILLEMQRIFYEDLNVDLRQKAVKITILSRMFTQLFAESEGPIV
        +D IKSAFQDIF+DELKKIKD+S NDY+ENLPSVPEAADVLWEYEGI DAY+G+GE+ILLEMQRIFYEDLNVDLRQK                ESE PIV
Subjt:  EDFIKSAFQDIFSDELKKIKDQSGNDYDENLPSVPEAADVLWEYEGIQDAYQGEGEDILLEMQRIFYEDLNVDLRQKAVKITILSRMFTQLFAESEGPIV

Query:  TWEDEEDEFLARAVYEHMQLNSEKVLQKLWCPICKQGELQENNHFIHCTHCGLRLTKGNEVTLDILCCRLADVHAEHLDRGCRLKPKFCVERRFNITALY
        TWEDEEDEFLARAVYEHMQL++EK+L+K WCP+CKQGELQENNHFIHCTHCGLRL KGNEVTLD+L CRLADVHAEHLDRGCRLKPKFCVE RFNITALY
Subjt:  TWEDEEDEFLARAVYEHMQLNSEKVLQKLWCPICKQGELQENNHFIHCTHCGLRLTKGNEVTLDILCCRLADVHAEHLDRGCRLKPKFCVERRFNITALY

Query:  ISCEGCNTFE
        ISCEGCNTFE
Subjt:  ISCEGCNTFE

XP_031737337.1 uncharacterized protein LOC101216926 isoform X4 [Cucumis sativus]1.7e-9682.38Show/hide
Query:  EDFIKSAFQDIFSDELKKIKDQSGNDYDENLPSVPEAADVLWEYEGIQDAYQGEGEDILLEMQRIFYEDLNVDLRQKAVKITILSRMFTQLFAESEGPIV
        +D IKSAFQDIF+DELKKIKD+S NDY+ENLPSVPEAADVLWEYEGI DAY+G+GE+ILLEMQRIFYEDLNVDLRQK                ESE PIV
Subjt:  EDFIKSAFQDIFSDELKKIKDQSGNDYDENLPSVPEAADVLWEYEGIQDAYQGEGEDILLEMQRIFYEDLNVDLRQKAVKITILSRMFTQLFAESEGPIV

Query:  TWEDEEDEFLARAVYEHMQLNSEKVLQKLWCPICKQGELQENNHFIHCTHCGLRLTKGNEVTLDILCCRLADVHAEHLDRGCRLKPKFCVERRFNITALY
        TWEDEEDEFLARAVYEHMQL++EK+L+K WCP+CKQGELQENNHFIHCTHCGLRL KGNEVTLD+L CRLADVHAEHLDRGCRLKPKFCVE RFNITALY
Subjt:  TWEDEEDEFLARAVYEHMQLNSEKVLQKLWCPICKQGELQENNHFIHCTHCGLRLTKGNEVTLDILCCRLADVHAEHLDRGCRLKPKFCVERRFNITALY

Query:  ISCEGCNTFE
        ISCEGCNTFE
Subjt:  ISCEGCNTFE

XP_038893884.1 uncharacterized protein LOC120082685 isoform X1 [Benincasa hispida]6.2e-9984.29Show/hide
Query:  EDFIKSAFQDIFSDELKKIKDQSGNDYDENLPSVPEAADVLWEYEGIQDAYQGEGEDILLEMQRIFYEDLNVDLRQKAVKITILSRMFTQLFAESEGPIV
        +DFIKSAFQDIFSDELKKIKDQS NDY++NLPSVPEAADVLWEYEG+QDAY+GEGE+ILLEMQRIFYEDLN+DLR K                ESEGPIV
Subjt:  EDFIKSAFQDIFSDELKKIKDQSGNDYDENLPSVPEAADVLWEYEGIQDAYQGEGEDILLEMQRIFYEDLNVDLRQKAVKITILSRMFTQLFAESEGPIV

Query:  TWEDEEDEFLARAVYEHMQLNSEKVLQKLWCPICKQGELQENNHFIHCTHCGLRLTKGNEVTLDILCCRLADVHAEHLDRGCRLKPKFCVERRFNITALY
        TWEDEEDEFLARAVYEHM+LNSEKVL+K WCPICKQGELQENNHFIHCTHCGLRLTKGNEVTLD+L CRLADVHAEHLDRGCRLKP FCVE +FNITALY
Subjt:  TWEDEEDEFLARAVYEHMQLNSEKVLQKLWCPICKQGELQENNHFIHCTHCGLRLTKGNEVTLDILCCRLADVHAEHLDRGCRLKPKFCVERRFNITALY

Query:  ISCEGCNTFE
        ISCEGCNTFE
Subjt:  ISCEGCNTFE

TrEMBL top hitse value%identityAlignment
A0A0A0LTY4 Uncharacterized protein8.2e-9782.38Show/hide
Query:  EDFIKSAFQDIFSDELKKIKDQSGNDYDENLPSVPEAADVLWEYEGIQDAYQGEGEDILLEMQRIFYEDLNVDLRQKAVKITILSRMFTQLFAESEGPIV
        +D IKSAFQDIF+DELKKIKD+S NDY+ENLPSVPEAADVLWEYEGI DAY+G+GE+ILLEMQRIFYEDLNVDLRQK                ESE PIV
Subjt:  EDFIKSAFQDIFSDELKKIKDQSGNDYDENLPSVPEAADVLWEYEGIQDAYQGEGEDILLEMQRIFYEDLNVDLRQKAVKITILSRMFTQLFAESEGPIV

Query:  TWEDEEDEFLARAVYEHMQLNSEKVLQKLWCPICKQGELQENNHFIHCTHCGLRLTKGNEVTLDILCCRLADVHAEHLDRGCRLKPKFCVERRFNITALY
        TWEDEEDEFLARAVYEHMQL++EK+L+K WCP+CKQGELQENNHFIHCTHCGLRL KGNEVTLD+L CRLADVHAEHLDRGCRLKPKFCVE RFNITALY
Subjt:  TWEDEEDEFLARAVYEHMQLNSEKVLQKLWCPICKQGELQENNHFIHCTHCGLRLTKGNEVTLDILCCRLADVHAEHLDRGCRLKPKFCVERRFNITALY

Query:  ISCEGCNTFE
        ISCEGCNTFE
Subjt:  ISCEGCNTFE

A0A1S3AU18 uncharacterized protein LOC1034828201.8e-9681.9Show/hide
Query:  EDFIKSAFQDIFSDELKKIKDQSGNDYDENLPSVPEAADVLWEYEGIQDAYQGEGEDILLEMQRIFYEDLNVDLRQKAVKITILSRMFTQLFAESEGPIV
        +DFIKSAFQDIFSDELKKIKD+S ND++ENLPSVPEAADVLWEYEGI DAY+G+GE+ILLEMQRIFYEDLNVD+RQK                ESEGPIV
Subjt:  EDFIKSAFQDIFSDELKKIKDQSGNDYDENLPSVPEAADVLWEYEGIQDAYQGEGEDILLEMQRIFYEDLNVDLRQKAVKITILSRMFTQLFAESEGPIV

Query:  TWEDEEDEFLARAVYEHMQLNSEKVLQKLWCPICKQGELQENNHFIHCTHCGLRLTKGNEVTLDILCCRLADVHAEHLDRGCRLKPKFCVERRFNITALY
        TWEDEEDEFLARAVYEHMQL++EK+L+K WCP+CKQGELQEN+HFIHCT CGLRL KGNEVTLD+L CRLADVHAEHLDRGCRLKPKFCVE RFNITALY
Subjt:  TWEDEEDEFLARAVYEHMQLNSEKVLQKLWCPICKQGELQENNHFIHCTHCGLRLTKGNEVTLDILCCRLADVHAEHLDRGCRLKPKFCVERRFNITALY

Query:  ISCEGCNTFE
        ISCEGCNTFE
Subjt:  ISCEGCNTFE

A0A5D3BJ40 Uncharacterized protein1.9e-10176.03Show/hide
Query:  EDFIKSAFQDIFSDELKKIKDQSGNDYDENLPSVPEAADVLWEYEGIQDAYQGEGEDILLEMQRIFYEDLNVDLRQK-----------------------
        +DFIKSAFQDIFSDELKKIKD+S ND++ENLPSVPEAADVLWEYEGI DAY+G+GE+ILLEMQRIFYEDLNVD+RQK                       
Subjt:  EDFIKSAFQDIFSDELKKIKDQSGNDYDENLPSVPEAADVLWEYEGIQDAYQGEGEDILLEMQRIFYEDLNVDLRQK-----------------------

Query:  -----AVKITILSRMFTQLFAESEGPIVTWEDEEDEFLARAVYEHMQLNSEKVLQKLWCPICKQGELQENNHFIHCTHCGLRLTKGNEVTLDILCCRLAD
             A K  ILSRMFTQ FAESEGPIVTWEDEEDEFLARAVYEHMQL++EK+L+K WCP+CKQGELQEN+HFIHCT CGLRL KGNEVTLD+L CRLAD
Subjt:  -----AVKITILSRMFTQLFAESEGPIVTWEDEEDEFLARAVYEHMQLNSEKVLQKLWCPICKQGELQENNHFIHCTHCGLRLTKGNEVTLDILCCRLAD

Query:  VHAEHLDRGCRLKPKFCVERRFNITALYISCEGCNTFETQKE
        VHAEHLDRGCRLKPKFCVE RFNITALYISCEGCNTFE + +
Subjt:  VHAEHLDRGCRLKPKFCVERRFNITALYISCEGCNTFETQKE

A0A6J1C5H3 uncharacterized protein LOC111008573 isoform X37.4e-9078.1Show/hide
Query:  EDFIKSAFQDIFSDELKKIKDQSGNDYDENLPSVPEAADVLWEYEGIQDAYQGEGEDILLEMQRIFYEDLNVDLRQKAVKITILSRMFTQLFAESEGPIV
        +DF+KSAFQDIFSDELKKIKDQS NDYD  LPS PE  DVLWEYEGIQDAY+GEGE+ILLEMQRIFYEDL+VD+RQK                 SEGPIV
Subjt:  EDFIKSAFQDIFSDELKKIKDQSGNDYDENLPSVPEAADVLWEYEGIQDAYQGEGEDILLEMQRIFYEDLNVDLRQKAVKITILSRMFTQLFAESEGPIV

Query:  TWEDEEDEFLARAVYEHMQLNSEKVLQKLWCPICKQGELQENNHFIHCTHCGLRLTKGNEVTLDILCCRLADVHAEHLDRGCRLKPKFCVERRFNITALY
        TWEDEEDEFLARAVYEHMQLNSEKV +K WCPICKQGEL ENN  IHCTHCGL+L+K NEVT+D+L  RLADVHAEHLDRGCRLKP FCVE RF+ITALY
Subjt:  TWEDEEDEFLARAVYEHMQLNSEKVLQKLWCPICKQGELQENNHFIHCTHCGLRLTKGNEVTLDILCCRLADVHAEHLDRGCRLKPKFCVERRFNITALY

Query:  ISCEGCNTFE
        ISCEGC TFE
Subjt:  ISCEGCNTFE

A0A6J1C935 uncharacterized protein LOC111008573 isoform X17.4e-9078.1Show/hide
Query:  EDFIKSAFQDIFSDELKKIKDQSGNDYDENLPSVPEAADVLWEYEGIQDAYQGEGEDILLEMQRIFYEDLNVDLRQKAVKITILSRMFTQLFAESEGPIV
        +DF+KSAFQDIFSDELKKIKDQS NDYD  LPS PE  DVLWEYEGIQDAY+GEGE+ILLEMQRIFYEDL+VD+RQK                 SEGPIV
Subjt:  EDFIKSAFQDIFSDELKKIKDQSGNDYDENLPSVPEAADVLWEYEGIQDAYQGEGEDILLEMQRIFYEDLNVDLRQKAVKITILSRMFTQLFAESEGPIV

Query:  TWEDEEDEFLARAVYEHMQLNSEKVLQKLWCPICKQGELQENNHFIHCTHCGLRLTKGNEVTLDILCCRLADVHAEHLDRGCRLKPKFCVERRFNITALY
        TWEDEEDEFLARAVYEHMQLNSEKV +K WCPICKQGEL ENN  IHCTHCGL+L+K NEVT+D+L  RLADVHAEHLDRGCRLKP FCVE RF+ITALY
Subjt:  TWEDEEDEFLARAVYEHMQLNSEKVLQKLWCPICKQGELQENNHFIHCTHCGLRLTKGNEVTLDILCCRLADVHAEHLDRGCRLKPKFCVERRFNITALY

Query:  ISCEGCNTFE
        ISCEGC TFE
Subjt:  ISCEGCNTFE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G12760.1 unknown protein6.1e-5249.3Show/hide
Query:  EDEDFIKSAFQDIFSDELKKIKDQSGNDYDENLPSVPEAADVLWEY-EGIQDAYQGEGEDILLEMQRIFYEDLNVDLRQKAVKITILSRMFTQLFAESEG
        + ++ I  AFQDI SDELKKI+D S N    N     +  D+LWEY EG++  Y+G+ E+ILLEMQ+IFY+DL        +  T ++  F Q       
Subjt:  EDEDFIKSAFQDIFSDELKKIKDQSGNDYDENLPSVPEAADVLWEY-EGIQDAYQGEGEDILLEMQRIFYEDLNVDLRQKAVKITILSRMFTQLFAESEG

Query:  PIVTWEDEEDEFLARAVYEHMQLNSEKVLQKLWCPICKQGELQENNHFIHCTHCGLRLTKGNEVTLDILCCRLADVHAEHLDRGCRLKPKFCVERRFNIT
         + TWEDEED++LA  V ++M LNSE+   ++WCPICK+GEL EN+  I C  C ++L KG EV L+IL  RLA+ H EHL RGCRLKP+F V+  +N+ 
Subjt:  PIVTWEDEEDEFLARAVYEHMQLNSEKVLQKLWCPICKQGELQENNHFIHCTHCGLRLTKGNEVTLDILCCRLADVHAEHLDRGCRLKPKFCVERRFNIT

Query:  ALYISCEGCNTFE
        ALYI+CE C TFE
Subjt:  ALYISCEGCNTFE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCACACCGTCGACATGAAAATGACCAATCACAATCACAAGTCGAGGGCAAATTTTGGCACCGCAATCGTAAAATAAAACCCTCAAAATGGAAGACGATGATCCAAAT
TCCGTCATTGAAACCCACCGTTCTTCGATTAAGATCCATCCTCATTACAACAATCAACAGTCATGGAAGCAGAAGGTCAATTTTCTCTTTTCTCGATGTATTGTTCTGTG
TTTTTGGCGGGTTCTTGTTTTTATTCGAAACTGAAATTTTACTATCGATAGCTGAGGGAAAACTGTTGCAAGAGAGTCAGAGAAGGAAGAAACCGCTTGCTCTGGAAGAT
GAGGATTTTATCAAATCTGCTTTTCAAGACATCTTTTCTGATGAGCTGAAAAAAATTAAAGACCAATCTGGGAATGATTATGATGAGAATTTACCTTCTGTCCCTGAGGC
TGCTGATGTTCTTTGGGAATATGAGGGGATTCAGGATGCTTATCAAGGTGAAGGTGAAGACATATTGTTGGAAATGCAAAGGATTTTTTATGAGGATCTGAATGTTGATC
TGAGACAGAAAGCTGTTAAAATCACTATTTTGAGTAGGATGTTTACTCAGCTCTTTGCAGAATCTGAAGGCCCTATTGTAACATGGGAAGATGAAGAAGACGAGTTCTTA
GCCCGTGCTGTTTACGAACACATGCAACTTAATAGTGAGAAGGTTCTTCAGAAGTTATGGTGTCCTATATGTAAACAAGGAGAGCTGCAAGAGAACAACCACTTCATACA
TTGTACTCATTGTGGACTTCGGCTTACCAAAGGCAATGAGGTTACTCTGGACATCCTCTGTTGTCGGTTGGCCGATGTGCATGCTGAACATCTCGATCGGGGCTGTAGAT
TGAAGCCTAAGTTTTGTGTTGAGAGGAGATTTAACATAACTGCATTGTACATCTCTTGTGAAGGTTGCAACACATTTGAGACACAAAAAGAACATTGTTGCAGTCCAAAC
TCTAAGGAACATGATAATGGGTTCAAGCCTGATGGCGACCACCTTGATCCTTCTCTGCGCCGGGCTCGCCGCGGTCCTCAACAGCGCGTACAGCATCAAAAAGCCGGCGA
CCAGTACCGTGCAGATACAGCATCAAGAAGATTGTGCTATTTTGGAGTGATGATTTTTGTGGTTTATAATCTTGACTTTGTGTGTAAGAAGAGGAGTAATGGGAAGAGGA
TGAATAATCATGAGGTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCACACCGTCGACATGAAAATGACCAATCACAATCACAAGTCGAGGGCAAATTTTGGCACCGCAATCGTAAAATAAAACCCTCAAAATGGAAGACGATGATCCAAAT
TCCGTCATTGAAACCCACCGTTCTTCGATTAAGATCCATCCTCATTACAACAATCAACAGTCATGGAAGCAGAAGGTCAATTTTCTCTTTTCTCGATGTATTGTTCTGTG
TTTTTGGCGGGTTCTTGTTTTTATTCGAAACTGAAATTTTACTATCGATAGCTGAGGGAAAACTGTTGCAAGAGAGTCAGAGAAGGAAGAAACCGCTTGCTCTGGAAGAT
GAGGATTTTATCAAATCTGCTTTTCAAGACATCTTTTCTGATGAGCTGAAAAAAATTAAAGACCAATCTGGGAATGATTATGATGAGAATTTACCTTCTGTCCCTGAGGC
TGCTGATGTTCTTTGGGAATATGAGGGGATTCAGGATGCTTATCAAGGTGAAGGTGAAGACATATTGTTGGAAATGCAAAGGATTTTTTATGAGGATCTGAATGTTGATC
TGAGACAGAAAGCTGTTAAAATCACTATTTTGAGTAGGATGTTTACTCAGCTCTTTGCAGAATCTGAAGGCCCTATTGTAACATGGGAAGATGAAGAAGACGAGTTCTTA
GCCCGTGCTGTTTACGAACACATGCAACTTAATAGTGAGAAGGTTCTTCAGAAGTTATGGTGTCCTATATGTAAACAAGGAGAGCTGCAAGAGAACAACCACTTCATACA
TTGTACTCATTGTGGACTTCGGCTTACCAAAGGCAATGAGGTTACTCTGGACATCCTCTGTTGTCGGTTGGCCGATGTGCATGCTGAACATCTCGATCGGGGCTGTAGAT
TGAAGCCTAAGTTTTGTGTTGAGAGGAGATTTAACATAACTGCATTGTACATCTCTTGTGAAGGTTGCAACACATTTGAGACACAAAAAGAACATTGTTGCAGTCCAAAC
TCTAAGGAACATGATAATGGGTTCAAGCCTGATGGCGACCACCTTGATCCTTCTCTGCGCCGGGCTCGCCGCGGTCCTCAACAGCGCGTACAGCATCAAAAAGCCGGCGA
CCAGTACCGTGCAGATACAGCATCAAGAAGATTGTGCTATTTTGGAGTGATGATTTTTGTGGTTTATAATCTTGACTTTGTGTGTAAGAAGAGGAGTAATGGGAAGAGGA
TGAATAATCATGAGGTTTGA
Protein sequenceShow/hide protein sequence
MSHRRHENDQSQSQVEGKFWHRNRKIKPSKWKTMIQIPSLKPTVLRLRSILITTINSHGSRRSIFSFLDVLFCVFGGFLFLFETEILLSIAEGKLLQESQRRKKPLALED
EDFIKSAFQDIFSDELKKIKDQSGNDYDENLPSVPEAADVLWEYEGIQDAYQGEGEDILLEMQRIFYEDLNVDLRQKAVKITILSRMFTQLFAESEGPIVTWEDEEDEFL
ARAVYEHMQLNSEKVLQKLWCPICKQGELQENNHFIHCTHCGLRLTKGNEVTLDILCCRLADVHAEHLDRGCRLKPKFCVERRFNITALYISCEGCNTFETQKEHCCSPN
SKEHDNGFKPDGDHLDPSLRRARRGPQQRVQHQKAGDQYRADTASRRLCYFGVMIFVVYNLDFVCKKRSNGKRMNNHEV