; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc01G14660 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc01G14660
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionDNA-binding protein RHL1-like
Genome locationClcChr01:27472512..27482859
RNA-Seq ExpressionClc01G14660
SyntenyClc01G14660
Gene Ontology termsGO:0042023 - DNA endoreduplication (biological process)
GO:0003677 - DNA binding (molecular function)
InterPro domainsIPR038859 - DNA-binding protein RHL1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYJ99067.1 DNA-binding protein RHL1 isoform X2 [Cucumis melo var. makuwa]6.9e-17372.55Show/hide
Query:  MARGSSSSKRDEAKGEIDPEIAARKRLKKLAFSNHILSETQAKPQAYLSPSATVLKHHGKDIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLGTKNPI
        MARGSSSSK+DEAKGEI+PEI  RKRLKKLAFSN+ILSETQAKPQAYLSPSATVLKHHGKDIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDL TKNP+
Subjt:  MARGSSSSKRDEAKGEIDPEIAARKRLKKLAFSNHILSETQAKPQAYLSPSATVLKHHGKDIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLGTKNPI

Query:  LYLDFPQGRMKLFGTIMYPKNKYLTLQFSRGGKNVMCEDYFDNMIVFSDAWWIGTKDENPEEACLDFPKELTTGKCGEYDFNSGAGVASTSGGAGVTSSS
        LYLDFPQGRMKLFGTIMYPKN+YLTLQFS+GGKNV CED FDNMIVFSDAWWIGTKDENPEEACLDFPKELT G+CGEYDFN         GGAGVTS+S
Subjt:  LYLDFPQGRMKLFGTIMYPKNKYLTLQFSRGGKNVMCEDYFDNMIVFSDAWWIGTKDENPEEACLDFPKELTTGKCGEYDFNSGAGVASTSGGAGVTSSS

Query:  KQSVKNKGINPAVENSFKGEDGDDLLDLEENVTNSIKTTPVRHSERSAGKLVLGLRRWPLFQREHSVSLWCLLFSESFRKKKCPLKFFTFLEKVPPVCTI
        KQSV+ KGINPA ENSFKGE GDDL+ LE +VTNS+KT PVRHSERSA K+                                                 
Subjt:  KQSVKNKGINPAVENSFKGEDGDDLLDLEENVTNSIKTTPVRHSERSAGKLVLGLRRWPLFQREHSVSLWCLLFSESFRKKKCPLKFFTFLEKVPPVCTI

Query:  TTNSLSTISDQIFAEASSEDESAGTYDDLSEGEEKNILIHEPSIGDHAKDLSVESIDEDAVEIRPPVLEGNQTSISKGEKSSRATGNAQSDARGLVQPTL
                    FAEASSEDES GT  DLSEGEEKNI+IHEPSIGDHAKD+SVESIDEDAVEI+P  LEGNQTSISK +K+SRA G+AQSD RGLVQPTL
Subjt:  TTNSLSTISDQIFAEASSEDESAGTYDDLSEGEEKNILIHEPSIGDHAKDLSVESIDEDAVEIRPPVLEGNQTSISKGEKSSRATGNAQSDARGLVQPTL

Query:  LSLFKKVEEKVISEDLYLICRIRLLRTPRSSKRSSTPKVSAQKMQLSGSKRKIDQDEGSKKRRAVRGQDD
        LSLFKKVEEK               RTPRSSKRSS PKVS QKMQLSGSK+KIDQDEGSKKRRAVRGQ D
Subjt:  LSLFKKVEEKVISEDLYLICRIRLLRTPRSSKRSSTPKVSAQKMQLSGSKRKIDQDEGSKKRRAVRGQDD

XP_004137530.1 DNA-binding protein RHL1 isoform X2 [Cucumis sativus]1.5e-18373.9Show/hide
Query:  MARGSSSSKRDEAKGEIDPEIAARKRLKKLAFSNHILSETQAKPQAYLSPSATVLKHHGKDIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLGTKNPI
        MARG SSSK+DEAKGEI+PEIA RKRLKKLAFSNHILSETQA+PQAYLSPSATVLKHHGKDIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDL TKNPI
Subjt:  MARGSSSSKRDEAKGEIDPEIAARKRLKKLAFSNHILSETQAKPQAYLSPSATVLKHHGKDIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLGTKNPI

Query:  LYLDFPQGRMKLFGTIMYPKNKYLTLQFSRGGKNVMCEDYFDNMIVFSDAWWIGTKDENPEEACLDFPKELTTGKCGEYDFNSGAGVASTSGGAGVTSSS
        LYLDFPQGRMKLFGTIMYPKN+YLTLQFSRGGKNV CED FDNMIVFSDAWWIGTKDENPEEACLDFPK+LT G+CGEYDFN GAGV STSG AGVTS+S
Subjt:  LYLDFPQGRMKLFGTIMYPKNKYLTLQFSRGGKNVMCEDYFDNMIVFSDAWWIGTKDENPEEACLDFPKELTTGKCGEYDFNSGAGVASTSGGAGVTSSS

Query:  KQSVKNKGINPAVENSFKGEDGDDLLDLEENVTNSIKTTPVRHSERSAGKLVLGLRRWPLFQREHSVSLWCLLFSESFRKKKCPLKFFTFLEKVPPVCTI
        KQSV+ KGINPA ENSFKGE GDDL+ LE +VTNSIKTTPVRHSERSA K+                                                 
Subjt:  KQSVKNKGINPAVENSFKGEDGDDLLDLEENVTNSIKTTPVRHSERSAGKLVLGLRRWPLFQREHSVSLWCLLFSESFRKKKCPLKFFTFLEKVPPVCTI

Query:  TTNSLSTISDQIFAEASSEDESAGTYDDLSEGEEKNILIHEPSIGDHA----KDLSVESIDEDAVEIRPPVLEGNQTSISKGEKSSRATGNAQSDARGLV
                    FAEASSEDESAGT  DLSEGEEKNI+IHEPSIGDHA    +D+SVESIDEDAV+I+PP LEGNQTSISK +KS RA G+AQSD RGLV
Subjt:  TTNSLSTISDQIFAEASSEDESAGTYDDLSEGEEKNILIHEPSIGDHA----KDLSVESIDEDAVEIRPPVLEGNQTSISKGEKSSRATGNAQSDARGLV

Query:  QPTLLSLFKKVEEKVISEDLYLICRIRLLRTPRSSKRSSTPKVSAQKMQLSGSKRKIDQDEGSKKRRAVRGQDDGEVQKKDTEYEVEDEIEESSSSQE
        QPTLLSLFKKVEEK               RTPRSSKRSS PKVS QKMQLSGSK+KIDQDEGSKKRR VRGQ  G+ QKKDTEYEVEDEIE+ SSSQE
Subjt:  QPTLLSLFKKVEEKVISEDLYLICRIRLLRTPRSSKRSSTPKVSAQKMQLSGSKRKIDQDEGSKKRRAVRGQDDGEVQKKDTEYEVEDEIEESSSSQE

XP_008467323.1 PREDICTED: DNA-binding protein RHL1 isoform X2 [Cucumis melo]6.4e-17972.09Show/hide
Query:  MARGSSSSKRDEAKGEIDPEIAARKRLKKLAFSNHILSETQAKPQAYLSPSATVLKHHGKDIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLGTKNPI
        MARGSSSSK+DEAKGEI+PEI  RKRLKKLAFSN+ILSETQAKPQAYLSPSATVLKHHGKDIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDL TKNP+
Subjt:  MARGSSSSKRDEAKGEIDPEIAARKRLKKLAFSNHILSETQAKPQAYLSPSATVLKHHGKDIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLGTKNPI

Query:  LYLDFPQGRMKLFGTIMYPKNKYLTLQFSRGGKNVMCEDYFDNMIVFSDAWWIGTKDENPEEACLDFPKELTTGKCGEYDFNSGAGVASTSGGAGVTSSS
        LYLDFPQGRMKLFGTIMYPKN+YLTLQFS+GGKNV CED FDNMIVFSDAWWIGTKDENPEEACLDFPKELT G+CGEYDFN         GGAGVTS+S
Subjt:  LYLDFPQGRMKLFGTIMYPKNKYLTLQFSRGGKNVMCEDYFDNMIVFSDAWWIGTKDENPEEACLDFPKELTTGKCGEYDFNSGAGVASTSGGAGVTSSS

Query:  KQSVKNKGINPAVENSFKGEDGDDLLDLEENVTNSIKTTPVRHSERSAGKLVLGLRRWPLFQREHSVSLWCLLFSESFRKKKCPLKFFTFLEKVPPVCTI
        KQSV+ KGINPA ENSFKGE GDDL+ LE +VTNS+KT PVRHSERSA K+                                                 
Subjt:  KQSVKNKGINPAVENSFKGEDGDDLLDLEENVTNSIKTTPVRHSERSAGKLVLGLRRWPLFQREHSVSLWCLLFSESFRKKKCPLKFFTFLEKVPPVCTI

Query:  TTNSLSTISDQIFAEASSEDESAGTYDDLSEGEEKNILIHEPSIGDHA----KDLSVESIDEDAVEIRPPVLEGNQTSISKGEKSSRATGNAQSDARGLV
                    FAEASSEDES GT  DLSEGEEKNI+IHEPSIGDHA    +D+SVESIDEDAVEI+P  LEGNQTSISK +K+SRA G+AQSD RGLV
Subjt:  TTNSLSTISDQIFAEASSEDESAGTYDDLSEGEEKNILIHEPSIGDHA----KDLSVESIDEDAVEIRPPVLEGNQTSISKGEKSSRATGNAQSDARGLV

Query:  QPTLLSLFKKVEEKVISEDLYLICRIRLLRTPRSSKRSSTPKVSAQKMQLSGSKRKIDQDEGSKKRRAVRGQDDGEVQKKDTEYEVEDEIEESSSSQE
        QPTLLSLFKKVEEK               RTPRSSKRSS PKVS QKMQLSGSK+KIDQDEGSKKRRAVRGQ  G+ Q+KDTEYEVEDEIEE SSSQE
Subjt:  QPTLLSLFKKVEEKVISEDLYLICRIRLLRTPRSSKRSSTPKVSAQKMQLSGSKRKIDQDEGSKKRRAVRGQDDGEVQKKDTEYEVEDEIEESSSSQE

XP_031744387.1 DNA-binding protein RHL1 isoform X1 [Cucumis sativus]1.1e-18173.16Show/hide
Query:  MARGSSSSKRDEAKGEIDPEIAARKRLKKLAFSNHILSETQAKPQAYLSPSATVLKHHGKDIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLGTKNPI
        MARG SSSK+DEAKGEI+PEIA RKRLKKLAFSNHILSETQA+PQAYLSPSATVLKHHGKDIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDL TKNPI
Subjt:  MARGSSSSKRDEAKGEIDPEIAARKRLKKLAFSNHILSETQAKPQAYLSPSATVLKHHGKDIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLGTKNPI

Query:  LYLDFPQGRMKLFGTIMYPKNKYLTLQFSRGGKNVMCEDYFDNMIVFSDAWWIGTKDENPEEACLDFPKELTTGKCGEYDFNSGAGVASTSGGAGVTSSS
        LYLDFPQGRMKLFGTIMYPKN+YLTLQFSRGGKNV CED FDNMIVFSDAWWIGTKDENPEEACLDFPK+LT G+CGEYDFN GAGV STSG AGVTS+S
Subjt:  LYLDFPQGRMKLFGTIMYPKNKYLTLQFSRGGKNVMCEDYFDNMIVFSDAWWIGTKDENPEEACLDFPKELTTGKCGEYDFNSGAGVASTSGGAGVTSSS

Query:  KQSVKNKGINPAVENSFKGEDGDDLLDLEENVTNSIKTTPVRHSERSAGKLVLGLRRWPLFQREHSVSLWCLLFSESFRKKKCPLKFFTFLEKVPPVCTI
        KQSV+ KGINPA ENSFKGE GDDL+ LE +VTNSIKTTPVRHSERSA K+                                                 
Subjt:  KQSVKNKGINPAVENSFKGEDGDDLLDLEENVTNSIKTTPVRHSERSAGKLVLGLRRWPLFQREHSVSLWCLLFSESFRKKKCPLKFFTFLEKVPPVCTI

Query:  TTNSLSTISDQIFAEASSEDESAGTYDDLSEGEEKNILIHEPSIGDHA----KDLSVESIDEDAVEIRPPVLEGNQTSISKGEKSSRATGNAQSDARGLV
                    FAEASSEDESAGT  DLSEGEEKNI+IHEPSIGDHA    +D+SVESIDEDAV+I+PP LEGNQTSISK +KS RA G+AQSD RGLV
Subjt:  TTNSLSTISDQIFAEASSEDESAGTYDDLSEGEEKNILIHEPSIGDHA----KDLSVESIDEDAVEIRPPVLEGNQTSISKGEKSSRATGNAQSDARGLV

Query:  QPTLLSLFKKVEEKVISEDLYLICRIRLLRTPRSSKRSSTPKVSAQKMQLSGSKRKIDQDEGSKKRRAVRGQDDGEVQKKDTEY-----EVEDEIEESSS
        QPTLLSLFKKVEEK               RTPRSSKRSS PKVS QKMQLSGSK+KIDQDEGSKKRR VRGQ  G+ QKKDTEY     EVEDEIE+ SS
Subjt:  QPTLLSLFKKVEEKVISEDLYLICRIRLLRTPRSSKRSSTPKVSAQKMQLSGSKRKIDQDEGSKKRRAVRGQDDGEVQKKDTEY-----EVEDEIEESSS

Query:  SQE
        SQE
Subjt:  SQE

XP_038893754.1 DNA-binding protein RHL1 isoform X1 [Benincasa hispida]2.0e-18073.15Show/hide
Query:  MARGSSSSKRDEAKGEIDPEIAARKRLKKLAFSNHILSETQAKPQAYLSPSATVLKHHGKDIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLGTKNPI
        MARGSSSSKRDEAKGEIDP IAARKRLKKLAFSNHILSETQAKPQAYLSPSATVLKHHGKDIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDL TKNPI
Subjt:  MARGSSSSKRDEAKGEIDPEIAARKRLKKLAFSNHILSETQAKPQAYLSPSATVLKHHGKDIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLGTKNPI

Query:  LYLDFPQGRMKLFGTIMYPKNKYLTLQFSRGGKNVMCEDYFDNMIVFSDAWWIGTKDENPEEACLDFPKELTTGKCGEYDFNSGAGVASTSGGAGVTSSS
        LYLDFPQGRMKLFGTIMYPKN+YLTLQFSRGGKNVMCEDYFDNMIVFSDAWWIGTKDENPEEA LDFP ELTTG+CGE DFN         GGAGVT  S
Subjt:  LYLDFPQGRMKLFGTIMYPKNKYLTLQFSRGGKNVMCEDYFDNMIVFSDAWWIGTKDENPEEACLDFPKELTTGKCGEYDFNSGAGVASTSGGAGVTSSS

Query:  KQSVKNKGINPAVENSFKGEDGDDLLDLEENVTNSIKTTPVRHSERSAGKLVLGLRRWPLFQREHSVSLWCLLFSESFRKKKCPLKFFTFLEKVPPVCTI
        KQSV+ KGINPAVENS KGE GDDL+DL++NVTNSIKTTPVRHSERSA K+                                                 
Subjt:  KQSVKNKGINPAVENSFKGEDGDDLLDLEENVTNSIKTTPVRHSERSAGKLVLGLRRWPLFQREHSVSLWCLLFSESFRKKKCPLKFFTFLEKVPPVCTI

Query:  TTNSLSTISDQIFAEASSEDESAGTYDDLSEGEEKNILIHEPSIGDHAK----DLSVESIDEDAVEIRPPVLEGNQTSISKGEKSSRATGNAQSDARGLV
                    FAE SSEDES  TY DLSEGEEKNI+IHEPSIGDHA+    DLSV+S+DEDA EIRPP LEGNQTSIS  +KSS A G+AQSD RGLV
Subjt:  TTNSLSTISDQIFAEASSEDESAGTYDDLSEGEEKNILIHEPSIGDHAK----DLSVESIDEDAVEIRPPVLEGNQTSISKGEKSSRATGNAQSDARGLV

Query:  QPTLLSLFKKVEEKVISEDLYLICRIRLLRTPRSSKRSSTPKVSAQKMQLSGSKRKIDQDEGSKKRRAVRGQDD-GEVQKKDTEYEVEDEIEESSSSQE
        QPTLLSLFKKVEEK               RT RSSKRSSTPKVS QKMQLSGSKRKIDQDEG +KRRAVRGQDD G++QKKDTEYEV+D+IEE SSSQE
Subjt:  QPTLLSLFKKVEEKVISEDLYLICRIRLLRTPRSSKRSSTPKVSAQKMQLSGSKRKIDQDEGSKKRRAVRGQDD-GEVQKKDTEYEVEDEIEESSSSQE

TrEMBL top hitse value%identityAlignment
A0A0A0LVZ6 Uncharacterized protein7.2e-18473.9Show/hide
Query:  MARGSSSSKRDEAKGEIDPEIAARKRLKKLAFSNHILSETQAKPQAYLSPSATVLKHHGKDIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLGTKNPI
        MARG SSSK+DEAKGEI+PEIA RKRLKKLAFSNHILSETQA+PQAYLSPSATVLKHHGKDIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDL TKNPI
Subjt:  MARGSSSSKRDEAKGEIDPEIAARKRLKKLAFSNHILSETQAKPQAYLSPSATVLKHHGKDIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLGTKNPI

Query:  LYLDFPQGRMKLFGTIMYPKNKYLTLQFSRGGKNVMCEDYFDNMIVFSDAWWIGTKDENPEEACLDFPKELTTGKCGEYDFNSGAGVASTSGGAGVTSSS
        LYLDFPQGRMKLFGTIMYPKN+YLTLQFSRGGKNV CED FDNMIVFSDAWWIGTKDENPEEACLDFPK+LT G+CGEYDFN GAGV STSG AGVTS+S
Subjt:  LYLDFPQGRMKLFGTIMYPKNKYLTLQFSRGGKNVMCEDYFDNMIVFSDAWWIGTKDENPEEACLDFPKELTTGKCGEYDFNSGAGVASTSGGAGVTSSS

Query:  KQSVKNKGINPAVENSFKGEDGDDLLDLEENVTNSIKTTPVRHSERSAGKLVLGLRRWPLFQREHSVSLWCLLFSESFRKKKCPLKFFTFLEKVPPVCTI
        KQSV+ KGINPA ENSFKGE GDDL+ LE +VTNSIKTTPVRHSERSA K+                                                 
Subjt:  KQSVKNKGINPAVENSFKGEDGDDLLDLEENVTNSIKTTPVRHSERSAGKLVLGLRRWPLFQREHSVSLWCLLFSESFRKKKCPLKFFTFLEKVPPVCTI

Query:  TTNSLSTISDQIFAEASSEDESAGTYDDLSEGEEKNILIHEPSIGDHA----KDLSVESIDEDAVEIRPPVLEGNQTSISKGEKSSRATGNAQSDARGLV
                    FAEASSEDESAGT  DLSEGEEKNI+IHEPSIGDHA    +D+SVESIDEDAV+I+PP LEGNQTSISK +KS RA G+AQSD RGLV
Subjt:  TTNSLSTISDQIFAEASSEDESAGTYDDLSEGEEKNILIHEPSIGDHA----KDLSVESIDEDAVEIRPPVLEGNQTSISKGEKSSRATGNAQSDARGLV

Query:  QPTLLSLFKKVEEKVISEDLYLICRIRLLRTPRSSKRSSTPKVSAQKMQLSGSKRKIDQDEGSKKRRAVRGQDDGEVQKKDTEYEVEDEIEESSSSQE
        QPTLLSLFKKVEEK               RTPRSSKRSS PKVS QKMQLSGSK+KIDQDEGSKKRR VRGQ  G+ QKKDTEYEVEDEIE+ SSSQE
Subjt:  QPTLLSLFKKVEEKVISEDLYLICRIRLLRTPRSSKRSSTPKVSAQKMQLSGSKRKIDQDEGSKKRRAVRGQDDGEVQKKDTEYEVEDEIEESSSSQE

A0A1S3CTA5 DNA-binding protein RHL1 isoform X23.1e-17972.09Show/hide
Query:  MARGSSSSKRDEAKGEIDPEIAARKRLKKLAFSNHILSETQAKPQAYLSPSATVLKHHGKDIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLGTKNPI
        MARGSSSSK+DEAKGEI+PEI  RKRLKKLAFSN+ILSETQAKPQAYLSPSATVLKHHGKDIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDL TKNP+
Subjt:  MARGSSSSKRDEAKGEIDPEIAARKRLKKLAFSNHILSETQAKPQAYLSPSATVLKHHGKDIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLGTKNPI

Query:  LYLDFPQGRMKLFGTIMYPKNKYLTLQFSRGGKNVMCEDYFDNMIVFSDAWWIGTKDENPEEACLDFPKELTTGKCGEYDFNSGAGVASTSGGAGVTSSS
        LYLDFPQGRMKLFGTIMYPKN+YLTLQFS+GGKNV CED FDNMIVFSDAWWIGTKDENPEEACLDFPKELT G+CGEYDFN         GGAGVTS+S
Subjt:  LYLDFPQGRMKLFGTIMYPKNKYLTLQFSRGGKNVMCEDYFDNMIVFSDAWWIGTKDENPEEACLDFPKELTTGKCGEYDFNSGAGVASTSGGAGVTSSS

Query:  KQSVKNKGINPAVENSFKGEDGDDLLDLEENVTNSIKTTPVRHSERSAGKLVLGLRRWPLFQREHSVSLWCLLFSESFRKKKCPLKFFTFLEKVPPVCTI
        KQSV+ KGINPA ENSFKGE GDDL+ LE +VTNS+KT PVRHSERSA K+                                                 
Subjt:  KQSVKNKGINPAVENSFKGEDGDDLLDLEENVTNSIKTTPVRHSERSAGKLVLGLRRWPLFQREHSVSLWCLLFSESFRKKKCPLKFFTFLEKVPPVCTI

Query:  TTNSLSTISDQIFAEASSEDESAGTYDDLSEGEEKNILIHEPSIGDHA----KDLSVESIDEDAVEIRPPVLEGNQTSISKGEKSSRATGNAQSDARGLV
                    FAEASSEDES GT  DLSEGEEKNI+IHEPSIGDHA    +D+SVESIDEDAVEI+P  LEGNQTSISK +K+SRA G+AQSD RGLV
Subjt:  TTNSLSTISDQIFAEASSEDESAGTYDDLSEGEEKNILIHEPSIGDHA----KDLSVESIDEDAVEIRPPVLEGNQTSISKGEKSSRATGNAQSDARGLV

Query:  QPTLLSLFKKVEEKVISEDLYLICRIRLLRTPRSSKRSSTPKVSAQKMQLSGSKRKIDQDEGSKKRRAVRGQDDGEVQKKDTEYEVEDEIEESSSSQE
        QPTLLSLFKKVEEK               RTPRSSKRSS PKVS QKMQLSGSK+KIDQDEGSKKRRAVRGQ  G+ Q+KDTEYEVEDEIEE SSSQE
Subjt:  QPTLLSLFKKVEEKVISEDLYLICRIRLLRTPRSSKRSSTPKVSAQKMQLSGSKRKIDQDEGSKKRRAVRGQDDGEVQKKDTEYEVEDEIEESSSSQE

A0A5D3BIX4 DNA-binding protein RHL1 isoform X23.3e-17372.55Show/hide
Query:  MARGSSSSKRDEAKGEIDPEIAARKRLKKLAFSNHILSETQAKPQAYLSPSATVLKHHGKDIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLGTKNPI
        MARGSSSSK+DEAKGEI+PEI  RKRLKKLAFSN+ILSETQAKPQAYLSPSATVLKHHGKDIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDL TKNP+
Subjt:  MARGSSSSKRDEAKGEIDPEIAARKRLKKLAFSNHILSETQAKPQAYLSPSATVLKHHGKDIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLGTKNPI

Query:  LYLDFPQGRMKLFGTIMYPKNKYLTLQFSRGGKNVMCEDYFDNMIVFSDAWWIGTKDENPEEACLDFPKELTTGKCGEYDFNSGAGVASTSGGAGVTSSS
        LYLDFPQGRMKLFGTIMYPKN+YLTLQFS+GGKNV CED FDNMIVFSDAWWIGTKDENPEEACLDFPKELT G+CGEYDFN         GGAGVTS+S
Subjt:  LYLDFPQGRMKLFGTIMYPKNKYLTLQFSRGGKNVMCEDYFDNMIVFSDAWWIGTKDENPEEACLDFPKELTTGKCGEYDFNSGAGVASTSGGAGVTSSS

Query:  KQSVKNKGINPAVENSFKGEDGDDLLDLEENVTNSIKTTPVRHSERSAGKLVLGLRRWPLFQREHSVSLWCLLFSESFRKKKCPLKFFTFLEKVPPVCTI
        KQSV+ KGINPA ENSFKGE GDDL+ LE +VTNS+KT PVRHSERSA K+                                                 
Subjt:  KQSVKNKGINPAVENSFKGEDGDDLLDLEENVTNSIKTTPVRHSERSAGKLVLGLRRWPLFQREHSVSLWCLLFSESFRKKKCPLKFFTFLEKVPPVCTI

Query:  TTNSLSTISDQIFAEASSEDESAGTYDDLSEGEEKNILIHEPSIGDHAKDLSVESIDEDAVEIRPPVLEGNQTSISKGEKSSRATGNAQSDARGLVQPTL
                    FAEASSEDES GT  DLSEGEEKNI+IHEPSIGDHAKD+SVESIDEDAVEI+P  LEGNQTSISK +K+SRA G+AQSD RGLVQPTL
Subjt:  TTNSLSTISDQIFAEASSEDESAGTYDDLSEGEEKNILIHEPSIGDHAKDLSVESIDEDAVEIRPPVLEGNQTSISKGEKSSRATGNAQSDARGLVQPTL

Query:  LSLFKKVEEKVISEDLYLICRIRLLRTPRSSKRSSTPKVSAQKMQLSGSKRKIDQDEGSKKRRAVRGQDD
        LSLFKKVEEK               RTPRSSKRSS PKVS QKMQLSGSK+KIDQDEGSKKRRAVRGQ D
Subjt:  LSLFKKVEEKVISEDLYLICRIRLLRTPRSSKRSSTPKVSAQKMQLSGSKRKIDQDEGSKKRRAVRGQDD

A0A6J1C822 DNA-binding protein RHL12.1e-16768.8Show/hide
Query:  MARGSSSSKRDEAKGEIDPEIAARKRLKKLAFSNHILSETQAKPQAYLSPSATVLKHHGKDIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLGTKNPI
        MARGSSSSKR+   GE+DPE A RKRLKKLAFSN++LS+TQAKPQAYLSPSATVLKHHGKDIVKKSQRKNRFLFSFSGLL PVSGGKIGELKDLGTKNPI
Subjt:  MARGSSSSKRDEAKGEIDPEIAARKRLKKLAFSNHILSETQAKPQAYLSPSATVLKHHGKDIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLGTKNPI

Query:  LYLDFPQGRMKLFGTIMYPKNKYLTLQFSRGGKNVMCEDYFDNMIVFSDAWWIGTKDENPEEACLDFPKELTTGKCGEYDFNSGAGVASTSGGAGVTSSS
        LYLDFPQGRMKLFGTIMYPKN+YLTLQFSRGGKNV CEDYFD+M+VFSDAWWIGTKDENPEEA LD PKELTTGKCGEYDFN         GGAGVTS+S
Subjt:  LYLDFPQGRMKLFGTIMYPKNKYLTLQFSRGGKNVMCEDYFDNMIVFSDAWWIGTKDENPEEACLDFPKELTTGKCGEYDFNSGAGVASTSGGAGVTSSS

Query:  KQSVKNKGINPAVENSFKGEDGDDLLDLEENVTNSIKTTPVRHSERSAGKLVLGLRRWPLFQREHSVSLWCLLFSESFRKKKCPLKFFTFLEKVPPVCTI
        K SV+ KGIN A E+S KGE GD L DLE+N+ NSI TTPVRHSERSAGK+                          F+                     
Subjt:  KQSVKNKGINPAVENSFKGEDGDDLLDLEENVTNSIKTTPVRHSERSAGKLVLGLRRWPLFQREHSVSLWCLLFSESFRKKKCPLKFFTFLEKVPPVCTI

Query:  TTNSLSTISDQIFAEASSEDESAGTYDDLSEGEEKNILIHEPSIGDHA----KDLSVESIDEDAVEIRPPVLEGNQTSISKGEKSSRATGNAQSDARGL-
                    FAEASSEDES G+YDDLSEGEEKNI+IHEPSIGDHA    +DLSV++ DED +  RPP LEGNQ  ISK +K SRA GN +S+ RGL 
Subjt:  TTNSLSTISDQIFAEASSEDESAGTYDDLSEGEEKNILIHEPSIGDHA----KDLSVESIDEDAVEIRPPVLEGNQTSISKGEKSSRATGNAQSDARGL-

Query:  VQPTLLSLFKKVEEKVISEDLYLICRIRLLRTPRSSKRSSTPKVSAQKMQLSGSKRKIDQDEGSKKRRAVRGQDD-GEVQKKDTEYEVEDEIEESSSSQE
        VQ TL SLFKKVEEK               RTPRSSKRSS PKVSA+KMQLSGSKRKI+QDEGSKKRRAVRGQDD G+V +KD EYEVED+IEE SSSQE
Subjt:  VQPTLLSLFKKVEEKVISEDLYLICRIRLLRTPRSSKRSSTPKVSAQKMQLSGSKRKIDQDEGSKKRRAVRGQDD-GEVQKKDTEYEVEDEIEESSSSQE

A0A6J1E8G5 DNA-binding protein RHL1-like2.5e-16869.2Show/hide
Query:  MARGSSSSKRDEAKGEIDPEIAARKRLKKLAFSNHILSETQAKPQAYLSPSATVLKHHGKDIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLGTKNPI
        MARGSSSSKRDEAKGE++PEIAARKRLKKLAF+N+ILSETQAKPQAY SPSATVLKHHGKDIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLGTKNPI
Subjt:  MARGSSSSKRDEAKGEIDPEIAARKRLKKLAFSNHILSETQAKPQAYLSPSATVLKHHGKDIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLGTKNPI

Query:  LYLDFPQGRMKLFGTIMYPKNKYLTLQFSRGGKNVMCEDYFDNMIVFSDAWWIGTKDENPEEACLDFPKELTTGKCGEYDFNSGAGVASTSGGAGVTSSS
        LYLDFPQGR+KLFGTI+YPKN+YLTLQFSRGGKNVMCED FDNMIVFSDAWWIGTKDENPEE  LDFPKE+T GKCGEYDFN GAGVAST         S
Subjt:  LYLDFPQGRMKLFGTIMYPKNKYLTLQFSRGGKNVMCEDYFDNMIVFSDAWWIGTKDENPEEACLDFPKELTTGKCGEYDFNSGAGVASTSGGAGVTSSS

Query:  KQSVKNKGINPAVENSFKGEDGDDLLDLEENVTNSIKTTPVRHSERSAGKLVLGLRRWPLFQREHSVSLWCLLFSESFRKKKCPLKFFTFLEKVPPVCTI
        KQSV+ KGIN A E S KGE GDDL+DLE+N+TNS+KTTPVRHSERSAGK+                                                 
Subjt:  KQSVKNKGINPAVENSFKGEDGDDLLDLEENVTNSIKTTPVRHSERSAGKLVLGLRRWPLFQREHSVSLWCLLFSESFRKKKCPLKFFTFLEKVPPVCTI

Query:  TTNSLSTISDQIFAEASSEDESAGTYDDLSEGEEKNILIHEPSIGDHAKD----LSVESIDEDAVEIRPPVLEGNQTSISKGEKSSRATGNAQSDARG-L
                    FA+A S++ESAGTY D SEGEEKNI+IHEPSIGDHA +    +SV+S D+DAVE RP  LEGN+T ISK +  SRA GNAQS  RG L
Subjt:  TTNSLSTISDQIFAEASSEDESAGTYDDLSEGEEKNILIHEPSIGDHAKD----LSVESIDEDAVEIRPPVLEGNQTSISKGEKSSRATGNAQSDARG-L

Query:  VQPTLLSLFKKVEEKVISEDLYLICRIRLLRTPRSSKRSSTPKVSAQKMQLSGSKRKIDQDEGSKKRRAVRGQDD-GEVQKKDTEYEVEDEIEESSSSQE
        VQPTL SLFKKVEEK               RTPRSSKRSSTPKVSAQKMQLSGSK+KIDQDEG KKRR V+GQDD G+ ++KDTEYE ED+IEE SSSQE
Subjt:  VQPTLLSLFKKVEEKVISEDLYLICRIRLLRTPRSSKRSSTPKVSAQKMQLSGSKRKIDQDEGSKKRRAVRGQDD-GEVQKKDTEYEVEDEIEESSSSQE

SwissProt top hitse value%identityAlignment
O81242 DNA-binding protein RHL16.0e-7156.81Show/hide
Query:  SSSSKRDEAKG--EIDPEIAARKRLKKLAFSNHILSETQAKPQAYLSPSATVLKHHGKDIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLGTKNPILY
        +SSSK+  +KG  + D E   RKRLK LA  N +LS++ AK  + L PS  VLKHHG DI++KSQRKNRFLFSF GLLAP+S   IG+L  L TKNP+LY
Subjt:  SSSSKRDEAKG--EIDPEIAARKRLKKLAFSNHILSETQAKPQAYLSPSATVLKHHGKDIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLGTKNPILY

Query:  LDFPQGRMKLFGTIMYPKNKYLTLQFSRGGKNVMCEDYFDNMIVFSDAWWIGTKDENPEEACLDFPKELTTGKCGEYDFNSGAG-------VASTSGGAG
        L+FPQGRMKLFGTI+YPKN+YLTLQFSRGGKNV+C+DYFDNMIVFS++WWIGTK+ENPEEA LDFPKEL   +  E+DF  GAG       +AS   G+ 
Subjt:  LDFPQGRMKLFGTIMYPKNKYLTLQFSRGGKNVMCEDYFDNMIVFSDAWWIGTKDENPEEACLDFPKELTTGKCGEYDFNSGAG-------VASTSGGAG

Query:  VTSSSKQSVKNKGINPAVENSFKGEDGDDLLDLEENV--TNSIKTTPVRHSERSAGK
         T +    V N+ +      S  GE  DD + +   V  T  ++ TPVR S+R++GK
Subjt:  VTSSSKQSVKNKGINPAVENSFKGEDGDDLLDLEENV--TNSIKTTPVRHSERSAGK

Arabidopsis top hitse value%identityAlignment
AT1G48380.1 root hair initiation protein root hairless 1 (RHL1)4.3e-7256.81Show/hide
Query:  SSSSKRDEAKG--EIDPEIAARKRLKKLAFSNHILSETQAKPQAYLSPSATVLKHHGKDIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLGTKNPILY
        +SSSK+  +KG  + D E   RKRLK LA  N +LS++ AK  + L PS  VLKHHG DI++KSQRKNRFLFSF GLLAP+S   IG+L  L TKNP+LY
Subjt:  SSSSKRDEAKG--EIDPEIAARKRLKKLAFSNHILSETQAKPQAYLSPSATVLKHHGKDIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLGTKNPILY

Query:  LDFPQGRMKLFGTIMYPKNKYLTLQFSRGGKNVMCEDYFDNMIVFSDAWWIGTKDENPEEACLDFPKELTTGKCGEYDFNSGAG-------VASTSGGAG
        L+FPQGRMKLFGTI+YPKN+YLTLQFSRGGKNV+C+DYFDNMIVFS++WWIGTK+ENPEEA LDFPKEL   +  E+DF  GAG       +AS   G+ 
Subjt:  LDFPQGRMKLFGTIMYPKNKYLTLQFSRGGKNVMCEDYFDNMIVFSDAWWIGTKDENPEEACLDFPKELTTGKCGEYDFNSGAG-------VASTSGGAG

Query:  VTSSSKQSVKNKGINPAVENSFKGEDGDDLLDLEENV--TNSIKTTPVRHSERSAGK
         T +    V N+ +      S  GE  DD + +   V  T  ++ TPVR S+R++GK
Subjt:  VTSSSKQSVKNKGINPAVENSFKGEDGDDLLDLEENV--TNSIKTTPVRHSERSAGK

AT1G48380.2 root hair initiation protein root hairless 1 (RHL1)3.2e-6750.69Show/hide
Query:  SSSSKRDEAKG--EIDPEIAARKRLKKLAFSNHILSETQAKPQAYLSPSATVLKHHGKDIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLGTKNPILY
        +SSSK+  +KG  + D E   RKRLK LA  N +LS++ AK  + L PS  VLKHHG DI++KSQRKNRFLFSF GLLAP+S   IG+L  L TKNP+LY
Subjt:  SSSSKRDEAKG--EIDPEIAARKRLKKLAFSNHILSETQAKPQAYLSPSATVLKHHGKDIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLGTKNPILY

Query:  LDFPQGRMKLFGTIMYPKNKYLTLQFSRGGKNVMCEDYFDNMIVFSDAWWIGTKDENPEEACLDFPKELT------------------------------
        L+FPQGRMKLFGTI+YPKN+YLTLQFSRGGKNV+C+DYFDNMIVFS++WWIGTK+ENPEEA LDFPKEL                               
Subjt:  LDFPQGRMKLFGTIMYPKNKYLTLQFSRGGKNVMCEDYFDNMIVFSDAWWIGTKDENPEEACLDFPKELT------------------------------

Query:  -TGKCGEYDFNSGAG-------VASTSGGAGVTSSSKQSVKNKGINPAVENSFKGEDGDDLLDLEENV--TNSIKTTPVRHSERSAGK
           +  E+DF  GAG       +AS   G+  T +    V N+ +      S  GE  DD + +   V  T  ++ TPVR S+R++GK
Subjt:  -TGKCGEYDFNSGAG-------VASTSGGAGVTSSSKQSVKNKGINPAVENSFKGEDGDDLLDLEENV--TNSIKTTPVRHSERSAGK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGCGAGGATCATCGTCTTCAAAGAGGGACGAAGCAAAAGGAGAAATCGATCCGGAGATTGCAGCACGAAAGCGGCTTAAGAAGCTCGCATTCTCCAATCACATACT
TTCAGAGACCCAGGCAAAGCCTCAGGCGTATCTGAGCCCTTCAGCGACGGTTCTGAAGCACCATGGCAAAGACATTGTCAAGAAATCTCAGCGAAAGAACAGGTTCCTCT
TCTCCTTTTCAGGCTTGCTCGCTCCCGTCAGTGGAGGCAAGATTGGCGAGCTCAAAGATTTGGGAACCAAGAATCCTATTCTCTATCTCGATTTTCCTCAGGGGCGTATG
AAGTTGTTTGGAACTATTATGTATCCGAAGAACAAATATTTGACTCTGCAGTTCTCTAGAGGTGGAAAGAATGTGATGTGTGAAGATTATTTTGATAATATGATTGTCTT
TTCTGATGCATGGTGGATTGGAACTAAAGATGAAAATCCAGAGGAGGCTTGCCTTGATTTTCCTAAAGAATTGACTACGGGAAAATGTGGAGAATATGACTTTAACAGTG
GTGCTGGTGTTGCTAGTACGAGTGGTGGTGCTGGTGTTACCAGTTCAAGTAAGCAGAGTGTTAAAAATAAGGGAATCAATCCTGCTGTAGAAAATTCCTTTAAAGGAGAA
GATGGAGATGATTTACTGGACCTTGAAGAAAATGTGACAAATTCAATAAAGACTACGCCAGTTAGACATTCTGAAAGATCTGCCGGAAAACTTGTATTGGGTCTTCGTAG
GTGGCCTTTGTTTCAAAGAGAGCATTCTGTGAGTTTGTGGTGTTTACTGTTTTCTGAGAGTTTCCGTAAAAAGAAATGTCCTCTTAAGTTCTTCACATTCTTGGAGAAAG
TGCCACCAGTTTGTACCATTACGACAAATAGTCTTTCAACAATTTCAGATCAAATTTTTGCAGAGGCTTCTTCTGAGGATGAGTCTGCTGGCACCTACGATGATTTGTCT
GAAGGAGAAGAAAAGAATATTCTCATACACGAACCTTCAATTGGAGATCATGCTAAAGATCTCTCTGTTGAGTCTATAGATGAAGATGCTGTGGAAATTAGACCTCCTGT
TCTTGAAGGAAATCAGACATCAATTTCTAAGGGGGAAAAAAGTTCTCGGGCTACAGGAAATGCTCAGAGTGATGCTCGTGGACTTGTCCAGCCTACTCTACTTAGTTTGT
TCAAGAAAGTGGAGGAGAAGGTAATATCTGAAGACCTTTACTTAATATGTAGAATCAGGTTGCTGAGGACACCAAGAAGTTCAAAGAGGTCTTCAACACCCAAAGTTTCT
GCCCAAAAGATGCAGCTGTCTGGTTCAAAGCGAAAGATTGACCAGGATGAAGGATCAAAAAAGAGGAGGGCTGTCCGGGGACAAGATGATGGAGAAGTCCAGAAGAAGGA
TACAGAATATGAGGTTGAAGATGAGATTGAAGAATCGTCAAGTTCTCAAGAGGTGAGTGTTCACTGCCTATTTTATAAACCTAAATACTCCCATGATCTTTTAGAATATT
GGACTGGAAGTTGGTGGGCCCTAGTCCCTACCCTTTATAAACTTCTCACGTCGTGCGTGCCTCACACCATGAGCTTGCACATTCTTGCTCATGATTTTGTGTCATCTAGG
ACACTGATGAAGATTGGACAAGTTGAGGTTATTACATTCTACCAATGCTAA
mRNA sequenceShow/hide mRNA sequence
ATTTTCTGCCAAATATTATGAAATGAGAAATTTAATTAAAGGAGAAGAAAAGAGTCAAGGATCCCCAAGAGTGAACGGCAGTCCGTAGCTGACTACCGGCATCAGAAATG
GCGCGAGGATCATCGTCTTCAAAGAGGGACGAAGCAAAAGGAGAAATCGATCCGGAGATTGCAGCACGAAAGCGGCTTAAGAAGCTCGCATTCTCCAATCACATACTTTC
AGAGACCCAGGCAAAGCCTCAGGCGTATCTGAGCCCTTCAGCGACGGTTCTGAAGCACCATGGCAAAGACATTGTCAAGAAATCTCAGCGAAAGAACAGGTTCCTCTTCT
CCTTTTCAGGCTTGCTCGCTCCCGTCAGTGGAGGCAAGATTGGCGAGCTCAAAGATTTGGGAACCAAGAATCCTATTCTCTATCTCGATTTTCCTCAGGGGCGTATGAAG
TTGTTTGGAACTATTATGTATCCGAAGAACAAATATTTGACTCTGCAGTTCTCTAGAGGTGGAAAGAATGTGATGTGTGAAGATTATTTTGATAATATGATTGTCTTTTC
TGATGCATGGTGGATTGGAACTAAAGATGAAAATCCAGAGGAGGCTTGCCTTGATTTTCCTAAAGAATTGACTACGGGAAAATGTGGAGAATATGACTTTAACAGTGGTG
CTGGTGTTGCTAGTACGAGTGGTGGTGCTGGTGTTACCAGTTCAAGTAAGCAGAGTGTTAAAAATAAGGGAATCAATCCTGCTGTAGAAAATTCCTTTAAAGGAGAAGAT
GGAGATGATTTACTGGACCTTGAAGAAAATGTGACAAATTCAATAAAGACTACGCCAGTTAGACATTCTGAAAGATCTGCCGGAAAACTTGTATTGGGTCTTCGTAGGTG
GCCTTTGTTTCAAAGAGAGCATTCTGTGAGTTTGTGGTGTTTACTGTTTTCTGAGAGTTTCCGTAAAAAGAAATGTCCTCTTAAGTTCTTCACATTCTTGGAGAAAGTGC
CACCAGTTTGTACCATTACGACAAATAGTCTTTCAACAATTTCAGATCAAATTTTTGCAGAGGCTTCTTCTGAGGATGAGTCTGCTGGCACCTACGATGATTTGTCTGAA
GGAGAAGAAAAGAATATTCTCATACACGAACCTTCAATTGGAGATCATGCTAAAGATCTCTCTGTTGAGTCTATAGATGAAGATGCTGTGGAAATTAGACCTCCTGTTCT
TGAAGGAAATCAGACATCAATTTCTAAGGGGGAAAAAAGTTCTCGGGCTACAGGAAATGCTCAGAGTGATGCTCGTGGACTTGTCCAGCCTACTCTACTTAGTTTGTTCA
AGAAAGTGGAGGAGAAGGTAATATCTGAAGACCTTTACTTAATATGTAGAATCAGGTTGCTGAGGACACCAAGAAGTTCAAAGAGGTCTTCAACACCCAAAGTTTCTGCC
CAAAAGATGCAGCTGTCTGGTTCAAAGCGAAAGATTGACCAGGATGAAGGATCAAAAAAGAGGAGGGCTGTCCGGGGACAAGATGATGGAGAAGTCCAGAAGAAGGATAC
AGAATATGAGGTTGAAGATGAGATTGAAGAATCGTCAAGTTCTCAAGAGGTGAGTGTTCACTGCCTATTTTATAAACCTAAATACTCCCATGATCTTTTAGAATATTGGA
CTGGAAGTTGGTGGGCCCTAGTCCCTACCCTTTATAAACTTCTCACGTCGTGCGTGCCTCACACCATGAGCTTGCACATTCTTGCTCATGATTTTGTGTCATCTAGGACA
CTGATGAAGATTGGACAAGTTGAGGTTATTACATTCTACCAATGCTAA
Protein sequenceShow/hide protein sequence
MARGSSSSKRDEAKGEIDPEIAARKRLKKLAFSNHILSETQAKPQAYLSPSATVLKHHGKDIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLGTKNPILYLDFPQGRM
KLFGTIMYPKNKYLTLQFSRGGKNVMCEDYFDNMIVFSDAWWIGTKDENPEEACLDFPKELTTGKCGEYDFNSGAGVASTSGGAGVTSSSKQSVKNKGINPAVENSFKGE
DGDDLLDLEENVTNSIKTTPVRHSERSAGKLVLGLRRWPLFQREHSVSLWCLLFSESFRKKKCPLKFFTFLEKVPPVCTITTNSLSTISDQIFAEASSEDESAGTYDDLS
EGEEKNILIHEPSIGDHAKDLSVESIDEDAVEIRPPVLEGNQTSISKGEKSSRATGNAQSDARGLVQPTLLSLFKKVEEKVISEDLYLICRIRLLRTPRSSKRSSTPKVS
AQKMQLSGSKRKIDQDEGSKKRRAVRGQDDGEVQKKDTEYEVEDEIEESSSSQEVSVHCLFYKPKYSHDLLEYWTGSWWALVPTLYKLLTSCVPHTMSLHILAHDFVSSR
TLMKIGQVEVITFYQC