; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0004680 (gene) of Snake gourd v1 genome

Gene IDTan0004680
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionBED-type domain-containing protein
Genome locationLG11:8306335..8311609
RNA-Seq ExpressionTan0004680
SyntenyTan0004680
Gene Ontology termsGO:0005634 - nucleus (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0046872 - metal ion binding (molecular function)
GO:0046983 - protein dimerization activity (molecular function)
InterPro domainsIPR003656 - Zinc finger, BED-type
IPR007021 - Domain of unknown function DUF659
IPR008906 - HAT, C-terminal dimerisation domain
IPR012337 - Ribonuclease H-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022990258.1 uncharacterized protein LOC111487192 isoform X1 [Cucurbita maxima]0.0e+0089.36Show/hide
Query:  MVESEELLTVTDMKFKEKRGKVPPRASDPGWAHGIMVNGGRQKIKCKYCHKVMLGGGISRLKQHLAGERGNVAPCEEVPEEVKVQIQQLLGFKVLEKLKR
        M ESEELL  TDMKFKEKRG VPPRASDPGWAHGIMVNGGRQKIKCKYC+KVMLGGGISRLKQHLAGERGNV PCEEVPEEVKVQI+QLLGFKVL KLKR
Subjt:  MVESEELLTVTDMKFKEKRGKVPPRASDPGWAHGIMVNGGRQKIKCKYCHKVMLGGGISRLKQHLAGERGNVAPCEEVPEEVKVQIQQLLGFKVLEKLKR

Query:  QKKGSKHAVSCLPSREEIDDGVHRVQNSRRRPFRRNGKDVLEGVTKEAKRKKKHLFPTSSMTQSVNQNTATMESIEQVDMAVARFIYQAGIPISAISSQY
         KKGSK+A SC+ SREEIDDGVHRVQN+RRR F+R GK+VLE VTK+AKRKKK+ FPTS +TQSVNQNT+T+ESIEQ DMAVARFIYQAGIPISA+S+Q+
Subjt:  QKKGSKHAVSCLPSREEIDDGVHRVQNSRRRPFRRNGKDVLEGVTKEAKRKKKHLFPTSSMTQSVNQNTATMESIEQVDMAVARFIYQAGIPISAISSQY

Query:  FQQMADAIAAIGPGYKMPTYHSLMGKLLDRSVQDAGEYVEELRKSWEVTGCSVLVDRWMDRTGSVVINFFVYCPKGTMFLKSVDLSEISESPEGLLNLFD
        FQQMADAIAA+GPGYKMPT HSLMGKLLDRSV+D G YVEELRKSWEVTGCSVLVDRWMDRTGSVVINFFVYCP+GTMFLKSVDLSEISESPEGLLNLFD
Subjt:  FQQMADAIAAIGPGYKMPTYHSLMGKLLDRSVQDAGEYVEELRKSWEVTGCSVLVDRWMDRTGSVVINFFVYCPKGTMFLKSVDLSEISESPEGLLNLFD

Query:  SIVEEVGLKNIVHFVTDTSPLFKAAGILLVEKYKTFFSSVCAAHCVELILEEIEKMEEVKEVVGKAKRIVQFIYNHVWVLNQIKKRSGGREIIQLASARY
        SIV+EVGLKNIV+FVTDTSPL KAAGILLVEKYKTFFSSVCAAHCVELILEE EKMEEVKE+VGKAKRIVQFIYN VWVLNQIKKRSGGREIIQLAS+RY
Subjt:  SIVEEVGLKNIVHFVTDTSPLFKAAGILLVEKYKTFFSSVCAAHCVELILEEIEKMEEVKEVVGKAKRIVQFIYNHVWVLNQIKKRSGGREIIQLASARY

Query:  FSIFLTLQNILSLKDYLHQTFTSGAWMQSNLSKSGAGLEVAKITADPLFWSKCDHISMGTKPLLSVLQFLESEEKPSAGFIYDAFEKAKNSVMLAFNQKE
        FSIFLTLQNI SLK+++ Q FTSGAWMQSN SK+GAGLEVAKITADP+FWSKC+HI+MGTKPLLSV+QFLESEEKPSAGFIYDAFEKAKNSVMLAFNQKE
Subjt:  FSIFLTLQNILSLKDYLHQTFTSGAWMQSNLSKSGAGLEVAKITADPLFWSKCDHISMGTKPLLSVLQFLESEEKPSAGFIYDAFEKAKNSVMLAFNQKE

Query:  SVYLPYLKAIDHVLLKEFQSPLHVAAYYLNPSIFYSPTFLSSKVIQKGLLDCIEALEPDITSQVMITNNINFYEEAVGDFGRPVALHGRDSLAPATWWSL
        SVYLPYLKAIDHVLLKEFQS LH+AAYYLNPSIFYSPTF+ SKVIQKGLLDCIEALEPDITSQVMITNNINFYEEAVGDFGRPVALHGRDSLAPATWWSL
Subjt:  SVYLPYLKAIDHVLLKEFQSPLHVAAYYLNPSIFYSPTFLSSKVIQKGLLDCIEALEPDITSQVMITNNINFYEEAVGDFGRPVALHGRDSLAPATWWSL

Query:  YGTDYPDLQHLSVRILSQTCSIMQCRKSWSMFKYIYLKKKNRLEKQKMNDLAFAHYNLQLQERRLETCKARCSIDVLDPVCLEAIDANMEDWMEDVEALE
        YGTDYPDLQ L+VRILSQ+C+I+QCRKS S+FKYIY KKKNRLEKQKMNDLAFAHYNLQLQERRLETCKARCSID LDPV LE I ANMEDW+EDVEALE
Subjt:  YGTDYPDLQHLSVRILSQTCSIMQCRKSWSMFKYIYLKKKNRLEKQKMNDLAFAHYNLQLQERRLETCKARCSIDVLDPVCLEAIDANMEDWMEDVEALE

Query:  DEHKRWVDVKVTNQ
        DEH+RWVD+K T+Q
Subjt:  DEHKRWVDVKVTNQ

XP_038884678.1 uncharacterized protein LOC120075395 isoform X1 [Benincasa hispida]0.0e+0089.6Show/hide
Query:  MKFKEKRGKVPPRASDPGWAHGIMVNGGRQKIKCKYCHKVMLGGGISRLKQHLAGERGNVAPCEEVPEEVKVQIQQLLGFKVLEKLKRQKKGSKHAVSCL
        MK  EK+G VPPRASDPGWAHGIMVNGGRQKIKCKYC+KVMLGGGISRLKQHLAGERGNVAPCE VPEEVKVQIQQLLGFKVLEKLKRQKKGSK+AVSC 
Subjt:  MKFKEKRGKVPPRASDPGWAHGIMVNGGRQKIKCKYCHKVMLGGGISRLKQHLAGERGNVAPCEEVPEEVKVQIQQLLGFKVLEKLKRQKKGSKHAVSCL

Query:  PSREEIDDGVHRVQNSRRRPFRRNGKDVLEGVTKEAKRKKKHLFPTSSMTQSVNQNTATMESIEQVDMAVARFIYQAGIPISAISSQYFQQMADAIAAIG
        PSREEIDDG+HRVQNSRR P RR  K+VLEGVTKEAKRKKKHL PTS + QS+NQNT  +ESIEQ DMAVA+F+YQAGIPISA+SSQYFQQMADAIAA+G
Subjt:  PSREEIDDGVHRVQNSRRRPFRRNGKDVLEGVTKEAKRKKKHLFPTSSMTQSVNQNTATMESIEQVDMAVARFIYQAGIPISAISSQYFQQMADAIAAIG

Query:  PGYKMPTYHSLMGKLLDRSVQDAGEYVEELRKSWEVTGCSVLVDRWMDRTGSVVINFFVYCPKGTMFLKSVDLSEISESPEGLLNLFDSIVEEVGLKNIV
        PGYKMPTYHSLMG LLDRSVQDAGEYVEELRKSWEVTGCS+LVDRWMDRT SVVINFFVYC KGTMFLKSVDLSEISESPEGLLNLFD+IV+EVG KNIV
Subjt:  PGYKMPTYHSLMGKLLDRSVQDAGEYVEELRKSWEVTGCSVLVDRWMDRTGSVVINFFVYCPKGTMFLKSVDLSEISESPEGLLNLFDSIVEEVGLKNIV

Query:  HFVTDTSPLFKAAGILLVEKYKTFFSSVCAAHCVELILEEIEKMEEVKEVVGKAKRIVQFIYNHVWVLNQIKKRSGGREIIQLASARYFSIFLTLQNILS
        +FVTDTSPLFKAAG LLVEKYKTFFSSVCAAHCVELILEEIE+MEEVKEVVGKAKRIVQFIYN+ WVLNQIKKRSGGREIIQLAS RYFS FLTL+NILS
Subjt:  HFVTDTSPLFKAAGILLVEKYKTFFSSVCAAHCVELILEEIEKMEEVKEVVGKAKRIVQFIYNHVWVLNQIKKRSGGREIIQLASARYFSIFLTLQNILS

Query:  LKDYLHQTFTSGAWMQSNLSKSGAGLEVAKITADPLFWSKCDHISMGTKPLLSVLQFLESEEKPSAGFIYDAFEKAKNSVMLAFNQKESVYLPYLKAIDH
        LK++LHQTFTSGAWMQSNLSK GAGLEV KI ADPLFWSKCDHI+MGTKPLLSVLQFLESEEKP+AGFIYDAFEKAKNSVMLAFNQKES+YLPYLKAIDH
Subjt:  LKDYLHQTFTSGAWMQSNLSKSGAGLEVAKITADPLFWSKCDHISMGTKPLLSVLQFLESEEKPSAGFIYDAFEKAKNSVMLAFNQKESVYLPYLKAIDH

Query:  VLLKEFQSPLHVAAYYLNPSIFYSPTFLSSKVIQKGLLDCIEALEPDITSQVMITNNINFYEEAVGDFGRPVALHGRDSLAPATWWSLYGTDYPDLQHLS
        VL KEFQS LHVAAYYLNPSIFYSPTFLSSKVIQKGLLDCIEALEPDITSQVMITNNINFYEEAVGDFGRPVALHGRDSLAPATWWSLYGTDYPDLQ L+
Subjt:  VLLKEFQSPLHVAAYYLNPSIFYSPTFLSSKVIQKGLLDCIEALEPDITSQVMITNNINFYEEAVGDFGRPVALHGRDSLAPATWWSLYGTDYPDLQHLS

Query:  VRILSQTCSIMQCRKSWSMFKYIYLKKKNRLEKQKMNDLAFAHYNLQLQERRLETCKARCSIDVLDPVCLEAIDANMEDWMEDVEALEDEHKRWVDVKVT
        VRILSQTCSI +CRKS SMFKY+YLKKK  LEKQKMNDLAFAHYNLQLQERRLETCKARCSID +DPV LEAID NM+DW      +EDEHK WVDVKVT
Subjt:  VRILSQTCSIMQCRKSWSMFKYIYLKKKNRLEKQKMNDLAFAHYNLQLQERRLETCKARCSIDVLDPVCLEAIDANMEDWMEDVEALEDEHKRWVDVKVT

Query:  NQETLVEHKLSNMDSCIDSTD
        NQET VEHKLSNMDSCID TD
Subjt:  NQETLVEHKLSNMDSCIDSTD

XP_038884679.1 uncharacterized protein LOC120075395 isoform X2 [Benincasa hispida]0.0e+0089.6Show/hide
Query:  MKFKEKRGKVPPRASDPGWAHGIMVNGGRQKIKCKYCHKVMLGGGISRLKQHLAGERGNVAPCEEVPEEVKVQIQQLLGFKVLEKLKRQKKGSKHAVSCL
        MK  EK+G VPPRASDPGWAHGIMVNGGRQKIKCKYC+KVMLGGGISRLKQHLAGERGNVAPCE VPEEVKVQIQQLLGFKVLEKLKRQKKGSK+AVSC 
Subjt:  MKFKEKRGKVPPRASDPGWAHGIMVNGGRQKIKCKYCHKVMLGGGISRLKQHLAGERGNVAPCEEVPEEVKVQIQQLLGFKVLEKLKRQKKGSKHAVSCL

Query:  PSREEIDDGVHRVQNSRRRPFRRNGKDVLEGVTKEAKRKKKHLFPTSSMTQSVNQNTATMESIEQVDMAVARFIYQAGIPISAISSQYFQQMADAIAAIG
        PSREEIDDG+HRVQNSRR P RR  K+VLEGVTKEAKRKKKHL PTS + QS+NQNT  +ESIEQ DMAVA+F+YQAGIPISA+SSQYFQQMADAIAA+G
Subjt:  PSREEIDDGVHRVQNSRRRPFRRNGKDVLEGVTKEAKRKKKHLFPTSSMTQSVNQNTATMESIEQVDMAVARFIYQAGIPISAISSQYFQQMADAIAAIG

Query:  PGYKMPTYHSLMGKLLDRSVQDAGEYVEELRKSWEVTGCSVLVDRWMDRTGSVVINFFVYCPKGTMFLKSVDLSEISESPEGLLNLFDSIVEEVGLKNIV
        PGYKMPTYHSLMG LLDRSVQDAGEYVEELRKSWEVTGCS+LVDRWMDRT SVVINFFVYC KGTMFLKSVDLSEISESPEGLLNLFD+IV+EVG KNIV
Subjt:  PGYKMPTYHSLMGKLLDRSVQDAGEYVEELRKSWEVTGCSVLVDRWMDRTGSVVINFFVYCPKGTMFLKSVDLSEISESPEGLLNLFDSIVEEVGLKNIV

Query:  HFVTDTSPLFKAAGILLVEKYKTFFSSVCAAHCVELILEEIEKMEEVKEVVGKAKRIVQFIYNHVWVLNQIKKRSGGREIIQLASARYFSIFLTLQNILS
        +FVTDTSPLFKAAG LLVEKYKTFFSSVCAAHCVELILEEIE+MEEVKEVVGKAKRIVQFIYN+ WVLNQIKKRSGGREIIQLAS RYFS FLTL+NILS
Subjt:  HFVTDTSPLFKAAGILLVEKYKTFFSSVCAAHCVELILEEIEKMEEVKEVVGKAKRIVQFIYNHVWVLNQIKKRSGGREIIQLASARYFSIFLTLQNILS

Query:  LKDYLHQTFTSGAWMQSNLSKSGAGLEVAKITADPLFWSKCDHISMGTKPLLSVLQFLESEEKPSAGFIYDAFEKAKNSVMLAFNQKESVYLPYLKAIDH
        LK++LHQTFTSGAWMQSNLSK GAGLEV KI ADPLFWSKCDHI+MGTKPLLSVLQFLESEEKP+AGFIYDAFEKAKNSVMLAFNQKES+YLPYLKAIDH
Subjt:  LKDYLHQTFTSGAWMQSNLSKSGAGLEVAKITADPLFWSKCDHISMGTKPLLSVLQFLESEEKPSAGFIYDAFEKAKNSVMLAFNQKESVYLPYLKAIDH

Query:  VLLKEFQSPLHVAAYYLNPSIFYSPTFLSSKVIQKGLLDCIEALEPDITSQVMITNNINFYEEAVGDFGRPVALHGRDSLAPATWWSLYGTDYPDLQHLS
        VL KEFQS LHVAAYYLNPSIFYSPTFLSSKVIQKGLLDCIEALEPDITSQVMITNNINFYEEAVGDFGRPVALHGRDSLAPATWWSLYGTDYPDLQ L+
Subjt:  VLLKEFQSPLHVAAYYLNPSIFYSPTFLSSKVIQKGLLDCIEALEPDITSQVMITNNINFYEEAVGDFGRPVALHGRDSLAPATWWSLYGTDYPDLQHLS

Query:  VRILSQTCSIMQCRKSWSMFKYIYLKKKNRLEKQKMNDLAFAHYNLQLQERRLETCKARCSIDVLDPVCLEAIDANMEDWMEDVEALEDEHKRWVDVKVT
        VRILSQTCSI +CRKS SMFKY+YLKKK  LEKQKMNDLAFAHYNLQLQERRLETCKARCSID +DPV LEAID NM+DW      +EDEHK WVDVKVT
Subjt:  VRILSQTCSIMQCRKSWSMFKYIYLKKKNRLEKQKMNDLAFAHYNLQLQERRLETCKARCSIDVLDPVCLEAIDANMEDWMEDVEALEDEHKRWVDVKVT

Query:  NQETLVEHKLSNMDSCIDSTD
        NQET VEHKLSNMDSCID TD
Subjt:  NQETLVEHKLSNMDSCIDSTD

XP_038884685.1 uncharacterized protein LOC120075395 isoform X4 [Benincasa hispida]0.0e+0089.6Show/hide
Query:  MKFKEKRGKVPPRASDPGWAHGIMVNGGRQKIKCKYCHKVMLGGGISRLKQHLAGERGNVAPCEEVPEEVKVQIQQLLGFKVLEKLKRQKKGSKHAVSCL
        MK  EK+G VPPRASDPGWAHGIMVNGGRQKIKCKYC+KVMLGGGISRLKQHLAGERGNVAPCE VPEEVKVQIQQLLGFKVLEKLKRQKKGSK+AVSC 
Subjt:  MKFKEKRGKVPPRASDPGWAHGIMVNGGRQKIKCKYCHKVMLGGGISRLKQHLAGERGNVAPCEEVPEEVKVQIQQLLGFKVLEKLKRQKKGSKHAVSCL

Query:  PSREEIDDGVHRVQNSRRRPFRRNGKDVLEGVTKEAKRKKKHLFPTSSMTQSVNQNTATMESIEQVDMAVARFIYQAGIPISAISSQYFQQMADAIAAIG
        PSREEIDDG+HRVQNSRR P RR  K+VLEGVTKEAKRKKKHL PTS + QS+NQNT  +ESIEQ DMAVA+F+YQAGIPISA+SSQYFQQMADAIAA+G
Subjt:  PSREEIDDGVHRVQNSRRRPFRRNGKDVLEGVTKEAKRKKKHLFPTSSMTQSVNQNTATMESIEQVDMAVARFIYQAGIPISAISSQYFQQMADAIAAIG

Query:  PGYKMPTYHSLMGKLLDRSVQDAGEYVEELRKSWEVTGCSVLVDRWMDRTGSVVINFFVYCPKGTMFLKSVDLSEISESPEGLLNLFDSIVEEVGLKNIV
        PGYKMPTYHSLMG LLDRSVQDAGEYVEELRKSWEVTGCS+LVDRWMDRT SVVINFFVYC KGTMFLKSVDLSEISESPEGLLNLFD+IV+EVG KNIV
Subjt:  PGYKMPTYHSLMGKLLDRSVQDAGEYVEELRKSWEVTGCSVLVDRWMDRTGSVVINFFVYCPKGTMFLKSVDLSEISESPEGLLNLFDSIVEEVGLKNIV

Query:  HFVTDTSPLFKAAGILLVEKYKTFFSSVCAAHCVELILEEIEKMEEVKEVVGKAKRIVQFIYNHVWVLNQIKKRSGGREIIQLASARYFSIFLTLQNILS
        +FVTDTSPLFKAAG LLVEKYKTFFSSVCAAHCVELILEEIE+MEEVKEVVGKAKRIVQFIYN+ WVLNQIKKRSGGREIIQLAS RYFS FLTL+NILS
Subjt:  HFVTDTSPLFKAAGILLVEKYKTFFSSVCAAHCVELILEEIEKMEEVKEVVGKAKRIVQFIYNHVWVLNQIKKRSGGREIIQLASARYFSIFLTLQNILS

Query:  LKDYLHQTFTSGAWMQSNLSKSGAGLEVAKITADPLFWSKCDHISMGTKPLLSVLQFLESEEKPSAGFIYDAFEKAKNSVMLAFNQKESVYLPYLKAIDH
        LK++LHQTFTSGAWMQSNLSK GAGLEV KI ADPLFWSKCDHI+MGTKPLLSVLQFLESEEKP+AGFIYDAFEKAKNSVMLAFNQKES+YLPYLKAIDH
Subjt:  LKDYLHQTFTSGAWMQSNLSKSGAGLEVAKITADPLFWSKCDHISMGTKPLLSVLQFLESEEKPSAGFIYDAFEKAKNSVMLAFNQKESVYLPYLKAIDH

Query:  VLLKEFQSPLHVAAYYLNPSIFYSPTFLSSKVIQKGLLDCIEALEPDITSQVMITNNINFYEEAVGDFGRPVALHGRDSLAPATWWSLYGTDYPDLQHLS
        VL KEFQS LHVAAYYLNPSIFYSPTFLSSKVIQKGLLDCIEALEPDITSQVMITNNINFYEEAVGDFGRPVALHGRDSLAPATWWSLYGTDYPDLQ L+
Subjt:  VLLKEFQSPLHVAAYYLNPSIFYSPTFLSSKVIQKGLLDCIEALEPDITSQVMITNNINFYEEAVGDFGRPVALHGRDSLAPATWWSLYGTDYPDLQHLS

Query:  VRILSQTCSIMQCRKSWSMFKYIYLKKKNRLEKQKMNDLAFAHYNLQLQERRLETCKARCSIDVLDPVCLEAIDANMEDWMEDVEALEDEHKRWVDVKVT
        VRILSQTCSI +CRKS SMFKY+YLKKK  LEKQKMNDLAFAHYNLQLQERRLETCKARCSID +DPV LEAID NM+DW      +EDEHK WVDVKVT
Subjt:  VRILSQTCSIMQCRKSWSMFKYIYLKKKNRLEKQKMNDLAFAHYNLQLQERRLETCKARCSIDVLDPVCLEAIDANMEDWMEDVEALEDEHKRWVDVKVT

Query:  NQETLVEHKLSNMDSCIDSTD
        NQET VEHKLSNMDSCID TD
Subjt:  NQETLVEHKLSNMDSCIDSTD

XP_038884686.1 uncharacterized protein LOC120075395 isoform X5 [Benincasa hispida]0.0e+0089.47Show/hide
Query:  MKFKEKRGKVPPRASDPGWAHGIMVNGGRQKIKCKYCHKVMLGGGISRLKQHLAGERGNVAPCEEVPEEVKVQIQQLLGFKVLEKLKRQKKGSKHAVSCL
        MK  EK+G VPPRASDPGWAHGIMVNGGRQKIKCKYC+KVMLGGGISRLKQHLAGERGNVAPCE VPEEVKVQIQQLLGFKVLEKLKRQKKGSK+AVSC 
Subjt:  MKFKEKRGKVPPRASDPGWAHGIMVNGGRQKIKCKYCHKVMLGGGISRLKQHLAGERGNVAPCEEVPEEVKVQIQQLLGFKVLEKLKRQKKGSKHAVSCL

Query:  PSREEIDDGVHRVQNSRRRPFRRNGKDVLEGVTKEAKRKKKHLFPTSSMTQSVNQNTATMESIEQVDMAVARFIYQAGIPISAISSQYFQQMADAIAAIG
        PSREEIDDG+HRVQNSRR P RR  K+VLEGVTKEAKRKKKHL PTS + QS+NQNT  +ESIEQ DMAVA+F+YQAGIPISA+SSQYFQQMADAIAA+G
Subjt:  PSREEIDDGVHRVQNSRRRPFRRNGKDVLEGVTKEAKRKKKHLFPTSSMTQSVNQNTATMESIEQVDMAVARFIYQAGIPISAISSQYFQQMADAIAAIG

Query:  PGYKMPTYHSLMGKLLDRSVQDAGEYVEELRKSWEVTGCSVLVDRWMDRTGSVVINFFVYCPKGTMFLKSVDLSEISESPEGLLNLFDSIVEEVGLKNIV
        PGYKMPTYHSLMG LLDRSVQDAGEYVEELRKSWEVTGCS+LVDRWMDRT SVVINFFVYC KGTMFLKSVDLSEISESPEGLLNLFD+IV+EVG KNIV
Subjt:  PGYKMPTYHSLMGKLLDRSVQDAGEYVEELRKSWEVTGCSVLVDRWMDRTGSVVINFFVYCPKGTMFLKSVDLSEISESPEGLLNLFDSIVEEVGLKNIV

Query:  HFVTDTSPLFKAAGILLVEKYKTFFSSVCAAHCVELILEEIEKMEEVKEVVGKAKRIVQFIYNHVWVLNQIKKRSGGREIIQLASARYFSIFLTLQNILS
        +FVTDTSPLFKAAG LLVEKYKTFFSSVCAAHCVELILEEIE+MEEVKEVVGKAKRIVQFIYN+ WVLNQIKKRSGGREIIQLAS RYFS FLTL+NILS
Subjt:  HFVTDTSPLFKAAGILLVEKYKTFFSSVCAAHCVELILEEIEKMEEVKEVVGKAKRIVQFIYNHVWVLNQIKKRSGGREIIQLASARYFSIFLTLQNILS

Query:  LKDYLHQTFTSGAWMQSNLSKSGAGLEVAKITADPLFWSKCDHISMGTKPLLSVLQFLESEEKPSAGFIYDAFEKAKNSVMLAFNQKESVYLPYLKAIDH
        LK++LHQTFTSGAWMQSNLSK GAGLEV KI ADPLFWSKCDHI+MGTKPLLSVLQFLESEEKP+AGFIYDAFEKAKNSVMLAFNQKES+YLPYLKAIDH
Subjt:  LKDYLHQTFTSGAWMQSNLSKSGAGLEVAKITADPLFWSKCDHISMGTKPLLSVLQFLESEEKPSAGFIYDAFEKAKNSVMLAFNQKESVYLPYLKAIDH

Query:  VLLKEFQSPLHVAAYYLNPSIFYSPTFLSSKVIQKGLLDCIEALEPDITSQVMITNNINFYEEAVGDFGRPVALHGRDSLAPATWWSLYGTDYPDLQHLS
        VL KEFQS LHVAAYYLNPSIFYSPTFLSSKVIQKGLLDCIEALEPDITSQVMITNNINFYEEAVGDFGRPVALHGRDSLAPATWWSLYGTDYPDLQ L+
Subjt:  VLLKEFQSPLHVAAYYLNPSIFYSPTFLSSKVIQKGLLDCIEALEPDITSQVMITNNINFYEEAVGDFGRPVALHGRDSLAPATWWSLYGTDYPDLQHLS

Query:  VRILSQTCSIMQCRKSWSMFKYIYLKKKNRLEKQKMNDLAFAHYNLQLQERRLETCKARCSIDVLDPVCLEAIDANMEDWMEDVEALEDEHKRWVDVKVT
        VRILSQTCSI +CRKS SMFKY+YLKKK  LEKQKMNDLAFAHYNLQLQERRLETCKARCSID +DPV LEAID NM+DW      +EDEHK WVDVKVT
Subjt:  VRILSQTCSIMQCRKSWSMFKYIYLKKKNRLEKQKMNDLAFAHYNLQLQERRLETCKARCSIDVLDPVCLEAIDANMEDWMEDVEALEDEHKRWVDVKVT

Query:  NQETLVEHKLSNMDSCIDSTDE
        NQET VEHKLSNMDSCID T E
Subjt:  NQETLVEHKLSNMDSCIDSTDE

TrEMBL top hitse value%identityAlignment
A0A6J1H0E4 uncharacterized protein LOC111459278 isoform X10.0e+0089.51Show/hide
Query:  MVESEELLTVTDMKFKEKRGKVPPRASDPGWAHGIMVNGGRQKIKCKYCHKVMLGGGISRLKQHLAGERGNVAPCEEVPEEVKVQIQQLLGFKVLEKLKR
        M ESEELL  TDMKFKEKRG  PPRASDPGWAHGIMVNGGRQKIKCKYC+KVMLGGGISRLKQHLAGERGNV PCEEVPEEVKVQI+QLLGFKVL KLKR
Subjt:  MVESEELLTVTDMKFKEKRGKVPPRASDPGWAHGIMVNGGRQKIKCKYCHKVMLGGGISRLKQHLAGERGNVAPCEEVPEEVKVQIQQLLGFKVLEKLKR

Query:  QKKGSKHAVSCLPSREEIDDGVHRVQNSRRRPFRRNGKDVLEGVTKEAKRKKKHLFPTSSMTQSVNQNTATMESIEQVDMAVARFIYQAGIPISAISSQY
          KGSK+A SC PSREEIDDGVHRVQN+RRR F+R GK+VLE VTK+AKRKKKH FPTS + QSVNQNT+T+ESIEQ D AVARFIYQAGIPISA+S+Q+
Subjt:  QKKGSKHAVSCLPSREEIDDGVHRVQNSRRRPFRRNGKDVLEGVTKEAKRKKKHLFPTSSMTQSVNQNTATMESIEQVDMAVARFIYQAGIPISAISSQY

Query:  FQQMADAIAAIGPGYKMPTYHSLMGKLLDRSVQDAGEYVEELRKSWEVTGCSVLVDRWMDRTGSVVINFFVYCPKGTMFLKSVDLSEISESPEGLLNLFD
        FQQMADAIAA+GPGYKMPT HSLMGKLLDRSVQDAGEYVEELRKSWEVTGCSVLVDRWMDRTGSVVINFFVYCP+GTMFLKSVDLSEISESPEGLLNLFD
Subjt:  FQQMADAIAAIGPGYKMPTYHSLMGKLLDRSVQDAGEYVEELRKSWEVTGCSVLVDRWMDRTGSVVINFFVYCPKGTMFLKSVDLSEISESPEGLLNLFD

Query:  SIVEEVGLKNIVHFVTDTSPLFKAAGILLVEKYKTFFSSVCAAHCVELILEEIEKMEEVKEVVGKAKRIVQFIYNHVWVLNQIKKRSGGREIIQLASARY
        SIV+EVGLKNIV+FVTDTSPLFKAAG LLVEKYKTFFSSVCAAHCVELILEEIEKMEEVKEVVGKAKRIVQFIYN VWVLNQIKKRSGGREIIQLAS+RY
Subjt:  SIVEEVGLKNIVHFVTDTSPLFKAAGILLVEKYKTFFSSVCAAHCVELILEEIEKMEEVKEVVGKAKRIVQFIYNHVWVLNQIKKRSGGREIIQLASARY

Query:  FSIFLTLQNILSLKDYLHQTFTSGAWMQSNLSKSGAGLEVAKITADPLFWSKCDHISMGTKPLLSVLQFLESEEKPSAGFIYDAFEKAKNSVMLAFNQKE
        FSIFLTLQNI SLK+++ Q FTSGAWMQSNLSKSGAGLEVAKITADP+FWSKCDHI+MGTKPLLSVLQFLESEE+PSAGFIYDAFEKAK++VMLAFNQKE
Subjt:  FSIFLTLQNILSLKDYLHQTFTSGAWMQSNLSKSGAGLEVAKITADPLFWSKCDHISMGTKPLLSVLQFLESEEKPSAGFIYDAFEKAKNSVMLAFNQKE

Query:  SVYLPYLKAIDHVLLKEFQSPLHVAAYYLNPSIFYSP-TFLSSKVIQKGLLDCIEALEPDITSQVMITNNINFYEEAVGDFGRPVALHGRDSLAPATWWS
        SVYLPYLKAIDHVLLKEFQS LH+AAYYLNPSIFYSP TF+ SKVIQKGLLDCIEALEPDITSQVMITNNINFYEEAVGDFGRPVALHGRDSLAPATWWS
Subjt:  SVYLPYLKAIDHVLLKEFQSPLHVAAYYLNPSIFYSP-TFLSSKVIQKGLLDCIEALEPDITSQVMITNNINFYEEAVGDFGRPVALHGRDSLAPATWWS

Query:  LYGTDYPDLQHLSVRILSQTCSIMQCRKSWSMFKYIYLKKKNRLEKQKMNDLAFAHYNLQLQERRLETCKARCSIDVLDPVCLEAIDANMEDWMEDVEAL
        LYGTDYPDLQ L+VRILSQ+C+I+QCRKS S+FKYIY KKKNRLEKQKMNDLAFAHYNLQLQERRLETCK RCSID LDPV LE I ANMEDW+EDVEAL
Subjt:  LYGTDYPDLQHLSVRILSQTCSIMQCRKSWSMFKYIYLKKKNRLEKQKMNDLAFAHYNLQLQERRLETCKARCSIDVLDPVCLEAIDANMEDWMEDVEAL

Query:  EDEHKRWVDVKVTNQ
        EDE +RWVD+K T+Q
Subjt:  EDEHKRWVDVKVTNQ

A0A6J1H1X4 uncharacterized protein LOC111459278 isoform X20.0e+0089.53Show/hide
Query:  RGKVPPRASDPGWAHGIMVNGGRQKIKCKYCHKVMLGGGISRLKQHLAGERGNVAPCEEVPEEVKVQIQQLLGFKVLEKLKRQKKGSKHAVSCLPSREEI
        +G  PPRASDPGWAHGIMVNGGRQKIKCKYC+KVMLGGGISRLKQHLAGERGNV PCEEVPEEVKVQI+QLLGFKVL KLKR  KGSK+A SC PSREEI
Subjt:  RGKVPPRASDPGWAHGIMVNGGRQKIKCKYCHKVMLGGGISRLKQHLAGERGNVAPCEEVPEEVKVQIQQLLGFKVLEKLKRQKKGSKHAVSCLPSREEI

Query:  DDGVHRVQNSRRRPFRRNGKDVLEGVTKEAKRKKKHLFPTSSMTQSVNQNTATMESIEQVDMAVARFIYQAGIPISAISSQYFQQMADAIAAIGPGYKMP
        DDGVHRVQN+RRR F+R GK+VLE VTK+AKRKKKH FPTS + QSVNQNT+T+ESIEQ D AVARFIYQAGIPISA+S+Q+FQQMADAIAA+GPGYKMP
Subjt:  DDGVHRVQNSRRRPFRRNGKDVLEGVTKEAKRKKKHLFPTSSMTQSVNQNTATMESIEQVDMAVARFIYQAGIPISAISSQYFQQMADAIAAIGPGYKMP

Query:  TYHSLMGKLLDRSVQDAGEYVEELRKSWEVTGCSVLVDRWMDRTGSVVINFFVYCPKGTMFLKSVDLSEISESPEGLLNLFDSIVEEVGLKNIVHFVTDT
        T HSLMGKLLDRSVQDAGEYVEELRKSWEVTGCSVLVDRWMDRTGSVVINFFVYCP+GTMFLKSVDLSEISESPEGLLNLFDSIV+EVGLKNIV+FVTDT
Subjt:  TYHSLMGKLLDRSVQDAGEYVEELRKSWEVTGCSVLVDRWMDRTGSVVINFFVYCPKGTMFLKSVDLSEISESPEGLLNLFDSIVEEVGLKNIVHFVTDT

Query:  SPLFKAAGILLVEKYKTFFSSVCAAHCVELILEEIEKMEEVKEVVGKAKRIVQFIYNHVWVLNQIKKRSGGREIIQLASARYFSIFLTLQNILSLKDYLH
        SPLFKAAG LLVEKYKTFFSSVCAAHCVELILEEIEKMEEVKEVVGKAKRIVQFIYN VWVLNQIKKRSGGREIIQLAS+RYFSIFLTLQNI SLK+++ 
Subjt:  SPLFKAAGILLVEKYKTFFSSVCAAHCVELILEEIEKMEEVKEVVGKAKRIVQFIYNHVWVLNQIKKRSGGREIIQLASARYFSIFLTLQNILSLKDYLH

Query:  QTFTSGAWMQSNLSKSGAGLEVAKITADPLFWSKCDHISMGTKPLLSVLQFLESEEKPSAGFIYDAFEKAKNSVMLAFNQKESVYLPYLKAIDHVLLKEF
        Q FTSGAWMQSNLSKSGAGLEVAKITADP+FWSKCDHI+MGTKPLLSVLQFLESEE+PSAGFIYDAFEKAK++VMLAFNQKESVYLPYLKAIDHVLLKEF
Subjt:  QTFTSGAWMQSNLSKSGAGLEVAKITADPLFWSKCDHISMGTKPLLSVLQFLESEEKPSAGFIYDAFEKAKNSVMLAFNQKESVYLPYLKAIDHVLLKEF

Query:  QSPLHVAAYYLNPSIFYSP-TFLSSKVIQKGLLDCIEALEPDITSQVMITNNINFYEEAVGDFGRPVALHGRDSLAPATWWSLYGTDYPDLQHLSVRILS
        QS LH+AAYYLNPSIFYSP TF+ SKVIQKGLLDCIEALEPDITSQVMITNNINFYEEAVGDFGRPVALHGRDSLAPATWWSLYGTDYPDLQ L+VRILS
Subjt:  QSPLHVAAYYLNPSIFYSP-TFLSSKVIQKGLLDCIEALEPDITSQVMITNNINFYEEAVGDFGRPVALHGRDSLAPATWWSLYGTDYPDLQHLSVRILS

Query:  QTCSIMQCRKSWSMFKYIYLKKKNRLEKQKMNDLAFAHYNLQLQERRLETCKARCSIDVLDPVCLEAIDANMEDWMEDVEALEDEHKRWVDVKVTNQ
        Q+C+I+QCRKS S+FKYIY KKKNRLEKQKMNDLAFAHYNLQLQERRLETCK RCSID LDPV LE I ANMEDW+EDVEALEDE +RWVD+K T+Q
Subjt:  QTCSIMQCRKSWSMFKYIYLKKKNRLEKQKMNDLAFAHYNLQLQERRLETCKARCSIDVLDPVCLEAIDANMEDWMEDVEALEDEHKRWVDVKVTNQ

A0A6J1H3E3 uncharacterized protein LOC111459278 isoform X30.0e+0089.9Show/hide
Query:  PPRASDPGWAHGIMVNGGRQKIKCKYCHKVMLGGGISRLKQHLAGERGNVAPCEEVPEEVKVQIQQLLGFKVLEKLKRQKKGSKHAVSCLPSREEIDDGV
        PPRASDPGWAHGIMVNGGRQKIKCKYC+KVMLGGGISRLKQHLAGERGNV PCEEVPEEVKVQI+QLLGFKVL KLKR  KGSK+A SC PSREEIDDGV
Subjt:  PPRASDPGWAHGIMVNGGRQKIKCKYCHKVMLGGGISRLKQHLAGERGNVAPCEEVPEEVKVQIQQLLGFKVLEKLKRQKKGSKHAVSCLPSREEIDDGV

Query:  HRVQNSRRRPFRRNGKDVLEGVTKEAKRKKKHLFPTSSMTQSVNQNTATMESIEQVDMAVARFIYQAGIPISAISSQYFQQMADAIAAIGPGYKMPTYHS
        HRVQN+RRR F+R GK+VLE VTK+AKRKKKH FPTS + QSVNQNT+T+ESIEQ D AVARFIYQAGIPISA+S+Q+FQQMADAIAA+GPGYKMPT HS
Subjt:  HRVQNSRRRPFRRNGKDVLEGVTKEAKRKKKHLFPTSSMTQSVNQNTATMESIEQVDMAVARFIYQAGIPISAISSQYFQQMADAIAAIGPGYKMPTYHS

Query:  LMGKLLDRSVQDAGEYVEELRKSWEVTGCSVLVDRWMDRTGSVVINFFVYCPKGTMFLKSVDLSEISESPEGLLNLFDSIVEEVGLKNIVHFVTDTSPLF
        LMGKLLDRSVQDAGEYVEELRKSWEVTGCSVLVDRWMDRTGSVVINFFVYCP+GTMFLKSVDLSEISESPEGLLNLFDSIV+EVGLKNIV+FVTDTSPLF
Subjt:  LMGKLLDRSVQDAGEYVEELRKSWEVTGCSVLVDRWMDRTGSVVINFFVYCPKGTMFLKSVDLSEISESPEGLLNLFDSIVEEVGLKNIVHFVTDTSPLF

Query:  KAAGILLVEKYKTFFSSVCAAHCVELILEEIEKMEEVKEVVGKAKRIVQFIYNHVWVLNQIKKRSGGREIIQLASARYFSIFLTLQNILSLKDYLHQTFT
        KAAG LLVEKYKTFFSSVCAAHCVELILEEIEKMEEVKEVVGKAKRIVQFIYN VWVLNQIKKRSGGREIIQLAS+RYFSIFLTLQNI SLK+++ Q FT
Subjt:  KAAGILLVEKYKTFFSSVCAAHCVELILEEIEKMEEVKEVVGKAKRIVQFIYNHVWVLNQIKKRSGGREIIQLASARYFSIFLTLQNILSLKDYLHQTFT

Query:  SGAWMQSNLSKSGAGLEVAKITADPLFWSKCDHISMGTKPLLSVLQFLESEEKPSAGFIYDAFEKAKNSVMLAFNQKESVYLPYLKAIDHVLLKEFQSPL
        SGAWMQSNLSKSGAGLEVAKITADP+FWSKCDHI+MGTKPLLSVLQFLESEE+PSAGFIYDAFEKAK++VMLAFNQKESVYLPYLKAIDHVLLKEFQS L
Subjt:  SGAWMQSNLSKSGAGLEVAKITADPLFWSKCDHISMGTKPLLSVLQFLESEEKPSAGFIYDAFEKAKNSVMLAFNQKESVYLPYLKAIDHVLLKEFQSPL

Query:  HVAAYYLNPSIFYSP-TFLSSKVIQKGLLDCIEALEPDITSQVMITNNINFYEEAVGDFGRPVALHGRDSLAPATWWSLYGTDYPDLQHLSVRILSQTCS
        H+AAYYLNPSIFYSP TF+ SKVIQKGLLDCIEALEPDITSQVMITNNINFYEEAVGDFGRPVALHGRDSLAPATWWSLYGTDYPDLQ L+VRILSQ+C+
Subjt:  HVAAYYLNPSIFYSP-TFLSSKVIQKGLLDCIEALEPDITSQVMITNNINFYEEAVGDFGRPVALHGRDSLAPATWWSLYGTDYPDLQHLSVRILSQTCS

Query:  IMQCRKSWSMFKYIYLKKKNRLEKQKMNDLAFAHYNLQLQERRLETCKARCSIDVLDPVCLEAIDANMEDWMEDVEALEDEHKRWVDVKVTNQ
        I+QCRKS S+FKYIY KKKNRLEKQKMNDLAFAHYNLQLQERRLETCK RCSID LDPV LE I ANMEDW+EDVEALEDE +RWVD+K T+Q
Subjt:  IMQCRKSWSMFKYIYLKKKNRLEKQKMNDLAFAHYNLQLQERRLETCKARCSIDVLDPVCLEAIDANMEDWMEDVEALEDEHKRWVDVKVTNQ

A0A6J1JMG6 uncharacterized protein LOC111487192 isoform X20.0e+0089.37Show/hide
Query:  RGKVPPRASDPGWAHGIMVNGGRQKIKCKYCHKVMLGGGISRLKQHLAGERGNVAPCEEVPEEVKVQIQQLLGFKVLEKLKRQKKGSKHAVSCLPSREEI
        +G VPPRASDPGWAHGIMVNGGRQKIKCKYC+KVMLGGGISRLKQHLAGERGNV PCEEVPEEVKVQI+QLLGFKVL KLKR KKGSK+A SC+ SREEI
Subjt:  RGKVPPRASDPGWAHGIMVNGGRQKIKCKYCHKVMLGGGISRLKQHLAGERGNVAPCEEVPEEVKVQIQQLLGFKVLEKLKRQKKGSKHAVSCLPSREEI

Query:  DDGVHRVQNSRRRPFRRNGKDVLEGVTKEAKRKKKHLFPTSSMTQSVNQNTATMESIEQVDMAVARFIYQAGIPISAISSQYFQQMADAIAAIGPGYKMP
        DDGVHRVQN+RRR F+R GK+VLE VTK+AKRKKK+ FPTS +TQSVNQNT+T+ESIEQ DMAVARFIYQAGIPISA+S+Q+FQQMADAIAA+GPGYKMP
Subjt:  DDGVHRVQNSRRRPFRRNGKDVLEGVTKEAKRKKKHLFPTSSMTQSVNQNTATMESIEQVDMAVARFIYQAGIPISAISSQYFQQMADAIAAIGPGYKMP

Query:  TYHSLMGKLLDRSVQDAGEYVEELRKSWEVTGCSVLVDRWMDRTGSVVINFFVYCPKGTMFLKSVDLSEISESPEGLLNLFDSIVEEVGLKNIVHFVTDT
        T HSLMGKLLDRSV+D G YVEELRKSWEVTGCSVLVDRWMDRTGSVVINFFVYCP+GTMFLKSVDLSEISESPEGLLNLFDSIV+EVGLKNIV+FVTDT
Subjt:  TYHSLMGKLLDRSVQDAGEYVEELRKSWEVTGCSVLVDRWMDRTGSVVINFFVYCPKGTMFLKSVDLSEISESPEGLLNLFDSIVEEVGLKNIVHFVTDT

Query:  SPLFKAAGILLVEKYKTFFSSVCAAHCVELILEEIEKMEEVKEVVGKAKRIVQFIYNHVWVLNQIKKRSGGREIIQLASARYFSIFLTLQNILSLKDYLH
        SPL KAAGILLVEKYKTFFSSVCAAHCVELILEE EKMEEVKE+VGKAKRIVQFIYN VWVLNQIKKRSGGREIIQLAS+RYFSIFLTLQNI SLK+++ 
Subjt:  SPLFKAAGILLVEKYKTFFSSVCAAHCVELILEEIEKMEEVKEVVGKAKRIVQFIYNHVWVLNQIKKRSGGREIIQLASARYFSIFLTLQNILSLKDYLH

Query:  QTFTSGAWMQSNLSKSGAGLEVAKITADPLFWSKCDHISMGTKPLLSVLQFLESEEKPSAGFIYDAFEKAKNSVMLAFNQKESVYLPYLKAIDHVLLKEF
        Q FTSGAWMQSN SK+GAGLEVAKITADP+FWSKC+HI+MGTKPLLSV+QFLESEEKPSAGFIYDAFEKAKNSVMLAFNQKESVYLPYLKAIDHVLLKEF
Subjt:  QTFTSGAWMQSNLSKSGAGLEVAKITADPLFWSKCDHISMGTKPLLSVLQFLESEEKPSAGFIYDAFEKAKNSVMLAFNQKESVYLPYLKAIDHVLLKEF

Query:  QSPLHVAAYYLNPSIFYSPTFLSSKVIQKGLLDCIEALEPDITSQVMITNNINFYEEAVGDFGRPVALHGRDSLAPATWWSLYGTDYPDLQHLSVRILSQ
        QS LH+AAYYLNPSIFYSPTF+ SKVIQKGLLDCIEALEPDITSQVMITNNINFYEEAVGDFGRPVALHGRDSLAPATWWSLYGTDYPDLQ L+VRILSQ
Subjt:  QSPLHVAAYYLNPSIFYSPTFLSSKVIQKGLLDCIEALEPDITSQVMITNNINFYEEAVGDFGRPVALHGRDSLAPATWWSLYGTDYPDLQHLSVRILSQ

Query:  TCSIMQCRKSWSMFKYIYLKKKNRLEKQKMNDLAFAHYNLQLQERRLETCKARCSIDVLDPVCLEAIDANMEDWMEDVEALEDEHKRWVDVKVTNQ
        +C+I+QCRKS S+FKYIY KKKNRLEKQKMNDLAFAHYNLQLQERRLETCKARCSID LDPV LE I ANMEDW+EDVEALEDEH+RWVD+K T+Q
Subjt:  TCSIMQCRKSWSMFKYIYLKKKNRLEKQKMNDLAFAHYNLQLQERRLETCKARCSIDVLDPVCLEAIDANMEDWMEDVEALEDEHKRWVDVKVTNQ

A0A6J1JSR3 uncharacterized protein LOC111487192 isoform X10.0e+0089.36Show/hide
Query:  MVESEELLTVTDMKFKEKRGKVPPRASDPGWAHGIMVNGGRQKIKCKYCHKVMLGGGISRLKQHLAGERGNVAPCEEVPEEVKVQIQQLLGFKVLEKLKR
        M ESEELL  TDMKFKEKRG VPPRASDPGWAHGIMVNGGRQKIKCKYC+KVMLGGGISRLKQHLAGERGNV PCEEVPEEVKVQI+QLLGFKVL KLKR
Subjt:  MVESEELLTVTDMKFKEKRGKVPPRASDPGWAHGIMVNGGRQKIKCKYCHKVMLGGGISRLKQHLAGERGNVAPCEEVPEEVKVQIQQLLGFKVLEKLKR

Query:  QKKGSKHAVSCLPSREEIDDGVHRVQNSRRRPFRRNGKDVLEGVTKEAKRKKKHLFPTSSMTQSVNQNTATMESIEQVDMAVARFIYQAGIPISAISSQY
         KKGSK+A SC+ SREEIDDGVHRVQN+RRR F+R GK+VLE VTK+AKRKKK+ FPTS +TQSVNQNT+T+ESIEQ DMAVARFIYQAGIPISA+S+Q+
Subjt:  QKKGSKHAVSCLPSREEIDDGVHRVQNSRRRPFRRNGKDVLEGVTKEAKRKKKHLFPTSSMTQSVNQNTATMESIEQVDMAVARFIYQAGIPISAISSQY

Query:  FQQMADAIAAIGPGYKMPTYHSLMGKLLDRSVQDAGEYVEELRKSWEVTGCSVLVDRWMDRTGSVVINFFVYCPKGTMFLKSVDLSEISESPEGLLNLFD
        FQQMADAIAA+GPGYKMPT HSLMGKLLDRSV+D G YVEELRKSWEVTGCSVLVDRWMDRTGSVVINFFVYCP+GTMFLKSVDLSEISESPEGLLNLFD
Subjt:  FQQMADAIAAIGPGYKMPTYHSLMGKLLDRSVQDAGEYVEELRKSWEVTGCSVLVDRWMDRTGSVVINFFVYCPKGTMFLKSVDLSEISESPEGLLNLFD

Query:  SIVEEVGLKNIVHFVTDTSPLFKAAGILLVEKYKTFFSSVCAAHCVELILEEIEKMEEVKEVVGKAKRIVQFIYNHVWVLNQIKKRSGGREIIQLASARY
        SIV+EVGLKNIV+FVTDTSPL KAAGILLVEKYKTFFSSVCAAHCVELILEE EKMEEVKE+VGKAKRIVQFIYN VWVLNQIKKRSGGREIIQLAS+RY
Subjt:  SIVEEVGLKNIVHFVTDTSPLFKAAGILLVEKYKTFFSSVCAAHCVELILEEIEKMEEVKEVVGKAKRIVQFIYNHVWVLNQIKKRSGGREIIQLASARY

Query:  FSIFLTLQNILSLKDYLHQTFTSGAWMQSNLSKSGAGLEVAKITADPLFWSKCDHISMGTKPLLSVLQFLESEEKPSAGFIYDAFEKAKNSVMLAFNQKE
        FSIFLTLQNI SLK+++ Q FTSGAWMQSN SK+GAGLEVAKITADP+FWSKC+HI+MGTKPLLSV+QFLESEEKPSAGFIYDAFEKAKNSVMLAFNQKE
Subjt:  FSIFLTLQNILSLKDYLHQTFTSGAWMQSNLSKSGAGLEVAKITADPLFWSKCDHISMGTKPLLSVLQFLESEEKPSAGFIYDAFEKAKNSVMLAFNQKE

Query:  SVYLPYLKAIDHVLLKEFQSPLHVAAYYLNPSIFYSPTFLSSKVIQKGLLDCIEALEPDITSQVMITNNINFYEEAVGDFGRPVALHGRDSLAPATWWSL
        SVYLPYLKAIDHVLLKEFQS LH+AAYYLNPSIFYSPTF+ SKVIQKGLLDCIEALEPDITSQVMITNNINFYEEAVGDFGRPVALHGRDSLAPATWWSL
Subjt:  SVYLPYLKAIDHVLLKEFQSPLHVAAYYLNPSIFYSPTFLSSKVIQKGLLDCIEALEPDITSQVMITNNINFYEEAVGDFGRPVALHGRDSLAPATWWSL

Query:  YGTDYPDLQHLSVRILSQTCSIMQCRKSWSMFKYIYLKKKNRLEKQKMNDLAFAHYNLQLQERRLETCKARCSIDVLDPVCLEAIDANMEDWMEDVEALE
        YGTDYPDLQ L+VRILSQ+C+I+QCRKS S+FKYIY KKKNRLEKQKMNDLAFAHYNLQLQERRLETCKARCSID LDPV LE I ANMEDW+EDVEALE
Subjt:  YGTDYPDLQHLSVRILSQTCSIMQCRKSWSMFKYIYLKKKNRLEKQKMNDLAFAHYNLQLQERRLETCKARCSIDVLDPVCLEAIDANMEDWMEDVEALE

Query:  DEHKRWVDVKVTNQ
        DEH+RWVD+K T+Q
Subjt:  DEHKRWVDVKVTNQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G17450.1 hAT dimerisation domain-containing protein5.3e-11033.14Show/hide
Query:  DPGWAHGIMVNGGRQKIKCKYCHKVMLGGGISRLKQHLAGERGNVAPCEEVPEEVKVQIQQLLGFKVLEKLKRQKKGSKHA-----VSCLPSREEIDDGV
        DPGW HGI  +  ++K+KC YC+K+ + GGI+R KQHLA   G VAPC+  PEEV V+I++ + +    K + +      A     VS  P +EE  +  
Subjt:  DPGWAHGIMVNGGRQKIKCKYCHKVMLGGGISRLKQHLAGERGNVAPCEEVPEEVKVQIQQLLGFKVLEKLKRQKKGSKHA-----VSCLPSREEIDDGV

Query:  HRVQNSRRRPFRRNG---KDVLEGV-------TKEAKRKKKHLFPTSSMTQSVNQN-----TATMESIEQVDMAVARFIYQAGIPISAISSQYFQQMADA
             S+ R    NG   KD  +           EAK K+  + P  S + S  +      +  + S + V  ++++F++  G+P  A +S YFQ+M + 
Subjt:  HRVQNSRRRPFRRNG---KDVLEGV-------TKEAKRKKKHLFPTSSMTQSVNQN-----TATMESIEQVDMAVARFIYQAGIPISAISSQYFQQMADA

Query:  IAAIGPGYKMPTYHSLMGKLLDRSVQDAGEYVEELRKSWEVTGCSVLVDRWMDRTGSVVINFFVYCPKGTMFLKSVDLSEISESPEGLLNLFDSIVEEVG
        I   G G+ +P+     G+LL   +     Y+ E R SW VTGCS++ D W +  G  +I+F V CP+G  F  S+D ++I E    L    D +V+++G
Subjt:  IAAIGPGYKMPTYHSLMGKLLDRSVQDAGEYVEELRKSWEVTGCSVLVDRWMDRTGSVVINFFVYCPKGTMFLKSVDLSEISESPEGLLNLFDSIVEEVG

Query:  LKNIVHFVTDTSPLFKAAGILLVEKYKTFFSSVCAAHCVELILEEIEKMEEVKEVVGKAKRIVQFIYNHVWVLNQIKKR-SGGREIIQLASARYFSIFLT
         +N+V  +T  + +F++AG LL EK K  + + CA HC EL+LE+  K+E V E + KA+RI +FIYN  W+LN +K   + G ++++ A  R+ S F T
Subjt:  LKNIVHFVTDTSPLFKAAGILLVEKYKTFFSSVCAAHCVELILEEIEKMEEVKEVVGKAKRIVQFIYNHVWVLNQIKKR-SGGREIIQLASARYFSIFLT

Query:  LQNILSLKDYLHQTFTSGAWMQS-NLSKSGAGLEVAKITADPLFWSKCDHISMGTKPLLSVLQFL-ESEEKPSAGFIYDAFEKAKNSVMLAFNQKESVYL
        LQ+++  K  L   F S  W+ S   +KS  G EV K+    +FW K  ++     P++ V+  + +  ++ S  + Y     AK ++    +     Y 
Subjt:  LQNILSLKDYLHQTFTSGAWMQS-NLSKSGAGLEVAKITADPLFWSKCDHISMGTKPLLSVLQFL-ESEEKPSAGFIYDAFEKAKNSVMLAFNQKESVYL

Query:  PYLKAIDHVLLKEFQSPLHVAAYYLNPSIFYSPTFLSSKVIQKGLLDCIEALEPDITSQVMITNNINFYEEAVGDFGRPVALHGRDSLAPATWWSLYGTD
        P+ + I++     F  PL+VAAY+ NP+  Y P F++   + +G+ +CI  LEPD T ++     I  Y  A  DFG  +A+  R  L P+ WW  +G  
Subjt:  PYLKAIDHVLLKEFQSPLHVAAYYLNPSIFYSPTFLSSKVIQKGLLDCIEALEPDITSQVMITNNINFYEEAVGDFGRPVALHGRDSLAPATWWSLYGTD

Query:  YPDLQHLSVRILSQTCSIMQCRKSWSMFKYIYLKKKNRLEKQKMNDLAFAHYNLQLQERRLETCKARCSIDVLDPVCLE--AIDANMEDWMEDVEALEDE
          +LQ ++VRILS TCS + C   WS++  +  + +++  K+   DL + HYNL+L+E++L   K R   +   P  L    +D  + DW+   E  E+E
Subjt:  YPDLQHLSVRILSQTCSIMQCRKSWSMFKYIYLKKKNRLEKQKMNDLAFAHYNLQLQERRLETCKARCSIDVLDPVCLE--AIDANMEDWMEDVEALEDE

AT3G22220.1 hAT transposon superfamily2.9e-10832.95Show/hide
Query:  PRASDPGWAH-GIMVNGGRQKIKCKYCHKVMLGGGISRLKQHLAGERGNVAPCEEVPEEVKVQIQQLLGFKVLEKLKRQKKGSK----------HAVSCL
        P+  D  W H  +   G R +++C YC K+  GGGI+R+K+HLAG++G    C++VP+EV++ +QQ +   V  + KR+K   +             + +
Subjt:  PRASDPGWAH-GIMVNGGRQKIKCKYCHKVMLGGGISRLKQHLAGERGNVAPCEEVPEEVKVQIQQLLGFKVLEKLKRQKKGSK----------HAVSCL

Query:  PSREEIDDGV----------HRVQNSRRRPFRRNGKDVLEG---VTKEAKRKKKHLFPTSSMTQSVNQNTATMESIEQVDMAVARFIYQAGIPISAISSQ
         +  ++++G                +++R +R    +  E       E  R   +L P +  +     +  + E  + V MA+ RF++  G    A +S 
Subjt:  PSREEIDDGV----------HRVQNSRRRPFRRNGKDVLEG---VTKEAKRKKKHLFPTSSMTQSVNQNTATMESIEQVDMAVARFIYQAGIPISAISSQ

Query:  YFQQMADAIAAIGPGYKMPTYHSLMGKLLDRSVQDAGEYVEELRKSWEVTGCSVLVDRWMDRTGSVVINFFVYCPKGTMFLKSVDLSEISESPEGLLNLF
          Q   DAI + G G  +PT+  L G +L   V++  + ++E +  W+ TGCSVLV       G +++ F VYCP+  +FLKSVD SEI +S + L  L 
Subjt:  YFQQMADAIAAIGPGYKMPTYHSLMGKLLDRSVQDAGEYVEELRKSWEVTGCSVLVDRWMDRTGSVVINFFVYCPKGTMFLKSVDLSEISESPEGLLNLF

Query:  DSIVEEVGLKNIVHFVTDTSPLFKAAGILLVEKYKTFFSSVCAAHCVELILEEIEKMEEVKEVVGKAKRIVQFIYNHVWVLNQIKKRSGGREIIQLASAR
          +VEE+G  N+V  +T     + AAG  L++ Y + +   CAAHC++ +LEE  KM+ ++E++ +A+ + + IYNH  VLN ++K + G +I+Q     
Subjt:  DSIVEEVGLKNIVHFVTDTSPLFKAAGILLVEKYKTFFSSVCAAHCVELILEEIEKMEEVKEVVGKAKRIVQFIYNHVWVLNQIKKRSGGREIIQLASAR

Query:  YFSIFLTLQNILSLKDYLHQTFTSGAWMQSNLSKSGAGLEVAKITADPLFWSKCDHISMGTKPLLSVLQFLESEEKPSAGFIYDAFEKAKNSVMLAFNQK
          + F T+  I  LK YL    TS  W   + SK   GL + +   D  FW      +  T P+L VL+ + SE KP+ G++Y A  +AK ++      +
Subjt:  YFSIFLTLQNILSLKDYLHQTFTSGAWMQSNLSKSGAGLEVAKITADPLFWSKCDHISMGTKPLLSVLQFLESEEKPSAGFIYDAFEKAKNSVMLAFNQK

Query:  ESVYLPYLKAIDHVLLKEFQSPLHVAAYYLNPSIFYSPTFLSSKVIQKGLLDCIEALEPDITSQVMITNNINFYEEAVGDFGRPVALHGRDSLAPATWWS
        E  Y+ Y K ID   L   Q PL+ A +YLNP  FYS        I   ++DCIE L PD+  Q ++  +IN Y+ AVG FGR +A+  RD++ PA WWS
Subjt:  ESVYLPYLKAIDHVLLKEFQSPLHVAAYYLNPSIFYSPTFLSSKVIQKGLLDCIEALEPDITSQVMITNNINFYEEAVGDFGRPVALHGRDSLAPATWWS

Query:  LYGTDYPDLQHLSVRILSQTC-SIMQCRKSWSMFKYIYLKKKNRLEKQKMNDLAFAHYNLQLQERRLETCKARCSIDVLDPVCLEAIDANMEDWM
         YG    +L   ++RILSQTC S +   ++ +    IY + KN +E+Q++NDL F  YN++L+    E+       D +DP+    ++  +EDW+
Subjt:  LYGTDYPDLQHLSVRILSQTC-SIMQCRKSWSMFKYIYLKKKNRLEKQKMNDLAFAHYNLQLQERRLETCKARCSIDVLDPVCLEAIDANMEDWM

AT3G22220.2 hAT transposon superfamily2.9e-10832.95Show/hide
Query:  PRASDPGWAH-GIMVNGGRQKIKCKYCHKVMLGGGISRLKQHLAGERGNVAPCEEVPEEVKVQIQQLLGFKVLEKLKRQKKGSK----------HAVSCL
        P+  D  W H  +   G R +++C YC K+  GGGI+R+K+HLAG++G    C++VP+EV++ +QQ +   V  + KR+K   +             + +
Subjt:  PRASDPGWAH-GIMVNGGRQKIKCKYCHKVMLGGGISRLKQHLAGERGNVAPCEEVPEEVKVQIQQLLGFKVLEKLKRQKKGSK----------HAVSCL

Query:  PSREEIDDGV----------HRVQNSRRRPFRRNGKDVLEG---VTKEAKRKKKHLFPTSSMTQSVNQNTATMESIEQVDMAVARFIYQAGIPISAISSQ
         +  ++++G                +++R +R    +  E       E  R   +L P +  +     +  + E  + V MA+ RF++  G    A +S 
Subjt:  PSREEIDDGV----------HRVQNSRRRPFRRNGKDVLEG---VTKEAKRKKKHLFPTSSMTQSVNQNTATMESIEQVDMAVARFIYQAGIPISAISSQ

Query:  YFQQMADAIAAIGPGYKMPTYHSLMGKLLDRSVQDAGEYVEELRKSWEVTGCSVLVDRWMDRTGSVVINFFVYCPKGTMFLKSVDLSEISESPEGLLNLF
          Q   DAI + G G  +PT+  L G +L   V++  + ++E +  W+ TGCSVLV       G +++ F VYCP+  +FLKSVD SEI +S + L  L 
Subjt:  YFQQMADAIAAIGPGYKMPTYHSLMGKLLDRSVQDAGEYVEELRKSWEVTGCSVLVDRWMDRTGSVVINFFVYCPKGTMFLKSVDLSEISESPEGLLNLF

Query:  DSIVEEVGLKNIVHFVTDTSPLFKAAGILLVEKYKTFFSSVCAAHCVELILEEIEKMEEVKEVVGKAKRIVQFIYNHVWVLNQIKKRSGGREIIQLASAR
          +VEE+G  N+V  +T     + AAG  L++ Y + +   CAAHC++ +LEE  KM+ ++E++ +A+ + + IYNH  VLN ++K + G +I+Q     
Subjt:  DSIVEEVGLKNIVHFVTDTSPLFKAAGILLVEKYKTFFSSVCAAHCVELILEEIEKMEEVKEVVGKAKRIVQFIYNHVWVLNQIKKRSGGREIIQLASAR

Query:  YFSIFLTLQNILSLKDYLHQTFTSGAWMQSNLSKSGAGLEVAKITADPLFWSKCDHISMGTKPLLSVLQFLESEEKPSAGFIYDAFEKAKNSVMLAFNQK
          + F T+  I  LK YL    TS  W   + SK   GL + +   D  FW      +  T P+L VL+ + SE KP+ G++Y A  +AK ++      +
Subjt:  YFSIFLTLQNILSLKDYLHQTFTSGAWMQSNLSKSGAGLEVAKITADPLFWSKCDHISMGTKPLLSVLQFLESEEKPSAGFIYDAFEKAKNSVMLAFNQK

Query:  ESVYLPYLKAIDHVLLKEFQSPLHVAAYYLNPSIFYSPTFLSSKVIQKGLLDCIEALEPDITSQVMITNNINFYEEAVGDFGRPVALHGRDSLAPATWWS
        E  Y+ Y K ID   L   Q PL+ A +YLNP  FYS        I   ++DCIE L PD+  Q ++  +IN Y+ AVG FGR +A+  RD++ PA WWS
Subjt:  ESVYLPYLKAIDHVLLKEFQSPLHVAAYYLNPSIFYSPTFLSSKVIQKGLLDCIEALEPDITSQVMITNNINFYEEAVGDFGRPVALHGRDSLAPATWWS

Query:  LYGTDYPDLQHLSVRILSQTC-SIMQCRKSWSMFKYIYLKKKNRLEKQKMNDLAFAHYNLQLQERRLETCKARCSIDVLDPVCLEAIDANMEDWM
         YG    +L   ++RILSQTC S +   ++ +    IY + KN +E+Q++NDL F  YN++L+    E+       D +DP+    ++  +EDW+
Subjt:  LYGTDYPDLQHLSVRILSQTC-SIMQCRKSWSMFKYIYLKKKNRLEKQKMNDLAFAHYNLQLQERRLETCKARCSIDVLDPVCLEAIDANMEDWM

AT4G15020.1 hAT transposon superfamily1.8e-10533.06Show/hide
Query:  PRASDPGWAH-GIMVNGGRQKIKCKYCHKVMLGGGISRLKQHLAGERGNVAPCEEVPEEVKVQIQQLLGFKVLEKLKRQKKGSKH-AVSCLPSREEIDDG
        P+  D  W H  I   G R +++C YC K+  GGGI+R+K+HLAG++G    C++VPE+V++ +QQ +   V  + KR K  S+  +V+ LP    I+  
Subjt:  PRASDPGWAH-GIMVNGGRQKIKCKYCHKVMLGGGISRLKQHLAGERGNVAPCEEVPEEVKVQIQQLLGFKVLEKLKRQKKGSKH-AVSCLPSREEIDDG

Query:  VHRVQNSRRRPFRRNG--------KDVLEGVTKE-AKRKKKHLFPTSSMTQSVNQNTATMESI--------------------EQVDMAVARFIYQAGIP
        +  VQ      F+  G        + +L G TK+   R KK+ F   S + +V+     M+++                      + MA+ RF++  G  
Subjt:  VHRVQNSRRRPFRRNG--------KDVLEGVTKE-AKRKKKHLFPTSSMTQSVNQNTATMESI--------------------EQVDMAVARFIYQAGIP

Query:  ISAISSQYFQQMADAIAAIGPGYKMPTYHSLMGKLLDRSVQDAGEYVEELRKSWEVTGCSVLVDRWMDRTGSVVINFFVYCPKGTMFLKSVDLSEISESP
          A++S  FQ M DAIA+ G G   PT+  L G +L   V++  + ++E +  W+ TGCS+LV+      G  V+NF VYCP+  +FLKSVD SE+  S 
Subjt:  ISAISSQYFQQMADAIAAIGPGYKMPTYHSLMGKLLDRSVQDAGEYVEELRKSWEVTGCSVLVDRWMDRTGSVVINFFVYCPKGTMFLKSVDLSEISESP

Query:  EGLLNLFDSIVEEVGLKNIVHFVTDTSPLFKAAGILLVEKYKTFFSSVCAAHCVELILEEIEKMEEVKEVVGKAKRIVQFIYNHVWVLNQIKKRSGGREI
        + L  L   +VEEVG  N+V  +T     +  AG  L+  Y + +   CAAHC++ +LEE  K+  + E + +A+ I +F+YNH  VLN + K + G +I
Subjt:  EGLLNLFDSIVEEVGLKNIVHFVTDTSPLFKAAGILLVEKYKTFFSSVCAAHCVELILEEIEKMEEVKEVVGKAKRIVQFIYNHVWVLNQIKKRSGGREI

Query:  IQLASARYFSIFLTLQNILSLKDYLHQTFTSGAWMQSNLSKSGAGLEVAKITADPLFWSKCDHISMGTKPLLSVLQFLESEEKPSAGFIYDAFEKAKNSV
        +  A +   + F TL  I  LK  L    TS  W + + S+  +GL +  +T D  FW     ++  T PLL  L+ + SE++P+ G++Y A  +AK+++
Subjt:  IQLASARYFSIFLTLQNILSLKDYLHQTFTSGAWMQSNLSKSGAGLEVAKITADPLFWSKCDHISMGTKPLLSVLQFLESEEKPSAGFIYDAFEKAKNSV

Query:  MLAFNQKESVYLPYLKAIDHVLLKEFQSPLHVAAYYLNPSIFYSPTFLSSKVIQKGLLDCIEALEPDITSQVMITNNINFYEEAVGDFGRPVALHGRDSL
              +E  Y+ Y K ID    ++   PL  A ++LNP +FY+        +   +LDCIE L PD   Q  I   +  Y+ A G FGR +A+  RD++
Subjt:  MLAFNQKESVYLPYLKAIDHVLLKEFQSPLHVAAYYLNPSIFYSPTFLSSKVIQKGLLDCIEALEPDITSQVMITNNINFYEEAVGDFGRPVALHGRDSL

Query:  APATWWSLYGTDYPDLQHLSVRILSQTC-SIMQCRKSWSMFKYIYLKKKNRLEKQKMNDLAFAHYNLQLQERRLETCKARCSIDVLDPVCLEAIDANMED
         PA WWS YG    +L   ++RILSQTC S + CR++    ++IY + KN +E+++++DL F  YN+     RL         D LDP+    ID  +++
Subjt:  APATWWSLYGTDYPDLQHLSVRILSQTC-SIMQCRKSWSMFKYIYLKKKNRLEKQKMNDLAFAHYNLQLQERRLETCKARCSIDVLDPVCLEAIDANMED

Query:  WME------------DVEALEDEHKRWVDVKVTNQETL
        W+             D ++LE  H+  V   + + E L
Subjt:  WME------------DVEALEDEHKRWVDVKVTNQETL

AT4G15020.2 hAT transposon superfamily1.8e-10533.06Show/hide
Query:  PRASDPGWAH-GIMVNGGRQKIKCKYCHKVMLGGGISRLKQHLAGERGNVAPCEEVPEEVKVQIQQLLGFKVLEKLKRQKKGSKH-AVSCLPSREEIDDG
        P+  D  W H  I   G R +++C YC K+  GGGI+R+K+HLAG++G    C++VPE+V++ +QQ +   V  + KR K  S+  +V+ LP    I+  
Subjt:  PRASDPGWAH-GIMVNGGRQKIKCKYCHKVMLGGGISRLKQHLAGERGNVAPCEEVPEEVKVQIQQLLGFKVLEKLKRQKKGSKH-AVSCLPSREEIDDG

Query:  VHRVQNSRRRPFRRNG--------KDVLEGVTKE-AKRKKKHLFPTSSMTQSVNQNTATMESI--------------------EQVDMAVARFIYQAGIP
        +  VQ      F+  G        + +L G TK+   R KK+ F   S + +V+     M+++                      + MA+ RF++  G  
Subjt:  VHRVQNSRRRPFRRNG--------KDVLEGVTKE-AKRKKKHLFPTSSMTQSVNQNTATMESI--------------------EQVDMAVARFIYQAGIP

Query:  ISAISSQYFQQMADAIAAIGPGYKMPTYHSLMGKLLDRSVQDAGEYVEELRKSWEVTGCSVLVDRWMDRTGSVVINFFVYCPKGTMFLKSVDLSEISESP
          A++S  FQ M DAIA+ G G   PT+  L G +L   V++  + ++E +  W+ TGCS+LV+      G  V+NF VYCP+  +FLKSVD SE+  S 
Subjt:  ISAISSQYFQQMADAIAAIGPGYKMPTYHSLMGKLLDRSVQDAGEYVEELRKSWEVTGCSVLVDRWMDRTGSVVINFFVYCPKGTMFLKSVDLSEISESP

Query:  EGLLNLFDSIVEEVGLKNIVHFVTDTSPLFKAAGILLVEKYKTFFSSVCAAHCVELILEEIEKMEEVKEVVGKAKRIVQFIYNHVWVLNQIKKRSGGREI
        + L  L   +VEEVG  N+V  +T     +  AG  L+  Y + +   CAAHC++ +LEE  K+  + E + +A+ I +F+YNH  VLN + K + G +I
Subjt:  EGLLNLFDSIVEEVGLKNIVHFVTDTSPLFKAAGILLVEKYKTFFSSVCAAHCVELILEEIEKMEEVKEVVGKAKRIVQFIYNHVWVLNQIKKRSGGREI

Query:  IQLASARYFSIFLTLQNILSLKDYLHQTFTSGAWMQSNLSKSGAGLEVAKITADPLFWSKCDHISMGTKPLLSVLQFLESEEKPSAGFIYDAFEKAKNSV
        +  A +   + F TL  I  LK  L    TS  W + + S+  +GL +  +T D  FW     ++  T PLL  L+ + SE++P+ G++Y A  +AK+++
Subjt:  IQLASARYFSIFLTLQNILSLKDYLHQTFTSGAWMQSNLSKSGAGLEVAKITADPLFWSKCDHISMGTKPLLSVLQFLESEEKPSAGFIYDAFEKAKNSV

Query:  MLAFNQKESVYLPYLKAIDHVLLKEFQSPLHVAAYYLNPSIFYSPTFLSSKVIQKGLLDCIEALEPDITSQVMITNNINFYEEAVGDFGRPVALHGRDSL
              +E  Y+ Y K ID    ++   PL  A ++LNP +FY+        +   +LDCIE L PD   Q  I   +  Y+ A G FGR +A+  RD++
Subjt:  MLAFNQKESVYLPYLKAIDHVLLKEFQSPLHVAAYYLNPSIFYSPTFLSSKVIQKGLLDCIEALEPDITSQVMITNNINFYEEAVGDFGRPVALHGRDSL

Query:  APATWWSLYGTDYPDLQHLSVRILSQTC-SIMQCRKSWSMFKYIYLKKKNRLEKQKMNDLAFAHYNLQLQERRLETCKARCSIDVLDPVCLEAIDANMED
         PA WWS YG    +L   ++RILSQTC S + CR++    ++IY + KN +E+++++DL F  YN+     RL         D LDP+    ID  +++
Subjt:  APATWWSLYGTDYPDLQHLSVRILSQTC-SIMQCRKSWSMFKYIYLKKKNRLEKQKMNDLAFAHYNLQLQERRLETCKARCSIDVLDPVCLEAIDANMED

Query:  WME------------DVEALEDEHKRWVDVKVTNQETL
        W+             D ++LE  H+  V   + + E L
Subjt:  WME------------DVEALEDEHKRWVDVKVTNQETL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAGGTTATGAACATCCAATGGTGGAGTCTGAGGAGCTTCTCACGGTCACGGATATGAAATTCAAGGAAAAACGAGGCAAGGTACCTCCACGAGCTTCTGATCCTGG
TTGGGCTCATGGAATTATGGTCAATGGGGGTCGCCAGAAGATTAAATGCAAATACTGTCATAAAGTTATGCTTGGGGGTGGTATATCCAGACTAAAGCAACATCTAGCCG
GGGAAAGAGGAAATGTAGCTCCATGTGAGGAAGTTCCAGAAGAAGTTAAGGTGCAGATTCAACAACTTTTAGGCTTTAAAGTGTTGGAGAAGCTTAAACGGCAGAAAAAG
GGTAGCAAACATGCAGTGTCATGCCTCCCAAGTAGGGAAGAAATAGACGATGGGGTACACCGGGTTCAGAATAGTCGACGACGCCCTTTCCGGAGAAATGGGAAGGATGT
ATTGGAAGGAGTTACTAAGGAAGCAAAGAGGAAAAAGAAACACCTTTTTCCAACATCTTCCATGACTCAATCCGTTAACCAGAATACTGCTACAATGGAAAGCATAGAAC
AAGTTGATATGGCTGTTGCCAGATTTATATACCAAGCTGGTATACCAATTAGTGCCATAAGCTCACAATATTTCCAACAAATGGCTGATGCCATTGCTGCTATAGGCCCT
GGTTATAAGATGCCTACCTATCATTCTTTGATGGGTAAATTGCTTGACAGAAGTGTCCAGGATGCTGGGGAGTATGTCGAAGAGTTGAGAAAGTCTTGGGAGGTTACAGG
GTGCTCAGTCTTGGTTGATAGGTGGATGGATAGAACTGGTTCTGTTGTCATAAACTTTTTTGTCTATTGTCCCAAGGGTACCATGTTCCTTAAGTCTGTTGATTTATCGG
AAATATCAGAATCACCTGAGGGGCTTCTAAATTTATTTGACAGCATTGTTGAAGAAGTTGGACTGAAGAATATAGTCCATTTTGTGACAGATACTTCTCCCTTGTTTAAA
GCTGCAGGTATACTCTTGGTCGAAAAATACAAGACATTTTTCTCAAGTGTTTGTGCTGCACATTGTGTGGAGTTAATCCTCGAGGAAATTGAGAAAATGGAAGAAGTAAA
AGAGGTTGTTGGGAAAGCGAAGAGGATAGTTCAATTCATATACAACCATGTCTGGGTCCTAAATCAAATAAAGAAGAGAAGTGGTGGAAGAGAGATTATTCAACTTGCAT
CTGCAAGATATTTCTCCATCTTCTTGACTCTGCAAAACATTTTATCTTTGAAAGACTATCTTCATCAGACGTTCACCAGTGGTGCTTGGATGCAGTCAAATTTGTCGAAG
TCTGGGGCTGGACTTGAGGTGGCAAAGATCACTGCTGATCCACTCTTCTGGTCCAAGTGTGATCATATTTCAATGGGAACGAAACCTTTACTTTCTGTGTTGCAATTTCT
TGAATCAGAGGAGAAGCCATCTGCCGGGTTTATATATGATGCATTTGAAAAAGCAAAGAACAGTGTCATGCTGGCTTTCAACCAAAAGGAATCTGTCTACTTGCCATATT
TAAAAGCCATTGACCATGTTTTGCTGAAGGAATTTCAGAGCCCTCTGCATGTGGCTGCATACTACCTAAATCCATCGATATTCTATAGTCCTACGTTCTTATCCAGCAAA
GTTATTCAAAAGGGTTTACTTGATTGCATCGAAGCCTTAGAGCCAGATATAACATCCCAGGTTATGATAACAAACAACATAAATTTCTATGAGGAAGCTGTTGGAGATTT
TGGGCGGCCAGTGGCATTACATGGTCGAGATTCATTGGCCCCAGCTACTTGGTGGTCATTGTATGGAACCGATTATCCAGATTTACAACACTTGTCCGTTAGAATATTGA
GTCAGACTTGCAGCATTATGCAATGTCGTAAAAGCTGGAGCATGTTCAAATATATCTATTTAAAGAAGAAGAACCGGCTGGAAAAGCAGAAGATGAATGACCTTGCATTT
GCTCATTATAACTTGCAACTCCAGGAGAGGAGATTGGAGACTTGTAAAGCAAGATGCTCAATAGATGTACTTGATCCTGTTTGTTTGGAGGCCATTGATGCGAACATGGA
AGATTGGATGGAGGATGTTGAGGCATTGGAGGATGAGCACAAGAGGTGGGTGGATGTGAAGGTCACTAATCAGGAGACCTTGGTGGAACATAAATTATCCAATATGGATA
GTTGTATTGACAGCACAGATGAGAGAGTCAGTGAGGACACTAGAGGTATAGATCGTAATGATTTGTAG
mRNA sequenceShow/hide mRNA sequence
CTCTGATAACCTGTGTTTGGGTAGCTGAAAAATGAAGAACAAAGAAACTTTCCAATTCGAGAAATTTTCTTTTACGAATGTACTTTTGATCTCAGCAACAAAATCGTGCT
GGCGTACTCCTTGTTGCATGTCATTCAGTCTACTTGAACTTCTTATGCAGATGAACTTTCCTTCTCTACTTTTGTTTGTCATTCATTGGGTTACTAATTATGAACTTTGT
TTATTTGAGCTTCTTTTCTCAACTTCTGCCTCTTATCATTGTTAATTTACAAGCAGGGCGATGCACAGTGCTTTATCTTCTTTCAAAATTCAGCTTATCTGAATAATAAA
CTCGTATTCACTGAACATTATTGTCCGACTTCAGATCAAGTTGAACTAGTTTAATTGTTTGTTTGTTTGTTTTTTTTTTTTTGAAATGGATGCATTTGAAACGGGTAACC
TGGGGAGGATTTATTATTATTATTTGTTGTGGTGGAGAACTTGATCATATAGTTAGTTTCATTTCAAGTTTTTTCAGACTAAGCTTACGATTTCTGTTATTTTGAGTCTG
CATTCCCTCCCCTCGTTGAAACCAGGAATTTACAGATTCACTCAAGTGTTCAGTGAAATGGTAATGGTTTGGTCAAAAAGTTTAATTTATTCGTGTGGTGAAATCTTACT
CAACTTCTTTGATGGATTGAAGCGATTCCATAAGGTGAAGTGAAATCATAGTCCAACACAAGAAAAAAAAACTTATATGAGAAACAATTGCCAGAGGAGCCACCAAATCT
TGCACGTAAACTGCCCAAAAAGAACAAGCAAGCTAAATTTTGTTCAAAACATCTCGGGCATTGGAGCCATGTTTCATAATATACATTTACAACCTTCAGATATAAATTTA
GTCTAGAAATTGGATGAACAGGGTATAATAATTGATTAAACCATGCAACAATCCGGTGCTAAGGTATTATAAATATGAGCCGAATGGAGTTTGAGACACGTTGACTGCAA
ATTATCATAGTTGTACGATTTGATTTATCATTTAGTTATTTATTTTTAAAACTATTGTCACGTTTGATTCAACAAGATATGTCAGGTTATGAACATCCAATGGTGGAGTC
TGAGGAGCTTCTCACGGTCACGGATATGAAATTCAAGGAAAAACGAGGCAAGGTACCTCCACGAGCTTCTGATCCTGGTTGGGCTCATGGAATTATGGTCAATGGGGGTC
GCCAGAAGATTAAATGCAAATACTGTCATAAAGTTATGCTTGGGGGTGGTATATCCAGACTAAAGCAACATCTAGCCGGGGAAAGAGGAAATGTAGCTCCATGTGAGGAA
GTTCCAGAAGAAGTTAAGGTGCAGATTCAACAACTTTTAGGCTTTAAAGTGTTGGAGAAGCTTAAACGGCAGAAAAAGGGTAGCAAACATGCAGTGTCATGCCTCCCAAG
TAGGGAAGAAATAGACGATGGGGTACACCGGGTTCAGAATAGTCGACGACGCCCTTTCCGGAGAAATGGGAAGGATGTATTGGAAGGAGTTACTAAGGAAGCAAAGAGGA
AAAAGAAACACCTTTTTCCAACATCTTCCATGACTCAATCCGTTAACCAGAATACTGCTACAATGGAAAGCATAGAACAAGTTGATATGGCTGTTGCCAGATTTATATAC
CAAGCTGGTATACCAATTAGTGCCATAAGCTCACAATATTTCCAACAAATGGCTGATGCCATTGCTGCTATAGGCCCTGGTTATAAGATGCCTACCTATCATTCTTTGAT
GGGTAAATTGCTTGACAGAAGTGTCCAGGATGCTGGGGAGTATGTCGAAGAGTTGAGAAAGTCTTGGGAGGTTACAGGGTGCTCAGTCTTGGTTGATAGGTGGATGGATA
GAACTGGTTCTGTTGTCATAAACTTTTTTGTCTATTGTCCCAAGGGTACCATGTTCCTTAAGTCTGTTGATTTATCGGAAATATCAGAATCACCTGAGGGGCTTCTAAAT
TTATTTGACAGCATTGTTGAAGAAGTTGGACTGAAGAATATAGTCCATTTTGTGACAGATACTTCTCCCTTGTTTAAAGCTGCAGGTATACTCTTGGTCGAAAAATACAA
GACATTTTTCTCAAGTGTTTGTGCTGCACATTGTGTGGAGTTAATCCTCGAGGAAATTGAGAAAATGGAAGAAGTAAAAGAGGTTGTTGGGAAAGCGAAGAGGATAGTTC
AATTCATATACAACCATGTCTGGGTCCTAAATCAAATAAAGAAGAGAAGTGGTGGAAGAGAGATTATTCAACTTGCATCTGCAAGATATTTCTCCATCTTCTTGACTCTG
CAAAACATTTTATCTTTGAAAGACTATCTTCATCAGACGTTCACCAGTGGTGCTTGGATGCAGTCAAATTTGTCGAAGTCTGGGGCTGGACTTGAGGTGGCAAAGATCAC
TGCTGATCCACTCTTCTGGTCCAAGTGTGATCATATTTCAATGGGAACGAAACCTTTACTTTCTGTGTTGCAATTTCTTGAATCAGAGGAGAAGCCATCTGCCGGGTTTA
TATATGATGCATTTGAAAAAGCAAAGAACAGTGTCATGCTGGCTTTCAACCAAAAGGAATCTGTCTACTTGCCATATTTAAAAGCCATTGACCATGTTTTGCTGAAGGAA
TTTCAGAGCCCTCTGCATGTGGCTGCATACTACCTAAATCCATCGATATTCTATAGTCCTACGTTCTTATCCAGCAAAGTTATTCAAAAGGGTTTACTTGATTGCATCGA
AGCCTTAGAGCCAGATATAACATCCCAGGTTATGATAACAAACAACATAAATTTCTATGAGGAAGCTGTTGGAGATTTTGGGCGGCCAGTGGCATTACATGGTCGAGATT
CATTGGCCCCAGCTACTTGGTGGTCATTGTATGGAACCGATTATCCAGATTTACAACACTTGTCCGTTAGAATATTGAGTCAGACTTGCAGCATTATGCAATGTCGTAAA
AGCTGGAGCATGTTCAAATATATCTATTTAAAGAAGAAGAACCGGCTGGAAAAGCAGAAGATGAATGACCTTGCATTTGCTCATTATAACTTGCAACTCCAGGAGAGGAG
ATTGGAGACTTGTAAAGCAAGATGCTCAATAGATGTACTTGATCCTGTTTGTTTGGAGGCCATTGATGCGAACATGGAAGATTGGATGGAGGATGTTGAGGCATTGGAGG
ATGAGCACAAGAGGTGGGTGGATGTGAAGGTCACTAATCAGGAGACCTTGGTGGAACATAAATTATCCAATATGGATAGTTGTATTGACAGCACAGATGAGAGAGTCAGT
GAGGACACTAGAGGTATAGATCGTAATGATTTGTAGCCTCTGTTTTGTTGTTCAAAAGCAGCCGATTACCTTATCTATAAATAATTAGTTTATGAATGCTCTAATTCTGT
TGTATAATCTGCTAAATGTGCGTTGTATCTAATTTCCATCTGTTTCGGAAAGTTGATATTCATTATATTGTATAGAGGAATTTGATGAGCATAAGTTCTTGTTTTTTGTT
TTTTTCCCCCTCCTTCTCTGGCAAATTTTACGCTGTGATGACAATATCTGGCTATTAAAGATCAGATA
Protein sequenceShow/hide protein sequence
MSGYEHPMVESEELLTVTDMKFKEKRGKVPPRASDPGWAHGIMVNGGRQKIKCKYCHKVMLGGGISRLKQHLAGERGNVAPCEEVPEEVKVQIQQLLGFKVLEKLKRQKK
GSKHAVSCLPSREEIDDGVHRVQNSRRRPFRRNGKDVLEGVTKEAKRKKKHLFPTSSMTQSVNQNTATMESIEQVDMAVARFIYQAGIPISAISSQYFQQMADAIAAIGP
GYKMPTYHSLMGKLLDRSVQDAGEYVEELRKSWEVTGCSVLVDRWMDRTGSVVINFFVYCPKGTMFLKSVDLSEISESPEGLLNLFDSIVEEVGLKNIVHFVTDTSPLFK
AAGILLVEKYKTFFSSVCAAHCVELILEEIEKMEEVKEVVGKAKRIVQFIYNHVWVLNQIKKRSGGREIIQLASARYFSIFLTLQNILSLKDYLHQTFTSGAWMQSNLSK
SGAGLEVAKITADPLFWSKCDHISMGTKPLLSVLQFLESEEKPSAGFIYDAFEKAKNSVMLAFNQKESVYLPYLKAIDHVLLKEFQSPLHVAAYYLNPSIFYSPTFLSSK
VIQKGLLDCIEALEPDITSQVMITNNINFYEEAVGDFGRPVALHGRDSLAPATWWSLYGTDYPDLQHLSVRILSQTCSIMQCRKSWSMFKYIYLKKKNRLEKQKMNDLAF
AHYNLQLQERRLETCKARCSIDVLDPVCLEAIDANMEDWMEDVEALEDEHKRWVDVKVTNQETLVEHKLSNMDSCIDSTDERVSEDTRGIDRNDL