; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG01G008230 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG01G008230
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionTranscription factor TFIIIB component B''-like isoform X3
Genome locationCG_Chr01:9532621..9543488
RNA-Seq ExpressionClCG01G008230
SyntenyClCG01G008230
Gene Ontology termsGO:0070898 - RNA polymerase III preinitiation complex assembly (biological process)
GO:0000126 - transcription factor TFIIIB complex (cellular component)
GO:0001156 - TFIIIC-class transcription factor complex binding (molecular function)
InterPro domainsIPR001005 - SANT/Myb domain
IPR009057 - Homeobox-like domain superfamily
IPR017884 - SANT domain
IPR039467 - Transcription factor TFIIIB component B'', Myb domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_011655158.1 uncharacterized protein LOC101216268 [Cucumis sativus]0.0e+0078.95Show/hide
Query:  VDPFDEFFSDPGVTSRAGGRFQPKIKPRPKKQTLAPK-STLSQDKKGTIPDTKSCHDGSGSSKSIKLSSQLPVMEEKGESEDDLLLATARSDLIGCSHPT
        +DPFD+ FS+  VT+RAG RFQPK KPRPKKQTLAP+ S  SQD KGTI D KSC D  G++KSIK SSQLPV EEK ESED LL  TARSD IGCS PT
Subjt:  VDPFDEFFSDPGVTSRAGGRFQPKIKPRPKKQTLAPK-STLSQDKKGTIPDTKSCHDGSGSSKSIKLSSQLPVMEEKGESEDDLLLATARSDLIGCSHPT

Query:  SVESAKMVNSTQFDLDSYGSTLCSGSTIDGGVTDAIDLTTFSSVPVG-ENLTDDTKTSGILNNSHPSVSSAHEATVLDQSGLGSIQSEGGHFNDGKIAGE
        SVES K+V+STQFDLD  GS L SGSTI+ GVTDAID TT  S PVG + LTDD K S +L  SHPS SSAHEA  +DQ G+GSIQSE  H  DGKIAG+
Subjt:  SVESAKMVNSTQFDLDSYGSTLCSGSTIDGGVTDAIDLTTFSSVPVG-ENLTDDTKTSGILNNSHPSVSSAHEATVLDQSGLGSIQSEGGHFNDGKIAGE

Query:  NIDLFYELECLDDFHNQPKNEADPSSLKQASISNEDGDLDKQRLETEVKVLFCYQEKLFLLKIFSSLVHIWSRFQEYGAGAIITMDTISSGTTTPSERPA
        NIDLFYELECLDDFHNQP+NE DPSSLKQA+ISNE GDLDKQRLE E                            E GA A +TMDT+SS TTTPSER A
Subjt:  NIDLFYELECLDDFHNQPKNEADPSSLKQASISNEDGDLDKQRLETEVKVLFCYQEKLFLLKIFSSLVHIWSRFQEYGAGAIITMDTISSGTTTPSERPA

Query:  CKYIPKPKMRTAEDACTQISQPEISNMLPLSPQVNSCDTRCMHEASIGTHSDGILNDSSINFDGYTPDNQHTETPVNVESLAYDSYGDILVDDFNSDDQD
        CKYIPKPKMRTA DACTQISQPEISNMLP SPQV SCDT  M+EASIGTHSDG+LNDSSINFDGY P NQ TETPVNVES  +DSYGDILVDDFNSDDQD
Subjt:  CKYIPKPKMRTAEDACTQISQPEISNMLPLSPQVNSCDTRCMHEASIGTHSDGILNDSSINFDGYTPDNQHTETPVNVESLAYDSYGDILVDDFNSDDQD

Query:  EMLREESGKNDEEEPSTESNISQQQKMFSPVGEEIEHSKTSRKLRKKVSHQLDEPEDGVDENRNSPNEPSSNCDVHGDSYNKNEIPKGVRGKKTSTKSSK
         MLREE+GKNDEEEPS +SN+SQQQK+   VGEEIEHSKTSRKLRKKVSHQLDEPEDGVD NR  PNEPSSN  +HG+ YNKNE PKG +G+KTSTKSSK
Subjt:  EMLREESGKNDEEEPSTESNISQQQKMFSPVGEEIEHSKTSRKLRKKVSHQLDEPEDGVDENRNSPNEPSSNCDVHGDSYNKNEIPKGVRGKKTSTKSSK

Query:  PSSDNEKPTRKRKDANKAVPDLQAEKRPKKFSHSTRRNRRQVNKVLLETPEDEIDFQKISFRDLIIYHEHKEKLEKKVASTRKSATNQRTDTSGEEIYND
        PSS+NEKPTRKRK+ANKAVPDLQAEKRPKKFSHSTRRNRRQVNKVLLETPED+IDFQKISFRDLIIYHEHKEKLEKKVASTRKS TNQRTDTS EEIYND
Subjt:  PSSDNEKPTRKRKDANKAVPDLQAEKRPKKFSHSTRRNRRQVNKVLLETPEDEIDFQKISFRDLIIYHEHKEKLEKKVASTRKSATNQRTDTSGEEIYND

Query:  GEESLASEQGRGTDDDETPDVVDMTSAYFNYQSFMDKTPRTKWSKQDTERFYEAVRQFGTDFCMIQQLFPGRTRRQIKLKFKSEERHHPFRLSDAITNRS
        GEE+LASEQG+GTDDDE PDVVDMTSAYFNYQSFMDKTPRTKWSK DTERFYEAVRQFGTDFCMIQQLFPG+TRRQIKLKFKSEERHHPFRLSDAITNR+
Subjt:  GEESLASEQGRGTDDDETPDVVDMTSAYFNYQSFMDKTPRTKWSKQDTERFYEAVRQFGTDFCMIQQLFPGRTRRQIKLKFKSEERHHPFRLSDAITNRS

Query:  KVKYFADHSQFLSLIGQLQEAANKAKHESNQDELTENTGDEEQPELSPETNEEEVAKPVGVEETEKEEFVGGEVHSPLKA-----DDSDDDDPNRWDDYK
        K     DHSQFLSLI QL+EAA KAKHESNQDELTENTGDEEQPELSP+TNEEEV +P GVEETEK+EFVGGE+HSPLK      DD DDDDPNRWD+YK
Subjt:  KVKYFADHSQFLSLIGQLQEAANKAKHESNQDELTENTGDEEQPELSPETNEEEVAKPVGVEETEKEEFVGGEVHSPLKA-----DDSDDDDPNRWDDYK

Query:  FDY
        FDY
Subjt:  FDY

XP_022993130.1 uncharacterized protein LOC111489243 isoform X1 [Cucurbita maxima]0.0e+0078.7Show/hide
Query:  VDPFDEFFSDPGVTSRAGGRFQPKIKPRPKKQTLAPK-STLSQDKKGTIPDTKSCHDGSGSSKSIKLSSQLPVMEEKGESEDDLLLATARSDLIGCSHPT
        +DPFDE  SDPG TSR GGRFQPKIKPRPKKQTLAP+ ST+SQDKKGT  D KSCHD  G++KSIK SSQL V+EE  ESEDDLLLAT RSD IGCSH T
Subjt:  VDPFDEFFSDPGVTSRAGGRFQPKIKPRPKKQTLAPK-STLSQDKKGTIPDTKSCHDGSGSSKSIKLSSQLPVMEEKGESEDDLLLATARSDLIGCSHPT

Query:  SVESAKMVNSTQFDLDSYGSTLCSGSTIDGGVTDAIDLTTFSSVPVG-ENLTDDTKTSGILNNSHPSVSSAHEATVLDQSGLGSIQSEGGHFNDGKIAGE
        SVESA MV+STQ DLDS G  L SGSTIDG              PVG ENLTDD+K SGILN SH S S AHEATVL QSGLGSIQ E GH NDGKIAG+
Subjt:  SVESAKMVNSTQFDLDSYGSTLCSGSTIDGGVTDAIDLTTFSSVPVG-ENLTDDTKTSGILNNSHPSVSSAHEATVLDQSGLGSIQSEGGHFNDGKIAGE

Query:  NIDLFYELECLDDFHNQPKNEADPSSLKQASISNEDGDLDKQRLETEVKVLFCYQEKLFLLKIFSSLVHIWSRFQEYGAGAIITMDTISSGTTTPSERPA
        N D+FY+LE LDDFHNQPKNEADPSSLKQA+ISNEDGDLDKQRLE E                            E GAGA IT DTISSGTTT  E+PA
Subjt:  NIDLFYELECLDDFHNQPKNEADPSSLKQASISNEDGDLDKQRLETEVKVLFCYQEKLFLLKIFSSLVHIWSRFQEYGAGAIITMDTISSGTTTPSERPA

Query:  CKYIPKPKMRTAEDACTQISQPEISNMLPLSPQVNSCDTRCMHEASIGTHSDGILNDSSINFDGYTPDNQHTETPVNVESLAYDSYGDILVDDFNSDDQD
        CKYIPKPKMR A D+CTQISQPEISN LP SPQV SCDT+CMHEASIGTHSDG+LNDSSINFD Y+P NQH E PVNVESLAYDSYGDILVDDFNSDDQD
Subjt:  CKYIPKPKMRTAEDACTQISQPEISNMLPLSPQVNSCDTRCMHEASIGTHSDGILNDSSINFDGYTPDNQHTETPVNVESLAYDSYGDILVDDFNSDDQD

Query:  EMLREESGKNDEEEPSTESNISQQQKMFSPVGEEIEHSKTSRKLRKKVSHQLDEPEDGVDENRNSPNEPSSNCDVHGDSYNKNEIPKGVRGKKTSTKSSK
        EMLREE GKN EE+PS+ SN+SQQQ+MF PVGEEIEHSKTSRKLR++VSHQL +PEDGVD+    P+E  SNCD+HGD Y KNE  KG RG KT TKS K
Subjt:  EMLREESGKNDEEEPSTESNISQQQKMFSPVGEEIEHSKTSRKLRKKVSHQLDEPEDGVDENRNSPNEPSSNCDVHGDSYNKNEIPKGVRGKKTSTKSSK

Query:  PSSDNEKPTRKRKDANKAVPDLQAEKRPKKFSHSTRRNRRQVNKVLLETPEDEIDFQKISFRDLIIYHEHKEKLEKKVASTRKSATNQRTDTSGEEIYND
        PSSDNEKPTRKRK+ANKAVPDLQAEKRPKKFSHSTRRNRRQVNKVLLETPEDEIDFQKISFRDLIIYHEHKEKLEKK A+TR+SATNQRTDT GEEIYND
Subjt:  PSSDNEKPTRKRKDANKAVPDLQAEKRPKKFSHSTRRNRRQVNKVLLETPEDEIDFQKISFRDLIIYHEHKEKLEKKVASTRKSATNQRTDTSGEEIYND

Query:  GEESLASEQGRGTDDDETPDVVDMTSAYFNYQSFMDKTPRTKWSKQDTERFYEAVRQFGTDFCMIQQLFPGRTRRQIKLKFKSEERHHPFRLSDAITNRS
        GEESLA EQGRGTDDDETPDVVDMTSAYFNY SFMDKT RTKWSK DTERFYEAVRQFGTDFCMIQQLFPGRTR QIKLKFK+EERHHPFRLSDAITNR+
Subjt:  GEESLASEQGRGTDDDETPDVVDMTSAYFNYQSFMDKTPRTKWSKQDTERFYEAVRQFGTDFCMIQQLFPGRTRRQIKLKFKSEERHHPFRLSDAITNRS

Query:  KVKYFADHSQFLSLIGQLQEAANKAKHESNQDELTENTGDEEQPELSPETNEEEVAKPVGVEETEKEEFVGGEVHSPLKADDSDDDDPNRWDDYKFDY
        K     DHSQFLSLIGQLQEAANKAKHESN+DELTENTG+EE  ELSPE NEEEVAKP  VE+T+ EEFV GE+HSPLKAD+SDDDDP+RWD+YKFDY
Subjt:  KVKYFADHSQFLSLIGQLQEAANKAKHESNQDELTENTGDEEQPELSPETNEEEVAKPVGVEETEKEEFVGGEVHSPLKADDSDDDDPNRWDDYKFDY

XP_038875902.1 uncharacterized protein LOC120068262 isoform X1 [Benincasa hispida]0.0e+0085.46Show/hide
Query:  VDPFDEFFSDPGVTSRAGGRFQPKIKPRPKKQTLAPKSTLSQDKKGTIPDTKSCHDGSGSSKSIKLSSQLPVMEEKGESEDDLLLATARSDLIGCSHPTS
        +DPFDE FSDPGVTSRAGGRFQPKIKPRPKKQTL PKSTLS DKKGTI DTKSC DGSG+SKSI  SSQLPV EEK ESEDDLLLATARSD IG SHPTS
Subjt:  VDPFDEFFSDPGVTSRAGGRFQPKIKPRPKKQTLAPKSTLSQDKKGTIPDTKSCHDGSGSSKSIKLSSQLPVMEEKGESEDDLLLATARSDLIGCSHPTS

Query:  VESAKMVNSTQFDLDSYGSTLCSGSTIDGGVTDAIDLTTFSSVPVG-ENLTDDTKTSGILNNSHPSVSSAHEATVLDQSGLGSIQSEGGHFNDGKIAGEN
        VESAKMV+S QFDLDSYG  L SGSTI+ GVTDAIDLTT S  PVG +NL DDTK   +LN SHPS SSAHEATVLDQ GLGSIQSE  HFNDGKIAG+N
Subjt:  VESAKMVNSTQFDLDSYGSTLCSGSTIDGGVTDAIDLTTFSSVPVG-ENLTDDTKTSGILNNSHPSVSSAHEATVLDQSGLGSIQSEGGHFNDGKIAGEN

Query:  IDLFYELECLDDFHNQPKNEADPSSLKQASISNEDGDLDKQRLETEVKVLFCYQEKLFLLKIFSSLVHIWSRFQEYGAGAIITMDTISSGTTTPSERPAC
        IDLFYELECLDDFHNQPKNEADPSSLK A+ISNEDGDLDKQRLE E+                         FQE GAGA ITMDTISS TTTPSE+PAC
Subjt:  IDLFYELECLDDFHNQPKNEADPSSLKQASISNEDGDLDKQRLETEVKVLFCYQEKLFLLKIFSSLVHIWSRFQEYGAGAIITMDTISSGTTTPSERPAC

Query:  KYIPKPKMRTAEDACTQISQPEISNMLPLSPQVNSCDTRCMHEASIGTHSDGILNDSSINFDGYTPDNQHTETPVNVESLAYDSYGDILVDDFNSDDQDE
        KYIPKPK+RTA DACTQISQPEISNMLPLSPQV SCDT  MHEASIGTH DG LNDSSI+FDGY P NQHTETPVNVESLAYDSYGDIL+DDFNSDD+DE
Subjt:  KYIPKPKMRTAEDACTQISQPEISNMLPLSPQVNSCDTRCMHEASIGTHSDGILNDSSINFDGYTPDNQHTETPVNVESLAYDSYGDILVDDFNSDDQDE

Query:  MLREESGKNDEEEPSTESNISQQQKMFSPVGEEIEHSKTSRKLRKKVSHQLDEPEDGVDENRNSPNEPSSNCDVHGDSYNKNEIPKGVRGKKTSTKSSKP
        MLREE GKN EEEPSTESNISQQQKMF PVGEEIEHSKTSRKLRKKVSHQLDEPEDGVDENRN PNEPSSN D+HGD YNKNE PKG RGKKTSTKSSKP
Subjt:  MLREESGKNDEEEPSTESNISQQQKMFSPVGEEIEHSKTSRKLRKKVSHQLDEPEDGVDENRNSPNEPSSNCDVHGDSYNKNEIPKGVRGKKTSTKSSKP

Query:  SSDNEKPTRKRKDANKAVPDLQAEKRPKKFSHSTRRNRRQVNKVLLETPEDEIDFQKISFRDLIIYHEHKEKLEKKVASTRKSATNQRTDTSGEEIYNDG
        S+DNEKPTRKRK+ANKAVPDLQAEKRPKKFSHSTRRNRRQVNKVLLETPEDEIDFQKISFRDLIIYHEHKEKLEKKVASTRKSATNQRTDT GEE+YNDG
Subjt:  SSDNEKPTRKRKDANKAVPDLQAEKRPKKFSHSTRRNRRQVNKVLLETPEDEIDFQKISFRDLIIYHEHKEKLEKKVASTRKSATNQRTDTSGEEIYNDG

Query:  EESLASEQGRGTDDDETPDVVDMTSAYFNYQSFMDKTPRTKWSKQDTERFYEAVRQFGTDFCMIQQLFPGRTRRQIKLKFKSEERHHPFRLSDAITNRSK
        EE+LASEQGRGTDDDETPDVVDMTSAYFNYQSFMDKTPRTKWSKQDTERFYEAVRQFGTDFCMIQQLFPGRTR QIKLKFKSEERHHPFRLSDAI NR+K
Subjt:  EESLASEQGRGTDDDETPDVVDMTSAYFNYQSFMDKTPRTKWSKQDTERFYEAVRQFGTDFCMIQQLFPGRTRRQIKLKFKSEERHHPFRLSDAITNRSK

Query:  VKYFADHSQFLSLIGQLQEAANKAKHESNQDELTENTGDEEQPELSPETNEEEVAKPVGVEETEKEEFVGGEVHSPLKADDS-DDDDPNRWDDYKFDY
             DHSQFL LI QL+EAANKAKHESNQDELTEN+GDEEQ ELSPETNEEEVAKP G+EET KEE VGGE+HSPLKAD+S DDDDPNRWD+YKFDY
Subjt:  VKYFADHSQFLSLIGQLQEAANKAKHESNQDELTENTGDEEQPELSPETNEEEVAKPVGVEETEKEEFVGGEVHSPLKADDS-DDDDPNRWDDYKFDY

XP_038875904.1 uncharacterized protein LOC120068262 isoform X2 [Benincasa hispida]0.0e+0085.34Show/hide
Query:  VDPFDEFFSDPGVTSRAGGRFQPKIKPRPKKQTLAPKSTLSQDKKGTIPDTKSCHDGSGSSKSIKLSSQLPVMEEKGESEDDLLLATARSDLIGCSHPTS
        +DPFDE FSDPGVTSRAGGRFQPKIKPRPKKQTL PKSTLS DKKGTI DTKSC DGSG+SKSI  SSQLPV EEK ESEDDLLLATARSD IG SHPTS
Subjt:  VDPFDEFFSDPGVTSRAGGRFQPKIKPRPKKQTLAPKSTLSQDKKGTIPDTKSCHDGSGSSKSIKLSSQLPVMEEKGESEDDLLLATARSDLIGCSHPTS

Query:  VESAKMVNSTQFDLDSYGSTLCSGSTIDGGVTDAIDLTTFSSVPVG-ENLTDDTKTSGILNNSHPSVSSAHEATVLDQSGLGSIQSEGGHFNDGKIAGEN
        VESAKMV+S QFDLDSYG  L SGSTI+ GVTDAIDLTT S  PVG +NL DDTK   +LN SHPS SSAHEATVLDQ GLGSIQSE  HFNDGKIAG+N
Subjt:  VESAKMVNSTQFDLDSYGSTLCSGSTIDGGVTDAIDLTTFSSVPVG-ENLTDDTKTSGILNNSHPSVSSAHEATVLDQSGLGSIQSEGGHFNDGKIAGEN

Query:  IDLFYELECLDDFHNQPKNEADPSSLKQASISNEDGDLDKQRLETEVKVLFCYQEKLFLLKIFSSLVHIWSRFQEYGAGAIITMDTISSGTTTPSERPAC
        IDLFYELECLDDFHNQPKNEADPSSLK A+ISNEDGDLDKQRLE E+                         FQE GAGA ITMDTISS TTTPSE+PAC
Subjt:  IDLFYELECLDDFHNQPKNEADPSSLKQASISNEDGDLDKQRLETEVKVLFCYQEKLFLLKIFSSLVHIWSRFQEYGAGAIITMDTISSGTTTPSERPAC

Query:  KYIPKPKMRTAEDACTQISQPEISNMLPLSPQVNSCDTRCMHEASIGTHSDGILNDSSINFDGYTPDNQHTETPVNVESLAYDSYGDILVDDFNSDDQDE
        KYIPKPK+RTA DACTQISQPEISNMLPLSPQV SCDT  MHEASIGTH DG LNDSSI+FDGY P NQHTETPVNVESLAYDSYGDIL+DDFNSDD+DE
Subjt:  KYIPKPKMRTAEDACTQISQPEISNMLPLSPQVNSCDTRCMHEASIGTHSDGILNDSSINFDGYTPDNQHTETPVNVESLAYDSYGDILVDDFNSDDQDE

Query:  MLREESGKNDEEEPSTESNISQQQKMFSPVGEEIEHSKTSRKLRKKVSHQLDEPEDGVDENRNSPNEPSSNCDVHGDSYNKNEIPKGVRGKKTSTKSSKP
        MLREE GKN EEEPSTESNISQQQKMF PVGEEIEHSKTSRKLRKKVSHQLDEPEDGVDENRN PNEPSSN D+HGD YNKNE PKG RGKKTSTKSSKP
Subjt:  MLREESGKNDEEEPSTESNISQQQKMFSPVGEEIEHSKTSRKLRKKVSHQLDEPEDGVDENRNSPNEPSSNCDVHGDSYNKNEIPKGVRGKKTSTKSSKP

Query:  SSDNEKPTRKRKDANKAVPDLQAEKRPKKFSHSTRRNRRQVNKVLLETPEDEIDFQKISFRDLIIYHEHKEKLEKKVASTRKSATNQRTDTSGEEIYNDG
        S+DNEKPTRKRK+ANKAVPDLQAEKRPKKFSHSTRRNRRQVNKVLLETPEDEIDFQKISFRDLIIYHEHKEKLE KVASTRKSATNQRTDT GEE+YNDG
Subjt:  SSDNEKPTRKRKDANKAVPDLQAEKRPKKFSHSTRRNRRQVNKVLLETPEDEIDFQKISFRDLIIYHEHKEKLEKKVASTRKSATNQRTDTSGEEIYNDG

Query:  EESLASEQGRGTDDDETPDVVDMTSAYFNYQSFMDKTPRTKWSKQDTERFYEAVRQFGTDFCMIQQLFPGRTRRQIKLKFKSEERHHPFRLSDAITNRSK
        EE+LASEQGRGTDDDETPDVVDMTSAYFNYQSFMDKTPRTKWSKQDTERFYEAVRQFGTDFCMIQQLFPGRTR QIKLKFKSEERHHPFRLSDAI NR+K
Subjt:  EESLASEQGRGTDDDETPDVVDMTSAYFNYQSFMDKTPRTKWSKQDTERFYEAVRQFGTDFCMIQQLFPGRTRRQIKLKFKSEERHHPFRLSDAITNRSK

Query:  VKYFADHSQFLSLIGQLQEAANKAKHESNQDELTENTGDEEQPELSPETNEEEVAKPVGVEETEKEEFVGGEVHSPLKADDS-DDDDPNRWDDYKFDY
             DHSQFL LI QL+EAANKAKHESNQDELTEN+GDEEQ ELSPETNEEEVAKP G+EET KEE VGGE+HSPLKAD+S DDDDPNRWD+YKFDY
Subjt:  VKYFADHSQFLSLIGQLQEAANKAKHESNQDELTENTGDEEQPELSPETNEEEVAKPVGVEETEKEEFVGGEVHSPLKADDS-DDDDPNRWDDYKFDY

XP_038875905.1 uncharacterized protein LOC120068262 isoform X3 [Benincasa hispida]0.0e+0085.21Show/hide
Query:  VDPFDEFFSDPGVTSRAGGRFQPKIKPRPKKQTLAPKSTLSQDKKGTIPDTKSCHDGSGSSKSIKLSSQLPVMEEKGESEDDLLLATARSDLIGCSHPTS
        +DPFDE FSDPGVTSRAGGRFQPKIKPRPKKQTL PKSTLS DKKGTI DTKSC DGSG+SKSI  SSQLPV EEK ESEDDLLLATARSD IG SHPTS
Subjt:  VDPFDEFFSDPGVTSRAGGRFQPKIKPRPKKQTLAPKSTLSQDKKGTIPDTKSCHDGSGSSKSIKLSSQLPVMEEKGESEDDLLLATARSDLIGCSHPTS

Query:  VESAKMVNSTQFDLDSYGSTLCSGSTIDGGVTDAIDLTTFSSVPVG-ENLTDDTKTSGILNNSHPSVSSAHEATVLDQSGLGSIQSEGGHFNDGKIAGEN
        VESAKMV+S QFDLDSYG  L SGSTI+ GVTDAIDLTT S  PVG +NL DDTK   +LN SHPS SSAHEATVLDQ GLGSIQSE  HFNDGKIAG+N
Subjt:  VESAKMVNSTQFDLDSYGSTLCSGSTIDGGVTDAIDLTTFSSVPVG-ENLTDDTKTSGILNNSHPSVSSAHEATVLDQSGLGSIQSEGGHFNDGKIAGEN

Query:  IDLFYELECLDDFHNQPKNEADPSSLKQASISNEDGDLDKQRLETEVKVLFCYQEKLFLLKIFSSLVHIWSRFQEYGAGAIITMDTISSGTTTPSERPAC
        IDLFYELECLDDFHNQPKNEADPSSLK A+ISNEDGDLDKQRLE E                            E GAGA ITMDTISS TTTPSE+PAC
Subjt:  IDLFYELECLDDFHNQPKNEADPSSLKQASISNEDGDLDKQRLETEVKVLFCYQEKLFLLKIFSSLVHIWSRFQEYGAGAIITMDTISSGTTTPSERPAC

Query:  KYIPKPKMRTAEDACTQISQPEISNMLPLSPQVNSCDTRCMHEASIGTHSDGILNDSSINFDGYTPDNQHTETPVNVESLAYDSYGDILVDDFNSDDQDE
        KYIPKPK+RTA DACTQISQPEISNMLPLSPQV SCDT  MHEASIGTH DG LNDSSI+FDGY P NQHTETPVNVESLAYDSYGDIL+DDFNSDD+DE
Subjt:  KYIPKPKMRTAEDACTQISQPEISNMLPLSPQVNSCDTRCMHEASIGTHSDGILNDSSINFDGYTPDNQHTETPVNVESLAYDSYGDILVDDFNSDDQDE

Query:  MLREESGKNDEEEPSTESNISQQQKMFSPVGEEIEHSKTSRKLRKKVSHQLDEPEDGVDENRNSPNEPSSNCDVHGDSYNKNEIPKGVRGKKTSTKSSKP
        MLREE GKN EEEPSTESNISQQQKMF PVGEEIEHSKTSRKLRKKVSHQLDEPEDGVDENRN PNEPSSN D+HGD YNKNE PKG RGKKTSTKSSKP
Subjt:  MLREESGKNDEEEPSTESNISQQQKMFSPVGEEIEHSKTSRKLRKKVSHQLDEPEDGVDENRNSPNEPSSNCDVHGDSYNKNEIPKGVRGKKTSTKSSKP

Query:  SSDNEKPTRKRKDANKAVPDLQAEKRPKKFSHSTRRNRRQVNKVLLETPEDEIDFQKISFRDLIIYHEHKEKLEKKVASTRKSATNQRTDTSGEEIYNDG
        S+DNEKPTRKRK+ANKAVPDLQAEKRPKKFSHSTRRNRRQVNKVLLETPEDEIDFQKISFRDLIIYHEHKEKLEKKVASTRKSATNQRTDT GEE+YNDG
Subjt:  SSDNEKPTRKRKDANKAVPDLQAEKRPKKFSHSTRRNRRQVNKVLLETPEDEIDFQKISFRDLIIYHEHKEKLEKKVASTRKSATNQRTDTSGEEIYNDG

Query:  EESLASEQGRGTDDDETPDVVDMTSAYFNYQSFMDKTPRTKWSKQDTERFYEAVRQFGTDFCMIQQLFPGRTRRQIKLKFKSEERHHPFRLSDAITNRSK
        EE+LASEQGRGTDDDETPDVVDMTSAYFNYQSFMDKTPRTKWSKQDTERFYEAVRQFGTDFCMIQQLFPGRTR QIKLKFKSEERHHPFRLSDAI NR+K
Subjt:  EESLASEQGRGTDDDETPDVVDMTSAYFNYQSFMDKTPRTKWSKQDTERFYEAVRQFGTDFCMIQQLFPGRTRRQIKLKFKSEERHHPFRLSDAITNRSK

Query:  VKYFADHSQFLSLIGQLQEAANKAKHESNQDELTENTGDEEQPELSPETNEEEVAKPVGVEETEKEEFVGGEVHSPLKADDS-DDDDPNRWDDYKFDY
             DHSQFL LI QL+EAANKAKHESNQDELTEN+GDEEQ ELSPETNEEEVAKP G+EET KEE VGGE+HSPLKAD+S DDDDPNRWD+YKFDY
Subjt:  VKYFADHSQFLSLIGQLQEAANKAKHESNQDELTENTGDEEQPELSPETNEEEVAKPVGVEETEKEEFVGGEVHSPLKADDS-DDDDPNRWDDYKFDY

TrEMBL top hitse value%identityAlignment
A0A0A0KPZ2 SANT domain-containing protein0.0e+0078.95Show/hide
Query:  VDPFDEFFSDPGVTSRAGGRFQPKIKPRPKKQTLAPK-STLSQDKKGTIPDTKSCHDGSGSSKSIKLSSQLPVMEEKGESEDDLLLATARSDLIGCSHPT
        +DPFD+ FS+  VT+RAG RFQPK KPRPKKQTLAP+ S  SQD KGTI D KSC D  G++KSIK SSQLPV EEK ESED LL  TARSD IGCS PT
Subjt:  VDPFDEFFSDPGVTSRAGGRFQPKIKPRPKKQTLAPK-STLSQDKKGTIPDTKSCHDGSGSSKSIKLSSQLPVMEEKGESEDDLLLATARSDLIGCSHPT

Query:  SVESAKMVNSTQFDLDSYGSTLCSGSTIDGGVTDAIDLTTFSSVPVG-ENLTDDTKTSGILNNSHPSVSSAHEATVLDQSGLGSIQSEGGHFNDGKIAGE
        SVES K+V+STQFDLD  GS L SGSTI+ GVTDAID TT  S PVG + LTDD K S +L  SHPS SSAHEA  +DQ G+GSIQSE  H  DGKIAG+
Subjt:  SVESAKMVNSTQFDLDSYGSTLCSGSTIDGGVTDAIDLTTFSSVPVG-ENLTDDTKTSGILNNSHPSVSSAHEATVLDQSGLGSIQSEGGHFNDGKIAGE

Query:  NIDLFYELECLDDFHNQPKNEADPSSLKQASISNEDGDLDKQRLETEVKVLFCYQEKLFLLKIFSSLVHIWSRFQEYGAGAIITMDTISSGTTTPSERPA
        NIDLFYELECLDDFHNQP+NE DPSSLKQA+ISNE GDLDKQRLE E                            E GA A +TMDT+SS TTTPSER A
Subjt:  NIDLFYELECLDDFHNQPKNEADPSSLKQASISNEDGDLDKQRLETEVKVLFCYQEKLFLLKIFSSLVHIWSRFQEYGAGAIITMDTISSGTTTPSERPA

Query:  CKYIPKPKMRTAEDACTQISQPEISNMLPLSPQVNSCDTRCMHEASIGTHSDGILNDSSINFDGYTPDNQHTETPVNVESLAYDSYGDILVDDFNSDDQD
        CKYIPKPKMRTA DACTQISQPEISNMLP SPQV SCDT  M+EASIGTHSDG+LNDSSINFDGY P NQ TETPVNVES  +DSYGDILVDDFNSDDQD
Subjt:  CKYIPKPKMRTAEDACTQISQPEISNMLPLSPQVNSCDTRCMHEASIGTHSDGILNDSSINFDGYTPDNQHTETPVNVESLAYDSYGDILVDDFNSDDQD

Query:  EMLREESGKNDEEEPSTESNISQQQKMFSPVGEEIEHSKTSRKLRKKVSHQLDEPEDGVDENRNSPNEPSSNCDVHGDSYNKNEIPKGVRGKKTSTKSSK
         MLREE+GKNDEEEPS +SN+SQQQK+   VGEEIEHSKTSRKLRKKVSHQLDEPEDGVD NR  PNEPSSN  +HG+ YNKNE PKG +G+KTSTKSSK
Subjt:  EMLREESGKNDEEEPSTESNISQQQKMFSPVGEEIEHSKTSRKLRKKVSHQLDEPEDGVDENRNSPNEPSSNCDVHGDSYNKNEIPKGVRGKKTSTKSSK

Query:  PSSDNEKPTRKRKDANKAVPDLQAEKRPKKFSHSTRRNRRQVNKVLLETPEDEIDFQKISFRDLIIYHEHKEKLEKKVASTRKSATNQRTDTSGEEIYND
        PSS+NEKPTRKRK+ANKAVPDLQAEKRPKKFSHSTRRNRRQVNKVLLETPED+IDFQKISFRDLIIYHEHKEKLEKKVASTRKS TNQRTDTS EEIYND
Subjt:  PSSDNEKPTRKRKDANKAVPDLQAEKRPKKFSHSTRRNRRQVNKVLLETPEDEIDFQKISFRDLIIYHEHKEKLEKKVASTRKSATNQRTDTSGEEIYND

Query:  GEESLASEQGRGTDDDETPDVVDMTSAYFNYQSFMDKTPRTKWSKQDTERFYEAVRQFGTDFCMIQQLFPGRTRRQIKLKFKSEERHHPFRLSDAITNRS
        GEE+LASEQG+GTDDDE PDVVDMTSAYFNYQSFMDKTPRTKWSK DTERFYEAVRQFGTDFCMIQQLFPG+TRRQIKLKFKSEERHHPFRLSDAITNR+
Subjt:  GEESLASEQGRGTDDDETPDVVDMTSAYFNYQSFMDKTPRTKWSKQDTERFYEAVRQFGTDFCMIQQLFPGRTRRQIKLKFKSEERHHPFRLSDAITNRS

Query:  KVKYFADHSQFLSLIGQLQEAANKAKHESNQDELTENTGDEEQPELSPETNEEEVAKPVGVEETEKEEFVGGEVHSPLKA-----DDSDDDDPNRWDDYK
        K     DHSQFLSLI QL+EAA KAKHESNQDELTENTGDEEQPELSP+TNEEEV +P GVEETEK+EFVGGE+HSPLK      DD DDDDPNRWD+YK
Subjt:  KVKYFADHSQFLSLIGQLQEAANKAKHESNQDELTENTGDEEQPELSPETNEEEVAKPVGVEETEKEEFVGGEVHSPLKA-----DDSDDDDPNRWDDYK

Query:  FDY
        FDY
Subjt:  FDY

A0A6J1FD64 uncharacterized protein LOC111444681 isoform X20.0e+0077.94Show/hide
Query:  VDPFDEFFSDPGVTSRAGGRFQPKIKPRPKKQTLAPK-STLSQDKKGTIPDTKSCHDGSGSSKSIKLSSQLPVMEEKGESEDDLLLATARSDLIGCSHPT
        +DPFDE  SDPG TSR GGRFQPKIKPRPKKQTLAP+ ST+SQDKKGT  D KSCHDG G++KSIK SSQLPV+EE  ESEDDLLLAT RSD IGCSH T
Subjt:  VDPFDEFFSDPGVTSRAGGRFQPKIKPRPKKQTLAPK-STLSQDKKGTIPDTKSCHDGSGSSKSIKLSSQLPVMEEKGESEDDLLLATARSDLIGCSHPT

Query:  SVESAKMVNSTQFDLDSYGSTLCSGSTIDGGVTDAIDLTTFSSVPVG-ENLTDDTKTSGILNNSHPSVSSAHEATVLDQSGLGSIQSEGGHFNDGKIAGE
        SVESA MV+STQ DLDS G  L SGSTIDG              PVG EN TDD+K SGILN SH S S AHEATVL QSGLGSIQ E GH NDGKIAG+
Subjt:  SVESAKMVNSTQFDLDSYGSTLCSGSTIDGGVTDAIDLTTFSSVPVG-ENLTDDTKTSGILNNSHPSVSSAHEATVLDQSGLGSIQSEGGHFNDGKIAGE

Query:  NIDLFYELECLDDFHNQPKNEADPSSLKQASISNEDGDLDKQRLETEVKVLFCYQEKLFLLKIFSSLVHIWSRFQEYGAGAIITMDTISSGTTTPSERPA
        N D+F ELE LDDFHNQPKNEADPSSLKQA+ISNEDGDLD QRLE E                            E GAGA IT D ISSGTTT  E+PA
Subjt:  NIDLFYELECLDDFHNQPKNEADPSSLKQASISNEDGDLDKQRLETEVKVLFCYQEKLFLLKIFSSLVHIWSRFQEYGAGAIITMDTISSGTTTPSERPA

Query:  CKYIPKPKMRTAEDACTQISQPEISNMLPLSPQVNSCDTRCMHEASIGTHSDGILNDSSINFDGYTPDNQHTETPVNVESLAYDSYGDILVDDFNSDDQD
        CKYIPKPKMR A D+CTQISQPEISN LP SPQV SCDTRCMHEASIGTHSDG+LNDSSINFD Y+P NQH E PVNVESLAYDSYGDILVDDFNSDDQD
Subjt:  CKYIPKPKMRTAEDACTQISQPEISNMLPLSPQVNSCDTRCMHEASIGTHSDGILNDSSINFDGYTPDNQHTETPVNVESLAYDSYGDILVDDFNSDDQD

Query:  EMLREESGKNDEEEPSTESNISQQQKMFSPVGEEIEHSKTSRKLRKKVSHQLDEPEDGVDENRNSPNEPSSNCDVHGDSYNKNEIPKGVRGKKTSTKSSK
        EMLREE GKN EE+P ++SN+SQQQ+MF PVGEEI+HSKTSRKLR++VSHQLD+PEDGVD+    P+E  SN D+HGD Y KN    G RG KT TKS K
Subjt:  EMLREESGKNDEEEPSTESNISQQQKMFSPVGEEIEHSKTSRKLRKKVSHQLDEPEDGVDENRNSPNEPSSNCDVHGDSYNKNEIPKGVRGKKTSTKSSK

Query:  PSSDNEKPTRKRKDANKAVPDLQAEKRPKKFSHSTRRNRRQVNKVLLETPEDEIDFQKISFRDLIIYHEHKEKLEKKVASTRKSATNQRTDTSGEEIYND
        PSSDNEKP RKRK+ANKAVPDLQAEKRPKKFSHSTRRNRRQVNKVLLETPEDEIDFQKISFRDLIIYHEHKEKLEKK A+TR+ ATNQRTDT GEEIYND
Subjt:  PSSDNEKPTRKRKDANKAVPDLQAEKRPKKFSHSTRRNRRQVNKVLLETPEDEIDFQKISFRDLIIYHEHKEKLEKKVASTRKSATNQRTDTSGEEIYND

Query:  GEESLASEQGRGTDDDETPDVVDMTSAYFNYQSFMDKTPRTKWSKQDTERFYEAVRQFGTDFCMIQQLFPGRTRRQIKLKFKSEERHHPFRLSDAITNRS
        GEESLA EQGRGTDDDETPDVVDMTSAYFNY SFMDKT RTKWSK DTERFYEAVRQFGTDFCMIQQLFPGRTR QIKLKFK+EERHHPFRLSDAITNR+
Subjt:  GEESLASEQGRGTDDDETPDVVDMTSAYFNYQSFMDKTPRTKWSKQDTERFYEAVRQFGTDFCMIQQLFPGRTRRQIKLKFKSEERHHPFRLSDAITNRS

Query:  KVKYFADHSQFLSLIGQLQEAANKAKHESNQDELTENTGDEEQPELSPETNEEEVAKPVGVEETEKEEFVGGEVHSPLKADDSDDDDPNRWDDYKFDY
        K     DHSQFLSLIGQLQEAANKAKHESN+DELTEN+GDEE  EL+PETNEEEVAKP  VE+T+ EEFV GE+HSPLKAD SDDDDP+RWD+YKFDY
Subjt:  KVKYFADHSQFLSLIGQLQEAANKAKHESNQDELTENTGDEEQPELSPETNEEEVAKPVGVEETEKEEFVGGEVHSPLKADDSDDDDPNRWDDYKFDY

A0A6J1FJU1 uncharacterized protein LOC111444681 isoform X10.0e+0078.2Show/hide
Query:  VDPFDEFFSDPGVTSRAGGRFQPKIKPRPKKQTLAPK-STLSQDKKGTIPDTKSCHDGSGSSKSIKLSSQLPVMEEKGESEDDLLLATARSDLIGCSHPT
        +DPFDE  SDPG TSR GGRFQPKIKPRPKKQTLAP+ ST+SQDKKGT  D KSCHDG G++KSIK SSQLPV+EE  ESEDDLLLAT RSD IGCSH T
Subjt:  VDPFDEFFSDPGVTSRAGGRFQPKIKPRPKKQTLAPK-STLSQDKKGTIPDTKSCHDGSGSSKSIKLSSQLPVMEEKGESEDDLLLATARSDLIGCSHPT

Query:  SVESAKMVNSTQFDLDSYGSTLCSGSTIDGGVTDAIDLTTFSSVPVG-ENLTDDTKTSGILNNSHPSVSSAHEATVLDQSGLGSIQSEGGHFNDGKIAGE
        SVESA MV+STQ DLDS G  L SGSTIDG              PVG EN TDD+K SGILN SH S S AHEATVL QSGLGSIQ E GH NDGKIAG+
Subjt:  SVESAKMVNSTQFDLDSYGSTLCSGSTIDGGVTDAIDLTTFSSVPVG-ENLTDDTKTSGILNNSHPSVSSAHEATVLDQSGLGSIQSEGGHFNDGKIAGE

Query:  NIDLFYELECLDDFHNQPKNEADPSSLKQASISNEDGDLDKQRLETEVKVLFCYQEKLFLLKIFSSLVHIWSRFQEYGAGAIITMDTISSGTTTPSERPA
        N D+F ELE LDDFHNQPKNEADPSSLKQA+ISNEDGDLD QRLE E                            E GAGA IT D ISSGTTTPSE+PA
Subjt:  NIDLFYELECLDDFHNQPKNEADPSSLKQASISNEDGDLDKQRLETEVKVLFCYQEKLFLLKIFSSLVHIWSRFQEYGAGAIITMDTISSGTTTPSERPA

Query:  CKYIPKPKMRTAEDACTQISQPEISNMLPLSPQVNSCDTRCMHEASIGTHSDGILNDSSINFDGYTPDNQHTETPVNVESLAYDSYGDILVDDFNSDDQD
        CKYIPKPKMR A D+CTQISQPEISN LP SPQV SCDTRCMHEASIGTHSDG+LNDSSINFD Y+P NQH E PVNVESLAYDSYGDILVDDFNSDDQD
Subjt:  CKYIPKPKMRTAEDACTQISQPEISNMLPLSPQVNSCDTRCMHEASIGTHSDGILNDSSINFDGYTPDNQHTETPVNVESLAYDSYGDILVDDFNSDDQD

Query:  EMLREESGKNDEEEPSTESNISQQQKMFSPVGEEIEHSKTSRKLRKKVSHQLDEPEDGVDENRNSPNEPSSNCDVHGDSYNKNEIPKGVRGKKTSTKSSK
        EMLREE GKN EE+P ++SN+SQQQ+MF PVGEEI+HSKTSRKLR++VSHQLD+PEDGVD+    P+E  SN D+HGD Y KN    G RG KT TKS K
Subjt:  EMLREESGKNDEEEPSTESNISQQQKMFSPVGEEIEHSKTSRKLRKKVSHQLDEPEDGVDENRNSPNEPSSNCDVHGDSYNKNEIPKGVRGKKTSTKSSK

Query:  PSSDNEKPTRKRKDANKAVPDLQAEKRPKKFSHSTRRNRRQVNKVLLETPEDEIDFQKISFRDLIIYHEHKEKLEKKVASTRKSATNQRTDTSGEEIYND
        PSSDNEKP RKRK+ANKAVPDLQAEKRPKKFSHSTRRNRRQVNKVLLETPEDEIDFQKISFRDLIIYHEHKEKLEKK A+TR+ ATNQRTDT GEEIYND
Subjt:  PSSDNEKPTRKRKDANKAVPDLQAEKRPKKFSHSTRRNRRQVNKVLLETPEDEIDFQKISFRDLIIYHEHKEKLEKKVASTRKSATNQRTDTSGEEIYND

Query:  GEESLASEQGRGTDDDETPDVVDMTSAYFNYQSFMDKTPRTKWSKQDTERFYEAVRQFGTDFCMIQQLFPGRTRRQIKLKFKSEERHHPFRLSDAITNRS
        GEESLA EQGRGTDDDETPDVVDMTSAYFNY SFMDKT RTKWSK DTERFYEAVRQFGTDFCMIQQLFPGRTR QIKLKFK+EERHHPFRLSDAITNR+
Subjt:  GEESLASEQGRGTDDDETPDVVDMTSAYFNYQSFMDKTPRTKWSKQDTERFYEAVRQFGTDFCMIQQLFPGRTRRQIKLKFKSEERHHPFRLSDAITNRS

Query:  KVKYFADHSQFLSLIGQLQEAANKAKHESNQDELTENTGDEEQPELSPETNEEEVAKPVGVEETEKEEFVGGEVHSPLKADDSDDDDPNRWDDYKFDY
        K     DHSQFLSLIGQLQEAANKAKHESN+DELTEN+GDEE  EL+PETNEEEVAKP  VE+T+ EEFV GE+HSPLKAD SDDDDP+RWD+YKFDY
Subjt:  KVKYFADHSQFLSLIGQLQEAANKAKHESNQDELTENTGDEEQPELSPETNEEEVAKPVGVEETEKEEFVGGEVHSPLKADDSDDDDPNRWDDYKFDY

A0A6J1JRX5 uncharacterized protein LOC111489243 isoform X20.0e+0076.82Show/hide
Query:  VDPFDEFFSDPGVTSRAGGRFQPKIKPRPKKQTLAPK-STLSQDKKGTIPDTKSCHDGSGSSKSIKLSSQLPVMEEKGESEDDLLLATARSDLIGCSHPT
        +DPFDE  SDPG TSR GGRFQPKIKPRPKKQTLAP+ ST+SQDKKGT  D KSCHD  G++KSIK SSQL V+EE  ESEDDLLLAT            
Subjt:  VDPFDEFFSDPGVTSRAGGRFQPKIKPRPKKQTLAPK-STLSQDKKGTIPDTKSCHDGSGSSKSIKLSSQLPVMEEKGESEDDLLLATARSDLIGCSHPT

Query:  SVESAKMVNSTQFDLDSYGSTLCSGSTIDGGVTDAIDLTTFSSVPVG-ENLTDDTKTSGILNNSHPSVSSAHEATVLDQSGLGSIQSEGGHFNDGKIAGE
               V+STQ DLDS G  L SGSTIDG              PVG ENLTDD+K SGILN SH S S AHEATVL QSGLGSIQ E GH NDGKIAG+
Subjt:  SVESAKMVNSTQFDLDSYGSTLCSGSTIDGGVTDAIDLTTFSSVPVG-ENLTDDTKTSGILNNSHPSVSSAHEATVLDQSGLGSIQSEGGHFNDGKIAGE

Query:  NIDLFYELECLDDFHNQPKNEADPSSLKQASISNEDGDLDKQRLETEVKVLFCYQEKLFLLKIFSSLVHIWSRFQEYGAGAIITMDTISSGTTTPSERPA
        N D+FY+LE LDDFHNQPKNEADPSSLKQA+ISNEDGDLDKQRLE E                            E GAGA IT DTISSGTTT  E+PA
Subjt:  NIDLFYELECLDDFHNQPKNEADPSSLKQASISNEDGDLDKQRLETEVKVLFCYQEKLFLLKIFSSLVHIWSRFQEYGAGAIITMDTISSGTTTPSERPA

Query:  CKYIPKPKMRTAEDACTQISQPEISNMLPLSPQVNSCDTRCMHEASIGTHSDGILNDSSINFDGYTPDNQHTETPVNVESLAYDSYGDILVDDFNSDDQD
        CKYIPKPKMR A D+CTQISQPEISN LP SPQV SCDT+CMHEASIGTHSDG+LNDSSINFD Y+P NQH E PVNVESLAYDSYGDILVDDFNSDDQD
Subjt:  CKYIPKPKMRTAEDACTQISQPEISNMLPLSPQVNSCDTRCMHEASIGTHSDGILNDSSINFDGYTPDNQHTETPVNVESLAYDSYGDILVDDFNSDDQD

Query:  EMLREESGKNDEEEPSTESNISQQQKMFSPVGEEIEHSKTSRKLRKKVSHQLDEPEDGVDENRNSPNEPSSNCDVHGDSYNKNEIPKGVRGKKTSTKSSK
        EMLREE GKN EE+PS+ SN+SQQQ+MF PVGEEIEHSKTSRKLR++VSHQL +PEDGVD+    P+E  SNCD+HGD Y KNE  KG RG KT TKS K
Subjt:  EMLREESGKNDEEEPSTESNISQQQKMFSPVGEEIEHSKTSRKLRKKVSHQLDEPEDGVDENRNSPNEPSSNCDVHGDSYNKNEIPKGVRGKKTSTKSSK

Query:  PSSDNEKPTRKRKDANKAVPDLQAEKRPKKFSHSTRRNRRQVNKVLLETPEDEIDFQKISFRDLIIYHEHKEKLEKKVASTRKSATNQRTDTSGEEIYND
        PSSDNEKPTRKRK+ANKAVPDLQAEKRPKKFSHSTRRNRRQVNKVLLETPEDEIDFQKISFRDLIIYHEHKEKLEKK A+TR+SATNQRTDT GEEIYND
Subjt:  PSSDNEKPTRKRKDANKAVPDLQAEKRPKKFSHSTRRNRRQVNKVLLETPEDEIDFQKISFRDLIIYHEHKEKLEKKVASTRKSATNQRTDTSGEEIYND

Query:  GEESLASEQGRGTDDDETPDVVDMTSAYFNYQSFMDKTPRTKWSKQDTERFYEAVRQFGTDFCMIQQLFPGRTRRQIKLKFKSEERHHPFRLSDAITNRS
        GEESLA EQGRGTDDDETPDVVDMTSAYFNY SFMDKT RTKWSK DTERFYEAVRQFGTDFCMIQQLFPGRTR QIKLKFK+EERHHPFRLSDAITNR+
Subjt:  GEESLASEQGRGTDDDETPDVVDMTSAYFNYQSFMDKTPRTKWSKQDTERFYEAVRQFGTDFCMIQQLFPGRTRRQIKLKFKSEERHHPFRLSDAITNRS

Query:  KVKYFADHSQFLSLIGQLQEAANKAKHESNQDELTENTGDEEQPELSPETNEEEVAKPVGVEETEKEEFVGGEVHSPLKADDSDDDDPNRWDDYKFDY
        K     DHSQFLSLIGQLQEAANKAKHESN+DELTENTG+EE  ELSPE NEEEVAKP  VE+T+ EEFV GE+HSPLKAD+SDDDDP+RWD+YKFDY
Subjt:  KVKYFADHSQFLSLIGQLQEAANKAKHESNQDELTENTGDEEQPELSPETNEEEVAKPVGVEETEKEEFVGGEVHSPLKADDSDDDDPNRWDDYKFDY

A0A6J1JVG9 uncharacterized protein LOC111489243 isoform X10.0e+0078.7Show/hide
Query:  VDPFDEFFSDPGVTSRAGGRFQPKIKPRPKKQTLAPK-STLSQDKKGTIPDTKSCHDGSGSSKSIKLSSQLPVMEEKGESEDDLLLATARSDLIGCSHPT
        +DPFDE  SDPG TSR GGRFQPKIKPRPKKQTLAP+ ST+SQDKKGT  D KSCHD  G++KSIK SSQL V+EE  ESEDDLLLAT RSD IGCSH T
Subjt:  VDPFDEFFSDPGVTSRAGGRFQPKIKPRPKKQTLAPK-STLSQDKKGTIPDTKSCHDGSGSSKSIKLSSQLPVMEEKGESEDDLLLATARSDLIGCSHPT

Query:  SVESAKMVNSTQFDLDSYGSTLCSGSTIDGGVTDAIDLTTFSSVPVG-ENLTDDTKTSGILNNSHPSVSSAHEATVLDQSGLGSIQSEGGHFNDGKIAGE
        SVESA MV+STQ DLDS G  L SGSTIDG              PVG ENLTDD+K SGILN SH S S AHEATVL QSGLGSIQ E GH NDGKIAG+
Subjt:  SVESAKMVNSTQFDLDSYGSTLCSGSTIDGGVTDAIDLTTFSSVPVG-ENLTDDTKTSGILNNSHPSVSSAHEATVLDQSGLGSIQSEGGHFNDGKIAGE

Query:  NIDLFYELECLDDFHNQPKNEADPSSLKQASISNEDGDLDKQRLETEVKVLFCYQEKLFLLKIFSSLVHIWSRFQEYGAGAIITMDTISSGTTTPSERPA
        N D+FY+LE LDDFHNQPKNEADPSSLKQA+ISNEDGDLDKQRLE E                            E GAGA IT DTISSGTTT  E+PA
Subjt:  NIDLFYELECLDDFHNQPKNEADPSSLKQASISNEDGDLDKQRLETEVKVLFCYQEKLFLLKIFSSLVHIWSRFQEYGAGAIITMDTISSGTTTPSERPA

Query:  CKYIPKPKMRTAEDACTQISQPEISNMLPLSPQVNSCDTRCMHEASIGTHSDGILNDSSINFDGYTPDNQHTETPVNVESLAYDSYGDILVDDFNSDDQD
        CKYIPKPKMR A D+CTQISQPEISN LP SPQV SCDT+CMHEASIGTHSDG+LNDSSINFD Y+P NQH E PVNVESLAYDSYGDILVDDFNSDDQD
Subjt:  CKYIPKPKMRTAEDACTQISQPEISNMLPLSPQVNSCDTRCMHEASIGTHSDGILNDSSINFDGYTPDNQHTETPVNVESLAYDSYGDILVDDFNSDDQD

Query:  EMLREESGKNDEEEPSTESNISQQQKMFSPVGEEIEHSKTSRKLRKKVSHQLDEPEDGVDENRNSPNEPSSNCDVHGDSYNKNEIPKGVRGKKTSTKSSK
        EMLREE GKN EE+PS+ SN+SQQQ+MF PVGEEIEHSKTSRKLR++VSHQL +PEDGVD+    P+E  SNCD+HGD Y KNE  KG RG KT TKS K
Subjt:  EMLREESGKNDEEEPSTESNISQQQKMFSPVGEEIEHSKTSRKLRKKVSHQLDEPEDGVDENRNSPNEPSSNCDVHGDSYNKNEIPKGVRGKKTSTKSSK

Query:  PSSDNEKPTRKRKDANKAVPDLQAEKRPKKFSHSTRRNRRQVNKVLLETPEDEIDFQKISFRDLIIYHEHKEKLEKKVASTRKSATNQRTDTSGEEIYND
        PSSDNEKPTRKRK+ANKAVPDLQAEKRPKKFSHSTRRNRRQVNKVLLETPEDEIDFQKISFRDLIIYHEHKEKLEKK A+TR+SATNQRTDT GEEIYND
Subjt:  PSSDNEKPTRKRKDANKAVPDLQAEKRPKKFSHSTRRNRRQVNKVLLETPEDEIDFQKISFRDLIIYHEHKEKLEKKVASTRKSATNQRTDTSGEEIYND

Query:  GEESLASEQGRGTDDDETPDVVDMTSAYFNYQSFMDKTPRTKWSKQDTERFYEAVRQFGTDFCMIQQLFPGRTRRQIKLKFKSEERHHPFRLSDAITNRS
        GEESLA EQGRGTDDDETPDVVDMTSAYFNY SFMDKT RTKWSK DTERFYEAVRQFGTDFCMIQQLFPGRTR QIKLKFK+EERHHPFRLSDAITNR+
Subjt:  GEESLASEQGRGTDDDETPDVVDMTSAYFNYQSFMDKTPRTKWSKQDTERFYEAVRQFGTDFCMIQQLFPGRTRRQIKLKFKSEERHHPFRLSDAITNRS

Query:  KVKYFADHSQFLSLIGQLQEAANKAKHESNQDELTENTGDEEQPELSPETNEEEVAKPVGVEETEKEEFVGGEVHSPLKADDSDDDDPNRWDDYKFDY
        K     DHSQFLSLIGQLQEAANKAKHESN+DELTENTG+EE  ELSPE NEEEVAKP  VE+T+ EEFV GE+HSPLKAD+SDDDDP+RWD+YKFDY
Subjt:  KVKYFADHSQFLSLIGQLQEAANKAKHESNQDELTENTGDEEQPELSPETNEEEVAKPVGVEETEKEEFVGGEVHSPLKADDSDDDDPNRWDDYKFDY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G39160.1 Homeodomain-like superfamily protein2.0e-3535.52Show/hide
Query:  RKKVSHQLDEPEDGVDENRNSPNEPSSNCDVHGDSYNKNEIPKGVRGKKTSTKSSKPSSDNEKPTRKRKDANKAVPDLQAEK-RPKKFSHSTRRNRRQVN
        R+ V     +   G  E       P  N  V G+  N      G   ++ S + SK        +RKRK  ++  P+  +EK   KKF HS+RR +R + 
Subjt:  RKKVSHQLDEPEDGVDENRNSPNEPSSNCDVHGDSYNKNEIPKGVRGKKTSTKSSKPSSDNEKPTRKRKDANKAVPDLQAEK-RPKKFSHSTRRNRRQVN

Query:  KVLLETPEDEIDFQKISFRD---LIIYHEHKEKLEKKVASTRKSATNQRTDTSGEEIYNDG--EESLASEQGRGTDDDETPDVVDMTSAYFNYQSFMDKT
        K LLETP+ EI  + +  RD   L+ Y E  +K E K A  + S  +   + SG + ++ G  EE    + G  + + +  +VV   S   NYQ++M+KT
Subjt:  KVLLETPEDEIDFQKISFRD---LIIYHEHKEKLEKKVASTRKSATNQRTDTSGEEIYNDG--EESLASEQGRGTDDDETPDVVDMTSAYFNYQSFMDKT

Query:  PRTKWSKQDTERFYEAVRQFGTDFCMIQQLFPGRTRRQIKLKFKSEERHHPFRLSDAITNRSKVKYFADHSQFLSLIGQLQEAANKAKHESNQDEL----
         RT+WSK+DTE FYE +++FG++  MIQQLFP RTR Q+KLKFK EER +P +L+DA+++RSK       + F ++I +LQ+ A  AK    ++E     
Subjt:  PRTKWSKQDTERFYEAVRQFGTDFCMIQQLFPGRTRRQIKLKFKSEERHHPFRLSDAITNRSKVKYFADHSQFLSLIGQLQEAANKAKHESNQDEL----

Query:  -TENTGDEEQPELSPETNEEEVAKPVGVEETEKEEFVGGEVHSPLKADDSD--DDDPNRWDDYKFD
         T +  + E+PE S ET         GV+E++     GG+V + +++D  D  DDD + W+ YK D
Subjt:  -TENTGDEEQPELSPETNEEEVAKPVGVEETEKEEFVGGEVHSPLKADDSD--DDDPNRWDDYKFD

AT4G39160.2 Homeodomain-like superfamily protein2.0e-3535.52Show/hide
Query:  RKKVSHQLDEPEDGVDENRNSPNEPSSNCDVHGDSYNKNEIPKGVRGKKTSTKSSKPSSDNEKPTRKRKDANKAVPDLQAEK-RPKKFSHSTRRNRRQVN
        R+ V     +   G  E       P  N  V G+  N      G   ++ S + SK        +RKRK  ++  P+  +EK   KKF HS+RR +R + 
Subjt:  RKKVSHQLDEPEDGVDENRNSPNEPSSNCDVHGDSYNKNEIPKGVRGKKTSTKSSKPSSDNEKPTRKRKDANKAVPDLQAEK-RPKKFSHSTRRNRRQVN

Query:  KVLLETPEDEIDFQKISFRD---LIIYHEHKEKLEKKVASTRKSATNQRTDTSGEEIYNDG--EESLASEQGRGTDDDETPDVVDMTSAYFNYQSFMDKT
        K LLETP+ EI  + +  RD   L+ Y E  +K E K A  + S  +   + SG + ++ G  EE    + G  + + +  +VV   S   NYQ++M+KT
Subjt:  KVLLETPEDEIDFQKISFRD---LIIYHEHKEKLEKKVASTRKSATNQRTDTSGEEIYNDG--EESLASEQGRGTDDDETPDVVDMTSAYFNYQSFMDKT

Query:  PRTKWSKQDTERFYEAVRQFGTDFCMIQQLFPGRTRRQIKLKFKSEERHHPFRLSDAITNRSKVKYFADHSQFLSLIGQLQEAANKAKHESNQDEL----
         RT+WSK+DTE FYE +++FG++  MIQQLFP RTR Q+KLKFK EER +P +L+DA+++RSK       + F ++I +LQ+ A  AK    ++E     
Subjt:  PRTKWSKQDTERFYEAVRQFGTDFCMIQQLFPGRTRRQIKLKFKSEERHHPFRLSDAITNRSKVKYFADHSQFLSLIGQLQEAANKAKHESNQDEL----

Query:  -TENTGDEEQPELSPETNEEEVAKPVGVEETEKEEFVGGEVHSPLKADDSD--DDDPNRWDDYKFD
         T +  + E+PE S ET         GV+E++     GG+V + +++D  D  DDD + W+ YK D
Subjt:  -TENTGDEEQPELSPETNEEEVAKPVGVEETEKEEFVGGEVHSPLKADDSD--DDDPNRWDDYKFD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACGTGGACCCTTTTGATGAATTTTTTTCTGATCCTGGCGTCACATCTCGAGCTGGGGGTAGATTTCAACCAAAGATCAAACCACGTCCTAAAAAACAAACTTTGGC
ACCGAAGTCCACACTATCTCAAGATAAGAAGGGAACAATACCGGATACTAAATCTTGTCATGATGGTAGTGGAAGCTCAAAATCAATCAAATTGTCATCCCAACTTCCTG
TGATGGAGGAGAAAGGGGAATCTGAAGATGATTTGCTTTTGGCTACCGCAAGGTCCGATCTCATTGGTTGTTCACATCCCACCTCTGTGGAAAGTGCTAAAATGGTAAAC
TCTACGCAGTTTGATTTGGATTCTTATGGTAGTACTCTTTGTTCAGGTTCTACTATCGATGGTGGAGTGACTGATGCAATTGATCTCACTACTTTCTCTTCGGTTCCAGT
TGGAGAAAATCTGACTGATGATACCAAAACTTCAGGAATATTAAATAATTCCCATCCAAGTGTCTCTTCAGCTCATGAAGCTACGGTTCTGGATCAAAGTGGGCTGGGAT
CAATCCAATCTGAGGGCGGGCATTTCAATGATGGTAAAATAGCAGGAGAGAATATAGATTTATTTTATGAATTGGAATGTCTAGATGATTTTCATAACCAACCAAAGAAT
GAAGCAGATCCTTCAAGCCTTAAGCAAGCATCAATCTCCAATGAGGATGGAGATTTGGATAAACAAAGGTTGGAAACAGAGGTGAAAGTCTTGTTTTGTTATCAGGAAAA
GTTATTTCTTTTAAAAATATTTTCTTCACTTGTCCATATATGGTCAAGGTTTCAGGAATATGGGGCAGGGGCTATTATCACCATGGATACGATAAGTTCTGGGACCACGA
CTCCCTCTGAACGGCCTGCTTGCAAGTATATACCAAAGCCCAAAATGAGAACTGCAGAAGATGCTTGCACACAAATCTCTCAGCCAGAAATCTCTAATATGCTTCCACTG
TCTCCACAAGTTAATTCTTGTGATACTAGATGCATGCATGAAGCCTCAATTGGGACGCATTCAGATGGGATTCTTAATGATTCATCGATTAACTTTGATGGTTACACCCC
TGACAACCAGCACACTGAAACACCTGTTAATGTAGAATCATTAGCATATGACTCTTATGGTGACATACTGGTGGATGATTTTAATTCAGATGATCAGGATGAGATGCTAA
GAGAAGAGAGTGGTAAGAACGATGAAGAAGAACCTTCAACGGAATCAAATATTTCTCAGCAGCAAAAGATGTTCTCCCCAGTTGGTGAAGAAATTGAGCATAGCAAAACT
TCAAGAAAGTTGAGAAAGAAGGTTTCTCATCAACTTGATGAGCCAGAAGATGGTGTTGATGAGAATAGAAACTCCCCGAATGAACCTTCTAGTAATTGTGATGTGCATGG
AGATAGCTATAACAAAAATGAAATCCCAAAAGGAGTTCGAGGAAAGAAAACTTCAACAAAGTCTTCGAAACCTTCTAGTGATAATGAAAAACCAACTCGGAAGCGCAAGG
ATGCTAATAAAGCAGTTCCAGATTTGCAAGCTGAAAAGCGCCCTAAGAAGTTCTCCCATTCAACTCGTCGAAATAGAAGGCAAGTAAACAAGGTTTTGCTTGAAACTCCG
GAGGATGAAATTGACTTTCAAAAGATAAGTTTTCGGGATCTCATTATTTATCATGAGCACAAGGAGAAGTTAGAGAAGAAAGTGGCAAGCACAAGAAAATCAGCAACCAA
TCAAAGAACCGATACTTCCGGTGAGGAGATTTATAATGATGGAGAGGAAAGCCTTGCTTCTGAACAAGGTAGAGGTACTGATGATGATGAAACGCCTGATGTAGTTGACA
TGACTTCTGCTTACTTTAATTATCAATCATTCATGGACAAAACACCACGTACAAAGTGGTCAAAGCAGGACACAGAGCGTTTTTATGAGGCTGTACGACAATTCGGGACA
GATTTTTGTATGATACAACAATTGTTTCCTGGTCGAACACGCCGTCAAATTAAACTAAAATTCAAAAGTGAAGAACGTCATCATCCATTTCGTCTCTCTGATGCTATAAC
TAATCGTTCCAAAGTGAAGTATTTTGCAGACCATTCCCAGTTTCTATCGTTGATTGGGCAGCTGCAAGAAGCTGCTAATAAGGCAAAACATGAATCAAATCAAGATGAAT
TGACTGAAAATACTGGGGATGAGGAGCAGCCAGAGTTGTCTCCTGAAACTAATGAGGAAGAAGTGGCAAAACCGGTAGGCGTGGAGGAGACAGAAAAGGAAGAATTTGTT
GGTGGTGAAGTTCACAGTCCATTGAAGGCTGATGATAGTGATGATGATGATCCTAATAGATGGGATGATTATAAATTTGATTATTAA
mRNA sequenceShow/hide mRNA sequence
TCCACCACTTTTCTTCCGCACCCTCCCTTCAATCTTCCCGCCGGCAACCTCCACGTGCAGCAGTCGGCGACCACAGAGCCCGCGACTGGTCTCTACGCACCCTACCCACT
CGTTGACGACGACCCACGCATTTGAGGTACGTTTCTCCTTATTTGGCGACTCTGCGAACCCACACCCACGCGAACACCTTCTCTTTAGTCGACGATGGTGTACATTCACC
CACGAATCTGACTTGTTCCGTGACCACGACTTCTACTAGGCTTTCGCAACATCATTCATCTCCGTTCGAGACTCCAGCGTGAGCGTCGACGATTGGGTGTGTATTGCAAC
AACGTTTTTGACTTGGGGACAAACTTGACGTTTCATATATGGACGTGGACCCTTTTGATGAATTTTTTTCTGATCCTGGCGTCACATCTCGAGCTGGGGGTAGATTTCAA
CCAAAGATCAAACCACGTCCTAAAAAACAAACTTTGGCACCGAAGTCCACACTATCTCAAGATAAGAAGGGAACAATACCGGATACTAAATCTTGTCATGATGGTAGTGG
AAGCTCAAAATCAATCAAATTGTCATCCCAACTTCCTGTGATGGAGGAGAAAGGGGAATCTGAAGATGATTTGCTTTTGGCTACCGCAAGGTCCGATCTCATTGGTTGTT
CACATCCCACCTCTGTGGAAAGTGCTAAAATGGTAAACTCTACGCAGTTTGATTTGGATTCTTATGGTAGTACTCTTTGTTCAGGTTCTACTATCGATGGTGGAGTGACT
GATGCAATTGATCTCACTACTTTCTCTTCGGTTCCAGTTGGAGAAAATCTGACTGATGATACCAAAACTTCAGGAATATTAAATAATTCCCATCCAAGTGTCTCTTCAGC
TCATGAAGCTACGGTTCTGGATCAAAGTGGGCTGGGATCAATCCAATCTGAGGGCGGGCATTTCAATGATGGTAAAATAGCAGGAGAGAATATAGATTTATTTTATGAAT
TGGAATGTCTAGATGATTTTCATAACCAACCAAAGAATGAAGCAGATCCTTCAAGCCTTAAGCAAGCATCAATCTCCAATGAGGATGGAGATTTGGATAAACAAAGGTTG
GAAACAGAGGTGAAAGTCTTGTTTTGTTATCAGGAAAAGTTATTTCTTTTAAAAATATTTTCTTCACTTGTCCATATATGGTCAAGGTTTCAGGAATATGGGGCAGGGGC
TATTATCACCATGGATACGATAAGTTCTGGGACCACGACTCCCTCTGAACGGCCTGCTTGCAAGTATATACCAAAGCCCAAAATGAGAACTGCAGAAGATGCTTGCACAC
AAATCTCTCAGCCAGAAATCTCTAATATGCTTCCACTGTCTCCACAAGTTAATTCTTGTGATACTAGATGCATGCATGAAGCCTCAATTGGGACGCATTCAGATGGGATT
CTTAATGATTCATCGATTAACTTTGATGGTTACACCCCTGACAACCAGCACACTGAAACACCTGTTAATGTAGAATCATTAGCATATGACTCTTATGGTGACATACTGGT
GGATGATTTTAATTCAGATGATCAGGATGAGATGCTAAGAGAAGAGAGTGGTAAGAACGATGAAGAAGAACCTTCAACGGAATCAAATATTTCTCAGCAGCAAAAGATGT
TCTCCCCAGTTGGTGAAGAAATTGAGCATAGCAAAACTTCAAGAAAGTTGAGAAAGAAGGTTTCTCATCAACTTGATGAGCCAGAAGATGGTGTTGATGAGAATAGAAAC
TCCCCGAATGAACCTTCTAGTAATTGTGATGTGCATGGAGATAGCTATAACAAAAATGAAATCCCAAAAGGAGTTCGAGGAAAGAAAACTTCAACAAAGTCTTCGAAACC
TTCTAGTGATAATGAAAAACCAACTCGGAAGCGCAAGGATGCTAATAAAGCAGTTCCAGATTTGCAAGCTGAAAAGCGCCCTAAGAAGTTCTCCCATTCAACTCGTCGAA
ATAGAAGGCAAGTAAACAAGGTTTTGCTTGAAACTCCGGAGGATGAAATTGACTTTCAAAAGATAAGTTTTCGGGATCTCATTATTTATCATGAGCACAAGGAGAAGTTA
GAGAAGAAAGTGGCAAGCACAAGAAAATCAGCAACCAATCAAAGAACCGATACTTCCGGTGAGGAGATTTATAATGATGGAGAGGAAAGCCTTGCTTCTGAACAAGGTAG
AGGTACTGATGATGATGAAACGCCTGATGTAGTTGACATGACTTCTGCTTACTTTAATTATCAATCATTCATGGACAAAACACCACGTACAAAGTGGTCAAAGCAGGACA
CAGAGCGTTTTTATGAGGCTGTACGACAATTCGGGACAGATTTTTGTATGATACAACAATTGTTTCCTGGTCGAACACGCCGTCAAATTAAACTAAAATTCAAAAGTGAA
GAACGTCATCATCCATTTCGTCTCTCTGATGCTATAACTAATCGTTCCAAAGTGAAGTATTTTGCAGACCATTCCCAGTTTCTATCGTTGATTGGGCAGCTGCAAGAAGC
TGCTAATAAGGCAAAACATGAATCAAATCAAGATGAATTGACTGAAAATACTGGGGATGAGGAGCAGCCAGAGTTGTCTCCTGAAACTAATGAGGAAGAAGTGGCAAAAC
CGGTAGGCGTGGAGGAGACAGAAAAGGAAGAATTTGTTGGTGGTGAAGTTCACAGTCCATTGAAGGCTGATGATAGTGATGATGATGATCCTAATAGATGGGATGATTAT
AAATTTGATTATTAATGGATGTCCCACTCTTTGTTCTCACATTTTT
Protein sequenceShow/hide protein sequence
MDVDPFDEFFSDPGVTSRAGGRFQPKIKPRPKKQTLAPKSTLSQDKKGTIPDTKSCHDGSGSSKSIKLSSQLPVMEEKGESEDDLLLATARSDLIGCSHPTSVESAKMVN
STQFDLDSYGSTLCSGSTIDGGVTDAIDLTTFSSVPVGENLTDDTKTSGILNNSHPSVSSAHEATVLDQSGLGSIQSEGGHFNDGKIAGENIDLFYELECLDDFHNQPKN
EADPSSLKQASISNEDGDLDKQRLETEVKVLFCYQEKLFLLKIFSSLVHIWSRFQEYGAGAIITMDTISSGTTTPSERPACKYIPKPKMRTAEDACTQISQPEISNMLPL
SPQVNSCDTRCMHEASIGTHSDGILNDSSINFDGYTPDNQHTETPVNVESLAYDSYGDILVDDFNSDDQDEMLREESGKNDEEEPSTESNISQQQKMFSPVGEEIEHSKT
SRKLRKKVSHQLDEPEDGVDENRNSPNEPSSNCDVHGDSYNKNEIPKGVRGKKTSTKSSKPSSDNEKPTRKRKDANKAVPDLQAEKRPKKFSHSTRRNRRQVNKVLLETP
EDEIDFQKISFRDLIIYHEHKEKLEKKVASTRKSATNQRTDTSGEEIYNDGEESLASEQGRGTDDDETPDVVDMTSAYFNYQSFMDKTPRTKWSKQDTERFYEAVRQFGT
DFCMIQQLFPGRTRRQIKLKFKSEERHHPFRLSDAITNRSKVKYFADHSQFLSLIGQLQEAANKAKHESNQDELTENTGDEEQPELSPETNEEEVAKPVGVEETEKEEFV
GGEVHSPLKADDSDDDDPNRWDDYKFDY