; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0036811 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0036811
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionFHA domain-containing protein
Genome locationchr2:1310924..1317725
RNA-Seq ExpressionLag0036811
SyntenyLag0036811
Gene Ontology termsGO:0031011 - Ino80 complex (cellular component)
GO:0071339 - MLL1 complex (cellular component)
GO:0002151 - G-quadruplex RNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR000253 - Forkhead-associated (FHA) domain
IPR008984 - SMAD/FHA domain superfamily
IPR025999 - Microspherule protein, N-terminal domain
IPR037912 - Microspherule protein 1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6571705.1 Microspherule protein 1, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0076.75Show/hide
Query:  MGALAPVAPWTPEDDILLKNAVEAGASLESLAKGAVQFSRRYTVRELQERWHSLLYDPIVSDDASMSMIDFERSSSILPSKFNKFGNPKETKCIGGKRKS
        MGALAPVAPWTPEDDILLKNAVEAGASLESLAKGAVQFSRRYTVRELQERWHSLLYDPIVS+DASMSMIDFERSSSILPSKFNKFGNPKETK IGGKRKS
Subjt:  MGALAPVAPWTPEDDILLKNAVEAGASLESLAKGAVQFSRRYTVRELQERWHSLLYDPIVSDDASMSMIDFERSSSILPSKFNKFGNPKETKCIGGKRKS

Query:  GTVRRCYYALRKRICNEPF-NPMDLSFLVG--DSNYDVGEPMSGNGIPPISDDFGLQSSELGVLPCNFAQNMMNNDDAEHTFHSGCQQTVEKHFPQNLDN
        G+VR CYYALRKRICNEPF NPMDL+FLVG  +SNY V EPMSGN IPPISDDFGLQSSE+G+LPC+F+QN+MN DD EHTF SGCQ TVEKHFP+NLDN
Subjt:  GTVRRCYYALRKRICNEPF-NPMDLSFLVG--DSNYDVGEPMSGNGIPPISDDFGLQSSELGVLPCNFAQNMMNNDDAEHTFHSGCQQTVEKHFPQNLDN

Query:  GHEGISHIMGENLPLSANESHVKELAPSASFPVHSLFENDLEVRP-TFGQLSNDQRVMGSEPEDNDVFNSPVSDSGASFHNVEYSSPLPGMPIWRNASAP
        G EGISH M E+LP SA +SHV+ELAPS  FPVHSLFENDLE RP TFGQLSNDQR MGSE EDN+VFNSPVS+SGASFHNVEYSSPLPGMPIWRNASAP
Subjt:  GHEGISHIMGENLPLSANESHVKELAPSASFPVHSLFENDLEVRP-TFGQLSNDQRVMGSEPEDNDVFNSPVSDSGASFHNVEYSSPLPGMPIWRNASAP

Query:  ALPIDVGFEDKDIPTADSFELPDDDGNSNIQNARLAGYDAHSNLKLKIEVQDVHLKSPNATAEVYLAELSNSLLNLSNEDELLFMDVDGKDAIDKSYYD-
        ALPIDVGF DKDIPT++SFELPDDDGN NIQNAR+AGYDA+S+LKLKIEV+  HLKSPNATAEVYLAELSNSL+N+SNEDELLFMDVDGKDA+DKSYYD 
Subjt:  ALPIDVGFEDKDIPTADSFELPDDDGNSNIQNARLAGYDAHSNLKLKIEVQDVHLKSPNATAEVYLAELSNSLLNLSNEDELLFMDVDGKDAIDKSYYD-

Query:  -----------------------------------------------------------EAHPSPSASLNSQCLGKGDEPLFCTLNTEDPEIPSNDDVFL
                                                                   EAH SPSASLNSQC  KGDEPLFCTLNTEDP+IPSNDDVFL
Subjt:  -----------------------------------------------------------EAHPSPSASLNSQCLGKGDEPLFCTLNTEDPEIPSNDDVFL

Query:  PPLSTTATMGYHFHDCMDPTFSSIKDFSCKEKSGEMTQNLVQRERKNHGQPHVSSLSIGLHGLPERGEKHLVGGAAVNLKLSHSNSIHVPSVNNTSSRNV
        PPLST ATMGY+F DC++ TFSS KDF+  EKSGE TQNL  RERKNHG    + ++  LHG  ERGEKH VGGA VN + SHSN+ H+PSV+N  S N 
Subjt:  PPLSTTATMGYHFHDCMDPTFSSIKDFSCKEKSGEMTQNLVQRERKNHGQPHVSSLSIGLHGLPERGEKHLVGGAAVNLKLSHSNSIHVPSVNNTSSRNV

Query:  NNDAILPVMLKEENNEISRVNHLGQNFLNTHVEKPGFESDNVRRYPPSAACGIKQEPDTLTTVKDHRLSQEGGVRGVFGVDQDGISSTSDQEELSIDSED
        N+DA LP +LKEENNEISRVNHLG+NFLN H EKPGF+SDNVR YPPSAAC IKQEPD L ++KDHRLSQEGG RG FGV+Q G+SSTSDQEELSIDSED
Subjt:  NNDAILPVMLKEENNEISRVNHLGQNFLNTHVEKPGFESDNVRRYPPSAACGIKQEPDTLTTVKDHRLSQEGGVRGVFGVDQDGISSTSDQEELSIDSED

Query:  DVPHFSDIEAMILDMDLDPEDQDLYSSEEVLKYQHMDTKKRIIRLEQGANSYMQRSIASHGALAVLYGRYSKHYIKKSEVLLGRQTEDVIVDIDLGREGS
        DVPHFSDIEAMILDMDLDPEDQDLYSSEEVLKYQH+DTKKRIIRLEQGAN+YMQRS ASHGALAVLYGRYSKHYIKKSEVLLGR TEDVIVDIDLGREGS
Subjt:  DVPHFSDIEAMILDMDLDPEDQDLYSSEEVLKYQHMDTKKRIIRLEQGANSYMQRSIASHGALAVLYGRYSKHYIKKSEVLLGRQTEDVIVDIDLGREGS

Query:  RNKISRRQAIIKIDQDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESNPVRMKQYVDNMNKTSHKQEYQSYAVLSCLMGNNSNTFV
         NKISRRQAIIK+DQDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESNP RMKQYVDN+ K SHKQE   YAVLSCL G      +
Subjt:  RNKISRRQAIIKIDQDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESNPVRMKQYVDNMNKTSHKQEYQSYAVLSCLMGNNSNTFV

Query:  EIISSSNISEELTTTKLQRYIQV----GFGGPTGFTMMFPSLHFGAHQLKLLSFW
        +I+     S     TK++RY  +    GFG P+GFT++FPSLHF A QLKLL  W
Subjt:  EIISSSNISEELTTTKLQRYIQV----GFGGPTGFTMMFPSLHFGAHQLKLLSFW

XP_022135664.1 uncharacterized protein LOC111007538 isoform X1 [Momordica charantia]0.0e+0082.11Show/hide
Query:  MGALAPVAPWTPEDDILLKNAVEAGASLESLAKGAVQFSRRYTVRELQERWHSLLYDPIVSDDASMSMIDFERSSSILPSKFNKFGNPKETKCIGGKRKS
        MGALAPVAPWTPEDDILLKNA+EAGASLESLAKGAVQFSRRYTVRELQERWHSLLYDPIVS++ASMSMIDFERSSSILPSKFNKFGNPKETKCIGGKRK 
Subjt:  MGALAPVAPWTPEDDILLKNAVEAGASLESLAKGAVQFSRRYTVRELQERWHSLLYDPIVSDDASMSMIDFERSSSILPSKFNKFGNPKETKCIGGKRKS

Query:  GTVRRCYYALRKRICNEPFNPMDLSFLVG--DSNYDVGEPMSGNGIPPISDDFGLQSSELGVLPCNFAQNMMNNDDAEHTFHSGCQQTVEKHFPQNLDNG
        G+VRRCYYALRKRICNEPFNPMDLSFLVG  DSNY V EPMSG+ IPPIS DFGLQ SELG+LP NFA NMMNNDD E TFHS CQ TVEKHFP NLDN 
Subjt:  GTVRRCYYALRKRICNEPFNPMDLSFLVG--DSNYDVGEPMSGNGIPPISDDFGLQSSELGVLPCNFAQNMMNNDDAEHTFHSGCQQTVEKHFPQNLDNG

Query:  HEGISHIMGENLPLSANESHVKELAPSASFPVHSLFENDLEVRP-TFGQLSNDQRVMGSEPEDNDVFNSPVSDSGASFHNVEYSSPLPGMPIWRNASAPA
        HEGI HIM ENLPLS NES V+ELAPSASFPVHSLFENDLEVRP TFGQ S DQR MGSE EDN+VFNSPVSDSGASFHNVEYSSPLPGMPIWRNASAPA
Subjt:  HEGISHIMGENLPLSANESHVKELAPSASFPVHSLFENDLEVRP-TFGQLSNDQRVMGSEPEDNDVFNSPVSDSGASFHNVEYSSPLPGMPIWRNASAPA

Query:  LPIDVGFEDKDIPTADSFELPDDDGNSNIQNARLAGYDAHSNLKLKIEVQDVHLKSPNATAEVYLAELSNSLLNLSNEDELLFMDVDGKDAIDKSYYD--
        LPIDVGF DKD+PT DSFELPDDDGN+NIQNAR+A YDA S+ KLKIEVQ  HLKSPNATAEVYLAELSNSLLNL+NEDELLFMD DGKD IDKSYYD  
Subjt:  LPIDVGFEDKDIPTADSFELPDDDGNSNIQNARLAGYDAHSNLKLKIEVQDVHLKSPNATAEVYLAELSNSLLNLSNEDELLFMDVDGKDAIDKSYYD--

Query:  ----------------------------------------------------------EAHPSPSASLNSQCLGKGDEPLFCTLNTEDPEIPSNDDVFLP
                                                                  E HPSPSASLNSQCLGKGDEPLFCTLNTEDPEIPSNDDVFLP
Subjt:  ----------------------------------------------------------EAHPSPSASLNSQCLGKGDEPLFCTLNTEDPEIPSNDDVFLP

Query:  PLSTTATMGYHFHDCMDPTFSSIKDFSCKEKSGEMTQNLVQRERKNHGQPHVSSLSIGLHGLPERGEKHLVGGAAVNLKLSHSNSIHVPSVNN------T
        PLST ++MGYHF D +D TFSSIKDFSC EKSGEMTQNLVQRERKNHGQPHVSSLSIGLHGLPERGEKHLVGGAAVNLKL HSNSIHVPS NN      +
Subjt:  PLSTTATMGYHFHDCMDPTFSSIKDFSCKEKSGEMTQNLVQRERKNHGQPHVSSLSIGLHGLPERGEKHLVGGAAVNLKLSHSNSIHVPSVNN------T

Query:  SSRNVNNDAILPVMLKEENNEISRVNHLGQNFLNTHVEKPGFESDNVRRYPPSAACGIKQEPDTLTTVKDHRLSQEGGVRGVFGVDQDGISSTSDQEELS
        SS N N DAILPV LKEE+ EISRVNHLGQNFLNTHVEKPGF+S+N R+YPPS A GIKQEPD LT VKDHRLSQE G RGVFGV+QDGISSTSDQEELS
Subjt:  SSRNVNNDAILPVMLKEENNEISRVNHLGQNFLNTHVEKPGFESDNVRRYPPSAACGIKQEPDTLTTVKDHRLSQEGGVRGVFGVDQDGISSTSDQEELS

Query:  IDSEDDVPHFSDIEAMILDMDLDPEDQDLYSSEEVLKYQHMDTKKRIIRLEQGANSYMQRSIASHGALAVLYGRYSKHYIKKSEVLLGRQTEDVIVDIDL
        IDSEDDVPHFSDIEAMILDMDLDPEDQDLY+SEEVL+YQHMDTKKRI+RLEQGA++ M+RS+ASHGALAVLYGRYSKHYIKKSEVLLGR TEDVIVDIDL
Subjt:  IDSEDDVPHFSDIEAMILDMDLDPEDQDLYSSEEVLKYQHMDTKKRIIRLEQGANSYMQRSIASHGALAVLYGRYSKHYIKKSEVLLGRQTEDVIVDIDL

Query:  GREGSRNKISRRQAIIKIDQDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESNPVRMKQYVDNMNKTSHKQEYQS
        GREGS NKISRRQAIIKIDQDGFFSLKNLGKCSISINNK+VAPGHCLRLNSGCLIEIRGM FIFES+PV MKQY+DN+ KTSHKQEYQS
Subjt:  GREGSRNKISRRQAIIKIDQDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESNPVRMKQYVDNMNKTSHKQEYQS

XP_022963643.1 uncharacterized protein LOC111463912 isoform X1 [Cucurbita moschata]0.0e+0079.19Show/hide
Query:  MGALAPVAPWTPEDDILLKNAVEAGASLESLAKGAVQFSRRYTVRELQERWHSLLYDPIVSDDASMSMIDFERSSSILPSKFNKFGNPKETKCIGGKRKS
        MGALAPVAPWTPEDDILLKNAVEAGASLESLAKGAVQFSRRYTVRELQERWHSLLYDPIVS+DASMSMIDFERSSSILPSKFN+FGNPKETK IGGKRKS
Subjt:  MGALAPVAPWTPEDDILLKNAVEAGASLESLAKGAVQFSRRYTVRELQERWHSLLYDPIVSDDASMSMIDFERSSSILPSKFNKFGNPKETKCIGGKRKS

Query:  GTVRRCYYALRKRICNEPF-NPMDLSFLVG--DSNYDVGEPMSGNGIPPISDDFGLQSSELGVLPCNFAQNMMNNDDAEHTFHSGCQQTVEKHFPQNLDN
        G+VR CYYALRKRICNEPF NPMDL+FLVG  +SNY V EPMSGN IPPISDDFGLQSSE+G+LPC+F+QN+MN DD EHTF SGCQ TVEKHFP+NLDN
Subjt:  GTVRRCYYALRKRICNEPF-NPMDLSFLVG--DSNYDVGEPMSGNGIPPISDDFGLQSSELGVLPCNFAQNMMNNDDAEHTFHSGCQQTVEKHFPQNLDN

Query:  GHEGISHIMGENLPLSANESHVKELAPSASFPVHSLFENDLEVRP-TFGQLSNDQRVMGSEPEDNDVFNSPVSDSGASFHNVEYSSPLPGMPIWRNASAP
        G EGISH M E+LP SA +SHV+ELAPS  FPVHSLFENDLE RP TFGQLSNDQR MGSE EDN+VFNSPVS+SGASFHNVEYSSPLPGMPIWRNASAP
Subjt:  GHEGISHIMGENLPLSANESHVKELAPSASFPVHSLFENDLEVRP-TFGQLSNDQRVMGSEPEDNDVFNSPVSDSGASFHNVEYSSPLPGMPIWRNASAP

Query:  ALPIDVGFEDKDIPTADSFELPDDDGNSNIQNARLAGYDAHSNLKLKIEVQDVHLKSPNATAEVYLAELSNSLLNLSNEDELLFMDVDGKDAIDKSYYD-
        ALPIDVGF DKDIPT++SFELPDDDGN NIQNAR+AGYDA+S+LKLKIEV+  HLKSPNATAEVYLAELSNSL+N+SNEDELLFMDVDGKDA+DKSYYD 
Subjt:  ALPIDVGFEDKDIPTADSFELPDDDGNSNIQNARLAGYDAHSNLKLKIEVQDVHLKSPNATAEVYLAELSNSLLNLSNEDELLFMDVDGKDAIDKSYYD-

Query:  -----------------------------------------------------------EAHPSPSASLNSQCLGKGDEPLFCTLNTEDPEIPSNDDVFL
                                                                   EAH SPSASLN+QC  KGDEPLFCTLNTEDP+IPSNDDVFL
Subjt:  -----------------------------------------------------------EAHPSPSASLNSQCLGKGDEPLFCTLNTEDPEIPSNDDVFL

Query:  PPLSTTATMGYHFHDCMDPTFSSIKDFSCKEKSGEMTQNLVQRERKNHGQPHVSSLSIGLHGLPERGEKHLVGGAAVNLKLSHSNSIHVPSVNNTSSRNV
        PPLST ATMGY+F DC++ TFSS KDF+  EKSGE TQNL  RERKNHG    + ++  LHG  ERGEKH VGGA VN + SHSN+ H+PSV+N  S N 
Subjt:  PPLSTTATMGYHFHDCMDPTFSSIKDFSCKEKSGEMTQNLVQRERKNHGQPHVSSLSIGLHGLPERGEKHLVGGAAVNLKLSHSNSIHVPSVNNTSSRNV

Query:  NNDAILPVMLKEENNEISRVNHLGQNFLNTHVEKPGFESDNVRRYPPSAACGIKQEPDTLTTVKDHRLSQEGGVRGVFGVDQDGISSTSDQEELSIDSED
        N+DA LP +LKEENNEISRVNHLG+NFLN H EKPGF+SDNVR YPPSAAC IKQEPD L ++KDHRLSQEGG RG FGV+Q G+SSTSDQEELSIDSED
Subjt:  NNDAILPVMLKEENNEISRVNHLGQNFLNTHVEKPGFESDNVRRYPPSAACGIKQEPDTLTTVKDHRLSQEGGVRGVFGVDQDGISSTSDQEELSIDSED

Query:  DVPHFSDIEAMILDMDLDPEDQDLYSSEEVLKYQHMDTKKRIIRLEQGANSYMQRSIASHGALAVLYGRYSKHYIKKSEVLLGRQTEDVIVDIDLGREGS
        DVPHFSDIEAMILDMDLDPEDQDLYSSEEVLKYQH+DTKKRIIRLEQGAN+YMQRS ASHGALAVLYGRYSKHYIKKSEVLLGR TEDVIVDIDLGREGS
Subjt:  DVPHFSDIEAMILDMDLDPEDQDLYSSEEVLKYQHMDTKKRIIRLEQGANSYMQRSIASHGALAVLYGRYSKHYIKKSEVLLGRQTEDVIVDIDLGREGS

Query:  RNKISRRQAIIKIDQDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESNPVRMKQYVDNMNKTSHKQEYQS
         NKISRRQAIIK+DQDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESNP RMKQYVDN+ K SHKQEYQS
Subjt:  RNKISRRQAIIKIDQDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESNPVRMKQYVDNMNKTSHKQEYQS

XP_022967567.1 uncharacterized protein LOC111467030 isoform X1 [Cucurbita maxima]0.0e+0078.77Show/hide
Query:  GLVRVFCFTEMGALAPVAPWTPEDDILLKNAVEAGASLESLAKGAVQFSRRYTVRELQERWHSLLYDPIVSDDASMSMIDFERSSSILPSKFNKFGNPKE
        GLVR +CF EMGALAPV PWTPEDDILLKNAVEAGASLESLAKGAVQFSRRYTVRELQERWHSLLYDPIVS+DASMSMIDFERSSSILPSKFNKFGNPKE
Subjt:  GLVRVFCFTEMGALAPVAPWTPEDDILLKNAVEAGASLESLAKGAVQFSRRYTVRELQERWHSLLYDPIVSDDASMSMIDFERSSSILPSKFNKFGNPKE

Query:  TKCIGGKRKSGTVRRCYYALRKRICNEPF-NPMDLSFLVG--DSNYDVGEPMSGNGIPPISDDFGLQSSELGVLPCNFAQNMMNNDDAEHTFHSGCQQTV
        TK IGGKRKSG+VR CYYALRKRICNEPF NPM+LSFLVG  +SNY V EPMSGN IPPISDDFGLQSSELG+LPC+F+QN+MN DD +HTF SGCQ TV
Subjt:  TKCIGGKRKSGTVRRCYYALRKRICNEPF-NPMDLSFLVG--DSNYDVGEPMSGNGIPPISDDFGLQSSELGVLPCNFAQNMMNNDDAEHTFHSGCQQTV

Query:  EKHFPQNLDNGHEGISHIMGENLPLSANESHVKELAPSASFPVHSLFENDLEVRP-TFGQLSNDQRVMGSEPEDNDVFNSPVSDSGASFHNVEYSSPLPG
        EKHFP+NLDNG EGISH M E+LP SA +SHV+ELAPS  FPVHSLFENDLE RP TFGQLSNDQR MGSE EDN+VFNSPVS+SGASFHNVEYSSPLPG
Subjt:  EKHFPQNLDNGHEGISHIMGENLPLSANESHVKELAPSASFPVHSLFENDLEVRP-TFGQLSNDQRVMGSEPEDNDVFNSPVSDSGASFHNVEYSSPLPG

Query:  MPIWRNASAPALPIDVGFEDKDIPTADSFELPDDDGNSNIQNARLAGYDAHSNLKLKIEVQDVHLKSPNATAEVYLAELSNSLLNLSNEDELLFMDVDGK
        MPIWRNASAPALPIDVGF DKDIPT++SFELPDDDGN NIQNA +AGYDA+++LKLK EV+  HLKSPNATAEVYLAELSNSL+N+SNEDELLFMDVDGK
Subjt:  MPIWRNASAPALPIDVGFEDKDIPTADSFELPDDDGNSNIQNARLAGYDAHSNLKLKIEVQDVHLKSPNATAEVYLAELSNSLLNLSNEDELLFMDVDGK

Query:  DAIDKSYYD------------------------------------------------------------EAHPSPSASLNSQCLGKGDEPLFCTLNTEDP
        DA+DKSYYD                                                            EAH SPSASLNS C  K DEPLFCTLNTEDP
Subjt:  DAIDKSYYD------------------------------------------------------------EAHPSPSASLNSQCLGKGDEPLFCTLNTEDP

Query:  EIPSNDDVFLPPLSTTATMGYHFHDCMDPTFSSIKDFSCKEKSGEMTQNLVQRERKNHGQPHVSSLSIGLHGLPERGEKHLVGGAAVNLKLSHSNSIHVP
        +IPSNDDVFLPPLST ATMGY+F DC+  TFSS KDF+  EKSGE TQNL  RERKNHGQ  V S   GL+G  ERGEKH VGGA VN + SHSN+ H+P
Subjt:  EIPSNDDVFLPPLSTTATMGYHFHDCMDPTFSSIKDFSCKEKSGEMTQNLVQRERKNHGQPHVSSLSIGLHGLPERGEKHLVGGAAVNLKLSHSNSIHVP

Query:  SVNNTSSRNVNNDAILPVMLKEENNEISRVNHLGQNFLNTHVEKPGFESDNVRRYPPSAACGIKQEPDTLTTVKDHRLSQEGGVRGVFGVDQDGISSTSD
        SV+N  S N N+DA LP +LKEENNEISRVNHLG+NFLN H EKPGF+SDNVR YPPSAAC IKQEP+ L ++KDHRLSQEGG RG FGV+Q G+SSTSD
Subjt:  SVNNTSSRNVNNDAILPVMLKEENNEISRVNHLGQNFLNTHVEKPGFESDNVRRYPPSAACGIKQEPDTLTTVKDHRLSQEGGVRGVFGVDQDGISSTSD

Query:  QEELSIDSEDDVPHFSDIEAMILDMDLDPEDQDLYSSEEVLKYQHMDTKKRIIRLEQGANSYMQRSIASHGALAVLYGRYSKHYIKKSEVLLGRQTEDVI
        QEELSIDSEDDVPHFSDIEAMILDMDLDPEDQDLYSSEEVLKYQH+DTKKRIIRLEQGAN+YMQRS ASHGALAVLYGRYSKHYIKKSEVLLGR TEDVI
Subjt:  QEELSIDSEDDVPHFSDIEAMILDMDLDPEDQDLYSSEEVLKYQHMDTKKRIIRLEQGANSYMQRSIASHGALAVLYGRYSKHYIKKSEVLLGRQTEDVI

Query:  VDIDLGREGSRNKISRRQAIIKIDQDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESNPVRMKQYVDNM-NKTSHKQEYQS
        VDIDLGREGS NKISRRQAIIK+DQDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESNP RMKQYVDN+  K SHKQEYQS
Subjt:  VDIDLGREGSRNKISRRQAIIKIDQDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESNPVRMKQYVDNM-NKTSHKQEYQS

XP_023554018.1 uncharacterized protein LOC111811415 isoform X1 [Cucurbita pepo subsp. pepo]0.0e+0079.52Show/hide
Query:  MGALAPVAPWTPEDDILLKNAVEAGASLESLAKGAVQFSRRYTVRELQERWHSLLYDPIVSDDASMSMIDFERSSSILPSKFNKFGNPKETKCIGGKRKS
        MGALAPVAPWTPEDDILLKNAVEAGASLESLAKGAVQFSRRYTVRELQERWHSLLYDPIVS+DASMSMIDFERSSSILPSKFNKFGNPKETK IGGKRKS
Subjt:  MGALAPVAPWTPEDDILLKNAVEAGASLESLAKGAVQFSRRYTVRELQERWHSLLYDPIVSDDASMSMIDFERSSSILPSKFNKFGNPKETKCIGGKRKS

Query:  GTVRRCYYALRKRICNEPF-NPMDLSFLVG--DSNYDVGEPMSGNGIPPISDDFGLQSSELGVLPCNFAQNMMNNDDAEHTFHSGCQQTVEKHFPQNLDN
        G+VR CYYALRKR+CNEPF NPMDL+FLVG  +SNY V EPMSGN IPPISDDFGLQSSE+G+LPC+F+QN+MN DD EHTF SGCQ TVEKHFP+NLDN
Subjt:  GTVRRCYYALRKRICNEPF-NPMDLSFLVG--DSNYDVGEPMSGNGIPPISDDFGLQSSELGVLPCNFAQNMMNNDDAEHTFHSGCQQTVEKHFPQNLDN

Query:  GHEGISHIMGENLPLSANESHVKELAPSASFPVHSLFENDLEVRP-TFGQLSNDQRVMGSEPEDNDVFNSPVSDSGASFHNVEYSSPLPGMPIWRNASAP
        G EGISH M E+LP SA +SHV+ LAPS  FPVHS+FENDLE RP TFGQLSNDQRVMGSE EDN+VFNSPVS+SGASFHNVEYSSPLPGMPIWRNAS P
Subjt:  GHEGISHIMGENLPLSANESHVKELAPSASFPVHSLFENDLEVRP-TFGQLSNDQRVMGSEPEDNDVFNSPVSDSGASFHNVEYSSPLPGMPIWRNASAP

Query:  ALPIDVGFEDKDIPTADSFELPDDDGNSNIQNARLAGYDAHSNLKLKIEVQDVHLKSPNATAEVYLAELSNSLLNLSNEDELLFMDVDGKDAIDKSYYD-
        ALPIDVGF DKDIPT++SFELPDDDGN NIQNAR+AGYDA+S+LKLKIEV+  HLKSPNATAEVYLAELSNSL+N+SNEDELLFMDVDGKDA+DKSYYD 
Subjt:  ALPIDVGFEDKDIPTADSFELPDDDGNSNIQNARLAGYDAHSNLKLKIEVQDVHLKSPNATAEVYLAELSNSLLNLSNEDELLFMDVDGKDAIDKSYYD-

Query:  -----------------------------------------------------------EAHPSPSASLNSQCLGKGDEPLFCTLNTEDPEIPSNDDVFL
                                                                   EAH SPSASLNSQC  KGDEPLFCTLNTEDP+IPSNDDVFL
Subjt:  -----------------------------------------------------------EAHPSPSASLNSQCLGKGDEPLFCTLNTEDPEIPSNDDVFL

Query:  PPLSTTATMGYHFHDCMDPTFSSIKDFSCKEKSGEMTQNLVQRERKNHGQPHVSSLSIGLHGLPERGEKHLVGGAAVNLKLSHSNSIHVPSVNNTSSRNV
        PPLST ATMGY+F DC++ TFSS KDF+  EKSGE TQNL  RERKNHGQP V S   GLHG  ERGEKH VGGA VN + SHSN+ H+PSV+N  S N 
Subjt:  PPLSTTATMGYHFHDCMDPTFSSIKDFSCKEKSGEMTQNLVQRERKNHGQPHVSSLSIGLHGLPERGEKHLVGGAAVNLKLSHSNSIHVPSVNNTSSRNV

Query:  NNDAILPVMLKEENNEISRVNHLGQNFLNTHVEKPGFESDNVRRYPPSAACGIKQEPDTLTTVKDHRLSQEGGVRGVFGVDQDGISSTSDQEELSIDSED
        N+DA LP +LKEENNEISRVNHLG+NFLN H +KPGF+SDNVR YPPSAAC IKQEPD L ++KDHRLSQEGG RG FGV+Q G+SSTSDQEELSIDSED
Subjt:  NNDAILPVMLKEENNEISRVNHLGQNFLNTHVEKPGFESDNVRRYPPSAACGIKQEPDTLTTVKDHRLSQEGGVRGVFGVDQDGISSTSDQEELSIDSED

Query:  DVPHFSDIEAMILDMDLDPEDQDLYSSEEVLKYQHMDTKKRIIRLEQGANSYMQRSIASHGALAVLYGRYSKHYIKKSEVLLGRQTEDVIVDIDLGREGS
        DVPHFSDIEAMILDMDLDPEDQDLYSSEEVLKYQH+DTKKRIIRLEQGAN+YMQRS ASHGALAVLYGRYSKHYIKKSEVLLGR TEDVIVDIDLGREGS
Subjt:  DVPHFSDIEAMILDMDLDPEDQDLYSSEEVLKYQHMDTKKRIIRLEQGANSYMQRSIASHGALAVLYGRYSKHYIKKSEVLLGRQTEDVIVDIDLGREGS

Query:  RNKISRRQAIIKIDQDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESNPVRMKQYVDNMNKTSHKQEYQS
         NKISRRQAIIK+DQDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESNP RMKQYVDN+ K SHKQEYQS
Subjt:  RNKISRRQAIIKIDQDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESNPVRMKQYVDNMNKTSHKQEYQS

TrEMBL top hitse value%identityAlignment
A0A6J1C2Q8 uncharacterized protein LOC111007538 isoform X20.0e+0078.74Show/hide
Query:  MGALAPVAPWTPEDDILLKNAVEAGASLESLAKGAVQFSRRYTVRELQERWHSLLYDPIVSDDASMSMIDFERSSSILPSKFNKFGNPKETKCIGGKRKS
        MGALAPVAPWTPEDDILLKNA+EAGASLESLAKGAVQFSRRYTVRELQERWHSLLYDPIVS++ASMSMIDFERSSSILPSKFNKFGNPKETKCIGGKRK 
Subjt:  MGALAPVAPWTPEDDILLKNAVEAGASLESLAKGAVQFSRRYTVRELQERWHSLLYDPIVSDDASMSMIDFERSSSILPSKFNKFGNPKETKCIGGKRKS

Query:  GTVRRCYYALRKRICNEPFNPMDLSFLVG--DSNYDVGEPMSGNGIPPISDDFGLQSSELGVLPCNFAQNMMNNDDAEHTFHSGCQQTVEKHFPQNLDNG
        G+VRRCYYALRKRICNEPFNPMDLSFLVG  DSNY V EPMSG+ IPPIS DFGLQ SELG+LP NFA NMMNNDD E TFHS CQ TVEKHFP NLDN 
Subjt:  GTVRRCYYALRKRICNEPFNPMDLSFLVG--DSNYDVGEPMSGNGIPPISDDFGLQSSELGVLPCNFAQNMMNNDDAEHTFHSGCQQTVEKHFPQNLDNG

Query:  HEGISHIMGENLPLSANESHVKELAPSASFPVHSLFENDLEVRP-TFGQLSNDQRVMGSEPEDNDVFNSPVSDSGASFHNVEYSSPLPGMPIWRNASAPA
        HEGI HIM ENLPLS NES V+ELAPSASFPVHSLFENDLEVRP TFGQ S DQR MGSE EDN+VFNSPVSDSGASFHNVEYSSPLPGMPIWRNASAPA
Subjt:  HEGISHIMGENLPLSANESHVKELAPSASFPVHSLFENDLEVRP-TFGQLSNDQRVMGSEPEDNDVFNSPVSDSGASFHNVEYSSPLPGMPIWRNASAPA

Query:  LPIDVGFEDKDIPTADSFELPDDDGNSNIQNARLAGYDAHSNLKLKIEVQDVHLKSPNATAEVYLAELSNSLLNLSNEDELLFMDVDGKDAIDKSYYD--
        LPIDVGF DKD+PT DSFELPDDDGN+NIQNAR+A YDA S+ KLKIEVQ  HLKSPNATAEVYLAELSNSLLNL+NEDELLFMD DGKD IDKSYYD  
Subjt:  LPIDVGFEDKDIPTADSFELPDDDGNSNIQNARLAGYDAHSNLKLKIEVQDVHLKSPNATAEVYLAELSNSLLNLSNEDELLFMDVDGKDAIDKSYYD--

Query:  ----------------------------------------------------------EAHPSPSASLNSQCLGKGDEPLFCTLNTEDPEIPSNDDVFLP
                                                                  E HPSPSASLNSQCLGKGDEPLFCTLNTEDPEIPSNDDVFLP
Subjt:  ----------------------------------------------------------EAHPSPSASLNSQCLGKGDEPLFCTLNTEDPEIPSNDDVFLP

Query:  PLSTTATMGYHFHDCMDPTFSSIKDFSCKEKSGEMTQNLVQRERKNHGQPHVSSLSIGLHGLPERGEKHLVGGAAVNLKLSHSNSIHVPSVNN------T
        PLST ++MGYHF D +D TFSSIKDFSC EKSGEMTQNLVQRERKNHGQPHVSSLSIGLHGLPERGEKHLVGGAAVNLKL HSNSIHVPS NN      +
Subjt:  PLSTTATMGYHFHDCMDPTFSSIKDFSCKEKSGEMTQNLVQRERKNHGQPHVSSLSIGLHGLPERGEKHLVGGAAVNLKLSHSNSIHVPSVNN------T

Query:  SSRNVNNDAILPVMLKEENNEISRVNHLGQNFLNTHVEKPGFESDNVRRYPPSAACGIKQEPDTLTTVKDHRLSQEGGVRGVFGVDQDGISSTSDQEELS
        SS N N DAILPV LKEE+ EISR                                    EPD LT VKDHRLSQE G RGVFGV+QDGISSTSDQEELS
Subjt:  SSRNVNNDAILPVMLKEENNEISRVNHLGQNFLNTHVEKPGFESDNVRRYPPSAACGIKQEPDTLTTVKDHRLSQEGGVRGVFGVDQDGISSTSDQEELS

Query:  IDSEDDVPHFSDIEAMILDMDLDPEDQDLYSSEEVLKYQHMDTKKRIIRLEQGANSYMQRSIASHGALAVLYGRYSKHYIKKSEVLLGRQTEDVIVDIDL
        IDSEDDVPHFSDIEAMILDMDLDPEDQDLY+SEEVL+YQHMDTKKRI+RLEQGA++ M+RS+ASHGALAVLYGRYSKHYIKKSEVLLGR TEDVIVDIDL
Subjt:  IDSEDDVPHFSDIEAMILDMDLDPEDQDLYSSEEVLKYQHMDTKKRIIRLEQGANSYMQRSIASHGALAVLYGRYSKHYIKKSEVLLGRQTEDVIVDIDL

Query:  GREGSRNKISRRQAIIKIDQDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESNPVRMKQYVDNMNKTSHKQEYQS
        GREGS NKISRRQAIIKIDQDGFFSLKNLGKCSISINNK+VAPGHCLRLNSGCLIEIRGM FIFES+PV MKQY+DN+ KTSHKQEYQS
Subjt:  GREGSRNKISRRQAIIKIDQDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESNPVRMKQYVDNMNKTSHKQEYQS

A0A6J1C2X8 uncharacterized protein LOC111007538 isoform X30.0e+0077.28Show/hide
Query:  MGALAPVAPWTPEDDILLKNAVEAGASLESLAKGAVQFSRRYTVRELQERWHSLLYDPIVSDDASMSMIDFERSSSILPSKFNKFGNPKETKCIGGKRKS
        MGALAPVAPWTPEDDILLKNA+EAGASLESLAKGAVQFSRRYTVRELQERWHSLLYDPIVS++ASMSMIDFERSSSILPSKFNKFGNPKETKCIGGKRK 
Subjt:  MGALAPVAPWTPEDDILLKNAVEAGASLESLAKGAVQFSRRYTVRELQERWHSLLYDPIVSDDASMSMIDFERSSSILPSKFNKFGNPKETKCIGGKRKS

Query:  GTVRRCYYALRKRICNEPFNPMDLSFLVG--DSNYDVGEPMSGNGIPPISDDFGLQSSELGVLPCNFAQNMMNNDDAEHTFHSGCQQTVEKHFPQNLDNG
        G+VRRCYYALRKRICNEPFNPMDLSFLVG  DSNY V EPMSG+ IPPIS DFGLQ SELG+LP NFA NMMNNDD E TFHS CQ TVEKHFP NLDN 
Subjt:  GTVRRCYYALRKRICNEPFNPMDLSFLVG--DSNYDVGEPMSGNGIPPISDDFGLQSSELGVLPCNFAQNMMNNDDAEHTFHSGCQQTVEKHFPQNLDNG

Query:  HEGISHIMGENLPLSANESHVKELAPSASFPVHSLFENDLEVRP-TFGQLSNDQRVMGSEPEDNDVFNSPVSDSGASFHNVEYSSPLPGMPIWRNASAPA
        HEGI HIM ENLPLS NES V+ELAPSASFPVHSLFENDLEVRP TFGQ S DQR MGSE EDN+VFNSPVSDSGASFHNVEYSSPLPGMPIWRNASAPA
Subjt:  HEGISHIMGENLPLSANESHVKELAPSASFPVHSLFENDLEVRP-TFGQLSNDQRVMGSEPEDNDVFNSPVSDSGASFHNVEYSSPLPGMPIWRNASAPA

Query:  LPIDVGFEDKDIPTADSFELPDDDGNSNIQNARLAGYDAHSNLKLKIEVQDVHLKSPNATAEVYLAELSNSLLNLSNEDELLFMDVDGKDAIDKSYYD--
        LPIDVGF DKD+PT DSFELPDDDGN+NIQNAR+A YDA S+ KLKIEVQ  HLKSPNATAEVYLAELSNSLLNL+NEDELLFMD DGKD IDKSYYD  
Subjt:  LPIDVGFEDKDIPTADSFELPDDDGNSNIQNARLAGYDAHSNLKLKIEVQDVHLKSPNATAEVYLAELSNSLLNLSNEDELLFMDVDGKDAIDKSYYD--

Query:  ----------------------------------------------------------EAHPSPSASLNSQCLGKGDEPLFCTLNTEDPEIPSNDDVFLP
                                                                  E HPSPSASLNSQCLGKGDEPLFCTLNTEDPEIPSNDDVFLP
Subjt:  ----------------------------------------------------------EAHPSPSASLNSQCLGKGDEPLFCTLNTEDPEIPSNDDVFLP

Query:  PLSTTATMGYHFHDCMDPTFSSIKDFSCKEKSGEMTQNLVQRERKNHGQPHVSSLSIGLHGLPERGEKHLVGGAAVNLKLSHSNSIHVPSVNN------T
        PLST ++MGYHF D +D TFSSIKDFSC EKSGEMTQNLVQRERKNHGQPHVSSLSIGLHGLPERGEKHLVGGAAVNLKL HSNSIHVPS NN      +
Subjt:  PLSTTATMGYHFHDCMDPTFSSIKDFSCKEKSGEMTQNLVQRERKNHGQPHVSSLSIGLHGLPERGEKHLVGGAAVNLKLSHSNSIHVPSVNN------T

Query:  SSRNVNNDAILPVMLKEENNEISRVNHLGQNFLNTHVEKPGFESDNVRRYPPSAACGIKQEPDTLTTVKDHRLSQEGGVRGVFGVDQDGISSTSDQEELS
        SS N N DAILPV LKEE+ EISR                                                   E G RGVFGV+QDGISSTSDQEELS
Subjt:  SSRNVNNDAILPVMLKEENNEISRVNHLGQNFLNTHVEKPGFESDNVRRYPPSAACGIKQEPDTLTTVKDHRLSQEGGVRGVFGVDQDGISSTSDQEELS

Query:  IDSEDDVPHFSDIEAMILDMDLDPEDQDLYSSEEVLKYQHMDTKKRIIRLEQGANSYMQRSIASHGALAVLYGRYSKHYIKKSEVLLGRQTEDVIVDIDL
        IDSEDDVPHFSDIEAMILDMDLDPEDQDLY+SEEVL+YQHMDTKKRI+RLEQGA++ M+RS+ASHGALAVLYGRYSKHYIKKSEVLLGR TEDVIVDIDL
Subjt:  IDSEDDVPHFSDIEAMILDMDLDPEDQDLYSSEEVLKYQHMDTKKRIIRLEQGANSYMQRSIASHGALAVLYGRYSKHYIKKSEVLLGRQTEDVIVDIDL

Query:  GREGSRNKISRRQAIIKIDQDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESNPVRMKQYVDNMNKTSHKQEYQS
        GREGS NKISRRQAIIKIDQDGFFSLKNLGKCSISINNK+VAPGHCLRLNSGCLIEIRGM FIFES+PV MKQY+DN+ KTSHKQEYQS
Subjt:  GREGSRNKISRRQAIIKIDQDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESNPVRMKQYVDNMNKTSHKQEYQS

A0A6J1C5Q1 uncharacterized protein LOC111007538 isoform X10.0e+0082.11Show/hide
Query:  MGALAPVAPWTPEDDILLKNAVEAGASLESLAKGAVQFSRRYTVRELQERWHSLLYDPIVSDDASMSMIDFERSSSILPSKFNKFGNPKETKCIGGKRKS
        MGALAPVAPWTPEDDILLKNA+EAGASLESLAKGAVQFSRRYTVRELQERWHSLLYDPIVS++ASMSMIDFERSSSILPSKFNKFGNPKETKCIGGKRK 
Subjt:  MGALAPVAPWTPEDDILLKNAVEAGASLESLAKGAVQFSRRYTVRELQERWHSLLYDPIVSDDASMSMIDFERSSSILPSKFNKFGNPKETKCIGGKRKS

Query:  GTVRRCYYALRKRICNEPFNPMDLSFLVG--DSNYDVGEPMSGNGIPPISDDFGLQSSELGVLPCNFAQNMMNNDDAEHTFHSGCQQTVEKHFPQNLDNG
        G+VRRCYYALRKRICNEPFNPMDLSFLVG  DSNY V EPMSG+ IPPIS DFGLQ SELG+LP NFA NMMNNDD E TFHS CQ TVEKHFP NLDN 
Subjt:  GTVRRCYYALRKRICNEPFNPMDLSFLVG--DSNYDVGEPMSGNGIPPISDDFGLQSSELGVLPCNFAQNMMNNDDAEHTFHSGCQQTVEKHFPQNLDNG

Query:  HEGISHIMGENLPLSANESHVKELAPSASFPVHSLFENDLEVRP-TFGQLSNDQRVMGSEPEDNDVFNSPVSDSGASFHNVEYSSPLPGMPIWRNASAPA
        HEGI HIM ENLPLS NES V+ELAPSASFPVHSLFENDLEVRP TFGQ S DQR MGSE EDN+VFNSPVSDSGASFHNVEYSSPLPGMPIWRNASAPA
Subjt:  HEGISHIMGENLPLSANESHVKELAPSASFPVHSLFENDLEVRP-TFGQLSNDQRVMGSEPEDNDVFNSPVSDSGASFHNVEYSSPLPGMPIWRNASAPA

Query:  LPIDVGFEDKDIPTADSFELPDDDGNSNIQNARLAGYDAHSNLKLKIEVQDVHLKSPNATAEVYLAELSNSLLNLSNEDELLFMDVDGKDAIDKSYYD--
        LPIDVGF DKD+PT DSFELPDDDGN+NIQNAR+A YDA S+ KLKIEVQ  HLKSPNATAEVYLAELSNSLLNL+NEDELLFMD DGKD IDKSYYD  
Subjt:  LPIDVGFEDKDIPTADSFELPDDDGNSNIQNARLAGYDAHSNLKLKIEVQDVHLKSPNATAEVYLAELSNSLLNLSNEDELLFMDVDGKDAIDKSYYD--

Query:  ----------------------------------------------------------EAHPSPSASLNSQCLGKGDEPLFCTLNTEDPEIPSNDDVFLP
                                                                  E HPSPSASLNSQCLGKGDEPLFCTLNTEDPEIPSNDDVFLP
Subjt:  ----------------------------------------------------------EAHPSPSASLNSQCLGKGDEPLFCTLNTEDPEIPSNDDVFLP

Query:  PLSTTATMGYHFHDCMDPTFSSIKDFSCKEKSGEMTQNLVQRERKNHGQPHVSSLSIGLHGLPERGEKHLVGGAAVNLKLSHSNSIHVPSVNN------T
        PLST ++MGYHF D +D TFSSIKDFSC EKSGEMTQNLVQRERKNHGQPHVSSLSIGLHGLPERGEKHLVGGAAVNLKL HSNSIHVPS NN      +
Subjt:  PLSTTATMGYHFHDCMDPTFSSIKDFSCKEKSGEMTQNLVQRERKNHGQPHVSSLSIGLHGLPERGEKHLVGGAAVNLKLSHSNSIHVPSVNN------T

Query:  SSRNVNNDAILPVMLKEENNEISRVNHLGQNFLNTHVEKPGFESDNVRRYPPSAACGIKQEPDTLTTVKDHRLSQEGGVRGVFGVDQDGISSTSDQEELS
        SS N N DAILPV LKEE+ EISRVNHLGQNFLNTHVEKPGF+S+N R+YPPS A GIKQEPD LT VKDHRLSQE G RGVFGV+QDGISSTSDQEELS
Subjt:  SSRNVNNDAILPVMLKEENNEISRVNHLGQNFLNTHVEKPGFESDNVRRYPPSAACGIKQEPDTLTTVKDHRLSQEGGVRGVFGVDQDGISSTSDQEELS

Query:  IDSEDDVPHFSDIEAMILDMDLDPEDQDLYSSEEVLKYQHMDTKKRIIRLEQGANSYMQRSIASHGALAVLYGRYSKHYIKKSEVLLGRQTEDVIVDIDL
        IDSEDDVPHFSDIEAMILDMDLDPEDQDLY+SEEVL+YQHMDTKKRI+RLEQGA++ M+RS+ASHGALAVLYGRYSKHYIKKSEVLLGR TEDVIVDIDL
Subjt:  IDSEDDVPHFSDIEAMILDMDLDPEDQDLYSSEEVLKYQHMDTKKRIIRLEQGANSYMQRSIASHGALAVLYGRYSKHYIKKSEVLLGRQTEDVIVDIDL

Query:  GREGSRNKISRRQAIIKIDQDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESNPVRMKQYVDNMNKTSHKQEYQS
        GREGS NKISRRQAIIKIDQDGFFSLKNLGKCSISINNK+VAPGHCLRLNSGCLIEIRGM FIFES+PV MKQY+DN+ KTSHKQEYQS
Subjt:  GREGSRNKISRRQAIIKIDQDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESNPVRMKQYVDNMNKTSHKQEYQS

A0A6J1HIH5 uncharacterized protein LOC111463912 isoform X10.0e+0079.19Show/hide
Query:  MGALAPVAPWTPEDDILLKNAVEAGASLESLAKGAVQFSRRYTVRELQERWHSLLYDPIVSDDASMSMIDFERSSSILPSKFNKFGNPKETKCIGGKRKS
        MGALAPVAPWTPEDDILLKNAVEAGASLESLAKGAVQFSRRYTVRELQERWHSLLYDPIVS+DASMSMIDFERSSSILPSKFN+FGNPKETK IGGKRKS
Subjt:  MGALAPVAPWTPEDDILLKNAVEAGASLESLAKGAVQFSRRYTVRELQERWHSLLYDPIVSDDASMSMIDFERSSSILPSKFNKFGNPKETKCIGGKRKS

Query:  GTVRRCYYALRKRICNEPF-NPMDLSFLVG--DSNYDVGEPMSGNGIPPISDDFGLQSSELGVLPCNFAQNMMNNDDAEHTFHSGCQQTVEKHFPQNLDN
        G+VR CYYALRKRICNEPF NPMDL+FLVG  +SNY V EPMSGN IPPISDDFGLQSSE+G+LPC+F+QN+MN DD EHTF SGCQ TVEKHFP+NLDN
Subjt:  GTVRRCYYALRKRICNEPF-NPMDLSFLVG--DSNYDVGEPMSGNGIPPISDDFGLQSSELGVLPCNFAQNMMNNDDAEHTFHSGCQQTVEKHFPQNLDN

Query:  GHEGISHIMGENLPLSANESHVKELAPSASFPVHSLFENDLEVRP-TFGQLSNDQRVMGSEPEDNDVFNSPVSDSGASFHNVEYSSPLPGMPIWRNASAP
        G EGISH M E+LP SA +SHV+ELAPS  FPVHSLFENDLE RP TFGQLSNDQR MGSE EDN+VFNSPVS+SGASFHNVEYSSPLPGMPIWRNASAP
Subjt:  GHEGISHIMGENLPLSANESHVKELAPSASFPVHSLFENDLEVRP-TFGQLSNDQRVMGSEPEDNDVFNSPVSDSGASFHNVEYSSPLPGMPIWRNASAP

Query:  ALPIDVGFEDKDIPTADSFELPDDDGNSNIQNARLAGYDAHSNLKLKIEVQDVHLKSPNATAEVYLAELSNSLLNLSNEDELLFMDVDGKDAIDKSYYD-
        ALPIDVGF DKDIPT++SFELPDDDGN NIQNAR+AGYDA+S+LKLKIEV+  HLKSPNATAEVYLAELSNSL+N+SNEDELLFMDVDGKDA+DKSYYD 
Subjt:  ALPIDVGFEDKDIPTADSFELPDDDGNSNIQNARLAGYDAHSNLKLKIEVQDVHLKSPNATAEVYLAELSNSLLNLSNEDELLFMDVDGKDAIDKSYYD-

Query:  -----------------------------------------------------------EAHPSPSASLNSQCLGKGDEPLFCTLNTEDPEIPSNDDVFL
                                                                   EAH SPSASLN+QC  KGDEPLFCTLNTEDP+IPSNDDVFL
Subjt:  -----------------------------------------------------------EAHPSPSASLNSQCLGKGDEPLFCTLNTEDPEIPSNDDVFL

Query:  PPLSTTATMGYHFHDCMDPTFSSIKDFSCKEKSGEMTQNLVQRERKNHGQPHVSSLSIGLHGLPERGEKHLVGGAAVNLKLSHSNSIHVPSVNNTSSRNV
        PPLST ATMGY+F DC++ TFSS KDF+  EKSGE TQNL  RERKNHG    + ++  LHG  ERGEKH VGGA VN + SHSN+ H+PSV+N  S N 
Subjt:  PPLSTTATMGYHFHDCMDPTFSSIKDFSCKEKSGEMTQNLVQRERKNHGQPHVSSLSIGLHGLPERGEKHLVGGAAVNLKLSHSNSIHVPSVNNTSSRNV

Query:  NNDAILPVMLKEENNEISRVNHLGQNFLNTHVEKPGFESDNVRRYPPSAACGIKQEPDTLTTVKDHRLSQEGGVRGVFGVDQDGISSTSDQEELSIDSED
        N+DA LP +LKEENNEISRVNHLG+NFLN H EKPGF+SDNVR YPPSAAC IKQEPD L ++KDHRLSQEGG RG FGV+Q G+SSTSDQEELSIDSED
Subjt:  NNDAILPVMLKEENNEISRVNHLGQNFLNTHVEKPGFESDNVRRYPPSAACGIKQEPDTLTTVKDHRLSQEGGVRGVFGVDQDGISSTSDQEELSIDSED

Query:  DVPHFSDIEAMILDMDLDPEDQDLYSSEEVLKYQHMDTKKRIIRLEQGANSYMQRSIASHGALAVLYGRYSKHYIKKSEVLLGRQTEDVIVDIDLGREGS
        DVPHFSDIEAMILDMDLDPEDQDLYSSEEVLKYQH+DTKKRIIRLEQGAN+YMQRS ASHGALAVLYGRYSKHYIKKSEVLLGR TEDVIVDIDLGREGS
Subjt:  DVPHFSDIEAMILDMDLDPEDQDLYSSEEVLKYQHMDTKKRIIRLEQGANSYMQRSIASHGALAVLYGRYSKHYIKKSEVLLGRQTEDVIVDIDLGREGS

Query:  RNKISRRQAIIKIDQDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESNPVRMKQYVDNMNKTSHKQEYQS
         NKISRRQAIIK+DQDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESNP RMKQYVDN+ K SHKQEYQS
Subjt:  RNKISRRQAIIKIDQDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESNPVRMKQYVDNMNKTSHKQEYQS

A0A6J1HR64 uncharacterized protein LOC111467030 isoform X10.0e+0078.77Show/hide
Query:  GLVRVFCFTEMGALAPVAPWTPEDDILLKNAVEAGASLESLAKGAVQFSRRYTVRELQERWHSLLYDPIVSDDASMSMIDFERSSSILPSKFNKFGNPKE
        GLVR +CF EMGALAPV PWTPEDDILLKNAVEAGASLESLAKGAVQFSRRYTVRELQERWHSLLYDPIVS+DASMSMIDFERSSSILPSKFNKFGNPKE
Subjt:  GLVRVFCFTEMGALAPVAPWTPEDDILLKNAVEAGASLESLAKGAVQFSRRYTVRELQERWHSLLYDPIVSDDASMSMIDFERSSSILPSKFNKFGNPKE

Query:  TKCIGGKRKSGTVRRCYYALRKRICNEPF-NPMDLSFLVG--DSNYDVGEPMSGNGIPPISDDFGLQSSELGVLPCNFAQNMMNNDDAEHTFHSGCQQTV
        TK IGGKRKSG+VR CYYALRKRICNEPF NPM+LSFLVG  +SNY V EPMSGN IPPISDDFGLQSSELG+LPC+F+QN+MN DD +HTF SGCQ TV
Subjt:  TKCIGGKRKSGTVRRCYYALRKRICNEPF-NPMDLSFLVG--DSNYDVGEPMSGNGIPPISDDFGLQSSELGVLPCNFAQNMMNNDDAEHTFHSGCQQTV

Query:  EKHFPQNLDNGHEGISHIMGENLPLSANESHVKELAPSASFPVHSLFENDLEVRP-TFGQLSNDQRVMGSEPEDNDVFNSPVSDSGASFHNVEYSSPLPG
        EKHFP+NLDNG EGISH M E+LP SA +SHV+ELAPS  FPVHSLFENDLE RP TFGQLSNDQR MGSE EDN+VFNSPVS+SGASFHNVEYSSPLPG
Subjt:  EKHFPQNLDNGHEGISHIMGENLPLSANESHVKELAPSASFPVHSLFENDLEVRP-TFGQLSNDQRVMGSEPEDNDVFNSPVSDSGASFHNVEYSSPLPG

Query:  MPIWRNASAPALPIDVGFEDKDIPTADSFELPDDDGNSNIQNARLAGYDAHSNLKLKIEVQDVHLKSPNATAEVYLAELSNSLLNLSNEDELLFMDVDGK
        MPIWRNASAPALPIDVGF DKDIPT++SFELPDDDGN NIQNA +AGYDA+++LKLK EV+  HLKSPNATAEVYLAELSNSL+N+SNEDELLFMDVDGK
Subjt:  MPIWRNASAPALPIDVGFEDKDIPTADSFELPDDDGNSNIQNARLAGYDAHSNLKLKIEVQDVHLKSPNATAEVYLAELSNSLLNLSNEDELLFMDVDGK

Query:  DAIDKSYYD------------------------------------------------------------EAHPSPSASLNSQCLGKGDEPLFCTLNTEDP
        DA+DKSYYD                                                            EAH SPSASLNS C  K DEPLFCTLNTEDP
Subjt:  DAIDKSYYD------------------------------------------------------------EAHPSPSASLNSQCLGKGDEPLFCTLNTEDP

Query:  EIPSNDDVFLPPLSTTATMGYHFHDCMDPTFSSIKDFSCKEKSGEMTQNLVQRERKNHGQPHVSSLSIGLHGLPERGEKHLVGGAAVNLKLSHSNSIHVP
        +IPSNDDVFLPPLST ATMGY+F DC+  TFSS KDF+  EKSGE TQNL  RERKNHGQ  V S   GL+G  ERGEKH VGGA VN + SHSN+ H+P
Subjt:  EIPSNDDVFLPPLSTTATMGYHFHDCMDPTFSSIKDFSCKEKSGEMTQNLVQRERKNHGQPHVSSLSIGLHGLPERGEKHLVGGAAVNLKLSHSNSIHVP

Query:  SVNNTSSRNVNNDAILPVMLKEENNEISRVNHLGQNFLNTHVEKPGFESDNVRRYPPSAACGIKQEPDTLTTVKDHRLSQEGGVRGVFGVDQDGISSTSD
        SV+N  S N N+DA LP +LKEENNEISRVNHLG+NFLN H EKPGF+SDNVR YPPSAAC IKQEP+ L ++KDHRLSQEGG RG FGV+Q G+SSTSD
Subjt:  SVNNTSSRNVNNDAILPVMLKEENNEISRVNHLGQNFLNTHVEKPGFESDNVRRYPPSAACGIKQEPDTLTTVKDHRLSQEGGVRGVFGVDQDGISSTSD

Query:  QEELSIDSEDDVPHFSDIEAMILDMDLDPEDQDLYSSEEVLKYQHMDTKKRIIRLEQGANSYMQRSIASHGALAVLYGRYSKHYIKKSEVLLGRQTEDVI
        QEELSIDSEDDVPHFSDIEAMILDMDLDPEDQDLYSSEEVLKYQH+DTKKRIIRLEQGAN+YMQRS ASHGALAVLYGRYSKHYIKKSEVLLGR TEDVI
Subjt:  QEELSIDSEDDVPHFSDIEAMILDMDLDPEDQDLYSSEEVLKYQHMDTKKRIIRLEQGANSYMQRSIASHGALAVLYGRYSKHYIKKSEVLLGRQTEDVI

Query:  VDIDLGREGSRNKISRRQAIIKIDQDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESNPVRMKQYVDNM-NKTSHKQEYQS
        VDIDLGREGS NKISRRQAIIK+DQDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESNP RMKQYVDN+  K SHKQEYQS
Subjt:  VDIDLGREGSRNKISRRQAIIKIDQDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESNPVRMKQYVDNM-NKTSHKQEYQS

SwissProt top hitse value%identityAlignment
Q96EZ8 Microspherule protein 12.8e-1333.33Show/hide
Query:  DQEELSIDSEDDVPHFSDIEAMILDMDL-DPEDQDLYSSEEVLKYQHMDTKKRIIRLEQGANSY---------MQRSIASHGALAVLYGRYSKHYIKKSE
        DQ    +   D V +FSD E +I D  L D  D+ L   E  L       K+ I +LEQ  + +         M      +  LAVL GR  ++ ++  E
Subjt:  DQEELSIDSEDDVPHFSDIEAMILDMDL-DPEDQDLYSSEEVLKYQHMDTKKRIIRLEQGANSY---------MQRSIASHGALAVLYGRYSKHYIKKSE

Query:  VLLGRQTEDVIVDIDLGREGSRNKISRRQAIIKIDQDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESN
        + LGR T+D  +D+DL  EG   KISR+Q +IK+  +G F + N G+  I I+ + V  G   RL++  ++EI  + F+F  N
Subjt:  VLLGRQTEDVIVDIDLGREGSRNKISRRQAIIKIDQDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESN

Q96EZ8 Microspherule protein 14.6e-0852.54Show/hide
Query:  WTPEDDILLKNAVEAGASLESLAKGAVQFSRRYTVRELQERWHSLLYDPIVSDDASMSM
        W P DD+LL NAV     L S+  G V+FS R+T+RE+QERW++LLYDP++S  A  +M
Subjt:  WTPEDDILLKNAVEAGASLESLAKGAVQFSRRYTVRELQERWHSLLYDPIVSDDASMSM

Q99L90 Microspherule protein 11.2e-1334.43Show/hide
Query:  DQEELSIDSEDDVPHFSDIEAMILDMDL-DPEDQDLYSSEEVLKYQHMDTKKRIIRLEQGANSY--MQRSIASHGA-------LAVLYGRYSKHYIKKSE
        DQ    +   D V +FSD E +I D  L D  D+ L   E  L       K+ I +LEQ  + +  +  SI   G+       LAVL GR  ++ ++  E
Subjt:  DQEELSIDSEDDVPHFSDIEAMILDMDL-DPEDQDLYSSEEVLKYQHMDTKKRIIRLEQGANSY--MQRSIASHGA-------LAVLYGRYSKHYIKKSE

Query:  VLLGRQTEDVIVDIDLGREGSRNKISRRQAIIKIDQDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESN
        + LGR T+D  +D+DL  EG   KISR+Q +IK+  +G F + N G+  I I+ + V  G   RL++  ++EI  + F+F  N
Subjt:  VLLGRQTEDVIVDIDLGREGSRNKISRRQAIIKIDQDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESN

Q99L90 Microspherule protein 14.6e-0852.54Show/hide
Query:  WTPEDDILLKNAVEAGASLESLAKGAVQFSRRYTVRELQERWHSLLYDPIVSDDASMSM
        W P DD+LL NAV     L S+  G V+FS R+T+RE+QERW++LLYDP++S  A  +M
Subjt:  WTPEDDILLKNAVEAGASLESLAKGAVQFSRRYTVRELQERWHSLLYDPIVSDDASMSM

Arabidopsis top hitse value%identityAlignment
AT1G60700.1 SMAD/FHA domain-containing protein6.9e-2841.33Show/hide
Query:  STSDQEELSIDSEDDVPHFSDIEAMILDMDLDPEDQD-LYSSEEVLKYQHMDTKKRIIRLEQGANSYMQRSIASHGALAVLYGRYSKHYIKKSEVLLGRQ
        ST  QEE  +D E+++    DI+AMI  ++L P+D D  ++ EE    +H   +  +I LEQ   + MQR+I  HGA+AVL+   SKH+++K EV++GR 
Subjt:  STSDQEELSIDSEDDVPHFSDIEAMILDMDLDPEDQD-LYSSEEVLKYQHMDTKKRIIRLEQGANSYMQRSIASHGALAVLYGRYSKHYIKKSEVLLGRQ

Query:  TEDVIVDIDLGREGSRNKISRRQAIIKIDQDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESNPVRMKQYVDNMNKTSHKQE
        +  + VDIDLG+    +KISRRQA++K++  G FSLKNLGK  I +N   +  G  + L S   I IRG+ F+F+ N   + Q++   N T  K E
Subjt:  TEDVIVDIDLGREGSRNKISRRQAIIKIDQDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESNPVRMKQYVDNMNKTSHKQE

AT1G75530.1 Forkhead-associated (FHA) domain-containing protein8.7e-4736.89Show/hide
Query:  PHVSSLSIGLHGLPERGEKHLVGGAAVNLKLSHSNSIHVPSVNNTSSRNVNNDAILPV---MLKEENNEISRVNHLGQNFLNTHVEKPGFESDNVRRYPP
        PH   ++  ++ +  R    +     +NL+L +  + +  SVN +S RN     + PV    L+ ++ +I   +  G   + T        SD+      
Subjt:  PHVSSLSIGLHGLPERGEKHLVGGAAVNLKLSHSNSIHVPSVNNTSSRNVNNDAILPV---MLKEENNEISRVNHLGQNFLNTHVEKPGFESDNVRRYPP

Query:  SAACGIKQEPDTLTTVKDHRLSQEGGVRGVFGVDQDGISSTSDQEELSIDSEDDVPHFSDIEAMILDMDLDPEDQDLYSSEEVLKYQHMDTKKRIIRLEQ
        S          TL    ++ + ++         + D   +  ++  + I+S++++P FSD+EAMILDMDL+P  QD Y   +  KY++ +  ++I+RLEQ
Subjt:  SAACGIKQEPDTLTTVKDHRLSQEGGVRGVFGVDQDGISSTSDQEELSIDSEDDVPHFSDIEAMILDMDLDPEDQDLYSSEEVLKYQHMDTKKRIIRLEQ

Query:  GANSYMQRSIASHGALAVLYGRYSKHYIKKSEVLLGRQTEDVIVDIDLGREGSRNKISRRQAIIKIDQDGFFSLKNLGKCSISINNKDVAPGHCLRLNSG
         A SYM R IA+HGA A+LYG  SKHYI K EVLLGR T +  VDIDLGR GS  + SRRQA+IK+ QDG F +KNLGK SI +N++++  G  + L + 
Subjt:  GANSYMQRSIASHGALAVLYGRYSKHYIKKSEVLLGRQTEDVIVDIDLGREGSRNKISRRQAIIKIDQDGFFSLKNLGKCSISINNKDVAPGHCLRLNSG

Query:  CLIEIRGMPFIFESNPVRMKQYVDNMNK
        CLI+IR   FIFE N   +K+Y+D ++K
Subjt:  CLIEIRGMPFIFESNPVRMKQYVDNMNK

AT1G75530.1 Forkhead-associated (FHA) domain-containing protein3.3e-1441.35Show/hide
Query:  WTPEDDILLKNAVEAGASLESLAKGAVQFSRRYTVRELQERWHSLLYDPIVSDDASMSMIDFERSSSILPSKFNKFGNPKETKCIGGKRKSGTVRRCYYA
        W PEDD LL+ ++E G SLE+LAKGAV+FSR++T+ EL ERWH LLY+P V+  +S    + +  +  +P                    S  VR  YY 
Subjt:  WTPEDDILLKNAVEAGASLESLAKGAVQFSRRYTVRELQERWHSLLYDPIVSDDASMSMIDFERSSSILPSKFNKFGNPKETKCIGGKRKSGTVRRCYYA

Query:  LRKR
         RKR
Subjt:  LRKR

AT3G54350.1 Forkhead-associated (FHA) domain-containing protein5.4e-10535.12Show/hide
Query:  MGALAPVAPWTPEDDILLKNAVEAGASLESLAKGAVQFSRRYTVRELQERWHSLLYDPIVSDDASMSMIDFERSSSILPSKFNKFGNPKETKCIGGKRKS
        MGALA V PW PEDD+LLKNAVEAGASLESLAKGAVQFSRR+++RELQ+RWH+LLYDP+VS +A+  M + ER++   P+KF + G  KE K    KR +
Subjt:  MGALAPVAPWTPEDDILLKNAVEAGASLESLAKGAVQFSRRYTVRELQERWHSLLYDPIVSDDASMSMIDFERSSSILPSKFNKFGNPKETKCIGGKRKS

Query:  GTVRRCYYALRKRICNEPFNPMDLSFLV--GDSNY-DVGEPMSGNGIPPISDDFGLQSSELGVLPCNFAQNMMNNDDAEHTFHSGCQQTVEKHFPQNLDN
          +R  Y++LRK+   EPFN +DL FLV   DS++ D G+          +   GL+ S + ++         +N   E     GC  T     P+  DN
Subjt:  GTVRRCYYALRKRICNEPFNPMDLSFLV--GDSNY-DVGEPMSGNGIPPISDDFGLQSSELGVLPCNFAQNMMNNDDAEHTFHSGCQQTVEKHFPQNLDN

Query:  GHEGISHIMGENLPLSANESHVKELAPSASFPVHSLFENDLEVRPTFGQLSNDQRVMGSEPEDNDVFNSPVSDSGASFHNVEYSSPLPGMPIWRNASAPA
            I ++ GENL  + +          A   V  +   D E +     L N      +     D F + +S S                 ++     P 
Subjt:  GHEGISHIMGENLPLSANESHVKELAPSASFPVHSLFENDLEVRPTFGQLSNDQRVMGSEPEDNDVFNSPVSDSGASFHNVEYSSPLPGMPIWRNASAPA

Query:  LPIDVGFEDKDIPTADSFELPDDDGNSNIQNARLAGYDAHSNLKLKIEVQDVHLKSPNATAEVYLAELSNSLLNLSNEDELLFMDVDGKDAIDKSYYDEA
        + +D    DK                          YD  S+L +           PN T +     ++ +    +  D+ + +++DG  A+D       
Subjt:  LPIDVGFEDKDIPTADSFELPDDDGNSNIQNARLAGYDAHSNLKLKIEVQDVHLKSPNATAEVYLAELSNSLLNLSNEDELLFMDVDGKDAIDKSYYDEA

Query:  HPSPSASLNSQCLGKGDEPLFCTLNTEDPEIPSNDDVFLP----PLSTTATMGYHFHDCMDPTFSSIKDFSCKEKSGEMTQNLVQRERKNHGQPHVSSLS
        HP     +           + C LN EDP+IP NDD+FL     P+S ++    +F D   P  + ++D S  ++  E     +Q ++K  G+   S+  
Subjt:  HPSPSASLNSQCLGKGDEPLFCTLNTEDPEIPSNDDVFLP----PLSTTATMGYHFHDCMDPTFSSIKDFSCKEKSGEMTQNLVQRERKNHGQPHVSSLS

Query:  IGLHGLPERGEKHLVGGAAVNLKLSHSNSIHVPSVNNTSSRNVNNDAILPVMLKEENNEISRVNHL--GQNFLNTHVEKPGFESDNVRRYPPSAACGIKQ
            G P +G K     A+ + +L ++    V    ++S++  +N  +      ++  + +    L  G +    H EK           PP     + +
Subjt:  IGLHGLPERGEKHLVGGAAVNLKLSHSNSIHVPSVNNTSSRNVNNDAILPVMLKEENNEISRVNHL--GQNFLNTHVEKPGFESDNVRRYPPSAACGIKQ

Query:  EPDTLTTVKDHRLSQEGGVR-GVFGVDQDGISSTSDQEELSIDSEDDVPHFSDIEAMILDMDLDPEDQDLYSSEEVLKYQHMDTKKRIIRLEQGANSYMQ
         P    T        +G +   V  ++     + ++ E    +S++D+P++SDIEAMILDMDL+P+DQD +   EV KYQ  D K+ IIRLEQ A+SYMQ
Subjt:  EPDTLTTVKDHRLSQEGGVR-GVFGVDQDGISSTSDQEELSIDSEDDVPHFSDIEAMILDMDLDPEDQDLYSSEEVLKYQHMDTKKRIIRLEQGANSYMQ

Query:  RSIASHGALAVLYGRYSKHYIKKSEVLLGRQTEDVIVDIDLGREGSRNKISRRQAIIKIDQDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRG
        R+IAS GA AVLYGRYSKHYIKK EVL+GR TED+ VDIDLGRE   +KISRRQAII++  DG F +KNLGK SIS+N K+V PG  L L S CL+EIRG
Subjt:  RSIASHGALAVLYGRYSKHYIKKSEVLLGRQTEDVIVDIDLGREGSRNKISRRQAIIKIDQDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRG

Query:  MPFIFESNPVRMKQYVDNMNKTS
        MPFIFE+N   M++Y+    K +
Subjt:  MPFIFESNPVRMKQYVDNMNKTS

AT3G54350.2 Forkhead-associated (FHA) domain-containing protein5.4e-10535.12Show/hide
Query:  MGALAPVAPWTPEDDILLKNAVEAGASLESLAKGAVQFSRRYTVRELQERWHSLLYDPIVSDDASMSMIDFERSSSILPSKFNKFGNPKETKCIGGKRKS
        MGALA V PW PEDD+LLKNAVEAGASLESLAKGAVQFSRR+++RELQ+RWH+LLYDP+VS +A+  M + ER++   P+KF + G  KE K    KR +
Subjt:  MGALAPVAPWTPEDDILLKNAVEAGASLESLAKGAVQFSRRYTVRELQERWHSLLYDPIVSDDASMSMIDFERSSSILPSKFNKFGNPKETKCIGGKRKS

Query:  GTVRRCYYALRKRICNEPFNPMDLSFLV--GDSNY-DVGEPMSGNGIPPISDDFGLQSSELGVLPCNFAQNMMNNDDAEHTFHSGCQQTVEKHFPQNLDN
          +R  Y++LRK+   EPFN +DL FLV   DS++ D G+          +   GL+ S + ++         +N   E     GC  T     P+  DN
Subjt:  GTVRRCYYALRKRICNEPFNPMDLSFLV--GDSNY-DVGEPMSGNGIPPISDDFGLQSSELGVLPCNFAQNMMNNDDAEHTFHSGCQQTVEKHFPQNLDN

Query:  GHEGISHIMGENLPLSANESHVKELAPSASFPVHSLFENDLEVRPTFGQLSNDQRVMGSEPEDNDVFNSPVSDSGASFHNVEYSSPLPGMPIWRNASAPA
            I ++ GENL  + +          A   V  +   D E +     L N      +     D F + +S S                 ++     P 
Subjt:  GHEGISHIMGENLPLSANESHVKELAPSASFPVHSLFENDLEVRPTFGQLSNDQRVMGSEPEDNDVFNSPVSDSGASFHNVEYSSPLPGMPIWRNASAPA

Query:  LPIDVGFEDKDIPTADSFELPDDDGNSNIQNARLAGYDAHSNLKLKIEVQDVHLKSPNATAEVYLAELSNSLLNLSNEDELLFMDVDGKDAIDKSYYDEA
        + +D    DK                          YD  S+L +           PN T +     ++ +    +  D+ + +++DG  A+D       
Subjt:  LPIDVGFEDKDIPTADSFELPDDDGNSNIQNARLAGYDAHSNLKLKIEVQDVHLKSPNATAEVYLAELSNSLLNLSNEDELLFMDVDGKDAIDKSYYDEA

Query:  HPSPSASLNSQCLGKGDEPLFCTLNTEDPEIPSNDDVFLP----PLSTTATMGYHFHDCMDPTFSSIKDFSCKEKSGEMTQNLVQRERKNHGQPHVSSLS
        HP     +           + C LN EDP+IP NDD+FL     P+S ++    +F D   P  + ++D S  ++  E     +Q ++K  G+   S+  
Subjt:  HPSPSASLNSQCLGKGDEPLFCTLNTEDPEIPSNDDVFLP----PLSTTATMGYHFHDCMDPTFSSIKDFSCKEKSGEMTQNLVQRERKNHGQPHVSSLS

Query:  IGLHGLPERGEKHLVGGAAVNLKLSHSNSIHVPSVNNTSSRNVNNDAILPVMLKEENNEISRVNHL--GQNFLNTHVEKPGFESDNVRRYPPSAACGIKQ
            G P +G K     A+ + +L ++    V    ++S++  +N  +      ++  + +    L  G +    H EK           PP     + +
Subjt:  IGLHGLPERGEKHLVGGAAVNLKLSHSNSIHVPSVNNTSSRNVNNDAILPVMLKEENNEISRVNHL--GQNFLNTHVEKPGFESDNVRRYPPSAACGIKQ

Query:  EPDTLTTVKDHRLSQEGGVR-GVFGVDQDGISSTSDQEELSIDSEDDVPHFSDIEAMILDMDLDPEDQDLYSSEEVLKYQHMDTKKRIIRLEQGANSYMQ
         P    T        +G +   V  ++     + ++ E    +S++D+P++SDIEAMILDMDL+P+DQD +   EV KYQ  D K+ IIRLEQ A+SYMQ
Subjt:  EPDTLTTVKDHRLSQEGGVR-GVFGVDQDGISSTSDQEELSIDSEDDVPHFSDIEAMILDMDLDPEDQDLYSSEEVLKYQHMDTKKRIIRLEQGANSYMQ

Query:  RSIASHGALAVLYGRYSKHYIKKSEVLLGRQTEDVIVDIDLGREGSRNKISRRQAIIKIDQDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRG
        R+IAS GA AVLYGRYSKHYIKK EVL+GR TED+ VDIDLGRE   +KISRRQAII++  DG F +KNLGK SIS+N K+V PG  L L S CL+EIRG
Subjt:  RSIASHGALAVLYGRYSKHYIKKSEVLLGRQTEDVIVDIDLGREGSRNKISRRQAIIKIDQDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRG

Query:  MPFIFESNPVRMKQYVDNMNKTS
        MPFIFE+N   M++Y+    K +
Subjt:  MPFIFESNPVRMKQYVDNMNKTS

AT3G54350.3 Forkhead-associated (FHA) domain-containing protein5.4e-10535.12Show/hide
Query:  MGALAPVAPWTPEDDILLKNAVEAGASLESLAKGAVQFSRRYTVRELQERWHSLLYDPIVSDDASMSMIDFERSSSILPSKFNKFGNPKETKCIGGKRKS
        MGALA V PW PEDD+LLKNAVEAGASLESLAKGAVQFSRR+++RELQ+RWH+LLYDP+VS +A+  M + ER++   P+KF + G  KE K    KR +
Subjt:  MGALAPVAPWTPEDDILLKNAVEAGASLESLAKGAVQFSRRYTVRELQERWHSLLYDPIVSDDASMSMIDFERSSSILPSKFNKFGNPKETKCIGGKRKS

Query:  GTVRRCYYALRKRICNEPFNPMDLSFLV--GDSNY-DVGEPMSGNGIPPISDDFGLQSSELGVLPCNFAQNMMNNDDAEHTFHSGCQQTVEKHFPQNLDN
          +R  Y++LRK+   EPFN +DL FLV   DS++ D G+          +   GL+ S + ++         +N   E     GC  T     P+  DN
Subjt:  GTVRRCYYALRKRICNEPFNPMDLSFLV--GDSNY-DVGEPMSGNGIPPISDDFGLQSSELGVLPCNFAQNMMNNDDAEHTFHSGCQQTVEKHFPQNLDN

Query:  GHEGISHIMGENLPLSANESHVKELAPSASFPVHSLFENDLEVRPTFGQLSNDQRVMGSEPEDNDVFNSPVSDSGASFHNVEYSSPLPGMPIWRNASAPA
            I ++ GENL  + +          A   V  +   D E +     L N      +     D F + +S S                 ++     P 
Subjt:  GHEGISHIMGENLPLSANESHVKELAPSASFPVHSLFENDLEVRPTFGQLSNDQRVMGSEPEDNDVFNSPVSDSGASFHNVEYSSPLPGMPIWRNASAPA

Query:  LPIDVGFEDKDIPTADSFELPDDDGNSNIQNARLAGYDAHSNLKLKIEVQDVHLKSPNATAEVYLAELSNSLLNLSNEDELLFMDVDGKDAIDKSYYDEA
        + +D    DK                          YD  S+L +           PN T +     ++ +    +  D+ + +++DG  A+D       
Subjt:  LPIDVGFEDKDIPTADSFELPDDDGNSNIQNARLAGYDAHSNLKLKIEVQDVHLKSPNATAEVYLAELSNSLLNLSNEDELLFMDVDGKDAIDKSYYDEA

Query:  HPSPSASLNSQCLGKGDEPLFCTLNTEDPEIPSNDDVFLP----PLSTTATMGYHFHDCMDPTFSSIKDFSCKEKSGEMTQNLVQRERKNHGQPHVSSLS
        HP     +           + C LN EDP+IP NDD+FL     P+S ++    +F D   P  + ++D S  ++  E     +Q ++K  G+   S+  
Subjt:  HPSPSASLNSQCLGKGDEPLFCTLNTEDPEIPSNDDVFLP----PLSTTATMGYHFHDCMDPTFSSIKDFSCKEKSGEMTQNLVQRERKNHGQPHVSSLS

Query:  IGLHGLPERGEKHLVGGAAVNLKLSHSNSIHVPSVNNTSSRNVNNDAILPVMLKEENNEISRVNHL--GQNFLNTHVEKPGFESDNVRRYPPSAACGIKQ
            G P +G K     A+ + +L ++    V    ++S++  +N  +      ++  + +    L  G +    H EK           PP     + +
Subjt:  IGLHGLPERGEKHLVGGAAVNLKLSHSNSIHVPSVNNTSSRNVNNDAILPVMLKEENNEISRVNHL--GQNFLNTHVEKPGFESDNVRRYPPSAACGIKQ

Query:  EPDTLTTVKDHRLSQEGGVR-GVFGVDQDGISSTSDQEELSIDSEDDVPHFSDIEAMILDMDLDPEDQDLYSSEEVLKYQHMDTKKRIIRLEQGANSYMQ
         P    T        +G +   V  ++     + ++ E    +S++D+P++SDIEAMILDMDL+P+DQD +   EV KYQ  D K+ IIRLEQ A+SYMQ
Subjt:  EPDTLTTVKDHRLSQEGGVR-GVFGVDQDGISSTSDQEELSIDSEDDVPHFSDIEAMILDMDLDPEDQDLYSSEEVLKYQHMDTKKRIIRLEQGANSYMQ

Query:  RSIASHGALAVLYGRYSKHYIKKSEVLLGRQTEDVIVDIDLGREGSRNKISRRQAIIKIDQDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRG
        R+IAS GA AVLYGRYSKHYIKK EVL+GR TED+ VDIDLGRE   +KISRRQAII++  DG F +KNLGK SIS+N K+V PG  L L S CL+EIRG
Subjt:  RSIASHGALAVLYGRYSKHYIKKSEVLLGRQTEDVIVDIDLGREGSRNKISRRQAIIKIDQDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRG

Query:  MPFIFESNPVRMKQYVDNMNKTS
        MPFIFE+N   M++Y+    K +
Subjt:  MPFIFESNPVRMKQYVDNMNKTS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGCAGGAGATCGGACGACGGGCAAACAAAGTTAAGAAACCTCAAGAATCCAGCCGGTGACCGGTCACAGAAAGGAAGCAAGGTCGCACGTGTCGTGCTTCCATCGGT
TTCTCTAAAACCCCCCGACGCCATATCCTTCGTCTTCCTCCCCTCAACCGTTCTCCCGCTTCCTTCGCTGCCACGGTTCGTTCCAACATTACACTCTTTCCAAGAGGGAC
TGGTAAGAGTTTTTTGCTTTACGGAGATGGGAGCTCTTGCCCCTGTCGCGCCTTGGACTCCCGAAGATGATATTCTGCTCAAGAACGCTGTTGAGGCCGGTGCTTCATTG
GAGTCCCTTGCCAAAGGTGCTGTGCAGTTTTCTCGAAGATACACAGTAAGAGAATTGCAAGAACGATGGCATTCTTTACTTTATGATCCAATTGTATCTGATGATGCATC
TATGTCCATGATTGACTTTGAGCGTTCTTCTTCTATTCTTCCATCAAAGTTCAACAAGTTTGGGAATCCAAAAGAAACTAAATGTATTGGTGGGAAGAGGAAATCTGGGA
CTGTACGCCGTTGCTATTATGCTTTGCGTAAAAGAATTTGCAATGAACCATTTAACCCTATGGACCTGAGTTTTCTTGTCGGCGATAGTAACTATGACGTTGGAGAGCCC
ATGTCAGGAAATGGTATCCCTCCAATATCAGATGATTTTGGACTTCAGAGCTCAGAGCTGGGGGTCTTGCCCTGTAATTTTGCCCAAAATATGATGAATAATGATGATGC
TGAGCACACTTTTCATTCTGGATGTCAACAAACAGTTGAAAAGCATTTTCCTCAGAACCTGGATAATGGACACGAGGGAATTTCTCACATCATGGGAGAGAATCTGCCCC
TCTCTGCAAATGAATCTCATGTTAAGGAATTGGCTCCGTCAGCTAGCTTTCCAGTCCATAGTCTTTTTGAAAATGATTTGGAGGTGAGACCTACTTTTGGGCAACTGAGC
AATGATCAGAGAGTGATGGGCTCTGAACCAGAAGACAATGATGTCTTTAATTCTCCTGTTTCTGATTCTGGTGCATCATTTCACAATGTTGAGTACTCATCTCCTCTTCC
TGGTATGCCAATATGGAGAAATGCCTCGGCGCCAGCCTTGCCGATTGATGTTGGCTTTGAAGATAAGGATATACCTACAGCAGACTCTTTTGAACTCCCTGATGATGATG
GGAACAGCAACATTCAAAATGCAAGATTAGCAGGCTATGATGCTCACTCCAACTTAAAGTTGAAGATTGAAGTACAGGATGTCCATTTGAAAAGTCCAAATGCCACTGCT
GAAGTTTATCTAGCAGAATTGTCCAATTCTCTTTTGAACTTGAGCAATGAGGATGAGCTACTTTTCATGGATGTTGATGGAAAAGATGCGATTGATAAGTCATACTATGA
TGAAGCTCATCCATCGCCATCTGCATCTTTGAACAGTCAATGTCTTGGAAAAGGTGACGAACCTCTTTTCTGCACCTTGAACACAGAAGACCCGGAAATCCCAAGCAATG
ATGATGTTTTTCTACCTCCATTGTCAACAACCGCTACTATGGGATACCATTTTCATGATTGCATGGATCCTACCTTTTCATCTATTAAGGATTTCTCTTGTAAAGAAAAA
TCTGGTGAAATGACTCAAAACCTTGTGCAAAGGGAGAGGAAAAATCATGGACAACCTCATGTTTCCTCTCTATCAATAGGATTGCATGGTTTACCTGAAAGAGGTGAAAA
ACATCTGGTTGGTGGAGCTGCTGTTAATTTAAAATTATCCCACAGCAACTCCATACACGTGCCATCTGTGAATAATACTAGTTCCAGAAATGTAAATAACGATGCTATCC
TACCTGTCATGCTCAAGGAAGAGAACAATGAAATTTCCCGGGTAAATCATCTTGGTCAGAATTTTTTAAATACTCATGTAGAGAAGCCAGGCTTTGAATCTGACAATGTT
AGAAGATATCCACCAAGTGCTGCTTGTGGCATTAAACAGGAACCAGATACATTGACCACAGTGAAAGATCATCGATTATCACAGGAAGGCGGTGTTCGAGGTGTTTTTGG
TGTAGATCAAGATGGAATATCTTCGACATCTGATCAAGAAGAGTTATCTATTGATAGTGAAGATGATGTACCTCATTTTTCAGATATTGAGGCAATGATACTTGATATGG
ACTTGGATCCAGAAGATCAGGATCTCTATTCAAGTGAAGAAGTCTTAAAATATCAACATATGGACACCAAGAAGAGAATCATAAGACTGGAGCAAGGGGCTAATTCTTAC
ATGCAAAGATCTATTGCCTCCCATGGGGCATTAGCAGTTCTGTATGGCCGATATTCGAAGCATTACATTAAGAAATCAGAGGTTCTATTAGGTAGACAAACTGAGGATGT
CATTGTGGACATTGACTTAGGAAGGGAGGGAAGCCGTAACAAAATATCTCGACGGCAGGCAATTATAAAAATAGATCAAGATGGATTTTTCTCCCTGAAGAATCTTGGCA
AGTGTTCAATCTCCATAAACAACAAGGATGTGGCCCCTGGTCACTGCCTGCGACTTAATTCTGGTTGCTTGATTGAGATAAGGGGAATGCCATTTATATTTGAGTCAAAC
CCAGTTCGTATGAAGCAGTATGTGGATAACATGAACAAGACATCTCACAAACAGGAGTATCAATCGTATGCTGTGTTAAGTTGTTTGATGGGAAACAATAGCAACACATT
TGTGGAGATTATTTCGAGCAGTAATATCAGCGAGGAGCTGACCACAACCAAATTGCAAAGATACATCCAAGTGGGTTTCGGCGGTCCCACAGGTTTCACCATGATGTTCC
CATCGTTGCACTTTGGAGCCCACCAACTAAAGTTGCTATCTTTTTGGGATATAGCTTCAGGTCAAAATGAAGGTAATGGTCCCGAAATCGAATCTGGGCTCCTTGCTCTG
CCCACCCCTAGAATCCAGCTTGGCCAATTCCATACCTGTCTAGAAAGCGCAACAACTTGGCTGAAGTCGAAATGGATAGCTTCAAAGGAGGTTCCTTCTGCTACGCTATC
ATTAACTCTTTACATTATTTAA
mRNA sequenceShow/hide mRNA sequence
ATGCGCAGGAGATCGGACGACGGGCAAACAAAGTTAAGAAACCTCAAGAATCCAGCCGGTGACCGGTCACAGAAAGGAAGCAAGGTCGCACGTGTCGTGCTTCCATCGGT
TTCTCTAAAACCCCCCGACGCCATATCCTTCGTCTTCCTCCCCTCAACCGTTCTCCCGCTTCCTTCGCTGCCACGGTTCGTTCCAACATTACACTCTTTCCAAGAGGGAC
TGGTAAGAGTTTTTTGCTTTACGGAGATGGGAGCTCTTGCCCCTGTCGCGCCTTGGACTCCCGAAGATGATATTCTGCTCAAGAACGCTGTTGAGGCCGGTGCTTCATTG
GAGTCCCTTGCCAAAGGTGCTGTGCAGTTTTCTCGAAGATACACAGTAAGAGAATTGCAAGAACGATGGCATTCTTTACTTTATGATCCAATTGTATCTGATGATGCATC
TATGTCCATGATTGACTTTGAGCGTTCTTCTTCTATTCTTCCATCAAAGTTCAACAAGTTTGGGAATCCAAAAGAAACTAAATGTATTGGTGGGAAGAGGAAATCTGGGA
CTGTACGCCGTTGCTATTATGCTTTGCGTAAAAGAATTTGCAATGAACCATTTAACCCTATGGACCTGAGTTTTCTTGTCGGCGATAGTAACTATGACGTTGGAGAGCCC
ATGTCAGGAAATGGTATCCCTCCAATATCAGATGATTTTGGACTTCAGAGCTCAGAGCTGGGGGTCTTGCCCTGTAATTTTGCCCAAAATATGATGAATAATGATGATGC
TGAGCACACTTTTCATTCTGGATGTCAACAAACAGTTGAAAAGCATTTTCCTCAGAACCTGGATAATGGACACGAGGGAATTTCTCACATCATGGGAGAGAATCTGCCCC
TCTCTGCAAATGAATCTCATGTTAAGGAATTGGCTCCGTCAGCTAGCTTTCCAGTCCATAGTCTTTTTGAAAATGATTTGGAGGTGAGACCTACTTTTGGGCAACTGAGC
AATGATCAGAGAGTGATGGGCTCTGAACCAGAAGACAATGATGTCTTTAATTCTCCTGTTTCTGATTCTGGTGCATCATTTCACAATGTTGAGTACTCATCTCCTCTTCC
TGGTATGCCAATATGGAGAAATGCCTCGGCGCCAGCCTTGCCGATTGATGTTGGCTTTGAAGATAAGGATATACCTACAGCAGACTCTTTTGAACTCCCTGATGATGATG
GGAACAGCAACATTCAAAATGCAAGATTAGCAGGCTATGATGCTCACTCCAACTTAAAGTTGAAGATTGAAGTACAGGATGTCCATTTGAAAAGTCCAAATGCCACTGCT
GAAGTTTATCTAGCAGAATTGTCCAATTCTCTTTTGAACTTGAGCAATGAGGATGAGCTACTTTTCATGGATGTTGATGGAAAAGATGCGATTGATAAGTCATACTATGA
TGAAGCTCATCCATCGCCATCTGCATCTTTGAACAGTCAATGTCTTGGAAAAGGTGACGAACCTCTTTTCTGCACCTTGAACACAGAAGACCCGGAAATCCCAAGCAATG
ATGATGTTTTTCTACCTCCATTGTCAACAACCGCTACTATGGGATACCATTTTCATGATTGCATGGATCCTACCTTTTCATCTATTAAGGATTTCTCTTGTAAAGAAAAA
TCTGGTGAAATGACTCAAAACCTTGTGCAAAGGGAGAGGAAAAATCATGGACAACCTCATGTTTCCTCTCTATCAATAGGATTGCATGGTTTACCTGAAAGAGGTGAAAA
ACATCTGGTTGGTGGAGCTGCTGTTAATTTAAAATTATCCCACAGCAACTCCATACACGTGCCATCTGTGAATAATACTAGTTCCAGAAATGTAAATAACGATGCTATCC
TACCTGTCATGCTCAAGGAAGAGAACAATGAAATTTCCCGGGTAAATCATCTTGGTCAGAATTTTTTAAATACTCATGTAGAGAAGCCAGGCTTTGAATCTGACAATGTT
AGAAGATATCCACCAAGTGCTGCTTGTGGCATTAAACAGGAACCAGATACATTGACCACAGTGAAAGATCATCGATTATCACAGGAAGGCGGTGTTCGAGGTGTTTTTGG
TGTAGATCAAGATGGAATATCTTCGACATCTGATCAAGAAGAGTTATCTATTGATAGTGAAGATGATGTACCTCATTTTTCAGATATTGAGGCAATGATACTTGATATGG
ACTTGGATCCAGAAGATCAGGATCTCTATTCAAGTGAAGAAGTCTTAAAATATCAACATATGGACACCAAGAAGAGAATCATAAGACTGGAGCAAGGGGCTAATTCTTAC
ATGCAAAGATCTATTGCCTCCCATGGGGCATTAGCAGTTCTGTATGGCCGATATTCGAAGCATTACATTAAGAAATCAGAGGTTCTATTAGGTAGACAAACTGAGGATGT
CATTGTGGACATTGACTTAGGAAGGGAGGGAAGCCGTAACAAAATATCTCGACGGCAGGCAATTATAAAAATAGATCAAGATGGATTTTTCTCCCTGAAGAATCTTGGCA
AGTGTTCAATCTCCATAAACAACAAGGATGTGGCCCCTGGTCACTGCCTGCGACTTAATTCTGGTTGCTTGATTGAGATAAGGGGAATGCCATTTATATTTGAGTCAAAC
CCAGTTCGTATGAAGCAGTATGTGGATAACATGAACAAGACATCTCACAAACAGGAGTATCAATCGTATGCTGTGTTAAGTTGTTTGATGGGAAACAATAGCAACACATT
TGTGGAGATTATTTCGAGCAGTAATATCAGCGAGGAGCTGACCACAACCAAATTGCAAAGATACATCCAAGTGGGTTTCGGCGGTCCCACAGGTTTCACCATGATGTTCC
CATCGTTGCACTTTGGAGCCCACCAACTAAAGTTGCTATCTTTTTGGGATATAGCTTCAGGTCAAAATGAAGGTAATGGTCCCGAAATCGAATCTGGGCTCCTTGCTCTG
CCCACCCCTAGAATCCAGCTTGGCCAATTCCATACCTGTCTAGAAAGCGCAACAACTTGGCTGAAGTCGAAATGGATAGCTTCAAAGGAGGTTCCTTCTGCTACGCTATC
ATTAACTCTTTACATTATTTAA
Protein sequenceShow/hide protein sequence
MRRRSDDGQTKLRNLKNPAGDRSQKGSKVARVVLPSVSLKPPDAISFVFLPSTVLPLPSLPRFVPTLHSFQEGLVRVFCFTEMGALAPVAPWTPEDDILLKNAVEAGASL
ESLAKGAVQFSRRYTVRELQERWHSLLYDPIVSDDASMSMIDFERSSSILPSKFNKFGNPKETKCIGGKRKSGTVRRCYYALRKRICNEPFNPMDLSFLVGDSNYDVGEP
MSGNGIPPISDDFGLQSSELGVLPCNFAQNMMNNDDAEHTFHSGCQQTVEKHFPQNLDNGHEGISHIMGENLPLSANESHVKELAPSASFPVHSLFENDLEVRPTFGQLS
NDQRVMGSEPEDNDVFNSPVSDSGASFHNVEYSSPLPGMPIWRNASAPALPIDVGFEDKDIPTADSFELPDDDGNSNIQNARLAGYDAHSNLKLKIEVQDVHLKSPNATA
EVYLAELSNSLLNLSNEDELLFMDVDGKDAIDKSYYDEAHPSPSASLNSQCLGKGDEPLFCTLNTEDPEIPSNDDVFLPPLSTTATMGYHFHDCMDPTFSSIKDFSCKEK
SGEMTQNLVQRERKNHGQPHVSSLSIGLHGLPERGEKHLVGGAAVNLKLSHSNSIHVPSVNNTSSRNVNNDAILPVMLKEENNEISRVNHLGQNFLNTHVEKPGFESDNV
RRYPPSAACGIKQEPDTLTTVKDHRLSQEGGVRGVFGVDQDGISSTSDQEELSIDSEDDVPHFSDIEAMILDMDLDPEDQDLYSSEEVLKYQHMDTKKRIIRLEQGANSY
MQRSIASHGALAVLYGRYSKHYIKKSEVLLGRQTEDVIVDIDLGREGSRNKISRRQAIIKIDQDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESN
PVRMKQYVDNMNKTSHKQEYQSYAVLSCLMGNNSNTFVEIISSSNISEELTTTKLQRYIQVGFGGPTGFTMMFPSLHFGAHQLKLLSFWDIASGQNEGNGPEIESGLLAL
PTPRIQLGQFHTCLESATTWLKSKWIASKEVPSATLSLTLYII