; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0021288 (gene) of Snake gourd v1 genome

Gene IDTan0021288
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionFHA domain-containing protein
Genome locationLG10:63635870..63645708
RNA-Seq ExpressionTan0021288
SyntenyTan0021288
Gene Ontology termsGO:0031011 - Ino80 complex (cellular component)
GO:0071339 - MLL1 complex (cellular component)
GO:0002151 - G-quadruplex RNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR000253 - Forkhead-associated (FHA) domain
IPR008984 - SMAD/FHA domain superfamily
IPR025999 - Microspherule protein, N-terminal domain
IPR037912 - Microspherule protein 1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7037887.1 Microspherule protein 1 [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0083.65Show/hide
Query:  MGALAPVAPWNPEDDILLKNSVEAGASLEALAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSMIDVERSSSILPSKFNKFGNPKETKCIGGKRKS
        MGALAP+APW+PEDDILLKN+VEAGASLEALAKGAVQFSRR+TVRELQERWHSLLYDPIVSEDASMSMIDVERSSSILPSKFNKFGN KE KCIGGKRKS
Subjt:  MGALAPVAPWNPEDDILLKNSVEAGASLEALAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSMIDVERSSSILPSKFNKFGNPKETKCIGGKRKS

Query:  GSVRRCYYALRKRICNEPFNPIDLSYLVGDSDYVGEEPMSGNCIPPISDDFGLQSSELGIMPCNFSQNVMNNDGTEHTFHSGCQHTVEKHFPQNLDNGHE
        GSVR CYYALRKRICNEPFNP+  SYLVGDSDYV EEPMSGNCIPP SDDFGLQSSELGI+PCNFSQN MNND TEHTFHSGCQHTVE HFPQNLDNGHE
Subjt:  GSVRRCYYALRKRICNEPFNPIDLSYLVGDSDYVGEEPMSGNCIPPISDDFGLQSSELGIMPCNFSQNVMNNDGTEHTFHSGCQHTVEKHFPQNLDNGHE

Query:  GISRIMVDNLPFSANESHAKELAPSTSFPVHSLFENDLEVKPSTFGQLSNDQRAMGSEPEDNDVFNSPVSDSGASFHNVEYSSPLPGMPIWRNASPTLPI
        GIS IMVDNLPF ANESHAKELAPS SFPVH+LFENDLE  PSTFGQLS DQR MGSEPEDNDVFNSPVSDSGASFHNVEYSSPLPGMPIWRNA   LPI
Subjt:  GISRIMVDNLPFSANESHAKELAPSTSFPVHSLFENDLEVKPSTFGQLSNDQRAMGSEPEDNDVFNSPVSDSGASFHNVEYSSPLPGMPIWRNASPTLPI

Query:  DVGFPEKDIPTGDSFELPDDDGNNNIQNARLAGYDGHSNSKLKIEVQHDHLKSPNATAEGCYLAELSNSLLNLSNEDELLFMDVDGRDVIDKSYYDGLSL
        DVGF +KD+PTGDSFELPDDDGNNNIQNARLAGY+ HSNSKLKIEVQHDHLKSPNATAEGCYL E SNSLLN SNEDEL FMDV           DGLSL
Subjt:  DVGFPEKDIPTGDSFELPDDDGNNNIQNARLAGYDGHSNSKLKIEVQHDHLKSPNATAEGCYLAELSNSLLNLSNEDELLFMDVDGRDVIDKSYYDGLSL

Query:  LLSSPNEVNHDQTANAINAETVLPTDTMLDPPSACSGELYEKGSHCSDGHLDCSSEAHPSSSASLNSQCPGKGDEPLFCTLNTEDPEIPSNDDVFLPPLS
        LLSSPNEVNHDQT NAIN+E VLPTDTMLDPPSACSGELYEKGSHCSDGHLDCSSEAHPS S SLN QCPGK DEPLFCTLNTEDPEIPSNDDVFLPPLS
Subjt:  LLSSPNEVNHDQTANAINAETVLPTDTMLDPPSACSGELYEKGSHCSDGHLDCSSEAHPSSSASLNSQCPGKGDEPLFCTLNTEDPEIPSNDDVFLPPLS

Query:  TTATMGYHFQDCMDPTFSSIKDFSCKEKSGETTQNLVQREKKNH-GQPRVSSLSIGLRGLPERGEKHLVGGAAVNLKLSHSNIIHVPSANKTSSRNVNND
        T ATMGYHF DCM PTFSSI DFSCKE SGE TQNLVQRE+KNH GQPRVS++S+GL  L ERGEKHL+ GA VNLKLSHSN IHVPSANKTSS NVN+D
Subjt:  TTATMGYHFQDCMDPTFSSIKDFSCKEKSGETTQNLVQREKKNH-GQPRVSSLSIGLRGLPERGEKHLVGGAAVNLKLSHSNIIHVPSANKTSSRNVNND

Query:  AILPAALKEENNEISRVNHLGQNFLNTHVEKPDFDSGNVRRYPPSTTCGIKQESDTLTTVKDHRLLQEVGARAVFGVEQDGISSTSDQEDLSIDSEDDLP
         ILP  L EENNEIS                                     E +T T+VKDHRL +EVG + VFGVEQDG+ STSD E+L IDSEDDLP
Subjt:  AILPAALKEENNEISRVNHLGQNFLNTHVEKPDFDSGNVRRYPPSTTCGIKQESDTLTTVKDHRLLQEVGARAVFGVEQDGISSTSDQEDLSIDSEDDLP

Query:  HFSDIEAMILDMDLDPEDQDLFSSEEVLKYQHMDTKKRIIRLEQGANAYMQRSMASHGALAVLYGRHSKHYIKKSEVLLGRATAEFIVDIDLGREGSGNK
        HFSDIEAMILDMDLDPEDQDL+ SEEVLKYQH+DT+KRIIRLEQG NA MQRS+ASHGALAVL GRHSKHYIKKSEVLLGRATAEFIVDIDLG EGSGNK
Subjt:  HFSDIEAMILDMDLDPEDQDLFSSEEVLKYQHMDTKKRIIRLEQGANAYMQRSMASHGALAVLYGRHSKHYIKKSEVLLGRATAEFIVDIDLGREGSGNK

Query:  ISRRQAIIKIDEDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESNSFRMKQYVDNIGKTSHKQEYPS
        ISRRQAIIKID+DGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGM FIFESN  RMKQYVDN+GKTSHKQEY S
Subjt:  ISRRQAIIKIDEDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESNSFRMKQYVDNIGKTSHKQEYPS

XP_022135664.1 uncharacterized protein LOC111007538 isoform X1 [Momordica charantia]0.0e+0085.17Show/hide
Query:  MGALAPVAPWNPEDDILLKNSVEAGASLEALAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSMIDVERSSSILPSKFNKFGNPKETKCIGGKRKS
        MGALAPVAPW PEDDILLKN++EAGASLE+LAKGAVQFSRRYTVRELQERWHSLLYDPIVSE+ASMSMID ERSSSILPSKFNKFGNPKETKCIGGKRK 
Subjt:  MGALAPVAPWNPEDDILLKNSVEAGASLEALAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSMIDVERSSSILPSKFNKFGNPKETKCIGGKRKS

Query:  GSVRRCYYALRKRICNEPFNPIDLSYLVG--DSDYVGEEPMSGNCIPPISDDFGLQSSELGIMPCNFSQNVMNNDGTEHTFHSGCQHTVEKHFPQNLDNG
        GSVRRCYYALRKRICNEPFNP+DLS+LVG  DS+YV EEPMSG+CIPPIS DFGLQ SELGI+P NF+ N+MNND TE TFHS CQHTVEKHFP NLDN 
Subjt:  GSVRRCYYALRKRICNEPFNPIDLSYLVG--DSDYVGEEPMSGNCIPPISDDFGLQSSELGIMPCNFSQNVMNNDGTEHTFHSGCQHTVEKHFPQNLDNG

Query:  HEGISRIMVDNLPFSANESHAKELAPSTSFPVHSLFENDLEVKPSTFGQLSNDQRAMGSEPEDNDVFNSPVSDSGASFHNVEYSSPLPGMPIWRNAS-PT
        HEGI  IM +NLP S NES  +ELAPS SFPVHSLFENDLEV+PSTFGQ S DQRAMGSE EDN+VFNSPVSDSGASFHNVEYSSPLPGMPIWRNAS P 
Subjt:  HEGISRIMVDNLPFSANESHAKELAPSTSFPVHSLFENDLEVKPSTFGQLSNDQRAMGSEPEDNDVFNSPVSDSGASFHNVEYSSPLPGMPIWRNAS-PT

Query:  LPIDVGFPEKDIPTGDSFELPDDDGNNNIQNARLAGYDGHSNSKLKIEVQHDHLKSPNATAEGCYLAELSNSLLNLSNEDELLFMDVDGRDVIDKSYYDG
        LPIDVGF +KD+PTGDSFELPDDDGNNNIQNAR+A YD  S+SKLKIEVQHDHLKSPNATAE  YLAELSNSLLNL+NEDELLFMD DG+DVIDKSYYDG
Subjt:  LPIDVGFPEKDIPTGDSFELPDDDGNNNIQNARLAGYDGHSNSKLKIEVQHDHLKSPNATAEGCYLAELSNSLLNLSNEDELLFMDVDGRDVIDKSYYDG

Query:  L-SLLLSSPNEVNHDQTANAINAETVLPTDTMLDPPSACSGELYEKGSHCSDGHLDCSSEAHPSSSASLNSQCPGKGDEPLFCTLNTEDPEIPSNDDVFL
        L SLLL+SPNEVNHDQTA+A+N ET+LPTD+M+DPP+ACSGELYEKGSHCSDGHLDCS E HPS SASLNSQC GKGDEPLFCTLNTEDPEIPSNDDVFL
Subjt:  L-SLLLSSPNEVNHDQTANAINAETVLPTDTMLDPPSACSGELYEKGSHCSDGHLDCSSEAHPSSSASLNSQCPGKGDEPLFCTLNTEDPEIPSNDDVFL

Query:  PPLSTTATMGYHFQDCMDPTFSSIKDFSCKEKSGETTQNLVQREKKNHGQPRVSSLSIGLRGLPERGEKHLVGGAAVNLKLSHSNIIHVPSANK------
        PPLST ++MGYHFQD +D TFSSIKDFSC EKSGE TQNLVQRE+KNHGQP VSSLSIGL GLPERGEKHLVGGAAVNLKL HSN IHVPSAN       
Subjt:  PPLSTTATMGYHFQDCMDPTFSSIKDFSCKEKSGETTQNLVQREKKNHGQPRVSSLSIGLRGLPERGEKHLVGGAAVNLKLSHSNIIHVPSANK------

Query:  TSSRNVNNDAILPAALKEENNEISRVNHLGQNFLNTHVEKPDFDSGNVRRYPPSTTCGIKQESDTLTTVKDHRLLQEVGARAVFGVEQDGISSTSDQEDL
        +SS N N DAILP  LKEE+ EISRVNHLGQNFLNTHVEKP FDS N R+YPPST  GIKQE D LT VKDHRL QE G+R VFGVEQDGISSTSDQE+L
Subjt:  TSSRNVNNDAILPAALKEENNEISRVNHLGQNFLNTHVEKPDFDSGNVRRYPPSTTCGIKQESDTLTTVKDHRLLQEVGARAVFGVEQDGISSTSDQEDL

Query:  SIDSEDDLPHFSDIEAMILDMDLDPEDQDLFSSEEVLKYQHMDTKKRIIRLEQGANAYMQRSMASHGALAVLYGRHSKHYIKKSEVLLGRATAEFIVDID
        SIDSEDD+PHFSDIEAMILDMDLDPEDQDL++SEEVL+YQHMDTKKRI+RLEQGA+A M+RSMASHGALAVLYGR+SKHYIKKSEVLLGRAT + IVDID
Subjt:  SIDSEDDLPHFSDIEAMILDMDLDPEDQDLFSSEEVLKYQHMDTKKRIIRLEQGANAYMQRSMASHGALAVLYGRHSKHYIKKSEVLLGRATAEFIVDID

Query:  LGREGSGNKISRRQAIIKIDEDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESNSFRMKQYVDNIGKTSHKQEYPS
        LGREGSGNKISRRQAIIKID+DGFFSLKNLGKCSISINNK+VAPGHCLRLNSGCLIEIRGM FIFES+   MKQY+DNIGKTSHKQEY S
Subjt:  LGREGSGNKISRRQAIIKIDEDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESNSFRMKQYVDNIGKTSHKQEYPS

XP_022940026.1 uncharacterized protein LOC111445785 [Cucurbita moschata]0.0e+0083.43Show/hide
Query:  MGALAPVAPWNPEDDILLKNSVEAGASLEALAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSMIDVERSSSILPSKFNKFGNPKETKCIGGKRKS
        MGALAP+APW+PEDDILLKN+VEAGASLEALAKGAVQFSRR+TVRELQERWHSLLYDPIVSEDASMSMIDVERSSSILPSKFNKFGN KE KCIGGKRKS
Subjt:  MGALAPVAPWNPEDDILLKNSVEAGASLEALAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSMIDVERSSSILPSKFNKFGNPKETKCIGGKRKS

Query:  GSVRRCYYALRKRICNEPFNPIDLSYLVGDSDYVGEEPMSGNCIPPISDDFGLQSSELGIMPCNFSQNVMNNDGTEHTFHSGCQHTVEKHFPQNLDNGHE
        GSVR CYYALRKRICNEPFNP+  SYLVGDSDYV EEPMSGNCIPP SDDFGLQSSELGI+PCNFSQN MNND TEHTFHSGCQHTVE HFPQNLDNGHE
Subjt:  GSVRRCYYALRKRICNEPFNPIDLSYLVGDSDYVGEEPMSGNCIPPISDDFGLQSSELGIMPCNFSQNVMNNDGTEHTFHSGCQHTVEKHFPQNLDNGHE

Query:  GISRIMVDNLPFSANESHAKELAPSTSFPVHSLFENDLEVKPSTFGQLSNDQRAMGSEPEDNDVFNSPVSDSGASFHNVEYSSPLPGMPIWRNASPTLPI
        GIS IMVDNLPF ANESHAKELAPS SFPVH+LFENDLE  PSTFGQLS DQR MGSEPEDNDVFNSPVSDSGASFHNVEYSSPLPGMPIWRNA   LPI
Subjt:  GISRIMVDNLPFSANESHAKELAPSTSFPVHSLFENDLEVKPSTFGQLSNDQRAMGSEPEDNDVFNSPVSDSGASFHNVEYSSPLPGMPIWRNASPTLPI

Query:  DVGFPEKDIPTGDSFELPDDDGNNNIQNARLAGYDGHSNSKLKIEVQHDHLKSPNATAEGCYLAELSNSLLNLSNEDELLFMDVDGRDVIDKSYYDGLSL
        DVGF +KD+PTGDSFELPDDDGNNNIQNARLAGY+ HSNSKLKIEVQHDHLKSPNATAEGCYL E SNSLLN SNEDEL FMDV           DGLSL
Subjt:  DVGFPEKDIPTGDSFELPDDDGNNNIQNARLAGYDGHSNSKLKIEVQHDHLKSPNATAEGCYLAELSNSLLNLSNEDELLFMDVDGRDVIDKSYYDGLSL

Query:  LLSSPNEVNHDQTANAINAETVLPTDTMLDPPSACSGELYEKGSHCSDGHLDCSSEAHPSSSASLNSQCPGKGDEPLFCTLNTEDPEIPSNDDVFLPPLS
        LLSSPNEVNHDQT NAIN+E VLPTDTMLDPPSACSGELYEKGSHCSDGHLDCSSEAHPS S SLN QCPGK DEPLFCTLNTEDPEIPSNDDVFLPPLS
Subjt:  LLSSPNEVNHDQTANAINAETVLPTDTMLDPPSACSGELYEKGSHCSDGHLDCSSEAHPSSSASLNSQCPGKGDEPLFCTLNTEDPEIPSNDDVFLPPLS

Query:  TTATMGYHFQDCMDPTFSSIKDFSCKEKSGETTQNLVQREKKNH-GQPRVSSLSIGLRGLPERGEKHLVGGAAVNLKLSHSNIIHVPSANKTSSRNVNND
        T ATMGYHF DCM PTFSSI DFSCKE SGE TQNLVQRE+KNH GQPRVS++S+GL  L ERGEKHL+ GA VNLKLSHSN IHVPSANKTSS NVN+D
Subjt:  TTATMGYHFQDCMDPTFSSIKDFSCKEKSGETTQNLVQREKKNH-GQPRVSSLSIGLRGLPERGEKHLVGGAAVNLKLSHSNIIHVPSANKTSSRNVNND

Query:  AILPAALKEENNEISRVNHLGQNFLNTHVEKPDFDSGNVRRYPPSTTCGIKQESDTLTTVKDHRLLQEVGARAVFGVEQDGISSTSDQEDLSIDSEDDLP
         ILP  L EENNEIS                                     E +T T+VKDHRL +EVG + VFGVEQDG+ STSD E+L IDSEDDLP
Subjt:  AILPAALKEENNEISRVNHLGQNFLNTHVEKPDFDSGNVRRYPPSTTCGIKQESDTLTTVKDHRLLQEVGARAVFGVEQDGISSTSDQEDLSIDSEDDLP

Query:  HFSDIEAMILDMDLDPEDQDLFSSEEVLKYQHMDTKKRIIRLEQGANAYMQRSMASHGALAVLYGRHSKHYIKKSEVLLGRATAEFIVDIDLGREGSGNK
        HFSDIEAMILDMDLDPEDQDL+ SEEVLKYQH+DT+KRIIRLEQG NA MQRS+ASHGALAVL G+HSKHYIKKSEVLLGRATAEFIVDIDLG EGSGNK
Subjt:  HFSDIEAMILDMDLDPEDQDLFSSEEVLKYQHMDTKKRIIRLEQGANAYMQRSMASHGALAVLYGRHSKHYIKKSEVLLGRATAEFIVDIDLGREGSGNK

Query:  ISRRQAIIKIDEDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESNSFRMKQYVDNIGKTSHKQEYPS
        ISRRQAIIKID+DG FSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGM FIFESN  RMKQYVDN+GKTSHKQEY S
Subjt:  ISRRQAIIKIDEDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESNSFRMKQYVDNIGKTSHKQEYPS

XP_022981262.1 uncharacterized protein LOC111480451 [Cucurbita maxima]0.0e+0083.09Show/hide
Query:  MGALAPVAPWNPEDDILLKNSVEAGASLEALAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSMIDVERSSSILPSKFNKFGNPKETKCIGGKRKS
        MGALAP+APW+PEDDILLKN+VEA ASLEALAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSMIDVERSSSILPSKFNKFGN KE KCIGGKRKS
Subjt:  MGALAPVAPWNPEDDILLKNSVEAGASLEALAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSMIDVERSSSILPSKFNKFGNPKETKCIGGKRKS

Query:  GSVRRCYYALRKRICNEPFNPIDLSYLVGDSDYVGEEPMSGNCIPPISDDFGLQSSELGIMPCNFSQNVMNNDGTEHTFHSGCQHTVEKHFPQNLDNGHE
        GSVR CYYALRKRICNEPFNP+  SYLVGDSDYV EEPMSGNCIPP SDDFGLQSSELG +PCNFSQN MNND TEHTFHSGCQHTVE HFPQNLDNGHE
Subjt:  GSVRRCYYALRKRICNEPFNPIDLSYLVGDSDYVGEEPMSGNCIPPISDDFGLQSSELGIMPCNFSQNVMNNDGTEHTFHSGCQHTVEKHFPQNLDNGHE

Query:  GISRIMVDNLPFSANESHAKELAPSTSFPVHSLFENDLEVKPSTFGQLSNDQRAMGSEPEDNDVFNSPVSDSGASFHNVEYSSPLPGMPIWRNASPTLPI
        GIS +MVDNLPF ANESHAKELAPS SFPVH+LFENDLE  PSTFGQLS DQR MGSEPEDNDVFNSPVSDSGASFHNVEYSSPLPGMPIWR+AS  LPI
Subjt:  GISRIMVDNLPFSANESHAKELAPSTSFPVHSLFENDLEVKPSTFGQLSNDQRAMGSEPEDNDVFNSPVSDSGASFHNVEYSSPLPGMPIWRNASPTLPI

Query:  DVGFPEKDIPTGDSFELPDDDGNNNIQNARLAGYDGHSNSKLKIEVQHDHLKSPNATAEGCYLAELSNSLLNLSNEDELLFMDVDGRDVIDKSYYDGLSL
        DVGF +KD+PTGDSFELPDDDGNNNIQNARLAGY+ HSNSKLKIEVQHDHLKSPNATAEGCYL ELSN++LN SNEDEL FMDV           DGLSL
Subjt:  DVGFPEKDIPTGDSFELPDDDGNNNIQNARLAGYDGHSNSKLKIEVQHDHLKSPNATAEGCYLAELSNSLLNLSNEDELLFMDVDGRDVIDKSYYDGLSL

Query:  LLSSPNEVNHDQTANAINAETVLPTDTMLDPPSACSGELYEKGSHCSDGHLDCSSEAHPSSSASLNSQCPGKGDEPLFCTLNTEDPEIPSNDDVFLPPLS
        LLSSPNEVNHDQT NAI +ETVLPTDTMLDPPSACSGELYEKGSHCSDGHLDCSSEAHPS S SLNSQCPGK DEPLFCTLNTEDPEIPSNDDVFLPPLS
Subjt:  LLSSPNEVNHDQTANAINAETVLPTDTMLDPPSACSGELYEKGSHCSDGHLDCSSEAHPSSSASLNSQCPGKGDEPLFCTLNTEDPEIPSNDDVFLPPLS

Query:  TTATMGYHFQDCMDPTFSSIKDFSCKEKSGETTQNLVQREKKNH-GQPRVSSLSIGLRGLPERGEKHLVGGAAVNLKLSHSNIIHVPSANKTSSRNVNND
        T ATMGYHF DCM PTFSS  DFSCKE SGE TQNLVQRE+KNH GQP VS++S+GL  LPERGEKHL+ G  VNLKLSHSN IHVPSANKTSS NVN+D
Subjt:  TTATMGYHFQDCMDPTFSSIKDFSCKEKSGETTQNLVQREKKNH-GQPRVSSLSIGLRGLPERGEKHLVGGAAVNLKLSHSNIIHVPSANKTSSRNVNND

Query:  AILPAALKEENNEISRVNHLGQNFLNTHVEKPDFDSGNVRRYPPSTTCGIKQESDTLTTVKDHRLLQEVGARAVFGVEQDGISSTSDQEDLSIDSEDDLP
         ILP  L EENNEIS                                     E +T T+VKDHRL +EVG + VFGVEQDG+ STSD E+L IDSEDDLP
Subjt:  AILPAALKEENNEISRVNHLGQNFLNTHVEKPDFDSGNVRRYPPSTTCGIKQESDTLTTVKDHRLLQEVGARAVFGVEQDGISSTSDQEDLSIDSEDDLP

Query:  HFSDIEAMILDMDLDPEDQDLFSSEEVLKYQHMDTKKRIIRLEQGANAYMQRSMASHGALAVLYGRHSKHYIKKSEVLLGRATAEFIVDIDLGREGSGNK
        HFSDIEAMILDMDLDPEDQDL+ +EEVLKYQH+DT+KRIIRLEQG NA MQRS+ASHGALAVL GRHSKHYIKKSEVLLGRATAEFIVDIDLG EGSGNK
Subjt:  HFSDIEAMILDMDLDPEDQDLFSSEEVLKYQHMDTKKRIIRLEQGANAYMQRSMASHGALAVLYGRHSKHYIKKSEVLLGRATAEFIVDIDLGREGSGNK

Query:  ISRRQAIIKIDEDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESNSFRMKQYVDNIGKTSHKQEYPS
        ISRRQAIIKID+DGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGM FIFESN  RMKQYVDN+GKTSHKQEY S
Subjt:  ISRRQAIIKIDEDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESNSFRMKQYVDNIGKTSHKQEYPS

XP_023524465.1 uncharacterized protein LOC111788378 [Cucurbita pepo subsp. pepo]0.0e+0083.54Show/hide
Query:  MGALAPVAPWNPEDDILLKNSVEAGASLEALAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSMIDVERSSSILPSKFNKFGNPKETKCIGGKRKS
        MGALAP+APW+PEDD LLKN+VEAGASLEALAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSMIDVERSSSILPSKFNKFGN KE KCIGGKRKS
Subjt:  MGALAPVAPWNPEDDILLKNSVEAGASLEALAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSMIDVERSSSILPSKFNKFGNPKETKCIGGKRKS

Query:  GSVRRCYYALRKRICNEPFNPIDLSYLVGDSDYVGEEPMSGNCIPPISDDFGLQSSELGIMPCNFSQNVMNNDGTEHTFHSGCQHTVEKHFPQNLDNGHE
        GSVR CYYALRKRICNEPFNP+  SYLVGDSD+V EEPMSGNCIPP SDDFGLQSSELGI+PCNFSQN MNND TEHTFHSGCQHTVE HFPQNLDNGHE
Subjt:  GSVRRCYYALRKRICNEPFNPIDLSYLVGDSDYVGEEPMSGNCIPPISDDFGLQSSELGIMPCNFSQNVMNNDGTEHTFHSGCQHTVEKHFPQNLDNGHE

Query:  GISRIMVDNLPFSANESHAKELAPSTSFPVHSLFENDLEVKPSTFGQLSNDQRAMGSEPEDNDVFNSPVSDSGASFHNVEYSSPLPGMPIWRNASPTLPI
        GIS IMVDNLPF ANESHAKELAPS SFPVH+LFENDLE  PSTF QLS DQR MGSEPEDNDVFNSPVSDSGASFHNVEYSSPLPGMPIWRNA   LPI
Subjt:  GISRIMVDNLPFSANESHAKELAPSTSFPVHSLFENDLEVKPSTFGQLSNDQRAMGSEPEDNDVFNSPVSDSGASFHNVEYSSPLPGMPIWRNASPTLPI

Query:  DVGFPEKDIPTGDSFELPDDDGNNNIQNARLAGYDGHSNSKLKIEVQHDHLKSPNATAEGCYLAELSNSLLNLSNEDELLFMDVDGRDVIDKSYYDGLSL
        DVGF +KD+PTGDSFELPDDDG NNIQNARLAGY+ HSNSKLKIEVQHDHLKSPNATAEGCYL E SNSLLN SNEDEL FMDV           DGLSL
Subjt:  DVGFPEKDIPTGDSFELPDDDGNNNIQNARLAGYDGHSNSKLKIEVQHDHLKSPNATAEGCYLAELSNSLLNLSNEDELLFMDVDGRDVIDKSYYDGLSL

Query:  LLSSPNEVNHDQTANAINAETVLPTDTMLDPPSACSGELYEKGSHCSDGHLDCSSEAHPSSSASLNSQCPGKGDEPLFCTLNTEDPEIPSNDDVFLPPLS
        LLSSPNEVNHDQT NAIN+ETVLPTDTMLDPPSACSGELYEKGSHCSDGHLDCSSEAHPS S SLNSQCPGK DEPLFCTLNTEDPEIPSNDDVFLPPLS
Subjt:  LLSSPNEVNHDQTANAINAETVLPTDTMLDPPSACSGELYEKGSHCSDGHLDCSSEAHPSSSASLNSQCPGKGDEPLFCTLNTEDPEIPSNDDVFLPPLS

Query:  TTATMGYHFQDCMDPTFSSIKDFSCKEKSGETTQNLVQREKKNH-GQPRVSSLSIGLRGLPERGEKHLVGGAAVNLKLSHSNIIHVPSANKTSSRNVNND
        T ATMGYHF DCM PTFSSI DFSCKE SGE TQNLVQRE+KNH G+PRVS++S+GL  LPERGEKHL+ GA VNLKLSHSN IHVPSANKTSS NVN+D
Subjt:  TTATMGYHFQDCMDPTFSSIKDFSCKEKSGETTQNLVQREKKNH-GQPRVSSLSIGLRGLPERGEKHLVGGAAVNLKLSHSNIIHVPSANKTSSRNVNND

Query:  AILPAALKEENNEISRVNHLGQNFLNTHVEKPDFDSGNVRRYPPSTTCGIKQESDTLTTVKDHRLLQEVGARAVFGVEQDGISSTSDQEDLSIDSEDDLP
         ILP  L EENNEIS                                     E +T T+VKDHRL +EVG + VFGVEQDG+ STSD E+L IDSEDDLP
Subjt:  AILPAALKEENNEISRVNHLGQNFLNTHVEKPDFDSGNVRRYPPSTTCGIKQESDTLTTVKDHRLLQEVGARAVFGVEQDGISSTSDQEDLSIDSEDDLP

Query:  HFSDIEAMILDMDLDPEDQDLFSSEEVLKYQHMDTKKRIIRLEQGANAYMQRSMASHGALAVLYGRHSKHYIKKSEVLLGRATAEFIVDIDLGREGSGNK
        HFSDIEAMILDMDLDPEDQDL+ SEEVLKYQH+DT+KRIIRLEQG NA MQRS+ASHGALAVL GRHSKHYIKKSEVLLGRATAEFIVDIDLG EGSGNK
Subjt:  HFSDIEAMILDMDLDPEDQDLFSSEEVLKYQHMDTKKRIIRLEQGANAYMQRSMASHGALAVLYGRHSKHYIKKSEVLLGRATAEFIVDIDLGREGSGNK

Query:  ISRRQAIIKIDEDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESNSFRMKQYVDNIGKTSHKQEYPS
        ISRRQAIIKID+DGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGM FIFESN  RMKQYVDN+GKTSHKQEY S
Subjt:  ISRRQAIIKIDEDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESNSFRMKQYVDNIGKTSHKQEYPS

TrEMBL top hitse value%identityAlignment
A0A6J1C2Q8 uncharacterized protein LOC111007538 isoform X20.0e+0081.8Show/hide
Query:  MGALAPVAPWNPEDDILLKNSVEAGASLEALAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSMIDVERSSSILPSKFNKFGNPKETKCIGGKRKS
        MGALAPVAPW PEDDILLKN++EAGASLE+LAKGAVQFSRRYTVRELQERWHSLLYDPIVSE+ASMSMID ERSSSILPSKFNKFGNPKETKCIGGKRK 
Subjt:  MGALAPVAPWNPEDDILLKNSVEAGASLEALAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSMIDVERSSSILPSKFNKFGNPKETKCIGGKRKS

Query:  GSVRRCYYALRKRICNEPFNPIDLSYLVG--DSDYVGEEPMSGNCIPPISDDFGLQSSELGIMPCNFSQNVMNNDGTEHTFHSGCQHTVEKHFPQNLDNG
        GSVRRCYYALRKRICNEPFNP+DLS+LVG  DS+YV EEPMSG+CIPPIS DFGLQ SELGI+P NF+ N+MNND TE TFHS CQHTVEKHFP NLDN 
Subjt:  GSVRRCYYALRKRICNEPFNPIDLSYLVG--DSDYVGEEPMSGNCIPPISDDFGLQSSELGIMPCNFSQNVMNNDGTEHTFHSGCQHTVEKHFPQNLDNG

Query:  HEGISRIMVDNLPFSANESHAKELAPSTSFPVHSLFENDLEVKPSTFGQLSNDQRAMGSEPEDNDVFNSPVSDSGASFHNVEYSSPLPGMPIWRNAS-PT
        HEGI  IM +NLP S NES  +ELAPS SFPVHSLFENDLEV+PSTFGQ S DQRAMGSE EDN+VFNSPVSDSGASFHNVEYSSPLPGMPIWRNAS P 
Subjt:  HEGISRIMVDNLPFSANESHAKELAPSTSFPVHSLFENDLEVKPSTFGQLSNDQRAMGSEPEDNDVFNSPVSDSGASFHNVEYSSPLPGMPIWRNAS-PT

Query:  LPIDVGFPEKDIPTGDSFELPDDDGNNNIQNARLAGYDGHSNSKLKIEVQHDHLKSPNATAEGCYLAELSNSLLNLSNEDELLFMDVDGRDVIDKSYYDG
        LPIDVGF +KD+PTGDSFELPDDDGNNNIQNAR+A YD  S+SKLKIEVQHDHLKSPNATAE  YLAELSNSLLNL+NEDELLFMD DG+DVIDKSYYDG
Subjt:  LPIDVGFPEKDIPTGDSFELPDDDGNNNIQNARLAGYDGHSNSKLKIEVQHDHLKSPNATAEGCYLAELSNSLLNLSNEDELLFMDVDGRDVIDKSYYDG

Query:  L-SLLLSSPNEVNHDQTANAINAETVLPTDTMLDPPSACSGELYEKGSHCSDGHLDCSSEAHPSSSASLNSQCPGKGDEPLFCTLNTEDPEIPSNDDVFL
        L SLLL+SPNEVNHDQTA+A+N ET+LPTD+M+DPP+ACSGELYEKGSHCSDGHLDCS E HPS SASLNSQC GKGDEPLFCTLNTEDPEIPSNDDVFL
Subjt:  L-SLLLSSPNEVNHDQTANAINAETVLPTDTMLDPPSACSGELYEKGSHCSDGHLDCSSEAHPSSSASLNSQCPGKGDEPLFCTLNTEDPEIPSNDDVFL

Query:  PPLSTTATMGYHFQDCMDPTFSSIKDFSCKEKSGETTQNLVQREKKNHGQPRVSSLSIGLRGLPERGEKHLVGGAAVNLKLSHSNIIHVPSANK------
        PPLST ++MGYHFQD +D TFSSIKDFSC EKSGE TQNLVQRE+KNHGQP VSSLSIGL GLPERGEKHLVGGAAVNLKL HSN IHVPSAN       
Subjt:  PPLSTTATMGYHFQDCMDPTFSSIKDFSCKEKSGETTQNLVQREKKNHGQPRVSSLSIGLRGLPERGEKHLVGGAAVNLKLSHSNIIHVPSANK------

Query:  TSSRNVNNDAILPAALKEENNEISRVNHLGQNFLNTHVEKPDFDSGNVRRYPPSTTCGIKQESDTLTTVKDHRLLQEVGARAVFGVEQDGISSTSDQEDL
        +SS N N DAILP  LKEE+ EISR                                    E D LT VKDHRL QE G+R VFGVEQDGISSTSDQE+L
Subjt:  TSSRNVNNDAILPAALKEENNEISRVNHLGQNFLNTHVEKPDFDSGNVRRYPPSTTCGIKQESDTLTTVKDHRLLQEVGARAVFGVEQDGISSTSDQEDL

Query:  SIDSEDDLPHFSDIEAMILDMDLDPEDQDLFSSEEVLKYQHMDTKKRIIRLEQGANAYMQRSMASHGALAVLYGRHSKHYIKKSEVLLGRATAEFIVDID
        SIDSEDD+PHFSDIEAMILDMDLDPEDQDL++SEEVL+YQHMDTKKRI+RLEQGA+A M+RSMASHGALAVLYGR+SKHYIKKSEVLLGRAT + IVDID
Subjt:  SIDSEDDLPHFSDIEAMILDMDLDPEDQDLFSSEEVLKYQHMDTKKRIIRLEQGANAYMQRSMASHGALAVLYGRHSKHYIKKSEVLLGRATAEFIVDID

Query:  LGREGSGNKISRRQAIIKIDEDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESNSFRMKQYVDNIGKTSHKQEYPS
        LGREGSGNKISRRQAIIKID+DGFFSLKNLGKCSISINNK+VAPGHCLRLNSGCLIEIRGM FIFES+   MKQY+DNIGKTSHKQEY S
Subjt:  LGREGSGNKISRRQAIIKIDEDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESNSFRMKQYVDNIGKTSHKQEYPS

A0A6J1C5Q1 uncharacterized protein LOC111007538 isoform X10.0e+0085.17Show/hide
Query:  MGALAPVAPWNPEDDILLKNSVEAGASLEALAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSMIDVERSSSILPSKFNKFGNPKETKCIGGKRKS
        MGALAPVAPW PEDDILLKN++EAGASLE+LAKGAVQFSRRYTVRELQERWHSLLYDPIVSE+ASMSMID ERSSSILPSKFNKFGNPKETKCIGGKRK 
Subjt:  MGALAPVAPWNPEDDILLKNSVEAGASLEALAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSMIDVERSSSILPSKFNKFGNPKETKCIGGKRKS

Query:  GSVRRCYYALRKRICNEPFNPIDLSYLVG--DSDYVGEEPMSGNCIPPISDDFGLQSSELGIMPCNFSQNVMNNDGTEHTFHSGCQHTVEKHFPQNLDNG
        GSVRRCYYALRKRICNEPFNP+DLS+LVG  DS+YV EEPMSG+CIPPIS DFGLQ SELGI+P NF+ N+MNND TE TFHS CQHTVEKHFP NLDN 
Subjt:  GSVRRCYYALRKRICNEPFNPIDLSYLVG--DSDYVGEEPMSGNCIPPISDDFGLQSSELGIMPCNFSQNVMNNDGTEHTFHSGCQHTVEKHFPQNLDNG

Query:  HEGISRIMVDNLPFSANESHAKELAPSTSFPVHSLFENDLEVKPSTFGQLSNDQRAMGSEPEDNDVFNSPVSDSGASFHNVEYSSPLPGMPIWRNAS-PT
        HEGI  IM +NLP S NES  +ELAPS SFPVHSLFENDLEV+PSTFGQ S DQRAMGSE EDN+VFNSPVSDSGASFHNVEYSSPLPGMPIWRNAS P 
Subjt:  HEGISRIMVDNLPFSANESHAKELAPSTSFPVHSLFENDLEVKPSTFGQLSNDQRAMGSEPEDNDVFNSPVSDSGASFHNVEYSSPLPGMPIWRNAS-PT

Query:  LPIDVGFPEKDIPTGDSFELPDDDGNNNIQNARLAGYDGHSNSKLKIEVQHDHLKSPNATAEGCYLAELSNSLLNLSNEDELLFMDVDGRDVIDKSYYDG
        LPIDVGF +KD+PTGDSFELPDDDGNNNIQNAR+A YD  S+SKLKIEVQHDHLKSPNATAE  YLAELSNSLLNL+NEDELLFMD DG+DVIDKSYYDG
Subjt:  LPIDVGFPEKDIPTGDSFELPDDDGNNNIQNARLAGYDGHSNSKLKIEVQHDHLKSPNATAEGCYLAELSNSLLNLSNEDELLFMDVDGRDVIDKSYYDG

Query:  L-SLLLSSPNEVNHDQTANAINAETVLPTDTMLDPPSACSGELYEKGSHCSDGHLDCSSEAHPSSSASLNSQCPGKGDEPLFCTLNTEDPEIPSNDDVFL
        L SLLL+SPNEVNHDQTA+A+N ET+LPTD+M+DPP+ACSGELYEKGSHCSDGHLDCS E HPS SASLNSQC GKGDEPLFCTLNTEDPEIPSNDDVFL
Subjt:  L-SLLLSSPNEVNHDQTANAINAETVLPTDTMLDPPSACSGELYEKGSHCSDGHLDCSSEAHPSSSASLNSQCPGKGDEPLFCTLNTEDPEIPSNDDVFL

Query:  PPLSTTATMGYHFQDCMDPTFSSIKDFSCKEKSGETTQNLVQREKKNHGQPRVSSLSIGLRGLPERGEKHLVGGAAVNLKLSHSNIIHVPSANK------
        PPLST ++MGYHFQD +D TFSSIKDFSC EKSGE TQNLVQRE+KNHGQP VSSLSIGL GLPERGEKHLVGGAAVNLKL HSN IHVPSAN       
Subjt:  PPLSTTATMGYHFQDCMDPTFSSIKDFSCKEKSGETTQNLVQREKKNHGQPRVSSLSIGLRGLPERGEKHLVGGAAVNLKLSHSNIIHVPSANK------

Query:  TSSRNVNNDAILPAALKEENNEISRVNHLGQNFLNTHVEKPDFDSGNVRRYPPSTTCGIKQESDTLTTVKDHRLLQEVGARAVFGVEQDGISSTSDQEDL
        +SS N N DAILP  LKEE+ EISRVNHLGQNFLNTHVEKP FDS N R+YPPST  GIKQE D LT VKDHRL QE G+R VFGVEQDGISSTSDQE+L
Subjt:  TSSRNVNNDAILPAALKEENNEISRVNHLGQNFLNTHVEKPDFDSGNVRRYPPSTTCGIKQESDTLTTVKDHRLLQEVGARAVFGVEQDGISSTSDQEDL

Query:  SIDSEDDLPHFSDIEAMILDMDLDPEDQDLFSSEEVLKYQHMDTKKRIIRLEQGANAYMQRSMASHGALAVLYGRHSKHYIKKSEVLLGRATAEFIVDID
        SIDSEDD+PHFSDIEAMILDMDLDPEDQDL++SEEVL+YQHMDTKKRI+RLEQGA+A M+RSMASHGALAVLYGR+SKHYIKKSEVLLGRAT + IVDID
Subjt:  SIDSEDDLPHFSDIEAMILDMDLDPEDQDLFSSEEVLKYQHMDTKKRIIRLEQGANAYMQRSMASHGALAVLYGRHSKHYIKKSEVLLGRATAEFIVDID

Query:  LGREGSGNKISRRQAIIKIDEDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESNSFRMKQYVDNIGKTSHKQEYPS
        LGREGSGNKISRRQAIIKID+DGFFSLKNLGKCSISINNK+VAPGHCLRLNSGCLIEIRGM FIFES+   MKQY+DNIGKTSHKQEY S
Subjt:  LGREGSGNKISRRQAIIKIDEDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESNSFRMKQYVDNIGKTSHKQEYPS

A0A6J1FHD3 uncharacterized protein LOC1114457850.0e+0083.43Show/hide
Query:  MGALAPVAPWNPEDDILLKNSVEAGASLEALAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSMIDVERSSSILPSKFNKFGNPKETKCIGGKRKS
        MGALAP+APW+PEDDILLKN+VEAGASLEALAKGAVQFSRR+TVRELQERWHSLLYDPIVSEDASMSMIDVERSSSILPSKFNKFGN KE KCIGGKRKS
Subjt:  MGALAPVAPWNPEDDILLKNSVEAGASLEALAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSMIDVERSSSILPSKFNKFGNPKETKCIGGKRKS

Query:  GSVRRCYYALRKRICNEPFNPIDLSYLVGDSDYVGEEPMSGNCIPPISDDFGLQSSELGIMPCNFSQNVMNNDGTEHTFHSGCQHTVEKHFPQNLDNGHE
        GSVR CYYALRKRICNEPFNP+  SYLVGDSDYV EEPMSGNCIPP SDDFGLQSSELGI+PCNFSQN MNND TEHTFHSGCQHTVE HFPQNLDNGHE
Subjt:  GSVRRCYYALRKRICNEPFNPIDLSYLVGDSDYVGEEPMSGNCIPPISDDFGLQSSELGIMPCNFSQNVMNNDGTEHTFHSGCQHTVEKHFPQNLDNGHE

Query:  GISRIMVDNLPFSANESHAKELAPSTSFPVHSLFENDLEVKPSTFGQLSNDQRAMGSEPEDNDVFNSPVSDSGASFHNVEYSSPLPGMPIWRNASPTLPI
        GIS IMVDNLPF ANESHAKELAPS SFPVH+LFENDLE  PSTFGQLS DQR MGSEPEDNDVFNSPVSDSGASFHNVEYSSPLPGMPIWRNA   LPI
Subjt:  GISRIMVDNLPFSANESHAKELAPSTSFPVHSLFENDLEVKPSTFGQLSNDQRAMGSEPEDNDVFNSPVSDSGASFHNVEYSSPLPGMPIWRNASPTLPI

Query:  DVGFPEKDIPTGDSFELPDDDGNNNIQNARLAGYDGHSNSKLKIEVQHDHLKSPNATAEGCYLAELSNSLLNLSNEDELLFMDVDGRDVIDKSYYDGLSL
        DVGF +KD+PTGDSFELPDDDGNNNIQNARLAGY+ HSNSKLKIEVQHDHLKSPNATAEGCYL E SNSLLN SNEDEL FMDV           DGLSL
Subjt:  DVGFPEKDIPTGDSFELPDDDGNNNIQNARLAGYDGHSNSKLKIEVQHDHLKSPNATAEGCYLAELSNSLLNLSNEDELLFMDVDGRDVIDKSYYDGLSL

Query:  LLSSPNEVNHDQTANAINAETVLPTDTMLDPPSACSGELYEKGSHCSDGHLDCSSEAHPSSSASLNSQCPGKGDEPLFCTLNTEDPEIPSNDDVFLPPLS
        LLSSPNEVNHDQT NAIN+E VLPTDTMLDPPSACSGELYEKGSHCSDGHLDCSSEAHPS S SLN QCPGK DEPLFCTLNTEDPEIPSNDDVFLPPLS
Subjt:  LLSSPNEVNHDQTANAINAETVLPTDTMLDPPSACSGELYEKGSHCSDGHLDCSSEAHPSSSASLNSQCPGKGDEPLFCTLNTEDPEIPSNDDVFLPPLS

Query:  TTATMGYHFQDCMDPTFSSIKDFSCKEKSGETTQNLVQREKKNH-GQPRVSSLSIGLRGLPERGEKHLVGGAAVNLKLSHSNIIHVPSANKTSSRNVNND
        T ATMGYHF DCM PTFSSI DFSCKE SGE TQNLVQRE+KNH GQPRVS++S+GL  L ERGEKHL+ GA VNLKLSHSN IHVPSANKTSS NVN+D
Subjt:  TTATMGYHFQDCMDPTFSSIKDFSCKEKSGETTQNLVQREKKNH-GQPRVSSLSIGLRGLPERGEKHLVGGAAVNLKLSHSNIIHVPSANKTSSRNVNND

Query:  AILPAALKEENNEISRVNHLGQNFLNTHVEKPDFDSGNVRRYPPSTTCGIKQESDTLTTVKDHRLLQEVGARAVFGVEQDGISSTSDQEDLSIDSEDDLP
         ILP  L EENNEIS                                     E +T T+VKDHRL +EVG + VFGVEQDG+ STSD E+L IDSEDDLP
Subjt:  AILPAALKEENNEISRVNHLGQNFLNTHVEKPDFDSGNVRRYPPSTTCGIKQESDTLTTVKDHRLLQEVGARAVFGVEQDGISSTSDQEDLSIDSEDDLP

Query:  HFSDIEAMILDMDLDPEDQDLFSSEEVLKYQHMDTKKRIIRLEQGANAYMQRSMASHGALAVLYGRHSKHYIKKSEVLLGRATAEFIVDIDLGREGSGNK
        HFSDIEAMILDMDLDPEDQDL+ SEEVLKYQH+DT+KRIIRLEQG NA MQRS+ASHGALAVL G+HSKHYIKKSEVLLGRATAEFIVDIDLG EGSGNK
Subjt:  HFSDIEAMILDMDLDPEDQDLFSSEEVLKYQHMDTKKRIIRLEQGANAYMQRSMASHGALAVLYGRHSKHYIKKSEVLLGRATAEFIVDIDLGREGSGNK

Query:  ISRRQAIIKIDEDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESNSFRMKQYVDNIGKTSHKQEYPS
        ISRRQAIIKID+DG FSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGM FIFESN  RMKQYVDN+GKTSHKQEY S
Subjt:  ISRRQAIIKIDEDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESNSFRMKQYVDNIGKTSHKQEYPS

A0A6J1HIH5 uncharacterized protein LOC111463912 isoform X10.0e+0082.15Show/hide
Query:  MGALAPVAPWNPEDDILLKNSVEAGASLEALAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSMIDVERSSSILPSKFNKFGNPKETKCIGGKRKS
        MGALAPVAPW PEDDILLKN+VEAGASLE+LAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSMID ERSSSILPSKFN+FGNPKETK IGGKRKS
Subjt:  MGALAPVAPWNPEDDILLKNSVEAGASLEALAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSMIDVERSSSILPSKFNKFGNPKETKCIGGKRKS

Query:  GSVRRCYYALRKRICNEPF-NPIDLSYLVG--DSDYVGEEPMSGNCIPPISDDFGLQSSELGIMPCNFSQNVMNNDGTEHTFHSGCQHTVEKHFPQNLDN
        GSVR CYYALRKRICNEPF NP+DL++LVG  +S+YV EEPMSGNCIPPISDDFGLQSSE+GI+PC+FSQNVMN D  EHTF SGCQ TVEKHFP+NLDN
Subjt:  GSVRRCYYALRKRICNEPF-NPIDLSYLVG--DSDYVGEEPMSGNCIPPISDDFGLQSSELGIMPCNFSQNVMNNDGTEHTFHSGCQHTVEKHFPQNLDN

Query:  GHEGISRIMVDNLPFSANESHAKELAPSTSFPVHSLFENDLEVKPSTFGQLSNDQRAMGSEPEDNDVFNSPVSDSGASFHNVEYSSPLPGMPIWRNAS-P
        G EGIS  M ++LP SA +SH +ELAPST FPVHSLFENDLE +PSTFGQLSNDQRAMGSE EDN+VFNSPVS+SGASFHNVEYSSPLPGMPIWRNAS P
Subjt:  GHEGISRIMVDNLPFSANESHAKELAPSTSFPVHSLFENDLEVKPSTFGQLSNDQRAMGSEPEDNDVFNSPVSDSGASFHNVEYSSPLPGMPIWRNAS-P

Query:  TLPIDVGFPEKDIPTGDSFELPDDDGNNNIQNARLAGYDGHSNSKLKIEVQHDHLKSPNATAEGCYLAELSNSLLNLSNEDELLFMDVDGRDVIDKSYYD
         LPIDVGF +KDIPT +SFELPDDDGN NIQNAR+AGYD +S+ KLKIEV+ DHLKSPNATAE  YLAELSNSL+N+SNEDELLFMDVDG+D +DKSYYD
Subjt:  TLPIDVGFPEKDIPTGDSFELPDDDGNNNIQNARLAGYDGHSNSKLKIEVQHDHLKSPNATAEGCYLAELSNSLLNLSNEDELLFMDVDGRDVIDKSYYD

Query:  GL-SLLLSSPNEVNHDQTANAINAETVLPTDTMLDPPSACSGELYEKGSHCSDGHLDCSSEAHPSSSASLNSQCPGKGDEPLFCTLNTEDPEIPSNDDVF
        GL SLLL+SPNE+NHDQTANAINAETVLPTDTM+DPP+ACSG LYEKGSHC  GHLDC+SEAH S SASLN+QCP KGDEPLFCTLNTEDP+IPSNDDVF
Subjt:  GL-SLLLSSPNEVNHDQTANAINAETVLPTDTMLDPPSACSGELYEKGSHCSDGHLDCSSEAHPSSSASLNSQCPGKGDEPLFCTLNTEDPEIPSNDDVF

Query:  LPPLSTTATMGYHFQDCMDPTFSSIKDFSCKEKSGETTQNLVQREKKNHGQPRVSSLSIGLRGLPERGEKHLVGGAAVNLKLSHSNIIHVPSANKTSSRN
        LPPLST ATMGY+FQDC++ TFSS KDF+  EKSGE TQNL  RE+KNHG    + ++  L G  ERGEKH VGGA VN + SHSN  H+PS +   S N
Subjt:  LPPLSTTATMGYHFQDCMDPTFSSIKDFSCKEKSGETTQNLVQREKKNHGQPRVSSLSIGLRGLPERGEKHLVGGAAVNLKLSHSNIIHVPSANKTSSRN

Query:  VNNDAILPAALKEENNEISRVNHLGQNFLNTHVEKPDFDSGNVRRYPPSTTCGIKQESDTLTTVKDHRLLQEVGARAVFGVEQDGISSTSDQEDLSIDSE
         N+DA LPA LKEENNEISRVNHLG+NFLN H EKP FDS NVR YPPS  C IKQE D L ++KDHRL QE G R  FGVEQ G+SSTSDQE+LSIDSE
Subjt:  VNNDAILPAALKEENNEISRVNHLGQNFLNTHVEKPDFDSGNVRRYPPSTTCGIKQESDTLTTVKDHRLLQEVGARAVFGVEQDGISSTSDQEDLSIDSE

Query:  DDLPHFSDIEAMILDMDLDPEDQDLFSSEEVLKYQHMDTKKRIIRLEQGANAYMQRSMASHGALAVLYGRHSKHYIKKSEVLLGRATAEFIVDIDLGREG
        DD+PHFSDIEAMILDMDLDPEDQDL+SSEEVLKYQH+DTKKRIIRLEQGANAYMQRS ASHGALAVLYGR+SKHYIKKSEVLLGRAT + IVDIDLGREG
Subjt:  DDLPHFSDIEAMILDMDLDPEDQDLFSSEEVLKYQHMDTKKRIIRLEQGANAYMQRSMASHGALAVLYGRHSKHYIKKSEVLLGRATAEFIVDIDLGREG

Query:  SGNKISRRQAIIKIDEDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESNSFRMKQYVDNIGKTSHKQEYPS
        SGNKISRRQAIIK+D+DGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESN  RMKQYVDN+GK SHKQEY S
Subjt:  SGNKISRRQAIIKIDEDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESNSFRMKQYVDNIGKTSHKQEYPS

A0A6J1ITH7 uncharacterized protein LOC1114804510.0e+0083.09Show/hide
Query:  MGALAPVAPWNPEDDILLKNSVEAGASLEALAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSMIDVERSSSILPSKFNKFGNPKETKCIGGKRKS
        MGALAP+APW+PEDDILLKN+VEA ASLEALAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSMIDVERSSSILPSKFNKFGN KE KCIGGKRKS
Subjt:  MGALAPVAPWNPEDDILLKNSVEAGASLEALAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSMIDVERSSSILPSKFNKFGNPKETKCIGGKRKS

Query:  GSVRRCYYALRKRICNEPFNPIDLSYLVGDSDYVGEEPMSGNCIPPISDDFGLQSSELGIMPCNFSQNVMNNDGTEHTFHSGCQHTVEKHFPQNLDNGHE
        GSVR CYYALRKRICNEPFNP+  SYLVGDSDYV EEPMSGNCIPP SDDFGLQSSELG +PCNFSQN MNND TEHTFHSGCQHTVE HFPQNLDNGHE
Subjt:  GSVRRCYYALRKRICNEPFNPIDLSYLVGDSDYVGEEPMSGNCIPPISDDFGLQSSELGIMPCNFSQNVMNNDGTEHTFHSGCQHTVEKHFPQNLDNGHE

Query:  GISRIMVDNLPFSANESHAKELAPSTSFPVHSLFENDLEVKPSTFGQLSNDQRAMGSEPEDNDVFNSPVSDSGASFHNVEYSSPLPGMPIWRNASPTLPI
        GIS +MVDNLPF ANESHAKELAPS SFPVH+LFENDLE  PSTFGQLS DQR MGSEPEDNDVFNSPVSDSGASFHNVEYSSPLPGMPIWR+AS  LPI
Subjt:  GISRIMVDNLPFSANESHAKELAPSTSFPVHSLFENDLEVKPSTFGQLSNDQRAMGSEPEDNDVFNSPVSDSGASFHNVEYSSPLPGMPIWRNASPTLPI

Query:  DVGFPEKDIPTGDSFELPDDDGNNNIQNARLAGYDGHSNSKLKIEVQHDHLKSPNATAEGCYLAELSNSLLNLSNEDELLFMDVDGRDVIDKSYYDGLSL
        DVGF +KD+PTGDSFELPDDDGNNNIQNARLAGY+ HSNSKLKIEVQHDHLKSPNATAEGCYL ELSN++LN SNEDEL FMDV           DGLSL
Subjt:  DVGFPEKDIPTGDSFELPDDDGNNNIQNARLAGYDGHSNSKLKIEVQHDHLKSPNATAEGCYLAELSNSLLNLSNEDELLFMDVDGRDVIDKSYYDGLSL

Query:  LLSSPNEVNHDQTANAINAETVLPTDTMLDPPSACSGELYEKGSHCSDGHLDCSSEAHPSSSASLNSQCPGKGDEPLFCTLNTEDPEIPSNDDVFLPPLS
        LLSSPNEVNHDQT NAI +ETVLPTDTMLDPPSACSGELYEKGSHCSDGHLDCSSEAHPS S SLNSQCPGK DEPLFCTLNTEDPEIPSNDDVFLPPLS
Subjt:  LLSSPNEVNHDQTANAINAETVLPTDTMLDPPSACSGELYEKGSHCSDGHLDCSSEAHPSSSASLNSQCPGKGDEPLFCTLNTEDPEIPSNDDVFLPPLS

Query:  TTATMGYHFQDCMDPTFSSIKDFSCKEKSGETTQNLVQREKKNH-GQPRVSSLSIGLRGLPERGEKHLVGGAAVNLKLSHSNIIHVPSANKTSSRNVNND
        T ATMGYHF DCM PTFSS  DFSCKE SGE TQNLVQRE+KNH GQP VS++S+GL  LPERGEKHL+ G  VNLKLSHSN IHVPSANKTSS NVN+D
Subjt:  TTATMGYHFQDCMDPTFSSIKDFSCKEKSGETTQNLVQREKKNH-GQPRVSSLSIGLRGLPERGEKHLVGGAAVNLKLSHSNIIHVPSANKTSSRNVNND

Query:  AILPAALKEENNEISRVNHLGQNFLNTHVEKPDFDSGNVRRYPPSTTCGIKQESDTLTTVKDHRLLQEVGARAVFGVEQDGISSTSDQEDLSIDSEDDLP
         ILP  L EENNEIS                                     E +T T+VKDHRL +EVG + VFGVEQDG+ STSD E+L IDSEDDLP
Subjt:  AILPAALKEENNEISRVNHLGQNFLNTHVEKPDFDSGNVRRYPPSTTCGIKQESDTLTTVKDHRLLQEVGARAVFGVEQDGISSTSDQEDLSIDSEDDLP

Query:  HFSDIEAMILDMDLDPEDQDLFSSEEVLKYQHMDTKKRIIRLEQGANAYMQRSMASHGALAVLYGRHSKHYIKKSEVLLGRATAEFIVDIDLGREGSGNK
        HFSDIEAMILDMDLDPEDQDL+ +EEVLKYQH+DT+KRIIRLEQG NA MQRS+ASHGALAVL GRHSKHYIKKSEVLLGRATAEFIVDIDLG EGSGNK
Subjt:  HFSDIEAMILDMDLDPEDQDLFSSEEVLKYQHMDTKKRIIRLEQGANAYMQRSMASHGALAVLYGRHSKHYIKKSEVLLGRATAEFIVDIDLGREGSGNK

Query:  ISRRQAIIKIDEDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESNSFRMKQYVDNIGKTSHKQEYPS
        ISRRQAIIKID+DGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGM FIFESN  RMKQYVDN+GKTSHKQEY S
Subjt:  ISRRQAIIKIDEDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESNSFRMKQYVDNIGKTSHKQEYPS

SwissProt top hitse value%identityAlignment
Q96EZ8 Microspherule protein 11.2e-1232.79Show/hide
Query:  DQEDLSIDSEDDLPHFSDIEAMILDMDL-DPEDQDLFSSEEVLKYQHMDTKKRIIRLEQGANAY---------MQRSMASHGALAVLYGRHSKHYIKKSE
        DQ    +   D + +FSD E +I D  L D  D+ L   E  L       K+ I +LEQ  + +         M      +  LAVL GR  ++ ++  E
Subjt:  DQEDLSIDSEDDLPHFSDIEAMILDMDL-DPEDQDLFSSEEVLKYQHMDTKKRIIRLEQGANAY---------MQRSMASHGALAVLYGRHSKHYIKKSE

Query:  VLLGRATAEFIVDIDLGREGSGNKISRRQAIIKIDEDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESN
        + LGRAT +  +D+DL  EG   KISR+Q +IK+  +G F + N G+  I I+ + V  G   RL++  ++EI  + F+F  N
Subjt:  VLLGRATAEFIVDIDLGREGSGNKISRRQAIIKIDEDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESN

Q96EZ8 Microspherule protein 18.6e-0849.15Show/hide
Query:  WNPEDDILLKNSVEAGASLEALAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSM
        W P DD+LL N+V     L ++  G V+FS R+T+RE+QERW++LLYDP++S+ A  +M
Subjt:  WNPEDDILLKNSVEAGASLEALAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSM

Q99L90 Microspherule protein 11.2e-1233.33Show/hide
Query:  DQEDLSIDSEDDLPHFSDIEAMILDMDL-DPEDQDLFSSEEVLKYQHMDTKKRIIRLEQGANAY--MQRSMASHGA-------LAVLYGRHSKHYIKKSE
        DQ    +   D + +FSD E +I D  L D  D+ L   E  L       K+ I +LEQ  + +  +  S+   G+       LAVL GR  ++ ++  E
Subjt:  DQEDLSIDSEDDLPHFSDIEAMILDMDL-DPEDQDLFSSEEVLKYQHMDTKKRIIRLEQGANAY--MQRSMASHGA-------LAVLYGRHSKHYIKKSE

Query:  VLLGRATAEFIVDIDLGREGSGNKISRRQAIIKIDEDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESN
        + LGRAT +  +D+DL  EG   KISR+Q +IK+  +G F + N G+  I I+ + V  G   RL++  ++EI  + F+F  N
Subjt:  VLLGRATAEFIVDIDLGREGSGNKISRRQAIIKIDEDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESN

Q99L90 Microspherule protein 18.6e-0849.15Show/hide
Query:  WNPEDDILLKNSVEAGASLEALAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSM
        W P DD+LL N+V     L ++  G V+FS R+T+RE+QERW++LLYDP++S+ A  +M
Subjt:  WNPEDDILLKNSVEAGASLEALAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSM

Arabidopsis top hitse value%identityAlignment
AT1G60700.1 SMAD/FHA domain-containing protein5.3e-2936.84Show/hide
Query:  RYPPSTTCGI--------KQESDTLTTVKDHRLLQEVGARAVFGVEQDGISSTSDQEDLSIDSEDDLPHFSDIEAMILDMDLDPEDQD-LFSSEEVLKYQ
        R P S  C I         ++S    T  D +L            E     ST  QE+  +D E+++    DI+AMI  ++L P+D D  F+ EE    +
Subjt:  RYPPSTTCGI--------KQESDTLTTVKDHRLLQEVGARAVFGVEQDGISSTSDQEDLSIDSEDDLPHFSDIEAMILDMDLDPEDQD-LFSSEEVLKYQ

Query:  HMDTKKRIIRLEQGANAYMQRSMASHGALAVLYGRHSKHYIKKSEVLLGRATAEFIVDIDLGREGSGNKISRRQAIIKIDEDGFFSLKNLGKCSISINNK
        H   +  +I LEQ     MQR++  HGA+AVL+   SKH+++K EV++GR++    VDIDLG+   G+KISRRQA++K++  G FSLKNLGK  I +N  
Subjt:  HMDTKKRIIRLEQGANAYMQRSMASHGALAVLYGRHSKHYIKKSEVLLGRATAEFIVDIDLGREGSGNKISRRQAIIKIDEDGFFSLKNLGKCSISINNK

Query:  DVAPGHCLRLNSGCLIEIRGMPFIFESNSFRMKQYVDNIGKTSHKQE
         +  G  + L S   I IRG+ F+F+ N   + Q++ N   T  K E
Subjt:  DVAPGHCLRLNSGCLIEIRGMPFIFESNSFRMKQYVDNIGKTSHKQE

AT1G75530.1 Forkhead-associated (FHA) domain-containing protein4.4e-4739.93Show/hide
Query:  SANKTSSRNVNNDAILP---AALKEENNEISRVNHLGQNFLNTHVEKPDFDSGNVRRYPPSTTCGIKQESDTLTTVKDHRLLQEVGARAVFGVEQDGISS
        S N +S RN     + P   ++L+ ++ +I   +  G   + T        S +      ST       S TL    ++ + ++         E D   +
Subjt:  SANKTSSRNVNNDAILP---AALKEENNEISRVNHLGQNFLNTHVEKPDFDSGNVRRYPPSTTCGIKQESDTLTTVKDHRLLQEVGARAVFGVEQDGISS

Query:  TSDQEDLSIDSEDDLPHFSDIEAMILDMDLDPEDQDLFSSEEVLKYQHMDTKKRIIRLEQGANAYMQRSMASHGALAVLYGRHSKHYIKKSEVLLGRATA
          ++ ++ I+S+++LP FSD+EAMILDMDL+P  QD +   +  KY++ +  ++I+RLEQ A +YM R +A+HGA A+LYG  SKHYI K EVLLGRAT 
Subjt:  TSDQEDLSIDSEDDLPHFSDIEAMILDMDLDPEDQDLFSSEEVLKYQHMDTKKRIIRLEQGANAYMQRSMASHGALAVLYGRHSKHYIKKSEVLLGRATA

Query:  EFIVDIDLGREGSGNKISRRQAIIKIDEDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESNSFRMKQYVDNIGK
        E+ VDIDLGR GS  + SRRQA+IK+ +DG F +KNLGK SI +N++++  G  + L + CLI+IR   FIFE N   +K+Y+D I K
Subjt:  EFIVDIDLGREGSGNKISRRQAIIKIDEDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESNSFRMKQYVDNIGK

AT1G75530.1 Forkhead-associated (FHA) domain-containing protein2.2e-1442.31Show/hide
Query:  WNPEDDILLKNSVEAGASLEALAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSMIDVERSSSILPSKFNKFGNPKETKCIGGKRKSGSVRRCYYA
        W PEDD LL+ S+E G SLE LAKGAV+FSR++T+ EL ERWH LLY+P V+  +S    +++  +  +P                    S  VR  YY 
Subjt:  WNPEDDILLKNSVEAGASLEALAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSMIDVERSSSILPSKFNKFGNPKETKCIGGKRKSGSVRRCYYA

Query:  LRKR
         RKR
Subjt:  LRKR

AT3G54350.1 Forkhead-associated (FHA) domain-containing protein2.3e-10935.1Show/hide
Query:  MGALAPVAPWNPEDDILLKNSVEAGASLEALAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSMIDVERSSSILPSKFNKFGNPKETKCIGGKRKS
        MGALA V PW PEDD+LLKN+VEAGASLE+LAKGAVQFSRR+++RELQ+RWH+LLYDP+VS +A+  M ++ER++   P+KF + G  KE K    KR +
Subjt:  MGALAPVAPWNPEDDILLKNSVEAGASLEALAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSMIDVERSSSILPSKFNKFGNPKETKCIGGKRKS

Query:  GSVRRCYYALRKRICNEPFNPIDLSYLVGDSDYVGEEPMSGNCIPPISDDFGLQSSELGIMPCNFSQNVMNNDGTEHTFHSGCQHTVEKHFPQNLDNGHE
          +R  Y++LRK+   EPFN +DL +LV                PP                         ND                HF  N D  H 
Subjt:  GSVRRCYYALRKRICNEPFNPIDLSYLVGDSDYVGEEPMSGNCIPPISDDFGLQSSELGIMPCNFSQNVMNNDGTEHTFHSGCQHTVEKHFPQNLDNGHE

Query:  GISRIMVDNLPFSANESHAKELAPSTSFPVHSLFENDLEVKPSTFGQLSNDQRAMGSEPEDNDVFNSPVSDSGASFHNVEYSSPLPGMPIWRNASPTLPI
        G+    +D +    + +  + LA       H L                         PEDN                      L G             
Subjt:  GISRIMVDNLPFSANESHAKELAPSTSFPVHSLFENDLEVKPSTFGQLSNDQRAMGSEPEDNDVFNSPVSDSGASFHNVEYSSPLPGMPIWRNASPTLPI

Query:  DVGFPEKDIPTGDSFELPDDDGNNNIQNARLAGYD-GHSNSKLKIEVQHDHLKSPNATAEGCYLAELSNSLLNLSNEDELLFMDVDGRDVIDKSYYDGL-
               DIP  +   L         ++A L+  D  H +S+ K+E      K+  A+ +  +LA+LS SL     ED   FM+VDG++V DKSYYDGL 
Subjt:  DVGFPEKDIPTGDSFELPDDDGNNNIQNARLAGYD-GHSNSKLKIEVQHDHLKSPNATAEGCYLAELSNSLLNLSNEDELLFMDVDGRDVIDKSYYDGL-

Query:  SLLLSSPNEVNHDQTANAINAE-TVLPTDTMLDPPSACSGELYEKGSHCSDGHLDCSSEAHPSSSASLNSQCPGKGDEPLFCTLNTEDPEIPSNDDVFLP
        SLL++S N+ N +   N    E ++ PT                   H  +  LD         + +L+   P      + C LN EDP+IP NDD+FL 
Subjt:  SLLLSSPNEVNHDQTANAINAE-TVLPTDTMLDPPSACSGELYEKGSHCSDGHLDCSSEAHPSSSASLNSQCPGKGDEPLFCTLNTEDPEIPSNDDVFLP

Query:  ----PLSTTATMGYHFQDCMDPTFSSIKDFSCKEKSGETTQNLVQREKKNHGQPRVSSLSIGLRGLPERGEKHLVGGAAVNLKLSHSNIIHVPSANKTSS
            P+S ++    +F+D   P  + ++D S  ++  E     +Q +KK  G+ + S+      G P +G K     A+ + +L ++    V     +S+
Subjt:  ----PLSTTATMGYHFQDCMDPTFSSIKDFSCKEKSGETTQNLVQREKKNHGQPRVSSLSIGLRGLPERGEKHLVGGAAVNLKLSHSNIIHVPSANKTSS

Query:  RNVNNDAILPAALKEENNEISRVNHLGQNFLNT--HVEKPDFDSGNVRRY----PPSTTCGIKQESDTLTTVKDHRLLQEVGARAVFGVEQDGISSTSDQ
        +  +N  +      ++  + +     G  F+ +  H   P+ DS N +      P + +   K   D L  +             V  +E     + ++ 
Subjt:  RNVNNDAILPAALKEENNEISRVNHLGQNFLNT--HVEKPDFDSGNVRRY----PPSTTCGIKQESDTLTTVKDHRLLQEVGARAVFGVEQDGISSTSDQ

Query:  EDLSIDSEDDLPHFSDIEAMILDMDLDPEDQDLFSSEEVLKYQHMDTKKRIIRLEQGANAYMQRSMASHGALAVLYGRHSKHYIKKSEVLLGRATAEFIV
        E    +S++DLP++SDIEAMILDMDL+P+DQD F   EV KYQ  D K+ IIRLEQ A++YMQR++AS GA AVLYGR+SKHYIKK EVL+GR+T +  V
Subjt:  EDLSIDSEDDLPHFSDIEAMILDMDLDPEDQDLFSSEEVLKYQHMDTKKRIIRLEQGANAYMQRSMASHGALAVLYGRHSKHYIKKSEVLLGRATAEFIV

Query:  DIDLGREGSGNKISRRQAIIKIDEDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESNSFRMKQYVDNIGKTS
        DIDLGRE  G+KISRRQAII++ +DG F +KNLGK SIS+N K+V PG  L L S CL+EIRGMPFIFE+N   M++Y+   GK +
Subjt:  DIDLGREGSGNKISRRQAIIKIDEDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESNSFRMKQYVDNIGKTS

AT3G54350.2 Forkhead-associated (FHA) domain-containing protein2.3e-10935.1Show/hide
Query:  MGALAPVAPWNPEDDILLKNSVEAGASLEALAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSMIDVERSSSILPSKFNKFGNPKETKCIGGKRKS
        MGALA V PW PEDD+LLKN+VEAGASLE+LAKGAVQFSRR+++RELQ+RWH+LLYDP+VS +A+  M ++ER++   P+KF + G  KE K    KR +
Subjt:  MGALAPVAPWNPEDDILLKNSVEAGASLEALAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSMIDVERSSSILPSKFNKFGNPKETKCIGGKRKS

Query:  GSVRRCYYALRKRICNEPFNPIDLSYLVGDSDYVGEEPMSGNCIPPISDDFGLQSSELGIMPCNFSQNVMNNDGTEHTFHSGCQHTVEKHFPQNLDNGHE
          +R  Y++LRK+   EPFN +DL +LV                PP                         ND                HF  N D  H 
Subjt:  GSVRRCYYALRKRICNEPFNPIDLSYLVGDSDYVGEEPMSGNCIPPISDDFGLQSSELGIMPCNFSQNVMNNDGTEHTFHSGCQHTVEKHFPQNLDNGHE

Query:  GISRIMVDNLPFSANESHAKELAPSTSFPVHSLFENDLEVKPSTFGQLSNDQRAMGSEPEDNDVFNSPVSDSGASFHNVEYSSPLPGMPIWRNASPTLPI
        G+    +D +    + +  + LA       H L                         PEDN                      L G             
Subjt:  GISRIMVDNLPFSANESHAKELAPSTSFPVHSLFENDLEVKPSTFGQLSNDQRAMGSEPEDNDVFNSPVSDSGASFHNVEYSSPLPGMPIWRNASPTLPI

Query:  DVGFPEKDIPTGDSFELPDDDGNNNIQNARLAGYD-GHSNSKLKIEVQHDHLKSPNATAEGCYLAELSNSLLNLSNEDELLFMDVDGRDVIDKSYYDGL-
               DIP  +   L         ++A L+  D  H +S+ K+E      K+  A+ +  +LA+LS SL     ED   FM+VDG++V DKSYYDGL 
Subjt:  DVGFPEKDIPTGDSFELPDDDGNNNIQNARLAGYD-GHSNSKLKIEVQHDHLKSPNATAEGCYLAELSNSLLNLSNEDELLFMDVDGRDVIDKSYYDGL-

Query:  SLLLSSPNEVNHDQTANAINAE-TVLPTDTMLDPPSACSGELYEKGSHCSDGHLDCSSEAHPSSSASLNSQCPGKGDEPLFCTLNTEDPEIPSNDDVFLP
        SLL++S N+ N +   N    E ++ PT                   H  +  LD         + +L+   P      + C LN EDP+IP NDD+FL 
Subjt:  SLLLSSPNEVNHDQTANAINAE-TVLPTDTMLDPPSACSGELYEKGSHCSDGHLDCSSEAHPSSSASLNSQCPGKGDEPLFCTLNTEDPEIPSNDDVFLP

Query:  ----PLSTTATMGYHFQDCMDPTFSSIKDFSCKEKSGETTQNLVQREKKNHGQPRVSSLSIGLRGLPERGEKHLVGGAAVNLKLSHSNIIHVPSANKTSS
            P+S ++    +F+D   P  + ++D S  ++  E     +Q +KK  G+ + S+      G P +G K     A+ + +L ++    V     +S+
Subjt:  ----PLSTTATMGYHFQDCMDPTFSSIKDFSCKEKSGETTQNLVQREKKNHGQPRVSSLSIGLRGLPERGEKHLVGGAAVNLKLSHSNIIHVPSANKTSS

Query:  RNVNNDAILPAALKEENNEISRVNHLGQNFLNT--HVEKPDFDSGNVRRY----PPSTTCGIKQESDTLTTVKDHRLLQEVGARAVFGVEQDGISSTSDQ
        +  +N  +      ++  + +     G  F+ +  H   P+ DS N +      P + +   K   D L  +             V  +E     + ++ 
Subjt:  RNVNNDAILPAALKEENNEISRVNHLGQNFLNT--HVEKPDFDSGNVRRY----PPSTTCGIKQESDTLTTVKDHRLLQEVGARAVFGVEQDGISSTSDQ

Query:  EDLSIDSEDDLPHFSDIEAMILDMDLDPEDQDLFSSEEVLKYQHMDTKKRIIRLEQGANAYMQRSMASHGALAVLYGRHSKHYIKKSEVLLGRATAEFIV
        E    +S++DLP++SDIEAMILDMDL+P+DQD F   EV KYQ  D K+ IIRLEQ A++YMQR++AS GA AVLYGR+SKHYIKK EVL+GR+T +  V
Subjt:  EDLSIDSEDDLPHFSDIEAMILDMDLDPEDQDLFSSEEVLKYQHMDTKKRIIRLEQGANAYMQRSMASHGALAVLYGRHSKHYIKKSEVLLGRATAEFIV

Query:  DIDLGREGSGNKISRRQAIIKIDEDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESNSFRMKQYVDNIGKTS
        DIDLGRE  G+KISRRQAII++ +DG F +KNLGK SIS+N K+V PG  L L S CL+EIRGMPFIFE+N   M++Y+   GK +
Subjt:  DIDLGREGSGNKISRRQAIIKIDEDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESNSFRMKQYVDNIGKTS

AT3G54350.3 Forkhead-associated (FHA) domain-containing protein2.3e-10935.1Show/hide
Query:  MGALAPVAPWNPEDDILLKNSVEAGASLEALAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSMIDVERSSSILPSKFNKFGNPKETKCIGGKRKS
        MGALA V PW PEDD+LLKN+VEAGASLE+LAKGAVQFSRR+++RELQ+RWH+LLYDP+VS +A+  M ++ER++   P+KF + G  KE K    KR +
Subjt:  MGALAPVAPWNPEDDILLKNSVEAGASLEALAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSMIDVERSSSILPSKFNKFGNPKETKCIGGKRKS

Query:  GSVRRCYYALRKRICNEPFNPIDLSYLVGDSDYVGEEPMSGNCIPPISDDFGLQSSELGIMPCNFSQNVMNNDGTEHTFHSGCQHTVEKHFPQNLDNGHE
          +R  Y++LRK+   EPFN +DL +LV                PP                         ND                HF  N D  H 
Subjt:  GSVRRCYYALRKRICNEPFNPIDLSYLVGDSDYVGEEPMSGNCIPPISDDFGLQSSELGIMPCNFSQNVMNNDGTEHTFHSGCQHTVEKHFPQNLDNGHE

Query:  GISRIMVDNLPFSANESHAKELAPSTSFPVHSLFENDLEVKPSTFGQLSNDQRAMGSEPEDNDVFNSPVSDSGASFHNVEYSSPLPGMPIWRNASPTLPI
        G+    +D +    + +  + LA       H L                         PEDN                      L G             
Subjt:  GISRIMVDNLPFSANESHAKELAPSTSFPVHSLFENDLEVKPSTFGQLSNDQRAMGSEPEDNDVFNSPVSDSGASFHNVEYSSPLPGMPIWRNASPTLPI

Query:  DVGFPEKDIPTGDSFELPDDDGNNNIQNARLAGYD-GHSNSKLKIEVQHDHLKSPNATAEGCYLAELSNSLLNLSNEDELLFMDVDGRDVIDKSYYDGL-
               DIP  +   L         ++A L+  D  H +S+ K+E      K+  A+ +  +LA+LS SL     ED   FM+VDG++V DKSYYDGL 
Subjt:  DVGFPEKDIPTGDSFELPDDDGNNNIQNARLAGYD-GHSNSKLKIEVQHDHLKSPNATAEGCYLAELSNSLLNLSNEDELLFMDVDGRDVIDKSYYDGL-

Query:  SLLLSSPNEVNHDQTANAINAE-TVLPTDTMLDPPSACSGELYEKGSHCSDGHLDCSSEAHPSSSASLNSQCPGKGDEPLFCTLNTEDPEIPSNDDVFLP
        SLL++S N+ N +   N    E ++ PT                   H  +  LD         + +L+   P      + C LN EDP+IP NDD+FL 
Subjt:  SLLLSSPNEVNHDQTANAINAE-TVLPTDTMLDPPSACSGELYEKGSHCSDGHLDCSSEAHPSSSASLNSQCPGKGDEPLFCTLNTEDPEIPSNDDVFLP

Query:  ----PLSTTATMGYHFQDCMDPTFSSIKDFSCKEKSGETTQNLVQREKKNHGQPRVSSLSIGLRGLPERGEKHLVGGAAVNLKLSHSNIIHVPSANKTSS
            P+S ++    +F+D   P  + ++D S  ++  E     +Q +KK  G+ + S+      G P +G K     A+ + +L ++    V     +S+
Subjt:  ----PLSTTATMGYHFQDCMDPTFSSIKDFSCKEKSGETTQNLVQREKKNHGQPRVSSLSIGLRGLPERGEKHLVGGAAVNLKLSHSNIIHVPSANKTSS

Query:  RNVNNDAILPAALKEENNEISRVNHLGQNFLNT--HVEKPDFDSGNVRRY----PPSTTCGIKQESDTLTTVKDHRLLQEVGARAVFGVEQDGISSTSDQ
        +  +N  +      ++  + +     G  F+ +  H   P+ DS N +      P + +   K   D L  +             V  +E     + ++ 
Subjt:  RNVNNDAILPAALKEENNEISRVNHLGQNFLNT--HVEKPDFDSGNVRRY----PPSTTCGIKQESDTLTTVKDHRLLQEVGARAVFGVEQDGISSTSDQ

Query:  EDLSIDSEDDLPHFSDIEAMILDMDLDPEDQDLFSSEEVLKYQHMDTKKRIIRLEQGANAYMQRSMASHGALAVLYGRHSKHYIKKSEVLLGRATAEFIV
        E    +S++DLP++SDIEAMILDMDL+P+DQD F   EV KYQ  D K+ IIRLEQ A++YMQR++AS GA AVLYGR+SKHYIKK EVL+GR+T +  V
Subjt:  EDLSIDSEDDLPHFSDIEAMILDMDLDPEDQDLFSSEEVLKYQHMDTKKRIIRLEQGANAYMQRSMASHGALAVLYGRHSKHYIKKSEVLLGRATAEFIV

Query:  DIDLGREGSGNKISRRQAIIKIDEDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESNSFRMKQYVDNIGKTS
        DIDLGRE  G+KISRRQAII++ +DG F +KNLGK SIS+N K+V PG  L L S CL+EIRGMPFIFE+N   M++Y+   GK +
Subjt:  DIDLGREGSGNKISRRQAIIKIDEDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESNSFRMKQYVDNIGKTS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGCTCTTGCCCCTGTCGCGCCTTGGAATCCTGAAGACGATATTCTGCTCAAGAATTCCGTTGAGGCAGGTGCTTCCTTGGAGGCCCTTGCCAAAGGTGCTGTGCA
GTTTTCTCGAAGATACACAGTGAGAGAATTGCAAGAACGATGGCATTCTTTACTTTATGATCCAATTGTATCTGAAGATGCATCTATGTCCATGATTGACGTTGAGCGAT
CTTCTTCCATTCTTCCATCAAAGTTCAACAAGTTTGGGAATCCAAAAGAAACTAAATGTATTGGTGGGAAGAGGAAATCTGGGAGTGTACGCCGTTGCTATTATGCTTTG
CGTAAAAGAATTTGCAACGAACCATTTAATCCTATAGACCTGAGTTATCTTGTTGGTGATAGTGACTATGTCGGTGAAGAGCCCATGTCAGGAAATTGTATCCCTCCAAT
ATCAGATGATTTTGGACTTCAGAGCTCAGAGCTGGGGATCATGCCCTGTAATTTTTCCCAAAATGTGATGAATAATGATGGTACTGAGCACACTTTTCATTCTGGATGTC
AACATACAGTTGAAAAGCATTTTCCTCAGAACCTGGATAATGGACATGAGGGAATTTCTCGTATTATGGTAGATAATCTGCCTTTCTCTGCAAATGAATCTCATGCAAAA
GAATTGGCTCCGTCAACTAGCTTTCCAGTCCATAGTCTCTTTGAAAATGATTTGGAGGTAAAACCTTCCACTTTTGGGCAACTGAGCAATGACCAGAGAGCAATGGGCTC
TGAACCAGAAGACAACGATGTCTTTAATTCTCCCGTTTCTGATTCTGGCGCATCATTTCATAATGTTGAGTACTCATCTCCTCTTCCTGGTATGCCAATATGGAGAAATG
CCTCGCCAACCTTGCCAATTGATGTTGGCTTTCCAGAGAAGGATATACCTACAGGAGACTCTTTTGAACTCCCTGATGATGATGGGAACAACAACATTCAAAATGCAAGA
TTAGCGGGCTATGATGGTCACTCTAACTCAAAGTTGAAGATTGAAGTACAGCATGATCATTTGAAAAGTCCAAATGCCACTGCTGAAGGTTGTTATCTAGCGGAATTGTC
CAATTCTCTTTTGAACTTGAGCAATGAGGATGAGCTACTTTTCATGGATGTTGATGGAAGGGATGTAATTGATAAGTCATACTATGATGGGTTGTCGCTTTTGTTGAGTT
CACCAAATGAAGTTAATCATGATCAAACAGCCAATGCAATTAATGCAGAAACAGTGTTACCAACGGATACAATGTTAGATCCCCCCTCTGCATGTTCTGGAGAGTTATAT
GAAAAGGGATCCCACTGTAGTGATGGACATCTGGATTGTAGTTCAGAAGCTCATCCATCGTCATCTGCATCTTTGAACAGTCAATGTCCTGGAAAAGGTGATGAACCTCT
TTTCTGCACTTTGAACACAGAAGACCCAGAAATCCCGAGCAACGATGATGTTTTTCTACCTCCATTGTCAACAACTGCTACTATGGGATACCATTTTCAAGATTGCATGG
ATCCTACCTTTTCATCTATTAAGGATTTCTCTTGTAAAGAAAAATCTGGTGAAACGACTCAAAACCTTGTGCAAAGGGAGAAGAAAAATCATGGACAACCTCGTGTTTCA
TCTCTATCAATTGGATTGCGTGGTTTGCCTGAAAGAGGTGAAAAACATCTGGTTGGTGGAGCTGCTGTTAATTTAAAATTATCCCATAGCAACATCATACACGTGCCATC
TGCAAATAAAACCAGCTCCAGAAATGTAAATAACGATGCTATCCTACCTGCCGCACTCAAGGAAGAGAACAATGAAATTTCCCGGGTAAATCATCTTGGCCAGAATTTTT
TAAATACTCATGTAGAGAAGCCAGACTTTGATTCTGGCAATGTTAGAAGATATCCACCAAGTACTACTTGTGGCATTAAACAGGAATCAGATACATTGACTACAGTGAAA
GATCATCGATTGTTGCAAGAAGTGGGTGCTCGAGCTGTTTTTGGTGTAGAACAAGATGGAATATCTTCGACATCTGATCAAGAAGATTTATCTATTGATAGTGAAGATGA
TTTACCTCATTTTTCAGATATTGAGGCAATGATTCTTGATATGGACTTGGATCCAGAAGATCAGGATTTGTTTTCAAGTGAAGAAGTCTTAAAATATCAACATATGGACA
CCAAGAAGAGAATCATCAGACTGGAGCAAGGGGCTAACGCTTACATGCAAAGATCTATGGCCTCTCATGGGGCATTAGCAGTTCTGTATGGCCGACATTCAAAGCATTAC
ATTAAGAAATCAGAGGTTCTATTGGGTAGAGCAACTGCAGAATTCATTGTGGACATTGACTTAGGAAGGGAGGGAAGCGGTAACAAAATATCTCGACGGCAGGCAATTAT
AAAAATAGATGAGGATGGATTTTTCTCCCTGAAGAATCTTGGCAAGTGCTCAATCTCTATAAATAACAAGGATGTGGCCCCTGGTCACTGCCTGCGACTTAATTCTGGTT
GCTTGATTGAGATAAGGGGAATGCCATTTATATTTGAGTCAAACTCATTTCGCATGAAGCAGTATGTGGATAATATAGGCAAGACATCTCACAAACAGGAGTATCCATCA
TAA
mRNA sequenceShow/hide mRNA sequence
GTCTCTTGGTAAATTAAAAAGAGAAAGCCCTAGATCCCTCATCTTCCCTCCTATTCTCGTGAAAGTCACACACCCAACCCTGCTCGACTTCTTCTCCTTCTTCACCGTCG
ACAACTTCTTCTTCTCCCTCTTCCAACCTCACGTCGGCCCCTCCCCGTTTCTGCAACTCCAGCGGCAGCACCCACACGAGCCTTTCCCCCTTCATCGTCTCCTTCTTCTT
CCTCACGCCGACAACCACATTCATTTTCGATTTCTTCTTCTTCTCTTCACCTCGACTTTCTCCACCGCCGGCAGCAGCAAACACGCGCAGTGCCTCTCCTCCTCCGACGA
GGGTGTCCGAACGACAACAAATGGTCGTTTTTCTTTGTGTATCGGGCAACACAACCCACAAGATAGCATCCTCCCTTCGTCGAGTCGCCATCTCTCTTCACCCTCTCCTT
TCGTTATAGACGCCGCTTGCGCTCGCCATCTCCCAGCTTCGCCAAGTCGCAGACGCTGCTCGGCGCCTGCCCTCTCTCTCTCCGCCCTCTCTTTCGACAAGTTGCCGACC
ACTGTTCGGCAAGGGCTCTCCTTTCATTTTATTCTCCCAAATATTAAGGGTTTCTCTTAAACCCCCCGACGCCATATCCTTCGTCCCTTCTCTTAACCCTTCTCTCGCTT
CCCCGCTGCATTACACTCTTTCTAAAGACGGGCTGGTAAGGAATTGTTGCTTTACGGAGATGGGAGCTCTTGCCCCTGTCGCGCCTTGGAATCCTGAAGACGATATTCTG
CTCAAGAATTCCGTTGAGGCAGGTGCTTCCTTGGAGGCCCTTGCCAAAGGTGCTGTGCAGTTTTCTCGAAGATACACAGTGAGAGAATTGCAAGAACGATGGCATTCTTT
ACTTTATGATCCAATTGTATCTGAAGATGCATCTATGTCCATGATTGACGTTGAGCGATCTTCTTCCATTCTTCCATCAAAGTTCAACAAGTTTGGGAATCCAAAAGAAA
CTAAATGTATTGGTGGGAAGAGGAAATCTGGGAGTGTACGCCGTTGCTATTATGCTTTGCGTAAAAGAATTTGCAACGAACCATTTAATCCTATAGACCTGAGTTATCTT
GTTGGTGATAGTGACTATGTCGGTGAAGAGCCCATGTCAGGAAATTGTATCCCTCCAATATCAGATGATTTTGGACTTCAGAGCTCAGAGCTGGGGATCATGCCCTGTAA
TTTTTCCCAAAATGTGATGAATAATGATGGTACTGAGCACACTTTTCATTCTGGATGTCAACATACAGTTGAAAAGCATTTTCCTCAGAACCTGGATAATGGACATGAGG
GAATTTCTCGTATTATGGTAGATAATCTGCCTTTCTCTGCAAATGAATCTCATGCAAAAGAATTGGCTCCGTCAACTAGCTTTCCAGTCCATAGTCTCTTTGAAAATGAT
TTGGAGGTAAAACCTTCCACTTTTGGGCAACTGAGCAATGACCAGAGAGCAATGGGCTCTGAACCAGAAGACAACGATGTCTTTAATTCTCCCGTTTCTGATTCTGGCGC
ATCATTTCATAATGTTGAGTACTCATCTCCTCTTCCTGGTATGCCAATATGGAGAAATGCCTCGCCAACCTTGCCAATTGATGTTGGCTTTCCAGAGAAGGATATACCTA
CAGGAGACTCTTTTGAACTCCCTGATGATGATGGGAACAACAACATTCAAAATGCAAGATTAGCGGGCTATGATGGTCACTCTAACTCAAAGTTGAAGATTGAAGTACAG
CATGATCATTTGAAAAGTCCAAATGCCACTGCTGAAGGTTGTTATCTAGCGGAATTGTCCAATTCTCTTTTGAACTTGAGCAATGAGGATGAGCTACTTTTCATGGATGT
TGATGGAAGGGATGTAATTGATAAGTCATACTATGATGGGTTGTCGCTTTTGTTGAGTTCACCAAATGAAGTTAATCATGATCAAACAGCCAATGCAATTAATGCAGAAA
CAGTGTTACCAACGGATACAATGTTAGATCCCCCCTCTGCATGTTCTGGAGAGTTATATGAAAAGGGATCCCACTGTAGTGATGGACATCTGGATTGTAGTTCAGAAGCT
CATCCATCGTCATCTGCATCTTTGAACAGTCAATGTCCTGGAAAAGGTGATGAACCTCTTTTCTGCACTTTGAACACAGAAGACCCAGAAATCCCGAGCAACGATGATGT
TTTTCTACCTCCATTGTCAACAACTGCTACTATGGGATACCATTTTCAAGATTGCATGGATCCTACCTTTTCATCTATTAAGGATTTCTCTTGTAAAGAAAAATCTGGTG
AAACGACTCAAAACCTTGTGCAAAGGGAGAAGAAAAATCATGGACAACCTCGTGTTTCATCTCTATCAATTGGATTGCGTGGTTTGCCTGAAAGAGGTGAAAAACATCTG
GTTGGTGGAGCTGCTGTTAATTTAAAATTATCCCATAGCAACATCATACACGTGCCATCTGCAAATAAAACCAGCTCCAGAAATGTAAATAACGATGCTATCCTACCTGC
CGCACTCAAGGAAGAGAACAATGAAATTTCCCGGGTAAATCATCTTGGCCAGAATTTTTTAAATACTCATGTAGAGAAGCCAGACTTTGATTCTGGCAATGTTAGAAGAT
ATCCACCAAGTACTACTTGTGGCATTAAACAGGAATCAGATACATTGACTACAGTGAAAGATCATCGATTGTTGCAAGAAGTGGGTGCTCGAGCTGTTTTTGGTGTAGAA
CAAGATGGAATATCTTCGACATCTGATCAAGAAGATTTATCTATTGATAGTGAAGATGATTTACCTCATTTTTCAGATATTGAGGCAATGATTCTTGATATGGACTTGGA
TCCAGAAGATCAGGATTTGTTTTCAAGTGAAGAAGTCTTAAAATATCAACATATGGACACCAAGAAGAGAATCATCAGACTGGAGCAAGGGGCTAACGCTTACATGCAAA
GATCTATGGCCTCTCATGGGGCATTAGCAGTTCTGTATGGCCGACATTCAAAGCATTACATTAAGAAATCAGAGGTTCTATTGGGTAGAGCAACTGCAGAATTCATTGTG
GACATTGACTTAGGAAGGGAGGGAAGCGGTAACAAAATATCTCGACGGCAGGCAATTATAAAAATAGATGAGGATGGATTTTTCTCCCTGAAGAATCTTGGCAAGTGCTC
AATCTCTATAAATAACAAGGATGTGGCCCCTGGTCACTGCCTGCGACTTAATTCTGGTTGCTTGATTGAGATAAGGGGAATGCCATTTATATTTGAGTCAAACTCATTTC
GCATGAAGCAGTATGTGGATAATATAGGCAAGACATCTCACAAACAGGAGTATCCATCATAATGATAGATGACAACCTAGCTGACATGGACGGTTGTACAGTCGGAAGGT
GCTCATTCATTTTTCTGCACTTTCTTCTGGGTTCTTTAGGTATGCTGTGTCAAGTTGTTATCTCGTCTGCGATTAGAAATATAAAATTCTATTCATTTTGGTAATTCCAG
TAGTTATTTTGTATATTGCTTTGTGATTCAACGTCTTTCTTCTCAATGAGGTTTGATTGGATTGCTAAAGATGGGAAACAAAACAATGGCAACACAGTTGTGGAGATTAT
TTGGAACAGTAACATCAGCAAGGAGCTGTCCAACCAAATTACGAAGATGTAGGTAAATTTACACTCGTTATCTACCCAACCCAGCCACGATACCTGACTCAAACCTGCAT
TCTGGTTACTCCTAAGTTTCACCATATGATGTTCGTTCCCATCGTTGCACTAAACTAAAGTTGATATCTTTTTGGGATATAGCTTCAGGTCGAAAATGAAAATGAAAATG
AAGGTGTAATGGTCCCTGAAATCGAATCTGGCAGCTTGGCCAATTCCATACCTGTCTAGAAGGCGAGGTTCCTTCTGCTACGCTAGCATTAACTCTTGCATTATTTAACA
ATATGATGTATGGAATATACAGTGTTCTGTTGTACAATCTTGAATCTTCAATGTAATAATATATTTGCGAATATTATCTTGTAATAAAAGGTCTTGCCTTCTCAAAATGA
GTCGTAATGGGAAGTGAAAAGGAAGTCTGACAAATTCAATACGTAAGAAACTGCAACTCAGGAGGACAGGTAAAAGCTAAGAGGGGAGAATAGCTCAAGCTCAGCACATA
GAGACAGGCAGTGGCTGATGTGTTGGATTGGAGGTACTCAATGCTCTGTCCAAACTGATTAAAGCCTCTTATCCATCTGATTCTGAACTGAAATTATCAATTTATACAAA
ACTTTATTGCTGTTGAGAAAGGGTATGAATCTTCTTAATAGCCTCATGTGGTCCTTAGGATGGCTCCACCTGGAGTTTGTAGCCAGACCTAAACAATCTTGTTTAAGACA
AATAGATTAATAGCCACGTCATAATGGTTGCTGTCTGTCTACATGAAATTATTCACTCCAACTAAGAATTACCTTCATAATTCTATCTATAAGTTCTGCCCGACTTGCAA
GTTGGAAACTTAATTCAAC
Protein sequenceShow/hide protein sequence
MGALAPVAPWNPEDDILLKNSVEAGASLEALAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSMIDVERSSSILPSKFNKFGNPKETKCIGGKRKSGSVRRCYYAL
RKRICNEPFNPIDLSYLVGDSDYVGEEPMSGNCIPPISDDFGLQSSELGIMPCNFSQNVMNNDGTEHTFHSGCQHTVEKHFPQNLDNGHEGISRIMVDNLPFSANESHAK
ELAPSTSFPVHSLFENDLEVKPSTFGQLSNDQRAMGSEPEDNDVFNSPVSDSGASFHNVEYSSPLPGMPIWRNASPTLPIDVGFPEKDIPTGDSFELPDDDGNNNIQNAR
LAGYDGHSNSKLKIEVQHDHLKSPNATAEGCYLAELSNSLLNLSNEDELLFMDVDGRDVIDKSYYDGLSLLLSSPNEVNHDQTANAINAETVLPTDTMLDPPSACSGELY
EKGSHCSDGHLDCSSEAHPSSSASLNSQCPGKGDEPLFCTLNTEDPEIPSNDDVFLPPLSTTATMGYHFQDCMDPTFSSIKDFSCKEKSGETTQNLVQREKKNHGQPRVS
SLSIGLRGLPERGEKHLVGGAAVNLKLSHSNIIHVPSANKTSSRNVNNDAILPAALKEENNEISRVNHLGQNFLNTHVEKPDFDSGNVRRYPPSTTCGIKQESDTLTTVK
DHRLLQEVGARAVFGVEQDGISSTSDQEDLSIDSEDDLPHFSDIEAMILDMDLDPEDQDLFSSEEVLKYQHMDTKKRIIRLEQGANAYMQRSMASHGALAVLYGRHSKHY
IKKSEVLLGRATAEFIVDIDLGREGSGNKISRRQAIIKIDEDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESNSFRMKQYVDNIGKTSHKQEYPS