; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0020301 (gene) of Chayote v1 genome

Gene IDSed0020301
OrganismSechium edule (Chayote v1)
DescriptionFHA domain-containing protein
Genome locationLG01:18827497..18836546
RNA-Seq ExpressionSed0020301
SyntenySed0020301
Gene Ontology termsGO:0031011 - Ino80 complex (cellular component)
GO:0071339 - MLL1 complex (cellular component)
GO:0002151 - G-quadruplex RNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR000253 - Forkhead-associated (FHA) domain
IPR008984 - SMAD/FHA domain superfamily
IPR025999 - Microspherule protein, N-terminal domain
IPR037912 - Microspherule protein 1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6571705.1 Microspherule protein 1, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0078.05Show/hide
Query:  MGALAPVAPWTPEDDILLKNAVEAGASLEALAKGAVQFSRRYTVRELQERWYSLLYDPIVSEEASMSMIDIECSSSILPSKFNKFGNPKETKYIGGKRKS
        MGALAPVAPWTPEDDILLKNAVEAGASLE+LAKGAVQFSRRYTVRELQERW+SLLYDPIVSE+ASMSMID E SSSILPSKFNKFGNPKETKYIGGKRKS
Subjt:  MGALAPVAPWTPEDDILLKNAVEAGASLEALAKGAVQFSRRYTVRELQERWYSLLYDPIVSEEASMSMIDIECSSSILPSKFNKFGNPKETKYIGGKRKS

Query:  GSVRRCYYALCKRICNEPF-NPMDLNYLVG--DSDYVGEEPMQENCVPPISDDFGLPSSELEILPCNFAQNVMNEDDTERTFLSGCQHTVEKHFPQNLDN
        GSVR CYYAL KRICNEPF NPMDLN+LVG  +S+YV EEPM  NC+PPISDDFGL SSE+ ILPC+F+QNVMN DD E TF SGCQ TVEKHFP+NLDN
Subjt:  GSVRRCYYALCKRICNEPF-NPMDLNYLVG--DSDYVGEEPMQENCVPPISDDFGLPSSELEILPCNFAQNVMNEDDTERTFLSGCQHTVEKHFPQNLDN

Query:  GHEGISHIIVDNLPFSANESHIEELAPSTSFPVHSIFENDLEVKPSTFGQLSNDQRAMGSEPEDNDVFNSPVSDSGASFHNVEYSSPLPGMQLWRD-SSP
        G EGISH + ++LP SA +SH+EELAPST FPVHS+FENDLE +PSTFGQLSNDQRAMGSE EDN+VFNSPVS+SGASFHNVEYSSPLPGM +WR+ S+P
Subjt:  GHEGISHIIVDNLPFSANESHIEELAPSTSFPVHSIFENDLEVKPSTFGQLSNDQRAMGSEPEDNDVFNSPVSDSGASFHNVEYSSPLPGMQLWRD-SSP

Query:  ELPIDVDFANTNIPTGDSFELPDDDDGNNNIQNARLAGYDAHSNSKLKIEVQHDHLKSSNASAEGSYLAELSNSLLNLSNEDELLFMDVDGRDVIDKSYY
         LPIDV FA+ +IPT +SFELP DDDGN NIQNAR+AGYDA+S+ KLKIEV+ DHLKS NA+AE  YLAELSNSL+N+SNEDELLFMDVDG+D +DKSYY
Subjt:  ELPIDVDFANTNIPTGDSFELPDDDDGNNNIQNARLAGYDAHSNSKLKIEVQHDHLKSSNASAEGSYLAELSNSLLNLSNEDELLFMDVDGRDVIDKSYY

Query:  EGL-SLLLSSPNEVHPDQTTEAINGETVLPMDTMLDPPTACSGELYEKGSHCSNRNLDCSSETHPSPSASLNSQCSGRGDEPLFCSLNTEDPEIPSNDDV
        +GL SLLL+SPNE++ DQT  AIN ETVLP DTM+DPPTACSG LYEKGSHC   +LDC+SE H SPSASLNSQC  +GDEPLFC+LNTEDP+IPSNDDV
Subjt:  EGL-SLLLSSPNEVHPDQTTEAINGETVLPMDTMLDPPTACSGELYEKGSHCSNRNLDCSSETHPSPSASLNSQCSGRGDEPLFCSLNTEDPEIPSNDDV

Query:  FLPPLTTISSMGYHFQDCVDPTFSSIKDFFCKEKSGEKTQNLVQRERKNHGQPRVSSLSIGLHGLPERGERHLVGGAAVSLKLSHSNSIHGSSANKTRSI
        FLPPL+T+++MGY+FQDC++ TFSS KDF   EKSGE TQNL  RERKNHG    + ++  LHG  ERGE+H VGGA V+ + SHSN+ H  S +   SI
Subjt:  FLPPLTTISSMGYHFQDCVDPTFSSIKDFFCKEKSGEKTQNLVQRERKNHGQPRVSSLSIGLHGLPERGERHLVGGAAVSLKLSHSNSIHGSSANKTRSI

Query:  NANNDAILPVSLKEESNEISRVNHLGQNFLNTHVEKPGFES--VSRYPPSATCGIKQEPDTLTTVKDHQLSQELGARGVFNVEQDGVSLKSDQEELSIES
        N N+DA LP  LKEE+NEISRVNHLG+NFLN H EKPGF+S  V  YPPSA C IKQEPD L ++KDH+LSQE G RG F VEQ G+S  SDQEELSI+S
Subjt:  NANNDAILPVSLKEESNEISRVNHLGQNFLNTHVEKPGFES--VSRYPPSATCGIKQEPDTLTTVKDHQLSQELGARGVFNVEQDGVSLKSDQEELSIES

Query:  DDDVPHFSDIEAMILDMDLDPDDQDLYSIEEVLKYQHMETKKKIIRLEQGANAFMQRSIASHGALAILYGRHAEYYIKKSEVLLGRATADCIVDIDLGRE
        +DDVPHFSDIEAMILDMDLDP+DQDLYS EEVLKYQH++TKK+IIRLEQGANA+MQRS ASHGALA+LYGR++++YIKKSEVLLGRAT D IVDIDLGRE
Subjt:  DDDVPHFSDIEAMILDMDLDPDDQDLYSIEEVLKYQHMETKKKIIRLEQGANAFMQRSIASHGALAILYGRHAEYYIKKSEVLLGRATADCIVDIDLGRE

Query:  GSGNKISRRQAIIKIDQDGFFSLKNLGKCSISINNKDVAWGHCLRLNSGCLIEIRGIPFIFESNPVRMKQYVENIDKTSHKQEY
        GSGNKISRRQAIIK+DQDGFFSLKNLGKCSISINNKDVA GHCLRLNSGCLIEIRG+PFIFESNP RMKQYV+N+ K SHKQEY
Subjt:  GSGNKISRRQAIIKIDQDGFFSLKNLGKCSISINNKDVAWGHCLRLNSGCLIEIRGIPFIFESNPVRMKQYVENIDKTSHKQEY

XP_022135664.1 uncharacterized protein LOC111007538 isoform X1 [Momordica charantia]0.0e+0081.71Show/hide
Query:  MGALAPVAPWTPEDDILLKNAVEAGASLEALAKGAVQFSRRYTVRELQERWYSLLYDPIVSEEASMSMIDIECSSSILPSKFNKFGNPKETKYIGGKRKS
        MGALAPVAPWTPEDDILLKNA+EAGASLE+LAKGAVQFSRRYTVRELQERW+SLLYDPIVSEEASMSMID E SSSILPSKFNKFGNPKETK IGGKRK 
Subjt:  MGALAPVAPWTPEDDILLKNAVEAGASLEALAKGAVQFSRRYTVRELQERWYSLLYDPIVSEEASMSMIDIECSSSILPSKFNKFGNPKETKYIGGKRKS

Query:  GSVRRCYYALCKRICNEPFNPMDLNYLVG--DSDYVGEEPMQENCVPPISDDFGLPSSELEILPCNFAQNVMNEDDTERTFLSGCQHTVEKHFPQNLDNG
        GSVRRCYYAL KRICNEPFNPMDL++LVG  DS+YV EEPM  +C+PPIS DFGL  SEL ILP NFA N+MN DDTE TF S CQHTVEKHFP NLDN 
Subjt:  GSVRRCYYALCKRICNEPFNPMDLNYLVG--DSDYVGEEPMQENCVPPISDDFGLPSSELEILPCNFAQNVMNEDDTERTFLSGCQHTVEKHFPQNLDNG

Query:  HEGISHIIVDNLPFSANESHIEELAPSTSFPVHSIFENDLEVKPSTFGQLSNDQRAMGSEPEDNDVFNSPVSDSGASFHNVEYSSPLPGMQLWRD-SSPE
        HEGI HI+ +NLP S NES +EELAPS SFPVHS+FENDLEV+PSTFGQ S DQRAMGSE EDN+VFNSPVSDSGASFHNVEYSSPLPGM +WR+ S+P 
Subjt:  HEGISHIIVDNLPFSANESHIEELAPSTSFPVHSIFENDLEVKPSTFGQLSNDQRAMGSEPEDNDVFNSPVSDSGASFHNVEYSSPLPGMQLWRD-SSPE

Query:  LPIDVDFANTNIPTGDSFELPDDDDGNNNIQNARLAGYDAHSNSKLKIEVQHDHLKSSNASAEGSYLAELSNSLLNLSNEDELLFMDVDGRDVIDKSYYE
        LPIDV F++ ++PTGDSFELP DDDGNNNIQNAR+A YDA S+SKLKIEVQHDHLKS NA+AE  YLAELSNSLLNL+NEDELLFMD DG+DVIDKSYY+
Subjt:  LPIDVDFANTNIPTGDSFELPDDDDGNNNIQNARLAGYDAHSNSKLKIEVQHDHLKSSNASAEGSYLAELSNSLLNLSNEDELLFMDVDGRDVIDKSYYE

Query:  GL-SLLLSSPNEVHPDQTTEAINGETVLPMDTMLDPPTACSGELYEKGSHCSNRNLDCSSETHPSPSASLNSQCSGRGDEPLFCSLNTEDPEIPSNDDVF
        GL SLLL+SPNEV+ DQT +A+N ET+LP D+M+DPPTACSGELYEKGSHCS+ +LDCS E HPSPSASLNSQC G+GDEPLFC+LNTEDPEIPSNDDVF
Subjt:  GL-SLLLSSPNEVHPDQTTEAINGETVLPMDTMLDPPTACSGELYEKGSHCSNRNLDCSSETHPSPSASLNSQCSGRGDEPLFCSLNTEDPEIPSNDDVF

Query:  LPPLTTISSMGYHFQDCVDPTFSSIKDFFCKEKSGEKTQNLVQRERKNHGQPRVSSLSIGLHGLPERGERHLVGGAAVSLKLSHSNSIHGSSANK-----
        LPPL+TISSMGYHFQD +D TFSSIKDF C EKSGE TQNLVQRERKNHGQP VSSLSIGLHGLPERGE+HLVGGAAV+LKL HSNSIH  SAN      
Subjt:  LPPLTTISSMGYHFQDCVDPTFSSIKDFFCKEKSGEKTQNLVQRERKNHGQPRVSSLSIGLHGLPERGERHLVGGAAVSLKLSHSNSIHGSSANK-----

Query:  -TRSINANNDAILPVSLKEESNEISRVNHLGQNFLNTHVEKPGFESVS--RYPPSATCGIKQEPDTLTTVKDHQLSQELGARGVFNVEQDGVSLKSDQEE
         + SINAN DAILPV+LKEES EISRVNHLGQNFLNTHVEKPGF+S +  +YPPS   GIKQEPD LT VKDH+LSQE G+RGVF VEQDG+S  SDQEE
Subjt:  -TRSINANNDAILPVSLKEESNEISRVNHLGQNFLNTHVEKPGFESVS--RYPPSATCGIKQEPDTLTTVKDHQLSQELGARGVFNVEQDGVSLKSDQEE

Query:  LSIESDDDVPHFSDIEAMILDMDLDPDDQDLYSIEEVLKYQHMETKKKIIRLEQGANAFMQRSIASHGALAILYGRHAEYYIKKSEVLLGRATADCIVDI
        LSI+S+DDVPHFSDIEAMILDMDLDP+DQDLY+ EEVL+YQHM+TKK+I+RLEQGA+A M+RS+ASHGALA+LYGR++++YIKKSEVLLGRAT D IVDI
Subjt:  LSIESDDDVPHFSDIEAMILDMDLDPDDQDLYSIEEVLKYQHMETKKKIIRLEQGANAFMQRSIASHGALAILYGRHAEYYIKKSEVLLGRATADCIVDI

Query:  DLGREGSGNKISRRQAIIKIDQDGFFSLKNLGKCSISINNKDVAWGHCLRLNSGCLIEIRGIPFIFESNPVRMKQYVENIDKTSHKQEYQS
        DLGREGSGNKISRRQAIIKIDQDGFFSLKNLGKCSISINNK+VA GHCLRLNSGCLIEIRG+ FIFES+PV MKQY++NI KTSHKQEYQS
Subjt:  DLGREGSGNKISRRQAIIKIDQDGFFSLKNLGKCSISINNKDVAWGHCLRLNSGCLIEIRGIPFIFESNPVRMKQYVENIDKTSHKQEYQS

XP_022135884.1 uncharacterized protein LOC111007538 isoform X2 [Momordica charantia]0.0e+0078.85Show/hide
Query:  MGALAPVAPWTPEDDILLKNAVEAGASLEALAKGAVQFSRRYTVRELQERWYSLLYDPIVSEEASMSMIDIECSSSILPSKFNKFGNPKETKYIGGKRKS
        MGALAPVAPWTPEDDILLKNA+EAGASLE+LAKGAVQFSRRYTVRELQERW+SLLYDPIVSEEASMSMID E SSSILPSKFNKFGNPKETK IGGKRK 
Subjt:  MGALAPVAPWTPEDDILLKNAVEAGASLEALAKGAVQFSRRYTVRELQERWYSLLYDPIVSEEASMSMIDIECSSSILPSKFNKFGNPKETKYIGGKRKS

Query:  GSVRRCYYALCKRICNEPFNPMDLNYLVG--DSDYVGEEPMQENCVPPISDDFGLPSSELEILPCNFAQNVMNEDDTERTFLSGCQHTVEKHFPQNLDNG
        GSVRRCYYAL KRICNEPFNPMDL++LVG  DS+YV EEPM  +C+PPIS DFGL  SEL ILP NFA N+MN DDTE TF S CQHTVEKHFP NLDN 
Subjt:  GSVRRCYYALCKRICNEPFNPMDLNYLVG--DSDYVGEEPMQENCVPPISDDFGLPSSELEILPCNFAQNVMNEDDTERTFLSGCQHTVEKHFPQNLDNG

Query:  HEGISHIIVDNLPFSANESHIEELAPSTSFPVHSIFENDLEVKPSTFGQLSNDQRAMGSEPEDNDVFNSPVSDSGASFHNVEYSSPLPGMQLWRD-SSPE
        HEGI HI+ +NLP S NES +EELAPS SFPVHS+FENDLEV+PSTFGQ S DQRAMGSE EDN+VFNSPVSDSGASFHNVEYSSPLPGM +WR+ S+P 
Subjt:  HEGISHIIVDNLPFSANESHIEELAPSTSFPVHSIFENDLEVKPSTFGQLSNDQRAMGSEPEDNDVFNSPVSDSGASFHNVEYSSPLPGMQLWRD-SSPE

Query:  LPIDVDFANTNIPTGDSFELPDDDDGNNNIQNARLAGYDAHSNSKLKIEVQHDHLKSSNASAEGSYLAELSNSLLNLSNEDELLFMDVDGRDVIDKSYYE
        LPIDV F++ ++PTGDSFELP DDDGNNNIQNAR+A YDA S+SKLKIEVQHDHLKS NA+AE  YLAELSNSLLNL+NEDELLFMD DG+DVIDKSYY+
Subjt:  LPIDVDFANTNIPTGDSFELPDDDDGNNNIQNARLAGYDAHSNSKLKIEVQHDHLKSSNASAEGSYLAELSNSLLNLSNEDELLFMDVDGRDVIDKSYYE

Query:  GL-SLLLSSPNEVHPDQTTEAINGETVLPMDTMLDPPTACSGELYEKGSHCSNRNLDCSSETHPSPSASLNSQCSGRGDEPLFCSLNTEDPEIPSNDDVF
        GL SLLL+SPNEV+ DQT +A+N ET+LP D+M+DPPTACSGELYEKGSHCS+ +LDCS E HPSPSASLNSQC G+GDEPLFC+LNTEDPEIPSNDDVF
Subjt:  GL-SLLLSSPNEVHPDQTTEAINGETVLPMDTMLDPPTACSGELYEKGSHCSNRNLDCSSETHPSPSASLNSQCSGRGDEPLFCSLNTEDPEIPSNDDVF

Query:  LPPLTTISSMGYHFQDCVDPTFSSIKDFFCKEKSGEKTQNLVQRERKNHGQPRVSSLSIGLHGLPERGERHLVGGAAVSLKLSHSNSIHGSSANK-----
        LPPL+TISSMGYHFQD +D TFSSIKDF C EKSGE TQNLVQRERKNHGQP VSSLSIGLHGLPERGE+HLVGGAAV+LKL HSNSIH  SAN      
Subjt:  LPPLTTISSMGYHFQDCVDPTFSSIKDFFCKEKSGEKTQNLVQRERKNHGQPRVSSLSIGLHGLPERGERHLVGGAAVSLKLSHSNSIHGSSANK-----

Query:  -TRSINANNDAILPVSLKEESNEISRVNHLGQNFLNTHVEKPGFESVSRYPPSATCGIKQEPDTLTTVKDHQLSQELGARGVFNVEQDGVSLKSDQEELS
         + SINAN DAILPV+LKEES EISR                                  EPD LT VKDH+LSQE G+RGVF VEQDG+S  SDQEELS
Subjt:  -TRSINANNDAILPVSLKEESNEISRVNHLGQNFLNTHVEKPGFESVSRYPPSATCGIKQEPDTLTTVKDHQLSQELGARGVFNVEQDGVSLKSDQEELS

Query:  IESDDDVPHFSDIEAMILDMDLDPDDQDLYSIEEVLKYQHMETKKKIIRLEQGANAFMQRSIASHGALAILYGRHAEYYIKKSEVLLGRATADCIVDIDL
        I+S+DDVPHFSDIEAMILDMDLDP+DQDLY+ EEVL+YQHM+TKK+I+RLEQGA+A M+RS+ASHGALA+LYGR++++YIKKSEVLLGRAT D IVDIDL
Subjt:  IESDDDVPHFSDIEAMILDMDLDPDDQDLYSIEEVLKYQHMETKKKIIRLEQGANAFMQRSIASHGALAILYGRHAEYYIKKSEVLLGRATADCIVDIDL

Query:  GREGSGNKISRRQAIIKIDQDGFFSLKNLGKCSISINNKDVAWGHCLRLNSGCLIEIRGIPFIFESNPVRMKQYVENIDKTSHKQEYQS
        GREGSGNKISRRQAIIKIDQDGFFSLKNLGKCSISINNK+VA GHCLRLNSGCLIEIRG+ FIFES+PV MKQY++NI KTSHKQEYQS
Subjt:  GREGSGNKISRRQAIIKIDQDGFFSLKNLGKCSISINNKDVAWGHCLRLNSGCLIEIRGIPFIFESNPVRMKQYVENIDKTSHKQEYQS

XP_022963643.1 uncharacterized protein LOC111463912 isoform X1 [Cucurbita moschata]0.0e+0077.88Show/hide
Query:  MGALAPVAPWTPEDDILLKNAVEAGASLEALAKGAVQFSRRYTVRELQERWYSLLYDPIVSEEASMSMIDIECSSSILPSKFNKFGNPKETKYIGGKRKS
        MGALAPVAPWTPEDDILLKNAVEAGASLE+LAKGAVQFSRRYTVRELQERW+SLLYDPIVSE+ASMSMID E SSSILPSKFN+FGNPKETKYIGGKRKS
Subjt:  MGALAPVAPWTPEDDILLKNAVEAGASLEALAKGAVQFSRRYTVRELQERWYSLLYDPIVSEEASMSMIDIECSSSILPSKFNKFGNPKETKYIGGKRKS

Query:  GSVRRCYYALCKRICNEPF-NPMDLNYLVG--DSDYVGEEPMQENCVPPISDDFGLPSSELEILPCNFAQNVMNEDDTERTFLSGCQHTVEKHFPQNLDN
        GSVR CYYAL KRICNEPF NPMDLN+LVG  +S+YV EEPM  NC+PPISDDFGL SSE+ ILPC+F+QNVMN DD E TF SGCQ TVEKHFP+NLDN
Subjt:  GSVRRCYYALCKRICNEPF-NPMDLNYLVG--DSDYVGEEPMQENCVPPISDDFGLPSSELEILPCNFAQNVMNEDDTERTFLSGCQHTVEKHFPQNLDN

Query:  GHEGISHIIVDNLPFSANESHIEELAPSTSFPVHSIFENDLEVKPSTFGQLSNDQRAMGSEPEDNDVFNSPVSDSGASFHNVEYSSPLPGMQLWRD-SSP
        G EGISH + ++LP SA +SH+EELAPST FPVHS+FENDLE +PSTFGQLSNDQRAMGSE EDN+VFNSPVS+SGASFHNVEYSSPLPGM +WR+ S+P
Subjt:  GHEGISHIIVDNLPFSANESHIEELAPSTSFPVHSIFENDLEVKPSTFGQLSNDQRAMGSEPEDNDVFNSPVSDSGASFHNVEYSSPLPGMQLWRD-SSP

Query:  ELPIDVDFANTNIPTGDSFELPDDDDGNNNIQNARLAGYDAHSNSKLKIEVQHDHLKSSNASAEGSYLAELSNSLLNLSNEDELLFMDVDGRDVIDKSYY
         LPIDV FA+ +IPT +SFELP DDDGN NIQNAR+AGYDA+S+ KLKIEV+ DHLKS NA+AE  YLAELSNSL+N+SNEDELLFMDVDG+D +DKSYY
Subjt:  ELPIDVDFANTNIPTGDSFELPDDDDGNNNIQNARLAGYDAHSNSKLKIEVQHDHLKSSNASAEGSYLAELSNSLLNLSNEDELLFMDVDGRDVIDKSYY

Query:  EGL-SLLLSSPNEVHPDQTTEAINGETVLPMDTMLDPPTACSGELYEKGSHCSNRNLDCSSETHPSPSASLNSQCSGRGDEPLFCSLNTEDPEIPSNDDV
        +GL SLLL+SPNE++ DQT  AIN ETVLP DTM+DPPTACSG LYEKGSHC   +LDC+SE H SPSASLN+QC  +GDEPLFC+LNTEDP+IPSNDDV
Subjt:  EGL-SLLLSSPNEVHPDQTTEAINGETVLPMDTMLDPPTACSGELYEKGSHCSNRNLDCSSETHPSPSASLNSQCSGRGDEPLFCSLNTEDPEIPSNDDV

Query:  FLPPLTTISSMGYHFQDCVDPTFSSIKDFFCKEKSGEKTQNLVQRERKNHGQPRVSSLSIGLHGLPERGERHLVGGAAVSLKLSHSNSIHGSSANKTRSI
        FLPPL+T+++MGY+FQDC++ TFSS KDF   EKSGE TQNL  RERKNHG    + ++  LHG  ERGE+H VGGA V+ + SHSN+ H  S +   SI
Subjt:  FLPPLTTISSMGYHFQDCVDPTFSSIKDFFCKEKSGEKTQNLVQRERKNHGQPRVSSLSIGLHGLPERGERHLVGGAAVSLKLSHSNSIHGSSANKTRSI

Query:  NANNDAILPVSLKEESNEISRVNHLGQNFLNTHVEKPGFES--VSRYPPSATCGIKQEPDTLTTVKDHQLSQELGARGVFNVEQDGVSLKSDQEELSIES
        N N+DA LP  LKEE+NEISRVNHLG+NFLN H EKPGF+S  V  YPPSA C IKQEPD L ++KDH+LSQE G RG F VEQ G+S  SDQEELSI+S
Subjt:  NANNDAILPVSLKEESNEISRVNHLGQNFLNTHVEKPGFES--VSRYPPSATCGIKQEPDTLTTVKDHQLSQELGARGVFNVEQDGVSLKSDQEELSIES

Query:  DDDVPHFSDIEAMILDMDLDPDDQDLYSIEEVLKYQHMETKKKIIRLEQGANAFMQRSIASHGALAILYGRHAEYYIKKSEVLLGRATADCIVDIDLGRE
        +DDVPHFSDIEAMILDMDLDP+DQDLYS EEVLKYQH++TKK+IIRLEQGANA+MQRS ASHGALA+LYGR++++YIKKSEVLLGRAT D IVDIDLGRE
Subjt:  DDDVPHFSDIEAMILDMDLDPDDQDLYSIEEVLKYQHMETKKKIIRLEQGANAFMQRSIASHGALAILYGRHAEYYIKKSEVLLGRATADCIVDIDLGRE

Query:  GSGNKISRRQAIIKIDQDGFFSLKNLGKCSISINNKDVAWGHCLRLNSGCLIEIRGIPFIFESNPVRMKQYVENIDKTSHKQEYQS
        GSGNKISRRQAIIK+DQDGFFSLKNLGKCSISINNKDVA GHCLRLNSGCLIEIRG+PFIFESNP RMKQYV+N+ K SHKQEYQS
Subjt:  GSGNKISRRQAIIKIDQDGFFSLKNLGKCSISINNKDVAWGHCLRLNSGCLIEIRGIPFIFESNPVRMKQYVENIDKTSHKQEYQS

XP_023554018.1 uncharacterized protein LOC111811415 isoform X1 [Cucurbita pepo subsp. pepo]0.0e+0078.44Show/hide
Query:  MGALAPVAPWTPEDDILLKNAVEAGASLEALAKGAVQFSRRYTVRELQERWYSLLYDPIVSEEASMSMIDIECSSSILPSKFNKFGNPKETKYIGGKRKS
        MGALAPVAPWTPEDDILLKNAVEAGASLE+LAKGAVQFSRRYTVRELQERW+SLLYDPIVSE+ASMSMID E SSSILPSKFNKFGNPKETKYIGGKRKS
Subjt:  MGALAPVAPWTPEDDILLKNAVEAGASLEALAKGAVQFSRRYTVRELQERWYSLLYDPIVSEEASMSMIDIECSSSILPSKFNKFGNPKETKYIGGKRKS

Query:  GSVRRCYYALCKRICNEPF-NPMDLNYLVG--DSDYVGEEPMQENCVPPISDDFGLPSSELEILPCNFAQNVMNEDDTERTFLSGCQHTVEKHFPQNLDN
        GSVR CYYAL KR+CNEPF NPMDLN+LVG  +S+YV EEPM  NC+PPISDDFGL SSE+ ILPC+F+QNVMN DD E TF SGCQ TVEKHFP+NLDN
Subjt:  GSVRRCYYALCKRICNEPF-NPMDLNYLVG--DSDYVGEEPMQENCVPPISDDFGLPSSELEILPCNFAQNVMNEDDTERTFLSGCQHTVEKHFPQNLDN

Query:  GHEGISHIIVDNLPFSANESHIEELAPSTSFPVHSIFENDLEVKPSTFGQLSNDQRAMGSEPEDNDVFNSPVSDSGASFHNVEYSSPLPGMQLWRDSS-P
        G EGISH + ++LP SA +SH+E LAPST FPVHSIFENDLE +PSTFGQLSNDQR MGSE EDN+VFNSPVS+SGASFHNVEYSSPLPGM +WR++S P
Subjt:  GHEGISHIIVDNLPFSANESHIEELAPSTSFPVHSIFENDLEVKPSTFGQLSNDQRAMGSEPEDNDVFNSPVSDSGASFHNVEYSSPLPGMQLWRDSS-P

Query:  ELPIDVDFANTNIPTGDSFELPDDDDGNNNIQNARLAGYDAHSNSKLKIEVQHDHLKSSNASAEGSYLAELSNSLLNLSNEDELLFMDVDGRDVIDKSYY
         LPIDV FA+ +IPT +SFELP DDDGN NIQNAR+AGYDA+S+ KLKIEV+ DHLKS NA+AE  YLAELSNSL+N+SNEDELLFMDVDG+D +DKSYY
Subjt:  ELPIDVDFANTNIPTGDSFELPDDDDGNNNIQNARLAGYDAHSNSKLKIEVQHDHLKSSNASAEGSYLAELSNSLLNLSNEDELLFMDVDGRDVIDKSYY

Query:  EGL-SLLLSSPNEVHPDQTTEAINGETVLPMDTMLDPPTACSGELYEKGSHCSNRNLDCSSETHPSPSASLNSQCSGRGDEPLFCSLNTEDPEIPSNDDV
        +GL SLLL+SPNE++ DQTT AIN ETVLP DTM+DPPTACSG LYEKGS C   +LDC+SE H SPSASLNSQC  +GDEPLFC+LNTEDP+IPSNDDV
Subjt:  EGL-SLLLSSPNEVHPDQTTEAINGETVLPMDTMLDPPTACSGELYEKGSHCSNRNLDCSSETHPSPSASLNSQCSGRGDEPLFCSLNTEDPEIPSNDDV

Query:  FLPPLTTISSMGYHFQDCVDPTFSSIKDFFCKEKSGEKTQNLVQRERKNHGQPRVSSLSIGLHGLPERGERHLVGGAAVSLKLSHSNSIHGSSANKTRSI
        FLPPL+T+++MGY+FQDC++ TFSS KDF   EKSGE TQNL  RERKNHGQPRV S   GLHG  ERGE+H VGGA V+ + SHSN+ H  S +   SI
Subjt:  FLPPLTTISSMGYHFQDCVDPTFSSIKDFFCKEKSGEKTQNLVQRERKNHGQPRVSSLSIGLHGLPERGERHLVGGAAVSLKLSHSNSIHGSSANKTRSI

Query:  NANNDAILPVSLKEESNEISRVNHLGQNFLNTHVEKPGFES--VSRYPPSATCGIKQEPDTLTTVKDHQLSQELGARGVFNVEQDGVSLKSDQEELSIES
        N N+DA LP  LKEE+NEISRVNHLG+NFLN H +KPGF+S  V  YPPSA C IKQEPD L ++KDH+LSQE G RG F VEQ G+S  SDQEELSI+S
Subjt:  NANNDAILPVSLKEESNEISRVNHLGQNFLNTHVEKPGFES--VSRYPPSATCGIKQEPDTLTTVKDHQLSQELGARGVFNVEQDGVSLKSDQEELSIES

Query:  DDDVPHFSDIEAMILDMDLDPDDQDLYSIEEVLKYQHMETKKKIIRLEQGANAFMQRSIASHGALAILYGRHAEYYIKKSEVLLGRATADCIVDIDLGRE
        +DDVPHFSDIEAMILDMDLDP+DQDLYS EEVLKYQH++TKK+IIRLEQGANA+MQRS ASHGALA+LYGR++++YIKKSEVLLGRAT D IVDIDLGRE
Subjt:  DDDVPHFSDIEAMILDMDLDPDDQDLYSIEEVLKYQHMETKKKIIRLEQGANAFMQRSIASHGALAILYGRHAEYYIKKSEVLLGRATADCIVDIDLGRE

Query:  GSGNKISRRQAIIKIDQDGFFSLKNLGKCSISINNKDVAWGHCLRLNSGCLIEIRGIPFIFESNPVRMKQYVENIDKTSHKQEYQS
        GSGNKISRRQAIIK+DQDGFFSLKNLGKCSISINNKDVA GHCLRLNSGCLIEIRG+PFIFESNP RMKQYV+N+ K SHKQEYQS
Subjt:  GSGNKISRRQAIIKIDQDGFFSLKNLGKCSISINNKDVAWGHCLRLNSGCLIEIRGIPFIFESNPVRMKQYVENIDKTSHKQEYQS

TrEMBL top hitse value%identityAlignment
A0A6J1C2Q8 uncharacterized protein LOC111007538 isoform X20.0e+0078.85Show/hide
Query:  MGALAPVAPWTPEDDILLKNAVEAGASLEALAKGAVQFSRRYTVRELQERWYSLLYDPIVSEEASMSMIDIECSSSILPSKFNKFGNPKETKYIGGKRKS
        MGALAPVAPWTPEDDILLKNA+EAGASLE+LAKGAVQFSRRYTVRELQERW+SLLYDPIVSEEASMSMID E SSSILPSKFNKFGNPKETK IGGKRK 
Subjt:  MGALAPVAPWTPEDDILLKNAVEAGASLEALAKGAVQFSRRYTVRELQERWYSLLYDPIVSEEASMSMIDIECSSSILPSKFNKFGNPKETKYIGGKRKS

Query:  GSVRRCYYALCKRICNEPFNPMDLNYLVG--DSDYVGEEPMQENCVPPISDDFGLPSSELEILPCNFAQNVMNEDDTERTFLSGCQHTVEKHFPQNLDNG
        GSVRRCYYAL KRICNEPFNPMDL++LVG  DS+YV EEPM  +C+PPIS DFGL  SEL ILP NFA N+MN DDTE TF S CQHTVEKHFP NLDN 
Subjt:  GSVRRCYYALCKRICNEPFNPMDLNYLVG--DSDYVGEEPMQENCVPPISDDFGLPSSELEILPCNFAQNVMNEDDTERTFLSGCQHTVEKHFPQNLDNG

Query:  HEGISHIIVDNLPFSANESHIEELAPSTSFPVHSIFENDLEVKPSTFGQLSNDQRAMGSEPEDNDVFNSPVSDSGASFHNVEYSSPLPGMQLWRD-SSPE
        HEGI HI+ +NLP S NES +EELAPS SFPVHS+FENDLEV+PSTFGQ S DQRAMGSE EDN+VFNSPVSDSGASFHNVEYSSPLPGM +WR+ S+P 
Subjt:  HEGISHIIVDNLPFSANESHIEELAPSTSFPVHSIFENDLEVKPSTFGQLSNDQRAMGSEPEDNDVFNSPVSDSGASFHNVEYSSPLPGMQLWRD-SSPE

Query:  LPIDVDFANTNIPTGDSFELPDDDDGNNNIQNARLAGYDAHSNSKLKIEVQHDHLKSSNASAEGSYLAELSNSLLNLSNEDELLFMDVDGRDVIDKSYYE
        LPIDV F++ ++PTGDSFELP DDDGNNNIQNAR+A YDA S+SKLKIEVQHDHLKS NA+AE  YLAELSNSLLNL+NEDELLFMD DG+DVIDKSYY+
Subjt:  LPIDVDFANTNIPTGDSFELPDDDDGNNNIQNARLAGYDAHSNSKLKIEVQHDHLKSSNASAEGSYLAELSNSLLNLSNEDELLFMDVDGRDVIDKSYYE

Query:  GL-SLLLSSPNEVHPDQTTEAINGETVLPMDTMLDPPTACSGELYEKGSHCSNRNLDCSSETHPSPSASLNSQCSGRGDEPLFCSLNTEDPEIPSNDDVF
        GL SLLL+SPNEV+ DQT +A+N ET+LP D+M+DPPTACSGELYEKGSHCS+ +LDCS E HPSPSASLNSQC G+GDEPLFC+LNTEDPEIPSNDDVF
Subjt:  GL-SLLLSSPNEVHPDQTTEAINGETVLPMDTMLDPPTACSGELYEKGSHCSNRNLDCSSETHPSPSASLNSQCSGRGDEPLFCSLNTEDPEIPSNDDVF

Query:  LPPLTTISSMGYHFQDCVDPTFSSIKDFFCKEKSGEKTQNLVQRERKNHGQPRVSSLSIGLHGLPERGERHLVGGAAVSLKLSHSNSIHGSSANK-----
        LPPL+TISSMGYHFQD +D TFSSIKDF C EKSGE TQNLVQRERKNHGQP VSSLSIGLHGLPERGE+HLVGGAAV+LKL HSNSIH  SAN      
Subjt:  LPPLTTISSMGYHFQDCVDPTFSSIKDFFCKEKSGEKTQNLVQRERKNHGQPRVSSLSIGLHGLPERGERHLVGGAAVSLKLSHSNSIHGSSANK-----

Query:  -TRSINANNDAILPVSLKEESNEISRVNHLGQNFLNTHVEKPGFESVSRYPPSATCGIKQEPDTLTTVKDHQLSQELGARGVFNVEQDGVSLKSDQEELS
         + SINAN DAILPV+LKEES EISR                                  EPD LT VKDH+LSQE G+RGVF VEQDG+S  SDQEELS
Subjt:  -TRSINANNDAILPVSLKEESNEISRVNHLGQNFLNTHVEKPGFESVSRYPPSATCGIKQEPDTLTTVKDHQLSQELGARGVFNVEQDGVSLKSDQEELS

Query:  IESDDDVPHFSDIEAMILDMDLDPDDQDLYSIEEVLKYQHMETKKKIIRLEQGANAFMQRSIASHGALAILYGRHAEYYIKKSEVLLGRATADCIVDIDL
        I+S+DDVPHFSDIEAMILDMDLDP+DQDLY+ EEVL+YQHM+TKK+I+RLEQGA+A M+RS+ASHGALA+LYGR++++YIKKSEVLLGRAT D IVDIDL
Subjt:  IESDDDVPHFSDIEAMILDMDLDPDDQDLYSIEEVLKYQHMETKKKIIRLEQGANAFMQRSIASHGALAILYGRHAEYYIKKSEVLLGRATADCIVDIDL

Query:  GREGSGNKISRRQAIIKIDQDGFFSLKNLGKCSISINNKDVAWGHCLRLNSGCLIEIRGIPFIFESNPVRMKQYVENIDKTSHKQEYQS
        GREGSGNKISRRQAIIKIDQDGFFSLKNLGKCSISINNK+VA GHCLRLNSGCLIEIRG+ FIFES+PV MKQY++NI KTSHKQEYQS
Subjt:  GREGSGNKISRRQAIIKIDQDGFFSLKNLGKCSISINNKDVAWGHCLRLNSGCLIEIRGIPFIFESNPVRMKQYVENIDKTSHKQEYQS

A0A6J1C5Q1 uncharacterized protein LOC111007538 isoform X10.0e+0081.71Show/hide
Query:  MGALAPVAPWTPEDDILLKNAVEAGASLEALAKGAVQFSRRYTVRELQERWYSLLYDPIVSEEASMSMIDIECSSSILPSKFNKFGNPKETKYIGGKRKS
        MGALAPVAPWTPEDDILLKNA+EAGASLE+LAKGAVQFSRRYTVRELQERW+SLLYDPIVSEEASMSMID E SSSILPSKFNKFGNPKETK IGGKRK 
Subjt:  MGALAPVAPWTPEDDILLKNAVEAGASLEALAKGAVQFSRRYTVRELQERWYSLLYDPIVSEEASMSMIDIECSSSILPSKFNKFGNPKETKYIGGKRKS

Query:  GSVRRCYYALCKRICNEPFNPMDLNYLVG--DSDYVGEEPMQENCVPPISDDFGLPSSELEILPCNFAQNVMNEDDTERTFLSGCQHTVEKHFPQNLDNG
        GSVRRCYYAL KRICNEPFNPMDL++LVG  DS+YV EEPM  +C+PPIS DFGL  SEL ILP NFA N+MN DDTE TF S CQHTVEKHFP NLDN 
Subjt:  GSVRRCYYALCKRICNEPFNPMDLNYLVG--DSDYVGEEPMQENCVPPISDDFGLPSSELEILPCNFAQNVMNEDDTERTFLSGCQHTVEKHFPQNLDNG

Query:  HEGISHIIVDNLPFSANESHIEELAPSTSFPVHSIFENDLEVKPSTFGQLSNDQRAMGSEPEDNDVFNSPVSDSGASFHNVEYSSPLPGMQLWRD-SSPE
        HEGI HI+ +NLP S NES +EELAPS SFPVHS+FENDLEV+PSTFGQ S DQRAMGSE EDN+VFNSPVSDSGASFHNVEYSSPLPGM +WR+ S+P 
Subjt:  HEGISHIIVDNLPFSANESHIEELAPSTSFPVHSIFENDLEVKPSTFGQLSNDQRAMGSEPEDNDVFNSPVSDSGASFHNVEYSSPLPGMQLWRD-SSPE

Query:  LPIDVDFANTNIPTGDSFELPDDDDGNNNIQNARLAGYDAHSNSKLKIEVQHDHLKSSNASAEGSYLAELSNSLLNLSNEDELLFMDVDGRDVIDKSYYE
        LPIDV F++ ++PTGDSFELP DDDGNNNIQNAR+A YDA S+SKLKIEVQHDHLKS NA+AE  YLAELSNSLLNL+NEDELLFMD DG+DVIDKSYY+
Subjt:  LPIDVDFANTNIPTGDSFELPDDDDGNNNIQNARLAGYDAHSNSKLKIEVQHDHLKSSNASAEGSYLAELSNSLLNLSNEDELLFMDVDGRDVIDKSYYE

Query:  GL-SLLLSSPNEVHPDQTTEAINGETVLPMDTMLDPPTACSGELYEKGSHCSNRNLDCSSETHPSPSASLNSQCSGRGDEPLFCSLNTEDPEIPSNDDVF
        GL SLLL+SPNEV+ DQT +A+N ET+LP D+M+DPPTACSGELYEKGSHCS+ +LDCS E HPSPSASLNSQC G+GDEPLFC+LNTEDPEIPSNDDVF
Subjt:  GL-SLLLSSPNEVHPDQTTEAINGETVLPMDTMLDPPTACSGELYEKGSHCSNRNLDCSSETHPSPSASLNSQCSGRGDEPLFCSLNTEDPEIPSNDDVF

Query:  LPPLTTISSMGYHFQDCVDPTFSSIKDFFCKEKSGEKTQNLVQRERKNHGQPRVSSLSIGLHGLPERGERHLVGGAAVSLKLSHSNSIHGSSANK-----
        LPPL+TISSMGYHFQD +D TFSSIKDF C EKSGE TQNLVQRERKNHGQP VSSLSIGLHGLPERGE+HLVGGAAV+LKL HSNSIH  SAN      
Subjt:  LPPLTTISSMGYHFQDCVDPTFSSIKDFFCKEKSGEKTQNLVQRERKNHGQPRVSSLSIGLHGLPERGERHLVGGAAVSLKLSHSNSIHGSSANK-----

Query:  -TRSINANNDAILPVSLKEESNEISRVNHLGQNFLNTHVEKPGFESVS--RYPPSATCGIKQEPDTLTTVKDHQLSQELGARGVFNVEQDGVSLKSDQEE
         + SINAN DAILPV+LKEES EISRVNHLGQNFLNTHVEKPGF+S +  +YPPS   GIKQEPD LT VKDH+LSQE G+RGVF VEQDG+S  SDQEE
Subjt:  -TRSINANNDAILPVSLKEESNEISRVNHLGQNFLNTHVEKPGFESVS--RYPPSATCGIKQEPDTLTTVKDHQLSQELGARGVFNVEQDGVSLKSDQEE

Query:  LSIESDDDVPHFSDIEAMILDMDLDPDDQDLYSIEEVLKYQHMETKKKIIRLEQGANAFMQRSIASHGALAILYGRHAEYYIKKSEVLLGRATADCIVDI
        LSI+S+DDVPHFSDIEAMILDMDLDP+DQDLY+ EEVL+YQHM+TKK+I+RLEQGA+A M+RS+ASHGALA+LYGR++++YIKKSEVLLGRAT D IVDI
Subjt:  LSIESDDDVPHFSDIEAMILDMDLDPDDQDLYSIEEVLKYQHMETKKKIIRLEQGANAFMQRSIASHGALAILYGRHAEYYIKKSEVLLGRATADCIVDI

Query:  DLGREGSGNKISRRQAIIKIDQDGFFSLKNLGKCSISINNKDVAWGHCLRLNSGCLIEIRGIPFIFESNPVRMKQYVENIDKTSHKQEYQS
        DLGREGSGNKISRRQAIIKIDQDGFFSLKNLGKCSISINNK+VA GHCLRLNSGCLIEIRG+ FIFES+PV MKQY++NI KTSHKQEYQS
Subjt:  DLGREGSGNKISRRQAIIKIDQDGFFSLKNLGKCSISINNKDVAWGHCLRLNSGCLIEIRGIPFIFESNPVRMKQYVENIDKTSHKQEYQS

A0A6J1HIH5 uncharacterized protein LOC111463912 isoform X10.0e+0077.88Show/hide
Query:  MGALAPVAPWTPEDDILLKNAVEAGASLEALAKGAVQFSRRYTVRELQERWYSLLYDPIVSEEASMSMIDIECSSSILPSKFNKFGNPKETKYIGGKRKS
        MGALAPVAPWTPEDDILLKNAVEAGASLE+LAKGAVQFSRRYTVRELQERW+SLLYDPIVSE+ASMSMID E SSSILPSKFN+FGNPKETKYIGGKRKS
Subjt:  MGALAPVAPWTPEDDILLKNAVEAGASLEALAKGAVQFSRRYTVRELQERWYSLLYDPIVSEEASMSMIDIECSSSILPSKFNKFGNPKETKYIGGKRKS

Query:  GSVRRCYYALCKRICNEPF-NPMDLNYLVG--DSDYVGEEPMQENCVPPISDDFGLPSSELEILPCNFAQNVMNEDDTERTFLSGCQHTVEKHFPQNLDN
        GSVR CYYAL KRICNEPF NPMDLN+LVG  +S+YV EEPM  NC+PPISDDFGL SSE+ ILPC+F+QNVMN DD E TF SGCQ TVEKHFP+NLDN
Subjt:  GSVRRCYYALCKRICNEPF-NPMDLNYLVG--DSDYVGEEPMQENCVPPISDDFGLPSSELEILPCNFAQNVMNEDDTERTFLSGCQHTVEKHFPQNLDN

Query:  GHEGISHIIVDNLPFSANESHIEELAPSTSFPVHSIFENDLEVKPSTFGQLSNDQRAMGSEPEDNDVFNSPVSDSGASFHNVEYSSPLPGMQLWRD-SSP
        G EGISH + ++LP SA +SH+EELAPST FPVHS+FENDLE +PSTFGQLSNDQRAMGSE EDN+VFNSPVS+SGASFHNVEYSSPLPGM +WR+ S+P
Subjt:  GHEGISHIIVDNLPFSANESHIEELAPSTSFPVHSIFENDLEVKPSTFGQLSNDQRAMGSEPEDNDVFNSPVSDSGASFHNVEYSSPLPGMQLWRD-SSP

Query:  ELPIDVDFANTNIPTGDSFELPDDDDGNNNIQNARLAGYDAHSNSKLKIEVQHDHLKSSNASAEGSYLAELSNSLLNLSNEDELLFMDVDGRDVIDKSYY
         LPIDV FA+ +IPT +SFELP DDDGN NIQNAR+AGYDA+S+ KLKIEV+ DHLKS NA+AE  YLAELSNSL+N+SNEDELLFMDVDG+D +DKSYY
Subjt:  ELPIDVDFANTNIPTGDSFELPDDDDGNNNIQNARLAGYDAHSNSKLKIEVQHDHLKSSNASAEGSYLAELSNSLLNLSNEDELLFMDVDGRDVIDKSYY

Query:  EGL-SLLLSSPNEVHPDQTTEAINGETVLPMDTMLDPPTACSGELYEKGSHCSNRNLDCSSETHPSPSASLNSQCSGRGDEPLFCSLNTEDPEIPSNDDV
        +GL SLLL+SPNE++ DQT  AIN ETVLP DTM+DPPTACSG LYEKGSHC   +LDC+SE H SPSASLN+QC  +GDEPLFC+LNTEDP+IPSNDDV
Subjt:  EGL-SLLLSSPNEVHPDQTTEAINGETVLPMDTMLDPPTACSGELYEKGSHCSNRNLDCSSETHPSPSASLNSQCSGRGDEPLFCSLNTEDPEIPSNDDV

Query:  FLPPLTTISSMGYHFQDCVDPTFSSIKDFFCKEKSGEKTQNLVQRERKNHGQPRVSSLSIGLHGLPERGERHLVGGAAVSLKLSHSNSIHGSSANKTRSI
        FLPPL+T+++MGY+FQDC++ TFSS KDF   EKSGE TQNL  RERKNHG    + ++  LHG  ERGE+H VGGA V+ + SHSN+ H  S +   SI
Subjt:  FLPPLTTISSMGYHFQDCVDPTFSSIKDFFCKEKSGEKTQNLVQRERKNHGQPRVSSLSIGLHGLPERGERHLVGGAAVSLKLSHSNSIHGSSANKTRSI

Query:  NANNDAILPVSLKEESNEISRVNHLGQNFLNTHVEKPGFES--VSRYPPSATCGIKQEPDTLTTVKDHQLSQELGARGVFNVEQDGVSLKSDQEELSIES
        N N+DA LP  LKEE+NEISRVNHLG+NFLN H EKPGF+S  V  YPPSA C IKQEPD L ++KDH+LSQE G RG F VEQ G+S  SDQEELSI+S
Subjt:  NANNDAILPVSLKEESNEISRVNHLGQNFLNTHVEKPGFES--VSRYPPSATCGIKQEPDTLTTVKDHQLSQELGARGVFNVEQDGVSLKSDQEELSIES

Query:  DDDVPHFSDIEAMILDMDLDPDDQDLYSIEEVLKYQHMETKKKIIRLEQGANAFMQRSIASHGALAILYGRHAEYYIKKSEVLLGRATADCIVDIDLGRE
        +DDVPHFSDIEAMILDMDLDP+DQDLYS EEVLKYQH++TKK+IIRLEQGANA+MQRS ASHGALA+LYGR++++YIKKSEVLLGRAT D IVDIDLGRE
Subjt:  DDDVPHFSDIEAMILDMDLDPDDQDLYSIEEVLKYQHMETKKKIIRLEQGANAFMQRSIASHGALAILYGRHAEYYIKKSEVLLGRATADCIVDIDLGRE

Query:  GSGNKISRRQAIIKIDQDGFFSLKNLGKCSISINNKDVAWGHCLRLNSGCLIEIRGIPFIFESNPVRMKQYVENIDKTSHKQEYQS
        GSGNKISRRQAIIK+DQDGFFSLKNLGKCSISINNKDVA GHCLRLNSGCLIEIRG+PFIFESNP RMKQYV+N+ K SHKQEYQS
Subjt:  GSGNKISRRQAIIKIDQDGFFSLKNLGKCSISINNKDVAWGHCLRLNSGCLIEIRGIPFIFESNPVRMKQYVENIDKTSHKQEYQS

A0A6J1HR64 uncharacterized protein LOC111467030 isoform X10.0e+0077.45Show/hide
Query:  MGALAPVAPWTPEDDILLKNAVEAGASLEALAKGAVQFSRRYTVRELQERWYSLLYDPIVSEEASMSMIDIECSSSILPSKFNKFGNPKETKYIGGKRKS
        MGALAPV PWTPEDDILLKNAVEAGASLE+LAKGAVQFSRRYTVRELQERW+SLLYDPIVSE+ASMSMID E SSSILPSKFNKFGNPKETKYIGGKRKS
Subjt:  MGALAPVAPWTPEDDILLKNAVEAGASLEALAKGAVQFSRRYTVRELQERWYSLLYDPIVSEEASMSMIDIECSSSILPSKFNKFGNPKETKYIGGKRKS

Query:  GSVRRCYYALCKRICNEPF-NPMDLNYLVG--DSDYVGEEPMQENCVPPISDDFGLPSSELEILPCNFAQNVMNEDDTERTFLSGCQHTVEKHFPQNLDN
        GSVR CYYAL KRICNEPF NPM+L++LVG  +S+YV EEPM  NC+PPISDDFGL SSEL ILPC+F+QNVMN DD + TF SGCQ TVEKHFP+NLDN
Subjt:  GSVRRCYYALCKRICNEPF-NPMDLNYLVG--DSDYVGEEPMQENCVPPISDDFGLPSSELEILPCNFAQNVMNEDDTERTFLSGCQHTVEKHFPQNLDN

Query:  GHEGISHIIVDNLPFSANESHIEELAPSTSFPVHSIFENDLEVKPSTFGQLSNDQRAMGSEPEDNDVFNSPVSDSGASFHNVEYSSPLPGMQLWRD-SSP
        G EGISH + ++LP SA +SH+EELAPST FPVHS+FENDLE +PSTFGQLSNDQRAMGSE EDN+VFNSPVS+SGASFHNVEYSSPLPGM +WR+ S+P
Subjt:  GHEGISHIIVDNLPFSANESHIEELAPSTSFPVHSIFENDLEVKPSTFGQLSNDQRAMGSEPEDNDVFNSPVSDSGASFHNVEYSSPLPGMQLWRD-SSP

Query:  ELPIDVDFANTNIPTGDSFELPDDDDGNNNIQNARLAGYDAHSNSKLKIEVQHDHLKSSNASAEGSYLAELSNSLLNLSNEDELLFMDVDGRDVIDKSYY
         LPIDV FA+ +IPT +SFELP DDDGN NIQNA +AGYDA+++ KLK EV+ DHLKS NA+AE  YLAELSNSL+N+SNEDELLFMDVDG+D +DKSYY
Subjt:  ELPIDVDFANTNIPTGDSFELPDDDDGNNNIQNARLAGYDAHSNSKLKIEVQHDHLKSSNASAEGSYLAELSNSLLNLSNEDELLFMDVDGRDVIDKSYY

Query:  EGL-SLLLSSPNEVHPDQTTEAINGETVLPMDTMLDPPTACSGELYEKGSHCSNRNLDCSSETHPSPSASLNSQCSGRGDEPLFCSLNTEDPEIPSNDDV
        +GL SLLL+SPNE++ DQTT AIN ET+LP DTM+DPPTACS  LYEKGSHC   +LDC+SE H SPSASLNS C  + DEPLFC+LNTEDP+IPSNDDV
Subjt:  EGL-SLLLSSPNEVHPDQTTEAINGETVLPMDTMLDPPTACSGELYEKGSHCSNRNLDCSSETHPSPSASLNSQCSGRGDEPLFCSLNTEDPEIPSNDDV

Query:  FLPPLTTISSMGYHFQDCVDPTFSSIKDFFCKEKSGEKTQNLVQRERKNHGQPRVSSLSIGLHGLPERGERHLVGGAAVSLKLSHSNSIHGSSANKTRSI
        FLPPL+T+++MGY+FQDC+  TFSS KDF   EKSGE TQNL  RERKNHGQ RV S   GL+G  ERGE+H VGGA V+ + SHSN+ H  S +   SI
Subjt:  FLPPLTTISSMGYHFQDCVDPTFSSIKDFFCKEKSGEKTQNLVQRERKNHGQPRVSSLSIGLHGLPERGERHLVGGAAVSLKLSHSNSIHGSSANKTRSI

Query:  NANNDAILPVSLKEESNEISRVNHLGQNFLNTHVEKPGFES--VSRYPPSATCGIKQEPDTLTTVKDHQLSQELGARGVFNVEQDGVSLKSDQEELSIES
        N N+DA LP  LKEE+NEISRVNHLG+NFLN H EKPGF+S  V  YPPSA C IKQEP+ L ++KDH+LSQE G RG F VEQ G+S  SDQEELSI+S
Subjt:  NANNDAILPVSLKEESNEISRVNHLGQNFLNTHVEKPGFES--VSRYPPSATCGIKQEPDTLTTVKDHQLSQELGARGVFNVEQDGVSLKSDQEELSIES

Query:  DDDVPHFSDIEAMILDMDLDPDDQDLYSIEEVLKYQHMETKKKIIRLEQGANAFMQRSIASHGALAILYGRHAEYYIKKSEVLLGRATADCIVDIDLGRE
        +DDVPHFSDIEAMILDMDLDP+DQDLYS EEVLKYQH++TKK+IIRLEQGANA+MQRS ASHGALA+LYGR++++YIKKSEVLLGRAT D IVDIDLGRE
Subjt:  DDDVPHFSDIEAMILDMDLDPDDQDLYSIEEVLKYQHMETKKKIIRLEQGANAFMQRSIASHGALAILYGRHAEYYIKKSEVLLGRATADCIVDIDLGRE

Query:  GSGNKISRRQAIIKIDQDGFFSLKNLGKCSISINNKDVAWGHCLRLNSGCLIEIRGIPFIFESNPVRMKQYVENI-DKTSHKQEYQS
        GSGNKISRRQAIIK+DQDGFFSLKNLGKCSISINNKDVA GHCLRLNSGCLIEIRG+PFIFESNP RMKQYV+NI  K SHKQEYQS
Subjt:  GSGNKISRRQAIIKIDQDGFFSLKNLGKCSISINNKDVAWGHCLRLNSGCLIEIRGIPFIFESNPVRMKQYVENI-DKTSHKQEYQS

A0A6J1ITH7 uncharacterized protein LOC1114804510.0e+0077.61Show/hide
Query:  MGALAPVAPWTPEDDILLKNAVEAGASLEALAKGAVQFSRRYTVRELQERWYSLLYDPIVSEEASMSMIDIECSSSILPSKFNKFGNPKETKYIGGKRKS
        MGALAP+APW+PEDDILLKNAVEA ASLEALAKGAVQFSRRYTVRELQERW+SLLYDPIVSE+ASMSMID+E SSSILPSKFNKFGN KE K IGGKRKS
Subjt:  MGALAPVAPWTPEDDILLKNAVEAGASLEALAKGAVQFSRRYTVRELQERWYSLLYDPIVSEEASMSMIDIECSSSILPSKFNKFGNPKETKYIGGKRKS

Query:  GSVRRCYYALCKRICNEPFNPMDLNYLVGDSDYVGEEPMQENCVPPISDDFGLPSSELEILPCNFAQNVMNEDDTERTFLSGCQHTVEKHFPQNLDNGHE
        GSVR CYYAL KRICNEPFNPM  +YLVGDSDYV EEPM  NC+PP SDDFGL SSEL  LPCNF+QN MN DDTE TF SGCQHTVE HFPQNLDNGHE
Subjt:  GSVRRCYYALCKRICNEPFNPMDLNYLVGDSDYVGEEPMQENCVPPISDDFGLPSSELEILPCNFAQNVMNEDDTERTFLSGCQHTVEKHFPQNLDNGHE

Query:  GISHIIVDNLPFSANESHIEELAPSTSFPVHSIFENDLEVKPSTFGQLSNDQRAMGSEPEDNDVFNSPVSDSGASFHNVEYSSPLPGMQLWRDSSPELPI
        GISH++VDNLPF ANESH +ELAPS SFPVH++FENDLE  PSTFGQLS DQR MGSEPEDNDVFNSPVSDSGASFHNVEYSSPLPGM +WRD+S  LPI
Subjt:  GISHIIVDNLPFSANESHIEELAPSTSFPVHSIFENDLEVKPSTFGQLSNDQRAMGSEPEDNDVFNSPVSDSGASFHNVEYSSPLPGMQLWRDSSPELPI

Query:  DVDFANTNIPTGDSFELPDDDDGNNNIQNARLAGYDAHSNSKLKIEVQHDHLKSSNASAEGSYLAELSNSLLNLSNEDELLFMDVDGRDVIDKSYYEGLS
        DV FA+ ++PTGDSFELP DDDGNNNIQNARLAGY+AHSNSKLKIEVQHDHLKS NA+AEG YL ELSN++LN SNEDEL FMDVD           GLS
Subjt:  DVDFANTNIPTGDSFELPDDDDGNNNIQNARLAGYDAHSNSKLKIEVQHDHLKSSNASAEGSYLAELSNSLLNLSNEDELLFMDVDGRDVIDKSYYEGLS

Query:  LLLSSPNEVHPDQTTEAINGETVLPMDTMLDPPTACSGELYEKGSHCSNRNLDCSSETHPSPSASLNSQCSGRGDEPLFCSLNTEDPEIPSNDDVFLPPL
        LLLSSPNEV+ DQT  AI  ETVLP DTMLDPP+ACSGELYEKGSHCS+ +LDCSSE HPSPS SLNSQC G+ DEPLFC+LNTEDPEIPSNDDVFLPPL
Subjt:  LLLSSPNEVHPDQTTEAINGETVLPMDTMLDPPTACSGELYEKGSHCSNRNLDCSSETHPSPSASLNSQCSGRGDEPLFCSLNTEDPEIPSNDDVFLPPL

Query:  TTISSMGYHFQDCVDPTFSSIKDFFCKEKSGEKTQNLVQRERKNH-GQPRVSSLSIGLHGLPERGERHLVGGAAVSLKLSHSNSIHGSSANKTRSINANN
        +TI++MGYHF DC+ PTFSS  DF CKE SGE TQNLVQRERKNH GQP VS++S+GLH LPERGE+HL+ G  V+LKLSHSNSIH  SANKT SIN N+
Subjt:  TTISSMGYHFQDCVDPTFSSIKDFFCKEKSGEKTQNLVQRERKNH-GQPRVSSLSIGLHGLPERGERHLVGGAAVSLKLSHSNSIHGSSANKTRSINANN

Query:  DAILPVSLKEESNEISRVNHLGQNFLNTHVEKPGFESVSRYPPSATCGIKQEPDTLTTVKDHQLSQELGARGVFNVEQDGVSLKSDQEELSIESDDDVPH
        D ILPV+L EE+NEIS                                   EP+T T+VKDH+LS+E+G +GVF VEQDG+   SD EEL I+S+DD+PH
Subjt:  DAILPVSLKEESNEISRVNHLGQNFLNTHVEKPGFESVSRYPPSATCGIKQEPDTLTTVKDHQLSQELGARGVFNVEQDGVSLKSDQEELSIESDDDVPH

Query:  FSDIEAMILDMDLDPDDQDLYSIEEVLKYQHMETKKKIIRLEQGANAFMQRSIASHGALAILYGRHAEYYIKKSEVLLGRATADCIVDIDLGREGSGNKI
        FSDIEAMILDMDLDP+DQDLY  EEVLKYQH++T+K+IIRLEQG NA MQRSIASHGALA+L GRH+++YIKKSEVLLGRATA+ IVDIDLG EGSGNKI
Subjt:  FSDIEAMILDMDLDPDDQDLYSIEEVLKYQHMETKKKIIRLEQGANAFMQRSIASHGALAILYGRHAEYYIKKSEVLLGRATADCIVDIDLGREGSGNKI

Query:  SRRQAIIKIDQDGFFSLKNLGKCSISINNKDVAWGHCLRLNSGCLIEIRGIPFIFESNPVRMKQYVENIDKTSHKQEYQS
        SRRQAIIKIDQDGFFSLKNLGKCSISINNKDVA GHCLRLNSGCLIEIRG+ FIFESNPVRMKQYV+N+ KTSHKQEYQS
Subjt:  SRRQAIIKIDQDGFFSLKNLGKCSISINNKDVAWGHCLRLNSGCLIEIRGIPFIFESNPVRMKQYVENIDKTSHKQEYQS

SwissProt top hitse value%identityAlignment
Q96EZ8 Microspherule protein 17.3e-1532.43Show/hide
Query:  LKSDQEELSIESDDDVPHFSDIEAMILDMDLDPDDQDLYSIEEVLKYQHMETKKKIIRLEQGANAF---------MQRSIASHGALAILYGRHAEYYIKK
        L  DQ    +   D V +FSD E +I D  L     ++  +E  L       K++I +LEQ  + +         M      +  LA+L GR   Y ++ 
Subjt:  LKSDQEELSIESDDDVPHFSDIEAMILDMDLDPDDQDLYSIEEVLKYQHMETKKKIIRLEQGANAF---------MQRSIASHGALAILYGRHAEYYIKK

Query:  SEVLLGRATADCIVDIDLGREGSGNKISRRQAIIKIDQDGFFSLKNLGKCSISINNKDVAWGHCLRLNSGCLIEIRGIPFIFESN
         E+ LGRAT D  +D+DL  EG   KISR+Q +IK+  +G F + N G+  I I+ + V  G   RL++  ++EI  + F+F  N
Subjt:  SEVLLGRATADCIVDIDLGREGSGNKISRRQAIIKIDQDGFFSLKNLGKCSISINNKDVAWGHCLRLNSGCLIEIRGIPFIFESN

Q96EZ8 Microspherule protein 11.7e-0852.54Show/hide
Query:  WTPEDDILLKNAVEAGASLEALAKGAVQFSRRYTVRELQERWYSLLYDPIVSEEASMSM
        W P DD+LL NAV     L ++  G V+FS R+T+RE+QERWY+LLYDP++S+ A  +M
Subjt:  WTPEDDILLKNAVEAGASLEALAKGAVQFSRRYTVRELQERWYSLLYDPIVSEEASMSM

Q99L90 Microspherule protein 13.3e-1533.51Show/hide
Query:  LKSDQEELSIESDDDVPHFSDIEAMILDMDLDPDDQDLYSIEEVLKYQHMETKKKIIRLEQGANAF--MQRSIASHGA-------LAILYGRHAEYYIKK
        L  DQ    +   D V +FSD E +I D  L     ++  +E  L       K++I +LEQ  + +  +  SI   G+       LA+L GR   Y ++ 
Subjt:  LKSDQEELSIESDDDVPHFSDIEAMILDMDLDPDDQDLYSIEEVLKYQHMETKKKIIRLEQGANAF--MQRSIASHGA-------LAILYGRHAEYYIKK

Query:  SEVLLGRATADCIVDIDLGREGSGNKISRRQAIIKIDQDGFFSLKNLGKCSISINNKDVAWGHCLRLNSGCLIEIRGIPFIFESN
         E+ LGRAT D  +D+DL  EG   KISR+Q +IK+  +G F + N G+  I I+ + V  G   RL++  ++EI  + F+F  N
Subjt:  SEVLLGRATADCIVDIDLGREGSGNKISRRQAIIKIDQDGFFSLKNLGKCSISINNKDVAWGHCLRLNSGCLIEIRGIPFIFESN

Q99L90 Microspherule protein 11.7e-0852.54Show/hide
Query:  WTPEDDILLKNAVEAGASLEALAKGAVQFSRRYTVRELQERWYSLLYDPIVSEEASMSM
        W P DD+LL NAV     L ++  G V+FS R+T+RE+QERWY+LLYDP++S+ A  +M
Subjt:  WTPEDDILLKNAVEAGASLEALAKGAVQFSRRYTVRELQERWYSLLYDPIVSEEASMSM

Arabidopsis top hitse value%identityAlignment
AT1G60700.1 SMAD/FHA domain-containing protein3.8e-2739.58Show/hide
Query:  QEELSIESDDDVPHFSDIEAMILDMDLDPDDQD-LYSIEEVLKYQHMETKKKIIRLEQGANAFMQRSIASHGALAILYGRHAEYYIKKSEVLLGRATADC
        QEE  ++ ++++    DI+AMI  ++L PDD D  ++ EE    +H   +  +I LEQ     MQR+I  HGA+A+L+   ++++++K EV++GR++   
Subjt:  QEELSIESDDDVPHFSDIEAMILDMDLDPDDQD-LYSIEEVLKYQHMETKKKIIRLEQGANAFMQRSIASHGALAILYGRHAEYYIKKSEVLLGRATADC

Query:  IVDIDLGREGSGNKISRRQAIIKIDQDGFFSLKNLGKCSISINNKDVAWGHCLRLNSGCLIEIRGIPFIFESNPVRMKQYVENIDKTSHKQE
         VDIDLG+   G+KISRRQA++K++  G FSLKNLGK  I +N   +  G  + L S   I IRGI F+F+ N   + Q+++N   T  K E
Subjt:  IVDIDLGREGSGNKISRRQAIIKIDQDGFFSLKNLGKCSISINNKDVAWGHCLRLNSGCLIEIRGIPFIFESNPVRMKQYVENIDKTSHKQE

AT1G75530.1 Forkhead-associated (FHA) domain-containing protein5.7e-4752.15Show/hide
Query:  DQEELSIESDDDVPHFSDIEAMILDMDLDPDDQDLYSIEEVLKYQHMETKKKIIRLEQGANAFMQRSIASHGALAILYGRHAEYYIKKSEVLLGRATADC
        ++  + IESD+++P FSD+EAMILDMDL+P  QD Y + +  KY++ E  +KI+RLEQ A ++M R IA+HGA A+LYG  +++YI K EVLLGRAT + 
Subjt:  DQEELSIESDDDVPHFSDIEAMILDMDLDPDDQDLYSIEEVLKYQHMETKKKIIRLEQGANAFMQRSIASHGALAILYGRHAEYYIKKSEVLLGRATADC

Query:  IVDIDLGREGSGNKISRRQAIIKIDQDGFFSLKNLGKCSISINNKDVAWGHCLRLNSGCLIEIRGIPFIFESNPVRMKQYVENIDK
         VDIDLGR GS  + SRRQA+IK+ QDG F +KNLGK SI +N++++  G  + L + CLI+IR   FIFE N   +K+Y++ I K
Subjt:  IVDIDLGREGSGNKISRRQAIIKIDQDGFFSLKNLGKCSISINNKDVAWGHCLRLNSGCLIEIRGIPFIFESNPVRMKQYVENIDK

AT1G75530.1 Forkhead-associated (FHA) domain-containing protein2.4e-1348.57Show/hide
Query:  WTPEDDILLKNAVEAGASLEALAKGAVQFSRRYTVRELQERWYSLLYDPIVSEEASMSMIDIECSSSILP
        W PEDD LL+ ++E G SLE LAKGAV+FSR++T+ EL ERW+ LLY+P V+  +S    +++  +  +P
Subjt:  WTPEDDILLKNAVEAGASLEALAKGAVQFSRRYTVRELQERWYSLLYDPIVSEEASMSMIDIECSSSILP

AT3G54350.1 Forkhead-associated (FHA) domain-containing protein2.0e-10033.6Show/hide
Query:  MGALAPVAPWTPEDDILLKNAVEAGASLEALAKGAVQFSRRYTVRELQERWYSLLYDPIVSEEASMSMIDIECSSSILPSKFNKFGNPKETKYIGGKRKS
        MGALA V PW PEDD+LLKNAVEAGASLE+LAKGAVQFSRR+++RELQ+RW++LLYDP+VS EA+  M ++E ++   P+KF + G  KE K    KR +
Subjt:  MGALAPVAPWTPEDDILLKNAVEAGASLEALAKGAVQFSRRYTVRELQERWYSLLYDPIVSEEASMSMIDIECSSSILPSKFNKFGNPKETKYIGGKRKS

Query:  GSVRRCYYALCKRICNEPFNPMDLNYLVGDSDYVGEEPMQENCVPPISDDFGLPSSELEILPCNFAQNVMNEDDTERTFLSGCQHTVEKHFPQNLDNGHE
          +R  Y++L K+   EPFN +DL +LV  +D                                                         HF  N D  H 
Subjt:  GSVRRCYYALCKRICNEPFNPMDLNYLVGDSDYVGEEPMQENCVPPISDDFGLPSSELEILPCNFAQNVMNEDDTERTFLSGCQHTVEKHFPQNLDNGHE

Query:  GISHIIVDNLPFSANESHIEELAPSTSFPVHSIFENDLEVKPSTFGQLSNDQRAMGSEPEDNDVFNSPVSDSGASFHNVEYSSPLPGMQLWRDSSPELPI
        G+    +D +    + +  E LA       H +                         PEDN                                   L  
Subjt:  GISHIIVDNLPFSANESHIEELAPSTSFPVHSIFENDLEVKPSTFGQLSNDQRAMGSEPEDNDVFNSPVSDSGASFHNVEYSSPLPGMQLWRDSSPELPI

Query:  DVDFANTNIPTGDSFELPDDDDGNNNIQNARLAGYD-AHSNSKLKIEVQHDHLKSSNASAEGSYLAELSNSLLNLSNEDELLFMDVDGRDVIDKSYYEGL
        D+ +               + +     ++A L+  D  H +S+ K+E      K++ AS +  +LA+LS SL     ED   FM+VDG++V DKSYY+GL
Subjt:  DVDFANTNIPTGDSFELPDDDDGNNNIQNARLAGYD-AHSNSKLKIEVQHDHLKSSNASAEGSYLAELSNSLLNLSNEDELLFMDVDGRDVIDKSYYEGL

Query:  -SLLLSSPN----EVHPDQTTEAINGETVLPMDTMLDPPTACSGELYEKGSHCSNRNLDCSSETHPSPSASLNSQCSGRGDEPLFCSLNTEDPEIPSNDD
         SLL++S N    E  P+ T +  +     P +  LD                    LD +    P P       C         C LN EDP+IP NDD
Subjt:  -SLLLSSPN----EVHPDQTTEAINGETVLPMDTMLDPPTACSGELYEKGSHCSNRNLDCSSETHPSPSASLNSQCSGRGDEPLFCSLNTEDPEIPSNDD

Query:  VFLP----PLTTISSMGYHFQDCVDPTFSSIKDFFCKEKSGEKTQNLVQRERKNHGQPRVSSLSIGLHGLPERGERHLVGGAAVSLKLSHSNSIHGSSAN
        +FL     P++  S    +F+D   P  + ++D    ++  E     +Q ++K  G+ + S+      G P +G +     A+ S +L ++ +  GSS  
Subjt:  VFLP----PLTTISSMGYHFQDCVDPTFSSIKDFFCKEKSGEKTQNLVQRERKNHGQPRVSSLSIGLHGLPERGERHLVGGAAVSLKLSHSNSIHGSSAN

Query:  KTRSINANNDAILPVSLKEESNEISRVNHLGQNFLNT--HVEKPGFESVSRYPPSATCGIKQEPDTLTTVKDHQLSQELGARGVFNVEQDGVSLKSDQEE
           S  A ++ +L  S    + +  +    G  F+ +  H   P  +S +    +    + + P    T         L    V  +E      +++ E 
Subjt:  KTRSINANNDAILPVSLKEESNEISRVNHLGQNFLNT--HVEKPGFESVSRYPPSATCGIKQEPDTLTTVKDHQLSQELGARGVFNVEQDGVSLKSDQEE

Query:  LSIESDDDVPHFSDIEAMILDMDLDPDDQDLYSIEEVLKYQHMETKKKIIRLEQGANAFMQRSIASHGALAILYGRHAEYYIKKSEVLLGRATADCIVDI
           ESD+D+P++SDIEAMILDMDL+PDDQD + + EV KYQ  + K+ IIRLEQ A+++MQR+IAS GA A+LYGR++++YIKK EVL+GR+T D  VDI
Subjt:  LSIESDDDVPHFSDIEAMILDMDLDPDDQDLYSIEEVLKYQHMETKKKIIRLEQGANAFMQRSIASHGALAILYGRHAEYYIKKSEVLLGRATADCIVDI

Query:  DLGREGSGNKISRRQAIIKIDQDGFFSLKNLGKCSISINNKDVAWGHCLRLNSGCLIEIRGIPFIFESNPVRMKQYVENIDKTS
        DLGRE  G+KISRRQAII++  DG F +KNLGK SIS+N K+V  G  L L S CL+EIRG+PFIFE+N   M++Y++   K +
Subjt:  DLGREGSGNKISRRQAIIKIDQDGFFSLKNLGKCSISINNKDVAWGHCLRLNSGCLIEIRGIPFIFESNPVRMKQYVENIDKTS

AT3G54350.2 Forkhead-associated (FHA) domain-containing protein2.0e-10033.6Show/hide
Query:  MGALAPVAPWTPEDDILLKNAVEAGASLEALAKGAVQFSRRYTVRELQERWYSLLYDPIVSEEASMSMIDIECSSSILPSKFNKFGNPKETKYIGGKRKS
        MGALA V PW PEDD+LLKNAVEAGASLE+LAKGAVQFSRR+++RELQ+RW++LLYDP+VS EA+  M ++E ++   P+KF + G  KE K    KR +
Subjt:  MGALAPVAPWTPEDDILLKNAVEAGASLEALAKGAVQFSRRYTVRELQERWYSLLYDPIVSEEASMSMIDIECSSSILPSKFNKFGNPKETKYIGGKRKS

Query:  GSVRRCYYALCKRICNEPFNPMDLNYLVGDSDYVGEEPMQENCVPPISDDFGLPSSELEILPCNFAQNVMNEDDTERTFLSGCQHTVEKHFPQNLDNGHE
          +R  Y++L K+   EPFN +DL +LV  +D                                                         HF  N D  H 
Subjt:  GSVRRCYYALCKRICNEPFNPMDLNYLVGDSDYVGEEPMQENCVPPISDDFGLPSSELEILPCNFAQNVMNEDDTERTFLSGCQHTVEKHFPQNLDNGHE

Query:  GISHIIVDNLPFSANESHIEELAPSTSFPVHSIFENDLEVKPSTFGQLSNDQRAMGSEPEDNDVFNSPVSDSGASFHNVEYSSPLPGMQLWRDSSPELPI
        G+    +D +    + +  E LA       H +                         PEDN                                   L  
Subjt:  GISHIIVDNLPFSANESHIEELAPSTSFPVHSIFENDLEVKPSTFGQLSNDQRAMGSEPEDNDVFNSPVSDSGASFHNVEYSSPLPGMQLWRDSSPELPI

Query:  DVDFANTNIPTGDSFELPDDDDGNNNIQNARLAGYD-AHSNSKLKIEVQHDHLKSSNASAEGSYLAELSNSLLNLSNEDELLFMDVDGRDVIDKSYYEGL
        D+ +               + +     ++A L+  D  H +S+ K+E      K++ AS +  +LA+LS SL     ED   FM+VDG++V DKSYY+GL
Subjt:  DVDFANTNIPTGDSFELPDDDDGNNNIQNARLAGYD-AHSNSKLKIEVQHDHLKSSNASAEGSYLAELSNSLLNLSNEDELLFMDVDGRDVIDKSYYEGL

Query:  -SLLLSSPN----EVHPDQTTEAINGETVLPMDTMLDPPTACSGELYEKGSHCSNRNLDCSSETHPSPSASLNSQCSGRGDEPLFCSLNTEDPEIPSNDD
         SLL++S N    E  P+ T +  +     P +  LD                    LD +    P P       C         C LN EDP+IP NDD
Subjt:  -SLLLSSPN----EVHPDQTTEAINGETVLPMDTMLDPPTACSGELYEKGSHCSNRNLDCSSETHPSPSASLNSQCSGRGDEPLFCSLNTEDPEIPSNDD

Query:  VFLP----PLTTISSMGYHFQDCVDPTFSSIKDFFCKEKSGEKTQNLVQRERKNHGQPRVSSLSIGLHGLPERGERHLVGGAAVSLKLSHSNSIHGSSAN
        +FL     P++  S    +F+D   P  + ++D    ++  E     +Q ++K  G+ + S+      G P +G +     A+ S +L ++ +  GSS  
Subjt:  VFLP----PLTTISSMGYHFQDCVDPTFSSIKDFFCKEKSGEKTQNLVQRERKNHGQPRVSSLSIGLHGLPERGERHLVGGAAVSLKLSHSNSIHGSSAN

Query:  KTRSINANNDAILPVSLKEESNEISRVNHLGQNFLNT--HVEKPGFESVSRYPPSATCGIKQEPDTLTTVKDHQLSQELGARGVFNVEQDGVSLKSDQEE
           S  A ++ +L  S    + +  +    G  F+ +  H   P  +S +    +    + + P    T         L    V  +E      +++ E 
Subjt:  KTRSINANNDAILPVSLKEESNEISRVNHLGQNFLNT--HVEKPGFESVSRYPPSATCGIKQEPDTLTTVKDHQLSQELGARGVFNVEQDGVSLKSDQEE

Query:  LSIESDDDVPHFSDIEAMILDMDLDPDDQDLYSIEEVLKYQHMETKKKIIRLEQGANAFMQRSIASHGALAILYGRHAEYYIKKSEVLLGRATADCIVDI
           ESD+D+P++SDIEAMILDMDL+PDDQD + + EV KYQ  + K+ IIRLEQ A+++MQR+IAS GA A+LYGR++++YIKK EVL+GR+T D  VDI
Subjt:  LSIESDDDVPHFSDIEAMILDMDLDPDDQDLYSIEEVLKYQHMETKKKIIRLEQGANAFMQRSIASHGALAILYGRHAEYYIKKSEVLLGRATADCIVDI

Query:  DLGREGSGNKISRRQAIIKIDQDGFFSLKNLGKCSISINNKDVAWGHCLRLNSGCLIEIRGIPFIFESNPVRMKQYVENIDKTS
        DLGRE  G+KISRRQAII++  DG F +KNLGK SIS+N K+V  G  L L S CL+EIRG+PFIFE+N   M++Y++   K +
Subjt:  DLGREGSGNKISRRQAIIKIDQDGFFSLKNLGKCSISINNKDVAWGHCLRLNSGCLIEIRGIPFIFESNPVRMKQYVENIDKTS

AT3G54350.3 Forkhead-associated (FHA) domain-containing protein2.0e-10033.6Show/hide
Query:  MGALAPVAPWTPEDDILLKNAVEAGASLEALAKGAVQFSRRYTVRELQERWYSLLYDPIVSEEASMSMIDIECSSSILPSKFNKFGNPKETKYIGGKRKS
        MGALA V PW PEDD+LLKNAVEAGASLE+LAKGAVQFSRR+++RELQ+RW++LLYDP+VS EA+  M ++E ++   P+KF + G  KE K    KR +
Subjt:  MGALAPVAPWTPEDDILLKNAVEAGASLEALAKGAVQFSRRYTVRELQERWYSLLYDPIVSEEASMSMIDIECSSSILPSKFNKFGNPKETKYIGGKRKS

Query:  GSVRRCYYALCKRICNEPFNPMDLNYLVGDSDYVGEEPMQENCVPPISDDFGLPSSELEILPCNFAQNVMNEDDTERTFLSGCQHTVEKHFPQNLDNGHE
          +R  Y++L K+   EPFN +DL +LV  +D                                                         HF  N D  H 
Subjt:  GSVRRCYYALCKRICNEPFNPMDLNYLVGDSDYVGEEPMQENCVPPISDDFGLPSSELEILPCNFAQNVMNEDDTERTFLSGCQHTVEKHFPQNLDNGHE

Query:  GISHIIVDNLPFSANESHIEELAPSTSFPVHSIFENDLEVKPSTFGQLSNDQRAMGSEPEDNDVFNSPVSDSGASFHNVEYSSPLPGMQLWRDSSPELPI
        G+    +D +    + +  E LA       H +                         PEDN                                   L  
Subjt:  GISHIIVDNLPFSANESHIEELAPSTSFPVHSIFENDLEVKPSTFGQLSNDQRAMGSEPEDNDVFNSPVSDSGASFHNVEYSSPLPGMQLWRDSSPELPI

Query:  DVDFANTNIPTGDSFELPDDDDGNNNIQNARLAGYD-AHSNSKLKIEVQHDHLKSSNASAEGSYLAELSNSLLNLSNEDELLFMDVDGRDVIDKSYYEGL
        D+ +               + +     ++A L+  D  H +S+ K+E      K++ AS +  +LA+LS SL     ED   FM+VDG++V DKSYY+GL
Subjt:  DVDFANTNIPTGDSFELPDDDDGNNNIQNARLAGYD-AHSNSKLKIEVQHDHLKSSNASAEGSYLAELSNSLLNLSNEDELLFMDVDGRDVIDKSYYEGL

Query:  -SLLLSSPN----EVHPDQTTEAINGETVLPMDTMLDPPTACSGELYEKGSHCSNRNLDCSSETHPSPSASLNSQCSGRGDEPLFCSLNTEDPEIPSNDD
         SLL++S N    E  P+ T +  +     P +  LD                    LD +    P P       C         C LN EDP+IP NDD
Subjt:  -SLLLSSPN----EVHPDQTTEAINGETVLPMDTMLDPPTACSGELYEKGSHCSNRNLDCSSETHPSPSASLNSQCSGRGDEPLFCSLNTEDPEIPSNDD

Query:  VFLP----PLTTISSMGYHFQDCVDPTFSSIKDFFCKEKSGEKTQNLVQRERKNHGQPRVSSLSIGLHGLPERGERHLVGGAAVSLKLSHSNSIHGSSAN
        +FL     P++  S    +F+D   P  + ++D    ++  E     +Q ++K  G+ + S+      G P +G +     A+ S +L ++ +  GSS  
Subjt:  VFLP----PLTTISSMGYHFQDCVDPTFSSIKDFFCKEKSGEKTQNLVQRERKNHGQPRVSSLSIGLHGLPERGERHLVGGAAVSLKLSHSNSIHGSSAN

Query:  KTRSINANNDAILPVSLKEESNEISRVNHLGQNFLNT--HVEKPGFESVSRYPPSATCGIKQEPDTLTTVKDHQLSQELGARGVFNVEQDGVSLKSDQEE
           S  A ++ +L  S    + +  +    G  F+ +  H   P  +S +    +    + + P    T         L    V  +E      +++ E 
Subjt:  KTRSINANNDAILPVSLKEESNEISRVNHLGQNFLNT--HVEKPGFESVSRYPPSATCGIKQEPDTLTTVKDHQLSQELGARGVFNVEQDGVSLKSDQEE

Query:  LSIESDDDVPHFSDIEAMILDMDLDPDDQDLYSIEEVLKYQHMETKKKIIRLEQGANAFMQRSIASHGALAILYGRHAEYYIKKSEVLLGRATADCIVDI
           ESD+D+P++SDIEAMILDMDL+PDDQD + + EV KYQ  + K+ IIRLEQ A+++MQR+IAS GA A+LYGR++++YIKK EVL+GR+T D  VDI
Subjt:  LSIESDDDVPHFSDIEAMILDMDLDPDDQDLYSIEEVLKYQHMETKKKIIRLEQGANAFMQRSIASHGALAILYGRHAEYYIKKSEVLLGRATADCIVDI

Query:  DLGREGSGNKISRRQAIIKIDQDGFFSLKNLGKCSISINNKDVAWGHCLRLNSGCLIEIRGIPFIFESNPVRMKQYVENIDKTS
        DLGRE  G+KISRRQAII++  DG F +KNLGK SIS+N K+V  G  L L S CL+EIRG+PFIFE+N   M++Y++   K +
Subjt:  DLGREGSGNKISRRQAIIKIDQDGFFSLKNLGKCSISINNKDVAWGHCLRLNSGCLIEIRGIPFIFESNPVRMKQYVENIDKTS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGCTCTTGCCCCTGTCGCGCCTTGGACTCCTGAAGATGATATTCTGCTCAAGAACGCCGTCGAGGCGGGTGCTTCCTTGGAGGCCCTTGCCAAAGGTGCTGTGCA
GTTTTCTCGAAGATACACCGTAAGAGAATTGCAAGAACGATGGTATTCTTTACTTTATGATCCAATTGTTTCTGAAGAAGCATCTATGTCCATGATTGACATTGAGTGTT
CTTCCTCCATTCTTCCATCAAAGTTCAACAAGTTTGGGAATCCAAAAGAAACTAAATATATTGGTGGGAAGAGGAAGTCTGGGAGTGTACGACGTTGTTATTATGCTTTG
TGTAAAAGAATTTGCAACGAACCATTTAACCCTATGGACCTGAATTATCTTGTTGGTGATAGTGACTATGTAGGTGAAGAGCCTATGCAAGAAAATTGTGTCCCTCCAAT
ATCAGATGATTTTGGACTTCCGAGCTCAGAGCTGGAGATCTTGCCTTGTAATTTTGCTCAAAATGTTATGAATGAGGATGATACTGAGCGCACTTTTCTCTCTGGATGTC
AACATACAGTTGAAAAGCATTTTCCTCAGAACCTGGATAATGGGCACGAGGGAATTTCTCACATTATTGTAGACAATCTGCCTTTCTCTGCAAATGAATCGCATATTGAA
GAATTGGCTCCATCAACTAGCTTTCCAGTTCATAGTATCTTTGAAAATGATTTGGAGGTGAAGCCTTCTACTTTTGGGCAACTGAGCAATGACCAGAGAGCCATGGGCTC
TGAACCAGAAGACAACGATGTCTTTAATTCTCCTGTTTCTGATTCTGGCGCATCATTTCATAATGTTGAGTACTCATCTCCTCTTCCTGGTATGCAACTATGGAGAGATT
CCTCACCAGAATTGCCGATTGATGTTGACTTTGCAAATACTAATATACCTACAGGAGACTCTTTTGAACTCCCTGATGATGATGATGGGAACAACAACATTCAAAATGCA
AGATTAGCAGGCTATGATGCTCACTCTAACTCAAAGTTGAAGATTGAAGTGCAGCATGATCATTTGAAAAGTTCAAATGCCTCTGCTGAAGGTTCTTATCTCGCAGAGTT
GTCCAATTCTCTTTTGAACTTGAGCAATGAGGATGAGCTACTTTTCATGGATGTTGATGGAAGAGATGTGATTGATAAGTCATACTATGAGGGGTTGTCACTTTTGTTGA
GTTCACCAAATGAAGTTCATCCTGATCAAACAACTGAAGCAATTAATGGAGAAACAGTGTTACCAATGGATACAATGTTAGATCCCCCTACTGCATGTTCTGGAGAGTTA
TATGAGAAAGGATCCCACTGTAGTAATAGAAATTTGGATTGTAGTTCAGAAACTCATCCATCACCATCTGCATCTTTGAACAGTCAATGTTCTGGAAGAGGAGATGAGCC
TCTTTTCTGCAGTTTGAACACAGAAGACCCAGAAATCCCGAGCAATGATGATGTTTTTCTACCTCCATTGACAACAATTTCTAGTATGGGATATCATTTTCAAGATTGCG
TGGACCCCACCTTTTCATCTATTAAGGATTTCTTTTGTAAAGAAAAATCTGGTGAAAAGACTCAAAACCTTGTGCAAAGGGAGAGGAAAAATCATGGACAACCTCGTGTT
TCATCTCTGTCGATAGGATTGCATGGTTTGCCCGAAAGAGGTGAAAGACATCTGGTTGGTGGAGCTGCTGTTAGTTTGAAATTATCCCATAGCAACTCCATACATGGGTC
ATCTGCAAATAAAACCAGATCCATAAATGCAAATAACGATGCTATCTTACCTGTCTCACTCAAGGAAGAGAGCAATGAAATTTCCCGGGTAAATCATCTTGGCCAGAATT
TTTTAAATACTCATGTAGAGAAGCCAGGCTTTGAATCTGTTAGTAGATATCCACCAAGTGCTACTTGTGGCATTAAACAGGAACCAGATACATTGACTACTGTGAAAGAT
CATCAATTATCACAAGAACTGGGTGCTCGAGGTGTTTTCAATGTAGAACAAGATGGGGTATCTTTGAAATCTGATCAAGAAGAGTTATCTATTGAGAGTGATGATGATGT
GCCTCATTTTTCAGATATTGAGGCAATGATACTTGATATGGACTTGGATCCAGATGATCAGGATCTGTATTCAATTGAAGAAGTCTTAAAATACCAACATATGGAGACCA
AGAAGAAAATCATCAGACTGGAGCAAGGGGCTAATGCTTTCATGCAAAGATCTATTGCCTCCCATGGGGCATTAGCAATTCTGTATGGCCGACATGCAGAATATTACATT
AAGAAATCTGAGGTTCTGTTGGGTAGAGCGACGGCAGATTGCATTGTGGACATTGACTTGGGAAGGGAGGGAAGTGGTAATAAAATATCTCGACGGCAGGCAATTATAAA
AATAGATCAGGACGGATTTTTCTCCCTGAAGAATCTTGGCAAGTGCTCAATTTCCATAAATAACAAGGATGTGGCCTGGGGTCACTGCCTGCGACTCAATTCTGGTTGCT
TGATTGAGATAAGGGGAATTCCATTTATATTTGAGTCAAACCCAGTTCGTATGAAGCAGTATGTGGAGAACATAGACAAGACATCTCACAAACAAGAGTATCAATCATAG
mRNA sequenceShow/hide mRNA sequence
CTCAGAGGTCACCGTTTGGCCTCGCCGTGTTTCGAAACGCTCGCTTTCCTTTCTCCCCCAATTTTGAGTCTCTAGGGTTTCTCTTGCAACGCCGGACGCCATATCCTTCG
TCTTCTTCCTCTCAACCGTTCTCCGGCCACGGTCGGTCAGTGCCGAACATTGTTTTCCTTTCTAACCAGAGACTGTTTAGGGTTTTCTGCTTTGCGGAGATGGGAGCTCT
TGCCCCTGTCGCGCCTTGGACTCCTGAAGATGATATTCTGCTCAAGAACGCCGTCGAGGCGGGTGCTTCCTTGGAGGCCCTTGCCAAAGGTGCTGTGCAGTTTTCTCGAA
GATACACCGTAAGAGAATTGCAAGAACGATGGTATTCTTTACTTTATGATCCAATTGTTTCTGAAGAAGCATCTATGTCCATGATTGACATTGAGTGTTCTTCCTCCATT
CTTCCATCAAAGTTCAACAAGTTTGGGAATCCAAAAGAAACTAAATATATTGGTGGGAAGAGGAAGTCTGGGAGTGTACGACGTTGTTATTATGCTTTGTGTAAAAGAAT
TTGCAACGAACCATTTAACCCTATGGACCTGAATTATCTTGTTGGTGATAGTGACTATGTAGGTGAAGAGCCTATGCAAGAAAATTGTGTCCCTCCAATATCAGATGATT
TTGGACTTCCGAGCTCAGAGCTGGAGATCTTGCCTTGTAATTTTGCTCAAAATGTTATGAATGAGGATGATACTGAGCGCACTTTTCTCTCTGGATGTCAACATACAGTT
GAAAAGCATTTTCCTCAGAACCTGGATAATGGGCACGAGGGAATTTCTCACATTATTGTAGACAATCTGCCTTTCTCTGCAAATGAATCGCATATTGAAGAATTGGCTCC
ATCAACTAGCTTTCCAGTTCATAGTATCTTTGAAAATGATTTGGAGGTGAAGCCTTCTACTTTTGGGCAACTGAGCAATGACCAGAGAGCCATGGGCTCTGAACCAGAAG
ACAACGATGTCTTTAATTCTCCTGTTTCTGATTCTGGCGCATCATTTCATAATGTTGAGTACTCATCTCCTCTTCCTGGTATGCAACTATGGAGAGATTCCTCACCAGAA
TTGCCGATTGATGTTGACTTTGCAAATACTAATATACCTACAGGAGACTCTTTTGAACTCCCTGATGATGATGATGGGAACAACAACATTCAAAATGCAAGATTAGCAGG
CTATGATGCTCACTCTAACTCAAAGTTGAAGATTGAAGTGCAGCATGATCATTTGAAAAGTTCAAATGCCTCTGCTGAAGGTTCTTATCTCGCAGAGTTGTCCAATTCTC
TTTTGAACTTGAGCAATGAGGATGAGCTACTTTTCATGGATGTTGATGGAAGAGATGTGATTGATAAGTCATACTATGAGGGGTTGTCACTTTTGTTGAGTTCACCAAAT
GAAGTTCATCCTGATCAAACAACTGAAGCAATTAATGGAGAAACAGTGTTACCAATGGATACAATGTTAGATCCCCCTACTGCATGTTCTGGAGAGTTATATGAGAAAGG
ATCCCACTGTAGTAATAGAAATTTGGATTGTAGTTCAGAAACTCATCCATCACCATCTGCATCTTTGAACAGTCAATGTTCTGGAAGAGGAGATGAGCCTCTTTTCTGCA
GTTTGAACACAGAAGACCCAGAAATCCCGAGCAATGATGATGTTTTTCTACCTCCATTGACAACAATTTCTAGTATGGGATATCATTTTCAAGATTGCGTGGACCCCACC
TTTTCATCTATTAAGGATTTCTTTTGTAAAGAAAAATCTGGTGAAAAGACTCAAAACCTTGTGCAAAGGGAGAGGAAAAATCATGGACAACCTCGTGTTTCATCTCTGTC
GATAGGATTGCATGGTTTGCCCGAAAGAGGTGAAAGACATCTGGTTGGTGGAGCTGCTGTTAGTTTGAAATTATCCCATAGCAACTCCATACATGGGTCATCTGCAAATA
AAACCAGATCCATAAATGCAAATAACGATGCTATCTTACCTGTCTCACTCAAGGAAGAGAGCAATGAAATTTCCCGGGTAAATCATCTTGGCCAGAATTTTTTAAATACT
CATGTAGAGAAGCCAGGCTTTGAATCTGTTAGTAGATATCCACCAAGTGCTACTTGTGGCATTAAACAGGAACCAGATACATTGACTACTGTGAAAGATCATCAATTATC
ACAAGAACTGGGTGCTCGAGGTGTTTTCAATGTAGAACAAGATGGGGTATCTTTGAAATCTGATCAAGAAGAGTTATCTATTGAGAGTGATGATGATGTGCCTCATTTTT
CAGATATTGAGGCAATGATACTTGATATGGACTTGGATCCAGATGATCAGGATCTGTATTCAATTGAAGAAGTCTTAAAATACCAACATATGGAGACCAAGAAGAAAATC
ATCAGACTGGAGCAAGGGGCTAATGCTTTCATGCAAAGATCTATTGCCTCCCATGGGGCATTAGCAATTCTGTATGGCCGACATGCAGAATATTACATTAAGAAATCTGA
GGTTCTGTTGGGTAGAGCGACGGCAGATTGCATTGTGGACATTGACTTGGGAAGGGAGGGAAGTGGTAATAAAATATCTCGACGGCAGGCAATTATAAAAATAGATCAGG
ACGGATTTTTCTCCCTGAAGAATCTTGGCAAGTGCTCAATTTCCATAAATAACAAGGATGTGGCCTGGGGTCACTGCCTGCGACTCAATTCTGGTTGCTTGATTGAGATA
AGGGGAATTCCATTTATATTTGAGTCAAACCCAGTTCGTATGAAGCAGTATGTGGAGAACATAGACAAGACATCTCACAAACAAGAGTATCAATCATAGTGATAGATGAT
AACCAAGCCGGCACGGTCAGTTGTACAGTCGGGAAGGTGCTAATTTTTCTGCACTTTCTGGGTTCTTTAGGTTGCATTGGATTGCTAAAGATGGGAAACAATAGCAATGA
GATTATTTTGAGCAGTAACGTAAGCAAGGAGCTATCCTACTCCTACCGAATTGCGATGATATAGGTCAAAATGAAGGGGTAATGGTCCCTGAAATCGAATTTTCCTTGCT
CTGGTCCACCCCTAGAATCCAGCTTGGCAATTCCATACCTGTCCCTGCTAGAAAGAGTAACAACTTGACTGAAGTCGTAATGGATAACTTCGAAGGAGGTTCTCTTTAGC
TATATATACGGTATCATTAACTCTTAACATTGTTTAACATTATCATGTATGGAACTTACAGTTTCAAGAAGCTGATCTTAAGCTTTGCTAACAAGTTGTATGACTTAGTT
TGTTGAAATTCAGCATTTGCATCCGTTGTATGTACAGTCTTCATTGTAACTTTCTGCAAATATCTTGTAATTAAAAGGTCTTGCCTTCTCAAAATCTAATCATGTCCATT
GAAAAACCCCCTAGTGGACCTATTCATAGAGGGATGTAATTAGCTTGGATTTGTTAAGGGGGCGTTTGGGGGCAAGGTGAATGGAATGATTATGGAGAGGAATGAGAATA
CAAGAGAATATATGATTATCTTTGTTTGCAATGGAGGAATTGCGGGAAAAATCAGTATATCTGTTTTGATTGTTTGAG
Protein sequenceShow/hide protein sequence
MGALAPVAPWTPEDDILLKNAVEAGASLEALAKGAVQFSRRYTVRELQERWYSLLYDPIVSEEASMSMIDIECSSSILPSKFNKFGNPKETKYIGGKRKSGSVRRCYYAL
CKRICNEPFNPMDLNYLVGDSDYVGEEPMQENCVPPISDDFGLPSSELEILPCNFAQNVMNEDDTERTFLSGCQHTVEKHFPQNLDNGHEGISHIIVDNLPFSANESHIE
ELAPSTSFPVHSIFENDLEVKPSTFGQLSNDQRAMGSEPEDNDVFNSPVSDSGASFHNVEYSSPLPGMQLWRDSSPELPIDVDFANTNIPTGDSFELPDDDDGNNNIQNA
RLAGYDAHSNSKLKIEVQHDHLKSSNASAEGSYLAELSNSLLNLSNEDELLFMDVDGRDVIDKSYYEGLSLLLSSPNEVHPDQTTEAINGETVLPMDTMLDPPTACSGEL
YEKGSHCSNRNLDCSSETHPSPSASLNSQCSGRGDEPLFCSLNTEDPEIPSNDDVFLPPLTTISSMGYHFQDCVDPTFSSIKDFFCKEKSGEKTQNLVQRERKNHGQPRV
SSLSIGLHGLPERGERHLVGGAAVSLKLSHSNSIHGSSANKTRSINANNDAILPVSLKEESNEISRVNHLGQNFLNTHVEKPGFESVSRYPPSATCGIKQEPDTLTTVKD
HQLSQELGARGVFNVEQDGVSLKSDQEELSIESDDDVPHFSDIEAMILDMDLDPDDQDLYSIEEVLKYQHMETKKKIIRLEQGANAFMQRSIASHGALAILYGRHAEYYI
KKSEVLLGRATADCIVDIDLGREGSGNKISRRQAIIKIDQDGFFSLKNLGKCSISINNKDVAWGHCLRLNSGCLIEIRGIPFIFESNPVRMKQYVENIDKTSHKQEYQS