; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh19G003120 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh19G003120
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionFHA domain-containing protein
Genome locationCmo_Chr19:2347973..2355281
RNA-Seq ExpressionCmoCh19G003120
SyntenyCmoCh19G003120
Gene Ontology termsGO:0031011 - Ino80 complex (cellular component)
GO:0071339 - MLL1 complex (cellular component)
GO:0002151 - G-quadruplex RNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR000253 - Forkhead-associated (FHA) domain
IPR008984 - SMAD/FHA domain superfamily
IPR025999 - Microspherule protein, N-terminal domain
IPR037912 - Microspherule protein 1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6571705.1 Microspherule protein 1, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0099.77Show/hide
Query:  MGALAPVAPWTPEDDILLKNAVEAGASLESLAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSMIDFERSSSILPSKFNRFGNPKETKYIGGKRKS
        MGALAPVAPWTPEDDILLKNAVEAGASLESLAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSMIDFERSSSILPSKFN+FGNPKETKYIGGKRKS
Subjt:  MGALAPVAPWTPEDDILLKNAVEAGASLESLAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSMIDFERSSSILPSKFNRFGNPKETKYIGGKRKS

Query:  GSVRHCYYALRKRICNEPFNNPMDLNFLVGPSNSNYVVEEPMSGNCIPPISDDFGLQSSEMGILPCDFSQNVMNTDDVEHTFQSGCQGTVEKHFPRNLDN
        GSVRHCYYALRKRICNEPFNNPMDLNFLVGPSNSNYVVEEPMSGNCIPPISDDFGLQSSEMGILPCDFSQNVMNTDDVEHTFQSGCQGTVEKHFPRNLDN
Subjt:  GSVRHCYYALRKRICNEPFNNPMDLNFLVGPSNSNYVVEEPMSGNCIPPISDDFGLQSSEMGILPCDFSQNVMNTDDVEHTFQSGCQGTVEKHFPRNLDN

Query:  GQEGISHSMRESLPPSAIDSHVEELAPSTGFPVHSLFENDLEARPSTFGQLSNDQRAMGSELEDNNVFNSPVSESGASFHNVEYSSPLPGMPIWRNASAP
        GQEGISHSMRESLPPSAIDSHVEELAPSTGFPVHSLFENDLEARPSTFGQLSNDQRAMGSELEDNNVFNSPVSESGASFHNVEYSSPLPGMPIWRNASAP
Subjt:  GQEGISHSMRESLPPSAIDSHVEELAPSTGFPVHSLFENDLEARPSTFGQLSNDQRAMGSELEDNNVFNSPVSESGASFHNVEYSSPLPGMPIWRNASAP

Query:  ALPIDVGFADKDIPTSNSFELPDDDGNKNIQNARVAGYDAYSDLKLKIEVEQDHLKSPNATAEVYLAELSNSLMNMSNEDELLFMDVDGKDALDKSYYDG
        ALPIDVGFADKDIPTSNSFELPDDDGNKNIQNARVAGYDAYSDLKLKIEVEQDHLKSPNATAEVYLAELSNSLMNMSNEDELLFMDVDGKDALDKSYYDG
Subjt:  ALPIDVGFADKDIPTSNSFELPDDDGNKNIQNARVAGYDAYSDLKLKIEVEQDHLKSPNATAEVYLAELSNSLMNMSNEDELLFMDVDGKDALDKSYYDG

Query:  LSSLLLNSPNEINHDQTANAINAETVLPTDTMVDPPTACSGGLYEKGSHCGVGHLDCTSEAHSSPSASLNNQCPVKGDEPLFCTLNTEDPDIPSNDDVFL
        LSSLLLNSPNEINHDQTANAINAETVLPTDTMVDPPTACSGGLYEKGSHCGVGHLDCTSEAHSSPSASLN+QCPVKGDEPLFCTLNTEDPDIPSNDDVFL
Subjt:  LSSLLLNSPNEINHDQTANAINAETVLPTDTMVDPPTACSGGLYEKGSHCGVGHLDCTSEAHSSPSASLNNQCPVKGDEPLFCTLNTEDPDIPSNDDVFL

Query:  PPLSTMATMGYNFQDCINTTFSSTKDFTYNEKSGETQNLGRERKNHGAGVNYRLHGFSERGEKHPVGGAGVNYRSSHSNARHLPSVSNVGSINGNSDAAL
        PPLSTMATMGYNFQDCINTTFSSTKDFTYNEKSGETQNLGRERKNHGAGVNYRLHGFSERGEKHPVGGAGVNYRSSHSNARHLPSVSNVGSINGNSDAAL
Subjt:  PPLSTMATMGYNFQDCINTTFSSTKDFTYNEKSGETQNLGRERKNHGAGVNYRLHGFSERGEKHPVGGAGVNYRSSHSNARHLPSVSNVGSINGNSDAAL

Query:  PAVLKEENNEISRVNHLGENFLNAHAEKPGFDSDNVRIYPPSAACDIKQEPDILASLKDHRLSQEGGTRGTFGVEQGGLSSTSDQEELSIDSEDDVPHFS
        PAVLKEENNEISRVNHLGENFLNAHAEKPGFDSDNVRIYPPSAACDIKQEPDILASLKDHRLSQEGGTRGTFGVEQGGLSSTSDQEELSIDSEDDVPHFS
Subjt:  PAVLKEENNEISRVNHLGENFLNAHAEKPGFDSDNVRIYPPSAACDIKQEPDILASLKDHRLSQEGGTRGTFGVEQGGLSSTSDQEELSIDSEDDVPHFS

Query:  DIEAMILDMDLDPEDQDLYSSEEVLKYQHVDTKKRIIRLEQGANAYMQRSTASHGALAVLYGRYSKHYIKKSEVLLGRATEDVIVDIDLGREGSGNKISR
        DIEAMILDMDLDPEDQDLYSSEEVLKYQHVDTKKRIIRLEQGANAYMQRSTASHGALAVLYGRYSKHYIKKSEVLLGRATEDVIVDIDLGREGSGNKISR
Subjt:  DIEAMILDMDLDPEDQDLYSSEEVLKYQHVDTKKRIIRLEQGANAYMQRSTASHGALAVLYGRYSKHYIKKSEVLLGRATEDVIVDIDLGREGSGNKISR

Query:  RQAIIKLDQDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESNPTRMKQYVDNVGKISHKQEY
        RQAIIKLDQDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESNPTRMKQYVDNVGKISHKQEY
Subjt:  RQAIIKLDQDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESNPTRMKQYVDNVGKISHKQEY

KAG7011431.1 Microspherule protein 1 [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0099.41Show/hide
Query:  MGALAPVAPWTPEDDILLKNAVEAGASLESLAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSMIDFERSSSILPSKFNRFGNPKETKYIGGKRKS
        MGALAPVAPWTPEDDILLKNAVEAGASLESLAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSMIDFERSSSILPSKFN+FGNPKETKYIGGKRKS
Subjt:  MGALAPVAPWTPEDDILLKNAVEAGASLESLAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSMIDFERSSSILPSKFNRFGNPKETKYIGGKRKS

Query:  GSVRHCYYALRKRICNEPFNNPMDLNFLVGPSNSNYVVEEPMSGNCIPPISDDFGLQSSEMGILPCDFSQNVMNTDDVEHTFQSGCQGTVEKHFPRNLDN
        GSVRHCYYALRKRICNEPFNNPMDLNFLVGPSNSNYVVEEPMSGNCIPPISDDF LQSSEMGILPCDFSQNVMNTDDVEHT+QSGCQGTVEKHFPRNLDN
Subjt:  GSVRHCYYALRKRICNEPFNNPMDLNFLVGPSNSNYVVEEPMSGNCIPPISDDFGLQSSEMGILPCDFSQNVMNTDDVEHTFQSGCQGTVEKHFPRNLDN

Query:  GQEGISHSMRESLPPSAIDSHVEELAPSTGFPVHSLFENDLEARPSTFGQLSNDQRAMGSELEDNNVFNSPVSESGASFHNVEYSSPLPGMPIWRNASAP
        GQEGISHSMRESLPPSAIDSHVEELAPSTGFPVHSLFENDLEARPSTFGQLSNDQRAMGSELEDNNVFNSPVSESGASFHNVEYSSPLPGMPIWRNASAP
Subjt:  GQEGISHSMRESLPPSAIDSHVEELAPSTGFPVHSLFENDLEARPSTFGQLSNDQRAMGSELEDNNVFNSPVSESGASFHNVEYSSPLPGMPIWRNASAP

Query:  ALPIDVGFADKDIPTSNSFELPDDDGNKNIQNARVAGYDAYSDLKLKIEVEQDHLKSPNATAEVYLAELSNSLMNMSNEDELLFMDVDGKDALDKSYYDG
        ALPIDVGFADKDIPTSNSFELPDDDGNKNIQNARVAGYDAYSDLKLKIEVEQDHLKSPNATAEVYLAELSNSLMNMSNEDELLFMDVDGKDALDKSYYDG
Subjt:  ALPIDVGFADKDIPTSNSFELPDDDGNKNIQNARVAGYDAYSDLKLKIEVEQDHLKSPNATAEVYLAELSNSLMNMSNEDELLFMDVDGKDALDKSYYDG

Query:  LSSLLLNSPNEINHDQTANAINAETVLPTDTMVDPPTACSGGLYEKGSHCGVGHLDCTSEAHSSPSASLNNQCPVKGDEPLFCTLNTEDPDIPSNDDVFL
        LSSLLLNSPNEINHDQTANAINAETVLPTDTMVDPPTACSGGLYEKGSHCGVGHLDCTSEAHSSPSASLN+QCPVKGDEPLFCTLNTEDPDIPSNDDVFL
Subjt:  LSSLLLNSPNEINHDQTANAINAETVLPTDTMVDPPTACSGGLYEKGSHCGVGHLDCTSEAHSSPSASLNNQCPVKGDEPLFCTLNTEDPDIPSNDDVFL

Query:  PPLSTMATMGYNFQDCINTTFSSTKDFTYNEKSGETQNLGRERKNHGAGVNYRLHGFSERGEKHPVGGAGVNYRSSHSNARHLPSVSNVGSINGNSDAAL
        PPLSTMATMGYNFQDCINTTFSSTKDFTYNEKSGETQNLGRERKNHGAGVNYRLHGFSERGEKHPVGGAGVNYRSSHSNARHLPSVSNVGSINGNSDAAL
Subjt:  PPLSTMATMGYNFQDCINTTFSSTKDFTYNEKSGETQNLGRERKNHGAGVNYRLHGFSERGEKHPVGGAGVNYRSSHSNARHLPSVSNVGSINGNSDAAL

Query:  PAVLKEENNEISRVNHLGENFLNAHAEKPGFDSDNVRIYPPSAACDIKQEPDILASLKDHRLSQEGGTRGTFGVEQGGLSSTSDQEELSIDSEDDVPHFS
        PAVLKEENNEISRVNHLGENFLNAHAEKPGFDSDNVRIYPPSAACDIKQEPDILASLKDHRLSQEGGTRGTFGVEQGGLSSTSDQEELSIDSEDDVPHFS
Subjt:  PAVLKEENNEISRVNHLGENFLNAHAEKPGFDSDNVRIYPPSAACDIKQEPDILASLKDHRLSQEGGTRGTFGVEQGGLSSTSDQEELSIDSEDDVPHFS

Query:  DIEAMILDMDLDPEDQDLYSSEEVLKYQHVDTKKRIIRLEQGANAYMQRSTASHGALAVLYGRYSKHYIKKSEVLLGRATEDVIVDIDLGREGSGNKISR
        DIEAMILDMDLDPEDQDLYSSEEVLKYQHVDTKKRIIRLEQGANAYMQRSTASHGALAVLYGRYSKHYIKKSEVLLGRATEDVIVDIDLGREGSGNKISR
Subjt:  DIEAMILDMDLDPEDQDLYSSEEVLKYQHVDTKKRIIRLEQGANAYMQRSTASHGALAVLYGRYSKHYIKKSEVLLGRATEDVIVDIDLGREGSGNKISR

Query:  RQAIIKLDQDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEI
        RQAIIKLDQDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIE+
Subjt:  RQAIIKLDQDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEI

XP_022963643.1 uncharacterized protein LOC111463912 isoform X1 [Cucurbita moschata]0.0e+00100Show/hide
Query:  MGALAPVAPWTPEDDILLKNAVEAGASLESLAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSMIDFERSSSILPSKFNRFGNPKETKYIGGKRKS
        MGALAPVAPWTPEDDILLKNAVEAGASLESLAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSMIDFERSSSILPSKFNRFGNPKETKYIGGKRKS
Subjt:  MGALAPVAPWTPEDDILLKNAVEAGASLESLAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSMIDFERSSSILPSKFNRFGNPKETKYIGGKRKS

Query:  GSVRHCYYALRKRICNEPFNNPMDLNFLVGPSNSNYVVEEPMSGNCIPPISDDFGLQSSEMGILPCDFSQNVMNTDDVEHTFQSGCQGTVEKHFPRNLDN
        GSVRHCYYALRKRICNEPFNNPMDLNFLVGPSNSNYVVEEPMSGNCIPPISDDFGLQSSEMGILPCDFSQNVMNTDDVEHTFQSGCQGTVEKHFPRNLDN
Subjt:  GSVRHCYYALRKRICNEPFNNPMDLNFLVGPSNSNYVVEEPMSGNCIPPISDDFGLQSSEMGILPCDFSQNVMNTDDVEHTFQSGCQGTVEKHFPRNLDN

Query:  GQEGISHSMRESLPPSAIDSHVEELAPSTGFPVHSLFENDLEARPSTFGQLSNDQRAMGSELEDNNVFNSPVSESGASFHNVEYSSPLPGMPIWRNASAP
        GQEGISHSMRESLPPSAIDSHVEELAPSTGFPVHSLFENDLEARPSTFGQLSNDQRAMGSELEDNNVFNSPVSESGASFHNVEYSSPLPGMPIWRNASAP
Subjt:  GQEGISHSMRESLPPSAIDSHVEELAPSTGFPVHSLFENDLEARPSTFGQLSNDQRAMGSELEDNNVFNSPVSESGASFHNVEYSSPLPGMPIWRNASAP

Query:  ALPIDVGFADKDIPTSNSFELPDDDGNKNIQNARVAGYDAYSDLKLKIEVEQDHLKSPNATAEVYLAELSNSLMNMSNEDELLFMDVDGKDALDKSYYDG
        ALPIDVGFADKDIPTSNSFELPDDDGNKNIQNARVAGYDAYSDLKLKIEVEQDHLKSPNATAEVYLAELSNSLMNMSNEDELLFMDVDGKDALDKSYYDG
Subjt:  ALPIDVGFADKDIPTSNSFELPDDDGNKNIQNARVAGYDAYSDLKLKIEVEQDHLKSPNATAEVYLAELSNSLMNMSNEDELLFMDVDGKDALDKSYYDG

Query:  LSSLLLNSPNEINHDQTANAINAETVLPTDTMVDPPTACSGGLYEKGSHCGVGHLDCTSEAHSSPSASLNNQCPVKGDEPLFCTLNTEDPDIPSNDDVFL
        LSSLLLNSPNEINHDQTANAINAETVLPTDTMVDPPTACSGGLYEKGSHCGVGHLDCTSEAHSSPSASLNNQCPVKGDEPLFCTLNTEDPDIPSNDDVFL
Subjt:  LSSLLLNSPNEINHDQTANAINAETVLPTDTMVDPPTACSGGLYEKGSHCGVGHLDCTSEAHSSPSASLNNQCPVKGDEPLFCTLNTEDPDIPSNDDVFL

Query:  PPLSTMATMGYNFQDCINTTFSSTKDFTYNEKSGETQNLGRERKNHGAGVNYRLHGFSERGEKHPVGGAGVNYRSSHSNARHLPSVSNVGSINGNSDAAL
        PPLSTMATMGYNFQDCINTTFSSTKDFTYNEKSGETQNLGRERKNHGAGVNYRLHGFSERGEKHPVGGAGVNYRSSHSNARHLPSVSNVGSINGNSDAAL
Subjt:  PPLSTMATMGYNFQDCINTTFSSTKDFTYNEKSGETQNLGRERKNHGAGVNYRLHGFSERGEKHPVGGAGVNYRSSHSNARHLPSVSNVGSINGNSDAAL

Query:  PAVLKEENNEISRVNHLGENFLNAHAEKPGFDSDNVRIYPPSAACDIKQEPDILASLKDHRLSQEGGTRGTFGVEQGGLSSTSDQEELSIDSEDDVPHFS
        PAVLKEENNEISRVNHLGENFLNAHAEKPGFDSDNVRIYPPSAACDIKQEPDILASLKDHRLSQEGGTRGTFGVEQGGLSSTSDQEELSIDSEDDVPHFS
Subjt:  PAVLKEENNEISRVNHLGENFLNAHAEKPGFDSDNVRIYPPSAACDIKQEPDILASLKDHRLSQEGGTRGTFGVEQGGLSSTSDQEELSIDSEDDVPHFS

Query:  DIEAMILDMDLDPEDQDLYSSEEVLKYQHVDTKKRIIRLEQGANAYMQRSTASHGALAVLYGRYSKHYIKKSEVLLGRATEDVIVDIDLGREGSGNKISR
        DIEAMILDMDLDPEDQDLYSSEEVLKYQHVDTKKRIIRLEQGANAYMQRSTASHGALAVLYGRYSKHYIKKSEVLLGRATEDVIVDIDLGREGSGNKISR
Subjt:  DIEAMILDMDLDPEDQDLYSSEEVLKYQHVDTKKRIIRLEQGANAYMQRSTASHGALAVLYGRYSKHYIKKSEVLLGRATEDVIVDIDLGREGSGNKISR

Query:  RQAIIKLDQDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESNPTRMKQYVDNVGKISHKQEYQS
        RQAIIKLDQDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESNPTRMKQYVDNVGKISHKQEYQS
Subjt:  RQAIIKLDQDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESNPTRMKQYVDNVGKISHKQEYQS

XP_022967567.1 uncharacterized protein LOC111467030 isoform X1 [Cucurbita maxima]0.0e+0096.82Show/hide
Query:  MGALAPVAPWTPEDDILLKNAVEAGASLESLAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSMIDFERSSSILPSKFNRFGNPKETKYIGGKRKS
        MGALAPV PWTPEDDILLKNAVEAGASLESLAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSMIDFERSSSILPSKFN+FGNPKETKYIGGKRKS
Subjt:  MGALAPVAPWTPEDDILLKNAVEAGASLESLAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSMIDFERSSSILPSKFNRFGNPKETKYIGGKRKS

Query:  GSVRHCYYALRKRICNEPFNNPMDLNFLVGPSNSNYVVEEPMSGNCIPPISDDFGLQSSEMGILPCDFSQNVMNTDDVEHTFQSGCQGTVEKHFPRNLDN
        GSVRHCYYALRKRICNEPFNNPM+L+FLVGPSNSNYVVEEPMSGNCIPPISDDFGLQSSE+GILPCDFSQNVMNTDDV+HTFQSGCQGTVEKHFPRNLDN
Subjt:  GSVRHCYYALRKRICNEPFNNPMDLNFLVGPSNSNYVVEEPMSGNCIPPISDDFGLQSSEMGILPCDFSQNVMNTDDVEHTFQSGCQGTVEKHFPRNLDN

Query:  GQEGISHSMRESLPPSAIDSHVEELAPSTGFPVHSLFENDLEARPSTFGQLSNDQRAMGSELEDNNVFNSPVSESGASFHNVEYSSPLPGMPIWRNASAP
        GQEGISHSMRESLPPSAIDSHVEELAPST FPVHSLFENDLEARPSTFGQLSNDQRAMGSELEDNNVFNSPVSESGASFHNVEYSSPLPGMPIWRNASAP
Subjt:  GQEGISHSMRESLPPSAIDSHVEELAPSTGFPVHSLFENDLEARPSTFGQLSNDQRAMGSELEDNNVFNSPVSESGASFHNVEYSSPLPGMPIWRNASAP

Query:  ALPIDVGFADKDIPTSNSFELPDDDGNKNIQNARVAGYDAYSDLKLKIEVEQDHLKSPNATAEVYLAELSNSLMNMSNEDELLFMDVDGKDALDKSYYDG
        ALPIDVGFADKDIPTSNSFELPDDDGNKNIQNA VAGYDAY+DLKLK EVEQDHLKSPNATAEVYLAELSNSLMNMSNEDELLFMDVDGKDALDKSYYDG
Subjt:  ALPIDVGFADKDIPTSNSFELPDDDGNKNIQNARVAGYDAYSDLKLKIEVEQDHLKSPNATAEVYLAELSNSLMNMSNEDELLFMDVDGKDALDKSYYDG

Query:  LSSLLLNSPNEINHDQTANAINAETVLPTDTMVDPPTACSGGLYEKGSHCGVGHLDCTSEAHSSPSASLNNQCPVKGDEPLFCTLNTEDPDIPSNDDVFL
        LSSLLLNSPNEINHDQT NAIN+ET+LPTDTMVDPPTACS GLYEKGSHCGVGHLDCTSEAHSSPSASLN+ CPVK DEPLFCTLNTEDPDIPSNDDVFL
Subjt:  LSSLLLNSPNEINHDQTANAINAETVLPTDTMVDPPTACSGGLYEKGSHCGVGHLDCTSEAHSSPSASLNNQCPVKGDEPLFCTLNTEDPDIPSNDDVFL

Query:  PPLSTMATMGYNFQDCINTTFSSTKDFTYNEKSGETQNLGRERKNHG-AGVNYRLHGFSERGEKHPVGGAGVNYRSSHSNARHLPSVSNVGSINGNSDAA
        PPLSTMATMGYNFQDCI+TTFSSTKDFTYNEKSGETQNLGRERKNHG + V   L+GFSERGEKHPVGGAGVNYRSSHSNARHLPSVSNVGSINGNSDAA
Subjt:  PPLSTMATMGYNFQDCINTTFSSTKDFTYNEKSGETQNLGRERKNHG-AGVNYRLHGFSERGEKHPVGGAGVNYRSSHSNARHLPSVSNVGSINGNSDAA

Query:  LPAVLKEENNEISRVNHLGENFLNAHAEKPGFDSDNVRIYPPSAACDIKQEPDILASLKDHRLSQEGGTRGTFGVEQGGLSSTSDQEELSIDSEDDVPHF
        LPAVLKEENNEISRVNHLGENFLNAHAEKPGFDSDNVRIYPPSAACDIKQEP+ILASLKDHRLSQEGGTRGTFGVEQGGLSSTSDQEELSIDSEDDVPHF
Subjt:  LPAVLKEENNEISRVNHLGENFLNAHAEKPGFDSDNVRIYPPSAACDIKQEPDILASLKDHRLSQEGGTRGTFGVEQGGLSSTSDQEELSIDSEDDVPHF

Query:  SDIEAMILDMDLDPEDQDLYSSEEVLKYQHVDTKKRIIRLEQGANAYMQRSTASHGALAVLYGRYSKHYIKKSEVLLGRATEDVIVDIDLGREGSGNKIS
        SDIEAMILDMDLDPEDQDLYSSEEVLKYQHVDTKKRIIRLEQGANAYMQRSTASHGALAVLYGRYSKHYIKKSEVLLGRATEDVIVDIDLGREGSGNKIS
Subjt:  SDIEAMILDMDLDPEDQDLYSSEEVLKYQHVDTKKRIIRLEQGANAYMQRSTASHGALAVLYGRYSKHYIKKSEVLLGRATEDVIVDIDLGREGSGNKIS

Query:  RRQAIIKLDQDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESNPTRMKQYVDNVG-KISHKQEYQS
        RRQAIIKLDQDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESNPTRMKQYVDN+G KISHKQEYQS
Subjt:  RRQAIIKLDQDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESNPTRMKQYVDNVG-KISHKQEYQS

XP_023554018.1 uncharacterized protein LOC111811415 isoform X1 [Cucurbita pepo subsp. pepo]0.0e+0098.07Show/hide
Query:  MGALAPVAPWTPEDDILLKNAVEAGASLESLAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSMIDFERSSSILPSKFNRFGNPKETKYIGGKRKS
        MGALAPVAPWTPEDDILLKNAVEAGASLESLAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSMIDFERSSSILPSKFN+FGNPKETKYIGGKRKS
Subjt:  MGALAPVAPWTPEDDILLKNAVEAGASLESLAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSMIDFERSSSILPSKFNRFGNPKETKYIGGKRKS

Query:  GSVRHCYYALRKRICNEPFNNPMDLNFLVGPSNSNYVVEEPMSGNCIPPISDDFGLQSSEMGILPCDFSQNVMNTDDVEHTFQSGCQGTVEKHFPRNLDN
        GSVRHCYYALRKR+CNEPFNNPMDLNFLVGPSNSNYVVEEPMSGNCIPPISDDFGLQSSEMGILPCDFSQNVMNTDDVEHTFQSGCQGTVEKHFPRNLDN
Subjt:  GSVRHCYYALRKRICNEPFNNPMDLNFLVGPSNSNYVVEEPMSGNCIPPISDDFGLQSSEMGILPCDFSQNVMNTDDVEHTFQSGCQGTVEKHFPRNLDN

Query:  GQEGISHSMRESLPPSAIDSHVEELAPSTGFPVHSLFENDLEARPSTFGQLSNDQRAMGSELEDNNVFNSPVSESGASFHNVEYSSPLPGMPIWRNASAP
        GQEGISHSMRESLPPSAIDSHVE LAPSTGFPVHS+FENDLEARPSTFGQLSNDQR MGSELEDNNVFNSPVSESGASFHNVEYSSPLPGMPIWRNAS P
Subjt:  GQEGISHSMRESLPPSAIDSHVEELAPSTGFPVHSLFENDLEARPSTFGQLSNDQRAMGSELEDNNVFNSPVSESGASFHNVEYSSPLPGMPIWRNASAP

Query:  ALPIDVGFADKDIPTSNSFELPDDDGNKNIQNARVAGYDAYSDLKLKIEVEQDHLKSPNATAEVYLAELSNSLMNMSNEDELLFMDVDGKDALDKSYYDG
        ALPIDVGFADKDIPTSNSFELPDDDGNKNIQNARVAGYDAYSDLKLKIEVEQDHLKSPNATAEVYLAELSNSLMNMSNEDELLFMDVDGKDALDKSYYDG
Subjt:  ALPIDVGFADKDIPTSNSFELPDDDGNKNIQNARVAGYDAYSDLKLKIEVEQDHLKSPNATAEVYLAELSNSLMNMSNEDELLFMDVDGKDALDKSYYDG

Query:  LSSLLLNSPNEINHDQTANAINAETVLPTDTMVDPPTACSGGLYEKGSHCGVGHLDCTSEAHSSPSASLNNQCPVKGDEPLFCTLNTEDPDIPSNDDVFL
        LSSLLLNSPNEINHDQT NAINAETVLPTDTMVDPPTACSGGLYEKGS CGVGHLDCTSEAHSSPSASLN+QCPVKGDEPLFCTLNTEDPDIPSNDDVFL
Subjt:  LSSLLLNSPNEINHDQTANAINAETVLPTDTMVDPPTACSGGLYEKGSHCGVGHLDCTSEAHSSPSASLNNQCPVKGDEPLFCTLNTEDPDIPSNDDVFL

Query:  PPLSTMATMGYNFQDCINTTFSSTKDFTYNEKSGETQNLGRERKNHG-AGVNYRLHGFSERGEKHPVGGAGVNYRSSHSNARHLPSVSNVGSINGNSDAA
        PPLSTMATMGYNFQDCINTTFSSTKDFTYNEKSGETQNLGRERKNHG   V   LHGFSERGEKHPVGGAGVNYRSSHSNARHLPSVSNVGSINGNSDAA
Subjt:  PPLSTMATMGYNFQDCINTTFSSTKDFTYNEKSGETQNLGRERKNHG-AGVNYRLHGFSERGEKHPVGGAGVNYRSSHSNARHLPSVSNVGSINGNSDAA

Query:  LPAVLKEENNEISRVNHLGENFLNAHAEKPGFDSDNVRIYPPSAACDIKQEPDILASLKDHRLSQEGGTRGTFGVEQGGLSSTSDQEELSIDSEDDVPHF
        LPAVLKEENNEISRVNHLGENFLNAHA+KPGFDSDNVR+YPPSAACDIKQEPDILASLKDHRLSQEGGTRGTFGVEQGGLSSTSDQEELSIDSEDDVPHF
Subjt:  LPAVLKEENNEISRVNHLGENFLNAHAEKPGFDSDNVRIYPPSAACDIKQEPDILASLKDHRLSQEGGTRGTFGVEQGGLSSTSDQEELSIDSEDDVPHF

Query:  SDIEAMILDMDLDPEDQDLYSSEEVLKYQHVDTKKRIIRLEQGANAYMQRSTASHGALAVLYGRYSKHYIKKSEVLLGRATEDVIVDIDLGREGSGNKIS
        SDIEAMILDMDLDPEDQDLYSSEEVLKYQHVDTKKRIIRLEQGANAYMQRSTASHGALAVLYGRYSKHYIKKSEVLLGRATEDVIVDIDLGREGSGNKIS
Subjt:  SDIEAMILDMDLDPEDQDLYSSEEVLKYQHVDTKKRIIRLEQGANAYMQRSTASHGALAVLYGRYSKHYIKKSEVLLGRATEDVIVDIDLGREGSGNKIS

Query:  RRQAIIKLDQDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESNPTRMKQYVDNVGKISHKQEYQS
        RRQAIIKLDQDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESNPTRMKQYVDNVGKISHKQEYQS
Subjt:  RRQAIIKLDQDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESNPTRMKQYVDNVGKISHKQEYQS

TrEMBL top hitse value%identityAlignment
A0A6J1C5Q1 uncharacterized protein LOC111007538 isoform X10.0e+0082.02Show/hide
Query:  MGALAPVAPWTPEDDILLKNAVEAGASLESLAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSMIDFERSSSILPSKFNRFGNPKETKYIGGKRKS
        MGALAPVAPWTPEDDILLKNA+EAGASLESLAKGAVQFSRRYTVRELQERWHSLLYDPIVSE+ASMSMIDFERSSSILPSKFN+FGNPKETK IGGKRK 
Subjt:  MGALAPVAPWTPEDDILLKNAVEAGASLESLAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSMIDFERSSSILPSKFNRFGNPKETKYIGGKRKS

Query:  GSVRHCYYALRKRICNEPFNNPMDLNFLVGPSNSNYVVEEPMSGNCIPPISDDFGLQSSEMGILPCDFSQNVMNTDDVEHTFQSGCQGTVEKHFPRNLDN
        GSVR CYYALRKRICNEPF NPMDL+FLVGPS+SNYVVEEPMSG+CIPPIS DFGLQ SE+GILP +F+ N+MN DD E TF S CQ TVEKHFP NLDN
Subjt:  GSVRHCYYALRKRICNEPFNNPMDLNFLVGPSNSNYVVEEPMSGNCIPPISDDFGLQSSEMGILPCDFSQNVMNTDDVEHTFQSGCQGTVEKHFPRNLDN

Query:  GQEGISHSMRESLPPSAIDSHVEELAPSTGFPVHSLFENDLEARPSTFGQLSNDQRAMGSELEDNNVFNSPVSESGASFHNVEYSSPLPGMPIWRNASAP
          EGI H MRE+LP S  +S VEELAPS  FPVHSLFENDLE RPSTFGQ S DQRAMGSELEDN VFNSPVS+SGASFHNVEYSSPLPGMPIWRNASAP
Subjt:  GQEGISHSMRESLPPSAIDSHVEELAPSTGFPVHSLFENDLEARPSTFGQLSNDQRAMGSELEDNNVFNSPVSESGASFHNVEYSSPLPGMPIWRNASAP

Query:  ALPIDVGFADKDIPTSNSFELPDDDGNKNIQNARVAGYDAYSDLKLKIEVEQDHLKSPNATAEVYLAELSNSLMNMSNEDELLFMDVDGKDALDKSYYDG
        ALPIDVGF+DKD+PT +SFELPDDDGN NIQNAR+A YDA SD KLKIEV+ DHLKSPNATAEVYLAELSNSL+N++NEDELLFMD DGKD +DKSYYDG
Subjt:  ALPIDVGFADKDIPTSNSFELPDDDGNKNIQNARVAGYDAYSDLKLKIEVEQDHLKSPNATAEVYLAELSNSLMNMSNEDELLFMDVDGKDALDKSYYDG

Query:  LSSLLLNSPNEINHDQTANAINAETVLPTDTMVDPPTACSGGLYEKGSHCGVGHLDCTSEAHSSPSASLNNQCPVKGDEPLFCTLNTEDPDIPSNDDVFL
        LSSLLLNSPNE+NHDQTA+A+N ET+LPTD+MVDPPTACSG LYEKGSHC  GHLDC+ E H SPSASLN+QC  KGDEPLFCTLNTEDP+IPSNDDVFL
Subjt:  LSSLLLNSPNEINHDQTANAINAETVLPTDTMVDPPTACSGGLYEKGSHCGVGHLDCTSEAHSSPSASLNNQCPVKGDEPLFCTLNTEDPDIPSNDDVFL

Query:  PPLSTMATMGYNFQDCINTTFSSTKDFTYNEKSGE-TQNL-GRERKNHG----AGVNYRLHGFSERGEKHPVGGAGVNYRSSHSNARHLPSVSNVG----
        PPLST+++MGY+FQD I+ TFSS KDF+ NEKSGE TQNL  RERKNHG    + ++  LHG  ERGEKH VGGA VN +  HSN+ H+PS +N G    
Subjt:  PPLSTMATMGYNFQDCINTTFSSTKDFTYNEKSGE-TQNL-GRERKNHG----AGVNYRLHGFSERGEKHPVGGAGVNYRSSHSNARHLPSVSNVG----

Query:  --SINGNSDAALPAVLKEENNEISRVNHLGENFLNAHAEKPGFDSDNVRIYPPSAACDIKQEPDILASLKDHRLSQEGGTRGTFGVEQGGLSSTSDQEEL
          SIN N DA LP  LKEE+ EISRVNHLG+NFLN H EKPGFDS+N R YPPS A  IKQEPDIL  +KDHRLSQE G+RG FGVEQ G+SSTSDQEEL
Subjt:  --SINGNSDAALPAVLKEENNEISRVNHLGENFLNAHAEKPGFDSDNVRIYPPSAACDIKQEPDILASLKDHRLSQEGGTRGTFGVEQGGLSSTSDQEEL

Query:  SIDSEDDVPHFSDIEAMILDMDLDPEDQDLYSSEEVLKYQHVDTKKRIIRLEQGANAYMQRSTASHGALAVLYGRYSKHYIKKSEVLLGRATEDVIVDID
        SIDSEDDVPHFSDIEAMILDMDLDPEDQDLY+SEEVL+YQH+DTKKRI+RLEQGA+A M+RS ASHGALAVLYGRYSKHYIKKSEVLLGRATEDVIVDID
Subjt:  SIDSEDDVPHFSDIEAMILDMDLDPEDQDLYSSEEVLKYQHVDTKKRIIRLEQGANAYMQRSTASHGALAVLYGRYSKHYIKKSEVLLGRATEDVIVDID

Query:  LGREGSGNKISRRQAIIKLDQDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESNPTRMKQYVDNVGKISHKQEYQS
        LGREGSGNKISRRQAIIK+DQDGFFSLKNLGKCSISINNK+VAPGHCLRLNSGCLIEIRGM FIFES+P  MKQY+DN+GK SHKQEYQS
Subjt:  LGREGSGNKISRRQAIIKLDQDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESNPTRMKQYVDNVGKISHKQEYQS

A0A6J1HIH5 uncharacterized protein LOC111463912 isoform X10.0e+00100Show/hide
Query:  MGALAPVAPWTPEDDILLKNAVEAGASLESLAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSMIDFERSSSILPSKFNRFGNPKETKYIGGKRKS
        MGALAPVAPWTPEDDILLKNAVEAGASLESLAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSMIDFERSSSILPSKFNRFGNPKETKYIGGKRKS
Subjt:  MGALAPVAPWTPEDDILLKNAVEAGASLESLAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSMIDFERSSSILPSKFNRFGNPKETKYIGGKRKS

Query:  GSVRHCYYALRKRICNEPFNNPMDLNFLVGPSNSNYVVEEPMSGNCIPPISDDFGLQSSEMGILPCDFSQNVMNTDDVEHTFQSGCQGTVEKHFPRNLDN
        GSVRHCYYALRKRICNEPFNNPMDLNFLVGPSNSNYVVEEPMSGNCIPPISDDFGLQSSEMGILPCDFSQNVMNTDDVEHTFQSGCQGTVEKHFPRNLDN
Subjt:  GSVRHCYYALRKRICNEPFNNPMDLNFLVGPSNSNYVVEEPMSGNCIPPISDDFGLQSSEMGILPCDFSQNVMNTDDVEHTFQSGCQGTVEKHFPRNLDN

Query:  GQEGISHSMRESLPPSAIDSHVEELAPSTGFPVHSLFENDLEARPSTFGQLSNDQRAMGSELEDNNVFNSPVSESGASFHNVEYSSPLPGMPIWRNASAP
        GQEGISHSMRESLPPSAIDSHVEELAPSTGFPVHSLFENDLEARPSTFGQLSNDQRAMGSELEDNNVFNSPVSESGASFHNVEYSSPLPGMPIWRNASAP
Subjt:  GQEGISHSMRESLPPSAIDSHVEELAPSTGFPVHSLFENDLEARPSTFGQLSNDQRAMGSELEDNNVFNSPVSESGASFHNVEYSSPLPGMPIWRNASAP

Query:  ALPIDVGFADKDIPTSNSFELPDDDGNKNIQNARVAGYDAYSDLKLKIEVEQDHLKSPNATAEVYLAELSNSLMNMSNEDELLFMDVDGKDALDKSYYDG
        ALPIDVGFADKDIPTSNSFELPDDDGNKNIQNARVAGYDAYSDLKLKIEVEQDHLKSPNATAEVYLAELSNSLMNMSNEDELLFMDVDGKDALDKSYYDG
Subjt:  ALPIDVGFADKDIPTSNSFELPDDDGNKNIQNARVAGYDAYSDLKLKIEVEQDHLKSPNATAEVYLAELSNSLMNMSNEDELLFMDVDGKDALDKSYYDG

Query:  LSSLLLNSPNEINHDQTANAINAETVLPTDTMVDPPTACSGGLYEKGSHCGVGHLDCTSEAHSSPSASLNNQCPVKGDEPLFCTLNTEDPDIPSNDDVFL
        LSSLLLNSPNEINHDQTANAINAETVLPTDTMVDPPTACSGGLYEKGSHCGVGHLDCTSEAHSSPSASLNNQCPVKGDEPLFCTLNTEDPDIPSNDDVFL
Subjt:  LSSLLLNSPNEINHDQTANAINAETVLPTDTMVDPPTACSGGLYEKGSHCGVGHLDCTSEAHSSPSASLNNQCPVKGDEPLFCTLNTEDPDIPSNDDVFL

Query:  PPLSTMATMGYNFQDCINTTFSSTKDFTYNEKSGETQNLGRERKNHGAGVNYRLHGFSERGEKHPVGGAGVNYRSSHSNARHLPSVSNVGSINGNSDAAL
        PPLSTMATMGYNFQDCINTTFSSTKDFTYNEKSGETQNLGRERKNHGAGVNYRLHGFSERGEKHPVGGAGVNYRSSHSNARHLPSVSNVGSINGNSDAAL
Subjt:  PPLSTMATMGYNFQDCINTTFSSTKDFTYNEKSGETQNLGRERKNHGAGVNYRLHGFSERGEKHPVGGAGVNYRSSHSNARHLPSVSNVGSINGNSDAAL

Query:  PAVLKEENNEISRVNHLGENFLNAHAEKPGFDSDNVRIYPPSAACDIKQEPDILASLKDHRLSQEGGTRGTFGVEQGGLSSTSDQEELSIDSEDDVPHFS
        PAVLKEENNEISRVNHLGENFLNAHAEKPGFDSDNVRIYPPSAACDIKQEPDILASLKDHRLSQEGGTRGTFGVEQGGLSSTSDQEELSIDSEDDVPHFS
Subjt:  PAVLKEENNEISRVNHLGENFLNAHAEKPGFDSDNVRIYPPSAACDIKQEPDILASLKDHRLSQEGGTRGTFGVEQGGLSSTSDQEELSIDSEDDVPHFS

Query:  DIEAMILDMDLDPEDQDLYSSEEVLKYQHVDTKKRIIRLEQGANAYMQRSTASHGALAVLYGRYSKHYIKKSEVLLGRATEDVIVDIDLGREGSGNKISR
        DIEAMILDMDLDPEDQDLYSSEEVLKYQHVDTKKRIIRLEQGANAYMQRSTASHGALAVLYGRYSKHYIKKSEVLLGRATEDVIVDIDLGREGSGNKISR
Subjt:  DIEAMILDMDLDPEDQDLYSSEEVLKYQHVDTKKRIIRLEQGANAYMQRSTASHGALAVLYGRYSKHYIKKSEVLLGRATEDVIVDIDLGREGSGNKISR

Query:  RQAIIKLDQDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESNPTRMKQYVDNVGKISHKQEYQS
        RQAIIKLDQDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESNPTRMKQYVDNVGKISHKQEYQS
Subjt:  RQAIIKLDQDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESNPTRMKQYVDNVGKISHKQEYQS

A0A6J1HKT2 uncharacterized protein LOC111463912 isoform X20.0e+00100Show/hide
Query:  MSMIDFERSSSILPSKFNRFGNPKETKYIGGKRKSGSVRHCYYALRKRICNEPFNNPMDLNFLVGPSNSNYVVEEPMSGNCIPPISDDFGLQSSEMGILP
        MSMIDFERSSSILPSKFNRFGNPKETKYIGGKRKSGSVRHCYYALRKRICNEPFNNPMDLNFLVGPSNSNYVVEEPMSGNCIPPISDDFGLQSSEMGILP
Subjt:  MSMIDFERSSSILPSKFNRFGNPKETKYIGGKRKSGSVRHCYYALRKRICNEPFNNPMDLNFLVGPSNSNYVVEEPMSGNCIPPISDDFGLQSSEMGILP

Query:  CDFSQNVMNTDDVEHTFQSGCQGTVEKHFPRNLDNGQEGISHSMRESLPPSAIDSHVEELAPSTGFPVHSLFENDLEARPSTFGQLSNDQRAMGSELEDN
        CDFSQNVMNTDDVEHTFQSGCQGTVEKHFPRNLDNGQEGISHSMRESLPPSAIDSHVEELAPSTGFPVHSLFENDLEARPSTFGQLSNDQRAMGSELEDN
Subjt:  CDFSQNVMNTDDVEHTFQSGCQGTVEKHFPRNLDNGQEGISHSMRESLPPSAIDSHVEELAPSTGFPVHSLFENDLEARPSTFGQLSNDQRAMGSELEDN

Query:  NVFNSPVSESGASFHNVEYSSPLPGMPIWRNASAPALPIDVGFADKDIPTSNSFELPDDDGNKNIQNARVAGYDAYSDLKLKIEVEQDHLKSPNATAEVY
        NVFNSPVSESGASFHNVEYSSPLPGMPIWRNASAPALPIDVGFADKDIPTSNSFELPDDDGNKNIQNARVAGYDAYSDLKLKIEVEQDHLKSPNATAEVY
Subjt:  NVFNSPVSESGASFHNVEYSSPLPGMPIWRNASAPALPIDVGFADKDIPTSNSFELPDDDGNKNIQNARVAGYDAYSDLKLKIEVEQDHLKSPNATAEVY

Query:  LAELSNSLMNMSNEDELLFMDVDGKDALDKSYYDGLSSLLLNSPNEINHDQTANAINAETVLPTDTMVDPPTACSGGLYEKGSHCGVGHLDCTSEAHSSP
        LAELSNSLMNMSNEDELLFMDVDGKDALDKSYYDGLSSLLLNSPNEINHDQTANAINAETVLPTDTMVDPPTACSGGLYEKGSHCGVGHLDCTSEAHSSP
Subjt:  LAELSNSLMNMSNEDELLFMDVDGKDALDKSYYDGLSSLLLNSPNEINHDQTANAINAETVLPTDTMVDPPTACSGGLYEKGSHCGVGHLDCTSEAHSSP

Query:  SASLNNQCPVKGDEPLFCTLNTEDPDIPSNDDVFLPPLSTMATMGYNFQDCINTTFSSTKDFTYNEKSGETQNLGRERKNHGAGVNYRLHGFSERGEKHP
        SASLNNQCPVKGDEPLFCTLNTEDPDIPSNDDVFLPPLSTMATMGYNFQDCINTTFSSTKDFTYNEKSGETQNLGRERKNHGAGVNYRLHGFSERGEKHP
Subjt:  SASLNNQCPVKGDEPLFCTLNTEDPDIPSNDDVFLPPLSTMATMGYNFQDCINTTFSSTKDFTYNEKSGETQNLGRERKNHGAGVNYRLHGFSERGEKHP

Query:  VGGAGVNYRSSHSNARHLPSVSNVGSINGNSDAALPAVLKEENNEISRVNHLGENFLNAHAEKPGFDSDNVRIYPPSAACDIKQEPDILASLKDHRLSQE
        VGGAGVNYRSSHSNARHLPSVSNVGSINGNSDAALPAVLKEENNEISRVNHLGENFLNAHAEKPGFDSDNVRIYPPSAACDIKQEPDILASLKDHRLSQE
Subjt:  VGGAGVNYRSSHSNARHLPSVSNVGSINGNSDAALPAVLKEENNEISRVNHLGENFLNAHAEKPGFDSDNVRIYPPSAACDIKQEPDILASLKDHRLSQE

Query:  GGTRGTFGVEQGGLSSTSDQEELSIDSEDDVPHFSDIEAMILDMDLDPEDQDLYSSEEVLKYQHVDTKKRIIRLEQGANAYMQRSTASHGALAVLYGRYS
        GGTRGTFGVEQGGLSSTSDQEELSIDSEDDVPHFSDIEAMILDMDLDPEDQDLYSSEEVLKYQHVDTKKRIIRLEQGANAYMQRSTASHGALAVLYGRYS
Subjt:  GGTRGTFGVEQGGLSSTSDQEELSIDSEDDVPHFSDIEAMILDMDLDPEDQDLYSSEEVLKYQHVDTKKRIIRLEQGANAYMQRSTASHGALAVLYGRYS

Query:  KHYIKKSEVLLGRATEDVIVDIDLGREGSGNKISRRQAIIKLDQDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESNPTRMKQYVD
        KHYIKKSEVLLGRATEDVIVDIDLGREGSGNKISRRQAIIKLDQDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESNPTRMKQYVD
Subjt:  KHYIKKSEVLLGRATEDVIVDIDLGREGSGNKISRRQAIIKLDQDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESNPTRMKQYVD

Query:  NVGKISHKQEYQS
        NVGKISHKQEYQS
Subjt:  NVGKISHKQEYQS

A0A6J1HR64 uncharacterized protein LOC111467030 isoform X10.0e+0096.82Show/hide
Query:  MGALAPVAPWTPEDDILLKNAVEAGASLESLAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSMIDFERSSSILPSKFNRFGNPKETKYIGGKRKS
        MGALAPV PWTPEDDILLKNAVEAGASLESLAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSMIDFERSSSILPSKFN+FGNPKETKYIGGKRKS
Subjt:  MGALAPVAPWTPEDDILLKNAVEAGASLESLAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSMIDFERSSSILPSKFNRFGNPKETKYIGGKRKS

Query:  GSVRHCYYALRKRICNEPFNNPMDLNFLVGPSNSNYVVEEPMSGNCIPPISDDFGLQSSEMGILPCDFSQNVMNTDDVEHTFQSGCQGTVEKHFPRNLDN
        GSVRHCYYALRKRICNEPFNNPM+L+FLVGPSNSNYVVEEPMSGNCIPPISDDFGLQSSE+GILPCDFSQNVMNTDDV+HTFQSGCQGTVEKHFPRNLDN
Subjt:  GSVRHCYYALRKRICNEPFNNPMDLNFLVGPSNSNYVVEEPMSGNCIPPISDDFGLQSSEMGILPCDFSQNVMNTDDVEHTFQSGCQGTVEKHFPRNLDN

Query:  GQEGISHSMRESLPPSAIDSHVEELAPSTGFPVHSLFENDLEARPSTFGQLSNDQRAMGSELEDNNVFNSPVSESGASFHNVEYSSPLPGMPIWRNASAP
        GQEGISHSMRESLPPSAIDSHVEELAPST FPVHSLFENDLEARPSTFGQLSNDQRAMGSELEDNNVFNSPVSESGASFHNVEYSSPLPGMPIWRNASAP
Subjt:  GQEGISHSMRESLPPSAIDSHVEELAPSTGFPVHSLFENDLEARPSTFGQLSNDQRAMGSELEDNNVFNSPVSESGASFHNVEYSSPLPGMPIWRNASAP

Query:  ALPIDVGFADKDIPTSNSFELPDDDGNKNIQNARVAGYDAYSDLKLKIEVEQDHLKSPNATAEVYLAELSNSLMNMSNEDELLFMDVDGKDALDKSYYDG
        ALPIDVGFADKDIPTSNSFELPDDDGNKNIQNA VAGYDAY+DLKLK EVEQDHLKSPNATAEVYLAELSNSLMNMSNEDELLFMDVDGKDALDKSYYDG
Subjt:  ALPIDVGFADKDIPTSNSFELPDDDGNKNIQNARVAGYDAYSDLKLKIEVEQDHLKSPNATAEVYLAELSNSLMNMSNEDELLFMDVDGKDALDKSYYDG

Query:  LSSLLLNSPNEINHDQTANAINAETVLPTDTMVDPPTACSGGLYEKGSHCGVGHLDCTSEAHSSPSASLNNQCPVKGDEPLFCTLNTEDPDIPSNDDVFL
        LSSLLLNSPNEINHDQT NAIN+ET+LPTDTMVDPPTACS GLYEKGSHCGVGHLDCTSEAHSSPSASLN+ CPVK DEPLFCTLNTEDPDIPSNDDVFL
Subjt:  LSSLLLNSPNEINHDQTANAINAETVLPTDTMVDPPTACSGGLYEKGSHCGVGHLDCTSEAHSSPSASLNNQCPVKGDEPLFCTLNTEDPDIPSNDDVFL

Query:  PPLSTMATMGYNFQDCINTTFSSTKDFTYNEKSGETQNLGRERKNHG-AGVNYRLHGFSERGEKHPVGGAGVNYRSSHSNARHLPSVSNVGSINGNSDAA
        PPLSTMATMGYNFQDCI+TTFSSTKDFTYNEKSGETQNLGRERKNHG + V   L+GFSERGEKHPVGGAGVNYRSSHSNARHLPSVSNVGSINGNSDAA
Subjt:  PPLSTMATMGYNFQDCINTTFSSTKDFTYNEKSGETQNLGRERKNHG-AGVNYRLHGFSERGEKHPVGGAGVNYRSSHSNARHLPSVSNVGSINGNSDAA

Query:  LPAVLKEENNEISRVNHLGENFLNAHAEKPGFDSDNVRIYPPSAACDIKQEPDILASLKDHRLSQEGGTRGTFGVEQGGLSSTSDQEELSIDSEDDVPHF
        LPAVLKEENNEISRVNHLGENFLNAHAEKPGFDSDNVRIYPPSAACDIKQEP+ILASLKDHRLSQEGGTRGTFGVEQGGLSSTSDQEELSIDSEDDVPHF
Subjt:  LPAVLKEENNEISRVNHLGENFLNAHAEKPGFDSDNVRIYPPSAACDIKQEPDILASLKDHRLSQEGGTRGTFGVEQGGLSSTSDQEELSIDSEDDVPHF

Query:  SDIEAMILDMDLDPEDQDLYSSEEVLKYQHVDTKKRIIRLEQGANAYMQRSTASHGALAVLYGRYSKHYIKKSEVLLGRATEDVIVDIDLGREGSGNKIS
        SDIEAMILDMDLDPEDQDLYSSEEVLKYQHVDTKKRIIRLEQGANAYMQRSTASHGALAVLYGRYSKHYIKKSEVLLGRATEDVIVDIDLGREGSGNKIS
Subjt:  SDIEAMILDMDLDPEDQDLYSSEEVLKYQHVDTKKRIIRLEQGANAYMQRSTASHGALAVLYGRYSKHYIKKSEVLLGRATEDVIVDIDLGREGSGNKIS

Query:  RRQAIIKLDQDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESNPTRMKQYVDNVG-KISHKQEYQS
        RRQAIIKLDQDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESNPTRMKQYVDN+G KISHKQEYQS
Subjt:  RRQAIIKLDQDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESNPTRMKQYVDNVG-KISHKQEYQS

A0A6J1HVE8 uncharacterized protein LOC111467030 isoform X20.0e+0096.69Show/hide
Query:  MSMIDFERSSSILPSKFNRFGNPKETKYIGGKRKSGSVRHCYYALRKRICNEPFNNPMDLNFLVGPSNSNYVVEEPMSGNCIPPISDDFGLQSSEMGILP
        MSMIDFERSSSILPSKFN+FGNPKETKYIGGKRKSGSVRHCYYALRKRICNEPFNNPM+L+FLVGPSNSNYVVEEPMSGNCIPPISDDFGLQSSE+GILP
Subjt:  MSMIDFERSSSILPSKFNRFGNPKETKYIGGKRKSGSVRHCYYALRKRICNEPFNNPMDLNFLVGPSNSNYVVEEPMSGNCIPPISDDFGLQSSEMGILP

Query:  CDFSQNVMNTDDVEHTFQSGCQGTVEKHFPRNLDNGQEGISHSMRESLPPSAIDSHVEELAPSTGFPVHSLFENDLEARPSTFGQLSNDQRAMGSELEDN
        CDFSQNVMNTDDV+HTFQSGCQGTVEKHFPRNLDNGQEGISHSMRESLPPSAIDSHVEELAPST FPVHSLFENDLEARPSTFGQLSNDQRAMGSELEDN
Subjt:  CDFSQNVMNTDDVEHTFQSGCQGTVEKHFPRNLDNGQEGISHSMRESLPPSAIDSHVEELAPSTGFPVHSLFENDLEARPSTFGQLSNDQRAMGSELEDN

Query:  NVFNSPVSESGASFHNVEYSSPLPGMPIWRNASAPALPIDVGFADKDIPTSNSFELPDDDGNKNIQNARVAGYDAYSDLKLKIEVEQDHLKSPNATAEVY
        NVFNSPVSESGASFHNVEYSSPLPGMPIWRNASAPALPIDVGFADKDIPTSNSFELPDDDGNKNIQNA VAGYDAY+DLKLK EVEQDHLKSPNATAEVY
Subjt:  NVFNSPVSESGASFHNVEYSSPLPGMPIWRNASAPALPIDVGFADKDIPTSNSFELPDDDGNKNIQNARVAGYDAYSDLKLKIEVEQDHLKSPNATAEVY

Query:  LAELSNSLMNMSNEDELLFMDVDGKDALDKSYYDGLSSLLLNSPNEINHDQTANAINAETVLPTDTMVDPPTACSGGLYEKGSHCGVGHLDCTSEAHSSP
        LAELSNSLMNMSNEDELLFMDVDGKDALDKSYYDGLSSLLLNSPNEINHDQT NAIN+ET+LPTDTMVDPPTACS GLYEKGSHCGVGHLDCTSEAHSSP
Subjt:  LAELSNSLMNMSNEDELLFMDVDGKDALDKSYYDGLSSLLLNSPNEINHDQTANAINAETVLPTDTMVDPPTACSGGLYEKGSHCGVGHLDCTSEAHSSP

Query:  SASLNNQCPVKGDEPLFCTLNTEDPDIPSNDDVFLPPLSTMATMGYNFQDCINTTFSSTKDFTYNEKSGETQNLGRERKNHG-AGVNYRLHGFSERGEKH
        SASLN+ CPVK DEPLFCTLNTEDPDIPSNDDVFLPPLSTMATMGYNFQDCI+TTFSSTKDFTYNEKSGETQNLGRERKNHG + V   L+GFSERGEKH
Subjt:  SASLNNQCPVKGDEPLFCTLNTEDPDIPSNDDVFLPPLSTMATMGYNFQDCINTTFSSTKDFTYNEKSGETQNLGRERKNHG-AGVNYRLHGFSERGEKH

Query:  PVGGAGVNYRSSHSNARHLPSVSNVGSINGNSDAALPAVLKEENNEISRVNHLGENFLNAHAEKPGFDSDNVRIYPPSAACDIKQEPDILASLKDHRLSQ
        PVGGAGVNYRSSHSNARHLPSVSNVGSINGNSDAALPAVLKEENNEISRVNHLGENFLNAHAEKPGFDSDNVRIYPPSAACDIKQEP+ILASLKDHRLSQ
Subjt:  PVGGAGVNYRSSHSNARHLPSVSNVGSINGNSDAALPAVLKEENNEISRVNHLGENFLNAHAEKPGFDSDNVRIYPPSAACDIKQEPDILASLKDHRLSQ

Query:  EGGTRGTFGVEQGGLSSTSDQEELSIDSEDDVPHFSDIEAMILDMDLDPEDQDLYSSEEVLKYQHVDTKKRIIRLEQGANAYMQRSTASHGALAVLYGRY
        EGGTRGTFGVEQGGLSSTSDQEELSIDSEDDVPHFSDIEAMILDMDLDPEDQDLYSSEEVLKYQHVDTKKRIIRLEQGANAYMQRSTASHGALAVLYGRY
Subjt:  EGGTRGTFGVEQGGLSSTSDQEELSIDSEDDVPHFSDIEAMILDMDLDPEDQDLYSSEEVLKYQHVDTKKRIIRLEQGANAYMQRSTASHGALAVLYGRY

Query:  SKHYIKKSEVLLGRATEDVIVDIDLGREGSGNKISRRQAIIKLDQDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESNPTRMKQYV
        SKHYIKKSEVLLGRATEDVIVDIDLGREGSGNKISRRQAIIKLDQDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESNPTRMKQYV
Subjt:  SKHYIKKSEVLLGRATEDVIVDIDLGREGSGNKISRRQAIIKLDQDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESNPTRMKQYV

Query:  DNVG-KISHKQEYQS
        DN+G KISHKQEYQS
Subjt:  DNVG-KISHKQEYQS

SwissProt top hitse value%identityAlignment
Q96EZ8 Microspherule protein 11.6e-1434.43Show/hide
Query:  DQEELSIDSEDDVPHFSDIEAMILDMDL-DPEDQDLYSSEEVLKYQHVDTKKRIIRLEQGANAY---------MQRSTASHGALAVLYGRYSKHYIKKSE
        DQ    +   D V +FSD E +I D  L D  D+ L   E  L       K+ I +LEQ  + +         M      +  LAVL GR  ++ ++  E
Subjt:  DQEELSIDSEDDVPHFSDIEAMILDMDL-DPEDQDLYSSEEVLKYQHVDTKKRIIRLEQGANAY---------MQRSTASHGALAVLYGRYSKHYIKKSE

Query:  VLLGRATEDVIVDIDLGREGSGNKISRRQAIIKLDQDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESN
        + LGRAT+D  +D+DL  EG   KISR+Q +IKL  +G F + N G+  I I+ + V  G   RL++  ++EI  + F+F  N
Subjt:  VLLGRATEDVIVDIDLGREGSGNKISRRQAIIKLDQDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESN

Q96EZ8 Microspherule protein 13.0e-0852.54Show/hide
Query:  WTPEDDILLKNAVEAGASLESLAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSM
        W P DD+LL NAV     L S+  G V+FS R+T+RE+QERW++LLYDP++S+ A  +M
Subjt:  WTPEDDILLKNAVEAGASLESLAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSM

Q99L90 Microspherule protein 12.8e-1434.43Show/hide
Query:  DQEELSIDSEDDVPHFSDIEAMILDMDL-DPEDQDLYSSEEVLKYQHVDTKKRIIRLEQGANAY---------MQRSTASHGALAVLYGRYSKHYIKKSE
        DQ    +   D V +FSD E +I D  L D  D+ L   E  L       K+ I +LEQ  + +         M      +  LAVL GR  ++ ++  E
Subjt:  DQEELSIDSEDDVPHFSDIEAMILDMDL-DPEDQDLYSSEEVLKYQHVDTKKRIIRLEQGANAY---------MQRSTASHGALAVLYGRYSKHYIKKSE

Query:  VLLGRATEDVIVDIDLGREGSGNKISRRQAIIKLDQDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESN
        + LGRAT+D  +D+DL  EG   KISR+Q +IKL  +G F + N G+  I I+ + V  G   RL++  ++EI  + F+F  N
Subjt:  VLLGRATEDVIVDIDLGREGSGNKISRRQAIIKLDQDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESN

Q99L90 Microspherule protein 13.0e-0852.54Show/hide
Query:  WTPEDDILLKNAVEAGASLESLAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSM
        W P DD+LL NAV     L S+  G V+FS R+T+RE+QERW++LLYDP++S+ A  +M
Subjt:  WTPEDDILLKNAVEAGASLESLAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSM

Arabidopsis top hitse value%identityAlignment
AT1G60700.1 SMAD/FHA domain-containing protein5.3e-2942.25Show/hide
Query:  STSDQEELSIDSEDDVPHFSDIEAMILDMDLDPEDQD-LYSSEEVLKYQHVDTKKRIIRLEQGANAYMQRSTASHGALAVLYGRYSKHYIKKSEVLLGRA
        ST  QEE  +D E+++    DI+AMI  ++L P+D D  ++ EE    +H   +  +I LEQ     MQR+   HGA+AVL+   SKH+++K EV++GR+
Subjt:  STSDQEELSIDSEDDVPHFSDIEAMILDMDLDPEDQD-LYSSEEVLKYQHVDTKKRIIRLEQGANAYMQRSTASHGALAVLYGRYSKHYIKKSEVLLGRA

Query:  TEDVIVDIDLGREGSGNKISRRQAIIKLDQDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESNPTRMKQYVDN
        +  + VDIDLG+   G+KISRRQA++KL+  G FSLKNLGK  I +N   +  G  + L S   I IRG+ F+F+ N   + Q++ N
Subjt:  TEDVIVDIDLGREGSGNKISRRQAIIKLDQDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESNPTRMKQYVDN

AT1G75530.1 Forkhead-associated (FHA) domain-containing protein2.5e-4752.15Show/hide
Query:  DQEELSIDSEDDVPHFSDIEAMILDMDLDPEDQDLYSSEEVLKYQHVDTKKRIIRLEQGANAYMQRSTASHGALAVLYGRYSKHYIKKSEVLLGRATEDV
        ++  + I+S++++P FSD+EAMILDMDL+P  QD Y   +  KY++ +  ++I+RLEQ A +YM R  A+HGA A+LYG  SKHYI K EVLLGRAT + 
Subjt:  DQEELSIDSEDDVPHFSDIEAMILDMDLDPEDQDLYSSEEVLKYQHVDTKKRIIRLEQGANAYMQRSTASHGALAVLYGRYSKHYIKKSEVLLGRATEDV

Query:  IVDIDLGREGSGNKISRRQAIIKLDQDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESNPTRMKQYVDNVGK
         VDIDLGR GS  + SRRQA+IKL QDG F +KNLGK SI +N++++  G  + L + CLI+IR   FIFE N   +K+Y+D + K
Subjt:  IVDIDLGREGSGNKISRRQAIIKLDQDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESNPTRMKQYVDNVGK

AT1G75530.1 Forkhead-associated (FHA) domain-containing protein3.7e-1441.35Show/hide
Query:  WTPEDDILLKNAVEAGASLESLAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSMIDFERSSSILPSKFNRFGNPKETKYIGGKRKSGSVRHCYYA
        W PEDD LL+ ++E G SLE+LAKGAV+FSR++T+ EL ERWH LLY+P V+  +S    + +  +  +P                    S  VR  YY 
Subjt:  WTPEDDILLKNAVEAGASLESLAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSMIDFERSSSILPSKFNRFGNPKETKYIGGKRKSGSVRHCYYA

Query:  LRKR
         RKR
Subjt:  LRKR

AT3G54350.1 Forkhead-associated (FHA) domain-containing protein1.7e-11235.63Show/hide
Query:  MGALAPVAPWTPEDDILLKNAVEAGASLESLAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSMIDFERSSSILPSKFNRFGNPKETKYIGGKRKS
        MGALA V PW PEDD+LLKNAVEAGASLESLAKGAVQFSRR+++RELQ+RWH+LLYDP+VS +A+  M + ER++   P+KF R G  KE K    KR +
Subjt:  MGALAPVAPWTPEDDILLKNAVEAGASLESLAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSMIDFERSSSILPSKFNRFGNPKETKYIGGKRKS

Query:  GSVRHCYYALRKRICNEPFNNPMDLNFLVGPSNSNYVVEEPMSGNCIPPISDDFGLQSSEMGILPCDFSQNVMNTDDVEHTFQSGCQGTVEKHFPRNLDN
          +R  Y++LRK+   EPFN+ +DL FLV P++S+++     +G+     +   GL+ S M I                          +   FP  L  
Subjt:  GSVRHCYYALRKRICNEPFNNPMDLNFLVGPSNSNYVVEEPMSGNCIPPISDDFGLQSSEMGILPCDFSQNVMNTDDVEHTFQSGCQGTVEKHFPRNLDN

Query:  GQEGISHSMRESLPPSAIDSHVEELAPSTGFPVHSLFENDLEARPSTFGQLSNDQRAMGSELEDNNVFNSPVSESGASFHNVEYSSPLPGMPIWRNASAP
        G                + +HV         P                              EDN   + P  E      N+ ++               
Subjt:  GQEGISHSMRESLPPSAIDSHVEELAPSTGFPVHSLFENDLEARPSTFGQLSNDQRAMGSELEDNNVFNSPVSESGASFHNVEYSSPLPGMPIWRNASAP

Query:  ALPIDVGFADKDIPTSNSFELPDDDGNKNIQNARVAGYDAYSDLKLKIEVEQDHLKSPNATAEVYLAELSNSLMNMSNEDELLFMDVDGKDALDKSYYDG
              G +  D+                           + D + K+E      K+  A+ + +LA+LS SL     ED   FM+VDGK+ +DKSYYDG
Subjt:  ALPIDVGFADKDIPTSNSFELPDDDGNKNIQNARVAGYDAYSDLKLKIEVEQDHLKSPNATAEVYLAELSNSLMNMSNEDELLFMDVDGKDALDKSYYDG

Query:  LSSLLLNSPNEINHDQTANAINAETVLPTDTMVDPPTACSGGLYEKGSHCGVGHLDCTSEAHSSPSASLNNQCPVKGDEPLFCTLNTEDPDIPSNDDVFL
        LSSLL+NS N+ N +            P  T  +P  A         +H G   LD         + +L+    + G   + C LN EDPDIP NDD+FL
Subjt:  LSSLLLNSPNEINHDQTANAINAETVLPTDTMVDPPTACSGGLYEKGSHCGVGHLDCTSEAHSSPSASLNNQCPVKGDEPLFCTLNTEDPDIPSNDDVFL

Query:  P----PLSTMATMGYNFQDCINTTFSSTKDFTYNEKSGETQNLGRERKNHGAGVNYRLHGFSERGEKHPVGGAGVNYRSSHSNARH---LPSVSNVGSIN
             P+S  +    NF+D  +   +  +D + +++  E  +L  ++K  G     RL G ++   +      G  +R+S S        P  S+     
Subjt:  P----PLSTMATMGYNFQDCINTTFSSTKDFTYNEKSGETQNLGRERKNHGAGVNYRLHGFSERGEKHPVGGAGVNYRSSHSNARH---LPSVSNVGSIN

Query:  GNSDAALPAVLKEENNEISRVNHLGENFL--NAHAEKPGFDSDNVR----IYPPSAACDIKQEPDILASLKDHRLSQEGGTRGTFGVEQGGLSSTSDQEE
         N+  +     K+   E +     G  F+  + H   P  DS+N +    + P + +   K   D L  +    L            E     + ++ E 
Subjt:  GNSDAALPAVLKEENNEISRVNHLGENFL--NAHAEKPGFDSDNVR----IYPPSAACDIKQEPDILASLKDHRLSQEGGTRGTFGVEQGGLSSTSDQEE

Query:  LSIDSEDDVPHFSDIEAMILDMDLDPEDQDLYSSEEVLKYQHVDTKKRIIRLEQGANAYMQRSTASHGALAVLYGRYSKHYIKKSEVLLGRATEDVIVDI
           +S++D+P++SDIEAMILDMDL+P+DQD +   EV KYQ  D K+ IIRLEQ A++YMQR+ AS GA AVLYGRYSKHYIKK EVL+GR+TED+ VDI
Subjt:  LSIDSEDDVPHFSDIEAMILDMDLDPEDQDLYSSEEVLKYQHVDTKKRIIRLEQGANAYMQRSTASHGALAVLYGRYSKHYIKKSEVLLGRATEDVIVDI

Query:  DLGREGSGNKISRRQAIIKLDQDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESNPTRMKQYVDNVGKIS
        DLGRE  G+KISRRQAII+L  DG F +KNLGK SIS+N K+V PG  L L S CL+EIRGMPFIFE+N + M++Y+   GK++
Subjt:  DLGREGSGNKISRRQAIIKLDQDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESNPTRMKQYVDNVGKIS

AT3G54350.2 Forkhead-associated (FHA) domain-containing protein1.7e-11235.63Show/hide
Query:  MGALAPVAPWTPEDDILLKNAVEAGASLESLAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSMIDFERSSSILPSKFNRFGNPKETKYIGGKRKS
        MGALA V PW PEDD+LLKNAVEAGASLESLAKGAVQFSRR+++RELQ+RWH+LLYDP+VS +A+  M + ER++   P+KF R G  KE K    KR +
Subjt:  MGALAPVAPWTPEDDILLKNAVEAGASLESLAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSMIDFERSSSILPSKFNRFGNPKETKYIGGKRKS

Query:  GSVRHCYYALRKRICNEPFNNPMDLNFLVGPSNSNYVVEEPMSGNCIPPISDDFGLQSSEMGILPCDFSQNVMNTDDVEHTFQSGCQGTVEKHFPRNLDN
          +R  Y++LRK+   EPFN+ +DL FLV P++S+++     +G+     +   GL+ S M I                          +   FP  L  
Subjt:  GSVRHCYYALRKRICNEPFNNPMDLNFLVGPSNSNYVVEEPMSGNCIPPISDDFGLQSSEMGILPCDFSQNVMNTDDVEHTFQSGCQGTVEKHFPRNLDN

Query:  GQEGISHSMRESLPPSAIDSHVEELAPSTGFPVHSLFENDLEARPSTFGQLSNDQRAMGSELEDNNVFNSPVSESGASFHNVEYSSPLPGMPIWRNASAP
        G                + +HV         P                              EDN   + P  E      N+ ++               
Subjt:  GQEGISHSMRESLPPSAIDSHVEELAPSTGFPVHSLFENDLEARPSTFGQLSNDQRAMGSELEDNNVFNSPVSESGASFHNVEYSSPLPGMPIWRNASAP

Query:  ALPIDVGFADKDIPTSNSFELPDDDGNKNIQNARVAGYDAYSDLKLKIEVEQDHLKSPNATAEVYLAELSNSLMNMSNEDELLFMDVDGKDALDKSYYDG
              G +  D+                           + D + K+E      K+  A+ + +LA+LS SL     ED   FM+VDGK+ +DKSYYDG
Subjt:  ALPIDVGFADKDIPTSNSFELPDDDGNKNIQNARVAGYDAYSDLKLKIEVEQDHLKSPNATAEVYLAELSNSLMNMSNEDELLFMDVDGKDALDKSYYDG

Query:  LSSLLLNSPNEINHDQTANAINAETVLPTDTMVDPPTACSGGLYEKGSHCGVGHLDCTSEAHSSPSASLNNQCPVKGDEPLFCTLNTEDPDIPSNDDVFL
        LSSLL+NS N+ N +            P  T  +P  A         +H G   LD         + +L+    + G   + C LN EDPDIP NDD+FL
Subjt:  LSSLLLNSPNEINHDQTANAINAETVLPTDTMVDPPTACSGGLYEKGSHCGVGHLDCTSEAHSSPSASLNNQCPVKGDEPLFCTLNTEDPDIPSNDDVFL

Query:  P----PLSTMATMGYNFQDCINTTFSSTKDFTYNEKSGETQNLGRERKNHGAGVNYRLHGFSERGEKHPVGGAGVNYRSSHSNARH---LPSVSNVGSIN
             P+S  +    NF+D  +   +  +D + +++  E  +L  ++K  G     RL G ++   +      G  +R+S S        P  S+     
Subjt:  P----PLSTMATMGYNFQDCINTTFSSTKDFTYNEKSGETQNLGRERKNHGAGVNYRLHGFSERGEKHPVGGAGVNYRSSHSNARH---LPSVSNVGSIN

Query:  GNSDAALPAVLKEENNEISRVNHLGENFL--NAHAEKPGFDSDNVR----IYPPSAACDIKQEPDILASLKDHRLSQEGGTRGTFGVEQGGLSSTSDQEE
         N+  +     K+   E +     G  F+  + H   P  DS+N +    + P + +   K   D L  +    L            E     + ++ E 
Subjt:  GNSDAALPAVLKEENNEISRVNHLGENFL--NAHAEKPGFDSDNVR----IYPPSAACDIKQEPDILASLKDHRLSQEGGTRGTFGVEQGGLSSTSDQEE

Query:  LSIDSEDDVPHFSDIEAMILDMDLDPEDQDLYSSEEVLKYQHVDTKKRIIRLEQGANAYMQRSTASHGALAVLYGRYSKHYIKKSEVLLGRATEDVIVDI
           +S++D+P++SDIEAMILDMDL+P+DQD +   EV KYQ  D K+ IIRLEQ A++YMQR+ AS GA AVLYGRYSKHYIKK EVL+GR+TED+ VDI
Subjt:  LSIDSEDDVPHFSDIEAMILDMDLDPEDQDLYSSEEVLKYQHVDTKKRIIRLEQGANAYMQRSTASHGALAVLYGRYSKHYIKKSEVLLGRATEDVIVDI

Query:  DLGREGSGNKISRRQAIIKLDQDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESNPTRMKQYVDNVGKIS
        DLGRE  G+KISRRQAII+L  DG F +KNLGK SIS+N K+V PG  L L S CL+EIRGMPFIFE+N + M++Y+   GK++
Subjt:  DLGREGSGNKISRRQAIIKLDQDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESNPTRMKQYVDNVGKIS

AT3G54350.3 Forkhead-associated (FHA) domain-containing protein1.7e-11235.63Show/hide
Query:  MGALAPVAPWTPEDDILLKNAVEAGASLESLAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSMIDFERSSSILPSKFNRFGNPKETKYIGGKRKS
        MGALA V PW PEDD+LLKNAVEAGASLESLAKGAVQFSRR+++RELQ+RWH+LLYDP+VS +A+  M + ER++   P+KF R G  KE K    KR +
Subjt:  MGALAPVAPWTPEDDILLKNAVEAGASLESLAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSMIDFERSSSILPSKFNRFGNPKETKYIGGKRKS

Query:  GSVRHCYYALRKRICNEPFNNPMDLNFLVGPSNSNYVVEEPMSGNCIPPISDDFGLQSSEMGILPCDFSQNVMNTDDVEHTFQSGCQGTVEKHFPRNLDN
          +R  Y++LRK+   EPFN+ +DL FLV P++S+++     +G+     +   GL+ S M I                          +   FP  L  
Subjt:  GSVRHCYYALRKRICNEPFNNPMDLNFLVGPSNSNYVVEEPMSGNCIPPISDDFGLQSSEMGILPCDFSQNVMNTDDVEHTFQSGCQGTVEKHFPRNLDN

Query:  GQEGISHSMRESLPPSAIDSHVEELAPSTGFPVHSLFENDLEARPSTFGQLSNDQRAMGSELEDNNVFNSPVSESGASFHNVEYSSPLPGMPIWRNASAP
        G                + +HV         P                              EDN   + P  E      N+ ++               
Subjt:  GQEGISHSMRESLPPSAIDSHVEELAPSTGFPVHSLFENDLEARPSTFGQLSNDQRAMGSELEDNNVFNSPVSESGASFHNVEYSSPLPGMPIWRNASAP

Query:  ALPIDVGFADKDIPTSNSFELPDDDGNKNIQNARVAGYDAYSDLKLKIEVEQDHLKSPNATAEVYLAELSNSLMNMSNEDELLFMDVDGKDALDKSYYDG
              G +  D+                           + D + K+E      K+  A+ + +LA+LS SL     ED   FM+VDGK+ +DKSYYDG
Subjt:  ALPIDVGFADKDIPTSNSFELPDDDGNKNIQNARVAGYDAYSDLKLKIEVEQDHLKSPNATAEVYLAELSNSLMNMSNEDELLFMDVDGKDALDKSYYDG

Query:  LSSLLLNSPNEINHDQTANAINAETVLPTDTMVDPPTACSGGLYEKGSHCGVGHLDCTSEAHSSPSASLNNQCPVKGDEPLFCTLNTEDPDIPSNDDVFL
        LSSLL+NS N+ N +            P  T  +P  A         +H G   LD         + +L+    + G   + C LN EDPDIP NDD+FL
Subjt:  LSSLLLNSPNEINHDQTANAINAETVLPTDTMVDPPTACSGGLYEKGSHCGVGHLDCTSEAHSSPSASLNNQCPVKGDEPLFCTLNTEDPDIPSNDDVFL

Query:  P----PLSTMATMGYNFQDCINTTFSSTKDFTYNEKSGETQNLGRERKNHGAGVNYRLHGFSERGEKHPVGGAGVNYRSSHSNARH---LPSVSNVGSIN
             P+S  +    NF+D  +   +  +D + +++  E  +L  ++K  G     RL G ++   +      G  +R+S S        P  S+     
Subjt:  P----PLSTMATMGYNFQDCINTTFSSTKDFTYNEKSGETQNLGRERKNHGAGVNYRLHGFSERGEKHPVGGAGVNYRSSHSNARH---LPSVSNVGSIN

Query:  GNSDAALPAVLKEENNEISRVNHLGENFL--NAHAEKPGFDSDNVR----IYPPSAACDIKQEPDILASLKDHRLSQEGGTRGTFGVEQGGLSSTSDQEE
         N+  +     K+   E +     G  F+  + H   P  DS+N +    + P + +   K   D L  +    L            E     + ++ E 
Subjt:  GNSDAALPAVLKEENNEISRVNHLGENFL--NAHAEKPGFDSDNVR----IYPPSAACDIKQEPDILASLKDHRLSQEGGTRGTFGVEQGGLSSTSDQEE

Query:  LSIDSEDDVPHFSDIEAMILDMDLDPEDQDLYSSEEVLKYQHVDTKKRIIRLEQGANAYMQRSTASHGALAVLYGRYSKHYIKKSEVLLGRATEDVIVDI
           +S++D+P++SDIEAMILDMDL+P+DQD +   EV KYQ  D K+ IIRLEQ A++YMQR+ AS GA AVLYGRYSKHYIKK EVL+GR+TED+ VDI
Subjt:  LSIDSEDDVPHFSDIEAMILDMDLDPEDQDLYSSEEVLKYQHVDTKKRIIRLEQGANAYMQRSTASHGALAVLYGRYSKHYIKKSEVLLGRATEDVIVDI

Query:  DLGREGSGNKISRRQAIIKLDQDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESNPTRMKQYVDNVGKIS
        DLGRE  G+KISRRQAII+L  DG F +KNLGK SIS+N K+V PG  L L S CL+EIRGMPFIFE+N + M++Y+   GK++
Subjt:  DLGREGSGNKISRRQAIIKLDQDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEIRGMPFIFESNPTRMKQYVDNVGKIS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGCTCTTGCCCCCGTCGCGCCTTGGACTCCTGAAGATGATATTCTCCTCAAGAACGCCGTTGAGGCAGGTGCTTCCTTGGAGTCCCTTGCCAAAGGTGCT
GTGCAGTTTTCTCGAAGATACACAGTAAGAGAATTGCAAGAACGATGGCATTCTTTGCTTTATGATCCAATTGTATCCGAAGATGCATCTATGTCCATGATTGAC
TTCGAGCGTTCTTCTTCCATTCTTCCGTCAAAGTTCAACAGATTTGGGAATCCAAAAGAAACCAAATATATTGGTGGGAAGAGAAAATCTGGGAGTGTACGCCAT
TGCTACTATGCTTTGCGTAAAAGAATTTGCAATGAACCATTTAATAACCCTATGGACCTGAATTTTCTTGTTGGACCCAGTAATAGTAACTATGTTGTTGAAGAG
CCTATGTCCGGAAATTGTATCCCTCCAATATCTGATGATTTTGGACTTCAGAGCTCAGAGATGGGGATCTTGCCATGTGATTTCTCTCAGAATGTGATGAACACT
GATGATGTGGAGCACACTTTTCAATCTGGATGTCAAGGTACAGTTGAAAAGCATTTTCCCAGGAACCTGGATAATGGACAGGAGGGAATTTCTCACAGTATGAGA
GAGAGTCTGCCTCCTTCTGCAATTGATTCTCATGTTGAGGAATTGGCTCCATCGACTGGCTTTCCAGTCCATAGTCTCTTTGAAAATGATTTGGAGGCAAGACCT
TCTACTTTTGGGCAACTGAGCAATGATCAGAGAGCGATGGGCTCTGAACTAGAGGATAACAATGTCTTTAACTCTCCTGTTTCTGAATCTGGTGCATCATTTCAC
AATGTTGAGTACTCATCTCCGCTTCCTGGTATGCCAATATGGAGAAATGCCTCAGCACCAGCCTTGCCAATTGATGTTGGCTTTGCAGATAAGGATATACCTACA
AGTAACTCTTTTGAACTACCTGATGATGATGGGAACAAAAACATTCAAAATGCAAGAGTAGCAGGCTATGATGCTTACTCTGACTTAAAGTTGAAGATTGAAGTT
GAGCAAGATCATTTGAAAAGTCCAAATGCCACCGCTGAAGTTTATCTTGCAGAATTGTCCAATTCTCTTATGAACATGAGCAATGAGGACGAGCTACTTTTCATG
GATGTTGATGGAAAGGATGCGCTTGATAAGTCATATTATGATGGTTTGAGCTCGCTTTTGTTGAATTCACCAAATGAAATCAATCATGATCAAACAGCTAATGCA
ATTAATGCAGAAACAGTGTTACCAACTGATACAATGGTAGATCCCCCCACAGCATGTTCTGGAGGGTTATATGAAAAAGGATCCCACTGTGGTGTTGGACATTTG
GATTGTACTTCTGAAGCTCATTCTTCGCCATCTGCATCTTTGAACAATCAGTGTCCTGTAAAAGGTGATGAACCTCTTTTTTGCACTTTGAACACAGAAGACCCA
GACATCCCGAGCAATGATGATGTTTTCCTACCTCCATTGTCAACAATGGCTACGATGGGATACAATTTTCAAGATTGCATCAATACTACCTTTTCATCTACCAAG
GATTTCACTTATAATGAAAAATCTGGTGAGACTCAAAACCTTGGGAGGGAGAGGAAAAATCATGGTGCTGGTGTTAATTATAGATTGCATGGTTTTTCTGAAAGA
GGTGAAAAGCATCCAGTTGGTGGAGCTGGTGTTAATTATAGATCATCCCATAGCAACGCCCGACACTTGCCATCTGTGAGTAATGTTGGCTCCATAAATGGAAAT
AGCGATGCTGCTCTTCCTGCTGTGCTCAAGGAAGAGAACAATGAAATTTCCCGGGTAAATCATCTTGGTGAGAATTTTTTGAATGCTCATGCAGAGAAGCCAGGC
TTTGATTCTGACAATGTTAGAATATATCCACCAAGTGCTGCCTGTGACATTAAACAGGAACCAGATATATTGGCTTCTTTGAAAGATCATCGTTTATCACAGGAA
GGGGGTACTAGAGGTACTTTTGGTGTTGAACAAGGTGGACTATCTTCGACATCTGATCAAGAAGAGTTATCTATTGACAGTGAAGATGATGTACCTCATTTTTCA
GATATTGAAGCAATGATACTTGATATGGACTTGGATCCAGAAGATCAGGATTTGTATTCAAGTGAAGAAGTCTTAAAATATCAACATGTGGACACAAAGAAGAGA
ATCATACGACTGGAGCAAGGGGCTAATGCTTACATGCAAAGATCTACTGCCTCTCATGGGGCATTAGCAGTTCTATATGGCCGATATTCGAAGCATTACATTAAG
AAATCAGAGGTTCTATTAGGTAGAGCAACTGAGGATGTCATTGTGGACATTGACTTGGGAAGGGAGGGAAGTGGTAACAAAATATCTCGGCGGCAGGCAATTATA
AAATTAGATCAGGATGGATTTTTCTCCCTGAAGAATCTTGGTAAATGCTCAATCTCTATAAATAACAAGGATGTAGCCCCTGGTCACTGCCTGCGACTTAATTCT
GGCTGCTTGATTGAGATAAGGGGAATGCCATTTATATTTGAGTCAAACCCAACTCGTATGAAGCAGTATGTGGATAATGTAGGCAAGATATCTCACAAACAGGAG
TATCAATCATGA
mRNA sequenceShow/hide mRNA sequence
CCTAAGGCCCATGAGCCCCATCCTCTTTCTTCCAGCTCCCTCTCACGTGTGTTCGAGCCGCCAAACACTCGCCCTCGTCTTCTTCATTTCCGGCCACAGCGCCTC
CTCTCTGACGCTATCGCTGCAGTCTCAGGCGTGTGAAGAACCTCGATAACTCTGACCAGCGTTTTTATTTCTTCACCGGCGAGCCTAAATTCACTAACCACCACG
TCATTTTGATTACACGCGGCGTAAGGCGATACCGTACGAAGTGATGCGACGGGCAGTGTGCTGCCACCACGAGCATACTGCTGCGATCGACACGATTTTCTGATG
GCTGTGTGTGCGACGGACCTCTTCACAACCAGCGATTGTGCGTTTTGACTGCCACGGTGGAACAACTTGCTGCCGGGTAGAACTAGTGGCGATTCTCATACCTTT
CGTCTTCCTTCTCTCGCTGTTACGGTTAGTTTCAACAGTATACGCTTTCTATAGAGGGTCTGGTAAGGAATTGTTGCTTTAGGGAGATGGGAGCTCTTGCCCCCG
TCGCGCCTTGGACTCCTGAAGATGATATTCTCCTCAAGAACGCCGTTGAGGCAGGTGCTTCCTTGGAGTCCCTTGCCAAAGGTGCTGTGCAGTTTTCTCGAAGAT
ACACAGTAAGAGAATTGCAAGAACGATGGCATTCTTTGCTTTATGATCCAATTGTATCCGAAGATGCATCTATGTCCATGATTGACTTCGAGCGTTCTTCTTCCA
TTCTTCCGTCAAAGTTCAACAGATTTGGGAATCCAAAAGAAACCAAATATATTGGTGGGAAGAGAAAATCTGGGAGTGTACGCCATTGCTACTATGCTTTGCGTA
AAAGAATTTGCAATGAACCATTTAATAACCCTATGGACCTGAATTTTCTTGTTGGACCCAGTAATAGTAACTATGTTGTTGAAGAGCCTATGTCCGGAAATTGTA
TCCCTCCAATATCTGATGATTTTGGACTTCAGAGCTCAGAGATGGGGATCTTGCCATGTGATTTCTCTCAGAATGTGATGAACACTGATGATGTGGAGCACACTT
TTCAATCTGGATGTCAAGGTACAGTTGAAAAGCATTTTCCCAGGAACCTGGATAATGGACAGGAGGGAATTTCTCACAGTATGAGAGAGAGTCTGCCTCCTTCTG
CAATTGATTCTCATGTTGAGGAATTGGCTCCATCGACTGGCTTTCCAGTCCATAGTCTCTTTGAAAATGATTTGGAGGCAAGACCTTCTACTTTTGGGCAACTGA
GCAATGATCAGAGAGCGATGGGCTCTGAACTAGAGGATAACAATGTCTTTAACTCTCCTGTTTCTGAATCTGGTGCATCATTTCACAATGTTGAGTACTCATCTC
CGCTTCCTGGTATGCCAATATGGAGAAATGCCTCAGCACCAGCCTTGCCAATTGATGTTGGCTTTGCAGATAAGGATATACCTACAAGTAACTCTTTTGAACTAC
CTGATGATGATGGGAACAAAAACATTCAAAATGCAAGAGTAGCAGGCTATGATGCTTACTCTGACTTAAAGTTGAAGATTGAAGTTGAGCAAGATCATTTGAAAA
GTCCAAATGCCACCGCTGAAGTTTATCTTGCAGAATTGTCCAATTCTCTTATGAACATGAGCAATGAGGACGAGCTACTTTTCATGGATGTTGATGGAAAGGATG
CGCTTGATAAGTCATATTATGATGGTTTGAGCTCGCTTTTGTTGAATTCACCAAATGAAATCAATCATGATCAAACAGCTAATGCAATTAATGCAGAAACAGTGT
TACCAACTGATACAATGGTAGATCCCCCCACAGCATGTTCTGGAGGGTTATATGAAAAAGGATCCCACTGTGGTGTTGGACATTTGGATTGTACTTCTGAAGCTC
ATTCTTCGCCATCTGCATCTTTGAACAATCAGTGTCCTGTAAAAGGTGATGAACCTCTTTTTTGCACTTTGAACACAGAAGACCCAGACATCCCGAGCAATGATG
ATGTTTTCCTACCTCCATTGTCAACAATGGCTACGATGGGATACAATTTTCAAGATTGCATCAATACTACCTTTTCATCTACCAAGGATTTCACTTATAATGAAA
AATCTGGTGAGACTCAAAACCTTGGGAGGGAGAGGAAAAATCATGGTGCTGGTGTTAATTATAGATTGCATGGTTTTTCTGAAAGAGGTGAAAAGCATCCAGTTG
GTGGAGCTGGTGTTAATTATAGATCATCCCATAGCAACGCCCGACACTTGCCATCTGTGAGTAATGTTGGCTCCATAAATGGAAATAGCGATGCTGCTCTTCCTG
CTGTGCTCAAGGAAGAGAACAATGAAATTTCCCGGGTAAATCATCTTGGTGAGAATTTTTTGAATGCTCATGCAGAGAAGCCAGGCTTTGATTCTGACAATGTTA
GAATATATCCACCAAGTGCTGCCTGTGACATTAAACAGGAACCAGATATATTGGCTTCTTTGAAAGATCATCGTTTATCACAGGAAGGGGGTACTAGAGGTACTT
TTGGTGTTGAACAAGGTGGACTATCTTCGACATCTGATCAAGAAGAGTTATCTATTGACAGTGAAGATGATGTACCTCATTTTTCAGATATTGAAGCAATGATAC
TTGATATGGACTTGGATCCAGAAGATCAGGATTTGTATTCAAGTGAAGAAGTCTTAAAATATCAACATGTGGACACAAAGAAGAGAATCATACGACTGGAGCAAG
GGGCTAATGCTTACATGCAAAGATCTACTGCCTCTCATGGGGCATTAGCAGTTCTATATGGCCGATATTCGAAGCATTACATTAAGAAATCAGAGGTTCTATTAG
GTAGAGCAACTGAGGATGTCATTGTGGACATTGACTTGGGAAGGGAGGGAAGTGGTAACAAAATATCTCGGCGGCAGGCAATTATAAAATTAGATCAGGATGGAT
TTTTCTCCCTGAAGAATCTTGGTAAATGCTCAATCTCTATAAATAACAAGGATGTAGCCCCTGGTCACTGCCTGCGACTTAATTCTGGCTGCTTGATTGAGATAA
GGGGAATGCCATTTATATTTGAGTCAAACCCAACTCGTATGAAGCAGTATGTGGATAATGTAGGCAAGATATCTCACAAACAGGAGTATCAATCATGATGATAGA
CGACACCCTAGTGGCATGAACAGAGAAACATACCTCGATTTATTCAAAGGTGCTCGTTTTTTCTGCACTTCATGGGTTCTTTAGGTTTCCTTGGACTGCTAAAGA
TCAGAAACAATAGCAACACTTTTGCGGTGATCGTTTCGACCAGTAACATCAGCAAGGAACTGTCCAACGAAAGTGCGAAGATATAGCCCTTTGGATATACGAGTG
AGTGGTTTCGGTAGGCCCTCCGGTTTCACCGTACTGTTCCCATCGTTGCACGTCGTAGCCCAGCAGCTAAAGTTGCTATTTTCTTGGATAGAGCTTGAGGTCAAA
TTGGTGGTAATTGATGTAGGAAGTCTTTTGAGACTTTCTTATATTTCTTGTGAAGTCTCTTTAGGCGTGGTTAAATGTTATTTAACCGCAGTCCAAGTTTAGATC
TTAGGATTTGACTATTTGGGTTTGTTATGGCTTCGTAAAGGCCTATAAATGTGTATTCGAAATAGCTCATTGCAAATCAAAATAGAAGTGTCTTTAGTGTTTGAC
ATGCAAAAACGAACGATAAAGCATGAATCGTTCGAGACAACCAACAAGAACACGAGGGAAGAATCAAAATGGTTCGTTTGGTCAAGAAAACTCCGAGTTTCATAT
GAAGATCGACCAGCCAAGCTTTAATGAAAGGATGAATGTTGGCAAGAGTTTGTTCGAATGTGTTTGAAAGATCAAACTCGTAGCTTTCAAGCTTCAAAACAATCA
GGAATTGGCCAAAAATGCTTCGGTAATGTGTTAAAAGATCGAACTCGACCTGCCAAGCTTTATTGTATTTTTTTTTTTTTAATTTTGTGTTTAATAAAACATTGG
CTAATCAA
Protein sequenceShow/hide protein sequence
MGALAPVAPWTPEDDILLKNAVEAGASLESLAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSMIDFERSSSILPSKFNRFGNPKETKYIGGKRKSGSVRH
CYYALRKRICNEPFNNPMDLNFLVGPSNSNYVVEEPMSGNCIPPISDDFGLQSSEMGILPCDFSQNVMNTDDVEHTFQSGCQGTVEKHFPRNLDNGQEGISHSMR
ESLPPSAIDSHVEELAPSTGFPVHSLFENDLEARPSTFGQLSNDQRAMGSELEDNNVFNSPVSESGASFHNVEYSSPLPGMPIWRNASAPALPIDVGFADKDIPT
SNSFELPDDDGNKNIQNARVAGYDAYSDLKLKIEVEQDHLKSPNATAEVYLAELSNSLMNMSNEDELLFMDVDGKDALDKSYYDGLSSLLLNSPNEINHDQTANA
INAETVLPTDTMVDPPTACSGGLYEKGSHCGVGHLDCTSEAHSSPSASLNNQCPVKGDEPLFCTLNTEDPDIPSNDDVFLPPLSTMATMGYNFQDCINTTFSSTK
DFTYNEKSGETQNLGRERKNHGAGVNYRLHGFSERGEKHPVGGAGVNYRSSHSNARHLPSVSNVGSINGNSDAALPAVLKEENNEISRVNHLGENFLNAHAEKPG
FDSDNVRIYPPSAACDIKQEPDILASLKDHRLSQEGGTRGTFGVEQGGLSSTSDQEELSIDSEDDVPHFSDIEAMILDMDLDPEDQDLYSSEEVLKYQHVDTKKR
IIRLEQGANAYMQRSTASHGALAVLYGRYSKHYIKKSEVLLGRATEDVIVDIDLGREGSGNKISRRQAIIKLDQDGFFSLKNLGKCSISINNKDVAPGHCLRLNS
GCLIEIRGMPFIFESNPTRMKQYVDNVGKISHKQEYQS