; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0015194 (gene) of Snake gourd v1 genome

Gene IDTan0015194
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionFHA domain-containing protein
Genome locationLG01:115598089..115605262
RNA-Seq ExpressionTan0015194
SyntenyTan0015194
Gene Ontology termsGO:0031011 - Ino80 complex (cellular component)
GO:0071339 - MLL1 complex (cellular component)
GO:0002151 - G-quadruplex RNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6608568.1 Microspherule protein 1, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0075.93Show/hide
Query:  MGALAPIAPWNPEDDILLKNYVEAGASLDALAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSMIDCTR---------------------------
        MGALAPIAPW+PEDDILLKN VEAGASL+ALAKGAVQFSRR+TVRELQERWHSLLYDPIVSEDASMSMID  R                           
Subjt:  MGALAPIAPWNPEDDILLKNYVEAGASLDALAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSMIDCTR---------------------------

Query:  -----CYYALRKRICNEPFNPIDLSYLVGDSDYVGEESMSGNCIPPISDDFGFQSSELGIMPCNFSQNVMNNDGTEHTFHSGCQHTVEKHFPQNLDNGHE
             CYYALRKRICNEPFNP+  SYLVGDSDYV EE MSGNCIPP SDDFG QSSELGI+PCNFSQN MNND TEHTFHSGCQHTVE HFPQNLDNGHE
Subjt:  -----CYYALRKRICNEPFNPIDLSYLVGDSDYVGEESMSGNCIPPISDDFGFQSSELGIMPCNFSQNVMNNDGTEHTFHSGCQHTVEKHFPQNLDNGHE

Query:  RISRIMVDNLPFSANESKAKELAPSTL------------SQPSTFGQLSNDQRAMGSESEDSDVFNSPVSDSGASFHNVEYSSPLPGMPIFRNASPASPA
         IS IMVDNLPF ANES AKELAPS              + PSTFGQLS DQR MGSE ED+DVFNSPVSDSGASFHNVEYSSPLPGMPI+RN   A  A
Subjt:  RISRIMVDNLPFSANESKAKELAPSTL------------SQPSTFGQLSNDQRAMGSESEDSDVFNSPVSDSGASFHNVEYSSPLPGMPIFRNASPASPA

Query:  LPIDVGFPEKDIPTGDSFELPDDDGNNNIQNARLAGYDAHSNSKLRIEVQHDHLKSPNATAEGCYLAE----------LDVIDNSYYDGLSLLLSSPNEV
        LPIDVGF +KD+PTGDSFELPDDDGNNNIQNARLAGY+AHSNSKL+IEVQHDHLKSPNATAEGCYL E           D +     DGLSLLLSSPNEV
Subjt:  LPIDVGFPEKDIPTGDSFELPDDDGNNNIQNARLAGYDAHSNSKLRIEVQHDHLKSPNATAEGCYLAE----------LDVIDNSYYDGLSLLLSSPNEV

Query:  NHDQTTNAINAETVLPTDTMLDPPSACSGELYEKGSHCSDGHLDCS-------------------DEPLFCTLNTEDPEISSNDDVFLPPLSTTATMGYH
        NHDQT NAIN+E VLPTDTMLDPPSACSGELYEKGSHCSDGHLDCS                   DEPLFCTLNTEDPEI SNDDVFLPPLST ATMGYH
Subjt:  NHDQTTNAINAETVLPTDTMLDPPSACSGELYEKGSHCSDGHLDCS-------------------DEPLFCTLNTEDPEISSNDDVFLPPLSTTATMGYH

Query:  FQDCMDPTFSSIKDFSCKEKSGETIQNLVQRERKNH-GQPRVSSLSIGLRGLPERGEKHLIGGAAVNLKLSHSNIIHVPSANKTSSRNVNNDAILPVALK
        F DCM PTFSSI DFSCKE SGE  QNLVQRERKNH GQPRVS++S+GL  L ERGEKHL+ GA VNLKLSHSN IHVPSANKTSS NVN+D ILPV L 
Subjt:  FQDCMDPTFSSIKDFSCKEKSGETIQNLVQRERKNH-GQPRVSSLSIGLRGLPERGEKHLIGGAAVNLKLSHSNIIHVPSANKTSSRNVNNDAILPVALK

Query:  EENNEISQVNHLGQNFLNTHVEKPDFDSNNVRRYPPNTLTTVKDRRLSQEVGARAVFGVEQDGISSTSDQEDLSIDSEDDLPHFSDIEAMILDMDLDPED
        EENNEIS                            PNT T+VKD RLS+EVG + VFGVEQDG+ STSD E+L IDSEDDLPHFSDIEAMILDMDLDPED
Subjt:  EENNEISQVNHLGQNFLNTHVEKPDFDSNNVRRYPPNTLTTVKDRRLSQEVGARAVFGVEQDGISSTSDQEDLSIDSEDDLPHFSDIEAMILDMDLDPED

Query:  QDLFSSEEVLKYQHMDTKKRIIRLEQGANAYMQRSIASHGALAVLYGRHSKHYIKKSEVLLGRATAEFIVDIDLGREGSGNKISRWQAIIKIDEDGFFSL
        QDL+ SEEVLKYQH+DT+KRIIRLEQG NA MQRSIASHGALAVL GRHSKHYIKKSEVLLGRATAEFIVDIDLG EGSGNKISR QAIIKID+DGFFSL
Subjt:  QDLFSSEEVLKYQHMDTKKRIIRLEQGANAYMQRSIASHGALAVLYGRHSKHYIKKSEVLLGRATAEFIVDIDLGREGSGNKISRWQAIIKIDEDGFFSL

Query:  KNLGKCSISINNKDVAPGHCLRLNSGCLIEV
        KNLGKCSISINNKDVAPGHCLRLNSGCLIE+
Subjt:  KNLGKCSISINNKDVAPGHCLRLNSGCLIEV

KAG7037887.1 Microspherule protein 1 [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0076.05Show/hide
Query:  MGALAPIAPWNPEDDILLKNYVEAGASLDALAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSMIDCTR---------------------------
        MGALAPIAPW+PEDDILLKN VEAGASL+ALAKGAVQFSRR+TVRELQERWHSLLYDPIVSEDASMSMID  R                           
Subjt:  MGALAPIAPWNPEDDILLKNYVEAGASLDALAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSMIDCTR---------------------------

Query:  -----CYYALRKRICNEPFNPIDLSYLVGDSDYVGEESMSGNCIPPISDDFGFQSSELGIMPCNFSQNVMNNDGTEHTFHSGCQHTVEKHFPQNLDNGHE
             CYYALRKRICNEPFNP+  SYLVGDSDYV EE MSGNCIPP SDDFG QSSELGI+PCNFSQN MNND TEHTFHSGCQHTVE HFPQNLDNGHE
Subjt:  -----CYYALRKRICNEPFNPIDLSYLVGDSDYVGEESMSGNCIPPISDDFGFQSSELGIMPCNFSQNVMNNDGTEHTFHSGCQHTVEKHFPQNLDNGHE

Query:  RISRIMVDNLPFSANESKAKELAPSTL------------SQPSTFGQLSNDQRAMGSESEDSDVFNSPVSDSGASFHNVEYSSPLPGMPIFRNASPASPA
         IS IMVDNLPF ANES AKELAPS              + PSTFGQLS DQR MGSE ED+DVFNSPVSDSGASFHNVEYSSPLPGMPI+RN   A  A
Subjt:  RISRIMVDNLPFSANESKAKELAPSTL------------SQPSTFGQLSNDQRAMGSESEDSDVFNSPVSDSGASFHNVEYSSPLPGMPIFRNASPASPA

Query:  LPIDVGFPEKDIPTGDSFELPDDDGNNNIQNARLAGYDAHSNSKLRIEVQHDHLKSPNATAEGCYLAE----------LDVIDNSYYDGLSLLLSSPNEV
        LPIDVGF +KD+PTGDSFELPDDDGNNNIQNARLAGY+AHSNSKL+IEVQHDHLKSPNATAEGCYL E           D +     DGLSLLLSSPNEV
Subjt:  LPIDVGFPEKDIPTGDSFELPDDDGNNNIQNARLAGYDAHSNSKLRIEVQHDHLKSPNATAEGCYLAE----------LDVIDNSYYDGLSLLLSSPNEV

Query:  NHDQTTNAINAETVLPTDTMLDPPSACSGELYEKGSHCSDGHLDCS-------------------DEPLFCTLNTEDPEISSNDDVFLPPLSTTATMGYH
        NHDQT NAIN+E VLPTDTMLDPPSACSGELYEKGSHCSDGHLDCS                   DEPLFCTLNTEDPEI SNDDVFLPPLST ATMGYH
Subjt:  NHDQTTNAINAETVLPTDTMLDPPSACSGELYEKGSHCSDGHLDCS-------------------DEPLFCTLNTEDPEISSNDDVFLPPLSTTATMGYH

Query:  FQDCMDPTFSSIKDFSCKEKSGETIQNLVQRERKNH-GQPRVSSLSIGLRGLPERGEKHLIGGAAVNLKLSHSNIIHVPSANKTSSRNVNNDAILPVALK
        F DCM PTFSSI DFSCKE SGE  QNLVQRERKNH GQPRVS++S+GL  L ERGEKHLI GA VNLKLSHSN IHVPSANKTSS NVN+D ILPV L 
Subjt:  FQDCMDPTFSSIKDFSCKEKSGETIQNLVQRERKNH-GQPRVSSLSIGLRGLPERGEKHLIGGAAVNLKLSHSNIIHVPSANKTSSRNVNNDAILPVALK

Query:  EENNEISQVNHLGQNFLNTHVEKPDFDSNNVRRYPPNTLTTVKDRRLSQEVGARAVFGVEQDGISSTSDQEDLSIDSEDDLPHFSDIEAMILDMDLDPED
        EENNEIS                            PNT T+VKD RLS+EVG + VFGVEQDG+ STSD E+L IDSEDDLPHFSDIEAMILDMDLDPED
Subjt:  EENNEISQVNHLGQNFLNTHVEKPDFDSNNVRRYPPNTLTTVKDRRLSQEVGARAVFGVEQDGISSTSDQEDLSIDSEDDLPHFSDIEAMILDMDLDPED

Query:  QDLFSSEEVLKYQHMDTKKRIIRLEQGANAYMQRSIASHGALAVLYGRHSKHYIKKSEVLLGRATAEFIVDIDLGREGSGNKISRWQAIIKIDEDGFFSL
        QDL+ SEEVLKYQH+DT+KRIIRLEQG NA MQRSIASHGALAVL GRHSKHYIKKSEVLLGRATAEFIVDIDLG EGSGNKISR QAIIKID+DGFFSL
Subjt:  QDLFSSEEVLKYQHMDTKKRIIRLEQGANAYMQRSIASHGALAVLYGRHSKHYIKKSEVLLGRATAEFIVDIDLGREGSGNKISRWQAIIKIDEDGFFSL

Query:  KNLGKCSISINNKDVAPGHCLRLNSGCLIEV
        KNLGKCSISINNKDVAPGHCLRLNSGCLIE+
Subjt:  KNLGKCSISINNKDVAPGHCLRLNSGCLIEV

XP_022135664.1 uncharacterized protein LOC111007538 isoform X1 [Momordica charantia]0.0e+0073.72Show/hide
Query:  MGALAPIAPWNPEDDILLKNYVEAGASLDALAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSMID------------------------------
        MGALAP+APW PEDDILLKN +EAGASL++LAKGAVQFSRRYTVRELQERWHSLLYDPIVSE+ASMSMID                              
Subjt:  MGALAPIAPWNPEDDILLKNYVEAGASLDALAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSMID------------------------------

Query:  --CTRCYYALRKRICNEPFNPIDLSYLVG--DSDYVGEESMSGNCIPPISDDFGFQSSELGIMPCNFSQNVMNNDGTEHTFHSGCQHTVEKHFPQNLDNG
            RCYYALRKRICNEPFNP+DLS+LVG  DS+YV EE MSG+CIPPIS DFG Q SELGI+P NF+ N+MNND TE TFHS CQHTVEKHFP NLDN 
Subjt:  --CTRCYYALRKRICNEPFNPIDLSYLVG--DSDYVGEESMSGNCIPPISDDFGFQSSELGIMPCNFSQNVMNNDGTEHTFHSGCQHTVEKHFPQNLDNG

Query:  HERISRIMVDNLPFSANESKAKELAPSTL------------SQPSTFGQLSNDQRAMGSESEDSDVFNSPVSDSGASFHNVEYSSPLPGMPIFRNASPAS
        HE I  IM +NLP S NES+ +ELAPS               +PSTFGQ S DQRAMGSE ED++VFNSPVSDSGASFHNVEYSSPLPGMPI+RNAS  +
Subjt:  HERISRIMVDNLPFSANESKAKELAPSTL------------SQPSTFGQLSNDQRAMGSESEDSDVFNSPVSDSGASFHNVEYSSPLPGMPIFRNASPAS

Query:  PALPIDVGFPEKDIPTGDSFELPDDDGNNNIQNARLAGYDAHSNSKLRIEVQHDHLKSPNATAEGCYLAEL---------------------DVIDNSYY
        PALPIDVGF +KD+PTGDSFELPDDDGNNNIQNAR+A YDA S+SKL+IEVQHDHLKSPNATAE  YLAEL                     DVID SYY
Subjt:  PALPIDVGFPEKDIPTGDSFELPDDDGNNNIQNARLAGYDAHSNSKLRIEVQHDHLKSPNATAEGCYLAEL---------------------DVIDNSYY

Query:  DGL-SLLLSSPNEVNHDQTTNAINAETVLPTDTMLDPPSACSGELYEKGSHCSDGHLDCS-------------------DEPLFCTLNTEDPEISSNDDV
        DGL SLLL+SPNEVNHDQT +A+N ET+LPTD+M+DPP+ACSGELYEKGSHCSDGHLDCS                   DEPLFCTLNTEDPEI SNDDV
Subjt:  DGL-SLLLSSPNEVNHDQTTNAINAETVLPTDTMLDPPSACSGELYEKGSHCSDGHLDCS-------------------DEPLFCTLNTEDPEISSNDDV

Query:  FLPPLSTTATMGYHFQDCMDPTFSSIKDFSCKEKSGETIQNLVQRERKNHGQPRVSSLSIGLRGLPERGEKHLIGGAAVNLKLSHSNIIHVPSANK----
        FLPPLST ++MGYHFQD +D TFSSIKDFSC EKSGE  QNLVQRERKNHGQP VSSLSIGL GLPERGEKHL+GGAAVNLKL HSN IHVPSAN     
Subjt:  FLPPLSTTATMGYHFQDCMDPTFSSIKDFSCKEKSGETIQNLVQRERKNHGQPRVSSLSIGLRGLPERGEKHLIGGAAVNLKLSHSNIIHVPSANK----

Query:  --TSSRNVNNDAILPVALKEENNEISQVNHLGQNFLNTHVEKPDFDSNNVRRYPPNT----------LTTVKDRRLSQEVGARAVFGVEQDGISSTSDQE
          +SS N N DAILPV LKEE+ EIS+VNHLGQNFLNTHVEKP FDS N R+YPP+T          LT VKD RLSQE G+R VFGVEQDGISSTSDQE
Subjt:  --TSSRNVNNDAILPVALKEENNEISQVNHLGQNFLNTHVEKPDFDSNNVRRYPPNT----------LTTVKDRRLSQEVGARAVFGVEQDGISSTSDQE

Query:  DLSIDSEDDLPHFSDIEAMILDMDLDPEDQDLFSSEEVLKYQHMDTKKRIIRLEQGANAYMQRSIASHGALAVLYGRHSKHYIKKSEVLLGRATAEFIVD
        +LSIDSEDD+PHFSDIEAMILDMDLDPEDQDL++SEEVL+YQHMDTKKRI+RLEQGA+A M+RS+ASHGALAVLYGR+SKHYIKKSEVLLGRAT + IVD
Subjt:  DLSIDSEDDLPHFSDIEAMILDMDLDPEDQDLFSSEEVLKYQHMDTKKRIIRLEQGANAYMQRSIASHGALAVLYGRHSKHYIKKSEVLLGRATAEFIVD

Query:  IDLGREGSGNKISRWQAIIKIDEDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEV
        IDLGREGSGNKISR QAIIKID+DGFFSLKNLGKCSISINNK+VAPGHCLRLNSGCLIE+
Subjt:  IDLGREGSGNKISRWQAIIKIDEDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEV

XP_022940026.1 uncharacterized protein LOC111445785 [Cucurbita moschata]0.0e+0075.81Show/hide
Query:  MGALAPIAPWNPEDDILLKNYVEAGASLDALAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSMIDCTR---------------------------
        MGALAPIAPW+PEDDILLKN VEAGASL+ALAKGAVQFSRR+TVRELQERWHSLLYDPIVSEDASMSMID  R                           
Subjt:  MGALAPIAPWNPEDDILLKNYVEAGASLDALAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSMIDCTR---------------------------

Query:  -----CYYALRKRICNEPFNPIDLSYLVGDSDYVGEESMSGNCIPPISDDFGFQSSELGIMPCNFSQNVMNNDGTEHTFHSGCQHTVEKHFPQNLDNGHE
             CYYALRKRICNEPFNP+  SYLVGDSDYV EE MSGNCIPP SDDFG QSSELGI+PCNFSQN MNND TEHTFHSGCQHTVE HFPQNLDNGHE
Subjt:  -----CYYALRKRICNEPFNPIDLSYLVGDSDYVGEESMSGNCIPPISDDFGFQSSELGIMPCNFSQNVMNNDGTEHTFHSGCQHTVEKHFPQNLDNGHE

Query:  RISRIMVDNLPFSANESKAKELAPSTL------------SQPSTFGQLSNDQRAMGSESEDSDVFNSPVSDSGASFHNVEYSSPLPGMPIFRNASPASPA
         IS IMVDNLPF ANES AKELAPS              + PSTFGQLS DQR MGSE ED+DVFNSPVSDSGASFHNVEYSSPLPGMPI+RN   A  A
Subjt:  RISRIMVDNLPFSANESKAKELAPSTL------------SQPSTFGQLSNDQRAMGSESEDSDVFNSPVSDSGASFHNVEYSSPLPGMPIFRNASPASPA

Query:  LPIDVGFPEKDIPTGDSFELPDDDGNNNIQNARLAGYDAHSNSKLRIEVQHDHLKSPNATAEGCYLAE----------LDVIDNSYYDGLSLLLSSPNEV
        LPIDVGF +KD+PTGDSFELPDDDGNNNIQNARLAGY+AHSNSKL+IEVQHDHLKSPNATAEGCYL E           D +     DGLSLLLSSPNEV
Subjt:  LPIDVGFPEKDIPTGDSFELPDDDGNNNIQNARLAGYDAHSNSKLRIEVQHDHLKSPNATAEGCYLAE----------LDVIDNSYYDGLSLLLSSPNEV

Query:  NHDQTTNAINAETVLPTDTMLDPPSACSGELYEKGSHCSDGHLDCS-------------------DEPLFCTLNTEDPEISSNDDVFLPPLSTTATMGYH
        NHDQT NAIN+E VLPTDTMLDPPSACSGELYEKGSHCSDGHLDCS                   DEPLFCTLNTEDPEI SNDDVFLPPLST ATMGYH
Subjt:  NHDQTTNAINAETVLPTDTMLDPPSACSGELYEKGSHCSDGHLDCS-------------------DEPLFCTLNTEDPEISSNDDVFLPPLSTTATMGYH

Query:  FQDCMDPTFSSIKDFSCKEKSGETIQNLVQRERKNH-GQPRVSSLSIGLRGLPERGEKHLIGGAAVNLKLSHSNIIHVPSANKTSSRNVNNDAILPVALK
        F DCM PTFSSI DFSCKE SGE  QNLVQRERKNH GQPRVS++S+GL  L ERGEKHLI GA VNLKLSHSN IHVPSANKTSS NVN+D ILPV L 
Subjt:  FQDCMDPTFSSIKDFSCKEKSGETIQNLVQRERKNH-GQPRVSSLSIGLRGLPERGEKHLIGGAAVNLKLSHSNIIHVPSANKTSSRNVNNDAILPVALK

Query:  EENNEISQVNHLGQNFLNTHVEKPDFDSNNVRRYPPNTLTTVKDRRLSQEVGARAVFGVEQDGISSTSDQEDLSIDSEDDLPHFSDIEAMILDMDLDPED
        EENNEIS                            PNT T+VKD RLS+EVG + VFGVEQDG+ STSD E+L IDSEDDLPHFSDIEAMILDMDLDPED
Subjt:  EENNEISQVNHLGQNFLNTHVEKPDFDSNNVRRYPPNTLTTVKDRRLSQEVGARAVFGVEQDGISSTSDQEDLSIDSEDDLPHFSDIEAMILDMDLDPED

Query:  QDLFSSEEVLKYQHMDTKKRIIRLEQGANAYMQRSIASHGALAVLYGRHSKHYIKKSEVLLGRATAEFIVDIDLGREGSGNKISRWQAIIKIDEDGFFSL
        QDL+ SEEVLKYQH+DT+KRIIRLEQG NA MQRSIASHGALAVL G+HSKHYIKKSEVLLGRATAEFIVDIDLG EGSGNKISR QAIIKID+DG FSL
Subjt:  QDLFSSEEVLKYQHMDTKKRIIRLEQGANAYMQRSIASHGALAVLYGRHSKHYIKKSEVLLGRATAEFIVDIDLGREGSGNKISRWQAIIKIDEDGFFSL

Query:  KNLGKCSISINNKDVAPGHCLRLNSGCLIEV
        KNLGKCSISINNKDVAPGHCLRLNSGCLIE+
Subjt:  KNLGKCSISINNKDVAPGHCLRLNSGCLIEV

XP_022981262.1 uncharacterized protein LOC111480451 [Cucurbita maxima]0.0e+0075.69Show/hide
Query:  MGALAPIAPWNPEDDILLKNYVEAGASLDALAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSMIDCTR---------------------------
        MGALAPIAPW+PEDDILLKN VEA ASL+ALAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSMID  R                           
Subjt:  MGALAPIAPWNPEDDILLKNYVEAGASLDALAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSMIDCTR---------------------------

Query:  -----CYYALRKRICNEPFNPIDLSYLVGDSDYVGEESMSGNCIPPISDDFGFQSSELGIMPCNFSQNVMNNDGTEHTFHSGCQHTVEKHFPQNLDNGHE
             CYYALRKRICNEPFNP+  SYLVGDSDYV EE MSGNCIPP SDDFG QSSELG +PCNFSQN MNND TEHTFHSGCQHTVE HFPQNLDNGHE
Subjt:  -----CYYALRKRICNEPFNPIDLSYLVGDSDYVGEESMSGNCIPPISDDFGFQSSELGIMPCNFSQNVMNNDGTEHTFHSGCQHTVEKHFPQNLDNGHE

Query:  RISRIMVDNLPFSANESKAKELAPSTL------------SQPSTFGQLSNDQRAMGSESEDSDVFNSPVSDSGASFHNVEYSSPLPGMPIFRNASPASPA
         IS +MVDNLPF ANES AKELAPS              + PSTFGQLS DQR MGSE ED+DVFNSPVSDSGASFHNVEYSSPLPGMPI+R+   AS A
Subjt:  RISRIMVDNLPFSANESKAKELAPSTL------------SQPSTFGQLSNDQRAMGSESEDSDVFNSPVSDSGASFHNVEYSSPLPGMPIFRNASPASPA

Query:  LPIDVGFPEKDIPTGDSFELPDDDGNNNIQNARLAGYDAHSNSKLRIEVQHDHLKSPNATAEGCYLAELD--VIDNSY--------YDGLSLLLSSPNEV
        LPIDVGF +KD+PTGDSFELPDDDGNNNIQNARLAGY+AHSNSKL+IEVQHDHLKSPNATAEGCYL EL   V++ S          DGLSLLLSSPNEV
Subjt:  LPIDVGFPEKDIPTGDSFELPDDDGNNNIQNARLAGYDAHSNSKLRIEVQHDHLKSPNATAEGCYLAELD--VIDNSY--------YDGLSLLLSSPNEV

Query:  NHDQTTNAINAETVLPTDTMLDPPSACSGELYEKGSHCSDGHLDCS-------------------DEPLFCTLNTEDPEISSNDDVFLPPLSTTATMGYH
        NHDQT NAI +ETVLPTDTMLDPPSACSGELYEKGSHCSDGHLDCS                   DEPLFCTLNTEDPEI SNDDVFLPPLST ATMGYH
Subjt:  NHDQTTNAINAETVLPTDTMLDPPSACSGELYEKGSHCSDGHLDCS-------------------DEPLFCTLNTEDPEISSNDDVFLPPLSTTATMGYH

Query:  FQDCMDPTFSSIKDFSCKEKSGETIQNLVQRERKNH-GQPRVSSLSIGLRGLPERGEKHLIGGAAVNLKLSHSNIIHVPSANKTSSRNVNNDAILPVALK
        F DCM PTFSS  DFSCKE SGE  QNLVQRERKNH GQP VS++S+GL  LPERGEKHLI G  VNLKLSHSN IHVPSANKTSS NVN+D ILPV L 
Subjt:  FQDCMDPTFSSIKDFSCKEKSGETIQNLVQRERKNH-GQPRVSSLSIGLRGLPERGEKHLIGGAAVNLKLSHSNIIHVPSANKTSSRNVNNDAILPVALK

Query:  EENNEISQVNHLGQNFLNTHVEKPDFDSNNVRRYPPNTLTTVKDRRLSQEVGARAVFGVEQDGISSTSDQEDLSIDSEDDLPHFSDIEAMILDMDLDPED
        EENNEIS                            PNT T+VKD RLS+EVG + VFGVEQDG+ STSD E+L IDSEDDLPHFSDIEAMILDMDLDPED
Subjt:  EENNEISQVNHLGQNFLNTHVEKPDFDSNNVRRYPPNTLTTVKDRRLSQEVGARAVFGVEQDGISSTSDQEDLSIDSEDDLPHFSDIEAMILDMDLDPED

Query:  QDLFSSEEVLKYQHMDTKKRIIRLEQGANAYMQRSIASHGALAVLYGRHSKHYIKKSEVLLGRATAEFIVDIDLGREGSGNKISRWQAIIKIDEDGFFSL
        QDL+ +EEVLKYQH+DT+KRIIRLEQG NA MQRSIASHGALAVL GRHSKHYIKKSEVLLGRATAEFIVDIDLG EGSGNKISR QAIIKID+DGFFSL
Subjt:  QDLFSSEEVLKYQHMDTKKRIIRLEQGANAYMQRSIASHGALAVLYGRHSKHYIKKSEVLLGRATAEFIVDIDLGREGSGNKISRWQAIIKIDEDGFFSL

Query:  KNLGKCSISINNKDVAPGHCLRLNSGCLIEV
        KNLGKCSISINNKDVAPGHCLRLNSGCLIE+
Subjt:  KNLGKCSISINNKDVAPGHCLRLNSGCLIEV

TrEMBL top hitse value%identityAlignment
A0A6J1C2Q8 uncharacterized protein LOC111007538 isoform X20.0e+0071.76Show/hide
Query:  MGALAPIAPWNPEDDILLKNYVEAGASLDALAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSMID------------------------------
        MGALAP+APW PEDDILLKN +EAGASL++LAKGAVQFSRRYTVRELQERWHSLLYDPIVSE+ASMSMID                              
Subjt:  MGALAPIAPWNPEDDILLKNYVEAGASLDALAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSMID------------------------------

Query:  --CTRCYYALRKRICNEPFNPIDLSYLVG--DSDYVGEESMSGNCIPPISDDFGFQSSELGIMPCNFSQNVMNNDGTEHTFHSGCQHTVEKHFPQNLDNG
            RCYYALRKRICNEPFNP+DLS+LVG  DS+YV EE MSG+CIPPIS DFG Q SELGI+P NF+ N+MNND TE TFHS CQHTVEKHFP NLDN 
Subjt:  --CTRCYYALRKRICNEPFNPIDLSYLVG--DSDYVGEESMSGNCIPPISDDFGFQSSELGIMPCNFSQNVMNNDGTEHTFHSGCQHTVEKHFPQNLDNG

Query:  HERISRIMVDNLPFSANESKAKELAPSTL------------SQPSTFGQLSNDQRAMGSESEDSDVFNSPVSDSGASFHNVEYSSPLPGMPIFRNASPAS
        HE I  IM +NLP S NES+ +ELAPS               +PSTFGQ S DQRAMGSE ED++VFNSPVSDSGASFHNVEYSSPLPGMPI+RNAS  +
Subjt:  HERISRIMVDNLPFSANESKAKELAPSTL------------SQPSTFGQLSNDQRAMGSESEDSDVFNSPVSDSGASFHNVEYSSPLPGMPIFRNASPAS

Query:  PALPIDVGFPEKDIPTGDSFELPDDDGNNNIQNARLAGYDAHSNSKLRIEVQHDHLKSPNATAEGCYLAEL---------------------DVIDNSYY
        PALPIDVGF +KD+PTGDSFELPDDDGNNNIQNAR+A YDA S+SKL+IEVQHDHLKSPNATAE  YLAEL                     DVID SYY
Subjt:  PALPIDVGFPEKDIPTGDSFELPDDDGNNNIQNARLAGYDAHSNSKLRIEVQHDHLKSPNATAEGCYLAEL---------------------DVIDNSYY

Query:  DGL-SLLLSSPNEVNHDQTTNAINAETVLPTDTMLDPPSACSGELYEKGSHCSDGHLDCS-------------------DEPLFCTLNTEDPEISSNDDV
        DGL SLLL+SPNEVNHDQT +A+N ET+LPTD+M+DPP+ACSGELYEKGSHCSDGHLDCS                   DEPLFCTLNTEDPEI SNDDV
Subjt:  DGL-SLLLSSPNEVNHDQTTNAINAETVLPTDTMLDPPSACSGELYEKGSHCSDGHLDCS-------------------DEPLFCTLNTEDPEISSNDDV

Query:  FLPPLSTTATMGYHFQDCMDPTFSSIKDFSCKEKSGETIQNLVQRERKNHGQPRVSSLSIGLRGLPERGEKHLIGGAAVNLKLSHSNIIHVPSANK----
        FLPPLST ++MGYHFQD +D TFSSIKDFSC EKSGE  QNLVQRERKNHGQP VSSLSIGL GLPERGEKHL+GGAAVNLKL HSN IHVPSAN     
Subjt:  FLPPLSTTATMGYHFQDCMDPTFSSIKDFSCKEKSGETIQNLVQRERKNHGQPRVSSLSIGLRGLPERGEKHLIGGAAVNLKLSHSNIIHVPSANK----

Query:  --TSSRNVNNDAILPVALKEENNEISQVNHLGQNFLNTHVEKPDFDSNNVRRYPPNTLTTVKDRRLSQEVGARAVFGVEQDGISSTSDQEDLSIDSEDDL
          +SS N N DAILPV LKEE+ EIS+                           P+ LT VKD RLSQE G+R VFGVEQDGISSTSDQE+LSIDSEDD+
Subjt:  --TSSRNVNNDAILPVALKEENNEISQVNHLGQNFLNTHVEKPDFDSNNVRRYPPNTLTTVKDRRLSQEVGARAVFGVEQDGISSTSDQEDLSIDSEDDL

Query:  PHFSDIEAMILDMDLDPEDQDLFSSEEVLKYQHMDTKKRIIRLEQGANAYMQRSIASHGALAVLYGRHSKHYIKKSEVLLGRATAEFIVDIDLGREGSGN
        PHFSDIEAMILDMDLDPEDQDL++SEEVL+YQHMDTKKRI+RLEQGA+A M+RS+ASHGALAVLYGR+SKHYIKKSEVLLGRAT + IVDIDLGREGSGN
Subjt:  PHFSDIEAMILDMDLDPEDQDLFSSEEVLKYQHMDTKKRIIRLEQGANAYMQRSIASHGALAVLYGRHSKHYIKKSEVLLGRATAEFIVDIDLGREGSGN

Query:  KISRWQAIIKIDEDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEV
        KISR QAIIKID+DGFFSLKNLGKCSISINNK+VAPGHCLRLNSGCLIE+
Subjt:  KISRWQAIIKIDEDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEV

A0A6J1C5Q1 uncharacterized protein LOC111007538 isoform X10.0e+0073.72Show/hide
Query:  MGALAPIAPWNPEDDILLKNYVEAGASLDALAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSMID------------------------------
        MGALAP+APW PEDDILLKN +EAGASL++LAKGAVQFSRRYTVRELQERWHSLLYDPIVSE+ASMSMID                              
Subjt:  MGALAPIAPWNPEDDILLKNYVEAGASLDALAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSMID------------------------------

Query:  --CTRCYYALRKRICNEPFNPIDLSYLVG--DSDYVGEESMSGNCIPPISDDFGFQSSELGIMPCNFSQNVMNNDGTEHTFHSGCQHTVEKHFPQNLDNG
            RCYYALRKRICNEPFNP+DLS+LVG  DS+YV EE MSG+CIPPIS DFG Q SELGI+P NF+ N+MNND TE TFHS CQHTVEKHFP NLDN 
Subjt:  --CTRCYYALRKRICNEPFNPIDLSYLVG--DSDYVGEESMSGNCIPPISDDFGFQSSELGIMPCNFSQNVMNNDGTEHTFHSGCQHTVEKHFPQNLDNG

Query:  HERISRIMVDNLPFSANESKAKELAPSTL------------SQPSTFGQLSNDQRAMGSESEDSDVFNSPVSDSGASFHNVEYSSPLPGMPIFRNASPAS
        HE I  IM +NLP S NES+ +ELAPS               +PSTFGQ S DQRAMGSE ED++VFNSPVSDSGASFHNVEYSSPLPGMPI+RNAS  +
Subjt:  HERISRIMVDNLPFSANESKAKELAPSTL------------SQPSTFGQLSNDQRAMGSESEDSDVFNSPVSDSGASFHNVEYSSPLPGMPIFRNASPAS

Query:  PALPIDVGFPEKDIPTGDSFELPDDDGNNNIQNARLAGYDAHSNSKLRIEVQHDHLKSPNATAEGCYLAEL---------------------DVIDNSYY
        PALPIDVGF +KD+PTGDSFELPDDDGNNNIQNAR+A YDA S+SKL+IEVQHDHLKSPNATAE  YLAEL                     DVID SYY
Subjt:  PALPIDVGFPEKDIPTGDSFELPDDDGNNNIQNARLAGYDAHSNSKLRIEVQHDHLKSPNATAEGCYLAEL---------------------DVIDNSYY

Query:  DGL-SLLLSSPNEVNHDQTTNAINAETVLPTDTMLDPPSACSGELYEKGSHCSDGHLDCS-------------------DEPLFCTLNTEDPEISSNDDV
        DGL SLLL+SPNEVNHDQT +A+N ET+LPTD+M+DPP+ACSGELYEKGSHCSDGHLDCS                   DEPLFCTLNTEDPEI SNDDV
Subjt:  DGL-SLLLSSPNEVNHDQTTNAINAETVLPTDTMLDPPSACSGELYEKGSHCSDGHLDCS-------------------DEPLFCTLNTEDPEISSNDDV

Query:  FLPPLSTTATMGYHFQDCMDPTFSSIKDFSCKEKSGETIQNLVQRERKNHGQPRVSSLSIGLRGLPERGEKHLIGGAAVNLKLSHSNIIHVPSANK----
        FLPPLST ++MGYHFQD +D TFSSIKDFSC EKSGE  QNLVQRERKNHGQP VSSLSIGL GLPERGEKHL+GGAAVNLKL HSN IHVPSAN     
Subjt:  FLPPLSTTATMGYHFQDCMDPTFSSIKDFSCKEKSGETIQNLVQRERKNHGQPRVSSLSIGLRGLPERGEKHLIGGAAVNLKLSHSNIIHVPSANK----

Query:  --TSSRNVNNDAILPVALKEENNEISQVNHLGQNFLNTHVEKPDFDSNNVRRYPPNT----------LTTVKDRRLSQEVGARAVFGVEQDGISSTSDQE
          +SS N N DAILPV LKEE+ EIS+VNHLGQNFLNTHVEKP FDS N R+YPP+T          LT VKD RLSQE G+R VFGVEQDGISSTSDQE
Subjt:  --TSSRNVNNDAILPVALKEENNEISQVNHLGQNFLNTHVEKPDFDSNNVRRYPPNT----------LTTVKDRRLSQEVGARAVFGVEQDGISSTSDQE

Query:  DLSIDSEDDLPHFSDIEAMILDMDLDPEDQDLFSSEEVLKYQHMDTKKRIIRLEQGANAYMQRSIASHGALAVLYGRHSKHYIKKSEVLLGRATAEFIVD
        +LSIDSEDD+PHFSDIEAMILDMDLDPEDQDL++SEEVL+YQHMDTKKRI+RLEQGA+A M+RS+ASHGALAVLYGR+SKHYIKKSEVLLGRAT + IVD
Subjt:  DLSIDSEDDLPHFSDIEAMILDMDLDPEDQDLFSSEEVLKYQHMDTKKRIIRLEQGANAYMQRSIASHGALAVLYGRHSKHYIKKSEVLLGRATAEFIVD

Query:  IDLGREGSGNKISRWQAIIKIDEDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEV
        IDLGREGSGNKISR QAIIKID+DGFFSLKNLGKCSISINNK+VAPGHCLRLNSGCLIE+
Subjt:  IDLGREGSGNKISRWQAIIKIDEDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEV

A0A6J1FHD3 uncharacterized protein LOC1114457850.0e+0075.81Show/hide
Query:  MGALAPIAPWNPEDDILLKNYVEAGASLDALAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSMIDCTR---------------------------
        MGALAPIAPW+PEDDILLKN VEAGASL+ALAKGAVQFSRR+TVRELQERWHSLLYDPIVSEDASMSMID  R                           
Subjt:  MGALAPIAPWNPEDDILLKNYVEAGASLDALAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSMIDCTR---------------------------

Query:  -----CYYALRKRICNEPFNPIDLSYLVGDSDYVGEESMSGNCIPPISDDFGFQSSELGIMPCNFSQNVMNNDGTEHTFHSGCQHTVEKHFPQNLDNGHE
             CYYALRKRICNEPFNP+  SYLVGDSDYV EE MSGNCIPP SDDFG QSSELGI+PCNFSQN MNND TEHTFHSGCQHTVE HFPQNLDNGHE
Subjt:  -----CYYALRKRICNEPFNPIDLSYLVGDSDYVGEESMSGNCIPPISDDFGFQSSELGIMPCNFSQNVMNNDGTEHTFHSGCQHTVEKHFPQNLDNGHE

Query:  RISRIMVDNLPFSANESKAKELAPSTL------------SQPSTFGQLSNDQRAMGSESEDSDVFNSPVSDSGASFHNVEYSSPLPGMPIFRNASPASPA
         IS IMVDNLPF ANES AKELAPS              + PSTFGQLS DQR MGSE ED+DVFNSPVSDSGASFHNVEYSSPLPGMPI+RN   A  A
Subjt:  RISRIMVDNLPFSANESKAKELAPSTL------------SQPSTFGQLSNDQRAMGSESEDSDVFNSPVSDSGASFHNVEYSSPLPGMPIFRNASPASPA

Query:  LPIDVGFPEKDIPTGDSFELPDDDGNNNIQNARLAGYDAHSNSKLRIEVQHDHLKSPNATAEGCYLAE----------LDVIDNSYYDGLSLLLSSPNEV
        LPIDVGF +KD+PTGDSFELPDDDGNNNIQNARLAGY+AHSNSKL+IEVQHDHLKSPNATAEGCYL E           D +     DGLSLLLSSPNEV
Subjt:  LPIDVGFPEKDIPTGDSFELPDDDGNNNIQNARLAGYDAHSNSKLRIEVQHDHLKSPNATAEGCYLAE----------LDVIDNSYYDGLSLLLSSPNEV

Query:  NHDQTTNAINAETVLPTDTMLDPPSACSGELYEKGSHCSDGHLDCS-------------------DEPLFCTLNTEDPEISSNDDVFLPPLSTTATMGYH
        NHDQT NAIN+E VLPTDTMLDPPSACSGELYEKGSHCSDGHLDCS                   DEPLFCTLNTEDPEI SNDDVFLPPLST ATMGYH
Subjt:  NHDQTTNAINAETVLPTDTMLDPPSACSGELYEKGSHCSDGHLDCS-------------------DEPLFCTLNTEDPEISSNDDVFLPPLSTTATMGYH

Query:  FQDCMDPTFSSIKDFSCKEKSGETIQNLVQRERKNH-GQPRVSSLSIGLRGLPERGEKHLIGGAAVNLKLSHSNIIHVPSANKTSSRNVNNDAILPVALK
        F DCM PTFSSI DFSCKE SGE  QNLVQRERKNH GQPRVS++S+GL  L ERGEKHLI GA VNLKLSHSN IHVPSANKTSS NVN+D ILPV L 
Subjt:  FQDCMDPTFSSIKDFSCKEKSGETIQNLVQRERKNH-GQPRVSSLSIGLRGLPERGEKHLIGGAAVNLKLSHSNIIHVPSANKTSSRNVNNDAILPVALK

Query:  EENNEISQVNHLGQNFLNTHVEKPDFDSNNVRRYPPNTLTTVKDRRLSQEVGARAVFGVEQDGISSTSDQEDLSIDSEDDLPHFSDIEAMILDMDLDPED
        EENNEIS                            PNT T+VKD RLS+EVG + VFGVEQDG+ STSD E+L IDSEDDLPHFSDIEAMILDMDLDPED
Subjt:  EENNEISQVNHLGQNFLNTHVEKPDFDSNNVRRYPPNTLTTVKDRRLSQEVGARAVFGVEQDGISSTSDQEDLSIDSEDDLPHFSDIEAMILDMDLDPED

Query:  QDLFSSEEVLKYQHMDTKKRIIRLEQGANAYMQRSIASHGALAVLYGRHSKHYIKKSEVLLGRATAEFIVDIDLGREGSGNKISRWQAIIKIDEDGFFSL
        QDL+ SEEVLKYQH+DT+KRIIRLEQG NA MQRSIASHGALAVL G+HSKHYIKKSEVLLGRATAEFIVDIDLG EGSGNKISR QAIIKID+DG FSL
Subjt:  QDLFSSEEVLKYQHMDTKKRIIRLEQGANAYMQRSIASHGALAVLYGRHSKHYIKKSEVLLGRATAEFIVDIDLGREGSGNKISRWQAIIKIDEDGFFSL

Query:  KNLGKCSISINNKDVAPGHCLRLNSGCLIEV
        KNLGKCSISINNKDVAPGHCLRLNSGCLIE+
Subjt:  KNLGKCSISINNKDVAPGHCLRLNSGCLIEV

A0A6J1HIH5 uncharacterized protein LOC111463912 isoform X10.0e+0070.64Show/hide
Query:  MGALAPIAPWNPEDDILLKNYVEAGASLDALAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSMIDCTR---------------------------
        MGALAP+APW PEDDILLKN VEAGASL++LAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSMID  R                           
Subjt:  MGALAPIAPWNPEDDILLKNYVEAGASLDALAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSMIDCTR---------------------------

Query:  -----CYYALRKRICNEPF-NPIDLSYLVG--DSDYVGEESMSGNCIPPISDDFGFQSSELGIMPCNFSQNVMNNDGTEHTFHSGCQHTVEKHFPQNLDN
             CYYALRKRICNEPF NP+DL++LVG  +S+YV EE MSGNCIPPISDDFG QSSE+GI+PC+FSQNVMN D  EHTF SGCQ TVEKHFP+NLDN
Subjt:  -----CYYALRKRICNEPF-NPIDLSYLVG--DSDYVGEESMSGNCIPPISDDFGFQSSELGIMPCNFSQNVMNNDGTEHTFHSGCQHTVEKHFPQNLDN

Query:  GHERISRIMVDNLPFSANESKAKELAPST------------LSQPSTFGQLSNDQRAMGSESEDSDVFNSPVSDSGASFHNVEYSSPLPGMPIFRNASPA
        G E IS  M ++LP SA +S  +ELAPST             ++PSTFGQLSNDQRAMGSE ED++VFNSPVS+SGASFHNVEYSSPLPGMPI+RNAS  
Subjt:  GHERISRIMVDNLPFSANESKAKELAPST------------LSQPSTFGQLSNDQRAMGSESEDSDVFNSPVSDSGASFHNVEYSSPLPGMPIFRNASPA

Query:  SPALPIDVGFPEKDIPTGDSFELPDDDGNNNIQNARLAGYDAHSNSKLRIEVQHDHLKSPNATAEGCYLAEL---------------------DVIDNSY
        +PALPIDVGF +KDIPT +SFELPDDDGN NIQNAR+AGYDA+S+ KL+IEV+ DHLKSPNATAE  YLAEL                     D +D SY
Subjt:  SPALPIDVGFPEKDIPTGDSFELPDDDGNNNIQNARLAGYDAHSNSKLRIEVQHDHLKSPNATAEGCYLAEL---------------------DVIDNSY

Query:  YDGL-SLLLSSPNEVNHDQTTNAINAETVLPTDTMLDPPSACSGELYEKGSHCSDGHLDCS-------------------DEPLFCTLNTEDPEISSNDD
        YDGL SLLL+SPNE+NHDQT NAINAETVLPTDTM+DPP+ACSG LYEKGSHC  GHLDC+                   DEPLFCTLNTEDP+I SNDD
Subjt:  YDGL-SLLLSSPNEVNHDQTTNAINAETVLPTDTMLDPPSACSGELYEKGSHCSDGHLDCS-------------------DEPLFCTLNTEDPEISSNDD

Query:  VFLPPLSTTATMGYHFQDCMDPTFSSIKDFSCKEKSGETIQNLVQRERKNHGQPRVSSLSIGLRGLPERGEKHLIGGAAVNLKLSHSNIIHVPSANKTSS
        VFLPPLST ATMGY+FQDC++ TFSS KDF+  EKSGET QNL  RERKNHG    + ++  L G  ERGEKH +GGA VN + SHSN  H+PS +   S
Subjt:  VFLPPLSTTATMGYHFQDCMDPTFSSIKDFSCKEKSGETIQNLVQRERKNHGQPRVSSLSIGLRGLPERGEKHLIGGAAVNLKLSHSNIIHVPSANKTSS

Query:  RNVNNDAILPVALKEENNEISQVNHLGQNFLNTHVEKPDFDSNNVRRYPPNT----------LTTVKDRRLSQEVGARAVFGVEQDGISSTSDQEDLSID
         N N+DA LP  LKEENNEIS+VNHLG+NFLN H EKP FDS+NVR YPP+           L ++KD RLSQE G R  FGVEQ G+SSTSDQE+LSID
Subjt:  RNVNNDAILPVALKEENNEISQVNHLGQNFLNTHVEKPDFDSNNVRRYPPNT----------LTTVKDRRLSQEVGARAVFGVEQDGISSTSDQEDLSID

Query:  SEDDLPHFSDIEAMILDMDLDPEDQDLFSSEEVLKYQHMDTKKRIIRLEQGANAYMQRSIASHGALAVLYGRHSKHYIKKSEVLLGRATAEFIVDIDLGR
        SEDD+PHFSDIEAMILDMDLDPEDQDL+SSEEVLKYQH+DTKKRIIRLEQGANAYMQRS ASHGALAVLYGR+SKHYIKKSEVLLGRAT + IVDIDLGR
Subjt:  SEDDLPHFSDIEAMILDMDLDPEDQDLFSSEEVLKYQHMDTKKRIIRLEQGANAYMQRSIASHGALAVLYGRHSKHYIKKSEVLLGRATAEFIVDIDLGR

Query:  EGSGNKISRWQAIIKIDEDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEV
        EGSGNKISR QAIIK+D+DGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIE+
Subjt:  EGSGNKISRWQAIIKIDEDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEV

A0A6J1ITH7 uncharacterized protein LOC1114804510.0e+0075.69Show/hide
Query:  MGALAPIAPWNPEDDILLKNYVEAGASLDALAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSMIDCTR---------------------------
        MGALAPIAPW+PEDDILLKN VEA ASL+ALAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSMID  R                           
Subjt:  MGALAPIAPWNPEDDILLKNYVEAGASLDALAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSMIDCTR---------------------------

Query:  -----CYYALRKRICNEPFNPIDLSYLVGDSDYVGEESMSGNCIPPISDDFGFQSSELGIMPCNFSQNVMNNDGTEHTFHSGCQHTVEKHFPQNLDNGHE
             CYYALRKRICNEPFNP+  SYLVGDSDYV EE MSGNCIPP SDDFG QSSELG +PCNFSQN MNND TEHTFHSGCQHTVE HFPQNLDNGHE
Subjt:  -----CYYALRKRICNEPFNPIDLSYLVGDSDYVGEESMSGNCIPPISDDFGFQSSELGIMPCNFSQNVMNNDGTEHTFHSGCQHTVEKHFPQNLDNGHE

Query:  RISRIMVDNLPFSANESKAKELAPSTL------------SQPSTFGQLSNDQRAMGSESEDSDVFNSPVSDSGASFHNVEYSSPLPGMPIFRNASPASPA
         IS +MVDNLPF ANES AKELAPS              + PSTFGQLS DQR MGSE ED+DVFNSPVSDSGASFHNVEYSSPLPGMPI+R+   AS A
Subjt:  RISRIMVDNLPFSANESKAKELAPSTL------------SQPSTFGQLSNDQRAMGSESEDSDVFNSPVSDSGASFHNVEYSSPLPGMPIFRNASPASPA

Query:  LPIDVGFPEKDIPTGDSFELPDDDGNNNIQNARLAGYDAHSNSKLRIEVQHDHLKSPNATAEGCYLAELD--VIDNSY--------YDGLSLLLSSPNEV
        LPIDVGF +KD+PTGDSFELPDDDGNNNIQNARLAGY+AHSNSKL+IEVQHDHLKSPNATAEGCYL EL   V++ S          DGLSLLLSSPNEV
Subjt:  LPIDVGFPEKDIPTGDSFELPDDDGNNNIQNARLAGYDAHSNSKLRIEVQHDHLKSPNATAEGCYLAELD--VIDNSY--------YDGLSLLLSSPNEV

Query:  NHDQTTNAINAETVLPTDTMLDPPSACSGELYEKGSHCSDGHLDCS-------------------DEPLFCTLNTEDPEISSNDDVFLPPLSTTATMGYH
        NHDQT NAI +ETVLPTDTMLDPPSACSGELYEKGSHCSDGHLDCS                   DEPLFCTLNTEDPEI SNDDVFLPPLST ATMGYH
Subjt:  NHDQTTNAINAETVLPTDTMLDPPSACSGELYEKGSHCSDGHLDCS-------------------DEPLFCTLNTEDPEISSNDDVFLPPLSTTATMGYH

Query:  FQDCMDPTFSSIKDFSCKEKSGETIQNLVQRERKNH-GQPRVSSLSIGLRGLPERGEKHLIGGAAVNLKLSHSNIIHVPSANKTSSRNVNNDAILPVALK
        F DCM PTFSS  DFSCKE SGE  QNLVQRERKNH GQP VS++S+GL  LPERGEKHLI G  VNLKLSHSN IHVPSANKTSS NVN+D ILPV L 
Subjt:  FQDCMDPTFSSIKDFSCKEKSGETIQNLVQRERKNH-GQPRVSSLSIGLRGLPERGEKHLIGGAAVNLKLSHSNIIHVPSANKTSSRNVNNDAILPVALK

Query:  EENNEISQVNHLGQNFLNTHVEKPDFDSNNVRRYPPNTLTTVKDRRLSQEVGARAVFGVEQDGISSTSDQEDLSIDSEDDLPHFSDIEAMILDMDLDPED
        EENNEIS                            PNT T+VKD RLS+EVG + VFGVEQDG+ STSD E+L IDSEDDLPHFSDIEAMILDMDLDPED
Subjt:  EENNEISQVNHLGQNFLNTHVEKPDFDSNNVRRYPPNTLTTVKDRRLSQEVGARAVFGVEQDGISSTSDQEDLSIDSEDDLPHFSDIEAMILDMDLDPED

Query:  QDLFSSEEVLKYQHMDTKKRIIRLEQGANAYMQRSIASHGALAVLYGRHSKHYIKKSEVLLGRATAEFIVDIDLGREGSGNKISRWQAIIKIDEDGFFSL
        QDL+ +EEVLKYQH+DT+KRIIRLEQG NA MQRSIASHGALAVL GRHSKHYIKKSEVLLGRATAEFIVDIDLG EGSGNKISR QAIIKID+DGFFSL
Subjt:  QDLFSSEEVLKYQHMDTKKRIIRLEQGANAYMQRSIASHGALAVLYGRHSKHYIKKSEVLLGRATAEFIVDIDLGREGSGNKISRWQAIIKIDEDGFFSL

Query:  KNLGKCSISINNKDVAPGHCLRLNSGCLIEV
        KNLGKCSISINNKDVAPGHCLRLNSGCLIE+
Subjt:  KNLGKCSISINNKDVAPGHCLRLNSGCLIEV

SwissProt top hitse value%identityAlignment
Q96EZ8 Microspherule protein 12.1e-1032.37Show/hide
Query:  DQEDLSIDSEDDLPHFSDIEAMILDMDL-DPEDQDLFSSEEVLKYQHMDTKKRIIRLEQGANAY---------MQRSIASHGALAVLYGRHSKHYIKKSE
        DQ    +   D + +FSD E +I D  L D  D+ L   E  L       K+ I +LEQ  + +         M      +  LAVL GR  ++ ++  E
Subjt:  DQEDLSIDSEDDLPHFSDIEAMILDMDL-DPEDQDLFSSEEVLKYQHMDTKKRIIRLEQGANAY---------MQRSIASHGALAVLYGRHSKHYIKKSE

Query:  VLLGRATAEFIVDIDLGREGSGNKISRWQAIIKIDEDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEV
        + LGRAT +  +D+DL  EG   KISR Q +IK+  +G F + N G+  I I+ + V  G   RL++  ++E+
Subjt:  VLLGRATAEFIVDIDLGREGSGNKISRWQAIIKIDEDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEV

Q96EZ8 Microspherule protein 15.8e-0849.15Show/hide
Query:  WNPEDDILLKNYVEAGASLDALAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSM
        W P DD+LL N V     L ++  G V+FS R+T+RE+QERW++LLYDP++S+ A  +M
Subjt:  WNPEDDILLKNYVEAGASLDALAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSM

Q99L90 Microspherule protein 19.5e-1133.53Show/hide
Query:  DQEDLSIDSEDDLPHFSDIEAMILDMDL-DPEDQDLFSSEEVLKYQHMDTKKRIIRLEQGANAY--MQRSIASHGA-------LAVLYGRHSKHYIKKSE
        DQ    +   D + +FSD E +I D  L D  D+ L   E  L       K+ I +LEQ  + +  +  SI   G+       LAVL GR  ++ ++  E
Subjt:  DQEDLSIDSEDDLPHFSDIEAMILDMDL-DPEDQDLFSSEEVLKYQHMDTKKRIIRLEQGANAY--MQRSIASHGA-------LAVLYGRHSKHYIKKSE

Query:  VLLGRATAEFIVDIDLGREGSGNKISRWQAIIKIDEDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEV
        + LGRAT +  +D+DL  EG   KISR Q +IK+  +G F + N G+  I I+ + V  G   RL++  ++E+
Subjt:  VLLGRATAEFIVDIDLGREGSGNKISRWQAIIKIDEDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEV

Q99L90 Microspherule protein 15.8e-0849.15Show/hide
Query:  WNPEDDILLKNYVEAGASLDALAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSM
        W P DD+LL N V     L ++  G V+FS R+T+RE+QERW++LLYDP++S+ A  +M
Subjt:  WNPEDDILLKNYVEAGASLDALAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSM

Arabidopsis top hitse value%identityAlignment
AT1G60700.1 SMAD/FHA domain-containing protein1.0e-2341.71Show/hide
Query:  STSDQEDLSIDSEDDLPHFSDIEAMILDMDLDPEDQD-LFSSEEVLKYQHMDTKKRIIRLEQGANAYMQRSIASHGALAVLYGRHSKHYIKKSEVLLGRA
        ST  QE+  +D E+++    DI+AMI  ++L P+D D  F+ EE    +H   +  +I LEQ     MQR+I  HGA+AVL+   SKH+++K EV++GR+
Subjt:  STSDQEDLSIDSEDDLPHFSDIEAMILDMDLDPEDQD-LFSSEEVLKYQHMDTKKRIIRLEQGANAYMQRSIASHGALAVLYGRHSKHYIKKSEVLLGRA

Query:  TAEFIVDIDLGREGSGNKISRWQAIIKIDEDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEVCCSSNSKG
        +    VDIDLG+   G+KISR QA++K++  G FSLKNLGK  I +N   +  G  + L S      C S N +G
Subjt:  TAEFIVDIDLGREGSGNKISRWQAIIKIDEDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEVCCSSNSKG

AT1G75530.1 Forkhead-associated (FHA) domain-containing protein3.4e-4039.15Show/hide
Query:  SANKTSSRNVNNDAILPV---ALKEENNEISQVNHLGQNFLNTHVEKPDFDSNNVRRYPPNTLTTVKDRRLS---QEVGARAVFGVEQDGISSTSDQEDL
        S N +S RN     + PV   +L+ ++ +I   +  G   + T        S++      +T+       L    +          E D   +  ++ ++
Subjt:  SANKTSSRNVNNDAILPV---ALKEENNEISQVNHLGQNFLNTHVEKPDFDSNNVRRYPPNTLTTVKDRRLS---QEVGARAVFGVEQDGISSTSDQEDL

Query:  SIDSEDDLPHFSDIEAMILDMDLDPEDQDLFSSEEVLKYQHMDTKKRIIRLEQGANAYMQRSIASHGALAVLYGRHSKHYIKKSEVLLGRATAEFIVDID
         I+S+++LP FSD+EAMILDMDL+P  QD +   +  KY++ +  ++I+RLEQ A +YM R IA+HGA A+LYG  SKHYI K EVLLGRAT E+ VDID
Subjt:  SIDSEDDLPHFSDIEAMILDMDLDPEDQDLFSSEEVLKYQHMDTKKRIIRLEQGANAYMQRSIASHGALAVLYGRHSKHYIKKSEVLLGRATAEFIVDID

Query:  LGREGSGNKISRWQAIIKIDEDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEV
        LGR GS  + SR QA+IK+ +DG F +KNLGK SI +N++++  G  + L + CLI++
Subjt:  LGREGSGNKISRWQAIIKIDEDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEV

AT1G75530.1 Forkhead-associated (FHA) domain-containing protein6.1e-1358.93Show/hide
Query:  WNPEDDILLKNYVEAGASLDALAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDAS
        W PEDD LL+  +E G SL+ LAKGAV+FSR++T+ EL ERWH LLY+P V+  +S
Subjt:  WNPEDDILLKNYVEAGASLDALAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDAS

AT3G54350.1 Forkhead-associated (FHA) domain-containing protein3.5e-8532.46Show/hide
Query:  MGALAPIAPWNPEDDILLKNYVEAGASLDALAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSMIDCTR---------------------------
        MGALA + PW PEDD+LLKN VEAGASL++LAKGAVQFSRR+++RELQ+RWH+LLYDP+VS +A+  M +  R                           
Subjt:  MGALAPIAPWNPEDDILLKNYVEAGASLDALAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSMIDCTR---------------------------

Query:  -----CYYALRKRICNEPFNPIDLSYLVGDSDYVGEESMSGNCIPPISDDFGFQSSELGIMPCNFSQNVMNNDGTEHTFHSGCQHTVEKHFPQNLDNGHE
              Y++LRK+   EPFN +DL +LV                                 P N S  + N D T           +   FP+ L  G  
Subjt:  -----CYYALRKRICNEPFNPIDLSYLVGDSDYVGEESMSGNCIPPISDDFGFQSSELGIMPCNFSQNVMNNDGTEHTFHSGCQHTVEKHFPQNLDNGHE

Query:  RISRIMVDNLPFSANESKAKELAPSTLSQPSTFGQLSNDQRAMGSESEDSDVFNSPVSDSGASFHNVEYSSPLPGMPIFRNASPASPAL-PIDVGFPEKD
            +    LP    E   +   P    +  TF           +E     V +    DS     N  + +        +N   ++  L  +     E+D
Subjt:  RISRIMVDNLPFSANESKAKELAPSTLSQPSTFGQLSNDQRAMGSESEDSDVFNSPVSDSGASFHNVEYSSPLPGMPIFRNASPASPAL-PIDVGFPEKD

Query:  IPTGDSFELPDDDGNNNIQNARLAGYDAHSNSKLRIEVQHDHLKSPNATAEGCYLAELDVIDNSYYDGL-SLLLSSPNEVNHDQTTNAINAETVLPTDTM
        +                                            P    +G        +D SYYDGL SLL++S N+ N +   N    E   P+   
Subjt:  IPTGDSFELPDDDGNNNIQNARLAGYDAHSNSKLRIEVQHDHLKSPNATAEGCYLAELDVIDNSYYDGL-SLLLSSPNEVNHDQTTNAINAETVLPTDTM

Query:  LDPPSACSGE---LYEKGSHCSDGHLDCSDEPLFCTLNTEDPEISSNDDVFLP----PLSTTATMGYHFQDCMDPTFSSIKDFSCKEKSGETIQNLVQRE
          P  A   +   L   G+   D H +     + C LN EDP+I  NDD+FL     P+S ++    +F+D   P  + ++D S  ++  E     +Q +
Subjt:  LDPPSACSGE---LYEKGSHCSDGHLDCSDEPLFCTLNTEDPEISSNDDVFLP----PLSTTATMGYHFQDCMDPTFSSIKDFSCKEKSGETIQNLVQRE

Query:  RKNHGQPRVSSLSIGLRGLPERGEKHLIGGAAVNLKLSHSNIIHVPSANKTSSRNVNNDAILPVALKEENNEISQVNHLGQNFLNT--HVEKPDFDSNNV
        +K  G+ + S+      G P +G K     A+ + +L ++     P  + ++    N         K+   E +     G  F+ +  H   P+ DS N 
Subjt:  RKNHGQPRVSSLSIGLRGLPERGEKHLIGGAAVNLKLSHSNIIHVPSANKTSSRNVNNDAILPVALKEENNEISQVNHLGQNFLNT--HVEKPDFDSNNV

Query:  RRY----PPNTLTTVKDRRLSQEVGARAVFGVEQDGISSTSDQEDLSIDSEDDLPHFSDIEAMILDMDLDPEDQDLFSSEEVLKYQHMDTKKRIIRLEQG
        +      P N     KD      +    V  +E     + ++ E    +S++DLP++SDIEAMILDMDL+P+DQD F   EV KYQ  D K+ IIRLEQ 
Subjt:  RRY----PPNTLTTVKDRRLSQEVGARAVFGVEQDGISSTSDQEDLSIDSEDDLPHFSDIEAMILDMDLDPEDQDLFSSEEVLKYQHMDTKKRIIRLEQG

Query:  ANAYMQRSIASHGALAVLYGRHSKHYIKKSEVLLGRATAEFIVDIDLGREGSGNKISRWQAIIKIDEDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGC
        A++YMQR+IAS GA AVLYGR+SKHYIKK EVL+GR+T +  VDIDLGRE  G+KISR QAII++ +DG F +KNLGK SIS+N K+V PG  L L S C
Subjt:  ANAYMQRSIASHGALAVLYGRHSKHYIKKSEVLLGRATAEFIVDIDLGREGSGNKISRWQAIIKIDEDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGC

Query:  LIEV
        L+E+
Subjt:  LIEV

AT3G54350.2 Forkhead-associated (FHA) domain-containing protein3.5e-8532.46Show/hide
Query:  MGALAPIAPWNPEDDILLKNYVEAGASLDALAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSMIDCTR---------------------------
        MGALA + PW PEDD+LLKN VEAGASL++LAKGAVQFSRR+++RELQ+RWH+LLYDP+VS +A+  M +  R                           
Subjt:  MGALAPIAPWNPEDDILLKNYVEAGASLDALAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSMIDCTR---------------------------

Query:  -----CYYALRKRICNEPFNPIDLSYLVGDSDYVGEESMSGNCIPPISDDFGFQSSELGIMPCNFSQNVMNNDGTEHTFHSGCQHTVEKHFPQNLDNGHE
              Y++LRK+   EPFN +DL +LV                                 P N S  + N D T           +   FP+ L  G  
Subjt:  -----CYYALRKRICNEPFNPIDLSYLVGDSDYVGEESMSGNCIPPISDDFGFQSSELGIMPCNFSQNVMNNDGTEHTFHSGCQHTVEKHFPQNLDNGHE

Query:  RISRIMVDNLPFSANESKAKELAPSTLSQPSTFGQLSNDQRAMGSESEDSDVFNSPVSDSGASFHNVEYSSPLPGMPIFRNASPASPAL-PIDVGFPEKD
            +    LP    E   +   P    +  TF           +E     V +    DS     N  + +        +N   ++  L  +     E+D
Subjt:  RISRIMVDNLPFSANESKAKELAPSTLSQPSTFGQLSNDQRAMGSESEDSDVFNSPVSDSGASFHNVEYSSPLPGMPIFRNASPASPAL-PIDVGFPEKD

Query:  IPTGDSFELPDDDGNNNIQNARLAGYDAHSNSKLRIEVQHDHLKSPNATAEGCYLAELDVIDNSYYDGL-SLLLSSPNEVNHDQTTNAINAETVLPTDTM
        +                                            P    +G        +D SYYDGL SLL++S N+ N +   N    E   P+   
Subjt:  IPTGDSFELPDDDGNNNIQNARLAGYDAHSNSKLRIEVQHDHLKSPNATAEGCYLAELDVIDNSYYDGL-SLLLSSPNEVNHDQTTNAINAETVLPTDTM

Query:  LDPPSACSGE---LYEKGSHCSDGHLDCSDEPLFCTLNTEDPEISSNDDVFLP----PLSTTATMGYHFQDCMDPTFSSIKDFSCKEKSGETIQNLVQRE
          P  A   +   L   G+   D H +     + C LN EDP+I  NDD+FL     P+S ++    +F+D   P  + ++D S  ++  E     +Q +
Subjt:  LDPPSACSGE---LYEKGSHCSDGHLDCSDEPLFCTLNTEDPEISSNDDVFLP----PLSTTATMGYHFQDCMDPTFSSIKDFSCKEKSGETIQNLVQRE

Query:  RKNHGQPRVSSLSIGLRGLPERGEKHLIGGAAVNLKLSHSNIIHVPSANKTSSRNVNNDAILPVALKEENNEISQVNHLGQNFLNT--HVEKPDFDSNNV
        +K  G+ + S+      G P +G K     A+ + +L ++     P  + ++    N         K+   E +     G  F+ +  H   P+ DS N 
Subjt:  RKNHGQPRVSSLSIGLRGLPERGEKHLIGGAAVNLKLSHSNIIHVPSANKTSSRNVNNDAILPVALKEENNEISQVNHLGQNFLNT--HVEKPDFDSNNV

Query:  RRY----PPNTLTTVKDRRLSQEVGARAVFGVEQDGISSTSDQEDLSIDSEDDLPHFSDIEAMILDMDLDPEDQDLFSSEEVLKYQHMDTKKRIIRLEQG
        +      P N     KD      +    V  +E     + ++ E    +S++DLP++SDIEAMILDMDL+P+DQD F   EV KYQ  D K+ IIRLEQ 
Subjt:  RRY----PPNTLTTVKDRRLSQEVGARAVFGVEQDGISSTSDQEDLSIDSEDDLPHFSDIEAMILDMDLDPEDQDLFSSEEVLKYQHMDTKKRIIRLEQG

Query:  ANAYMQRSIASHGALAVLYGRHSKHYIKKSEVLLGRATAEFIVDIDLGREGSGNKISRWQAIIKIDEDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGC
        A++YMQR+IAS GA AVLYGR+SKHYIKK EVL+GR+T +  VDIDLGRE  G+KISR QAII++ +DG F +KNLGK SIS+N K+V PG  L L S C
Subjt:  ANAYMQRSIASHGALAVLYGRHSKHYIKKSEVLLGRATAEFIVDIDLGREGSGNKISRWQAIIKIDEDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGC

Query:  LIEV
        L+E+
Subjt:  LIEV

AT3G54350.3 Forkhead-associated (FHA) domain-containing protein3.5e-8532.46Show/hide
Query:  MGALAPIAPWNPEDDILLKNYVEAGASLDALAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSMIDCTR---------------------------
        MGALA + PW PEDD+LLKN VEAGASL++LAKGAVQFSRR+++RELQ+RWH+LLYDP+VS +A+  M +  R                           
Subjt:  MGALAPIAPWNPEDDILLKNYVEAGASLDALAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSMIDCTR---------------------------

Query:  -----CYYALRKRICNEPFNPIDLSYLVGDSDYVGEESMSGNCIPPISDDFGFQSSELGIMPCNFSQNVMNNDGTEHTFHSGCQHTVEKHFPQNLDNGHE
              Y++LRK+   EPFN +DL +LV                                 P N S  + N D T           +   FP+ L  G  
Subjt:  -----CYYALRKRICNEPFNPIDLSYLVGDSDYVGEESMSGNCIPPISDDFGFQSSELGIMPCNFSQNVMNNDGTEHTFHSGCQHTVEKHFPQNLDNGHE

Query:  RISRIMVDNLPFSANESKAKELAPSTLSQPSTFGQLSNDQRAMGSESEDSDVFNSPVSDSGASFHNVEYSSPLPGMPIFRNASPASPAL-PIDVGFPEKD
            +    LP    E   +   P    +  TF           +E     V +    DS     N  + +        +N   ++  L  +     E+D
Subjt:  RISRIMVDNLPFSANESKAKELAPSTLSQPSTFGQLSNDQRAMGSESEDSDVFNSPVSDSGASFHNVEYSSPLPGMPIFRNASPASPAL-PIDVGFPEKD

Query:  IPTGDSFELPDDDGNNNIQNARLAGYDAHSNSKLRIEVQHDHLKSPNATAEGCYLAELDVIDNSYYDGL-SLLLSSPNEVNHDQTTNAINAETVLPTDTM
        +                                            P    +G        +D SYYDGL SLL++S N+ N +   N    E   P+   
Subjt:  IPTGDSFELPDDDGNNNIQNARLAGYDAHSNSKLRIEVQHDHLKSPNATAEGCYLAELDVIDNSYYDGL-SLLLSSPNEVNHDQTTNAINAETVLPTDTM

Query:  LDPPSACSGE---LYEKGSHCSDGHLDCSDEPLFCTLNTEDPEISSNDDVFLP----PLSTTATMGYHFQDCMDPTFSSIKDFSCKEKSGETIQNLVQRE
          P  A   +   L   G+   D H +     + C LN EDP+I  NDD+FL     P+S ++    +F+D   P  + ++D S  ++  E     +Q +
Subjt:  LDPPSACSGE---LYEKGSHCSDGHLDCSDEPLFCTLNTEDPEISSNDDVFLP----PLSTTATMGYHFQDCMDPTFSSIKDFSCKEKSGETIQNLVQRE

Query:  RKNHGQPRVSSLSIGLRGLPERGEKHLIGGAAVNLKLSHSNIIHVPSANKTSSRNVNNDAILPVALKEENNEISQVNHLGQNFLNT--HVEKPDFDSNNV
        +K  G+ + S+      G P +G K     A+ + +L ++     P  + ++    N         K+   E +     G  F+ +  H   P+ DS N 
Subjt:  RKNHGQPRVSSLSIGLRGLPERGEKHLIGGAAVNLKLSHSNIIHVPSANKTSSRNVNNDAILPVALKEENNEISQVNHLGQNFLNT--HVEKPDFDSNNV

Query:  RRY----PPNTLTTVKDRRLSQEVGARAVFGVEQDGISSTSDQEDLSIDSEDDLPHFSDIEAMILDMDLDPEDQDLFSSEEVLKYQHMDTKKRIIRLEQG
        +      P N     KD      +    V  +E     + ++ E    +S++DLP++SDIEAMILDMDL+P+DQD F   EV KYQ  D K+ IIRLEQ 
Subjt:  RRY----PPNTLTTVKDRRLSQEVGARAVFGVEQDGISSTSDQEDLSIDSEDDLPHFSDIEAMILDMDLDPEDQDLFSSEEVLKYQHMDTKKRIIRLEQG

Query:  ANAYMQRSIASHGALAVLYGRHSKHYIKKSEVLLGRATAEFIVDIDLGREGSGNKISRWQAIIKIDEDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGC
        A++YMQR+IAS GA AVLYGR+SKHYIKK EVL+GR+T +  VDIDLGRE  G+KISR QAII++ +DG F +KNLGK SIS+N K+V PG  L L S C
Subjt:  ANAYMQRSIASHGALAVLYGRHSKHYIKKSEVLLGRATAEFIVDIDLGREGSGNKISRWQAIIKIDEDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGC

Query:  LIEV
        L+E+
Subjt:  LIEV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGCTCTTGCCCCTATCGCGCCTTGGAATCCTGAAGACGATATTCTGCTCAAGAATTATGTCGAGGCAGGTGCTTCCTTGGATGCCCTTGCCAAAGGTGCTGTGCA
GTTTTCTCGAAGATACACAGTGAGAGAATTGCAAGAACGATGGCATTCTTTACTTTATGATCCAATTGTATCTGAAGATGCATCTATGTCCATGATTGACTGTACGCGTT
GCTATTATGCTTTGCGTAAAAGAATTTGCAACGAACCATTTAATCCTATAGACCTGAGTTATCTTGTTGGCGATAGTGACTATGTCGGTGAAGAGTCTATGTCAGGAAAT
TGTATCCCTCCAATATCAGATGATTTTGGATTTCAGAGCTCAGAGCTGGGGATCATGCCCTGTAATTTTTCCCAAAATGTGATGAATAATGATGGTACTGAGCACACTTT
TCATTCTGGATGTCAACATACAGTTGAAAAGCATTTTCCTCAGAACTTGGATAATGGACATGAAAGAATTTCTCGTATTATGGTAGATAATCTGCCTTTCTCTGCAAATG
AATCTAAAGCAAAAGAATTGGCTCCGTCAACGCTTTCCCAACCTTCCACTTTTGGGCAACTGAGCAATGACCAGAGAGCGATGGGCTCTGAATCAGAGGACAGTGATGTC
TTTAATTCTCCCGTTTCTGATTCTGGCGCATCATTTCATAATGTTGAGTACTCATCTCCTCTTCCTGGTATGCCAATATTTAGAAATGCCTCGCCAGCCTCGCCAGCCTT
GCCGATTGATGTTGGCTTTCCAGAGAAGGATATACCTACAGGAGACTCTTTTGAACTCCCTGATGATGATGGGAACAACAACATTCAAAATGCAAGATTAGCTGGCTATG
ATGCTCACTCTAACTCAAAGTTGAGGATTGAAGTGCAGCATGATCATTTGAAAAGTCCAAATGCCACTGCTGAAGGTTGTTATCTAGCGGAATTGGATGTAATTGATAAC
TCATACTATGATGGGTTGTCGCTTTTGTTGAGTTCACCAAATGAAGTTAATCATGATCAAACAACCAATGCAATTAATGCAGAAACAGTGTTACCAACGGATACAATGTT
AGATCCCCCCTCTGCATGTTCTGGAGAGTTATATGAAAAGGGATCCCACTGTAGTGATGGACATCTGGATTGTAGTGATGAACCTCTTTTCTGCACTTTGAACACAGAAG
ACCCAGAAATCTCGAGCAACGATGATGTTTTTCTACCTCCATTGTCAACAACTGCTACTATGGGATACCATTTTCAAGATTGCATGGATCCTACCTTTTCATCTATTAAG
GATTTCTCTTGTAAAGAAAAATCTGGTGAAACGATTCAAAACCTTGTGCAAAGGGAGAGGAAAAATCATGGACAACCCCGTGTTTCATCTCTATCAATTGGATTGCGTGG
TTTGCCTGAAAGAGGTGAAAAACATCTGATTGGTGGAGCTGCTGTTAATTTAAAATTATCCCATAGCAACATCATACACGTGCCATCTGCAAATAAAACCAGCTCCAGAA
ATGTAAATAACGATGCTATCCTACCTGTTGCACTCAAGGAAGAGAACAATGAAATTTCCCAGGTAAATCATCTTGGCCAGAATTTTTTAAATACTCATGTAGAGAAGCCA
GACTTTGATTCTAACAATGTTAGGAGATATCCACCAAATACATTGACTACAGTGAAAGATCGTCGATTGTCGCAAGAAGTGGGTGCTCGAGCTGTTTTTGGTGTAGAACA
AGATGGAATATCTTCGACATCTGATCAAGAAGATTTATCTATTGATAGTGAAGATGATTTACCTCATTTTTCAGATATTGAGGCAATGATTCTTGATATGGACTTGGATC
CAGAAGATCAGGATTTGTTTTCAAGTGAAGAAGTCTTAAAATATCAACATATGGACACCAAGAAGAGAATCATCAGACTGGAGCAAGGGGCTAACGCTTACATGCAAAGA
TCTATTGCCTCTCATGGGGCATTAGCAGTTCTGTATGGCCGACATTCAAAGCATTACATTAAGAAATCAGAGGTTCTATTGGGTAGAGCAACTGCAGAATTCATTGTGGA
CATTGACTTAGGAAGGGAGGGAAGCGGTAACAAAATATCTCGATGGCAGGCAATTATAAAAATAGATGAGGATGGATTTTTCTCCCTGAAGAATCTTGGCAAGTGCTCAA
TCTCTATAAATAACAAGGATGTGGCCCCTGGTCACTGCCTGCGACTTAATTCTGGTTGCTTGATTGAGGTATGTTGCTCTAGTAACTCGAAAGGACGATGTATCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGAGCTCTTGCCCCTATCGCGCCTTGGAATCCTGAAGACGATATTCTGCTCAAGAATTATGTCGAGGCAGGTGCTTCCTTGGATGCCCTTGCCAAAGGTGCTGTGCA
GTTTTCTCGAAGATACACAGTGAGAGAATTGCAAGAACGATGGCATTCTTTACTTTATGATCCAATTGTATCTGAAGATGCATCTATGTCCATGATTGACTGTACGCGTT
GCTATTATGCTTTGCGTAAAAGAATTTGCAACGAACCATTTAATCCTATAGACCTGAGTTATCTTGTTGGCGATAGTGACTATGTCGGTGAAGAGTCTATGTCAGGAAAT
TGTATCCCTCCAATATCAGATGATTTTGGATTTCAGAGCTCAGAGCTGGGGATCATGCCCTGTAATTTTTCCCAAAATGTGATGAATAATGATGGTACTGAGCACACTTT
TCATTCTGGATGTCAACATACAGTTGAAAAGCATTTTCCTCAGAACTTGGATAATGGACATGAAAGAATTTCTCGTATTATGGTAGATAATCTGCCTTTCTCTGCAAATG
AATCTAAAGCAAAAGAATTGGCTCCGTCAACGCTTTCCCAACCTTCCACTTTTGGGCAACTGAGCAATGACCAGAGAGCGATGGGCTCTGAATCAGAGGACAGTGATGTC
TTTAATTCTCCCGTTTCTGATTCTGGCGCATCATTTCATAATGTTGAGTACTCATCTCCTCTTCCTGGTATGCCAATATTTAGAAATGCCTCGCCAGCCTCGCCAGCCTT
GCCGATTGATGTTGGCTTTCCAGAGAAGGATATACCTACAGGAGACTCTTTTGAACTCCCTGATGATGATGGGAACAACAACATTCAAAATGCAAGATTAGCTGGCTATG
ATGCTCACTCTAACTCAAAGTTGAGGATTGAAGTGCAGCATGATCATTTGAAAAGTCCAAATGCCACTGCTGAAGGTTGTTATCTAGCGGAATTGGATGTAATTGATAAC
TCATACTATGATGGGTTGTCGCTTTTGTTGAGTTCACCAAATGAAGTTAATCATGATCAAACAACCAATGCAATTAATGCAGAAACAGTGTTACCAACGGATACAATGTT
AGATCCCCCCTCTGCATGTTCTGGAGAGTTATATGAAAAGGGATCCCACTGTAGTGATGGACATCTGGATTGTAGTGATGAACCTCTTTTCTGCACTTTGAACACAGAAG
ACCCAGAAATCTCGAGCAACGATGATGTTTTTCTACCTCCATTGTCAACAACTGCTACTATGGGATACCATTTTCAAGATTGCATGGATCCTACCTTTTCATCTATTAAG
GATTTCTCTTGTAAAGAAAAATCTGGTGAAACGATTCAAAACCTTGTGCAAAGGGAGAGGAAAAATCATGGACAACCCCGTGTTTCATCTCTATCAATTGGATTGCGTGG
TTTGCCTGAAAGAGGTGAAAAACATCTGATTGGTGGAGCTGCTGTTAATTTAAAATTATCCCATAGCAACATCATACACGTGCCATCTGCAAATAAAACCAGCTCCAGAA
ATGTAAATAACGATGCTATCCTACCTGTTGCACTCAAGGAAGAGAACAATGAAATTTCCCAGGTAAATCATCTTGGCCAGAATTTTTTAAATACTCATGTAGAGAAGCCA
GACTTTGATTCTAACAATGTTAGGAGATATCCACCAAATACATTGACTACAGTGAAAGATCGTCGATTGTCGCAAGAAGTGGGTGCTCGAGCTGTTTTTGGTGTAGAACA
AGATGGAATATCTTCGACATCTGATCAAGAAGATTTATCTATTGATAGTGAAGATGATTTACCTCATTTTTCAGATATTGAGGCAATGATTCTTGATATGGACTTGGATC
CAGAAGATCAGGATTTGTTTTCAAGTGAAGAAGTCTTAAAATATCAACATATGGACACCAAGAAGAGAATCATCAGACTGGAGCAAGGGGCTAACGCTTACATGCAAAGA
TCTATTGCCTCTCATGGGGCATTAGCAGTTCTGTATGGCCGACATTCAAAGCATTACATTAAGAAATCAGAGGTTCTATTGGGTAGAGCAACTGCAGAATTCATTGTGGA
CATTGACTTAGGAAGGGAGGGAAGCGGTAACAAAATATCTCGATGGCAGGCAATTATAAAAATAGATGAGGATGGATTTTTCTCCCTGAAGAATCTTGGCAAGTGCTCAA
TCTCTATAAATAACAAGGATGTGGCCCCTGGTCACTGCCTGCGACTTAATTCTGGTTGCTTGATTGAGGTATGTTGCTCTAGTAACTCGAAAGGACGATGTATCTGATTA
GCTTTTTTGGAAAATGATCCCCGTCTCTTAACTCGCGAATTGAGCACCTTAAGATTACTGACAATGTTCAGTAGCATTACAACTGCGTCTTTGAGTTCTAAGATTTACCG
GTGGCAGAAGTATTGTTGTTGTTTCATGATTTCATTGACTCCCGCATCTTTTATGATTGACATCATTAGTATGGCCTTAGGTATATTGATGGGACAAAAAATTGCATTAG
GTATATAAAGAAAATACATACTCTCTCCCCATTGATTACTTATTGAAACCTAGACATTGTGCATAAAATTATGTTTAGAAGATACAGCAGTTTCATTTTTTTGTCTGTTA
TATTACTATTCTTCATGGGCCTGTAGCCTTGGAAAGGTTGGAACTCTGGATTTTTGACGTAATCATGTTGGACTGTACGTCTATGCTCAAGAATTTGAATGTTTTACTCT
TTTCTGAAGATAAGTGGAATGCCATTTATATTTGAGTCAAACTAATTTCGCATGAAGCAGTATGTGGGATAATATAGGCAAGACATCTCACAAACAGGAGTATCAATCAT
AATGATAGATGACAACCTAGCTGACATGGACGGTTGTACAGTCGGAAGGTGCTCATTTTTCTGCACTTTCTACTGGGTTCTTTAGATGGGAAACAATGGCAACACATTTG
TGGAGATTATTTCGAACGGTAACATCAGCAAGGAGCTGTCCAACCAAAGTGCGAAGATGTAGGTAAATTTACACTCGTTATCTACCCAACCCAGCCACGATACCTGACTC
AAACCTGCATTCTGGTTACTCCTTTAGACATCCAAGTGGGTTATGGCGGTCCCATAAGTTTCACCATTTGATGTTCCCATCGTTGCAGTTTGGAGCCCACCAACTAAAGT
TGATATCTTTTTGGGATATAGCTTCAGGTCGAAAATGAAGGTAATGGTCCCTGAAATCGAATCTGGGCTCCAGCTTGGCCAATTCCATACCTGTCTAGAAGGCGTAAGAA
GGGATAGCTTCGAAGGAGGTTCCTTCGGTTACGCTAGCATTAACCTCTTGCATTATTTAACAATATCATGTATGGAATATACAGTTACAAGAAACTGATCTTAACCTTAG
CTGACAAGTTGTGTGACACAGTGTTCTGTTGTACAATCTTGAATCTTTAATGTAATATATTTGCAAATATTATCTTGTAATAAA
Protein sequenceShow/hide protein sequence
MGALAPIAPWNPEDDILLKNYVEAGASLDALAKGAVQFSRRYTVRELQERWHSLLYDPIVSEDASMSMIDCTRCYYALRKRICNEPFNPIDLSYLVGDSDYVGEESMSGN
CIPPISDDFGFQSSELGIMPCNFSQNVMNNDGTEHTFHSGCQHTVEKHFPQNLDNGHERISRIMVDNLPFSANESKAKELAPSTLSQPSTFGQLSNDQRAMGSESEDSDV
FNSPVSDSGASFHNVEYSSPLPGMPIFRNASPASPALPIDVGFPEKDIPTGDSFELPDDDGNNNIQNARLAGYDAHSNSKLRIEVQHDHLKSPNATAEGCYLAELDVIDN
SYYDGLSLLLSSPNEVNHDQTTNAINAETVLPTDTMLDPPSACSGELYEKGSHCSDGHLDCSDEPLFCTLNTEDPEISSNDDVFLPPLSTTATMGYHFQDCMDPTFSSIK
DFSCKEKSGETIQNLVQRERKNHGQPRVSSLSIGLRGLPERGEKHLIGGAAVNLKLSHSNIIHVPSANKTSSRNVNNDAILPVALKEENNEISQVNHLGQNFLNTHVEKP
DFDSNNVRRYPPNTLTTVKDRRLSQEVGARAVFGVEQDGISSTSDQEDLSIDSEDDLPHFSDIEAMILDMDLDPEDQDLFSSEEVLKYQHMDTKKRIIRLEQGANAYMQR
SIASHGALAVLYGRHSKHYIKKSEVLLGRATAEFIVDIDLGREGSGNKISRWQAIIKIDEDGFFSLKNLGKCSISINNKDVAPGHCLRLNSGCLIEVCCSSNSKGRCI