; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0026547 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0026547
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionPHD-type domain-containing protein
Genome locationchr10:38900452..38905416
RNA-Seq ExpressionLag0026547
SyntenyLag0026547
Gene Ontology termsNA
InterPro domainsIPR001005 - SANT/Myb domain
IPR001965 - Zinc finger, PHD-type
IPR009057 - Homeobox-like domain superfamily
IPR011011 - Zinc finger, FYVE/PHD-type
IPR013083 - Zinc finger, RING/FYVE/PHD-type
IPR017930 - Myb domain
IPR019786 - Zinc finger, PHD-type, conserved site
IPR019787 - Zinc finger, PHD-finger


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022154229.1 uncharacterized protein LOC111021537 isoform X1 [Momordica charantia]4.2e-29753.23Show/hide
Query:  SALAWHWTIEALASFKEIKPSLLH-----------GDEKERGEMVALKCLEGLFDPLNYIGENGPPTQEPKVTFDSSRSCEHVVKRICKETPKSALRVAG
        S LAW  TIEALASF ++KPSLLH              K  GEMVAL+CLEGLF  LN + E+ PP  + KVTFDSS SCE+VVKRI KETPKSALRVAG
Subjt:  SALAWHWTIEALASFKEIKPSLLH-----------GDEKERGEMVALKCLEGLFDPLNYIGENGPPTQEPKVTFDSSRSCEHVVKRICKETPKSALRVAG

Query:  PDMLKWDVRPFIDQRRASMRCTLHQLKDTILDGTHPYADFLMKKSGLTPINKRDSISPNNEDGIELSRRLDSSFPDPPGQKEKGKG--------------
        P+MLKWDV PFI Q+RASMR TLHQLKDTILDGTHPY DFL  KSGL P+NKRD I+ NN+D  ELSRRLDSS PD  GQKEKGKG              
Subjt:  PDMLKWDVRPFIDQRRASMRCTLHQLKDTILDGTHPYADFLMKKSGLTPINKRDSISPNNEDGIELSRRLDSSFPDPPGQKEKGKG--------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------------------------------------------------------------------SPLLPEDERIISVV
                                                                                              SPLL EDER  SV 
Subjt:  --------------------------------------------------------------------------------------SPLLPEDERIISVV

Query:  SQSSSSLLPSKR----------------------------------------------------------------------------------------
          SSSSLLPSKR                                                                                        
Subjt:  SQSSSSLLPSKR----------------------------------------------------------------------------------------

Query:  ------------SAVEFTSEDEARQLPGCDDGYINIKKLEQHSAHTLYSGQKVASSHGIKVLEDLPERIVPQNERDDTDHLDEHQMTL-GEDKLVEEERF
                    S V+  SEDEA QLPGCDDGY+N+ KL QHSA TLYSGQ+VASSHG ++LED  ER+VPQNE DD + LDE  MTL  EDKL +EE F
Subjt:  ------------SAVEFTSEDEARQLPGCDDGYINIKKLEQHSAHTLYSGQKVASSHGIKVLEDLPERIVPQNERDDTDHLDEHQMTL-GEDKLVEEERF

Query:  GSMKSGWCTATDVSHKGELGISCYTMQDCTQDGEMHEVVIIEKAKDGSELPIEPKGSSISPAEGNQHDIITDDPECDFGHDNHVNEMNNMSHSGFLPKTV
        G  +S  CTATD  H+ E GI C+TM    QD EMHE++ +EK  D S LP EPK S+ S AEGN H+   DD +CD GHDNHVN +N MSHS F P+TV
Subjt:  GSMKSGWCTATDVSHKGELGISCYTMQDCTQDGEMHEVVIIEKAKDGSELPIEPKGSSISPAEGNQHDIITDDPECDFGHDNHVNEMNNMSHSGFLPKTV

Query:  ATNIDVGTNLDEEEKDMLSDSDSDRYHNERIDIAMKKKEFFSSQCMVDHDSFLLADRRELTVCVKCNGGGHLLSCDTSDCPLVVHDKCLGSLTRMNDEGD
        AT+IDVG N DEEEKDMLSDSD   + N+ IDIAMKK EFFSSQC+VDHDSF LADR+ELTVCVKCN GG LLSC+ SDC LVVHDKCLG   RMNDEGD
Subjt:  ATNIDVGTNLDEEEKDMLSDSDSDRYHNERIDIAMKKKEFFSSQCMVDHDSFLLADRRELTVCVKCNGGGHLLSCDTSDCPLVVHDKCLGSLTRMNDEGD

Query:  FYCPFCSYSLAISEYLEAKKHAALAKKNVATLFDTALGHQSIVIKEVLQQKDLDPSRKAGVEDVAKIFEDVDLENKHNQVTLDGKQVNEVVDQSS---TD
        F CPFC YSLAISEYLEAK++AALAKKNVAT     L HQSI IKEVLQ+KDL PSRKAGVEDVAKI EDV+LENK NQVTLDG+ VNEV+D  S   TD
Subjt:  FYCPFCSYSLAISEYLEAKKHAALAKKNVATLFDTALGHQSIVIKEVLQQKDLDPSRKAGVEDVAKIFEDVDLENKHNQVTLDGKQVNEVVDQSS---TD

Query:  IEPIIELSNPLHITNSNHRENDASPLRVAPDVLSGEKNGDEL-------------VDQEHQGNTVAELVDQERRDNAVAELVDQERQGNAVTELVDQKCQ
        IE   +LS PL I+NSNHREN+A+PLRVAPDVL+GEK+GDEL             VDQE +GNTV ELVDQE + N V ELVDQE QGN V ELVDQ+CQ
Subjt:  IEPIIELSNPLHITNSNHRENDASPLRVAPDVLSGEKNGDEL-------------VDQEHQGNTVAELVDQERRDNAVAELVDQERQGNAVTELVDQKCQ

Query:  GNVAELEDGQKATEQHDISVSDILHEDQGPVEATTMREGLQYQTDDN-EELAYAINIEGEKSSDDENDESIISRYSIRFRRKYHHTCPETHQFRRKKLPW
        GNVAELEDG KATEQ+DI   + +HEDQGPVEA T +EGL+YQTDDN EE  YAINIEGEKSSDDENDESIISRYSIRFR++YH TCPET Q RRKKLPW
Subjt:  GNVAELEDGQKATEQHDISVSDILHEDQGPVEATTMREGLQYQTDDN-EELAYAINIEGEKSSDDENDESIISRYSIRFRRKYHHTCPETHQFRRKKLPW

Query:  TAEEEETLREGVRKFSSSVERSPNIPWKKILEFGSNVFLQDRTSIDLKDKWRKLCRSPKMK
        TAEEEETLREGVRKFSSSV+RSP IPWKKILEFGS VFL+ RTSIDLKDKWR LCRSPK K
Subjt:  TAEEEETLREGVRKFSSSVERSPNIPWKKILEFGSNVFLQDRTSIDLKDKWRKLCRSPKMK

XP_022154231.1 uncharacterized protein LOC111021537 isoform X3 [Momordica charantia]6.9e-30053.83Show/hide
Query:  SALAWHWTIEALASFKEIKPSLLH-----------GDEKERGEMVALKCLEGLFDPLNYIGENGPPTQEPKVTFDSSRSCEHVVKRICKETPKSALRVAG
        S LAW  TIEALASF ++KPSLLH              K  GEMVAL+CLEGLF  LN + E+ PP  + KVTFDSS SCE+VVKRI KETPKSALRVAG
Subjt:  SALAWHWTIEALASFKEIKPSLLH-----------GDEKERGEMVALKCLEGLFDPLNYIGENGPPTQEPKVTFDSSRSCEHVVKRICKETPKSALRVAG

Query:  PDMLKWDVRPFIDQRRASMRCTLHQLKDTILDGTHPYADFLMKKSGLTPINKRDSISPNNEDGIELSRRLDSSFPDPPGQKEKGKG--------------
        P+MLKWDV PFI Q+RASMR TLHQLKDTILDGTHPY DFL  KSGL P+NKRD I+ NN+D  ELSRRLDSS PD  GQKEKGKG              
Subjt:  PDMLKWDVRPFIDQRRASMRCTLHQLKDTILDGTHPYADFLMKKSGLTPINKRDSISPNNEDGIELSRRLDSSFPDPPGQKEKGKG--------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------------------------------------------------------------------SPLLPEDERIISVV
                                                                                              SPLL EDER  SV 
Subjt:  --------------------------------------------------------------------------------------SPLLPEDERIISVV

Query:  SQSSSSLLPSKR----------------------------------------------------------------------------------------
          SSSSLLPSKR                                                                                        
Subjt:  SQSSSSLLPSKR----------------------------------------------------------------------------------------

Query:  ------------SAVEFTSEDEARQLPGCDDGYINIKKLEQHSAHTLYSGQKVASSHGIKVLEDLPERIVPQNERDDTDHLDEHQMTL-GEDKLVEEERF
                    S V+  SEDEA QLPGCDDGY+N+ KL QHSA TLYSGQ+VASSHG ++LED  ER+VPQNE DD + LDE  MTL  EDKL +EE F
Subjt:  ------------SAVEFTSEDEARQLPGCDDGYINIKKLEQHSAHTLYSGQKVASSHGIKVLEDLPERIVPQNERDDTDHLDEHQMTL-GEDKLVEEERF

Query:  GSMKSGWCTATDVSHKGELGISCYTMQDCTQDGEMHEVVIIEKAKDGSELPIEPKGSSISPAEGNQHDIITDDPECDFGHDNHVNEMNNMSHSGFLPKTV
        G  +S  CTATD  H+ E GI C+TM    QD EMHE++ +EK  D S LP EPK S+ S AEGN H+   DD +CD GHDNHVN +N MSHS F P+TV
Subjt:  GSMKSGWCTATDVSHKGELGISCYTMQDCTQDGEMHEVVIIEKAKDGSELPIEPKGSSISPAEGNQHDIITDDPECDFGHDNHVNEMNNMSHSGFLPKTV

Query:  ATNIDVGTNLDEEEKDMLSDSDSDRYHNERIDIAMKKKEFFSSQCMVDHDSFLLADRRELTVCVKCNGGGHLLSCDTSDCPLVVHDKCLGSLTRMNDEGD
        AT+IDVG N DEEEKDMLSDSD   + N+ IDIAMKK EFFSSQC+VDHDSF LADR+ELTVCVKCN GG LLSC+ SDC LVVHDKCLG   RMNDEGD
Subjt:  ATNIDVGTNLDEEEKDMLSDSDSDRYHNERIDIAMKKKEFFSSQCMVDHDSFLLADRRELTVCVKCNGGGHLLSCDTSDCPLVVHDKCLGSLTRMNDEGD

Query:  FYCPFCSYSLAISEYLEAKKHAALAKKNVATLFDTALGHQSIVIKEVLQQKDLDPSRKAGVEDVAKIFEDVDLENKHNQVTLDGKQVNEVVDQSS---TD
        F CPFC YSLAISEYLEAK++AALAKKNVAT     L HQSI IKEVLQ+KDL PSRKAGVEDVAKI EDV+LENK NQVTLDG+ VNEV+D  S   TD
Subjt:  FYCPFCSYSLAISEYLEAKKHAALAKKNVATLFDTALGHQSIVIKEVLQQKDLDPSRKAGVEDVAKIFEDVDLENKHNQVTLDGKQVNEVVDQSS---TD

Query:  IEPIIELSNPLHITNSNHRENDASPLRVAPDVLSGEKNGDELVDQEHQGNTVAELVDQERRDNAVAELVDQERQGNAVTELVDQKCQGNVAELEDGQKAT
        IE   +LS PL I+NSNHREN+A+PLRVAPDVL+GEK+GDELVDQE +GNTV ELVDQE + N V ELVDQE QGN V ELVDQ+CQGNVAELEDG KAT
Subjt:  IEPIIELSNPLHITNSNHRENDASPLRVAPDVLSGEKNGDELVDQEHQGNTVAELVDQERRDNAVAELVDQERQGNAVTELVDQKCQGNVAELEDGQKAT

Query:  EQHDISVSDILHEDQGPVEATTMREGLQYQTDDN-EELAYAINIEGEKSSDDENDESIISRYSIRFRRKYHHTCPETHQFRRKKLPWTAEEEETLREGVR
        EQ+DI   + +HEDQGPVEA T +EGL+YQTDDN EE  YAINIEGEKSSDDENDESIISRYSIRFR++YH TCPET Q RRKKLPWTAEEEETLREGVR
Subjt:  EQHDISVSDILHEDQGPVEATTMREGLQYQTDDN-EELAYAINIEGEKSSDDENDESIISRYSIRFRRKYHHTCPETHQFRRKKLPWTAEEEETLREGVR

Query:  KFSSSVERSPNIPWKKILEFGSNVFLQDRTSIDLKDKWRKLCRSPKMK
        KFSSSV+RSP IPWKKILEFGS VFL+ RTSIDLKDKWR LCRSPK K
Subjt:  KFSSSVERSPNIPWKKILEFGSNVFLQDRTSIDLKDKWRKLCRSPKMK

XP_023527258.1 uncharacterized protein LOC111790548 isoform X1 [Cucurbita pepo subsp. pepo]1.8e-28468.14Show/hide
Query:  SALAWHWTIEALASFKEIKPSLLH-----------GDEKERGEMVALKCLEGLFDPLNYIGENGPPTQEPKVTFDSSRSCEHVVKRICKETPKSALRVAG
        S+LAW WTIEALASF+E+KPSLLH           G  K  GEMVALKCLEGLF  LNYIGEN  P QE KV FDSS SCE V KRI KETPKS+LRVAG
Subjt:  SALAWHWTIEALASFKEIKPSLLH-----------GDEKERGEMVALKCLEGLFDPLNYIGENGPPTQEPKVTFDSSRSCEHVVKRICKETPKSALRVAG

Query:  PDMLKWDVRPFIDQRRASMRCTLHQLKDTILDGTHPYADFLMKKSGLTPINKRDSISPNNEDGIELSRRLDSSFPDPPGQKEKGKGSPLLPEDERIISVV
        PD+LKWDVR F DQ+RASMRCTLH+LKD ILDGTHPYADFLM+KSGLTPINKRD+I  NNED IELS RLD S   P GQ EKGKGSPLL ED+R ISVV
Subjt:  PDMLKWDVRPFIDQRRASMRCTLHQLKDTILDGTHPYADFLMKKSGLTPINKRDSISPNNEDGIELSRRLDSSFPDPPGQKEKGKGSPLLPEDERIISVV

Query:  SQSSSSLLPSKRSAVEFTSEDEARQLPGCDDGYINIKKLEQHSAHTLYSGQKVASSHGIKVLEDLPERIVPQNERDDTDHLDEHQMTLGEDKLVEEERFG
        +  SSSLLPSKRS V+FTSEDEARQLPG DDGYIN+KKL+QHSAHT +SGQ+VASSH  +VLED PER VPQNERDDTD LDEHQ+T  +D+LVE+  FG
Subjt:  SQSSSSLLPSKRSAVEFTSEDEARQLPGCDDGYINIKKLEQHSAHTLYSGQKVASSHGIKVLEDLPERIVPQNERDDTDHLDEHQMTLGEDKLVEEERFG

Query:  SMKSGWCTATDVSHKGELGISCYTMQDCTQDGEMHEVVIIEKAKDGSELPIEPKGSSISPAEGNQHDIITDDPECDFGHDNHVNEMNNMSHSGFLPKTVA
        S K         SH+ + GI CYTM   T+D EM EVV +EK KDGSELP EPK S+ SPAEGN H+   D+ +CD GHD  VNE N MS SGF+ KTVA
Subjt:  SMKSGWCTATDVSHKGELGISCYTMQDCTQDGEMHEVVIIEKAKDGSELPIEPKGSSISPAEGNQHDIITDDPECDFGHDNHVNEMNNMSHSGFLPKTVA

Query:  TNIDVGTNLDEEEKDMLSDSDSDRYHNERIDIAMKKKEFFSSQCMVDHDSFLLADRRELTVCVKCNGGGHLLSCDTSDCPLVVHDKCLGSLTRMNDEGDF
        TN++VG   D +EKD+LSDSD   YH E IDIA +KKEF SSQCMVDHDSF LAD R L VCVKCN GG LL C+ SDCPLVVH KCL S   M DEGDF
Subjt:  TNIDVGTNLDEEEKDMLSDSDSDRYHNERIDIAMKKKEFFSSQCMVDHDSFLLADRRELTVCVKCNGGGHLLSCDTSDCPLVVHDKCLGSLTRMNDEGDF

Query:  YCPFCSYSLAISEYLEAKKHAALAKKNVATLFDTALGHQSIVIKEVLQQKDLDPSRKAGVEDVAKIFEDVDLENKHNQVTLDGKQVNEVVDQSS---TDI
         CPFC YSLAISEYLEAKKH A  KKNVA+ F TALGHQS V++EVLQQKD+D S++A VEDVAKI EDVDLE+K NQV+LDG++VNEVVD  S   TD 
Subjt:  YCPFCSYSLAISEYLEAKKHAALAKKNVATLFDTALGHQSIVIKEVLQQKDLDPSRKAGVEDVAKIFEDVDLENKHNQVTLDGKQVNEVVDQSS---TDI

Query:  EPIIELSNPLHITNSNHRENDASPLRVAPDVLSGEKNGDELVDQEHQGNTVAELVDQERRDNAVAELVDQERQGNAVTELVDQKCQGNVAELEDGQKATE
        E + ELS PLHI NSNHR+N ASP RVA D L G++NG ELVDQE QGNTVA                           +VDQKC+GNVAE EDGQK TE
Subjt:  EPIIELSNPLHITNSNHRENDASPLRVAPDVLSGEKNGDELVDQEHQGNTVAELVDQERRDNAVAELVDQERQGNAVTELVDQKCQGNVAELEDGQKATE

Query:  QHDISVSDILHEDQGPVEATTMREGLQYQTDDNE-ELAYAINIEGEKSSDDENDESIISRYSIRFRRKYHHTCPETHQFRRKKLPWTAEEEETL
        QHDI   +ILHE +GPVE    + GLQYQTDD+E + A AI  EGEKSSDD NDESIISRYSIRFR+K HHT PETH  RRKKL WTAEEEET+
Subjt:  QHDISVSDILHEDQGPVEATTMREGLQYQTDDNE-ELAYAINIEGEKSSDDENDESIISRYSIRFRRKYHHTCPETHQFRRKKLPWTAEEEETL

XP_023527259.1 uncharacterized protein LOC111790548 isoform X2 [Cucurbita pepo subsp. pepo]2.8e-28568.74Show/hide
Query:  SALAWHWTIEALASFKEIKPSLLH----GDEKERGEMVALKCLEGLFDPLNYIGENGPPTQEPKVTFDSSRSCEHVVKRICKETPKSALRVAGPDMLKWD
        S+LAW WTIEALASF+E+KPSLLH    G  K  GEMVALKCLEGLF  LNYIGEN  P QE KV FDSS SCE V KRI KETPKS+LRVAGPD+LKWD
Subjt:  SALAWHWTIEALASFKEIKPSLLH----GDEKERGEMVALKCLEGLFDPLNYIGENGPPTQEPKVTFDSSRSCEHVVKRICKETPKSALRVAGPDMLKWD

Query:  VRPFIDQRRASMRCTLHQLKDTILDGTHPYADFLMKKSGLTPINKRDSISPNNEDGIELSRRLDSSFPDPPGQKEKGKGSPLLPEDERIISVVSQSSSSL
        VR F DQ+RASMRCTLH+LKD ILDGTHPYADFLM+KSGLTPINKRD+I  NNED IELS RLD S   P GQ EKGKGSPLL ED+R ISVV+  SSSL
Subjt:  VRPFIDQRRASMRCTLHQLKDTILDGTHPYADFLMKKSGLTPINKRDSISPNNEDGIELSRRLDSSFPDPPGQKEKGKGSPLLPEDERIISVVSQSSSSL

Query:  LPSKRSAVEFTSEDEARQLPGCDDGYINIKKLEQHSAHTLYSGQKVASSHGIKVLEDLPERIVPQNERDDTDHLDEHQMTLGEDKLVEEERFGSMKSGWC
        LPSKRS V+FTSEDEARQLPG DDGYIN+KKL+QHSAHT +SGQ+VASSH  +VLED PER VPQNERDDTD LDEHQ+T  +D+LVE+  FGS K    
Subjt:  LPSKRSAVEFTSEDEARQLPGCDDGYINIKKLEQHSAHTLYSGQKVASSHGIKVLEDLPERIVPQNERDDTDHLDEHQMTLGEDKLVEEERFGSMKSGWC

Query:  TATDVSHKGELGISCYTMQDCTQDGEMHEVVIIEKAKDGSELPIEPKGSSISPAEGNQHDIITDDPECDFGHDNHVNEMNNMSHSGFLPKTVATNIDVGT
             SH+ + GI CYTM   T+D EM EVV +EK KDGSELP EPK S+ SPAEGN H+   D+ +CD GHD  VNE N MS SGF+ KTVATN++VG 
Subjt:  TATDVSHKGELGISCYTMQDCTQDGEMHEVVIIEKAKDGSELPIEPKGSSISPAEGNQHDIITDDPECDFGHDNHVNEMNNMSHSGFLPKTVATNIDVGT

Query:  NLDEEEKDMLSDSDSDRYHNERIDIAMKKKEFFSSQCMVDHDSFLLADRRELTVCVKCNGGGHLLSCDTSDCPLVVHDKCLGSLTRMNDEGDFYCPFCSY
          D +EKD+LSDSD   YH E IDIA +KKEF SSQCMVDHDSF LAD R L VCVKCN GG LL C+ SDCPLVVH KCL S   M DEGDF CPFC Y
Subjt:  NLDEEEKDMLSDSDSDRYHNERIDIAMKKKEFFSSQCMVDHDSFLLADRRELTVCVKCNGGGHLLSCDTSDCPLVVHDKCLGSLTRMNDEGDFYCPFCSY

Query:  SLAISEYLEAKKHAALAKKNVATLFDTALGHQSIVIKEVLQQKDLDPSRKAGVEDVAKIFEDVDLENKHNQVTLDGKQVNEVVDQSS---TDIEPIIELS
        SLAISEYLEAKKH A  KKNVA+ F TALGHQS V++EVLQQKD+D S++A VEDVAKI EDVDLE+K NQV+LDG++VNEVVD  S   TD E + ELS
Subjt:  SLAISEYLEAKKHAALAKKNVATLFDTALGHQSIVIKEVLQQKDLDPSRKAGVEDVAKIFEDVDLENKHNQVTLDGKQVNEVVDQSS---TDIEPIIELS

Query:  NPLHITNSNHRENDASPLRVAPDVLSGEKNGDELVDQEHQGNTVAELVDQERRDNAVAELVDQERQGNAVTELVDQKCQGNVAELEDGQKATEQHDISVS
         PLHI NSNHR+N ASP RVA D L G++NG ELVDQE QGNTVA                           +VDQKC+GNVAE EDGQK TEQHDI   
Subjt:  NPLHITNSNHRENDASPLRVAPDVLSGEKNGDELVDQEHQGNTVAELVDQERRDNAVAELVDQERQGNAVTELVDQKCQGNVAELEDGQKATEQHDISVS

Query:  DILHEDQGPVEATTMREGLQYQTDDNE-ELAYAINIEGEKSSDDENDESIISRYSIRFRRKYHHTCPETHQFRRKKLPWTAEEEETL
        +ILHE +GPVE    + GLQYQTDD+E + A AI  EGEKSSDD NDESIISRYSIRFR+K HHT PETH  RRKKL WTAEEEET+
Subjt:  DILHEDQGPVEATTMREGLQYQTDDNE-ELAYAINIEGEKSSDDENDESIISRYSIRFRRKYHHTCPETHQFRRKKLPWTAEEEETL

XP_038904579.1 uncharacterized protein LOC120090944 [Benincasa hispida]0.0e+0070.76Show/hide
Query:  SALAWHWTIEALASFKEIKPSLLH-----------GDEKERGEMVALKCLEGLFDPLNYIGENGPPTQEPKVTFDSSRSCEHVVKRICKETPKSALRVAG
        SALAW WTIEALA F+E+KPSLLH           G  K  GEMVALKCLEGLF PLN IGENGPP QE KV FDSS SCE+VVKRI KETP+SALRVAG
Subjt:  SALAWHWTIEALASFKEIKPSLLH-----------GDEKERGEMVALKCLEGLFDPLNYIGENGPPTQEPKVTFDSSRSCEHVVKRICKETPKSALRVAG

Query:  PDMLKWDVRPFIDQRRASMRCTLHQLKDTILDGTHPYADFLMKKSGLTPINKRDSISPNNEDGIELSRRLDSSFPDPPGQKEKGKGSPLLPEDERIISVV
        PDMLKWDV+PFIDQ+ ASMRCTLHQLKD+ILDGTHPYADFL++KSGLTPINKRD+IS NNED I+LSRRLD S   P G+KE+GKGSPLL +DE  IS+V
Subjt:  PDMLKWDVRPFIDQRRASMRCTLHQLKDTILDGTHPYADFLMKKSGLTPINKRDSISPNNEDGIELSRRLDSSFPDPPGQKEKGKGSPLLPEDERIISVV

Query:  SQSSSSLLPSKRSAVEFTSEDEARQLPGCDDGYINIKKLEQHSAHTLYSGQKVASSHGIKVLEDLPERIVPQNERDDTDHLDEHQMTLGEDKLVEEERFG
        + SSSSLLPSKRSAV+FTSEDEARQLPGCDDG+IN+KKL+ HSA TLYSGQ+VASSHG +++ED  ER  PQ ERDDT+HLD HQ+TL  DKLVEEE FG
Subjt:  SQSSSSLLPSKRSAVEFTSEDEARQLPGCDDGYINIKKLEQHSAHTLYSGQKVASSHGIKVLEDLPERIVPQNERDDTDHLDEHQMTLGEDKLVEEERFG

Query:  SMKSGWCTATDVSHKGELGISCYTMQDCTQDGEMHEVVIIEKAKDGSELPIEPKGSSISPAEGNQHDIITDDPECDFGHDNHVNEMNNMSHSGFLPKTVA
        S KSG CTATD  H+ ++    YT+   TQDGEM EVV  EK  DG ELP EPK S+ SPAEG       ++ +CD GHD HVNEM  +SHSGFL  TVA
Subjt:  SMKSGWCTATDVSHKGELGISCYTMQDCTQDGEMHEVVIIEKAKDGSELPIEPKGSSISPAEGNQHDIITDDPECDFGHDNHVNEMNNMSHSGFLPKTVA

Query:  TNIDVGTNLDEEEKDMLSDSDSDRYHNERIDIAMKKKEFFSSQCMVDHDSFLLADRRELTVCVKCNGGGHLLSCDTSDCPLVVHDKCLGSLTRMNDEGDF
        TNIDVG N DE+EKD+LSDSD   YH E IDIAM+KKEF SSQCMVD DSFLLADRRE+TVCVKCN GG LLSC+ SDCPLVVH KCLGS  RMNDEG+F
Subjt:  TNIDVGTNLDEEEKDMLSDSDSDRYHNERIDIAMKKKEFFSSQCMVDHDSFLLADRRELTVCVKCNGGGHLLSCDTSDCPLVVHDKCLGSLTRMNDEGDF

Query:  YCPFCSYSLAISEYLEAKKHAALAKKNVATLFDT-ALGHQSIVIKEVLQQKDLDPSRKAGVEDVAKIFEDVDLENKHNQVTLDGKQVNEVVD-QSSTDIE
         CPFC YSLAIS+YLEAKKHAALAKKNVA      AL  QSI I+EVLQQKDLDPSR+AGVEDVAKI EDVDLENK N+VTLDG+ VNE VD QS TD E
Subjt:  YCPFCSYSLAISEYLEAKKHAALAKKNVATLFDT-ALGHQSIVIKEVLQQKDLDPSRKAGVEDVAKIFEDVDLENKHNQVTLDGKQVNEVVD-QSSTDIE

Query:  PIIELSNPLHITNSNHRENDASPLRVAPDVLSGEKNGDELVDQEHQGNTVAELVDQERRDNAVAELVDQERQG---------------------------
         IIELS P+H  NSNHREN++S LRVAPDVLSGEK+ +ELVD+E  GNT AELVDQE   N VAELVDQE QG                           
Subjt:  PIIELSNPLHITNSNHRENDASPLRVAPDVLSGEKNGDELVDQEHQGNTVAELVDQERRDNAVAELVDQERQG---------------------------

Query:  -----NAVTELVDQKCQGNVAELEDGQKATEQHDISVSDILHEDQGPVEATTMREGLQYQTDDNE-ELAYAINIEGEKSSDDENDESIISRYSIRFRRKY
             N V ELVDQ+CQGNVAEL+DGQKATEQH+I    ILH+D+GP+E   M + LQYQTDDNE E A AI  EGEKSSDD ND+SIISRYSIRFR+KY
Subjt:  -----NAVTELVDQKCQGNVAELEDGQKATEQHDISVSDILHEDQGPVEATTMREGLQYQTDDNE-ELAYAINIEGEKSSDDENDESIISRYSIRFRRKY

Query:  HHTCPETHQFRRKKLPWTAEEEETLREGVRKFSSSVERSPNIPWKKILEFGSNVF
        HHT  ETH  RRKKLPWTAEEEE + EGVRKFSSSV+RSP IPWKKILEFGS+VF
Subjt:  HHTCPETHQFRRKKLPWTAEEEETLREGVRKFSSSVERSPNIPWKKILEFGSNVF

TrEMBL top hitse value%identityAlignment
A0A1S3B7A1 uncharacterized protein LOC1034868085.0e-27263.2Show/hide
Query:  SALAWHWTIEALASFKEIKPSLLH-----------GDEKERGEMVALKCLEGLFDPLNYIGENGPPTQEPKVTFDSSRSCEHVVKRICKETPKSALRVAG
        SALAW WTIEALASF ++KPSLLH           G     GEMVAL+CLEGLF PL+ IGENG P QE KV FDSS SC  VVKRI  ETP+SAL VAG
Subjt:  SALAWHWTIEALASFKEIKPSLLH-----------GDEKERGEMVALKCLEGLFDPLNYIGENGPPTQEPKVTFDSSRSCEHVVKRICKETPKSALRVAG

Query:  PDMLKWDVRPFIDQRRASMRCTLHQLKDTILDGTHPYADFLMKKSGLTPINKRDSISPNNEDGIELSRRLDSSFPDPPGQKEKGKGSPLLPEDERIISVV
        PDM KWDV+PFIDQ+RASMRCTL QLKD+ILDGTHPYA+FLM KSGLTPINKRD  S NNED +EL RRLD+S   P G+KE GKGSPLL EDER +SVV
Subjt:  PDMLKWDVRPFIDQRRASMRCTLHQLKDTILDGTHPYADFLMKKSGLTPINKRDSISPNNEDGIELSRRLDSSFPDPPGQKEKGKGSPLLPEDERIISVV

Query:  SQSSSSLLPSKRSAVEFTSEDEARQLPGCDDGYINIKKLEQHSAHTLYSGQKVASSHGIKVLED-LPERIVPQNERDDTDHLDEHQMTLGEDKLVEEERF
          SSSSLLP+KRS++EFTSEDEA QLPGCDDG+IN+KKL+ HSAH LYSGQ+VASSHG +V+E+   ER  PQ ERDDT+HLD HQ+TL EDKLVEEE F
Subjt:  SQSSSSLLPSKRSAVEFTSEDEARQLPGCDDGYINIKKLEQHSAHTLYSGQKVASSHGIKVLED-LPERIVPQNERDDTDHLDEHQMTLGEDKLVEEERF

Query:  GSMKSG-------------------------------------------------WCTATDVSHKGELGISCYTMQDCTQDGEMHEVVIIEKAKDGSELP
        GS KSG                                                  CTATD  H GE GI CYT+   TQDGE  EVV  EK  DGSELP
Subjt:  GSMKSG-------------------------------------------------WCTATDVSHKGELGISCYTMQDCTQDGEMHEVVIIEKAKDGSELP

Query:  IEPKGSSISPAEGNQHDIITDDPECDFGHDNHVNEMNNMSHSGFLPKTVATNIDVGTNLDEEEKDMLSDSDSDRYHNERIDIAMKKKEFFSSQCMVDHDS
         EPK  + SPAEGN  +   ++ + DFGHD+HVNEMN +SHSGF+  TVAT+ DVG   DEEEKDML  SD+D YH E +DIAM+KKEF SSQCMVD DS
Subjt:  IEPKGSSISPAEGNQHDIITDDPECDFGHDNHVNEMNNMSHSGFLPKTVATNIDVGTNLDEEEKDMLSDSDSDRYHNERIDIAMKKKEFFSSQCMVDHDS

Query:  FLLADRRELTVCVKCNGGGHLLSCDTSDCPLVVHDKCLGSLTRMNDEGDFYCPFCSYSLAISEYLEAKKHAALAKKNVATLFDTALGHQSIVIKEVLQQK
        FL+ADR ELTVCVKCN GG LLSC++ DCPLVVH KCLGSL  MNDE DF CPFC YS AISEYLEAKKHAALAKKNV + F +AL H SI  K VLQ K
Subjt:  FLLADRRELTVCVKCNGGGHLLSCDTSDCPLVVHDKCLGSLTRMNDEGDFYCPFCSYSLAISEYLEAKKHAALAKKNVATLFDTALGHQSIVIKEVLQQK

Query:  DLDPSRKAGVEDVAKIFEDVDLENKHNQVTLDGKQVNEVVDQSS--------------TDIEPIIELSNPLHITNSNHRENDASPLRVAPDVLSGEKNGD
        DLDPSR+AGVEDVAKI EDVD+ENK NQVT+DG+ VNEVVD  S              TD E  I LS  ++I N+NHREN++S LRVAPDVLS EK+ +
Subjt:  DLDPSRKAGVEDVAKIFEDVDLENKHNQVTLDGKQVNEVVDQSS--------------TDIEPIIELSNPLHITNSNHRENDASPLRVAPDVLSGEKNGD

Query:  ELVDQEHQGNTVAELVDQERRDNAVAELVDQERQGNAVTELVDQKCQGNVAELEDGQKATEQHDISVSDILHEDQGPVEATTMREGLQYQTDDNE-ELAY
        E VDQE   NT AELVDQE   N   ELVDQE QGNA             A+LEDG+ +T+QH   + +ILHED+ PVE   + E LQYQT+DNE E A 
Subjt:  ELVDQEHQGNTVAELVDQERRDNAVAELVDQERQGNAVTELVDQKCQGNVAELEDGQKATEQHDISVSDILHEDQGPVEATTMREGLQYQTDDNE-ELAY

Query:  AINIEGEKSSDDENDESIISRYSIRFRRKYHHTCPETHQFRRKKL
        AI  E EKSSDD NDESIISRYSIRFR+KYHHT  ETH   RKKL
Subjt:  AINIEGEKSSDDENDESIISRYSIRFRRKYHHTCPETHQFRRKKL

A0A5A7TK87 PHD domain-containing protein2.2e-26763.18Show/hide
Query:  SALAWHWTIEALASFKEIKPSLLH-----------GDEKERGEMVALKCLEGLFDPLNYIGENGPPTQEPKVTFDSSRSCEHVVKRICKETPKSALRVAG
        SALAW WTIEALASF ++KPSLLH           G     GEMVAL+CLEGLF PL+ IGENG P QE KV FDSS SC  VVKRI  ETP+SAL VAG
Subjt:  SALAWHWTIEALASFKEIKPSLLH-----------GDEKERGEMVALKCLEGLFDPLNYIGENGPPTQEPKVTFDSSRSCEHVVKRICKETPKSALRVAG

Query:  PDMLKWDVRPFIDQRRASMRCTLHQLKDTILDGTHPYADFLMKKSGLTPINKRDSISPNNEDGIELSRRLDSSFPDPPGQKEKGKGSPLLPEDERIISVV
        PDM KWDV+PFIDQ+RASMRCTL QLKD+ILDGTHPYA+FLM KSGLTPINKRD  S NNED +EL RRLD+S   P G+KE GKGSPLL EDER +SVV
Subjt:  PDMLKWDVRPFIDQRRASMRCTLHQLKDTILDGTHPYADFLMKKSGLTPINKRDSISPNNEDGIELSRRLDSSFPDPPGQKEKGKGSPLLPEDERIISVV

Query:  SQSSSSLLPSKRSAVEFTSEDEARQLPGCDDGYINIKKLEQHSAHTLYSGQKVASSHGIKVLED-LPERIVPQNERDDTDHLDEHQMTLGEDKLVEEERF
          SSSSLLP+KRS++EFTSEDEA QLPGCDDG+IN+KKL+ HSAH LYSGQ+VASSHG +V+E+   ER  PQ ERDDT+HLD HQ+TL EDKLVEEE F
Subjt:  SQSSSSLLPSKRSAVEFTSEDEARQLPGCDDGYINIKKLEQHSAHTLYSGQKVASSHGIKVLED-LPERIVPQNERDDTDHLDEHQMTLGEDKLVEEERF

Query:  GSMKSG-------------------------------------------------WCTATDVSHKGELGISCYTMQDCTQDGEMHEVVIIEKAKDGSELP
        GS KSG                                                  CTATD  H GE GI CYT+   TQDGE  EVV  EK  DGSELP
Subjt:  GSMKSG-------------------------------------------------WCTATDVSHKGELGISCYTMQDCTQDGEMHEVVIIEKAKDGSELP

Query:  IEPKGSSISPAEGNQHDIITDDPECDFGHDNHVNEMNNMSHSGFLPKTVATNIDVGTNLDEEEKDMLSDSDSDRYHNERIDIAMKKKEFFSSQCMVDHDS
         EPK  + SPAEGN  +   ++ + DFGHD+HVNEMN +SHSGF+  TVAT+ DVG   DEEEKDML  SD+D YH E +DIAM+KKEF SSQCMVD DS
Subjt:  IEPKGSSISPAEGNQHDIITDDPECDFGHDNHVNEMNNMSHSGFLPKTVATNIDVGTNLDEEEKDMLSDSDSDRYHNERIDIAMKKKEFFSSQCMVDHDS

Query:  FLLADRRELTVCVKCNGGGHLLSCDTSDCPLVVHDKCLGSLTRMNDEGDFYCPFCSYSLAISEYLEAKKHAALAKKNVATLFDTALGHQSIVIKEVLQQK
        FL+ADR ELTVCVKCN GG LLSC++ DCPLVVH KCLGSL  MNDE DF CPFC YS AISEYLEAKKHAALAKKNV + F +AL H SI  K VLQ K
Subjt:  FLLADRRELTVCVKCNGGGHLLSCDTSDCPLVVHDKCLGSLTRMNDEGDFYCPFCSYSLAISEYLEAKKHAALAKKNVATLFDTALGHQSIVIKEVLQQK

Query:  DLDPSRKAGVEDVAKIFEDVDLENKHNQVTLDGKQVNEVVDQSS--------------TDIEPIIELSNPLHITNSNHRENDASPLRVAPDVLSGEKNGD
        DLDPSR+AGVEDVAKI EDVD+ENK NQVT+DG+ VNEVVD  S              TD E  I LS  ++I N+NHREN++S LRVAPDVLS EK+ +
Subjt:  DLDPSRKAGVEDVAKIFEDVDLENKHNQVTLDGKQVNEVVDQSS--------------TDIEPIIELSNPLHITNSNHRENDASPLRVAPDVLSGEKNGD

Query:  ELVDQEHQGNTVAELVDQERRDNAVAELVDQERQGNAVTELVDQKCQGNVAELEDGQKATEQHDISVSDILHEDQGPVEATTMREGLQYQTDDNE-ELAY
        E VDQE   NT AELVDQE   N   ELVDQE QGNA             A+LEDG+ +T+QH   + +ILHED+ PVE   + E LQYQT+DNE E A 
Subjt:  ELVDQEHQGNTVAELVDQERRDNAVAELVDQERQGNAVTELVDQKCQGNVAELEDGQKATEQHDISVSDILHEDQGPVEATTMREGLQYQTDDNE-ELAY

Query:  AINIEGEKSSDDENDESIISRYSIRFRRKYH
        AI  E EKSSDD NDESIISRYSIRFR+KYH
Subjt:  AINIEGEKSSDDENDESIISRYSIRFRRKYH

A0A6J1DLF4 uncharacterized protein LOC111021537 isoform X12.0e-29753.23Show/hide
Query:  SALAWHWTIEALASFKEIKPSLLH-----------GDEKERGEMVALKCLEGLFDPLNYIGENGPPTQEPKVTFDSSRSCEHVVKRICKETPKSALRVAG
        S LAW  TIEALASF ++KPSLLH              K  GEMVAL+CLEGLF  LN + E+ PP  + KVTFDSS SCE+VVKRI KETPKSALRVAG
Subjt:  SALAWHWTIEALASFKEIKPSLLH-----------GDEKERGEMVALKCLEGLFDPLNYIGENGPPTQEPKVTFDSSRSCEHVVKRICKETPKSALRVAG

Query:  PDMLKWDVRPFIDQRRASMRCTLHQLKDTILDGTHPYADFLMKKSGLTPINKRDSISPNNEDGIELSRRLDSSFPDPPGQKEKGKG--------------
        P+MLKWDV PFI Q+RASMR TLHQLKDTILDGTHPY DFL  KSGL P+NKRD I+ NN+D  ELSRRLDSS PD  GQKEKGKG              
Subjt:  PDMLKWDVRPFIDQRRASMRCTLHQLKDTILDGTHPYADFLMKKSGLTPINKRDSISPNNEDGIELSRRLDSSFPDPPGQKEKGKG--------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------------------------------------------------------------------SPLLPEDERIISVV
                                                                                              SPLL EDER  SV 
Subjt:  --------------------------------------------------------------------------------------SPLLPEDERIISVV

Query:  SQSSSSLLPSKR----------------------------------------------------------------------------------------
          SSSSLLPSKR                                                                                        
Subjt:  SQSSSSLLPSKR----------------------------------------------------------------------------------------

Query:  ------------SAVEFTSEDEARQLPGCDDGYINIKKLEQHSAHTLYSGQKVASSHGIKVLEDLPERIVPQNERDDTDHLDEHQMTL-GEDKLVEEERF
                    S V+  SEDEA QLPGCDDGY+N+ KL QHSA TLYSGQ+VASSHG ++LED  ER+VPQNE DD + LDE  MTL  EDKL +EE F
Subjt:  ------------SAVEFTSEDEARQLPGCDDGYINIKKLEQHSAHTLYSGQKVASSHGIKVLEDLPERIVPQNERDDTDHLDEHQMTL-GEDKLVEEERF

Query:  GSMKSGWCTATDVSHKGELGISCYTMQDCTQDGEMHEVVIIEKAKDGSELPIEPKGSSISPAEGNQHDIITDDPECDFGHDNHVNEMNNMSHSGFLPKTV
        G  +S  CTATD  H+ E GI C+TM    QD EMHE++ +EK  D S LP EPK S+ S AEGN H+   DD +CD GHDNHVN +N MSHS F P+TV
Subjt:  GSMKSGWCTATDVSHKGELGISCYTMQDCTQDGEMHEVVIIEKAKDGSELPIEPKGSSISPAEGNQHDIITDDPECDFGHDNHVNEMNNMSHSGFLPKTV

Query:  ATNIDVGTNLDEEEKDMLSDSDSDRYHNERIDIAMKKKEFFSSQCMVDHDSFLLADRRELTVCVKCNGGGHLLSCDTSDCPLVVHDKCLGSLTRMNDEGD
        AT+IDVG N DEEEKDMLSDSD   + N+ IDIAMKK EFFSSQC+VDHDSF LADR+ELTVCVKCN GG LLSC+ SDC LVVHDKCLG   RMNDEGD
Subjt:  ATNIDVGTNLDEEEKDMLSDSDSDRYHNERIDIAMKKKEFFSSQCMVDHDSFLLADRRELTVCVKCNGGGHLLSCDTSDCPLVVHDKCLGSLTRMNDEGD

Query:  FYCPFCSYSLAISEYLEAKKHAALAKKNVATLFDTALGHQSIVIKEVLQQKDLDPSRKAGVEDVAKIFEDVDLENKHNQVTLDGKQVNEVVDQSS---TD
        F CPFC YSLAISEYLEAK++AALAKKNVAT     L HQSI IKEVLQ+KDL PSRKAGVEDVAKI EDV+LENK NQVTLDG+ VNEV+D  S   TD
Subjt:  FYCPFCSYSLAISEYLEAKKHAALAKKNVATLFDTALGHQSIVIKEVLQQKDLDPSRKAGVEDVAKIFEDVDLENKHNQVTLDGKQVNEVVDQSS---TD

Query:  IEPIIELSNPLHITNSNHRENDASPLRVAPDVLSGEKNGDEL-------------VDQEHQGNTVAELVDQERRDNAVAELVDQERQGNAVTELVDQKCQ
        IE   +LS PL I+NSNHREN+A+PLRVAPDVL+GEK+GDEL             VDQE +GNTV ELVDQE + N V ELVDQE QGN V ELVDQ+CQ
Subjt:  IEPIIELSNPLHITNSNHRENDASPLRVAPDVLSGEKNGDEL-------------VDQEHQGNTVAELVDQERRDNAVAELVDQERQGNAVTELVDQKCQ

Query:  GNVAELEDGQKATEQHDISVSDILHEDQGPVEATTMREGLQYQTDDN-EELAYAINIEGEKSSDDENDESIISRYSIRFRRKYHHTCPETHQFRRKKLPW
        GNVAELEDG KATEQ+DI   + +HEDQGPVEA T +EGL+YQTDDN EE  YAINIEGEKSSDDENDESIISRYSIRFR++YH TCPET Q RRKKLPW
Subjt:  GNVAELEDGQKATEQHDISVSDILHEDQGPVEATTMREGLQYQTDDN-EELAYAINIEGEKSSDDENDESIISRYSIRFRRKYHHTCPETHQFRRKKLPW

Query:  TAEEEETLREGVRKFSSSVERSPNIPWKKILEFGSNVFLQDRTSIDLKDKWRKLCRSPKMK
        TAEEEETLREGVRKFSSSV+RSP IPWKKILEFGS VFL+ RTSIDLKDKWR LCRSPK K
Subjt:  TAEEEETLREGVRKFSSSVERSPNIPWKKILEFGSNVFLQDRTSIDLKDKWRKLCRSPKMK

A0A6J1DN48 uncharacterized protein LOC111021537 isoform X33.4e-30053.83Show/hide
Query:  SALAWHWTIEALASFKEIKPSLLH-----------GDEKERGEMVALKCLEGLFDPLNYIGENGPPTQEPKVTFDSSRSCEHVVKRICKETPKSALRVAG
        S LAW  TIEALASF ++KPSLLH              K  GEMVAL+CLEGLF  LN + E+ PP  + KVTFDSS SCE+VVKRI KETPKSALRVAG
Subjt:  SALAWHWTIEALASFKEIKPSLLH-----------GDEKERGEMVALKCLEGLFDPLNYIGENGPPTQEPKVTFDSSRSCEHVVKRICKETPKSALRVAG

Query:  PDMLKWDVRPFIDQRRASMRCTLHQLKDTILDGTHPYADFLMKKSGLTPINKRDSISPNNEDGIELSRRLDSSFPDPPGQKEKGKG--------------
        P+MLKWDV PFI Q+RASMR TLHQLKDTILDGTHPY DFL  KSGL P+NKRD I+ NN+D  ELSRRLDSS PD  GQKEKGKG              
Subjt:  PDMLKWDVRPFIDQRRASMRCTLHQLKDTILDGTHPYADFLMKKSGLTPINKRDSISPNNEDGIELSRRLDSSFPDPPGQKEKGKG--------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------------------------------------------------------------------SPLLPEDERIISVV
                                                                                              SPLL EDER  SV 
Subjt:  --------------------------------------------------------------------------------------SPLLPEDERIISVV

Query:  SQSSSSLLPSKR----------------------------------------------------------------------------------------
          SSSSLLPSKR                                                                                        
Subjt:  SQSSSSLLPSKR----------------------------------------------------------------------------------------

Query:  ------------SAVEFTSEDEARQLPGCDDGYINIKKLEQHSAHTLYSGQKVASSHGIKVLEDLPERIVPQNERDDTDHLDEHQMTL-GEDKLVEEERF
                    S V+  SEDEA QLPGCDDGY+N+ KL QHSA TLYSGQ+VASSHG ++LED  ER+VPQNE DD + LDE  MTL  EDKL +EE F
Subjt:  ------------SAVEFTSEDEARQLPGCDDGYINIKKLEQHSAHTLYSGQKVASSHGIKVLEDLPERIVPQNERDDTDHLDEHQMTL-GEDKLVEEERF

Query:  GSMKSGWCTATDVSHKGELGISCYTMQDCTQDGEMHEVVIIEKAKDGSELPIEPKGSSISPAEGNQHDIITDDPECDFGHDNHVNEMNNMSHSGFLPKTV
        G  +S  CTATD  H+ E GI C+TM    QD EMHE++ +EK  D S LP EPK S+ S AEGN H+   DD +CD GHDNHVN +N MSHS F P+TV
Subjt:  GSMKSGWCTATDVSHKGELGISCYTMQDCTQDGEMHEVVIIEKAKDGSELPIEPKGSSISPAEGNQHDIITDDPECDFGHDNHVNEMNNMSHSGFLPKTV

Query:  ATNIDVGTNLDEEEKDMLSDSDSDRYHNERIDIAMKKKEFFSSQCMVDHDSFLLADRRELTVCVKCNGGGHLLSCDTSDCPLVVHDKCLGSLTRMNDEGD
        AT+IDVG N DEEEKDMLSDSD   + N+ IDIAMKK EFFSSQC+VDHDSF LADR+ELTVCVKCN GG LLSC+ SDC LVVHDKCLG   RMNDEGD
Subjt:  ATNIDVGTNLDEEEKDMLSDSDSDRYHNERIDIAMKKKEFFSSQCMVDHDSFLLADRRELTVCVKCNGGGHLLSCDTSDCPLVVHDKCLGSLTRMNDEGD

Query:  FYCPFCSYSLAISEYLEAKKHAALAKKNVATLFDTALGHQSIVIKEVLQQKDLDPSRKAGVEDVAKIFEDVDLENKHNQVTLDGKQVNEVVDQSS---TD
        F CPFC YSLAISEYLEAK++AALAKKNVAT     L HQSI IKEVLQ+KDL PSRKAGVEDVAKI EDV+LENK NQVTLDG+ VNEV+D  S   TD
Subjt:  FYCPFCSYSLAISEYLEAKKHAALAKKNVATLFDTALGHQSIVIKEVLQQKDLDPSRKAGVEDVAKIFEDVDLENKHNQVTLDGKQVNEVVDQSS---TD

Query:  IEPIIELSNPLHITNSNHRENDASPLRVAPDVLSGEKNGDELVDQEHQGNTVAELVDQERRDNAVAELVDQERQGNAVTELVDQKCQGNVAELEDGQKAT
        IE   +LS PL I+NSNHREN+A+PLRVAPDVL+GEK+GDELVDQE +GNTV ELVDQE + N V ELVDQE QGN V ELVDQ+CQGNVAELEDG KAT
Subjt:  IEPIIELSNPLHITNSNHRENDASPLRVAPDVLSGEKNGDELVDQEHQGNTVAELVDQERRDNAVAELVDQERQGNAVTELVDQKCQGNVAELEDGQKAT

Query:  EQHDISVSDILHEDQGPVEATTMREGLQYQTDDN-EELAYAINIEGEKSSDDENDESIISRYSIRFRRKYHHTCPETHQFRRKKLPWTAEEEETLREGVR
        EQ+DI   + +HEDQGPVEA T +EGL+YQTDDN EE  YAINIEGEKSSDDENDESIISRYSIRFR++YH TCPET Q RRKKLPWTAEEEETLREGVR
Subjt:  EQHDISVSDILHEDQGPVEATTMREGLQYQTDDN-EELAYAINIEGEKSSDDENDESIISRYSIRFRRKYHHTCPETHQFRRKKLPWTAEEEETLREGVR

Query:  KFSSSVERSPNIPWKKILEFGSNVFLQDRTSIDLKDKWRKLCRSPKMK
        KFSSSV+RSP IPWKKILEFGS VFL+ RTSIDLKDKWR LCRSPK K
Subjt:  KFSSSVERSPNIPWKKILEFGSNVFLQDRTSIDLKDKWRKLCRSPKMK

A0A6J1FB82 uncharacterized protein LOC1114424395.9e-28167.63Show/hide
Query:  SALAWHWTIEALASFKEIKPSLLH-----------GDEKERGEMVALKCLEGLFDPLNYIGENGPPTQEPKVTFDSSRSCEHVVKRICKETPKSALRVAG
        S+LAW WTIEALAS +E+KPSLLH           G  K  GEMVALKCLEGLF  L+YIGEN  P QE KV FDSS  CE VVKRI KETPKS+LRVAG
Subjt:  SALAWHWTIEALASFKEIKPSLLH-----------GDEKERGEMVALKCLEGLFDPLNYIGENGPPTQEPKVTFDSSRSCEHVVKRICKETPKSALRVAG

Query:  PDMLKWDVRPFIDQRRASMRCTLHQLKDTILDGTHPYADFLMKKSGLTPINKRDSISPNNEDGIELSRRLDSSFPDPPGQKEKGKGSPLLPEDERIISVV
        PD+LKWDVR F DQ+RASMRCTLH+LKD ILDGTHP ADFLM+KSGLTPINKR  I  NNED IELS RLD S   P GQ EKGKGSPLL ED+R ISVV
Subjt:  PDMLKWDVRPFIDQRRASMRCTLHQLKDTILDGTHPYADFLMKKSGLTPINKRDSISPNNEDGIELSRRLDSSFPDPPGQKEKGKGSPLLPEDERIISVV

Query:  SQSSSSLLPSKRSAVEFTSEDEARQLPGCDDGYINIKKLEQHSAHTLYSGQKVASSHGIKVLEDLPERIVPQNERDDTDHLDEHQMTLGEDKLVEEERFG
        +  SSSLLPSKRS V+FTSEDEARQLPGC DGYIN+KKL+QHSAHT +SGQ+VASSH  +VLED  ER VPQNERDDTD LDEHQ+T  +D+ VE+  FG
Subjt:  SQSSSSLLPSKRSAVEFTSEDEARQLPGCDDGYINIKKLEQHSAHTLYSGQKVASSHGIKVLEDLPERIVPQNERDDTDHLDEHQMTLGEDKLVEEERFG

Query:  SMKSGWCTATDVSHKGELGISCYTMQDCTQDGEMHEVVIIEKAKDGSELPIEPKGSSISPAEGNQHDIITDDPECDFGHDNHVNEMNNMSHSGFLPKTVA
        S K          H+ + GISCYTM   TQD EM EVV +EK KDGSELP EPK S+ SPAE N H+   D+ +CD GHD HVNE N MS SGF+ KTVA
Subjt:  SMKSGWCTATDVSHKGELGISCYTMQDCTQDGEMHEVVIIEKAKDGSELPIEPKGSSISPAEGNQHDIITDDPECDFGHDNHVNEMNNMSHSGFLPKTVA

Query:  TNIDVGTNLDEEEKDMLSDSDSDRYHNERIDIAMKKKEFFSSQCMVDHDSFLLADRRELTVCVKCNGGGHLLSCDTSDCPLVVHDKCLGSLTRMNDEGDF
        TN++VG   D +EKD+LSDSD   YH E IDIA +KKEF SSQCMVDHDSF LAD R L VCVKCN GG LL C+ SDCPLVVH KCL S   M DEGDF
Subjt:  TNIDVGTNLDEEEKDMLSDSDSDRYHNERIDIAMKKKEFFSSQCMVDHDSFLLADRRELTVCVKCNGGGHLLSCDTSDCPLVVHDKCLGSLTRMNDEGDF

Query:  YCPFCSYSLAISEYLEAKKHAALAKKNVATLFDTALGHQSIVIKEVLQQKDLDPSRKAGVEDVAKIFEDVDLENKHNQVTLDGKQVNEVVDQSS---TDI
         CPFC YSLAISEYLEAKKH A  KKNVA+ F TALGHQS  ++EVLQQ D+DPS++  VEDVAKI EDV+LE+K NQV+LDG++VNEVVD  S   TD 
Subjt:  YCPFCSYSLAISEYLEAKKHAALAKKNVATLFDTALGHQSIVIKEVLQQKDLDPSRKAGVEDVAKIFEDVDLENKHNQVTLDGKQVNEVVDQSS---TDI

Query:  EPIIELSNPLHITNSNHRENDASPLRVAPDVLSGEKNGDELVDQEHQGNTVAELVDQERRDNAVAELVDQERQGNAVTELVDQKCQGNVAELEDGQKATE
        E I ELS PLHI NSNHRE  ASP RVA D L GE+NG ELVDQE QGNTVA                           +VDQKC+GNVAE EDGQK TE
Subjt:  EPIIELSNPLHITNSNHRENDASPLRVAPDVLSGEKNGDELVDQEHQGNTVAELVDQERRDNAVAELVDQERQGNAVTELVDQKCQGNVAELEDGQKATE

Query:  QHDISVSDILHEDQGPVEATTMREGLQYQTDDNE-ELAYAINIEGEKSSDDENDESIISRYSIRFRRKYHHTCPETHQFRRKKLPWTAEEEETL
        QHDI   +ILHE +GPVE    + GLQYQTDD+E + A AI  EGEKSSDD NDESIISRYSIRFR+K HHT PETH  RRKKLPWTAEEEETL
Subjt:  QHDISVSDILHEDQGPVEATTMREGLQYQTDDNE-ELAYAINIEGEKSSDDENDESIISRYSIRFRRKYHHTCPETHQFRRKKLPWTAEEEETL

SwissProt top hitse value%identityAlignment
F4I7L1 Telomere repeat-binding factor 42.7e-0446.97Show/hide
Query:  KKLPWTAEEEETLREGVRKFSSSVERSPNIPWKKIL---EFGSNVFLQDRTSIDLKDKWRKLCRSP
        +KL WTAEEEE L  GVRK            WK IL   E      L  R++IDLKDKWR L  +P
Subjt:  KKLPWTAEEEETLREGVRKFSSSVERSPNIPWKKIL---EFGSNVFLQDRTSIDLKDKWRKLCRSP

F4IEY4 Telomere repeat-binding factor 53.5e-0446.97Show/hide
Query:  KKLPWTAEEEETLREGVRKFSSSVERSPNIPWKKIL---EFGSNVFLQDRTSIDLKDKWRKLCRSP
        +KL WTAEEEE L  G+RK            WK IL   EF     L  R++IDLKDKWR L   P
Subjt:  KKLPWTAEEEETLREGVRKFSSSVERSPNIPWKKIL---EFGSNVFLQDRTSIDLKDKWRKLCRSP

Arabidopsis top hitse value%identityAlignment
AT1G01150.1 Homeodomain-like protein with RING/FYVE/PHD-type zinc finger domain6.1e-1254.24Show/hide
Query:  KKLPWTAEEEETLREGVRKFSSSVERSPNIPWKKILEFGSNVFLQDRTSIDLKDKWRKL
        K++ WT  EE+ LREGV KFS ++ +  N+PWKKILE G  +F   R S DLKDKWR +
Subjt:  KKLPWTAEEEETLREGVRKFSSSVERSPNIPWKKILEFGSNVFLQDRTSIDLKDKWRKL

AT1G14770.1 RING/FYVE/PHD zinc finger superfamily protein3.7e-0927.59Show/hide
Query:  WHWTIEALASFKEIKPSLL--------HGDE--KERGEMVALKCLEGLFDP--LNYIGENGPPTQEPKVTFDSSRSCEHVVKRICKETPKSALRVAGPDM
        W W IE +A F +    LL        + D+  K   E+++L+ LE +FDP   +  G       E KV FD S S   V++ I KE P + LRV  P++
Subjt:  WHWTIEALASFKEIKPSLL--------HGDE--KERGEMVALKCLEGLFDP--LNYIGENGPPTQEPKVTFDSSRSCEHVVKRICKETPKSALRVAGPDM

Query:  LKWDVRPFIDQRRASM-RCTLHQLKDTILDGTHPYADFLMKKSGLTPINKRDSISPNNEDGIELSRRLDSSFPDPPGQKEKGKGSPLLPEDERIISVVSQ
         K++V PFI  +   + +C L +L+D  L         +  ++  TP    ++  P   D  + S  +D    +P  +++   G+   P DE+ + + ++
Subjt:  LKWDVRPFIDQRRASM-RCTLHQLKDTILDGTHPYADFLMKKSGLTPINKRDSISPNNEDGIELSRRLDSSFPDPPGQKEKGKGSPLLPEDERIISVVSQ

Query:  SSS
          S
Subjt:  SSS

AT1G14770.2 RING/FYVE/PHD zinc finger superfamily protein3.7e-0927.59Show/hide
Query:  WHWTIEALASFKEIKPSLL--------HGDE--KERGEMVALKCLEGLFDP--LNYIGENGPPTQEPKVTFDSSRSCEHVVKRICKETPKSALRVAGPDM
        W W IE +A F +    LL        + D+  K   E+++L+ LE +FDP   +  G       E KV FD S S   V++ I KE P + LRV  P++
Subjt:  WHWTIEALASFKEIKPSLL--------HGDE--KERGEMVALKCLEGLFDP--LNYIGENGPPTQEPKVTFDSSRSCEHVVKRICKETPKSALRVAGPDM

Query:  LKWDVRPFIDQRRASM-RCTLHQLKDTILDGTHPYADFLMKKSGLTPINKRDSISPNNEDGIELSRRLDSSFPDPPGQKEKGKGSPLLPEDERIISVVSQ
         K++V PFI  +   + +C L +L+D  L         +  ++  TP    ++  P   D  + S  +D    +P  +++   G+   P DE+ + + ++
Subjt:  LKWDVRPFIDQRRASM-RCTLHQLKDTILDGTHPYADFLMKKSGLTPINKRDSISPNNEDGIELSRRLDSSFPDPPGQKEKGKGSPLLPEDERIISVVSQ

Query:  SSS
          S
Subjt:  SSS

AT1G68030.1 RING/FYVE/PHD zinc finger superfamily protein4.3e-1325Show/hide
Query:  PPSSTLIAATSSPFVVANLFYCSALAWHWTIEALASFK----------EIKPSLLHGDEKERGEMVALKCLEGLFDPLNYIGENGPPTQEPKVTFDSSRS
        PP+S  +      FV  NL   +   W W IE  A FK          E+ P L     +   EMVA +CL  LFD  + +      T    + FDSS S
Subjt:  PPSSTLIAATSSPFVVANLFYCSALAWHWTIEALASFK----------EIKPSLLHGDEKERGEMVALKCLEGLFDPLNYIGENGPPTQEPKVTFDSSRS

Query:  CEHVVKRICKETPKSALRVAGPDMLKWDVRPFIDQRRASM-RCTLHQLKDTILDGTHPYADFLMKKSGLTPINKRDSISPNNEDGIELSRRLDSSFPDPP
        CE+V++ I  E P S L+   P + KW+++PFI  +  S+ +C L  + +                + ++P ++ +++  + ++   ++  +D   PD  
Subjt:  CEHVVKRICKETPKSALRVAGPDMLKWDVRPFIDQRRASM-RCTLHQLKDTILDGTHPYADFLMKKSGLTPINKRDSISPNNEDGIELSRRLDSSFPDPP

Query:  GQKEKGKGSPLLPEDERIISVVSQSSSSLLPSKRSAVEFTSEDEARQL
        G+ E+G+               S+ +  LL  + +   + SE++   L
Subjt:  GQKEKGKGSPLLPEDERIISVVSQSSSSLLPSKRSAVEFTSEDEARQL

AT1G68030.1 RING/FYVE/PHD zinc finger superfamily protein1.3e-0937.68Show/hide
Query:  CVKCNGGGHLLSCDTSDCPLVVHDKCLGSLTRMNDEGDFYCPFCSYSLAISEYLEAKKHAALAKKNVAT
        CV C   G LL C +  C ++VH KCL S    +D GDFYC  C+ +   +EY++ +   A AK+ + +
Subjt:  CVKCNGGGHLLSCDTSDCPLVVHDKCLGSLTRMNDEGDFYCPFCSYSLAISEYLEAKKHAALAKKNVAT

AT5G03780.1 TRF-like 103.1e-1122.75Show/hide
Query:  MLSDSDSDRYHNERIDIAMKKKEFFSSQCMVDHDSFLLADRRELTVCVKCNGGGHLLS-CDTSDCPLVVHDKCL---------GSLTRMNDEGDFYCPFC
        +L D  ++    +  D+  KK+    +    D     +    +L  C+ C      +S C   DC L  H +CL          S +   D  + +CP+C
Subjt:  MLSDSDSDRYHNERIDIAMKKKEFFSSQCMVDHDSFLLADRRELTVCVKCNGGGHLLS-CDTSDCPLVVHDKCL---------GSLTRMNDEGDFYCPFC

Query:  SYSLAISEYLEAKKHAALAKKNVATLFDTALGHQSIVIKEVLQQKDLDPSRKAGVEDVAKIFEDVDLENKHNQVTLDGKQVNEVVDQSSTDIEPIIELSN
           +       A K   L +K V                                E     + D +++++   +TL G ++     + STDI    EL  
Subjt:  SYSLAISEYLEAKKHAALAKKNVATLFDTALGHQSIVIKEVLQQKDLDPSRKAGVEDVAKIFEDVDLENKHNQVTLDGKQVNEVVDQSSTDIEPIIELSN

Query:  PLHITNSNHRENDASPLRVAPDVLSGEKNGDELVDQEHQGNTVAELVDQERRDNAVAELVDQERQGNAVTELVDQKCQGNVAELEDGQKATEQHDISVSD
                  E D    +  PD   G+    +++D+      VA    Q+  D+  A     + QG  +         G   + E     + Q   S  +
Subjt:  PLHITNSNHRENDASPLRVAPDVLSGEKNGDELVDQEHQGNTVAELVDQERRDNAVAELVDQERQGNAVTELVDQKCQGNVAELEDGQKATEQHDISVSD

Query:  ILHEDQGPVEATTMREGLQYQTDDNEELAYAINIEGEKSSDDENDESIISRYSIRFRRKY--------HHTCPETHQFR---RKKLPWTAEEEETLREGV
           +DQ        R GL+    D      +    GE  ++       ++  S R R +           T  +   F+   R++L WT EEEE L+ GV
Subjt:  ILHEDQGPVEATTMREGLQYQTDDNEELAYAINIEGEKSSDDENDESIISRYSIRFRRKY--------HHTCPETHQFR---RKKLPWTAEEEETLREGV

Query:  RKFSSSVERSPNIPWKKILEFGSNVFLQDRTSIDLKDKWRKLCR
         KF++  E + N+PW+KILE G  VF + RT  DLKDKWR + +
Subjt:  RKFSSSVERSPNIPWKKILEFGSNVFLQDRTSIDLKDKWRKLCR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTTGGTGTTCAACAATCTTCTCGTGCATCACATCCCGCTGAAGGAAGTCCAAGGCGAACAAATGGATGCCTTGAGCTTCGACCTTAGAGGGACGGTTGTGACTTT
CTCGAAAGAAGAATTCCTTCTTATGATTGGATGGGTTTTCGTTTTTCATTCTTCTTCATATCCTCCTTCATCCACACTAATCGCGGCAACTTCTTCTCCGTTCGTTGTCG
CCAACCTCTTCTACTGTTCAGCTCTTGCTTGGCATTGGACCATCGAAGCCCTTGCAAGTTTCAAGGAAATAAAACCATCTCTATTACATGGAGACGAGAAAGAACGCGGG
GAAATGGTTGCTTTGAAATGCTTAGAGGGTTTGTTTGATCCTTTAAATTATATTGGAGAAAATGGTCCTCCTACTCAAGAACCAAAAGTTACGTTTGATTCATCTAGGAG
CTGTGAACATGTTGTTAAACGCATATGTAAGGAGACTCCAAAGTCTGCCTTAAGAGTGGCTGGACCAGATATGTTAAAATGGGATGTTAGACCCTTTATTGACCAAAGAA
GAGCATCCATGCGTTGTACATTACATCAGCTGAAAGATACTATTCTTGATGGTACACATCCATATGCTGATTTCTTAATGAAGAAGAGTGGCTTGACACCCATAAATAAG
AGGGACAGCATTTCTCCAAATAATGAGGATGGTATTGAGCTTAGCAGGAGACTTGATAGTAGCTTCCCTGATCCTCCAGGTCAAAAAGAAAAAGGCAAAGGAAGCCCTCT
ACTTCCTGAGGATGAAAGAATAATATCAGTGGTGAGCCAATCTAGCTCTAGTTTGTTACCTTCTAAAAGGAGTGCCGTTGAGTTTACATCCGAAGATGAGGCTAGACAGT
TACCCGGTTGTGATGATGGTTACATAAACATTAAGAAACTTGAGCAACATTCTGCACATACTCTGTATTCCGGGCAGAAAGTGGCTTCTTCACATGGAATAAAGGTGTTG
GAAGATTTACCTGAAAGAATCGTGCCGCAAAATGAGAGAGATGATACTGATCACTTGGATGAACATCAGATGACTTTGGGGGAAGACAAACTTGTAGAAGAAGAACGTTT
TGGGTCAATGAAGTCTGGATGGTGTACTGCTACTGATGTATCGCATAAGGGTGAATTAGGTATTTCCTGTTATACTATGCAGGACTGTACACAAGATGGTGAGATGCATG
AAGTTGTTATTATCGAGAAAGCAAAAGATGGAAGTGAACTGCCTATTGAGCCAAAAGGATCTAGTATTTCTCCTGCCGAAGGAAATCAGCATGATATCATCACTGATGAT
CCCGAGTGTGATTTTGGGCATGATAATCATGTAAACGAAATGAATAATATGTCTCATAGTGGGTTTCTGCCGAAGACTGTTGCTACCAATATTGATGTTGGCACGAATCT
TGATGAGGAAGAGAAAGACATGTTAAGTGATAGTGATAGTGATAGATATCATAATGAAAGGATTGATATTGCCATGAAAAAAAAAGAATTCTTTAGTTCTCAATGTATGG
TCGATCACGATTCTTTTCTATTAGCTGACAGGAGAGAGCTAACTGTTTGTGTGAAGTGTAATGGAGGTGGTCATTTGTTGTCTTGTGATACTAGTGATTGTCCTTTGGTG
GTTCATGATAAGTGCTTGGGTTCCTTGACTAGGATGAATGATGAAGGCGATTTTTATTGTCCTTTCTGCTCATATTCACTTGCTATATCAGAATACCTTGAAGCTAAGAA
GCACGCTGCATTGGCAAAGAAAAATGTTGCTACTTTGTTTGATACTGCTTTGGGACATCAGTCAATAGTCATTAAAGAGGTATTGCAACAAAAAGATCTTGATCCTTCAC
GAAAAGCTGGAGTTGAGGACGTTGCTAAAATTTTTGAAGATGTGGACTTGGAAAATAAACACAATCAAGTAACTTTAGATGGAAAACAAGTAAATGAAGTTGTTGATCAA
TCCTCAACAGATATTGAGCCAATAATAGAGCTGTCAAACCCATTGCATATTACCAATTCCAATCATAGAGAAAACGATGCAAGTCCTTTGAGAGTAGCACCTGATGTTTT
ATCTGGAGAGAAAAATGGCGATGAATTGGTGGACCAAGAGCATCAAGGAAATACAGTAGCAGAATTGGTGGACCAAGAGCGTCGAGACAATGCAGTAGCGGAATTGGTGG
ACCAAGAGCGTCAAGGCAATGCAGTAACTGAATTGGTGGACCAAAAGTGTCAAGGCAATGTTGCAGAATTGGAAGATGGTCAAAAAGCCACAGAGCAGCATGACATTTCT
GTTTCTGATATTCTCCACGAAGATCAAGGGCCAGTTGAAGCAACGACTATGCGAGAAGGTTTACAGTATCAAACCGATGACAATGAAGAACTTGCTTATGCAATTAATAT
TGAAGGAGAAAAATCTTCTGATGATGAAAACGACGAGTCTATCATTTCTAGATACTCCATCAGATTTCGACGGAAATATCATCATACGTGTCCAGAAACTCATCAATTCA
GACGGAAGAAACTACCCTGGACAGCTGAGGAGGAAGAGACACTGAGGGAGGGAGTTCGAAAATTCTCTAGTTCTGTTGAAAGAAGTCCAAACATACCTTGGAAAAAGATT
TTAGAATTTGGTAGTAATGTGTTTCTGCAAGATCGTACATCAATAGATCTTAAAGATAAATGGAGGAAGTTGTGCAGAAGTCCAAAGATGAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGATTTGGTGTTCAACAATCTTCTCGTGCATCACATCCCGCTGAAGGAAGTCCAAGGCGAACAAATGGATGCCTTGAGCTTCGACCTTAGAGGGACGGTTGTGACTTT
CTCGAAAGAAGAATTCCTTCTTATGATTGGATGGGTTTTCGTTTTTCATTCTTCTTCATATCCTCCTTCATCCACACTAATCGCGGCAACTTCTTCTCCGTTCGTTGTCG
CCAACCTCTTCTACTGTTCAGCTCTTGCTTGGCATTGGACCATCGAAGCCCTTGCAAGTTTCAAGGAAATAAAACCATCTCTATTACATGGAGACGAGAAAGAACGCGGG
GAAATGGTTGCTTTGAAATGCTTAGAGGGTTTGTTTGATCCTTTAAATTATATTGGAGAAAATGGTCCTCCTACTCAAGAACCAAAAGTTACGTTTGATTCATCTAGGAG
CTGTGAACATGTTGTTAAACGCATATGTAAGGAGACTCCAAAGTCTGCCTTAAGAGTGGCTGGACCAGATATGTTAAAATGGGATGTTAGACCCTTTATTGACCAAAGAA
GAGCATCCATGCGTTGTACATTACATCAGCTGAAAGATACTATTCTTGATGGTACACATCCATATGCTGATTTCTTAATGAAGAAGAGTGGCTTGACACCCATAAATAAG
AGGGACAGCATTTCTCCAAATAATGAGGATGGTATTGAGCTTAGCAGGAGACTTGATAGTAGCTTCCCTGATCCTCCAGGTCAAAAAGAAAAAGGCAAAGGAAGCCCTCT
ACTTCCTGAGGATGAAAGAATAATATCAGTGGTGAGCCAATCTAGCTCTAGTTTGTTACCTTCTAAAAGGAGTGCCGTTGAGTTTACATCCGAAGATGAGGCTAGACAGT
TACCCGGTTGTGATGATGGTTACATAAACATTAAGAAACTTGAGCAACATTCTGCACATACTCTGTATTCCGGGCAGAAAGTGGCTTCTTCACATGGAATAAAGGTGTTG
GAAGATTTACCTGAAAGAATCGTGCCGCAAAATGAGAGAGATGATACTGATCACTTGGATGAACATCAGATGACTTTGGGGGAAGACAAACTTGTAGAAGAAGAACGTTT
TGGGTCAATGAAGTCTGGATGGTGTACTGCTACTGATGTATCGCATAAGGGTGAATTAGGTATTTCCTGTTATACTATGCAGGACTGTACACAAGATGGTGAGATGCATG
AAGTTGTTATTATCGAGAAAGCAAAAGATGGAAGTGAACTGCCTATTGAGCCAAAAGGATCTAGTATTTCTCCTGCCGAAGGAAATCAGCATGATATCATCACTGATGAT
CCCGAGTGTGATTTTGGGCATGATAATCATGTAAACGAAATGAATAATATGTCTCATAGTGGGTTTCTGCCGAAGACTGTTGCTACCAATATTGATGTTGGCACGAATCT
TGATGAGGAAGAGAAAGACATGTTAAGTGATAGTGATAGTGATAGATATCATAATGAAAGGATTGATATTGCCATGAAAAAAAAAGAATTCTTTAGTTCTCAATGTATGG
TCGATCACGATTCTTTTCTATTAGCTGACAGGAGAGAGCTAACTGTTTGTGTGAAGTGTAATGGAGGTGGTCATTTGTTGTCTTGTGATACTAGTGATTGTCCTTTGGTG
GTTCATGATAAGTGCTTGGGTTCCTTGACTAGGATGAATGATGAAGGCGATTTTTATTGTCCTTTCTGCTCATATTCACTTGCTATATCAGAATACCTTGAAGCTAAGAA
GCACGCTGCATTGGCAAAGAAAAATGTTGCTACTTTGTTTGATACTGCTTTGGGACATCAGTCAATAGTCATTAAAGAGGTATTGCAACAAAAAGATCTTGATCCTTCAC
GAAAAGCTGGAGTTGAGGACGTTGCTAAAATTTTTGAAGATGTGGACTTGGAAAATAAACACAATCAAGTAACTTTAGATGGAAAACAAGTAAATGAAGTTGTTGATCAA
TCCTCAACAGATATTGAGCCAATAATAGAGCTGTCAAACCCATTGCATATTACCAATTCCAATCATAGAGAAAACGATGCAAGTCCTTTGAGAGTAGCACCTGATGTTTT
ATCTGGAGAGAAAAATGGCGATGAATTGGTGGACCAAGAGCATCAAGGAAATACAGTAGCAGAATTGGTGGACCAAGAGCGTCGAGACAATGCAGTAGCGGAATTGGTGG
ACCAAGAGCGTCAAGGCAATGCAGTAACTGAATTGGTGGACCAAAAGTGTCAAGGCAATGTTGCAGAATTGGAAGATGGTCAAAAAGCCACAGAGCAGCATGACATTTCT
GTTTCTGATATTCTCCACGAAGATCAAGGGCCAGTTGAAGCAACGACTATGCGAGAAGGTTTACAGTATCAAACCGATGACAATGAAGAACTTGCTTATGCAATTAATAT
TGAAGGAGAAAAATCTTCTGATGATGAAAACGACGAGTCTATCATTTCTAGATACTCCATCAGATTTCGACGGAAATATCATCATACGTGTCCAGAAACTCATCAATTCA
GACGGAAGAAACTACCCTGGACAGCTGAGGAGGAAGAGACACTGAGGGAGGGAGTTCGAAAATTCTCTAGTTCTGTTGAAAGAAGTCCAAACATACCTTGGAAAAAGATT
TTAGAATTTGGTAGTAATGTGTTTCTGCAAGATCGTACATCAATAGATCTTAAAGATAAATGGAGGAAGTTGTGCAGAAGTCCAAAGATGAAATGA
Protein sequenceShow/hide protein sequence
MDLVFNNLLVHHIPLKEVQGEQMDALSFDLRGTVVTFSKEEFLLMIGWVFVFHSSSYPPSSTLIAATSSPFVVANLFYCSALAWHWTIEALASFKEIKPSLLHGDEKERG
EMVALKCLEGLFDPLNYIGENGPPTQEPKVTFDSSRSCEHVVKRICKETPKSALRVAGPDMLKWDVRPFIDQRRASMRCTLHQLKDTILDGTHPYADFLMKKSGLTPINK
RDSISPNNEDGIELSRRLDSSFPDPPGQKEKGKGSPLLPEDERIISVVSQSSSSLLPSKRSAVEFTSEDEARQLPGCDDGYINIKKLEQHSAHTLYSGQKVASSHGIKVL
EDLPERIVPQNERDDTDHLDEHQMTLGEDKLVEEERFGSMKSGWCTATDVSHKGELGISCYTMQDCTQDGEMHEVVIIEKAKDGSELPIEPKGSSISPAEGNQHDIITDD
PECDFGHDNHVNEMNNMSHSGFLPKTVATNIDVGTNLDEEEKDMLSDSDSDRYHNERIDIAMKKKEFFSSQCMVDHDSFLLADRRELTVCVKCNGGGHLLSCDTSDCPLV
VHDKCLGSLTRMNDEGDFYCPFCSYSLAISEYLEAKKHAALAKKNVATLFDTALGHQSIVIKEVLQQKDLDPSRKAGVEDVAKIFEDVDLENKHNQVTLDGKQVNEVVDQ
SSTDIEPIIELSNPLHITNSNHRENDASPLRVAPDVLSGEKNGDELVDQEHQGNTVAELVDQERRDNAVAELVDQERQGNAVTELVDQKCQGNVAELEDGQKATEQHDIS
VSDILHEDQGPVEATTMREGLQYQTDDNEELAYAINIEGEKSSDDENDESIISRYSIRFRRKYHHTCPETHQFRRKKLPWTAEEEETLREGVRKFSSSVERSPNIPWKKI
LEFGSNVFLQDRTSIDLKDKWRKLCRSPKMK