; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0006926 (gene) of Snake gourd v1 genome

Gene IDTan0006926
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPHD-type domain-containing protein
Genome locationLG08:58141759..58145631
RNA-Seq ExpressionTan0006926
SyntenyTan0006926
Gene Ontology termsGO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR001005 - SANT/Myb domain
IPR001965 - Zinc finger, PHD-type
IPR009057 - Homeobox-like domain superfamily
IPR011011 - Zinc finger, FYVE/PHD-type
IPR013083 - Zinc finger, RING/FYVE/PHD-type
IPR017930 - Myb domain
IPR019786 - Zinc finger, PHD-type, conserved site
IPR019787 - Zinc finger, PHD-finger


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6596217.1 hypothetical protein SDJN03_09397, partial [Cucurbita argyrosperma subsp. sororia]1.7e-30276.86Show/hide
Query:  MENETESSSSSALAWYWTIEALASFTDVKPSLLHDVIDKALELPEDTRKNAGEMVALKCLEGLFGPLNYSGENGPPAQESKVMFDSSESCEDVVKRVYKE
        MENE ES+SSSALAW WT+EALASF +VK SLL DVID+  ELPE TRKNAGEM+AL+CLEGLFGPLNYSGENGPPA++SKVMFDSSE CEDV+KRV KE
Subjt:  MENETESSSSSALAWYWTIEALASFTDVKPSLLHDVIDKALELPEDTRKNAGEMVALKCLEGLFGPLNYSGENGPPAQESKVMFDSSESCEDVVKRVYKE

Query:  TPKSALRVAGPYMSKWDVSPFIAQKRASMRCTLLQLKDSLLDGTHPYADFLKQKSGLMPVKKRDSISLNDDDLIERSKRLDSSFSDPQGQKEKHKGSPLL
        T KSALRVA P MSKWDVSPFIAQKRASMRCTLLQL+DS+LDGTHPYADFL QKSGLMPVKK+D I LN+ DL +R +RLD+S SDPQ QKEK +GSP+L
Subjt:  TPKSALRVAGPYMSKWDVSPFIAQKRASMRCTLLQLKDSLLDGTHPYADFLKQKSGLMPVKKRDSISLNDDDLIERSKRLDSSFSDPQGQKEKHKGSPLL

Query:  LEDE-RRLSVADPSSSTLLPSKRSSVDLTSEDEARQLSGCDEGCTNAKKLKQHSAHTLYSGQEVASSHGTEVLADLSERVVPQNERDDTDHLDEHQMTSV
        LE+E RRLSVADPS+S+LLPSKRS VDLTSEDEARQL GC++G  NAK+ K   AHT +SG EVASSHG EV     E VVPQ ERDD D    HQMTSV
Subjt:  LEDE-RRLSVADPSSSTLLPSKRSSVDLTSEDEARQLSGCDEGCTNAKKLKQHSAHTLYSGQEVASSHGTEVLADLSERVVPQNERDDTDHLDEHQMTSV

Query:  EDKVV-EEHFGS-NNGQYTATDELHQVESGAPCYTTPDSTQDGGMHEVISAEKAKDGEELPIEPKASNNSPAEGNVHDIITGNSEHDFGDDNHVHEMNTL
        EDKVV EE+FGS  +GQYTATDELHQVES  PCYTT   + D  M EVIS EK+KDG ELP+E KASNNSP EGN HDIITG+SEHDFG DNHV+EMNTL
Subjt:  EDKVV-EEHFGS-NNGQYTATDELHQVESGAPCYTTPDSTQDGGMHEVISAEKAKDGEELPIEPKASNNSPAEGNVHDIITGNSEHDFGDDNHVHEMNTL

Query:  SHSGFLAQAVATDIDVGMNPNEEEKDMLSDNDGYHNERTDIAMKKKEFFSSQCIVDHDSFLLADGRELTVCVKCNEGGQLLSCNISDCPLVVHDKCLGSS
        SH GFL + V TDIDVGMNP+EEEKDMLSD+DGYHN    IA KKKEF SSQC+ D DSFLLA+G   +VCVKCNEGGQLLSCN S CPLVVHDKCL SS
Subjt:  SHSGFLAQAVATDIDVGMNPNEEEKDMLSDNDGYHNERTDIAMKKKEFFSSQCIVDHDSFLLADGRELTVCVKCNEGGQLLSCNISDCPLVVHDKCLGSS

Query:  ARMDDEGDFNCPFCFYSLAISEYLEAKKNAALAKKNVATFIRTGLGHQSIVIKEVLQQKDLDPSRKAGGEDVAKICEDVDLENKGNQVTLDGEHVKEVVD
        ARM  EG F+CPFC  SLAISEYLEAKKNAALAKKN+ATF RT LGH SIVI+E LQQKDLDPSR A  ED AKI E   LEN+ NQVTLDGEHVKE V+
Subjt:  ARMDDEGDFNCPFCFYSLAISEYLEAKKNAALAKKNVATFIRTGLGHQSIVIKEVLQQKDLDPSRKAGGEDVAKICEDVDLENKGNQVTLDGEHVKEVVD

Query:  HQSPIVTDIERIIELSRPLHVANSNHRENEASPLRVAPDVLSAGKDGDELVDQECQDNVAELEDGLKATEQHDIYDTLHEDQGPVEAAATQEGLQNQTDG
        HQS  VTDIE  IELS+PLH+ NSNHRENEAS  RVAPDVLS  KDGDELV+QEC  +VAEL+DGL+ATEQHDIYDTLHEDQGPVEAAA QEGLQ QTD 
Subjt:  HQSPIVTDIERIIELSRPLHVANSNHRENEASPLRVAPDVLSAGKDGDELVDQECQDNVAELEDGLKATEQHDIYDTLHEDQGPVEAAATQEGLQNQTDG

Query:  NGEEPAYAINIEGEK-SSDDDNDESIISRYSIRFRRKYH
          EEP+YAINI GEK S DDDND+SIISRYSIRFRRKYH
Subjt:  NGEEPAYAINIEGEK-SSDDDNDESIISRYSIRFRRKYH

KAG7027757.1 Dehydration-responsive element-binding protein 2F [Cucurbita argyrosperma subsp. argyrosperma]1.7e-30276.86Show/hide
Query:  MENETESSSSSALAWYWTIEALASFTDVKPSLLHDVIDKALELPEDTRKNAGEMVALKCLEGLFGPLNYSGENGPPAQESKVMFDSSESCEDVVKRVYKE
        MENE ES+SSSALAW WT+EALASF +VK SLL DVID+  ELPE TRKNAGEM+AL+CLEGLFGPLNYSGENGPPA++SKVMFDSSE CEDV+KRV KE
Subjt:  MENETESSSSSALAWYWTIEALASFTDVKPSLLHDVIDKALELPEDTRKNAGEMVALKCLEGLFGPLNYSGENGPPAQESKVMFDSSESCEDVVKRVYKE

Query:  TPKSALRVAGPYMSKWDVSPFIAQKRASMRCTLLQLKDSLLDGTHPYADFLKQKSGLMPVKKRDSISLNDDDLIERSKRLDSSFSDPQGQKEKHKGSPLL
        T KSALRVA P MSKWDVSPFIAQKRASMRCTLLQL+DS+LDGTHPYADFL QKSGLMPVKK+D I LN+ DL +R +RLD+S SDPQ QKEK +GSP+L
Subjt:  TPKSALRVAGPYMSKWDVSPFIAQKRASMRCTLLQLKDSLLDGTHPYADFLKQKSGLMPVKKRDSISLNDDDLIERSKRLDSSFSDPQGQKEKHKGSPLL

Query:  LEDE-RRLSVADPSSSTLLPSKRSSVDLTSEDEARQLSGCDEGCTNAKKLKQHSAHTLYSGQEVASSHGTEVLADLSERVVPQNERDDTDHLDEHQMTSV
        LE+E RRLSVADPS+S+LLPSKRS VDLTSEDEARQL GC++G  NAK+ K   AHT +SG EVASSHG EV     E VVPQ ERDD D    HQMTSV
Subjt:  LEDE-RRLSVADPSSSTLLPSKRSSVDLTSEDEARQLSGCDEGCTNAKKLKQHSAHTLYSGQEVASSHGTEVLADLSERVVPQNERDDTDHLDEHQMTSV

Query:  EDKVV-EEHFGS-NNGQYTATDELHQVESGAPCYTTPDSTQDGGMHEVISAEKAKDGEELPIEPKASNNSPAEGNVHDIITGNSEHDFGDDNHVHEMNTL
        EDKVV EE+FGS  +GQYTATDELHQVES  PCYTT   + D  M EVIS EK+KDG ELP+E KASNNSP EGN HDIITG+SEHDFG DNHV+EMNTL
Subjt:  EDKVV-EEHFGS-NNGQYTATDELHQVESGAPCYTTPDSTQDGGMHEVISAEKAKDGEELPIEPKASNNSPAEGNVHDIITGNSEHDFGDDNHVHEMNTL

Query:  SHSGFLAQAVATDIDVGMNPNEEEKDMLSDNDGYHNERTDIAMKKKEFFSSQCIVDHDSFLLADGRELTVCVKCNEGGQLLSCNISDCPLVVHDKCLGSS
        SH GFL + V TDIDVGMNP+EEEKDMLSD+DGYHN    IA KKKEF SSQC+ D DSFLLA+G   +VCVKCNEGGQLLSCN S CPLVVHDKCL SS
Subjt:  SHSGFLAQAVATDIDVGMNPNEEEKDMLSDNDGYHNERTDIAMKKKEFFSSQCIVDHDSFLLADGRELTVCVKCNEGGQLLSCNISDCPLVVHDKCLGSS

Query:  ARMDDEGDFNCPFCFYSLAISEYLEAKKNAALAKKNVATFIRTGLGHQSIVIKEVLQQKDLDPSRKAGGEDVAKICEDVDLENKGNQVTLDGEHVKEVVD
        ARM  EG F+CPFC  SLAISEYLEAKKNAALAKKN+ATF RT LGH SIVI+E LQQKDLDPSR A  ED AKI E   LEN+ NQVTLDGEHVKE V+
Subjt:  ARMDDEGDFNCPFCFYSLAISEYLEAKKNAALAKKNVATFIRTGLGHQSIVIKEVLQQKDLDPSRKAGGEDVAKICEDVDLENKGNQVTLDGEHVKEVVD

Query:  HQSPIVTDIERIIELSRPLHVANSNHRENEASPLRVAPDVLSAGKDGDELVDQECQDNVAELEDGLKATEQHDIYDTLHEDQGPVEAAATQEGLQNQTDG
        HQS  VTDIE  IELS+PLH+ NSNHRENEAS  RVAPDVLS  KDGDELV+QEC  +VAEL+DGL+ATEQHDIYDTLHEDQGPVEAAA QEGLQ QTD 
Subjt:  HQSPIVTDIERIIELSRPLHVANSNHRENEASPLRVAPDVLSAGKDGDELVDQECQDNVAELEDGLKATEQHDIYDTLHEDQGPVEAAATQEGLQNQTDG

Query:  NGEEPAYAINIEGEK-SSDDDNDESIISRYSIRFRRKYH
          EEP+YAINI GEK S DDDND+SIISRYSIRFRRKYH
Subjt:  NGEEPAYAINIEGEK-SSDDDNDESIISRYSIRFRRKYH

XP_023538898.1 uncharacterized protein LOC111799689 isoform X1 [Cucurbita pepo subsp. pepo]1.7e-30276.59Show/hide
Query:  MENETESSSSSALAWYWTIEALASFTDVKPSLLHDVIDKALELPEDTRKNAGEMVALKCLEGLFGPLNYSGENGPPAQESKVMFDSSESCEDVVKRVYKE
        MENE ES+SSSALAW WT+EALASF +VK SLL DVID+  ELPE TRKNAGEM+AL+CLEGLFGPLNYSGENGPPA++SKVMFDSSE CEDV+KRV KE
Subjt:  MENETESSSSSALAWYWTIEALASFTDVKPSLLHDVIDKALELPEDTRKNAGEMVALKCLEGLFGPLNYSGENGPPAQESKVMFDSSESCEDVVKRVYKE

Query:  TPKSALRVAGPYMSKWDVSPFIAQKRASMRCTLLQLKDSLLDGTHPYADFLKQKSGLMPVKKRDSISLNDDDLIERSKRLDSSFSDPQGQKEKHKGSPLL
        T KSALR+A P MSKWDVSPFIAQKRASMRCTLLQL+DS+LDGTHPYADFL QKSGLMPVK +D I LN+ DL E  +RLD+S SDPQ QKE  +GSP+L
Subjt:  TPKSALRVAGPYMSKWDVSPFIAQKRASMRCTLLQLKDSLLDGTHPYADFLKQKSGLMPVKKRDSISLNDDDLIERSKRLDSSFSDPQGQKEKHKGSPLL

Query:  LEDE-RRLSVADPSSSTLLPSKRSSVDLTSEDEARQLSGCDEGCTNAKKLKQHSAHTLYSGQEVASSHGTEVLADLSERVVPQNERDDTDHLDEHQMTSV
        LE+E RRLSVADPS+S+LLPSKRS VDLTSEDEARQL GC++G  NAK+ K   AHT +SG EVASSHG EV     ERVVPQ ERDD D    HQMTSV
Subjt:  LEDE-RRLSVADPSSSTLLPSKRSSVDLTSEDEARQLSGCDEGCTNAKKLKQHSAHTLYSGQEVASSHGTEVLADLSERVVPQNERDDTDHLDEHQMTSV

Query:  EDKVV-EEHFGS-NNGQYTATDELHQVESGAPCYTTPDSTQDGGMHEVISAEKAKDGEELPIEPKASNNSPAEGNVHDIITGNSEHDFGDDNHVHEMNTL
        EDKVV EEHFGS  +GQYTATDELHQVES  PCYTT   +QD  M EVIS EK+KDG ELP+E KASNNSP EGN+HDIITG+S+HDFG DNHV+EMNTL
Subjt:  EDKVV-EEHFGS-NNGQYTATDELHQVESGAPCYTTPDSTQDGGMHEVISAEKAKDGEELPIEPKASNNSPAEGNVHDIITGNSEHDFGDDNHVHEMNTL

Query:  SHSGFLAQAVATDIDVGMNPNEEEKDMLSDNDGYHNERTDIAMKKKEFFSSQCIVDHDSFLLADGRELTVCVKCNEGGQLLSCNISDCPLVVHDKCLGSS
        SH GFL + VATDIDVGMNP+E+EKDMLSD+DGYH +   IA KKKEF SSQC+ D DSFLLA+G   +VCVKCNEGGQLLSCN+SDCPLVVHDKCL SS
Subjt:  SHSGFLAQAVATDIDVGMNPNEEEKDMLSDNDGYHNERTDIAMKKKEFFSSQCIVDHDSFLLADGRELTVCVKCNEGGQLLSCNISDCPLVVHDKCLGSS

Query:  ARMDDEGDFNCPFCFYSLAISEYLEAKKNAALAKKNVATFIRTGLGHQSIVIKEVLQQKDLDPSRKAGGEDVAKICEDVDLENKGNQVTLDGEHVKEVVD
        ARM  EG F+CPFC  SLAISEYLEAKKNAALAKKN+ATF RT LG  SIVI+E LQQKDLDPSR A  ED AKI E   LEN+ NQVTLDGEHVKE V+
Subjt:  ARMDDEGDFNCPFCFYSLAISEYLEAKKNAALAKKNVATFIRTGLGHQSIVIKEVLQQKDLDPSRKAGGEDVAKICEDVDLENKGNQVTLDGEHVKEVVD

Query:  HQSPIVTDIERIIELSRPLHVANSNHRENEASPLRVAPDVLSAGKDGDELVDQECQDNVAELEDGLKATEQHDIYDTLHEDQGPVEAAATQEGLQNQTDG
        HQSP VTDIER IELS+ L + NSN RENEAS  RVAPDVLS  KDGDELV+QECQ NVAEL+DGL+ATEQHDIYDTLHEDQGPVEAA  QEGLQ QTD 
Subjt:  HQSPIVTDIERIIELSRPLHVANSNHRENEASPLRVAPDVLSAGKDGDELVDQECQDNVAELEDGLKATEQHDIYDTLHEDQGPVEAAATQEGLQNQTDG

Query:  NGEEPAYAINIEGEK-SSDDDNDESIISRYSIRFRRKYH
          EEP+YAINI GEK S DDDND+SIISRYSIRFRRKYH
Subjt:  NGEEPAYAINIEGEK-SSDDDNDESIISRYSIRFRRKYH

XP_023538899.1 uncharacterized protein LOC111799689 isoform X2 [Cucurbita pepo subsp. pepo]1.7e-30276.59Show/hide
Query:  MENETESSSSSALAWYWTIEALASFTDVKPSLLHDVIDKALELPEDTRKNAGEMVALKCLEGLFGPLNYSGENGPPAQESKVMFDSSESCEDVVKRVYKE
        MENE ES+SSSALAW WT+EALASF +VK SLL DVID+  ELPE TRKNAGEM+AL+CLEGLFGPLNYSGENGPPA++SKVMFDSSE CEDV+KRV KE
Subjt:  MENETESSSSSALAWYWTIEALASFTDVKPSLLHDVIDKALELPEDTRKNAGEMVALKCLEGLFGPLNYSGENGPPAQESKVMFDSSESCEDVVKRVYKE

Query:  TPKSALRVAGPYMSKWDVSPFIAQKRASMRCTLLQLKDSLLDGTHPYADFLKQKSGLMPVKKRDSISLNDDDLIERSKRLDSSFSDPQGQKEKHKGSPLL
        T KSALR+A P MSKWDVSPFIAQKRASMRCTLLQL+DS+LDGTHPYADFL QKSGLMPVK +D I LN+ DL E  +RLD+S SDPQ QKE  +GSP+L
Subjt:  TPKSALRVAGPYMSKWDVSPFIAQKRASMRCTLLQLKDSLLDGTHPYADFLKQKSGLMPVKKRDSISLNDDDLIERSKRLDSSFSDPQGQKEKHKGSPLL

Query:  LEDE-RRLSVADPSSSTLLPSKRSSVDLTSEDEARQLSGCDEGCTNAKKLKQHSAHTLYSGQEVASSHGTEVLADLSERVVPQNERDDTDHLDEHQMTSV
        LE+E RRLSVADPS+S+LLPSKRS VDLTSEDEARQL GC++G  NAK+ K   AHT +SG EVASSHG EV     ERVVPQ ERDD D    HQMTSV
Subjt:  LEDE-RRLSVADPSSSTLLPSKRSSVDLTSEDEARQLSGCDEGCTNAKKLKQHSAHTLYSGQEVASSHGTEVLADLSERVVPQNERDDTDHLDEHQMTSV

Query:  EDKVV-EEHFGS-NNGQYTATDELHQVESGAPCYTTPDSTQDGGMHEVISAEKAKDGEELPIEPKASNNSPAEGNVHDIITGNSEHDFGDDNHVHEMNTL
        EDKVV EEHFGS  +GQYTATDELHQVES  PCYTT   +QD  M EVIS EK+KDG ELP+E KASNNSP EGN+HDIITG+S+HDFG DNHV+EMNTL
Subjt:  EDKVV-EEHFGS-NNGQYTATDELHQVESGAPCYTTPDSTQDGGMHEVISAEKAKDGEELPIEPKASNNSPAEGNVHDIITGNSEHDFGDDNHVHEMNTL

Query:  SHSGFLAQAVATDIDVGMNPNEEEKDMLSDNDGYHNERTDIAMKKKEFFSSQCIVDHDSFLLADGRELTVCVKCNEGGQLLSCNISDCPLVVHDKCLGSS
        SH GFL + VATDIDVGMNP+E+EKDMLSD+DGYH +   IA KKKEF SSQC+ D DSFLLA+G   +VCVKCNEGGQLLSCN+SDCPLVVHDKCL SS
Subjt:  SHSGFLAQAVATDIDVGMNPNEEEKDMLSDNDGYHNERTDIAMKKKEFFSSQCIVDHDSFLLADGRELTVCVKCNEGGQLLSCNISDCPLVVHDKCLGSS

Query:  ARMDDEGDFNCPFCFYSLAISEYLEAKKNAALAKKNVATFIRTGLGHQSIVIKEVLQQKDLDPSRKAGGEDVAKICEDVDLENKGNQVTLDGEHVKEVVD
        ARM  EG F+CPFC  SLAISEYLEAKKNAALAKKN+ATF RT LG  SIVI+E LQQKDLDPSR A  ED AKI E   LEN+ NQVTLDGEHVKE V+
Subjt:  ARMDDEGDFNCPFCFYSLAISEYLEAKKNAALAKKNVATFIRTGLGHQSIVIKEVLQQKDLDPSRKAGGEDVAKICEDVDLENKGNQVTLDGEHVKEVVD

Query:  HQSPIVTDIERIIELSRPLHVANSNHRENEASPLRVAPDVLSAGKDGDELVDQECQDNVAELEDGLKATEQHDIYDTLHEDQGPVEAAATQEGLQNQTDG
        HQSP VTDIER IELS+ L + NSN RENEAS  RVAPDVLS  KDGDELV+QECQ NVAEL+DGL+ATEQHDIYDTLHEDQGPVEAA  QEGLQ QTD 
Subjt:  HQSPIVTDIERIIELSRPLHVANSNHRENEASPLRVAPDVLSAGKDGDELVDQECQDNVAELEDGLKATEQHDIYDTLHEDQGPVEAAATQEGLQNQTDG

Query:  NGEEPAYAINIEGEK-SSDDDNDESIISRYSIRFRRKYH
          EEP+YAINI GEK S DDDND+SIISRYSIRFRRKYH
Subjt:  NGEEPAYAINIEGEK-SSDDDNDESIISRYSIRFRRKYH

XP_038904579.1 uncharacterized protein LOC120090944 [Benincasa hispida]3.0e-30768.83Show/hide
Query:  MENETESSSSSALAWYWTIEALASFTDVKPSLLHDVIDKALELPEDTRKNAGEMVALKCLEGLFGPLNYSGENGPPAQESKVMFDSSESCEDVVKRVYKE
        MEN TES+SSSALAW WTIEALA F +VKPSLLHDVIDKA EL + TRKNAGEMVALKCLEGLFGPLN  GENGPPAQESKVMFDSSESCE+VVKR+YKE
Subjt:  MENETESSSSSALAWYWTIEALASFTDVKPSLLHDVIDKALELPEDTRKNAGEMVALKCLEGLFGPLNYSGENGPPAQESKVMFDSSESCEDVVKRVYKE

Query:  TPKSALRVAGPYMSKWDVSPFIAQKRASMRCTLLQLKDSLLDGTHPYADFLKQKSGLMPVKKRDSISLNDDDLIERSKRLDSSFSDPQGQKEKHKGSPLL
        TP+SALRVAGP M KWDV PFI QK ASMRCTL QLKDS+LDGTHPYADFL QKSGL P+ KRD+ISLN++D I+ S+RLD S S PQG+KE+ KGSP L
Subjt:  TPKSALRVAGPYMSKWDVSPFIAQKRASMRCTLLQLKDSLLDGTHPYADFLKQKSGLMPVKKRDSISLNDDDLIERSKRLDSSFSDPQGQKEKHKGSPLL

Query:  LEDERRLSVADPSSSTLLPSKRSSVDLTSEDEARQLSGCDEGCTNAKKLKQHSAHTLYSGQEVASSHGTEVLADLSERVVPQNERDDTDHLDEHQMTSVE
        L+DE R+S+ +PSSS+LLPSKRS+VD TSEDEARQL GCD+G  N KKLK HSA TLYSGQEVASSHGTE++ D SER  PQ ERDDT+HLD HQ+T V 
Subjt:  LEDERRLSVADPSSSTLLPSKRSSVDLTSEDEARQLSGCDEGCTNAKKLKQHSAHTLYSGQEVASSHGTEVLADLSERVVPQNERDDTDHLDEHQMTSVE

Query:  DKVV-EEHFGS-NNGQYTATDELHQVESGAPCYTTPDSTQDGGMHEVISAEKAKDGEELPIEPKASNNSPAEGNVHDIITGNSEHDFGDDNHVHEMNTLS
        DK+V EEHFGS  +GQ TATDELH  ES  P YT   STQDG M EV+S EK  DG+ELP EPKASN+SPAEG        NS+ D G D HV+EM T+S
Subjt:  DKVV-EEHFGS-NNGQYTATDELHQVESGAPCYTTPDSTQDGGMHEVISAEKAKDGEELPIEPKASNNSPAEGNVHDIITGNSEHDFGDDNHVHEMNTLS

Query:  HSGFLAQAVATDIDVGMNPNEEEKDMLSDNDGYHNERTDIAMKKKEFFSSQCIVDHDSFLLADGRELTVCVKCNEGGQLLSCNISDCPLVVHDKCLGSSA
        HSGFL+  VAT+IDVGMNP+E+EKD+LSD+DGYH E  DIAM+KKEF SSQC+VD DSFLLAD RE+TVCVKCNEGGQLLSCNISDCPLVVH KCLGSSA
Subjt:  HSGFLAQAVATDIDVGMNPNEEEKDMLSDNDGYHNERTDIAMKKKEFFSSQCIVDHDSFLLADGRELTVCVKCNEGGQLLSCNISDCPLVVHDKCLGSSA

Query:  RMDDEGDFNCPFCFYSLAISEYLEAKKNAALAKKNVATFIRT-GLGHQSIVIKEVLQQKDLDPSRKAGGEDVAKICEDVDLENKGNQVTLDGEHVKEVVD
        RM+DEG+F CPFC YSLAIS+YLEAKK+AALAKKNVA F+    L  QSI I+EVLQQKDLDPSR+AG EDVAKI EDVDLENK N+VTLDGEHV E VD
Subjt:  RMDDEGDFNCPFCFYSLAISEYLEAKKNAALAKKNVATFIRT-GLGHQSIVIKEVLQQKDLDPSRKAGGEDVAKICEDVDLENKGNQVTLDGEHVKEVVD

Query:  HQSPIVTDIERIIELSRPLHVANSNHRENEASPLRVAPDVLSAGKDGD----------------------------------------------------
         QS  +TD ERIIELS+P+H ANSNHRENE+S LRVAPDVLS  KD +                                                    
Subjt:  HQSPIVTDIERIIELSRPLHVANSNHRENEASPLRVAPDVLSAGKDGD----------------------------------------------------

Query:  -------------------ELVDQECQDNVAELEDGLKATEQHDIYDTLHEDQGPVEAAATQEGLQNQTDGNGEEPAYAINIEGEKSSDDDNDESIISRY
                           ELVDQECQ NVAEL+DG KATEQH+IY  LH+D+GP+E A  ++ LQ QTD N +E A AI  EGEKSSDD ND+SIISRY
Subjt:  -------------------ELVDQECQDNVAELEDGLKATEQHDIYDTLHEDQGPVEAAATQEGLQNQTDGNGEEPAYAINIEGEKSSDDDNDESIISRY

Query:  SIRFRRKYHHT--ETHQLRRKKLPWTAEEEETLREGVRKFSSSVERSPNIPWKKILEFGSNVF
        SIRFR+KYHHT  ETH  RRKKLPWTAEEEE + EGVRKFSSSV+RSP IPWKKILEFGS+VF
Subjt:  SIRFRRKYHHT--ETHQLRRKKLPWTAEEEETLREGVRKFSSSVERSPNIPWKKILEFGSNVF

TrEMBL top hitse value%identityAlignment
A0A6J1DN48 uncharacterized protein LOC111021537 isoform X35.1e-30054.42Show/hide
Query:  MENETESSSSSALAWYWTIEALASFTDVKPSLLHDVIDKALELPEDTRKNAGEMVALKCLEGLFGPLNYSGENGPPAQESKVMFDSSESCEDVVKRVYKE
        MENE E  S+S LAW  TIEALASF  VKPSLLHDVI+KA EL +D RKNAGEMVAL+CLEGLFG LN   E+ PPA +SKV FDSSESCE VVKR+YKE
Subjt:  MENETESSSSSALAWYWTIEALASFTDVKPSLLHDVIDKALELPEDTRKNAGEMVALKCLEGLFGPLNYSGENGPPAQESKVMFDSSESCEDVVKRVYKE

Query:  TPKSALRVAGPYMSKWDVSPFIAQKRASMRCTLLQLKDSLLDGTHPYADFLKQKSGLMPVKKRDSISLNDDDLIERSKRLDSSFSDPQGQKEKHKG----
        TPKSALRVAGP M KWDVSPFIAQKRASMR TL QLKD++LDGTHPY DFLK KSGL+PV KRD I+LN+DD  E S+RLDSS  D QGQKEK KG    
Subjt:  TPKSALRVAGPYMSKWDVSPFIAQKRASMRCTLLQLKDSLLDGTHPYADFLKQKSGLMPVKKRDSISLNDDDLIERSKRLDSSFSDPQGQKEKHKG----

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------------------------------------------------------------------SPLL
                                                                                                        SPLL
Subjt:  ------------------------------------------------------------------------------------------------SPLL

Query:  LEDERRLSVADPSSSTLLPSKR------------------------------------------------------------------------------
        LEDERR SVADPSSS+LLPSKR                                                                              
Subjt:  LEDERRLSVADPSSSTLLPSKR------------------------------------------------------------------------------

Query:  ----------------------SSVDLTSEDEARQLSGCDEGCTNAKKLKQHSAHTLYSGQEVASSHGTEVLADLSERVVPQNERDDTDHLDEHQMT-SV
                              S VD  SEDEA QL GCD+G  N  KL QHSA TLYSGQEVASSHGTE+L D +ERVVPQNE DD + LDE  MT  V
Subjt:  ----------------------SSVDLTSEDEARQLSGCDEGCTNAKKLKQHSAHTLYSGQEVASSHGTEVLADLSERVVPQNERDDTDHLDEHQMT-SV

Query:  EDKVV-EEHFGSNNGQ-YTATDELHQVESGAPCYTTPDSTQDGGMHEVISAEKAKDGEELPIEPKASNNSPAEGNVHDIITGNSEHDFGDDNHVHEMNTL
        EDK+  EEHFG    +  TATDELHQ ESG PC+T P   QD  MHE+IS EK  D   LP EPKASN+S AEGN+H+    +S+ D G DNHV+ +NT+
Subjt:  EDKVV-EEHFGSNNGQ-YTATDELHQVESGAPCYTTPDSTQDGGMHEVISAEKAKDGEELPIEPKASNNSPAEGNVHDIITGNSEHDFGDDNHVHEMNTL

Query:  SHSGFLAQAVATDIDVGMNPNEEEKDMLSDNDGYHNERTDIAMKKKEFFSSQCIVDHDSFLLADGRELTVCVKCNEGGQLLSCNISDCPLVVHDKCLGSS
        SHS F  + VATDIDVGMNP+EEEKDMLSD+DG+ N+  DIAMKK EFFSSQC+VDHDSF LAD +ELTVCVKCNEGGQLLSCNISDC LVVHDKCLG S
Subjt:  SHSGFLAQAVATDIDVGMNPNEEEKDMLSDNDGYHNERTDIAMKKKEFFSSQCIVDHDSFLLADGRELTVCVKCNEGGQLLSCNISDCPLVVHDKCLGSS

Query:  ARMDDEGDFNCPFCFYSLAISEYLEAKKNAALAKKNVATFIRTGLGHQSIVIKEVLQQKDLDPSRKAGGEDVAKICEDVDLENKGNQVTLDGEHVKEVVD
        ARM+DEGDF CPFCFYSLAISEYLEAK+NAALAKKNVATFIR GL HQSI IKEVLQ+KDL PSRKAG EDVAKICEDV+LENK NQVTLDGEHV EV+D
Subjt:  ARMDDEGDFNCPFCFYSLAISEYLEAKKNAALAKKNVATFIRTGLGHQSIVIKEVLQQKDLDPSRKAGGEDVAKICEDVDLENKGNQVTLDGEHVKEVVD

Query:  HQSPIVTDIERIIELSRPLHVANSNHRENEASPLRVAPDVLSAGKDGD---------------------------------------ELVDQECQDNVAE
        HQSP  TDIER  +LS+PL ++NSNHRENEA+PLRVAPDVL+  KDGD                                       ELVDQECQ NVAE
Subjt:  HQSPIVTDIERIIELSRPLHVANSNHRENEASPLRVAPDVLSAGKDGD---------------------------------------ELVDQECQDNVAE

Query:  LEDGLKATEQHDIYDTLHEDQGPVEAAATQEGLQNQTDGNGEEPAYAINIEGEKSSDDDNDESIISRYSIRFRRKYHHT--ETHQLRRKKLPWTAEEEET
        LEDGLKATEQ+DIY+ +HEDQGPVEA   QEGL+ QTD N EEP YAINIEGEKSSDD+NDESIISRYSIRFR++YH T  ET Q RRKKLPWTAEEEET
Subjt:  LEDGLKATEQHDIYDTLHEDQGPVEAAATQEGLQNQTDGNGEEPAYAINIEGEKSSDDDNDESIISRYSIRFRRKYHHT--ETHQLRRKKLPWTAEEEET

Query:  LREGVRKFSSSVERSPNIPWKKILEFGSNVFLKDRTSIDLKDKWRNLCRSPKFK
        LREGVRKFSSSV+RSP IPWKKILEFGS VFLK RTSIDLKDKWRNLCRSPKFK
Subjt:  LREGVRKFSSSVERSPNIPWKKILEFGSNVFLKDRTSIDLKDKWRNLCRSPKFK

A0A6J1FTW0 uncharacterized protein LOC111448741 isoform X25.4e-30276.59Show/hide
Query:  MENETESSSSSALAWYWTIEALASFTDVKPSLLHDVIDKALELPEDTRKNAGEMVALKCLEGLFGPLNYSGENGPPAQESKVMFDSSESCEDVVKRVYKE
        MENE ES+SSSALAW WT+EALASF +VK SLL DVID+  ELPE TRKNAGEM+AL+CLEGLFGPLNYSGENGPPA++SKVMFDSSE CEDV+KRV KE
Subjt:  MENETESSSSSALAWYWTIEALASFTDVKPSLLHDVIDKALELPEDTRKNAGEMVALKCLEGLFGPLNYSGENGPPAQESKVMFDSSESCEDVVKRVYKE

Query:  TPKSALRVAGPYMSKWDVSPFIAQKRASMRCTLLQLKDSLLDGTHPYADFLKQKSGLMPVKKRDSISLNDDDLIERSKRLDSSFSDPQGQKEKHKGSPLL
        T KSALRVA P MSKWDVSPFIAQKRASMRCTLLQL+DS+LDGTHPYADFL QKSGLMPVKK+D I LN+ DL +R +RLD+S SDPQ QKEK +GSP+L
Subjt:  TPKSALRVAGPYMSKWDVSPFIAQKRASMRCTLLQLKDSLLDGTHPYADFLKQKSGLMPVKKRDSISLNDDDLIERSKRLDSSFSDPQGQKEKHKGSPLL

Query:  LEDE-RRLSVADPSSSTLLPSKRSSVDLTSEDEARQLSGCDEGCTNAKKLKQHSAHTLYSGQEVASSHGTEVLADLSERVVPQNERDDTDHLDEHQMTSV
        LE+E RRLSVADPS+S+LLPSKRS VDLTSEDEARQL GC++G  NAK+ K   AHT +SG EVASSHG EV     E VVPQ ERDD D    HQMTSV
Subjt:  LEDE-RRLSVADPSSSTLLPSKRSSVDLTSEDEARQLSGCDEGCTNAKKLKQHSAHTLYSGQEVASSHGTEVLADLSERVVPQNERDDTDHLDEHQMTSV

Query:  EDKVV-EEHFGS-NNGQYTATDELHQVESGAPCYTTPDSTQDGGMHEVISAEKAKDGEELPIEPKASNNSPAEGNVHDIITGNSEHDFGDDNHVHEMNTL
        EDKVV EE+FGS  +GQYTATDELHQVES  PCYT    + D  M EVIS EK+KDG ELP+E KASNNSP EGN+HDIITG+SEHDFG DNHV+EMNTL
Subjt:  EDKVV-EEHFGS-NNGQYTATDELHQVESGAPCYTTPDSTQDGGMHEVISAEKAKDGEELPIEPKASNNSPAEGNVHDIITGNSEHDFGDDNHVHEMNTL

Query:  SHSGFLAQAVATDIDVGMNPNEEEKDMLSDNDGYHNERTDIAMKKKEFFSSQCIVDHDSFLLADGRELTVCVKCNEGGQLLSCNISDCPLVVHDKCLGSS
        SH GFL + V TDIDVGMNP+EEEKDMLSD+DGYHN    IA KKKEF SSQC+ D DSFLLA+G   +VCVKCNEGGQLLSCN S CPLVVHDKCL SS
Subjt:  SHSGFLAQAVATDIDVGMNPNEEEKDMLSDNDGYHNERTDIAMKKKEFFSSQCIVDHDSFLLADGRELTVCVKCNEGGQLLSCNISDCPLVVHDKCLGSS

Query:  ARMDDEGDFNCPFCFYSLAISEYLEAKKNAALAKKNVATFIRTGLGHQSIVIKEVLQQKDLDPSRKAGGEDVAKICEDVDLENKGNQVTLDGEHVKEVVD
        ARM  EG F+CPFC  SLAISEYLEAKKNAALAKKN+ATF RT LGH SIVI+E LQQKDLDPSR A  ED AKI E   LEN+ NQVTLDGEHVKE V+
Subjt:  ARMDDEGDFNCPFCFYSLAISEYLEAKKNAALAKKNVATFIRTGLGHQSIVIKEVLQQKDLDPSRKAGGEDVAKICEDVDLENKGNQVTLDGEHVKEVVD

Query:  HQSPIVTDIERIIELSRPLHVANSNHRENEASPLRVAPDVLSAGKDGDELVDQECQDNVAELEDGLKATEQHDIYDTLHEDQGPVEAAATQEGLQNQTDG
        HQS  VTDIE  IELS+PLH+ NSNHRENEAS  RVAPDVLS  KDGDELV+QEC  +VAEL+DGL+ATEQHDIYDTLHEDQGPVEAAA QEGLQ  TD 
Subjt:  HQSPIVTDIERIIELSRPLHVANSNHRENEASPLRVAPDVLSAGKDGDELVDQECQDNVAELEDGLKATEQHDIYDTLHEDQGPVEAAATQEGLQNQTDG

Query:  NGEEPAYAINIEGEK-SSDDDNDESIISRYSIRFRRKYH
          EEP+YAINI GEK S DDDND+SIISRYSIRFRRKYH
Subjt:  NGEEPAYAINIEGEK-SSDDDNDESIISRYSIRFRRKYH

A0A6J1FXU9 uncharacterized protein LOC111448741 isoform X15.4e-30276.59Show/hide
Query:  MENETESSSSSALAWYWTIEALASFTDVKPSLLHDVIDKALELPEDTRKNAGEMVALKCLEGLFGPLNYSGENGPPAQESKVMFDSSESCEDVVKRVYKE
        MENE ES+SSSALAW WT+EALASF +VK SLL DVID+  ELPE TRKNAGEM+AL+CLEGLFGPLNYSGENGPPA++SKVMFDSSE CEDV+KRV KE
Subjt:  MENETESSSSSALAWYWTIEALASFTDVKPSLLHDVIDKALELPEDTRKNAGEMVALKCLEGLFGPLNYSGENGPPAQESKVMFDSSESCEDVVKRVYKE

Query:  TPKSALRVAGPYMSKWDVSPFIAQKRASMRCTLLQLKDSLLDGTHPYADFLKQKSGLMPVKKRDSISLNDDDLIERSKRLDSSFSDPQGQKEKHKGSPLL
        T KSALRVA P MSKWDVSPFIAQKRASMRCTLLQL+DS+LDGTHPYADFL QKSGLMPVKK+D I LN+ DL +R +RLD+S SDPQ QKEK +GSP+L
Subjt:  TPKSALRVAGPYMSKWDVSPFIAQKRASMRCTLLQLKDSLLDGTHPYADFLKQKSGLMPVKKRDSISLNDDDLIERSKRLDSSFSDPQGQKEKHKGSPLL

Query:  LEDE-RRLSVADPSSSTLLPSKRSSVDLTSEDEARQLSGCDEGCTNAKKLKQHSAHTLYSGQEVASSHGTEVLADLSERVVPQNERDDTDHLDEHQMTSV
        LE+E RRLSVADPS+S+LLPSKRS VDLTSEDEARQL GC++G  NAK+ K   AHT +SG EVASSHG EV     E VVPQ ERDD D    HQMTSV
Subjt:  LEDE-RRLSVADPSSSTLLPSKRSSVDLTSEDEARQLSGCDEGCTNAKKLKQHSAHTLYSGQEVASSHGTEVLADLSERVVPQNERDDTDHLDEHQMTSV

Query:  EDKVV-EEHFGS-NNGQYTATDELHQVESGAPCYTTPDSTQDGGMHEVISAEKAKDGEELPIEPKASNNSPAEGNVHDIITGNSEHDFGDDNHVHEMNTL
        EDKVV EE+FGS  +GQYTATDELHQVES  PCYT    + D  M EVIS EK+KDG ELP+E KASNNSP EGN+HDIITG+SEHDFG DNHV+EMNTL
Subjt:  EDKVV-EEHFGS-NNGQYTATDELHQVESGAPCYTTPDSTQDGGMHEVISAEKAKDGEELPIEPKASNNSPAEGNVHDIITGNSEHDFGDDNHVHEMNTL

Query:  SHSGFLAQAVATDIDVGMNPNEEEKDMLSDNDGYHNERTDIAMKKKEFFSSQCIVDHDSFLLADGRELTVCVKCNEGGQLLSCNISDCPLVVHDKCLGSS
        SH GFL + V TDIDVGMNP+EEEKDMLSD+DGYHN    IA KKKEF SSQC+ D DSFLLA+G   +VCVKCNEGGQLLSCN S CPLVVHDKCL SS
Subjt:  SHSGFLAQAVATDIDVGMNPNEEEKDMLSDNDGYHNERTDIAMKKKEFFSSQCIVDHDSFLLADGRELTVCVKCNEGGQLLSCNISDCPLVVHDKCLGSS

Query:  ARMDDEGDFNCPFCFYSLAISEYLEAKKNAALAKKNVATFIRTGLGHQSIVIKEVLQQKDLDPSRKAGGEDVAKICEDVDLENKGNQVTLDGEHVKEVVD
        ARM  EG F+CPFC  SLAISEYLEAKKNAALAKKN+ATF RT LGH SIVI+E LQQKDLDPSR A  ED AKI E   LEN+ NQVTLDGEHVKE V+
Subjt:  ARMDDEGDFNCPFCFYSLAISEYLEAKKNAALAKKNVATFIRTGLGHQSIVIKEVLQQKDLDPSRKAGGEDVAKICEDVDLENKGNQVTLDGEHVKEVVD

Query:  HQSPIVTDIERIIELSRPLHVANSNHRENEASPLRVAPDVLSAGKDGDELVDQECQDNVAELEDGLKATEQHDIYDTLHEDQGPVEAAATQEGLQNQTDG
        HQS  VTDIE  IELS+PLH+ NSNHRENEAS  RVAPDVLS  KDGDELV+QEC  +VAEL+DGL+ATEQHDIYDTLHEDQGPVEAAA QEGLQ  TD 
Subjt:  HQSPIVTDIERIIELSRPLHVANSNHRENEASPLRVAPDVLSAGKDGDELVDQECQDNVAELEDGLKATEQHDIYDTLHEDQGPVEAAATQEGLQNQTDG

Query:  NGEEPAYAINIEGEK-SSDDDNDESIISRYSIRFRRKYH
          EEP+YAINI GEK S DDDND+SIISRYSIRFRRKYH
Subjt:  NGEEPAYAINIEGEK-SSDDDNDESIISRYSIRFRRKYH

A0A6J1I5T3 uncharacterized protein LOC111470170 isoform X21.3e-29875.34Show/hide
Query:  MENETESSSSSALAWYWTIEALASFTDVKPSLLHDVIDKALELPEDTRKNAGEMVALKCLEGLFGPLNYSGENGPPAQESKVMFDSSESCEDVVKRVYKE
        MENE ES+SSSALAW WT+EALASF +VK SLL DVID+  E PE TRKNAGEM+AL+CLEGLFGPLNY GENGPPA++SKVMFDSSE CEDV+KRV KE
Subjt:  MENETESSSSSALAWYWTIEALASFTDVKPSLLHDVIDKALELPEDTRKNAGEMVALKCLEGLFGPLNYSGENGPPAQESKVMFDSSESCEDVVKRVYKE

Query:  TPKSALRVAGPYMSKWDVSPFIAQKRASMRCTLLQLKDSLLDGTHPYADFLKQKSGLMPVKKRDSISLNDDDLIERSKRLDSSFSDPQGQKEKHKGSPLL
        T KSALRVA P MSKWDVSPFIAQKRASMRCTLLQL+DS+LDGTHPYADFL QKSGLMPVKK D I LN+ DL ER +RLD+S SDPQ QKEK +GSP+L
Subjt:  TPKSALRVAGPYMSKWDVSPFIAQKRASMRCTLLQLKDSLLDGTHPYADFLKQKSGLMPVKKRDSISLNDDDLIERSKRLDSSFSDPQGQKEKHKGSPLL

Query:  LEDE-RRLSVADPSSSTLLPSKRSSVDLTSEDEARQLSGCDEGCTNAKKLKQHSAHTLYSGQEVASSHGTEVLADLSERVVPQNERDDTDHLDEHQMTSV
        LE E RRLSVADPS+S+LLPSKRS VDLTSEDEARQL GC++G  NAK+ K   AHTL+SG  VASSHG +V     ERVVPQ ERDD D    HQMTSV
Subjt:  LEDE-RRLSVADPSSSTLLPSKRSSVDLTSEDEARQLSGCDEGCTNAKKLKQHSAHTLYSGQEVASSHGTEVLADLSERVVPQNERDDTDHLDEHQMTSV

Query:  EDKVV-EEHFGS-NNGQYTATDELHQVESGAPCYTTPDSTQDGGMHEVISAEKAKDGEELPIEPKASNNSPAEGNVHDIITGNSEHDFGDDNHVHEMNTL
        EDK+V EEHFGS  +GQYTATDELHQVES  PCYT    +QD  M EVIS EK+KDG ELP+E KA NNSP EGN+HDIITG+SEHDFG DNHV+EMNT+
Subjt:  EDKVV-EEHFGS-NNGQYTATDELHQVESGAPCYTTPDSTQDGGMHEVISAEKAKDGEELPIEPKASNNSPAEGNVHDIITGNSEHDFGDDNHVHEMNTL

Query:  SHSGFLAQAVATDIDVGMNPNEEEKDMLSDNDGYHNERTDIAMKKKEFFSSQCIVDHDSFLLADGRELTVCVKCNEGGQLLSCNISDCPLVVHDKCLGSS
        SH GFL + VA DIDV MNP+E+EKDMLSD+DGYH     IA +KKEF SSQC+ D DSFLLA+G   +VCVKCNEGGQLLSCN+S CPLVVHDKCL SS
Subjt:  SHSGFLAQAVATDIDVGMNPNEEEKDMLSDNDGYHNERTDIAMKKKEFFSSQCIVDHDSFLLADGRELTVCVKCNEGGQLLSCNISDCPLVVHDKCLGSS

Query:  ARMDDEGDFNCPFCFYSLAISEYLEAKKNAALAKKNVATFIRTGLGHQSIVIKEVLQQKDLDPSRKAGGEDVAKICEDVDLENKGNQVTLDGEHVKEVVD
        ARM  EG F+CPFC  SLAISEYLEAKKNAALAKKN+ATF RT LGH SIVI+E LQQKDLDPSR A  ED AK+ E+  LEN+ NQVT DGEHVKE V+
Subjt:  ARMDDEGDFNCPFCFYSLAISEYLEAKKNAALAKKNVATFIRTGLGHQSIVIKEVLQQKDLDPSRKAGGEDVAKICEDVDLENKGNQVTLDGEHVKEVVD

Query:  HQSPIVTDIERIIELSRPLHVANSNHRENEASPLRVAPDVLSAGKDGDELVDQECQDNVAELEDGLKATEQHDIYDTLHEDQGPVEAAATQEGLQNQTDG
        H+S  VTDIERIIELS+PLH+ +SNHRENEAS  RVAPDVLS  KDGD+LV+QECQ NVAEL+DGL+ATE+HDIYDTLHEDQGPVEAAA QEGLQ QTD 
Subjt:  HQSPIVTDIERIIELSRPLHVANSNHRENEASPLRVAPDVLSAGKDGDELVDQECQDNVAELEDGLKATEQHDIYDTLHEDQGPVEAAATQEGLQNQTDG

Query:  NGEEPAYAINIEGEKSSDDDNDESIISRYSIRFRRKYH
          EEP+YAINI  + S DDDND+SIISRYSIRFRRKYH
Subjt:  NGEEPAYAINIEGEKSSDDDNDESIISRYSIRFRRKYH

A0A6J1I8L2 uncharacterized protein LOC111470170 isoform X11.3e-29875.34Show/hide
Query:  MENETESSSSSALAWYWTIEALASFTDVKPSLLHDVIDKALELPEDTRKNAGEMVALKCLEGLFGPLNYSGENGPPAQESKVMFDSSESCEDVVKRVYKE
        MENE ES+SSSALAW WT+EALASF +VK SLL DVID+  E PE TRKNAGEM+AL+CLEGLFGPLNY GENGPPA++SKVMFDSSE CEDV+KRV KE
Subjt:  MENETESSSSSALAWYWTIEALASFTDVKPSLLHDVIDKALELPEDTRKNAGEMVALKCLEGLFGPLNYSGENGPPAQESKVMFDSSESCEDVVKRVYKE

Query:  TPKSALRVAGPYMSKWDVSPFIAQKRASMRCTLLQLKDSLLDGTHPYADFLKQKSGLMPVKKRDSISLNDDDLIERSKRLDSSFSDPQGQKEKHKGSPLL
        T KSALRVA P MSKWDVSPFIAQKRASMRCTLLQL+DS+LDGTHPYADFL QKSGLMPVKK D I LN+ DL ER +RLD+S SDPQ QKEK +GSP+L
Subjt:  TPKSALRVAGPYMSKWDVSPFIAQKRASMRCTLLQLKDSLLDGTHPYADFLKQKSGLMPVKKRDSISLNDDDLIERSKRLDSSFSDPQGQKEKHKGSPLL

Query:  LEDE-RRLSVADPSSSTLLPSKRSSVDLTSEDEARQLSGCDEGCTNAKKLKQHSAHTLYSGQEVASSHGTEVLADLSERVVPQNERDDTDHLDEHQMTSV
        LE E RRLSVADPS+S+LLPSKRS VDLTSEDEARQL GC++G  NAK+ K   AHTL+SG  VASSHG +V     ERVVPQ ERDD D    HQMTSV
Subjt:  LEDE-RRLSVADPSSSTLLPSKRSSVDLTSEDEARQLSGCDEGCTNAKKLKQHSAHTLYSGQEVASSHGTEVLADLSERVVPQNERDDTDHLDEHQMTSV

Query:  EDKVV-EEHFGS-NNGQYTATDELHQVESGAPCYTTPDSTQDGGMHEVISAEKAKDGEELPIEPKASNNSPAEGNVHDIITGNSEHDFGDDNHVHEMNTL
        EDK+V EEHFGS  +GQYTATDELHQVES  PCYT    +QD  M EVIS EK+KDG ELP+E KA NNSP EGN+HDIITG+SEHDFG DNHV+EMNT+
Subjt:  EDKVV-EEHFGS-NNGQYTATDELHQVESGAPCYTTPDSTQDGGMHEVISAEKAKDGEELPIEPKASNNSPAEGNVHDIITGNSEHDFGDDNHVHEMNTL

Query:  SHSGFLAQAVATDIDVGMNPNEEEKDMLSDNDGYHNERTDIAMKKKEFFSSQCIVDHDSFLLADGRELTVCVKCNEGGQLLSCNISDCPLVVHDKCLGSS
        SH GFL + VA DIDV MNP+E+EKDMLSD+DGYH     IA +KKEF SSQC+ D DSFLLA+G   +VCVKCNEGGQLLSCN+S CPLVVHDKCL SS
Subjt:  SHSGFLAQAVATDIDVGMNPNEEEKDMLSDNDGYHNERTDIAMKKKEFFSSQCIVDHDSFLLADGRELTVCVKCNEGGQLLSCNISDCPLVVHDKCLGSS

Query:  ARMDDEGDFNCPFCFYSLAISEYLEAKKNAALAKKNVATFIRTGLGHQSIVIKEVLQQKDLDPSRKAGGEDVAKICEDVDLENKGNQVTLDGEHVKEVVD
        ARM  EG F+CPFC  SLAISEYLEAKKNAALAKKN+ATF RT LGH SIVI+E LQQKDLDPSR A  ED AK+ E+  LEN+ NQVT DGEHVKE V+
Subjt:  ARMDDEGDFNCPFCFYSLAISEYLEAKKNAALAKKNVATFIRTGLGHQSIVIKEVLQQKDLDPSRKAGGEDVAKICEDVDLENKGNQVTLDGEHVKEVVD

Query:  HQSPIVTDIERIIELSRPLHVANSNHRENEASPLRVAPDVLSAGKDGDELVDQECQDNVAELEDGLKATEQHDIYDTLHEDQGPVEAAATQEGLQNQTDG
        H+S  VTDIERIIELS+PLH+ +SNHRENEAS  RVAPDVLS  KDGD+LV+QECQ NVAEL+DGL+ATE+HDIYDTLHEDQGPVEAAA QEGLQ QTD 
Subjt:  HQSPIVTDIERIIELSRPLHVANSNHRENEASPLRVAPDVLSAGKDGDELVDQECQDNVAELEDGLKATEQHDIYDTLHEDQGPVEAAATQEGLQNQTDG

Query:  NGEEPAYAINIEGEKSSDDDNDESIISRYSIRFRRKYH
          EEP+YAINI  + S DDDND+SIISRYSIRFRRKYH
Subjt:  NGEEPAYAINIEGEKSSDDDNDESIISRYSIRFRRKYH

SwissProt top hitse value%identityAlignment
F4I7L1 Telomere repeat-binding factor 46.3e-0548.48Show/hide
Query:  KKLPWTAEEEETLREGVRKFSSSVERSPNIPWKKIL---EFGSNVFLKDRTSIDLKDKWRNLCRSP
        +KL WTAEEEE L  GVRK            WK IL   E      L  R++IDLKDKWRNL  +P
Subjt:  KKLPWTAEEEETLREGVRKFSSSVERSPNIPWKKIL---EFGSNVFLKDRTSIDLKDKWRNLCRSP

F4IEY4 Telomere repeat-binding factor 58.2e-0548.48Show/hide
Query:  KKLPWTAEEEETLREGVRKFSSSVERSPNIPWKKIL---EFGSNVFLKDRTSIDLKDKWRNLCRSP
        +KL WTAEEEE L  G+RK            WK IL   EF     L  R++IDLKDKWRNL   P
Subjt:  KKLPWTAEEEETLREGVRKFSSSVERSPNIPWKKIL---EFGSNVFLKDRTSIDLKDKWRNLCRSP

Arabidopsis top hitse value%identityAlignment
AT1G01150.1 Homeodomain-like protein with RING/FYVE/PHD-type zinc finger domain1.4e-1255.93Show/hide
Query:  KKLPWTAEEEETLREGVRKFSSSVERSPNIPWKKILEFGSNVFLKDRTSIDLKDKWRNL
        K++ WT  EE+ LREGV KFS ++ +  N+PWKKILE G  +F   R S DLKDKWRN+
Subjt:  KKLPWTAEEEETLREGVRKFSSSVERSPNIPWKKILEFGSNVFLKDRTSIDLKDKWRNL

AT1G14770.1 RING/FYVE/PHD zinc finger superfamily protein7.4e-1722.01Show/hide
Query:  NETESSSSSALAWYWTIEALASFTDVKPSLLHDVIDKALELPEDTRKNAGEMVALKCLEGLFGPLNYSGENG---PPAQESKVMFDSSESCEDVVKRVYK
        +E  S  +    W W IE +A F      LL D+++   +  +D  K   E+++L+ LE +F P + S  NG     A E KV FD S S  DV++ + K
Subjt:  NETESSSSSALAWYWTIEALASFTDVKPSLLHDVIDKALELPEDTRKNAGEMVALKCLEGLFGPLNYSGENG---PPAQESKVMFDSSESCEDVVKRVYK

Query:  ETPKSALRVAGPYMSKWDVSPFIAQKRASM-RCTLLQLKDSLLDGTHPYADFLKQKSGLMPVKKRDSISLNDDDLIERSKRLDSSFSDPQGQKEKHKGSP
        E P + LRV  P +SK++V PFIA K   + +C L +L+D            ++ ++   P  + +     DD    RS  +D    +P  +++ H G+ 
Subjt:  ETPKSALRVAGPYMSKWDVSPFIAQKRASM-RCTLLQLKDSLLDGTHPYADFLKQKSGLMPVKKRDSISLNDDDLIERSKRLDSSFSDPQGQKEKHKGSP

Query:  LLLEDERRLSVADPSSSTLLPSKRSSVDLTSEDEARQLSGCDEGCTNAKKLKQHSAHTLYSGQEVASSHGTEVLADLSERVVPQNERDDTDHLDEHQMTS
            DE+ + + +   S         ++L   ++ + ++  DE             HT   G+ V  +                   DDT   +  Q T+
Subjt:  LLLEDERRLSVADPSSSTLLPSKRSSVDLTSEDEARQLSGCDEGCTNAKKLKQHSAHTLYSGQEVASSHGTEVLADLSERVVPQNERDDTDHLDEHQMTS

Query:  VEDKVVEEHFGSNNGQYTATDELHQVESGAPCYTTPDSTQDGGMHEVISAEKAKDGEELPIEPKASNNSPAEGNVHDIITGNSEHDFGDDNHVHEMNTLS
         +      H G+   + T +    +    A    T D T                                      +I+G+ +    D   + +     
Subjt:  VEDKVVEEHFGSNNGQYTATDELHQVESGAPCYTTPDSTQDGGMHEVISAEKAKDGEELPIEPKASNNSPAEGNVHDIITGNSEHDFGDDNHVHEMNTLS

Query:  HSGFLAQAVATDIDVGMNPNEEEKDMLSDNDGYHNERTDIAMKKKEFFSSQCIVDHDSFLLADGRELTVCVKCNEGGQLLSCNISDCPLVVHDKCLGSSA
                         NPN            Y N     +  + E                      VC KC + G LL C+ S+C   VH +CL    
Subjt:  HSGFLAQAVATDIDVGMNPNEEEKDMLSDNDGYHNERTDIAMKKKEFFSSQCIVDHDSFLLADGRELTVCVKCNEGGQLLSCNISDCPLVVHDKCLGSSA

Query:  RMDDEGDFNCPFCFYSLAISEYLEAKKNAALAKKNV
         +D+ G+F CP C+Y     EY E++K  + AK+ +
Subjt:  RMDDEGDFNCPFCFYSLAISEYLEAKKNAALAKKNV

AT1G14770.2 RING/FYVE/PHD zinc finger superfamily protein7.4e-1722.01Show/hide
Query:  NETESSSSSALAWYWTIEALASFTDVKPSLLHDVIDKALELPEDTRKNAGEMVALKCLEGLFGPLNYSGENG---PPAQESKVMFDSSESCEDVVKRVYK
        +E  S  +    W W IE +A F      LL D+++   +  +D  K   E+++L+ LE +F P + S  NG     A E KV FD S S  DV++ + K
Subjt:  NETESSSSSALAWYWTIEALASFTDVKPSLLHDVIDKALELPEDTRKNAGEMVALKCLEGLFGPLNYSGENG---PPAQESKVMFDSSESCEDVVKRVYK

Query:  ETPKSALRVAGPYMSKWDVSPFIAQKRASM-RCTLLQLKDSLLDGTHPYADFLKQKSGLMPVKKRDSISLNDDDLIERSKRLDSSFSDPQGQKEKHKGSP
        E P + LRV  P +SK++V PFIA K   + +C L +L+D            ++ ++   P  + +     DD    RS  +D    +P  +++ H G+ 
Subjt:  ETPKSALRVAGPYMSKWDVSPFIAQKRASM-RCTLLQLKDSLLDGTHPYADFLKQKSGLMPVKKRDSISLNDDDLIERSKRLDSSFSDPQGQKEKHKGSP

Query:  LLLEDERRLSVADPSSSTLLPSKRSSVDLTSEDEARQLSGCDEGCTNAKKLKQHSAHTLYSGQEVASSHGTEVLADLSERVVPQNERDDTDHLDEHQMTS
            DE+ + + +   S         ++L   ++ + ++  DE             HT   G+ V  +                   DDT   +  Q T+
Subjt:  LLLEDERRLSVADPSSSTLLPSKRSSVDLTSEDEARQLSGCDEGCTNAKKLKQHSAHTLYSGQEVASSHGTEVLADLSERVVPQNERDDTDHLDEHQMTS

Query:  VEDKVVEEHFGSNNGQYTATDELHQVESGAPCYTTPDSTQDGGMHEVISAEKAKDGEELPIEPKASNNSPAEGNVHDIITGNSEHDFGDDNHVHEMNTLS
         +      H G+   + T +    +    A    T D T                                      +I+G+ +    D   + +     
Subjt:  VEDKVVEEHFGSNNGQYTATDELHQVESGAPCYTTPDSTQDGGMHEVISAEKAKDGEELPIEPKASNNSPAEGNVHDIITGNSEHDFGDDNHVHEMNTLS

Query:  HSGFLAQAVATDIDVGMNPNEEEKDMLSDNDGYHNERTDIAMKKKEFFSSQCIVDHDSFLLADGRELTVCVKCNEGGQLLSCNISDCPLVVHDKCLGSSA
                         NPN            Y N     +  + E                      VC KC + G LL C+ S+C   VH +CL    
Subjt:  HSGFLAQAVATDIDVGMNPNEEEKDMLSDNDGYHNERTDIAMKKKEFFSSQCIVDHDSFLLADGRELTVCVKCNEGGQLLSCNISDCPLVVHDKCLGSSA

Query:  RMDDEGDFNCPFCFYSLAISEYLEAKKNAALAKKNV
         +D+ G+F CP C+Y     EY E++K  + AK+ +
Subjt:  RMDDEGDFNCPFCFYSLAISEYLEAKKNAALAKKNV

AT1G68030.1 RING/FYVE/PHD zinc finger superfamily protein3.8e-1330.23Show/hide
Query:  WYWTIEALASFTDVKPSLLHDVIDKALELPEDTRKNAGEMVALKCLEGLFGPLNYSGENGPPAQESKVMFDSSESCEDVVKRVYKETPKSALRVAGPYMS
        W W IE  A F   +  +L+DV + A +LP+   +   EMVA +CL  LF   +           S + FDSSESCE V++ +  E P S L+   P ++
Subjt:  WYWTIEALASFTDVKPSLLHDVIDKALELPEDTRKNAGEMVALKCLEGLFGPLNYSGENGPPAQESKVMFDSSESCEDVVKRVYKETPKSALRVAGPYMS

Query:  KWDVSPFIAQKRASM-RCTL-LQLKDSLLDGTHPYADFLKQKSGLMPVKKRDSIS--LNDDDLIERSKRLDS
        KW++ PFI  K  S+ +C L L ++ S  +     +   ++++     K+ D ++  +++ DL  R++  +S
Subjt:  KWDVSPFIAQKRASM-RCTL-LQLKDSLLDGTHPYADFLKQKSGLMPVKKRDSIS--LNDDDLIERSKRLDS

AT1G68030.1 RING/FYVE/PHD zinc finger superfamily protein7.9e-1138.89Show/hide
Query:  CVKCNEGGQLLSCNISDCPLVVHDKCLGSSARMDDEGDFNCPFCFYSLAISEYLEAKKNAALAKKNVATFIR
        CV C E G+LL C+   C ++VH KCL S     D GDF C  C  +   +EY++ +   A AK+ + +F+R
Subjt:  CVKCNEGGQLLSCNISDCPLVVHDKCLGSSARMDDEGDFNCPFCFYSLAISEYLEAKKNAALAKKNVATFIR

AT5G03780.1 TRF-like 104.2e-1220.86Show/hide
Query:  VGMNPNEEEKDMLSDNDGYHN---ERTDIAMKKKEFFSSQCIVDHDSFLLADGRELTVCVKCNEGGQLLS-CNISDCPLVVHDKCL---------GSSAR
        + +  + +  D L D++   N   +  D+  KK+    +    D     +    +L  C+ C    + +S C   DC L  H +CL          SS+ 
Subjt:  VGMNPNEEEKDMLSDNDGYHN---ERTDIAMKKKEFFSSQCIVDHDSFLLADGRELTVCVKCNEGGQLLS-CNISDCPLVVHDKCL---------GSSAR

Query:  MDDEGDFNCPFCFYSLAISEYLEAKKNAALAKKNVATFIRTGLGHQSIVIKEVLQQKDLDPSRKAGGEDVAKICEDVDLENKGNQVTLDGEHVKEVVDHQ
         +D  +  CP+C+  +       A K+  L +K V                E    K LD   K+  ED+    +++  + +   +  D E   E     
Subjt:  MDDEGDFNCPFCFYSLAISEYLEAKKNAALAKKNVATFIRTGLGHQSIVIKEVLQQKDLDPSRKAGGEDVAKICEDVDLENKGNQVTLDGEHVKEVVDHQ

Query:  SPIVTDIERI--------IELSRPLHVANSNHRENEASPLRVAPDVLSAGKDGDELVDQECQDNVAELEDGLKATEQHDIYDTLHEDQGP---VEAAATQ
        S    D  ++        +  S  +        E++ +       +L+ G      V      +   +++   A EQ  +       +     +++  + 
Subjt:  SPIVTDIERI--------IELSRPLHVANSNHRENEASPLRVAPDVLSAGKDGDELVDQECQDNVAELEDGLKATEQHDIYDTLHEDQGP---VEAAATQ

Query:  EGLQNQTDGNGEEPAYAINIEGEKSSDDDNDESIISRYSIRFRRKYHHTETHQLRRKKLPWTAEEEETLREGVRKFSSSVERSPNIPWKKILEFGSNVFL
        +G  N+ +G         +++    S    ++   ++   + +     +     +R++L WT EEEE L+ GV KF++  E + N+PW+KILE G  VF 
Subjt:  EGLQNQTDGNGEEPAYAINIEGEKSSDDDNDESIISRYSIRFRRKYHHTETHQLRRKKLPWTAEEEETLREGVRKFSSSVERSPNIPWKKILEFGSNVFL

Query:  KDRTSIDLKDKWRNLCR
        + RT  DLKDKWR++ +
Subjt:  KDRTSIDLKDKWRNLCR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGAATGAAACCGAATCTTCTTCAAGTTCAGCTCTTGCTTGGTATTGGACCATCGAAGCCCTTGCAAGTTTTACAGACGTGAAACCGTCTCTATTACATGATGTAAT
TGATAAGGCTTTAGAATTACCTGAGGATACAAGAAAGAATGCCGGGGAAATGGTTGCTTTGAAATGCTTGGAGGGTTTGTTTGGTCCTTTAAATTATAGTGGAGAAAATG
GTCCTCCTGCTCAAGAATCAAAAGTTATGTTTGATTCATCTGAGAGCTGTGAAGATGTTGTTAAACGCGTATATAAGGAGACTCCAAAATCTGCCTTAAGAGTGGCTGGA
CCATATATGTCTAAATGGGATGTTAGTCCCTTTATCGCACAAAAAAGAGCATCCATGCGTTGTACATTACTTCAGCTGAAAGATTCACTTCTTGATGGTACACATCCATA
TGCTGATTTCTTAAAGCAGAAGAGTGGGTTGATGCCTGTAAAGAAGCGGGATAGCATTTCTCTGAATGATGATGATCTTATCGAGCGCAGCAAGAGGCTTGATAGTAGCT
TCTCTGATCCTCAAGGTCAAAAAGAAAAACACAAAGGAAGCCCTCTACTTCTTGAGGATGAAAGAAGACTGTCGGTGGCAGATCCATCTAGTTCTACTTTGTTACCATCT
AAAAGAAGTAGTGTTGACTTAACATCTGAAGATGAGGCAAGACAGTTATCTGGTTGTGATGAGGGTTGCACAAATGCTAAGAAGCTTAAGCAGCATTCTGCCCATACTTT
GTATTCAGGACAGGAAGTGGCTTCTTCACATGGTACAGAGGTGTTAGCAGATTTATCTGAAAGAGTTGTGCCACAAAACGAGAGAGATGATACTGATCACTTGGATGAAC
ATCAGATGACTTCGGTGGAAGACAAAGTTGTAGAGGAGCATTTTGGGTCAAATAATGGACAGTATACTGCTACTGATGAATTGCATCAGGTTGAATCAGGTGCTCCTTGT
TATACTACGCCGGATTCTACACAAGATGGTGGGATGCATGAAGTTATTAGTGCCGAGAAAGCAAAAGATGGAGAGGAACTGCCTATTGAACCAAAAGCATCTAATAATTC
TCCTGCTGAAGGAAACGTGCATGATATCATCACTGGTAATTCCGAGCATGACTTTGGGGATGATAATCATGTACATGAAATGAATACTCTGTCTCATAGTGGATTTCTGG
CACAGGCTGTTGCTACCGATATTGATGTTGGCATGAATCCCAATGAGGAAGAGAAAGATATGTTAAGTGATAATGATGGATATCATAATGAAAGGACAGATATTGCAATG
AAAAAAAAAGAGTTCTTTAGTTCTCAATGTATAGTCGATCATGATTCCTTTCTATTAGCTGATGGGAGAGAGCTAACCGTTTGTGTGAAATGCAATGAAGGTGGTCAGTT
GTTGTCTTGTAATATTAGTGATTGTCCTTTGGTGGTTCATGATAAGTGCTTGGGTTCCTCAGCTAGGATGGATGATGAAGGCGATTTTAATTGTCCTTTCTGCTTTTATT
CACTTGCTATATCAGAATACCTTGAAGCTAAGAAGAATGCTGCATTGGCAAAGAAAAATGTTGCTACTTTTATTCGTACTGGTTTAGGACACCAGTCAATTGTTATTAAA
GAGGTATTGCAACAAAAAGATCTTGATCCTTCACGAAAAGCTGGAGGTGAGGATGTTGCTAAAATTTGTGAAGATGTTGACTTGGAAAATAAAGGCAATCAAGTAACTCT
AGATGGAGAACATGTAAAGGAAGTTGTTGACCATCAATCCCCAATAGTTACGGATATTGAGCGAATAATAGAGCTGTCAAGGCCATTGCATGTTGCCAATTCCAATCATA
GAGAAAACGAGGCAAGTCCTTTGAGAGTAGCTCCTGATGTTTTATCTGCAGGTAAAGACGGTGATGAATTGGTGGACCAAGAGTGTCAAGATAATGTTGCAGAACTGGAA
GATGGTCTAAAAGCCACAGAGCAGCATGACATTTATGATACTCTCCACGAAGATCAAGGGCCAGTTGAAGCAGCGGCTACACAAGAAGGATTACAGAACCAGACTGATGG
TAACGGAGAGGAACCTGCTTATGCAATTAATATTGAAGGAGAAAAATCTTCTGATGATGACAATGATGAGTCTATCATTTCTAGATACTCCATAAGATTTCGACGGAAAT
ATCATCATACGGAAACTCATCAATTAAGACGGAAGAAACTACCCTGGACAGCTGAGGAGGAAGAGACTCTAAGGGAGGGAGTTCGAAAATTCTCTAGTTCAGTTGAAAGA
AGTCCAAACATACCTTGGAAAAAGATTTTAGAATTTGGTAGTAATGTGTTTTTGAAAGATCGTACATCGATAGATCTTAAAGATAAATGGAGGAACTTGTGCAGAAGTCC
AAAGTTTAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGAATGAAACCGAATCTTCTTCAAGTTCAGCTCTTGCTTGGTATTGGACCATCGAAGCCCTTGCAAGTTTTACAGACGTGAAACCGTCTCTATTACATGATGTAAT
TGATAAGGCTTTAGAATTACCTGAGGATACAAGAAAGAATGCCGGGGAAATGGTTGCTTTGAAATGCTTGGAGGGTTTGTTTGGTCCTTTAAATTATAGTGGAGAAAATG
GTCCTCCTGCTCAAGAATCAAAAGTTATGTTTGATTCATCTGAGAGCTGTGAAGATGTTGTTAAACGCGTATATAAGGAGACTCCAAAATCTGCCTTAAGAGTGGCTGGA
CCATATATGTCTAAATGGGATGTTAGTCCCTTTATCGCACAAAAAAGAGCATCCATGCGTTGTACATTACTTCAGCTGAAAGATTCACTTCTTGATGGTACACATCCATA
TGCTGATTTCTTAAAGCAGAAGAGTGGGTTGATGCCTGTAAAGAAGCGGGATAGCATTTCTCTGAATGATGATGATCTTATCGAGCGCAGCAAGAGGCTTGATAGTAGCT
TCTCTGATCCTCAAGGTCAAAAAGAAAAACACAAAGGAAGCCCTCTACTTCTTGAGGATGAAAGAAGACTGTCGGTGGCAGATCCATCTAGTTCTACTTTGTTACCATCT
AAAAGAAGTAGTGTTGACTTAACATCTGAAGATGAGGCAAGACAGTTATCTGGTTGTGATGAGGGTTGCACAAATGCTAAGAAGCTTAAGCAGCATTCTGCCCATACTTT
GTATTCAGGACAGGAAGTGGCTTCTTCACATGGTACAGAGGTGTTAGCAGATTTATCTGAAAGAGTTGTGCCACAAAACGAGAGAGATGATACTGATCACTTGGATGAAC
ATCAGATGACTTCGGTGGAAGACAAAGTTGTAGAGGAGCATTTTGGGTCAAATAATGGACAGTATACTGCTACTGATGAATTGCATCAGGTTGAATCAGGTGCTCCTTGT
TATACTACGCCGGATTCTACACAAGATGGTGGGATGCATGAAGTTATTAGTGCCGAGAAAGCAAAAGATGGAGAGGAACTGCCTATTGAACCAAAAGCATCTAATAATTC
TCCTGCTGAAGGAAACGTGCATGATATCATCACTGGTAATTCCGAGCATGACTTTGGGGATGATAATCATGTACATGAAATGAATACTCTGTCTCATAGTGGATTTCTGG
CACAGGCTGTTGCTACCGATATTGATGTTGGCATGAATCCCAATGAGGAAGAGAAAGATATGTTAAGTGATAATGATGGATATCATAATGAAAGGACAGATATTGCAATG
AAAAAAAAAGAGTTCTTTAGTTCTCAATGTATAGTCGATCATGATTCCTTTCTATTAGCTGATGGGAGAGAGCTAACCGTTTGTGTGAAATGCAATGAAGGTGGTCAGTT
GTTGTCTTGTAATATTAGTGATTGTCCTTTGGTGGTTCATGATAAGTGCTTGGGTTCCTCAGCTAGGATGGATGATGAAGGCGATTTTAATTGTCCTTTCTGCTTTTATT
CACTTGCTATATCAGAATACCTTGAAGCTAAGAAGAATGCTGCATTGGCAAAGAAAAATGTTGCTACTTTTATTCGTACTGGTTTAGGACACCAGTCAATTGTTATTAAA
GAGGTATTGCAACAAAAAGATCTTGATCCTTCACGAAAAGCTGGAGGTGAGGATGTTGCTAAAATTTGTGAAGATGTTGACTTGGAAAATAAAGGCAATCAAGTAACTCT
AGATGGAGAACATGTAAAGGAAGTTGTTGACCATCAATCCCCAATAGTTACGGATATTGAGCGAATAATAGAGCTGTCAAGGCCATTGCATGTTGCCAATTCCAATCATA
GAGAAAACGAGGCAAGTCCTTTGAGAGTAGCTCCTGATGTTTTATCTGCAGGTAAAGACGGTGATGAATTGGTGGACCAAGAGTGTCAAGATAATGTTGCAGAACTGGAA
GATGGTCTAAAAGCCACAGAGCAGCATGACATTTATGATACTCTCCACGAAGATCAAGGGCCAGTTGAAGCAGCGGCTACACAAGAAGGATTACAGAACCAGACTGATGG
TAACGGAGAGGAACCTGCTTATGCAATTAATATTGAAGGAGAAAAATCTTCTGATGATGACAATGATGAGTCTATCATTTCTAGATACTCCATAAGATTTCGACGGAAAT
ATCATCATACGGAAACTCATCAATTAAGACGGAAGAAACTACCCTGGACAGCTGAGGAGGAAGAGACTCTAAGGGAGGGAGTTCGAAAATTCTCTAGTTCAGTTGAAAGA
AGTCCAAACATACCTTGGAAAAAGATTTTAGAATTTGGTAGTAATGTGTTTTTGAAAGATCGTACATCGATAGATCTTAAAGATAAATGGAGGAACTTGTGCAGAAGTCC
AAAGTTTAAATGA
Protein sequenceShow/hide protein sequence
MENETESSSSSALAWYWTIEALASFTDVKPSLLHDVIDKALELPEDTRKNAGEMVALKCLEGLFGPLNYSGENGPPAQESKVMFDSSESCEDVVKRVYKETPKSALRVAG
PYMSKWDVSPFIAQKRASMRCTLLQLKDSLLDGTHPYADFLKQKSGLMPVKKRDSISLNDDDLIERSKRLDSSFSDPQGQKEKHKGSPLLLEDERRLSVADPSSSTLLPS
KRSSVDLTSEDEARQLSGCDEGCTNAKKLKQHSAHTLYSGQEVASSHGTEVLADLSERVVPQNERDDTDHLDEHQMTSVEDKVVEEHFGSNNGQYTATDELHQVESGAPC
YTTPDSTQDGGMHEVISAEKAKDGEELPIEPKASNNSPAEGNVHDIITGNSEHDFGDDNHVHEMNTLSHSGFLAQAVATDIDVGMNPNEEEKDMLSDNDGYHNERTDIAM
KKKEFFSSQCIVDHDSFLLADGRELTVCVKCNEGGQLLSCNISDCPLVVHDKCLGSSARMDDEGDFNCPFCFYSLAISEYLEAKKNAALAKKNVATFIRTGLGHQSIVIK
EVLQQKDLDPSRKAGGEDVAKICEDVDLENKGNQVTLDGEHVKEVVDHQSPIVTDIERIIELSRPLHVANSNHRENEASPLRVAPDVLSAGKDGDELVDQECQDNVAELE
DGLKATEQHDIYDTLHEDQGPVEAAATQEGLQNQTDGNGEEPAYAINIEGEKSSDDDNDESIISRYSIRFRRKYHHTETHQLRRKKLPWTAEEEETLREGVRKFSSSVER
SPNIPWKKILEFGSNVFLKDRTSIDLKDKWRNLCRSPKFK