; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0017727 (gene) of Chayote v1 genome

Gene IDSed0017727
OrganismSechium edule (Chayote v1)
DescriptionProtein POLLENLESS 3-LIKE 1-like
Genome locationLG01:13956859..13960054
RNA-Seq ExpressionSed0017727
SyntenySed0017727
Gene Ontology termsGO:0009536 - plastid (cellular component)
GO:0005515 - protein binding (molecular function)
GO:0016491 - oxidoreductase activity (molecular function)
GO:0051536 - iron-sulfur cluster binding (molecular function)
InterPro domainsIPR011990 - Tetratricopeptide-like helical domain superfamily
IPR019734 - Tetratricopeptide repeat
IPR044961 - Tetratricopeptide repeat protein POLLENLESS 3/SULFUR DEFICIENCY-INDUCED 1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008445490.1 PREDICTED: uncharacterized protein LOC103488488 [Cucumis melo]5.5e-23965.74Show/hide
Query:  RGFSTPPPPWKSRPFRSPKTAPFSERK------PNKSDLFHVIHKVPAGGSPYVKAKQVQLIDKDPSRAISLFWAAINTGDRVDSALKDMAVVMKQLDRS
        +GFSTPPP WKS+PFR PKTAPFSE K       NKSDLFHVIHKVPAG SPYVKAKQVQLIDKDP+RA+SLFWAAIN GDRVDSALKDMAVVMKQLDRS
Subjt:  RGFSTPPPPWKSRPFRSPKTAPFSERK------PNKSDLFHVIHKVPAGGSPYVKAKQVQLIDKDPSRAISLFWAAINTGDRVDSALKDMAVVMKQLDRS

Query:  DEAIEAIKSFRHLCPNESQESIDNVLIELYKRSGRIEEEIDMLQCKLKQIEEGSVFGGKRTKPARSQGKKVQITIEQEKSRVLGNLAWAFLQLDDYYIAE
        DEAIEAIKSFRHLCP +SQESIDNVLIELYKRSGRIEEEIDMLQCKLKQIE+G+VFGGKRTK ARSQGKKVQIT+EQEKSRVLGNLAWAFLQLD+ YIAE
Subjt:  DEAIEAIKSFRHLCPNESQESIDNVLIELYKRSGRIEEEIDMLQCKLKQIEEGSVFGGKRTKPARSQGKKVQITIEQEKSRVLGNLAWAFLQLDDYYIAE

Query:  DYYREALSLELDNNKKCNLAICLISTNRLMEAKSLLQSVRASAGGKPMEEVYAKSFERASQMLAEKESKSPLDSAGKEEDKSSHATTITSKSI--RAGPS
        +YYR+ALSLE DNNKKCNLAICLI TNRL EAKSLLQSVRAS+GGKPMEE YAKSFERAS MLAEKE K P +S   EED ++ A TITSK+   ++G  
Subjt:  DYYREALSLELDNNKKCNLAICLISTNRLMEAKSLLQSVRASAGGKPMEEVYAKSFERASQMLAEKESKSPLDSAGKEEDKSSHATTITSKSI--RAGPS

Query:  VSSQFTASTRWTPDDEEVYINENSRHY-------ENKSFGAVNSSHNYLHCD--------------NSCIPNKMKRNQEQDGLLRTE-ESFNCSSLYSSP
        V  Q TAST+WT DD+++YINENS  Y       ENKS GAVNSSHNYLHCD              +SCIP K+K ++ Q  L R E ESFNC SLYSSP
Subjt:  VSSQFTASTRWTPDDEEVYINENSRHY-------ENKSFGAVNSSHNYLHCD--------------NSCIPNKMKRNQEQDGLLRTE-ESFNCSSLYSSP

Query:  ILARGNTEVPSTRLKNSIREFDTRHGLNERRQQKGTTGGSSIRKVLFEKPSMKDQNFAA----YASSESEGTRTPSNYDMKYKSAASNAFELEVPFTQPR
          A+ + EVP T+ KNSI EF+ R G  ERRQQ+       IRKVLF  PS K+++FA+     ASSESEGT+  SNY  KY+SAA ++ ELEVPFTQPR
Subjt:  ILARGNTEVPSTRLKNSIREFDTRHGLNERRQQKGTTGGSSIRKVLFEKPSMKDQNFAA----YASSESEGTRTPSNYDMKYKSAASNAFELEVPFTQPR

Query:  SCSRGINGGGHQRKATECFRGL-PGSSIRKLSFEPTTSTEKIQ--------RSELSRAVSDEPRHLVVVRKQISRGDIEYEE--AAVSYDPMKIKVE---
        SC+  +N  GH RKATECFR L   SS RKLSFEP TSTE IQ        RSELSRAVSDEP+ L     Q S GDIEYEE  + + Y  MKIK E   
Subjt:  SCSRGINGGGHQRKATECFRGL-PGSSIRKLSFEPTTSTEKIQ--------RSELSRAVSDEPRHLVVVRKQISRGDIEYEE--AAVSYDPMKIKVE---

Query:  ------HNSSTVGGKKSWADMVEEEEEEGGNNEKKDDATDETSSSGRCQVNCFGGDWN-SSSDNEE--FNDENLNSNILHQ-GHSPSNNLVQ-VIQFGSL
              HNS TV GKKSWADMVEEEEEE    E+ +   + +SSSG  QVNCF  +W+  SSDN E  FNDENLNSNILHQ  H PS+N V+ +++FGSL
Subjt:  ------HNSSTVGGKKSWADMVEEEEEEGGNNEKKDDATDETSSSGRCQVNCFGGDWN-SSSDNEE--FNDENLNSNILHQ-GHSPSNNLVQ-VIQFGSL

Query:  DIKYGAKNTVDVDSSRNPAVRRPLCFNEQPTLDSTDNRCSSLLPRKDLTTEVSSNSVTTMAGENVNLIIRRNRLQVFQEI-TVHQELE
        +IK    ++ +V S RN  VR   C  +Q  L+S DN  +S LPRKDLTTEVS        G+  N ++RRNRLQVF EI TVHQELE
Subjt:  DIKYGAKNTVDVDSSRNPAVRRPLCFNEQPTLDSTDNRCSSLLPRKDLTTEVSSNSVTTMAGENVNLIIRRNRLQVFQEI-TVHQELE

XP_022139857.1 uncharacterized protein LOC111010669 isoform X1 [Momordica charantia]1.3e-24364.6Show/hide
Query:  MWANG------GRGFSTPPPPWKSRPFRSPKTAPFSERK------PNKSDLFHVIHKVPAGGSPYVKAKQVQLIDKDPSRAISLFWAAINTGDRVDSALK
        MW N       GRGFSTPPP WKSRPFR  KT PFSERK       NKS+LFHV+HKVPAG SPYVKAKQVQLIDKDP++A+SLFWAAIN GDRVDSALK
Subjt:  MWANG------GRGFSTPPPPWKSRPFRSPKTAPFSERK------PNKSDLFHVIHKVPAGGSPYVKAKQVQLIDKDPSRAISLFWAAINTGDRVDSALK

Query:  DMAVVMKQLDRSDEAIEAIKSFRHLCPNESQESIDNVLIELYKRSGRIEEEIDMLQCKLKQIEEGSVFGGKRTKPARSQGKKVQITIEQEKSRVLGNLAW
        DMAVVMKQLDRSDEAIEAIKSFRHLCP +SQESIDNVLIELYKRSGRIEEEI+MLQCKL+QIEEG+VFGGKRTK ARSQGKKVQITIEQEKSRVLGNLAW
Subjt:  DMAVVMKQLDRSDEAIEAIKSFRHLCPNESQESIDNVLIELYKRSGRIEEEIDMLQCKLKQIEEGSVFGGKRTKPARSQGKKVQITIEQEKSRVLGNLAW

Query:  AFLQLDDYYIAEDYYREALSLELDNNKKCNLAICLISTNRLMEAKSLLQSVRASAGGKPMEEVYAKSFERASQMLAEKESKSPLDSAGKEEDKSSHATTI
        AFLQLD+  +AEDYYR+ALSLE DNNKKCNLAICLI TNRL EAKSLLQ+VRAS+GGK MEE YAKSFERAS MLAEKESKS  +S G+EED SS +TT 
Subjt:  AFLQLDDYYIAEDYYREALSLELDNNKKCNLAICLISTNRLMEAKSLLQSVRASAGGKPMEEVYAKSFERASQMLAEKESKSPLDSAGKEEDKSSHATTI

Query:  TSKSIRAGPSVSSQFTASTRWTPDDEEVYINENSR---HY-------------------ENKSFGAVNSSHNYLHCD--------------NSCIPNKMK
             +A P V SQ TAST+WT DDEE+Y+NENSR   H+                   +NKS GAV SSHNYL+CD              NSCIP +MK
Subjt:  TSKSIRAGPSVSSQFTASTRWTPDDEEVYINENSR---HY-------------------ENKSFGAVNSSHNYLHCD--------------NSCIPNKMK

Query:  RNQEQDGLLR-TEESFNCSSLYSSPILARGNTEVPSTRLKNSIREFDTRHGLNERRQQKGTTGGSSIRKVLFEKPSMKDQN----FAAYASSESEGTRTP
         N+  DGL R   E FNCS+LYSSP  A+ N EVP T+ KN   EF+ RH   ER+Q+KGTTG S+ RKVLF  PSM DQ+    F+  ASSESEGT T 
Subjt:  RNQEQDGLLR-TEESFNCSSLYSSPILARGNTEVPSTRLKNSIREFDTRHGLNERRQQKGTTGGSSIRKVLFEKPSMKDQN----FAAYASSESEGTRTP

Query:  SNYDMKYKSAASNAFELEVPFTQPRSCSRG-INGGGHQRKATECFRGLPGSSIRKLSFEPTTSTEKIQ--------RSELSRAVSDEPRHLVVVRKQIS-
        SNY  KY+SAA +A ELEVPFTQPRSCS G ING    RKA EC    P SS RKLSFEP TSTE IQ        RS+LSRAVSDEP  L    KQ S 
Subjt:  SNYDMKYKSAASNAFELEVPFTQPRSCSRG-INGGGHQRKATECFRGLPGSSIRKLSFEPTTSTEKIQ--------RSELSRAVSDEPRHLVVVRKQIS-

Query:  -RGDIEYEEAAVSYDPMKIKVEHNSSTVGGKKSWADMVEEEEEEGGNNEKKDDATDETSSSGRCQVNCFGGDWN---SSSDNEEFNDENLNSNILHQGHS
            ++ +E  ++ D    K +HNSSTVGGKKSWADMVEEEEE+G + ++ +   +E SSSG C      GDW+   SSS +EEFNDENLN NIL Q   
Subjt:  -RGDIEYEEAAVSYDPMKIKVEHNSSTVGGKKSWADMVEEEEEEGGNNEKKDDATDETSSSGRCQVNCFGGDWN---SSSDNEEFNDENLNSNILHQGHS

Query:  PSNNLVQVIQFGSLDIKYGAKNTVDVDSSRNPAVRRPLCFNEQPTLDSTDNRCSSLLPRKDLTTEVSSNS-------VTTMAGENVNLIIRRNRLQVFQE
        P    V+ I+F SLD+K GA+++ D+ SSRN AVRRPL F+EQPTLDS DN  SS LP   LTTE+S NS        TTM GENV+L  R NRLQVFQE
Subjt:  PSNNLVQVIQFGSLDIKYGAKNTVDVDSSRNPAVRRPLCFNEQPTLDSTDNRCSSLLPRKDLTTEVSSNS-------VTTMAGENVNLIIRRNRLQVFQE

Query:  ITVHQELE
        +TVHQEL+
Subjt:  ITVHQELE

XP_022927078.1 uncharacterized protein LOC111434012 [Cucurbita moschata]2.5e-23964.99Show/hide
Query:  MWANG------GRGFSTPPPPWKSRPFRSPKTAPFSERK------PNKSDLFHVIHKVPAGGSPYVKAKQVQLIDKDPSRAISLFWAAINTGDRVDSALK
        MW N       GRGFSTPPP W+SRPFRSPKTAPF +RK       NKSDLFHVIHK+PAG SPYVKAKQVQLIDKDPSRA+SLFWAAIN GDRVDSALK
Subjt:  MWANG------GRGFSTPPPPWKSRPFRSPKTAPFSERK------PNKSDLFHVIHKVPAGGSPYVKAKQVQLIDKDPSRAISLFWAAINTGDRVDSALK

Query:  DMAVVMKQLDRSDEAIEAIKSFRHLCPNESQESIDNVLIELYKRSGRIEEEIDMLQCKLKQIEEGSVFGGKRTKPARSQGKKVQITIEQEKSRVLGNLAW
        DMAVVMKQLDRSDEAIEAI+SFRHLC  +SQESIDNVLIELYKRSGRIEEEIDML+CKLKQIE+G+VFGGK+TK ARSQGKKVQIT+EQEKSRVLGNLAW
Subjt:  DMAVVMKQLDRSDEAIEAIKSFRHLCPNESQESIDNVLIELYKRSGRIEEEIDMLQCKLKQIEEGSVFGGKRTKPARSQGKKVQITIEQEKSRVLGNLAW

Query:  AFLQLDDYYIAEDYYREALSLELDNNKKCNLAICLISTNRLMEAKSLLQSVRASAGGKPMEEVYAKSFERASQMLAEKESKSPLDSAGKEEDKSSHATTI
        AFLQ D+  +AE+YYR+AL LE DNNK+CNLAICLI  NRL EAKS+LQS+RAS+GG  MEE YAKSFERAS MLAEKESK  L ++ ++E+ +S AT  
Subjt:  AFLQLDDYYIAEDYYREALSLELDNNKKCNLAICLISTNRLMEAKSLLQSVRASAGGKPMEEVYAKSFERASQMLAEKESKSPLDSAGKEEDKSSHATTI

Query:  TSKSIRAGPSVSSQFTASTRWTPDDEEVYINENSR------HYENKSF-GAVNSSHNYLHCDNSCIPNKMK----------RNQEQDGLLR-TEESFNCS
            + AG  V  Q TASTRWT  DEE+Y+NENSR       +EN+SF  + +     L   +SCI  KMK          R+++QDGLLR  +E  NC 
Subjt:  TSKSIRAGPSVSSQFTASTRWTPDDEEVYINENSR------HYENKSF-GAVNSSHNYLHCDNSCIPNKMK----------RNQEQDGLLR-TEESFNCS

Query:  SLYSSPILARGNTEVPSTRLKNSIREFDTRHGLNERRQQKGTTGGSSIRKVLFEKPSMKDQNFAAYASSESEGTRTPSNYDMKYKSAASNAFELEVPFTQ
        SLYSSP  A+ N EVP T+ KNS+ EF+ R  LNE RQ+K T+  SS RKVLF+     D  FA  ASSESE +   SNY  KY+SAAS+A ELEVPFTQ
Subjt:  SLYSSPILARGNTEVPSTRLKNSIREFDTRHGLNERRQQKGTTGGSSIRKVLFEKPSMKDQNFAAYASSESEGTRTPSNYDMKYKSAASNAFELEVPFTQ

Query:  PRSCSRGINGGGHQRKATECFRG-LPGSSIRKLSFEPTTSTEKIQ--------RSELSRAVSDEPRHLVVVRKQISRGDIEYEEAAVSYDPMKIKVEH--
        PRSCS GINGG  Q+K +ECFR  L  SS RKLSFEP TSTE  Q        RSELSRAVSDE              D+EYEE A+ YD MKI+ EH  
Subjt:  PRSCSRGINGGGHQRKATECFRG-LPGSSIRKLSFEPTTSTEKIQ--------RSELSRAVSDEPRHLVVVRKQISRGDIEYEEAAVSYDPMKIKVEH--

Query:  NSSTVGGKKSWADMVEEEEEEGGNNEKKDDATDETSSSGRCQVNCFGGDWNSSSDNEE--FNDENLNSNILHQ-GHSPSNNLVQVIQFGSLDIKYGAKNT
        NSS VGGKKSWADMVEEEEEE   + +K+D T+ETSSS R +VNCF  DW SSSDNEE  FNDENLNSNILHQ  HSP +         S  ++ GA+++
Subjt:  NSSTVGGKKSWADMVEEEEEEGGNNEKKDDATDETSSSGRCQVNCFGGDWNSSSDNEE--FNDENLNSNILHQ-GHSPSNNLVQVIQFGSLDIKYGAKNT

Query:  VDVDSSRNPAVRRPLCFNEQPTLDSTDNRCSSLLPRKDLTTEVSSNSVTTMAGENVNLIIRRNRLQVFQEITVHQEL
         DV SSRNPAVRRPLCF++QPTL+S DNR SS LP+KDLTTE          GENVNL IRRNRLQ+FQEITVHQEL
Subjt:  VDVDSSRNPAVRRPLCFNEQPTLDSTDNRCSSLLPRKDLTTEVSSNSVTTMAGENVNLIIRRNRLQVFQEITVHQEL

XP_038894110.1 uncharacterized protein LOC120082846 isoform X1 [Benincasa hispida]2.6e-26569.55Show/hide
Query:  MWANGG------RGFSTPPPPWKSRPFRSPKTAPFSERK------PNKSDLFHVIHKVPAGGSPYVKAKQVQLIDKDPSRAISLFWAAINTGDRVDSALK
        MW N G      +GFSTPPP WKSRPFRS KT+PFSERK       NKSDLFHVIHKVPAG SPYVKAKQVQLIDKDPSRA+SLFWAAIN GDRVDSALK
Subjt:  MWANGG------RGFSTPPPPWKSRPFRSPKTAPFSERK------PNKSDLFHVIHKVPAGGSPYVKAKQVQLIDKDPSRAISLFWAAINTGDRVDSALK

Query:  DMAVVMKQLDRSDEAIEAIKSFRHLCPNESQESIDNVLIELYKRSGRIEEEIDMLQCKLKQIEEGSVFGGKRTKPARSQGKKVQITIEQEKSRVLGNLAW
        DMAVVMKQLDRSDEAIEAI+SFRHLCP +SQESIDNVLIELYKRSGRIEEEIDMLQCKLKQIEEG+VFGGKRTK ARSQGKKVQITIEQEKSRVLGNLAW
Subjt:  DMAVVMKQLDRSDEAIEAIKSFRHLCPNESQESIDNVLIELYKRSGRIEEEIDMLQCKLKQIEEGSVFGGKRTKPARSQGKKVQITIEQEKSRVLGNLAW

Query:  AFLQLDDYYIAEDYYREALSLELDNNKKCNLAICLISTNRLMEAKSLLQSVRASAGGKPMEEVYAKSFERASQMLAEKESKSPLDSAGKEEDKSSHATTI
        AFLQLD+ Y+AEDYYR+ALSLE DNNKKCNLAICLI TNRL+EAKSLLQSVRAS+GGKPMEE YAKSFERAS MLAEKES S  +S G EED  +  T I
Subjt:  AFLQLDDYYIAEDYYREALSLELDNNKKCNLAICLISTNRLMEAKSLLQSVRASAGGKPMEEVYAKSFERASQMLAEKESKSPLDSAGKEEDKSSHATTI

Query:  TSK--SIRAGPSVSSQFTASTRWTPDDEEVYINENSRH-------YENKSFGAVNSSHNYLHCD--------------NSCI-PNKMKRNQ-EQDGLLR-
        TSK  + RAGP V  Q T STRWT DDE++YINENSR        YENKS GAVNSSHNYLHCD              +SCI P K K N+  QDGLLR 
Subjt:  TSK--SIRAGPSVSSQFTASTRWTPDDEEVYINENSRH-------YENKSFGAVNSSHNYLHCD--------------NSCI-PNKMKRNQ-EQDGLLR-

Query:  TEESFNCSSLYSSPILARGNTEVPSTRLKNSIREFDTRHGLNERRQQKGTTGGSSIRKVLFEKPSMKDQN----FAAYASSESEGTRTP-SNYDMKYKSA
         +ESFNC SLYSSPI A+ N EVP T+ KNS  EF+ R    ERRQQ+  +     RKVLFE PSMKDQ+    F   ASSESEGT  P SNY  KY+SA
Subjt:  TEESFNCSSLYSSPILARGNTEVPSTRLKNSIREFDTRHGLNERRQQKGTTGGSSIRKVLFEKPSMKDQN----FAAYASSESEGTRTP-SNYDMKYKSA

Query:  ASNAFELEVPFTQPRSCSRGINGGGHQRKATECFRGL-PGSSIRKLSFEPTTSTEKIQ-----RSELSRAVSDEPRHLVVVRKQISRGDIEYEEAAVSYD
        A +  ELEVPFTQPRSCS G+NGG H RKATECFR L   SS RKLSFEP T+TE IQ     RSELSRAVSDEP+ L    K+ S GDI+Y E AV Y 
Subjt:  ASNAFELEVPFTQPRSCSRGINGGGHQRKATECFRGL-PGSSIRKLSFEPTTSTEKIQ-----RSELSRAVSDEPRHLVVVRKQISRGDIEYEEAAVSYD

Query:  PMKIKVEH---------NSSTVGGKKSWADMVEEEEEEGGNNEKKDDATDETSSSGRCQVNCFGGDWNSSSDNEE--FNDENLNSNILHQ-GHSPSNN--
         +KIK EH         NSSTVGGKKSWADMVEEEEE+  + EK++D  + +SSSGR QVNCF  +W+SSSDN E  FNDENLNSNILHQ   SPS+N  
Subjt:  PMKIKVEH---------NSSTVGGKKSWADMVEEEEEEGGNNEKKDDATDETSSSGRCQVNCFGGDWNSSSDNEE--FNDENLNSNILHQ-GHSPSNN--

Query:  LVQVIQFGSLDIKYGAKNTVDVDSSRNPAVRRPLCFNEQPTLDSTDNRCSSLLPRKDLTTEVSSNSVTTMAGENVNLIIRRNRLQVFQEITVHQELEC
        +  +I F SL+IK GAK++ DV   RNPAVRRPL F++QP L+ST+NRC+S LPRKDLTTEV  NS     G+  NL +RRNRLQVF EITVHQELEC
Subjt:  LVQVIQFGSLDIKYGAKNTVDVDSSRNPAVRRPLCFNEQPTLDSTDNRCSSLLPRKDLTTEVSSNSVTTMAGENVNLIIRRNRLQVFQEITVHQELEC

XP_038894111.1 uncharacterized protein LOC120082846 isoform X2 [Benincasa hispida]1.2e-24966.29Show/hide
Query:  MWANGG------RGFSTPPPPWKSRPFRSPKTAPFSERK------PNKSDLFHVIHKVPAGGSPYVKAKQVQLIDKDPSRAISLFWAAINTGDRVDSALK
        MW N G      +GFSTPPP WKSRPFRS KT+PFSERK       NKSDLFHVIHKVPAG SPYVKAKQVQLIDKDPSRA+SLFWAAIN GDRVDSALK
Subjt:  MWANGG------RGFSTPPPPWKSRPFRSPKTAPFSERK------PNKSDLFHVIHKVPAGGSPYVKAKQVQLIDKDPSRAISLFWAAINTGDRVDSALK

Query:  DMAVVMKQLDRSDEAIEAIKSFRHLCPNESQESIDNVLIELYKRSGRIEEEIDMLQCKLKQIEEGSVFGGKRTKPARSQGKKVQITIEQEKSRVLGNLAW
        DMAVVMKQLDRSDEAIEAI+SFRHLCP +SQESIDNVLIELYKRSGRIEEEIDMLQCKLKQIEEG+VFGGKRTK ARSQGKKVQITIEQEKSRVLGNLAW
Subjt:  DMAVVMKQLDRSDEAIEAIKSFRHLCPNESQESIDNVLIELYKRSGRIEEEIDMLQCKLKQIEEGSVFGGKRTKPARSQGKKVQITIEQEKSRVLGNLAW

Query:  AFLQLDDYYIAEDYYREALSLELDNNKKCNLAICLISTNRLMEAKSLLQSVRASAGGKPMEEVYAKSFERASQMLAEKESKSPLDSAGKEEDKSSHATTI
        AFLQLD+ Y+AEDYYR+ALSLE DNNKKCNLAICLI TNRL+EAKSLLQSVRAS+GG                                 E+ +   T I
Subjt:  AFLQLDDYYIAEDYYREALSLELDNNKKCNLAICLISTNRLMEAKSLLQSVRASAGGKPMEEVYAKSFERASQMLAEKESKSPLDSAGKEEDKSSHATTI

Query:  TSK--SIRAGPSVSSQFTASTRWTPDDEEVYINENSRH-------YENKSFGAVNSSHNYLHCD--------------NSCI-PNKMKRNQ-EQDGLLR-
        TSK  + RAGP V  Q T STRWT DDE++YINENSR        YENKS GAVNSSHNYLHCD              +SCI P K K N+  QDGLLR 
Subjt:  TSK--SIRAGPSVSSQFTASTRWTPDDEEVYINENSRH-------YENKSFGAVNSSHNYLHCD--------------NSCI-PNKMKRNQ-EQDGLLR-

Query:  TEESFNCSSLYSSPILARGNTEVPSTRLKNSIREFDTRHGLNERRQQKGTTGGSSIRKVLFEKPSMKDQN----FAAYASSESEGTRTP-SNYDMKYKSA
         +ESFNC SLYSSPI A+ N EVP T+ KNS  EF+ R    ERRQQ+  +     RKVLFE PSMKDQ+    F   ASSESEGT  P SNY  KY+SA
Subjt:  TEESFNCSSLYSSPILARGNTEVPSTRLKNSIREFDTRHGLNERRQQKGTTGGSSIRKVLFEKPSMKDQN----FAAYASSESEGTRTP-SNYDMKYKSA

Query:  ASNAFELEVPFTQPRSCSRGINGGGHQRKATECFRGL-PGSSIRKLSFEPTTSTEKIQ-----RSELSRAVSDEPRHLVVVRKQISRGDIEYEEAAVSYD
        A +  ELEVPFTQPRSCS G+NGG H RKATECFR L   SS RKLSFEP T+TE IQ     RSELSRAVSDEP+ L    K+ S GDI+Y E AV Y 
Subjt:  ASNAFELEVPFTQPRSCSRGINGGGHQRKATECFRGL-PGSSIRKLSFEPTTSTEKIQ-----RSELSRAVSDEPRHLVVVRKQISRGDIEYEEAAVSYD

Query:  PMKIKVEH---------NSSTVGGKKSWADMVEEEEEEGGNNEKKDDATDETSSSGRCQVNCFGGDWNSSSDNEE--FNDENLNSNILHQ-GHSPSNN--
         +KIK EH         NSSTVGGKKSWADMVEEEEE+  + EK++D  + +SSSGR QVNCF  +W+SSSDN E  FNDENLNSNILHQ   SPS+N  
Subjt:  PMKIKVEH---------NSSTVGGKKSWADMVEEEEEEGGNNEKKDDATDETSSSGRCQVNCFGGDWNSSSDNEE--FNDENLNSNILHQ-GHSPSNN--

Query:  LVQVIQFGSLDIKYGAKNTVDVDSSRNPAVRRPLCFNEQPTLDSTDNRCSSLLPRKDLTTEVSSNSVTTMAGENVNLIIRRNRLQVFQEITVHQELEC
        +  +I F SL+IK GAK++ DV   RNPAVRRPL F++QP L+ST+NRC+S LPRKDLTTEV  NS     G+  NL +RRNRLQVF EITVHQELEC
Subjt:  LVQVIQFGSLDIKYGAKNTVDVDSSRNPAVRRPLCFNEQPTLDSTDNRCSSLLPRKDLTTEVSSNSVTTMAGENVNLIIRRNRLQVFQEITVHQELEC

TrEMBL top hitse value%identityAlignment
A0A1S3BDQ0 uncharacterized protein LOC1034884882.7e-23965.74Show/hide
Query:  RGFSTPPPPWKSRPFRSPKTAPFSERK------PNKSDLFHVIHKVPAGGSPYVKAKQVQLIDKDPSRAISLFWAAINTGDRVDSALKDMAVVMKQLDRS
        +GFSTPPP WKS+PFR PKTAPFSE K       NKSDLFHVIHKVPAG SPYVKAKQVQLIDKDP+RA+SLFWAAIN GDRVDSALKDMAVVMKQLDRS
Subjt:  RGFSTPPPPWKSRPFRSPKTAPFSERK------PNKSDLFHVIHKVPAGGSPYVKAKQVQLIDKDPSRAISLFWAAINTGDRVDSALKDMAVVMKQLDRS

Query:  DEAIEAIKSFRHLCPNESQESIDNVLIELYKRSGRIEEEIDMLQCKLKQIEEGSVFGGKRTKPARSQGKKVQITIEQEKSRVLGNLAWAFLQLDDYYIAE
        DEAIEAIKSFRHLCP +SQESIDNVLIELYKRSGRIEEEIDMLQCKLKQIE+G+VFGGKRTK ARSQGKKVQIT+EQEKSRVLGNLAWAFLQLD+ YIAE
Subjt:  DEAIEAIKSFRHLCPNESQESIDNVLIELYKRSGRIEEEIDMLQCKLKQIEEGSVFGGKRTKPARSQGKKVQITIEQEKSRVLGNLAWAFLQLDDYYIAE

Query:  DYYREALSLELDNNKKCNLAICLISTNRLMEAKSLLQSVRASAGGKPMEEVYAKSFERASQMLAEKESKSPLDSAGKEEDKSSHATTITSKSI--RAGPS
        +YYR+ALSLE DNNKKCNLAICLI TNRL EAKSLLQSVRAS+GGKPMEE YAKSFERAS MLAEKE K P +S   EED ++ A TITSK+   ++G  
Subjt:  DYYREALSLELDNNKKCNLAICLISTNRLMEAKSLLQSVRASAGGKPMEEVYAKSFERASQMLAEKESKSPLDSAGKEEDKSSHATTITSKSI--RAGPS

Query:  VSSQFTASTRWTPDDEEVYINENSRHY-------ENKSFGAVNSSHNYLHCD--------------NSCIPNKMKRNQEQDGLLRTE-ESFNCSSLYSSP
        V  Q TAST+WT DD+++YINENS  Y       ENKS GAVNSSHNYLHCD              +SCIP K+K ++ Q  L R E ESFNC SLYSSP
Subjt:  VSSQFTASTRWTPDDEEVYINENSRHY-------ENKSFGAVNSSHNYLHCD--------------NSCIPNKMKRNQEQDGLLRTE-ESFNCSSLYSSP

Query:  ILARGNTEVPSTRLKNSIREFDTRHGLNERRQQKGTTGGSSIRKVLFEKPSMKDQNFAA----YASSESEGTRTPSNYDMKYKSAASNAFELEVPFTQPR
          A+ + EVP T+ KNSI EF+ R G  ERRQQ+       IRKVLF  PS K+++FA+     ASSESEGT+  SNY  KY+SAA ++ ELEVPFTQPR
Subjt:  ILARGNTEVPSTRLKNSIREFDTRHGLNERRQQKGTTGGSSIRKVLFEKPSMKDQNFAA----YASSESEGTRTPSNYDMKYKSAASNAFELEVPFTQPR

Query:  SCSRGINGGGHQRKATECFRGL-PGSSIRKLSFEPTTSTEKIQ--------RSELSRAVSDEPRHLVVVRKQISRGDIEYEE--AAVSYDPMKIKVE---
        SC+  +N  GH RKATECFR L   SS RKLSFEP TSTE IQ        RSELSRAVSDEP+ L     Q S GDIEYEE  + + Y  MKIK E   
Subjt:  SCSRGINGGGHQRKATECFRGL-PGSSIRKLSFEPTTSTEKIQ--------RSELSRAVSDEPRHLVVVRKQISRGDIEYEE--AAVSYDPMKIKVE---

Query:  ------HNSSTVGGKKSWADMVEEEEEEGGNNEKKDDATDETSSSGRCQVNCFGGDWN-SSSDNEE--FNDENLNSNILHQ-GHSPSNNLVQ-VIQFGSL
              HNS TV GKKSWADMVEEEEEE    E+ +   + +SSSG  QVNCF  +W+  SSDN E  FNDENLNSNILHQ  H PS+N V+ +++FGSL
Subjt:  ------HNSSTVGGKKSWADMVEEEEEEGGNNEKKDDATDETSSSGRCQVNCFGGDWN-SSSDNEE--FNDENLNSNILHQ-GHSPSNNLVQ-VIQFGSL

Query:  DIKYGAKNTVDVDSSRNPAVRRPLCFNEQPTLDSTDNRCSSLLPRKDLTTEVSSNSVTTMAGENVNLIIRRNRLQVFQEI-TVHQELE
        +IK    ++ +V S RN  VR   C  +Q  L+S DN  +S LPRKDLTTEVS        G+  N ++RRNRLQVF EI TVHQELE
Subjt:  DIKYGAKNTVDVDSSRNPAVRRPLCFNEQPTLDSTDNRCSSLLPRKDLTTEVSSNSVTTMAGENVNLIIRRNRLQVFQEI-TVHQELE

A0A5A7VD19 Protein POLLENLESS 3-LIKE 1-like2.7e-23965.74Show/hide
Query:  RGFSTPPPPWKSRPFRSPKTAPFSERK------PNKSDLFHVIHKVPAGGSPYVKAKQVQLIDKDPSRAISLFWAAINTGDRVDSALKDMAVVMKQLDRS
        +GFSTPPP WKS+PFR PKTAPFSE K       NKSDLFHVIHKVPAG SPYVKAKQVQLIDKDP+RA+SLFWAAIN GDRVDSALKDMAVVMKQLDRS
Subjt:  RGFSTPPPPWKSRPFRSPKTAPFSERK------PNKSDLFHVIHKVPAGGSPYVKAKQVQLIDKDPSRAISLFWAAINTGDRVDSALKDMAVVMKQLDRS

Query:  DEAIEAIKSFRHLCPNESQESIDNVLIELYKRSGRIEEEIDMLQCKLKQIEEGSVFGGKRTKPARSQGKKVQITIEQEKSRVLGNLAWAFLQLDDYYIAE
        DEAIEAIKSFRHLCP +SQESIDNVLIELYKRSGRIEEEIDMLQCKLKQIE+G+VFGGKRTK ARSQGKKVQIT+EQEKSRVLGNLAWAFLQLD+ YIAE
Subjt:  DEAIEAIKSFRHLCPNESQESIDNVLIELYKRSGRIEEEIDMLQCKLKQIEEGSVFGGKRTKPARSQGKKVQITIEQEKSRVLGNLAWAFLQLDDYYIAE

Query:  DYYREALSLELDNNKKCNLAICLISTNRLMEAKSLLQSVRASAGGKPMEEVYAKSFERASQMLAEKESKSPLDSAGKEEDKSSHATTITSKSI--RAGPS
        +YYR+ALSLE DNNKKCNLAICLI TNRL EAKSLLQSVRAS+GGKPMEE YAKSFERAS MLAEKE K P +S   EED ++ A TITSK+   ++G  
Subjt:  DYYREALSLELDNNKKCNLAICLISTNRLMEAKSLLQSVRASAGGKPMEEVYAKSFERASQMLAEKESKSPLDSAGKEEDKSSHATTITSKSI--RAGPS

Query:  VSSQFTASTRWTPDDEEVYINENSRHY-------ENKSFGAVNSSHNYLHCD--------------NSCIPNKMKRNQEQDGLLRTE-ESFNCSSLYSSP
        V  Q TAST+WT DD+++YINENS  Y       ENKS GAVNSSHNYLHCD              +SCIP K+K ++ Q  L R E ESFNC SLYSSP
Subjt:  VSSQFTASTRWTPDDEEVYINENSRHY-------ENKSFGAVNSSHNYLHCD--------------NSCIPNKMKRNQEQDGLLRTE-ESFNCSSLYSSP

Query:  ILARGNTEVPSTRLKNSIREFDTRHGLNERRQQKGTTGGSSIRKVLFEKPSMKDQNFAA----YASSESEGTRTPSNYDMKYKSAASNAFELEVPFTQPR
          A+ + EVP T+ KNSI EF+ R G  ERRQQ+       IRKVLF  PS K+++FA+     ASSESEGT+  SNY  KY+SAA ++ ELEVPFTQPR
Subjt:  ILARGNTEVPSTRLKNSIREFDTRHGLNERRQQKGTTGGSSIRKVLFEKPSMKDQNFAA----YASSESEGTRTPSNYDMKYKSAASNAFELEVPFTQPR

Query:  SCSRGINGGGHQRKATECFRGL-PGSSIRKLSFEPTTSTEKIQ--------RSELSRAVSDEPRHLVVVRKQISRGDIEYEE--AAVSYDPMKIKVE---
        SC+  +N  GH RKATECFR L   SS RKLSFEP TSTE IQ        RSELSRAVSDEP+ L     Q S GDIEYEE  + + Y  MKIK E   
Subjt:  SCSRGINGGGHQRKATECFRGL-PGSSIRKLSFEPTTSTEKIQ--------RSELSRAVSDEPRHLVVVRKQISRGDIEYEE--AAVSYDPMKIKVE---

Query:  ------HNSSTVGGKKSWADMVEEEEEEGGNNEKKDDATDETSSSGRCQVNCFGGDWN-SSSDNEE--FNDENLNSNILHQ-GHSPSNNLVQ-VIQFGSL
              HNS TV GKKSWADMVEEEEEE    E+ +   + +SSSG  QVNCF  +W+  SSDN E  FNDENLNSNILHQ  H PS+N V+ +++FGSL
Subjt:  ------HNSSTVGGKKSWADMVEEEEEEGGNNEKKDDATDETSSSGRCQVNCFGGDWN-SSSDNEE--FNDENLNSNILHQ-GHSPSNNLVQ-VIQFGSL

Query:  DIKYGAKNTVDVDSSRNPAVRRPLCFNEQPTLDSTDNRCSSLLPRKDLTTEVSSNSVTTMAGENVNLIIRRNRLQVFQEI-TVHQELE
        +IK    ++ +V S RN  VR   C  +Q  L+S DN  +S LPRKDLTTEVS        G+  N ++RRNRLQVF EI TVHQELE
Subjt:  DIKYGAKNTVDVDSSRNPAVRRPLCFNEQPTLDSTDNRCSSLLPRKDLTTEVSSNSVTTMAGENVNLIIRRNRLQVFQEI-TVHQELE

A0A6J1CF35 uncharacterized protein LOC111010669 isoform X16.1e-24464.6Show/hide
Query:  MWANG------GRGFSTPPPPWKSRPFRSPKTAPFSERK------PNKSDLFHVIHKVPAGGSPYVKAKQVQLIDKDPSRAISLFWAAINTGDRVDSALK
        MW N       GRGFSTPPP WKSRPFR  KT PFSERK       NKS+LFHV+HKVPAG SPYVKAKQVQLIDKDP++A+SLFWAAIN GDRVDSALK
Subjt:  MWANG------GRGFSTPPPPWKSRPFRSPKTAPFSERK------PNKSDLFHVIHKVPAGGSPYVKAKQVQLIDKDPSRAISLFWAAINTGDRVDSALK

Query:  DMAVVMKQLDRSDEAIEAIKSFRHLCPNESQESIDNVLIELYKRSGRIEEEIDMLQCKLKQIEEGSVFGGKRTKPARSQGKKVQITIEQEKSRVLGNLAW
        DMAVVMKQLDRSDEAIEAIKSFRHLCP +SQESIDNVLIELYKRSGRIEEEI+MLQCKL+QIEEG+VFGGKRTK ARSQGKKVQITIEQEKSRVLGNLAW
Subjt:  DMAVVMKQLDRSDEAIEAIKSFRHLCPNESQESIDNVLIELYKRSGRIEEEIDMLQCKLKQIEEGSVFGGKRTKPARSQGKKVQITIEQEKSRVLGNLAW

Query:  AFLQLDDYYIAEDYYREALSLELDNNKKCNLAICLISTNRLMEAKSLLQSVRASAGGKPMEEVYAKSFERASQMLAEKESKSPLDSAGKEEDKSSHATTI
        AFLQLD+  +AEDYYR+ALSLE DNNKKCNLAICLI TNRL EAKSLLQ+VRAS+GGK MEE YAKSFERAS MLAEKESKS  +S G+EED SS +TT 
Subjt:  AFLQLDDYYIAEDYYREALSLELDNNKKCNLAICLISTNRLMEAKSLLQSVRASAGGKPMEEVYAKSFERASQMLAEKESKSPLDSAGKEEDKSSHATTI

Query:  TSKSIRAGPSVSSQFTASTRWTPDDEEVYINENSR---HY-------------------ENKSFGAVNSSHNYLHCD--------------NSCIPNKMK
             +A P V SQ TAST+WT DDEE+Y+NENSR   H+                   +NKS GAV SSHNYL+CD              NSCIP +MK
Subjt:  TSKSIRAGPSVSSQFTASTRWTPDDEEVYINENSR---HY-------------------ENKSFGAVNSSHNYLHCD--------------NSCIPNKMK

Query:  RNQEQDGLLR-TEESFNCSSLYSSPILARGNTEVPSTRLKNSIREFDTRHGLNERRQQKGTTGGSSIRKVLFEKPSMKDQN----FAAYASSESEGTRTP
         N+  DGL R   E FNCS+LYSSP  A+ N EVP T+ KN   EF+ RH   ER+Q+KGTTG S+ RKVLF  PSM DQ+    F+  ASSESEGT T 
Subjt:  RNQEQDGLLR-TEESFNCSSLYSSPILARGNTEVPSTRLKNSIREFDTRHGLNERRQQKGTTGGSSIRKVLFEKPSMKDQN----FAAYASSESEGTRTP

Query:  SNYDMKYKSAASNAFELEVPFTQPRSCSRG-INGGGHQRKATECFRGLPGSSIRKLSFEPTTSTEKIQ--------RSELSRAVSDEPRHLVVVRKQIS-
        SNY  KY+SAA +A ELEVPFTQPRSCS G ING    RKA EC    P SS RKLSFEP TSTE IQ        RS+LSRAVSDEP  L    KQ S 
Subjt:  SNYDMKYKSAASNAFELEVPFTQPRSCSRG-INGGGHQRKATECFRGLPGSSIRKLSFEPTTSTEKIQ--------RSELSRAVSDEPRHLVVVRKQIS-

Query:  -RGDIEYEEAAVSYDPMKIKVEHNSSTVGGKKSWADMVEEEEEEGGNNEKKDDATDETSSSGRCQVNCFGGDWN---SSSDNEEFNDENLNSNILHQGHS
            ++ +E  ++ D    K +HNSSTVGGKKSWADMVEEEEE+G + ++ +   +E SSSG C      GDW+   SSS +EEFNDENLN NIL Q   
Subjt:  -RGDIEYEEAAVSYDPMKIKVEHNSSTVGGKKSWADMVEEEEEEGGNNEKKDDATDETSSSGRCQVNCFGGDWN---SSSDNEEFNDENLNSNILHQGHS

Query:  PSNNLVQVIQFGSLDIKYGAKNTVDVDSSRNPAVRRPLCFNEQPTLDSTDNRCSSLLPRKDLTTEVSSNS-------VTTMAGENVNLIIRRNRLQVFQE
        P    V+ I+F SLD+K GA+++ D+ SSRN AVRRPL F+EQPTLDS DN  SS LP   LTTE+S NS        TTM GENV+L  R NRLQVFQE
Subjt:  PSNNLVQVIQFGSLDIKYGAKNTVDVDSSRNPAVRRPLCFNEQPTLDSTDNRCSSLLPRKDLTTEVSSNS-------VTTMAGENVNLIIRRNRLQVFQE

Query:  ITVHQELE
        +TVHQEL+
Subjt:  ITVHQELE

A0A6J1CGP2 uncharacterized protein LOC111010669 isoform X25.6e-22962.05Show/hide
Query:  MWANG------GRGFSTPPPPWKSRPFRSPKTAPFSERK------PNKSDLFHVIHKVPAGGSPYVKAKQVQLIDKDPSRAISLFWAAINTGDRVDSALK
        MW N       GRGFSTPPP WKSRPFR  KT PFSERK       NKS+LFHV+HKVPAG SPYVKAKQVQLIDKDP++A+SLFWAAIN GDRVDSALK
Subjt:  MWANG------GRGFSTPPPPWKSRPFRSPKTAPFSERK------PNKSDLFHVIHKVPAGGSPYVKAKQVQLIDKDPSRAISLFWAAINTGDRVDSALK

Query:  DMAVVMKQLDRSDEAIEAIKSFRHLCPNESQESIDNVLIELYKRSGRIEEEIDMLQCKLKQIEEGSVFGGKRTKPARSQGKKVQITIEQEKSRVLGNLAW
        DMAVVMKQLDRSDEAIEAIKSFRHLCP +SQESIDNVLIELYKRSGRIEEEI+MLQCKL+QIEEG+VFGGKRTK ARSQGKKVQITIEQEKSRVLGNLAW
Subjt:  DMAVVMKQLDRSDEAIEAIKSFRHLCPNESQESIDNVLIELYKRSGRIEEEIDMLQCKLKQIEEGSVFGGKRTKPARSQGKKVQITIEQEKSRVLGNLAW

Query:  AFLQLDDYYIAEDYYREALSLELDNNKKCNLAICLISTNRLMEAKSLLQSVRASAGGKPMEEVYAKSFERASQMLAEKESKSPLDSAGKEEDKSSHATTI
        AFLQLD+  +AEDYYR+ALSLE DNNKKCNLAICLI TNRL EAKSLLQ+VRAS+GGK MEE YAKSFERAS MLAEKESKS  +S G+EED SS +TT 
Subjt:  AFLQLDDYYIAEDYYREALSLELDNNKKCNLAICLISTNRLMEAKSLLQSVRASAGGKPMEEVYAKSFERASQMLAEKESKSPLDSAGKEEDKSSHATTI

Query:  TSKSIRAGPSVSSQFTASTRWTPDDEEVYINENSR---HY-------------------ENKSFGAVNSSHNYLHCD--------------NSCIPNKMK
             +A P V SQ TAST+WT DDEE+Y+NENSR   H+                   +NKS GAV SSHNYL+CD              NSCIP +MK
Subjt:  TSKSIRAGPSVSSQFTASTRWTPDDEEVYINENSR---HY-------------------ENKSFGAVNSSHNYLHCD--------------NSCIPNKMK

Query:  RNQEQDGLLR-TEESFNCSSLYSSPILARGNTEVPSTRLKNSIREFDTRHGLNERRQQKGTTGGSSIRKVLFEKPSMKDQN----FAAYASSESEGTRTP
         N+  DGL R   E FNCS+LYSSP  A+ N EVP T+ KN   EF+ RH   ER+Q+KGTTG S+ RKVLF  PSM DQ+    F+  ASSESEGT T 
Subjt:  RNQEQDGLLR-TEESFNCSSLYSSPILARGNTEVPSTRLKNSIREFDTRHGLNERRQQKGTTGGSSIRKVLFEKPSMKDQN----FAAYASSESEGTRTP

Query:  SNYDMKYKSAASNAFELEVPFTQPRSCSRG-INGGGHQRKATECFRGLPGSSIRKLSFEPTTSTEKIQ--------RSELSRAVSDEPRHLVVVRKQIS-
        SNY  KY+SAA +A ELEVPFTQPRSCS G ING    RKA EC    P SS RKLSFEP TSTE IQ        RS+LSRAVSDEP  L    KQ S 
Subjt:  SNYDMKYKSAASNAFELEVPFTQPRSCSRG-INGGGHQRKATECFRGLPGSSIRKLSFEPTTSTEKIQ--------RSELSRAVSDEPRHLVVVRKQIS-

Query:  -RGDIEYEEAAVSYDPMKIKVEHNSSTVGGKKSWADMVEEEEEEGGNNEKKDDATDETSSSGRCQVNCFGGDWN---SSSDNEEFNDENLNSNILHQGHS
            ++ +E  ++ D    K +HNSSTVGGKKSWADMVEEEEE+G + ++ +   +E SSSG C      GDW+   SSS +EEFNDENLN NIL Q   
Subjt:  -RGDIEYEEAAVSYDPMKIKVEHNSSTVGGKKSWADMVEEEEEEGGNNEKKDDATDETSSSGRCQVNCFGGDWN---SSSDNEEFNDENLNSNILHQGHS

Query:  PSNNLVQVIQFGSLDIKYGAKNTVDVDSSRNPAVRRPLCFNEQPTLDSTDNRCSSLLPRKDLTTEVSSNSVTTMAGENVNLIIRRNRLQVFQEITVHQEL
        P    V+ I+F SLD+K GA+++ D+ SSRN A                                 +S   TTM GENV+L  R NRLQVFQE+TVHQEL
Subjt:  PSNNLVQVIQFGSLDIKYGAKNTVDVDSSRNPAVRRPLCFNEQPTLDSTDNRCSSLLPRKDLTTEVSSNSVTTMAGENVNLIIRRNRLQVFQEITVHQEL

Query:  E
        +
Subjt:  E

A0A6J1EK01 uncharacterized protein LOC1114340121.2e-23964.99Show/hide
Query:  MWANG------GRGFSTPPPPWKSRPFRSPKTAPFSERK------PNKSDLFHVIHKVPAGGSPYVKAKQVQLIDKDPSRAISLFWAAINTGDRVDSALK
        MW N       GRGFSTPPP W+SRPFRSPKTAPF +RK       NKSDLFHVIHK+PAG SPYVKAKQVQLIDKDPSRA+SLFWAAIN GDRVDSALK
Subjt:  MWANG------GRGFSTPPPPWKSRPFRSPKTAPFSERK------PNKSDLFHVIHKVPAGGSPYVKAKQVQLIDKDPSRAISLFWAAINTGDRVDSALK

Query:  DMAVVMKQLDRSDEAIEAIKSFRHLCPNESQESIDNVLIELYKRSGRIEEEIDMLQCKLKQIEEGSVFGGKRTKPARSQGKKVQITIEQEKSRVLGNLAW
        DMAVVMKQLDRSDEAIEAI+SFRHLC  +SQESIDNVLIELYKRSGRIEEEIDML+CKLKQIE+G+VFGGK+TK ARSQGKKVQIT+EQEKSRVLGNLAW
Subjt:  DMAVVMKQLDRSDEAIEAIKSFRHLCPNESQESIDNVLIELYKRSGRIEEEIDMLQCKLKQIEEGSVFGGKRTKPARSQGKKVQITIEQEKSRVLGNLAW

Query:  AFLQLDDYYIAEDYYREALSLELDNNKKCNLAICLISTNRLMEAKSLLQSVRASAGGKPMEEVYAKSFERASQMLAEKESKSPLDSAGKEEDKSSHATTI
        AFLQ D+  +AE+YYR+AL LE DNNK+CNLAICLI  NRL EAKS+LQS+RAS+GG  MEE YAKSFERAS MLAEKESK  L ++ ++E+ +S AT  
Subjt:  AFLQLDDYYIAEDYYREALSLELDNNKKCNLAICLISTNRLMEAKSLLQSVRASAGGKPMEEVYAKSFERASQMLAEKESKSPLDSAGKEEDKSSHATTI

Query:  TSKSIRAGPSVSSQFTASTRWTPDDEEVYINENSR------HYENKSF-GAVNSSHNYLHCDNSCIPNKMK----------RNQEQDGLLR-TEESFNCS
            + AG  V  Q TASTRWT  DEE+Y+NENSR       +EN+SF  + +     L   +SCI  KMK          R+++QDGLLR  +E  NC 
Subjt:  TSKSIRAGPSVSSQFTASTRWTPDDEEVYINENSR------HYENKSF-GAVNSSHNYLHCDNSCIPNKMK----------RNQEQDGLLR-TEESFNCS

Query:  SLYSSPILARGNTEVPSTRLKNSIREFDTRHGLNERRQQKGTTGGSSIRKVLFEKPSMKDQNFAAYASSESEGTRTPSNYDMKYKSAASNAFELEVPFTQ
        SLYSSP  A+ N EVP T+ KNS+ EF+ R  LNE RQ+K T+  SS RKVLF+     D  FA  ASSESE +   SNY  KY+SAAS+A ELEVPFTQ
Subjt:  SLYSSPILARGNTEVPSTRLKNSIREFDTRHGLNERRQQKGTTGGSSIRKVLFEKPSMKDQNFAAYASSESEGTRTPSNYDMKYKSAASNAFELEVPFTQ

Query:  PRSCSRGINGGGHQRKATECFRG-LPGSSIRKLSFEPTTSTEKIQ--------RSELSRAVSDEPRHLVVVRKQISRGDIEYEEAAVSYDPMKIKVEH--
        PRSCS GINGG  Q+K +ECFR  L  SS RKLSFEP TSTE  Q        RSELSRAVSDE              D+EYEE A+ YD MKI+ EH  
Subjt:  PRSCSRGINGGGHQRKATECFRG-LPGSSIRKLSFEPTTSTEKIQ--------RSELSRAVSDEPRHLVVVRKQISRGDIEYEEAAVSYDPMKIKVEH--

Query:  NSSTVGGKKSWADMVEEEEEEGGNNEKKDDATDETSSSGRCQVNCFGGDWNSSSDNEE--FNDENLNSNILHQ-GHSPSNNLVQVIQFGSLDIKYGAKNT
        NSS VGGKKSWADMVEEEEEE   + +K+D T+ETSSS R +VNCF  DW SSSDNEE  FNDENLNSNILHQ  HSP +         S  ++ GA+++
Subjt:  NSSTVGGKKSWADMVEEEEEEGGNNEKKDDATDETSSSGRCQVNCFGGDWNSSSDNEE--FNDENLNSNILHQ-GHSPSNNLVQVIQFGSLDIKYGAKNT

Query:  VDVDSSRNPAVRRPLCFNEQPTLDSTDNRCSSLLPRKDLTTEVSSNSVTTMAGENVNLIIRRNRLQVFQEITVHQEL
         DV SSRNPAVRRPLCF++QPTL+S DNR SS LP+KDLTTE          GENVNL IRRNRLQ+FQEITVHQEL
Subjt:  VDVDSSRNPAVRRPLCFNEQPTLDSTDNRCSSLLPRKDLTTEVSSNSVTTMAGENVNLIIRRNRLQVFQEITVHQEL

SwissProt top hitse value%identityAlignment
Q8GXU5 Protein SULFUR DEFICIENCY-INDUCED 19.9e-6655.56Show/hide
Query:  DLFHVIHKVPAGGSPYVKAKQVQLIDKDPSRAISLFWAAINTGDRVDSALKDMAVVMKQLDRSDEAIEAIKSFRHLCPNESQESIDNVLIELYKRSGRIE
        +LFHVIHKVP G +PYV+AK  QLI+K+P  AI  FW AINTGDRVDSALKDMAVVMKQLDRS+EAIEAIKSFR  C   SQ+S+DNVLI+LYK+ GR+E
Subjt:  DLFHVIHKVPAGGSPYVKAKQVQLIDKDPSRAISLFWAAINTGDRVDSALKDMAVVMKQLDRSDEAIEAIKSFRHLCPNESQESIDNVLIELYKRSGRIE

Query:  EEIDMLQCKLKQIEEGSVFGGKRTKPARSQGKKVQITIEQEKSRVLGNLAWAFLQLDDYYIAEDYYREALSLELDNNKKCNLAICLISTNRLMEAKSLLQ
        E++++L+ KL+QI +G  F GK TK ARS GKK Q+T++QE SR+LGNL WA++Q   Y  AE  YR+A  +E D NK CNLA+CLI   R  E + +L 
Subjt:  EEIDMLQCKLKQIEEGSVFGGKRTKPARSQGKKVQITIEQEKSRVLGNLAWAFLQLDDYYIAEDYYREALSLELDNNKKCNLAICLISTNRLMEAKSLLQ

Query:  SVRASAGGKPMEEVYAKSFERASQMLAEKESKSPLDSAGKEED
         V      + +     ++ +RA ++L+E ES  P     + ED
Subjt:  SVRASAGGKPMEEVYAKSFERASQMLAEKESKSPLDSAGKEED

Q8L730 Protein SULFUR DEFICIENCY-INDUCED 24.3e-6156.8Show/hide
Query:  ERKPNKSDLFHVIHKVPAGGSPYVKAKQVQLIDKDPSRAISLFWAAINTGDRVDSALKDMAVVMKQLDRSDEAIEAIKSFRHLCPNESQESIDNVLIELY
        ER+ + +  ++V+HK+P G SPYV+AK VQL++KD   AI LFW AI   DRVDSALKDMA++MKQ +R++EAI+AI+SFR LC  ++QES+DNVLI+LY
Subjt:  ERKPNKSDLFHVIHKVPAGGSPYVKAKQVQLIDKDPSRAISLFWAAINTGDRVDSALKDMAVVMKQLDRSDEAIEAIKSFRHLCPNESQESIDNVLIELY

Query:  KRSGRIEEEIDMLQCKLKQIEEGSVFGGKRTKPARSQGKKVQITIEQEKSRVLGNLAWAFLQLDDYYIAEDYYREALSLELDNNKKCNLAICLISTNRLM
        K+ GRIEE++++L+ KL  I +G  F GK TK ARS GKK Q+T+E+E SR+LGNL WA++QL DY  AE  YR+A  +E D NK CNL  CLI   +  
Subjt:  KRSGRIEEEIDMLQCKLKQIEEGSVFGGKRTKPARSQGKKVQITIEQEKSRVLGNLAWAFLQLDDYYIAEDYYREALSLELDNNKKCNLAICLISTNRLM

Query:  EAKSLL
        EA+S+L
Subjt:  EAKSLL

Q9FKV5 Protein POLLENLESS 3-LIKE 12.2e-7357.45Show/hide
Query:  GRGFSTPPPPWKSRPFRSPKTAPFSERKPNKSDLFHVIHKVPAGGSPYVKAKQVQLIDKDPSRAISLFWAAINTGDRVDSALKDMAVVMKQLDRSDEAIE
        G GF TPPP W +   R     P SERK +         +V  G SPYV+AK  QL+ KDP+RAISLFWAAIN GDRVDSALKDM VV+KQL+R DE IE
Subjt:  GRGFSTPPPPWKSRPFRSPKTAPFSERKPNKSDLFHVIHKVPAGGSPYVKAKQVQLIDKDPSRAISLFWAAINTGDRVDSALKDMAVVMKQLDRSDEAIE

Query:  AIKSFRHLCPNESQESIDNVLIELYKRSGRIEEEIDMLQCKLKQIEEGSVFGGKRTKPARSQGKKVQITIEQEKSRVLGNLAWAFLQLDDYYIAEDYYRE
        AIKSFR+LCP ESQ+SIDN+L+ELY +SGRI E  ++L+ KL+ +E+   +GG+     RS  ++   TIEQEK+R+LGNLAW  LQL +Y IAE YYR 
Subjt:  AIKSFRHLCPNESQESIDNVLIELYKRSGRIEEEIDMLQCKLKQIEEGSVFGGKRTKPARSQGKKVQITIEQEKSRVLGNLAWAFLQLDDYYIAEDYYRE

Query:  ALSLELDNNKKCNLAICLISTNRLMEAKSLLQSVRASAGGKPMEEVYAKSFERASQMLAEKESKSPLDSAGKEED
        ALSLE DNNK CNLAICLI   R  EAKSLL+ V+ S G +   E + KSFERA++MLAE+E  +  D   K ED
Subjt:  ALSLELDNNKKCNLAICLISTNRLMEAKSLLQSVRASAGGKPMEEVYAKSFERASQMLAEKESKSPLDSAGKEED

Q9SD20 Protein POLLENLESS 3-LIKE 21.6e-7156.08Show/hide
Query:  FRSPKTAPFSERKP-----NKSDLFHVIHKVPAGGSPYVKAKQVQLIDKDPSRAISLFWAAINTGDRVDSALKDMAVVMKQLDRSDEAIEAIKSFRHLCP
        FR  K+AP S  KP      +S+ FH IHKVP G SPYV+AK VQL++KDP RAI LFW AIN GDRVDSALKDMA+VMKQ +R++EAIEAIKS R  C 
Subjt:  FRSPKTAPFSERKP-----NKSDLFHVIHKVPAGGSPYVKAKQVQLIDKDPSRAISLFWAAINTGDRVDSALKDMAVVMKQLDRSDEAIEAIKSFRHLCP

Query:  NESQESIDNVLIELYKRSGRIEEEIDMLQCKLKQIEEGSVFGGKRTKPARSQGKKVQITIEQEKSRVLGNLAWAFLQLDDYYIAEDYYREALSLELDNNK
        +++QES+DN+L++LYKR GR++++I +L+ KL  I++G  F GKRTK ARSQGKK Q+++EQE +R+LGNL WA +Q D++  AED YR ALS+  DNNK
Subjt:  NESQESIDNVLIELYKRSGRIEEEIDMLQCKLKQIEEGSVFGGKRTKPARSQGKKVQITIEQEKSRVLGNLAWAFLQLDDYYIAEDYYREALSLELDNNK

Query:  KCNLAICLISTNRLMEAKSLLQSVR-ASAGGKPMEEVYAKSFERASQMLAEKESK
         CNL ICL+   R+ EAK  L+ V+ A   G    + + K++ERA QML +  S+
Subjt:  KCNLAICLISTNRLMEAKSLLQSVR-ASAGGKPMEEVYAKSFERASQMLAEKESK

Q9SUC3 Protein POLLENLESS 31.0e-7860.37Show/hide
Query:  FSTPPPPWKS-----RPFRSPKTAPFS-ERKPNKSDLFHVIHKVPAGGSPYVKAKQVQLIDKDPSRAISLFWAAINTGDRVDSALKDMAVVMKQLDRSDE
        + TPPP   S      P    +  P+S      + D FH++HKVP+G SPYV+AK  QLIDKDP+RAISLFW AIN GDRVDSALKDMAVVMKQL RSDE
Subjt:  FSTPPPPWKS-----RPFRSPKTAPFS-ERKPNKSDLFHVIHKVPAGGSPYVKAKQVQLIDKDPSRAISLFWAAINTGDRVDSALKDMAVVMKQLDRSDE

Query:  AIEAIKSFRHLCPNESQESIDNVLIELYKRSGRIEEEIDMLQCKLKQIEEGSVFGGKRTKPARSQGKKVQITIEQEKSRVLGNLAWAFLQLDDYYIAEDY
         IEAIKSFR+LC  ESQ+SIDN+L+ELYK+SGRIEEE  +L+ KL+ +E+G  FGG+ ++  R QGK V +TIEQEK+R+LGNL W  LQL +Y IAE +
Subjt:  AIEAIKSFRHLCPNESQESIDNVLIELYKRSGRIEEEIDMLQCKLKQIEEGSVFGGKRTKPARSQGKKVQITIEQEKSRVLGNLAWAFLQLDDYYIAEDY

Query:  YREALSLELDNNKKCNLAICLISTNRLMEAKSLLQSVRAS-AGGKPMEEVYAKSFERASQMLAEKESKSP
        YR AL LE D NK CNLAICL+  +R+ EAKSLL  VR S A  +  +E +AKS++RA +MLAE ESK P
Subjt:  YREALSLELDNNKKCNLAICLISTNRLMEAKSLLQSVRAS-AGGKPMEEVYAKSFERASQMLAEKESKSP

Arabidopsis top hitse value%identityAlignment
AT1G04770.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.1e-6256.8Show/hide
Query:  ERKPNKSDLFHVIHKVPAGGSPYVKAKQVQLIDKDPSRAISLFWAAINTGDRVDSALKDMAVVMKQLDRSDEAIEAIKSFRHLCPNESQESIDNVLIELY
        ER+ + +  ++V+HK+P G SPYV+AK VQL++KD   AI LFW AI   DRVDSALKDMA++MKQ +R++EAI+AI+SFR LC  ++QES+DNVLI+LY
Subjt:  ERKPNKSDLFHVIHKVPAGGSPYVKAKQVQLIDKDPSRAISLFWAAINTGDRVDSALKDMAVVMKQLDRSDEAIEAIKSFRHLCPNESQESIDNVLIELY

Query:  KRSGRIEEEIDMLQCKLKQIEEGSVFGGKRTKPARSQGKKVQITIEQEKSRVLGNLAWAFLQLDDYYIAEDYYREALSLELDNNKKCNLAICLISTNRLM
        K+ GRIEE++++L+ KL  I +G  F GK TK ARS GKK Q+T+E+E SR+LGNL WA++QL DY  AE  YR+A  +E D NK CNL  CLI   +  
Subjt:  KRSGRIEEEIDMLQCKLKQIEEGSVFGGKRTKPARSQGKKVQITIEQEKSRVLGNLAWAFLQLDDYYIAEDYYREALSLELDNNKKCNLAICLISTNRLM

Query:  EAKSLL
        EA+S+L
Subjt:  EAKSLL

AT3G51280.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.1e-7256.08Show/hide
Query:  FRSPKTAPFSERKP-----NKSDLFHVIHKVPAGGSPYVKAKQVQLIDKDPSRAISLFWAAINTGDRVDSALKDMAVVMKQLDRSDEAIEAIKSFRHLCP
        FR  K+AP S  KP      +S+ FH IHKVP G SPYV+AK VQL++KDP RAI LFW AIN GDRVDSALKDMA+VMKQ +R++EAIEAIKS R  C 
Subjt:  FRSPKTAPFSERKP-----NKSDLFHVIHKVPAGGSPYVKAKQVQLIDKDPSRAISLFWAAINTGDRVDSALKDMAVVMKQLDRSDEAIEAIKSFRHLCP

Query:  NESQESIDNVLIELYKRSGRIEEEIDMLQCKLKQIEEGSVFGGKRTKPARSQGKKVQITIEQEKSRVLGNLAWAFLQLDDYYIAEDYYREALSLELDNNK
        +++QES+DN+L++LYKR GR++++I +L+ KL  I++G  F GKRTK ARSQGKK Q+++EQE +R+LGNL WA +Q D++  AED YR ALS+  DNNK
Subjt:  NESQESIDNVLIELYKRSGRIEEEIDMLQCKLKQIEEGSVFGGKRTKPARSQGKKVQITIEQEKSRVLGNLAWAFLQLDDYYIAEDYYREALSLELDNNK

Query:  KCNLAICLISTNRLMEAKSLLQSVR-ASAGGKPMEEVYAKSFERASQMLAEKESK
         CNL ICL+   R+ EAK  L+ V+ A   G    + + K++ERA QML +  S+
Subjt:  KCNLAICLISTNRLMEAKSLLQSVR-ASAGGKPMEEVYAKSFERASQMLAEKESK

AT4G20900.1 Tetratricopeptide repeat (TPR)-like superfamily protein9.8e-7756.99Show/hide
Query:  FSTPPPPWKS-----RPFRSPKTAPFS-ERKPNKSDLFHVIHKVPAGGSPYVKAKQVQLIDKDPSRAISLFWAAINTGDRVDSALKDMAVVMKQLDRSDE
        + TPPP   S      P    +  P+S      + D FH++HKVP+G SPYV+AK  QLIDKDP+RAISLFW AIN GDRVDSALKDMAVVMKQL RSDE
Subjt:  FSTPPPPWKS-----RPFRSPKTAPFS-ERKPNKSDLFHVIHKVPAGGSPYVKAKQVQLIDKDPSRAISLFWAAINTGDRVDSALKDMAVVMKQLDRSDE

Query:  AIEAIKSFRHLCPNESQESIDNVLIELYKRSGRIEEEIDMLQCKLKQIEEGSVFGGKRTKPARSQGKKVQITIEQEKSRVLGNLAWAFLQLDDYYIAEDY
         IEAIKSFR+LC  ESQ+SIDN+L+ELYK+SGRIEEE  +L+ KL+ +E+G  FGG+ ++  R QGK V +TIEQEK+R+LGNL W  LQL +Y IAE +
Subjt:  AIEAIKSFRHLCPNESQESIDNVLIELYKRSGRIEEEIDMLQCKLKQIEEGSVFGGKRTKPARSQGKKVQITIEQEKSRVLGNLAWAFLQLDDYYIAEDY

Query:  YR----------------EALSLELDNNKKCNLAICLISTNRLMEAKSLLQSVRAS-AGGKPMEEVYAKSFERASQMLAEKESKSP
        YR                 AL LE D NK CNLAICL+  +R+ EAKSLL  VR S A  +  +E +AKS++RA +MLAE ESK P
Subjt:  YR----------------EALSLELDNNKKCNLAICLISTNRLMEAKSLLQSVRAS-AGGKPMEEVYAKSFERASQMLAEKESKSP

AT5G44330.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.6e-7457.45Show/hide
Query:  GRGFSTPPPPWKSRPFRSPKTAPFSERKPNKSDLFHVIHKVPAGGSPYVKAKQVQLIDKDPSRAISLFWAAINTGDRVDSALKDMAVVMKQLDRSDEAIE
        G GF TPPP W +   R     P SERK +         +V  G SPYV+AK  QL+ KDP+RAISLFWAAIN GDRVDSALKDM VV+KQL+R DE IE
Subjt:  GRGFSTPPPPWKSRPFRSPKTAPFSERKPNKSDLFHVIHKVPAGGSPYVKAKQVQLIDKDPSRAISLFWAAINTGDRVDSALKDMAVVMKQLDRSDEAIE

Query:  AIKSFRHLCPNESQESIDNVLIELYKRSGRIEEEIDMLQCKLKQIEEGSVFGGKRTKPARSQGKKVQITIEQEKSRVLGNLAWAFLQLDDYYIAEDYYRE
        AIKSFR+LCP ESQ+SIDN+L+ELY +SGRI E  ++L+ KL+ +E+   +GG+     RS  ++   TIEQEK+R+LGNLAW  LQL +Y IAE YYR 
Subjt:  AIKSFRHLCPNESQESIDNVLIELYKRSGRIEEEIDMLQCKLKQIEEGSVFGGKRTKPARSQGKKVQITIEQEKSRVLGNLAWAFLQLDDYYIAEDYYRE

Query:  ALSLELDNNKKCNLAICLISTNRLMEAKSLLQSVRASAGGKPMEEVYAKSFERASQMLAEKESKSPLDSAGKEED
        ALSLE DNNK CNLAICLI   R  EAKSLL+ V+ S G +   E + KSFERA++MLAE+E  +  D   K ED
Subjt:  ALSLELDNNKKCNLAICLISTNRLMEAKSLLQSVRASAGGKPMEEVYAKSFERASQMLAEKESKSPLDSAGKEED

AT5G48850.1 Tetratricopeptide repeat (TPR)-like superfamily protein7.0e-6755.56Show/hide
Query:  DLFHVIHKVPAGGSPYVKAKQVQLIDKDPSRAISLFWAAINTGDRVDSALKDMAVVMKQLDRSDEAIEAIKSFRHLCPNESQESIDNVLIELYKRSGRIE
        +LFHVIHKVP G +PYV+AK  QLI+K+P  AI  FW AINTGDRVDSALKDMAVVMKQLDRS+EAIEAIKSFR  C   SQ+S+DNVLI+LYK+ GR+E
Subjt:  DLFHVIHKVPAGGSPYVKAKQVQLIDKDPSRAISLFWAAINTGDRVDSALKDMAVVMKQLDRSDEAIEAIKSFRHLCPNESQESIDNVLIELYKRSGRIE

Query:  EEIDMLQCKLKQIEEGSVFGGKRTKPARSQGKKVQITIEQEKSRVLGNLAWAFLQLDDYYIAEDYYREALSLELDNNKKCNLAICLISTNRLMEAKSLLQ
        E++++L+ KL+QI +G  F GK TK ARS GKK Q+T++QE SR+LGNL WA++Q   Y  AE  YR+A  +E D NK CNLA+CLI   R  E + +L 
Subjt:  EEIDMLQCKLKQIEEGSVFGGKRTKPARSQGKKVQITIEQEKSRVLGNLAWAFLQLDDYYIAEDYYREALSLELDNNKKCNLAICLISTNRLMEAKSLLQ

Query:  SVRASAGGKPMEEVYAKSFERASQMLAEKESKSPLDSAGKEED
         V      + +     ++ +RA ++L+E ES  P     + ED
Subjt:  SVRASAGGKPMEEVYAKSFERASQMLAEKESKSPLDSAGKEED


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGGGCGAACGGCGGCAGGGGGTTTTCGACCCCTCCTCCGCCGTGGAAATCGAGGCCTTTCCGGTCACCAAAAACGGCGCCGTTTTCCGAGAGGAAACCGAACAAATC
TGACCTTTTTCATGTCATTCACAAAGTTCCCGCTGGGGGTTCTCCCTATGTTAAGGCTAAACAAGTTCAGTTGATAGACAAAGATCCGAGTAGGGCGATATCTTTGTTTT
GGGCTGCGATAAATACCGGAGATCGGGTCGACAGTGCACTGAAAGACATGGCTGTAGTGATGAAGCAGTTGGACCGTTCCGATGAGGCGATTGAGGCGATCAAATCGTTT
CGCCATCTCTGCCCTAATGAATCTCAGGAATCTATTGATAATGTCTTGATTGAATTATACAAGAGATCTGGAAGGATAGAAGAAGAGATTGATATGCTTCAATGCAAACT
GAAACAAATCGAAGAAGGCTCGGTTTTTGGAGGGAAGAGGACGAAGCCTGCAAGATCTCAAGGGAAGAAGGTGCAAATTACCATTGAGCAAGAGAAATCAAGAGTTCTTG
GAAACTTGGCCTGGGCTTTCTTGCAGCTGGACGACTACTACATCGCTGAAGATTACTACCGGGAAGCTTTGTCTCTAGAGTTGGATAACAACAAGAAGTGTAATCTTGCA
ATCTGTCTGATCTCTACGAATCGGCTAATGGAAGCGAAGTCTCTGCTTCAGTCTGTAAGAGCTTCTGCTGGAGGCAAGCCAATGGAAGAAGTATATGCCAAATCATTTGA
ACGCGCGTCTCAGATGCTGGCTGAAAAAGAATCAAAGTCGCCGCTGGATTCAGCAGGGAAGGAAGAAGATAAAAGCAGCCACGCGACCACGATAACCTCGAAGAGCATTA
GAGCTGGGCCTTCTGTTTCTTCTCAGTTCACTGCATCTACAAGGTGGACTCCTGATGATGAAGAGGTGTACATTAATGAAAATAGTCGGCATTACGAGAACAAGTCATTT
GGAGCTGTGAATTCTTCACATAATTATCTGCATTGTGATAACTCTTGCATTCCCAACAAAATGAAGAGAAACCAAGAGCAGGATGGTTTATTGAGAACAGAAGAGAGTTT
CAACTGCAGCTCATTGTATTCATCTCCGATTCTAGCGAGAGGAAATACCGAAGTTCCCTCCACTCGACTGAAAAACTCCATTCGAGAATTTGATACTCGACACGGGCTGA
ATGAAAGGAGGCAGCAAAAAGGAACAACTGGTGGTTCAAGTATCAGGAAAGTTTTGTTTGAGAAGCCTTCAATGAAGGACCAAAACTTTGCTGCATATGCTTCTTCTGAA
TCTGAAGGAACTCGTACGCCTTCCAATTACGATATGAAGTACAAGTCTGCAGCTTCAAATGCATTTGAACTGGAAGTTCCATTTACACAACCAAGGAGTTGTTCAAGGGG
TATTAATGGAGGAGGACATCAAAGAAAGGCGACCGAGTGCTTCCGAGGTTTGCCCGGTAGTTCGATTAGAAAACTTTCGTTCGAGCCTACAACAAGCACTGAAAAAATTC
AAAGATCTGAACTTTCCAGAGCAGTGAGCGATGAACCTCGACATCTTGTTGTAGTCCGGAAGCAGATTTCTCGTGGAGACATCGAGTATGAAGAAGCGGCAGTGTCGTAC
GACCCGATGAAGATAAAGGTAGAACACAATTCATCAACAGTTGGTGGGAAGAAGAGTTGGGCAGATATGGTTGAAGAAGAAGAGGAGGAAGGCGGTAACAATGAAAAGAA
AGATGATGCAACGGATGAAACATCGTCGAGCGGAAGATGTCAAGTCAATTGCTTTGGTGGCGATTGGAATAGCAGCAGTGACAATGAAGAGTTCAATGATGAAAATCTAA
ATTCCAACATACTCCACCAGGGTCATAGTCCAAGCAACAATCTGGTTCAAGTAATACAATTTGGCTCACTTGATATAAAATATGGAGCTAAGAACACAGTCGATGTAGAT
TCATCGAGAAATCCAGCAGTACGGCGCCCTTTATGTTTCAATGAACAGCCTACGTTAGACTCGACCGATAACCGTTGCTCCTCACTGCTGCCAAGGAAAGATTTAACAAC
TGAAGTCTCCAGTAATTCTGTGACCACCATGGCTGGAGAGAATGTGAACTTGATAATTAGAAGAAACAGGTTGCAGGTATTTCAGGAGATAACAGTGCATCAAGAGCTAG
AATGTTAA
mRNA sequenceShow/hide mRNA sequence
ATCGCTTATCTTTCACGCCTTTTCACTTCGCTTCTCTCTGTATATTTGTTCTAACCACAATCACCCCATGCAACATCCAAACAACAATTACTGACCCAAGAACTTAAATC
TGAATAAAACCCATAAAAGAAAACTCGTTTTTACTCGCCGATTTTCAAACCCAAAATTTTTCTCGGGAAAATTCAGTCGGAGTTTCATGTGGGCGAACGGCGGCAGGGGG
TTTTCGACCCCTCCTCCGCCGTGGAAATCGAGGCCTTTCCGGTCACCAAAAACGGCGCCGTTTTCCGAGAGGAAACCGAACAAATCTGACCTTTTTCATGTCATTCACAA
AGTTCCCGCTGGGGGTTCTCCCTATGTTAAGGCTAAACAAGTTCAGTTGATAGACAAAGATCCGAGTAGGGCGATATCTTTGTTTTGGGCTGCGATAAATACCGGAGATC
GGGTCGACAGTGCACTGAAAGACATGGCTGTAGTGATGAAGCAGTTGGACCGTTCCGATGAGGCGATTGAGGCGATCAAATCGTTTCGCCATCTCTGCCCTAATGAATCT
CAGGAATCTATTGATAATGTCTTGATTGAATTATACAAGAGATCTGGAAGGATAGAAGAAGAGATTGATATGCTTCAATGCAAACTGAAACAAATCGAAGAAGGCTCGGT
TTTTGGAGGGAAGAGGACGAAGCCTGCAAGATCTCAAGGGAAGAAGGTGCAAATTACCATTGAGCAAGAGAAATCAAGAGTTCTTGGAAACTTGGCCTGGGCTTTCTTGC
AGCTGGACGACTACTACATCGCTGAAGATTACTACCGGGAAGCTTTGTCTCTAGAGTTGGATAACAACAAGAAGTGTAATCTTGCAATCTGTCTGATCTCTACGAATCGG
CTAATGGAAGCGAAGTCTCTGCTTCAGTCTGTAAGAGCTTCTGCTGGAGGCAAGCCAATGGAAGAAGTATATGCCAAATCATTTGAACGCGCGTCTCAGATGCTGGCTGA
AAAAGAATCAAAGTCGCCGCTGGATTCAGCAGGGAAGGAAGAAGATAAAAGCAGCCACGCGACCACGATAACCTCGAAGAGCATTAGAGCTGGGCCTTCTGTTTCTTCTC
AGTTCACTGCATCTACAAGGTGGACTCCTGATGATGAAGAGGTGTACATTAATGAAAATAGTCGGCATTACGAGAACAAGTCATTTGGAGCTGTGAATTCTTCACATAAT
TATCTGCATTGTGATAACTCTTGCATTCCCAACAAAATGAAGAGAAACCAAGAGCAGGATGGTTTATTGAGAACAGAAGAGAGTTTCAACTGCAGCTCATTGTATTCATC
TCCGATTCTAGCGAGAGGAAATACCGAAGTTCCCTCCACTCGACTGAAAAACTCCATTCGAGAATTTGATACTCGACACGGGCTGAATGAAAGGAGGCAGCAAAAAGGAA
CAACTGGTGGTTCAAGTATCAGGAAAGTTTTGTTTGAGAAGCCTTCAATGAAGGACCAAAACTTTGCTGCATATGCTTCTTCTGAATCTGAAGGAACTCGTACGCCTTCC
AATTACGATATGAAGTACAAGTCTGCAGCTTCAAATGCATTTGAACTGGAAGTTCCATTTACACAACCAAGGAGTTGTTCAAGGGGTATTAATGGAGGAGGACATCAAAG
AAAGGCGACCGAGTGCTTCCGAGGTTTGCCCGGTAGTTCGATTAGAAAACTTTCGTTCGAGCCTACAACAAGCACTGAAAAAATTCAAAGATCTGAACTTTCCAGAGCAG
TGAGCGATGAACCTCGACATCTTGTTGTAGTCCGGAAGCAGATTTCTCGTGGAGACATCGAGTATGAAGAAGCGGCAGTGTCGTACGACCCGATGAAGATAAAGGTAGAA
CACAATTCATCAACAGTTGGTGGGAAGAAGAGTTGGGCAGATATGGTTGAAGAAGAAGAGGAGGAAGGCGGTAACAATGAAAAGAAAGATGATGCAACGGATGAAACATC
GTCGAGCGGAAGATGTCAAGTCAATTGCTTTGGTGGCGATTGGAATAGCAGCAGTGACAATGAAGAGTTCAATGATGAAAATCTAAATTCCAACATACTCCACCAGGGTC
ATAGTCCAAGCAACAATCTGGTTCAAGTAATACAATTTGGCTCACTTGATATAAAATATGGAGCTAAGAACACAGTCGATGTAGATTCATCGAGAAATCCAGCAGTACGG
CGCCCTTTATGTTTCAATGAACAGCCTACGTTAGACTCGACCGATAACCGTTGCTCCTCACTGCTGCCAAGGAAAGATTTAACAACTGAAGTCTCCAGTAATTCTGTGAC
CACCATGGCTGGAGAGAATGTGAACTTGATAATTAGAAGAAACAGGTTGCAGGTATTTCAGGAGATAACAGTGCATCAAGAGCTAGAATGTTAAAAAAGAAGAAAAAAAA
GCTATCACCAAGAAGTTTGATGGCACTTCTAAGTTTGTAAAGAAT
Protein sequenceShow/hide protein sequence
MWANGGRGFSTPPPPWKSRPFRSPKTAPFSERKPNKSDLFHVIHKVPAGGSPYVKAKQVQLIDKDPSRAISLFWAAINTGDRVDSALKDMAVVMKQLDRSDEAIEAIKSF
RHLCPNESQESIDNVLIELYKRSGRIEEEIDMLQCKLKQIEEGSVFGGKRTKPARSQGKKVQITIEQEKSRVLGNLAWAFLQLDDYYIAEDYYREALSLELDNNKKCNLA
ICLISTNRLMEAKSLLQSVRASAGGKPMEEVYAKSFERASQMLAEKESKSPLDSAGKEEDKSSHATTITSKSIRAGPSVSSQFTASTRWTPDDEEVYINENSRHYENKSF
GAVNSSHNYLHCDNSCIPNKMKRNQEQDGLLRTEESFNCSSLYSSPILARGNTEVPSTRLKNSIREFDTRHGLNERRQQKGTTGGSSIRKVLFEKPSMKDQNFAAYASSE
SEGTRTPSNYDMKYKSAASNAFELEVPFTQPRSCSRGINGGGHQRKATECFRGLPGSSIRKLSFEPTTSTEKIQRSELSRAVSDEPRHLVVVRKQISRGDIEYEEAAVSY
DPMKIKVEHNSSTVGGKKSWADMVEEEEEEGGNNEKKDDATDETSSSGRCQVNCFGGDWNSSSDNEEFNDENLNSNILHQGHSPSNNLVQVIQFGSLDIKYGAKNTVDVD
SSRNPAVRRPLCFNEQPTLDSTDNRCSSLLPRKDLTTEVSSNSVTTMAGENVNLIIRRNRLQVFQEITVHQELEC