; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg035503 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg035503
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionPHD-type domain-containing protein
Genome locationscaffold3:736519..740171
RNA-Seq ExpressionSpg035503
SyntenySpg035503
Gene Ontology termsGO:0005524 - ATP binding (molecular function)
GO:0016887 - ATPase activity (molecular function)
InterPro domainsIPR003593 - AAA+ ATPase domain
IPR003959 - ATPase, AAA-type, core
IPR027417 - P-loop containing nucleoside triphosphate hydrolase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022142881.1 uncharacterized protein LOC111012888 isoform X2 [Momordica charantia]3.7e-20851.32Show/hide
Query:  LTPPRGLLLHGYPGTSKTHVVRALVGSCARGNKRIAYFSRKGADGLGKYVGDSEKHLKQLFQVAEECQPSIIFFDEIYGLAASQTRQQDQTHTSVVSTLL
        +TPPRG+LLHGYPGT KTHVVRAL+GSCARG+KRIAYF+RKGAD LGKYVGD+E+ L+ LFQVAE CQPSIIFFDEI GLA  +TRQQDQTH SVVSTLL
Subjt:  LTPPRGLLLHGYPGTSKTHVVRALVGSCARGNKRIAYFSRKGADGLGKYVGDSEKHLKQLFQVAEECQPSIIFFDEIYGLAASQTRQQDQTHTSVVSTLL

Query:  ALLDGLKSR-------------------------------------------------------------------GFAGADLQALCTQTTIIALKRTFP
        ALLDGLKSR                                                                   GFAGADLQALCTQ  + ALKR FP
Subjt:  ALLDGLKSR-------------------------------------------------------------------GFAGADLQALCTQTTIIALKRTFP

Query:  LKQVLSASRDTCD-DHPTDLPVVVVEERDWLEALSSCPPPCSRREASLAANDVASSPLPSHLIPCLIRPLSTLLVSLYMDECICLPNSLSTAATIIKSVV
        LK+VLS S +    D+   LP ++VEERDWLEAL   PPPCSRREA +AANDVASSPLPSHLIP L++PLSTLLVSLY+DE I LP +L  AAT+IKSV+
Subjt:  LKQVLSASRDTCD-DHPTDLPVVVVEERDWLEALSSCPPPCSRREASLAANDVASSPLPSHLIPCLIRPLSTLLVSLYMDECICLPNSLSTAATIIKSVV

Query:  VSTLERKKIVSNKWWFHVDDFIQDNEIANEIERKWQGS---------------------EANIIEDSDKLIPEDNTTNAKQTSFDLGNNHGFRIMIAGNP
        +S L+ +KIV++ WW HV DF++D +IANEIE K QGS                     E +  E S       +TT  + TSFDLGN  GFRI+IAGNP
Subjt:  VSTLERKKIVSNKWWFHVDDFIQDNEIANEIERKWQGS---------------------EANIIEDSDKLIPEDNTTNAKQTSFDLGNNHGFRIMIAGNP

Query:  RSGQRHLASCIVNCFVEHAEIRKVDMATISQ-GYSNLGQGITYILANCLSRGSCLIFMSRVDLWAIESHDDQNLEHDTHSTEEQCIR-------------
        RSG RHLASC+++C+++H EIRKVD+ATISQ G+ +L QGI+ IL NC S GSC++FM R+DLWA+E  +    E D    E+Q                
Subjt:  RSGQRHLASCIVNCFVEHAEIRKVDMATISQ-GYSNLGQGITYILANCLSRGSCLIFMSRVDLWAIESHDDQNLEHDTHSTEEQCIR-------------

Query:  -------------------KATVQRASSAWTLFIDQ-----------ATSEVPFLLLPQEIRHFFRKDLSNCKKLVAQGHNIPRFSIQINEAFNHDLVIN
                           +  +  AS AW+ F++Q           ATSEVPFLLLPQEIR FFR DLS C+   ++ H +PRFS+QI+  FNHD+VIN
Subjt:  -------------------KATVQRASSAWTLFIDQ-----------ATSEVPFLLLPQEIRHFFRKDLSNCKKLVAQGHNIPRFSIQINEAFNHDLVIN

Query:  QSAAKLSKEMVKLLVLLIHQKNHVSCTSACNKKPPCDLIEEQARTSNYQKDKESI---SEDKELNDTNNT---------TSIEDTNLLSAISIFGYQILQ
        QSAA+LS+++ KLLV LIHQK+H   TSAC K     +I+ ++  +N Q DKE+    + +K+  D N           T    +NL+S IS FGYQILQ
Subjt:  QSAAKLSKEMVKLLVLLIHQKNHVSCTSACNKKPPCDLIEEQARTSNYQKDKESI---SEDKELNDTNNT---------TSIEDTNLLSAISIFGYQILQ

Query:  NPQFSVLCWVTSKFKEGPSANISKAQWEVWKFNSCVLHP--------TSSSDNVQTKENYGMVKGLVAVGLSATRGIYTSLQQVCSDVRAVLNVLVEKVN
         P F+ LCWVTSK KEGPS ++S   W+ W FNSC++ P        +SS  N ++KE  G+V+GLVAVGLSA RG YTSL++V  DVR VL++LVE++N
Subjt:  NPQFSVLCWVTSKFKEGPSANISKAQWEVWKFNSCVLHP--------TSSSDNVQTKENYGMVKGLVAVGLSATRGIYTSLQQVCSDVRAVLNVLVEKVN

Query:  AKVAAGKDRLQFIGILSQVASLEDTVHSWAYALQ
        AK++AGKDR Q++ +LSQVA LED V++WA+ LQ
Subjt:  AKVAAGKDRLQFIGILSQVASLEDTVHSWAYALQ

XP_022156988.1 uncharacterized protein LOC111023817 [Momordica charantia]2.6e-26264.25Show/hide
Query:  LTPPRGLLLHGYPGTSKTHVVRALVGSCARGNKRIAYFSRKGADGLGKYVGDSEKHLKQLFQVAEECQPSIIFFDEIYGLAASQTRQQDQTHTSVVSTLL
        + PPRG+LLHGYPGT KTHVVRALVGSCARGNK IAYFSRKGAD LGKYVGDSE+ L++LFQVA++CQPSIIFFDEI GLA  +TR+QDQTHTSVVSTLL
Subjt:  LTPPRGLLLHGYPGTSKTHVVRALVGSCARGNKRIAYFSRKGADGLGKYVGDSEKHLKQLFQVAEECQPSIIFFDEIYGLAASQTRQQDQTHTSVVSTLL

Query:  ALLDGLKSR---------------------------------------------------------------------GFAGADLQALCTQTTIIALKRT
        ALLDGLKSR                                                                     GFAGADLQALCTQT IIALKRT
Subjt:  ALLDGLKSR---------------------------------------------------------------------GFAGADLQALCTQTTIIALKRT

Query:  FPLKQVLSASRDTCDDHPTDLPVVVVEERDWLEALSSCPPPCSRREASLAANDVASSPLPSHLIPCLIRPLSTLLVSLYMDECICLPNSLSTAATIIKSV
        FPLKQVL AS+DT    PT LP V+VEERDWLEALSSCPPPCSRREA++AANDV SSPLPSHLIPCLIRPLS LLVSLY+++CICLP +LS AA II++V
Subjt:  FPLKQVLSASRDTCDDHPTDLPVVVVEERDWLEALSSCPPPCSRREASLAANDVASSPLPSHLIPCLIRPLSTLLVSLYMDECICLPNSLSTAATIIKSV

Query:  VVSTLERKKIVSNKWWFHVDDFIQDNEIANEIERKWQGSEANIIE-----DSDKLIPEDNTTNAKQTSFDLGNNHGFRIMIAGNPRSGQRHLASCIVNCF
        ++S LERK   SNKWW HVDDFIQ+++IA+EIERKWQ    NI+E     +S + IPED   NAK TSFDLGN+ GFRIMIAGNP  GQRHLASCI+NCF
Subjt:  VVSTLERKKIVSNKWWFHVDDFIQDNEIANEIERKWQGSEANIIE-----DSDKLIPEDNTTNAKQTSFDLGNNHGFRIMIAGNPRSGQRHLASCIVNCF

Query:  VEHAEIRKVDMATISQGYSNLGQGITYILANCLSRGSCLIFMSRVDLWAIESHDDQNLEHD-THSTEEQCIRKATVQRASSAWTLFIDQ-----------
        V H EIRKVDMAT SQGY+NL QGI++ LANCLSRG CLIFM R+DLWA+E+H+DQ LEHD TH T+EQCIRKA V RASSAWTLFI Q           
Subjt:  VEHAEIRKVDMATISQGYSNLGQGITYILANCLSRGSCLIFMSRVDLWAIESHDDQNLEHD-THSTEEQCIRKATVQRASSAWTLFIDQ-----------

Query:  --ATSEVPFLLLPQEIRHFFRKDLSNCKKLVAQGHNIPRFSIQINEAFNHDLVINQSAAKLSKEMVKLLVLLIHQKNHVSCTSACNKKPPCDLIEEQART
          ATSEVPFLLLPQEIR FF  DL++CK+LVA  H IPRFSIQI+E FNHDLVI++SAA LSK+MVKLL LLIHQK HV+           +   E  R+
Subjt:  --ATSEVPFLLLPQEIRHFFRKDLSNCKKLVAQGHNIPRFSIQINEAFNHDLVINQSAAKLSKEMVKLLVLLIHQKNHVSCTSACNKKPPCDLIEEQART

Query:  SNYQKDKESISEDKELNDTNNTTSIEDTNLLSAISIFGYQILQNPQFSVLCWVTSKFKEGPSANISKAQWEVWKFNSCVLHPTSSSDNVQTKENYGMVKG
        SNYQKDKE                         IS FG QILQN  +S LC  TSK KEGPSANISK+ WE W+FN CVLH TSS D V+T+ENYGMVKG
Subjt:  SNYQKDKESISEDKELNDTNNTTSIEDTNLLSAISIFGYQILQNPQFSVLCWVTSKFKEGPSANISKAQWEVWKFNSCVLHPTSSSDNVQTKENYGMVKG

Query:  LVAVGLSATRGIYTSLQQVCSDVRAVLNVLVEKVNAKVAAGKDRLQFIGILSQVASLEDTVHSWAYALQMYY
        LV++GL ATRG+YTSLQQVCSDVR VL++ VEK+NAKV AGK+ LQ+  ILSQVASLEDTV++WAY LQ YY
Subjt:  LVAVGLSATRGIYTSLQQVCSDVRAVLNVLVEKVNAKVAAGKDRLQFIGILSQVASLEDTVHSWAYALQMYY

XP_022922672.1 uncharacterized protein LOC111430605 [Cucurbita moschata]2.9e-20550.3Show/hide
Query:  LTPPRGLLLHGYPGTSKTHVVRALVGSCARGNKRIAYFSRKGADGLGKYVGDSEKHLKQLFQVAEECQPSIIFFDEIYGLAASQTRQQDQTHTSVVSTLL
        +TPPRG+LLHGYPGT KTHVVRAL+GSCARG+KRIAYF+RKGAD LGKYVGD+E+ L+ LFQVAE+CQPSIIFFDEI GLA  +TRQQDQTH SVVSTLL
Subjt:  LTPPRGLLLHGYPGTSKTHVVRALVGSCARGNKRIAYFSRKGADGLGKYVGDSEKHLKQLFQVAEECQPSIIFFDEIYGLAASQTRQQDQTHTSVVSTLL

Query:  ALLDGLKSR-------------------------------------------------------------------GFAGADLQALCTQTTIIALKRTFP
        ALLDGLKSR                                                                   GFAGADLQALCTQ  + ALKR FP
Subjt:  ALLDGLKSR-------------------------------------------------------------------GFAGADLQALCTQTTIIALKRTFP

Query:  LKQVLSASRDTCD-DHPTDLPVVVVEERDWLEALSSCPPPCSRREASLAANDVASSPLPSHLIPCLIRPLSTLLVSLYMDECICLPNSLSTAATIIKSVV
        LK+VLSAS +    D+   LP ++VEERDWLEAL   PPPCSRREA +AANDVASSPLPSHLIPCL++PLSTLLVSLY+DE I LP +L  AAT+IKSV+
Subjt:  LKQVLSASRDTCD-DHPTDLPVVVVEERDWLEALSSCPPPCSRREASLAANDVASSPLPSHLIPCLIRPLSTLLVSLYMDECICLPNSLSTAATIIKSVV

Query:  VSTLERKKIVSNKWWFHVDDFIQDNEIANEIERKWQGS----EANIIEDSDKLIPEDNTTNAK---------------QTSFDLGNNHGFRIMIAGNPRS
        V+ L+ +KIV++ WW +V DF+QD ++ANEIE K QGS    E +    S  L  E    ++K               ++ F L N  GFRI+IAGNPRS
Subjt:  VSTLERKKIVSNKWWFHVDDFIQDNEIANEIERKWQGS----EANIIEDSDKLIPEDNTTNAK---------------QTSFDLGNNHGFRIMIAGNPRS

Query:  GQRHLASCIVNCFVEHAEIRKVDMATISQ-GYSNLGQGITYILANCLSRGSCLIFMSRVDLWAIESHDDQNLE------------------------HDT
        G RHLASC+++C+++H EIRKVD+ATISQ G+ +L QGI+ IL NC S GSCL+FM R+DLWAIES +  + E                         + 
Subjt:  GQRHLASCIVNCFVEHAEIRKVDMATISQ-GYSNLGQGITYILANCLSRGSCLIFMSRVDLWAIESHDDQNLE------------------------HDT

Query:  HSTEEQC-------IRKATVQRASSAWTLFIDQ-----------ATSEVPFLLLPQEIRHFFRKDLSNCKKLVAQGHNIPRFSIQINEAFNHDLVINQSA
        HS  +Q        ++  ++  AS AW+ F++Q           ATSEVPFLLLPQEIR FFR DLS C+    + H++PRFS+QI+  F+HD+VINQSA
Subjt:  HSTEEQC-------IRKATVQRASSAWTLFIDQ-----------ATSEVPFLLLPQEIRHFFRKDLSNCKKLVAQGHNIPRFSIQINEAFNHDLVINQSA

Query:  AKLSKEMVKLLVLLIHQKNHVSCTSACNKKPPCDLIEEQARTSNYQKDKESISE---DKELNDTNNT---------TSIEDTNLLSAISIFGYQILQNPQ
         +LS++++KLLV LIHQK H   TS C K     + ++++  ++ + DKE+ SE   +K+  D ++          T    +NL S IS FGYQIL+ P 
Subjt:  AKLSKEMVKLLVLLIHQKNHVSCTSACNKKPPCDLIEEQARTSNYQKDKESISE---DKELNDTNNT---------TSIEDTNLLSAISIFGYQILQNPQ

Query:  FSVLCWVTSKFKEGPSANISKAQWEVWKFNSCVLHP--------TSSSDNVQTKENYGMVKGLVAVGLSATRGIYTSLQQVCSDVRAVLNVLVEKVNAKV
        F+ LCWVTSK KEGPSA++S   W+ W FNSC++ P        +SS+ N ++KE  GMV+GL+AVGLSA RG YTSL++V  DVR VL++LVE++NAK+
Subjt:  FSVLCWVTSKFKEGPSANISKAQWEVWKFNSCVLHP--------TSSSDNVQTKENYGMVKGLVAVGLSATRGIYTSLQQVCSDVRAVLNVLVEKVNAKV

Query:  AAGKDRLQFIGILSQVASLEDTVHSWAYALQMYYPLLINSHTSQN
         +GKDR Q+  +LSQVA LED V+SW + L        N  TS N
Subjt:  AAGKDRLQFIGILSQVASLEDTVHSWAYALQMYYPLLINSHTSQN

XP_031739139.1 uncharacterized protein LOC101208571 [Cucumis sativus]5.3e-20750.84Show/hide
Query:  LTPPRGLLLHGYPGTSKTHVVRALVGSCARGNKRIAYFSRKGADGLGKYVGDSEKHLKQLFQVAEECQPSIIFFDEIYGLAASQTRQQDQTHTSVVSTLL
        +TPPRG+LLHGYPGT KTHVVRAL+GSCARG+KRIAYF+RKGAD LGKYVGD+E+ L+ LFQVAE+CQPSIIFFDEI GLA  +TRQQDQTH SVVSTLL
Subjt:  LTPPRGLLLHGYPGTSKTHVVRALVGSCARGNKRIAYFSRKGADGLGKYVGDSEKHLKQLFQVAEECQPSIIFFDEIYGLAASQTRQQDQTHTSVVSTLL

Query:  ALLDGLKSR-------------------------------------------------------------------GFAGADLQALCTQTTIIALKRTFP
        ALLDGLKSR                                                                   GFAGADLQALCTQ  + ALKR FP
Subjt:  ALLDGLKSR-------------------------------------------------------------------GFAGADLQALCTQTTIIALKRTFP

Query:  LKQVLSASRDTCDD-HPTDLPVVVVEERDWLEALSSCPPPCSRREASLAANDVASSPLPSHLIPCLIRPLSTLLVSLYMDECICLPNSLSTAATIIKSVV
        LK+VLSAS +     +   LP ++VEERDWLEAL   PPPCSRREA +AANDV SSPLP HLIPCL++PLSTLLVSLY+DE I LP +L  AAT+IKSV+
Subjt:  LKQVLSASRDTCDD-HPTDLPVVVVEERDWLEALSSCPPPCSRREASLAANDVASSPLPSHLIPCLIRPLSTLLVSLYMDECICLPNSLSTAATIIKSVV

Query:  VSTLERKKIVSNKWWFHVDDFIQDNEIANEIERKWQGSEANI------------IEDSDKLIPEDN--------TTNAKQTSFDLGNNHGFRIMIAGNPR
        VS L+ KKIV++ WW HV DF+QD +IANEIE K QGS   +            ++ S++    +N        +T  + +SF LGN  GFRI+IAGNPR
Subjt:  VSTLERKKIVSNKWWFHVDDFIQDNEIANEIERKWQGSEANI------------IEDSDKLIPEDN--------TTNAKQTSFDLGNNHGFRIMIAGNPR

Query:  SGQRHLASCIVNCFVEHAEIRKVDMATISQ-GYSNLGQGITYILANCLSRGSCLIFMSRVDLWAIES------------HDDQNLEHDT-----------
        SG RHLASC+++C+++H E+RKVD+ATISQ G+ +L QGI+ IL NC S GSCL+FM R+DLWAIE+            ++DQ LE  T           
Subjt:  SGQRHLASCIVNCFVEHAEIRKVDMATISQ-GYSNLGQGITYILANCLSRGSCLIFMSRVDLWAIES------------HDDQNLEHDT-----------

Query:  ---------HSTEEQCIRKATVQRASSAWTLFIDQ-----------ATSEVPFLLLPQEIRHFFRKDLSNCKKLVAQGHNIPRFSIQINEAFNHDLVINQ
                  STE   ++   +  AS AW+ F++Q           ATSEVPFLLLPQEIR FFR DLS C+   ++ H++PRFS+QI+  F+HD+VINQ
Subjt:  ---------HSTEEQCIRKATVQRASSAWTLFIDQ-----------ATSEVPFLLLPQEIRHFFRKDLSNCKKLVAQGHNIPRFSIQINEAFNHDLVINQ

Query:  SAAKLSKEMVKLLVLLIHQKNHVSCTSACNKKPPCDLIEEQARTSNYQKDKESISEDKELNDTNNTTSIE------------DTNLLSAISIFGYQILQN
        SAA+LS+++VKLLV LIHQK+H   T  C K     +I+++    N Q DKE+ SE      + + +S+              +NL+S IS FG+QIL+ 
Subjt:  SAAKLSKEMVKLLVLLIHQKNHVSCTSACNKKPPCDLIEEQARTSNYQKDKESISEDKELNDTNNTTSIE------------DTNLLSAISIFGYQILQN

Query:  PQFSVLCWVTSKFKEGPSANISKAQWEVWKFNSCVLHP---------TSSSDNVQTKENYGMVKGLVAVGLSATRGIYTSLQQVCSDVRAVLNVLVEKVN
        P F+ LCWVTSK KEGP A++S   W+ W FNSC++ P         +SS  N ++KE  G+V+GL+AVGLSA RG YTSL++V  DVR VL +LVE++N
Subjt:  PQFSVLCWVTSKFKEGPSANISKAQWEVWKFNSCVLHP---------TSSSDNVQTKENYGMVKGLVAVGLSATRGIYTSLQQVCSDVRAVLNVLVEKVN

Query:  AKVAAGKDRLQFIGILSQVASLEDTVHSWAYALQ
        AK+ +GK+R Q+  +LSQVA LED V+SWA+ LQ
Subjt:  AKVAAGKDRLQFIGILSQVASLEDTVHSWAYALQ

XP_038898386.1 uncharacterized protein LOC120086038 [Benincasa hispida]3.3e-20950.96Show/hide
Query:  LTPPRGLLLHGYPGTSKTHVVRALVGSCARGNKRIAYFSRKGADGLGKYVGDSEKHLKQLFQVAEECQPSIIFFDEIYGLAASQTRQQDQTHTSVVSTLL
        +TPPRG+LLHGYPGT KTHVVRAL+GSCARG+KRIAYF+RKGAD LGKYVGD+E+ L+ LFQVAE CQPSIIFFDEI GLA  +TRQQDQTH SVVSTLL
Subjt:  LTPPRGLLLHGYPGTSKTHVVRALVGSCARGNKRIAYFSRKGADGLGKYVGDSEKHLKQLFQVAEECQPSIIFFDEIYGLAASQTRQQDQTHTSVVSTLL

Query:  ALLDGLKSR-------------------------------------------------------------------GFAGADLQALCTQTTIIALKRTFP
        ALLDGLKSR                                                                   GFAGADLQALCTQ  + ALKR FP
Subjt:  ALLDGLKSR-------------------------------------------------------------------GFAGADLQALCTQTTIIALKRTFP

Query:  LKQVLSASRDTCD-DHPTDLPVVVVEERDWLEALSSCPPPCSRREASLAANDVASSPLPSHLIPCLIRPLSTLLVSLYMDECICLPNSLSTAATIIKSVV
        LK+VLSAS +    D+   LP ++VEERDWLEAL   PPPCSRREA +AANDVASSPLP HLIPCL+RPLSTLL+SLY+DE I LP +L  AAT+IKSV+
Subjt:  LKQVLSASRDTCD-DHPTDLPVVVVEERDWLEALSSCPPPCSRREASLAANDVASSPLPSHLIPCLIRPLSTLLVSLYMDECICLPNSLSTAATIIKSVV

Query:  VSTLERKKIVSNKWWFHVDDFIQDNEIANEIERKWQGSEANIIEDS---DKLIPEDNTTN------------------AKQTSFDLGNNHGFRIMIAGNP
        VS L+ ++IV++ WW HV DF+Q+ +IANEIE K QGS   ++EDS     ++  ++T+N                   +QTSF LGN  GFRI+IAGNP
Subjt:  VSTLERKKIVSNKWWFHVDDFIQDNEIANEIERKWQGSEANIIEDS---DKLIPEDNTTN------------------AKQTSFDLGNNHGFRIMIAGNP

Query:  RSGQRHLASCIVNCFVEHAEIRKVDMATISQ-GYSNLGQGITYILANCLSRGSCLIFMSRVDLWAIESH------------------------DDQNL--
        RSG RHLASC+++C+++H EIRKVD+ATI Q G+ +L QGI+ IL NC S GSCL+FM R+DLWAIE+                         DD +L  
Subjt:  RSGQRHLASCIVNCFVEHAEIRKVDMATISQ-GYSNLGQGITYILANCLSRGSCLIFMSRVDLWAIESH------------------------DDQNL--

Query:  ------EHDTHSTEEQCIRKATVQRASSAWTLFIDQ-----------ATSEVPFLLLPQEIRHFFRKDLSNCKKLVAQGHNIPRFSIQINEAFNHDLVIN
                 + STE   ++  ++  +S AW+ F++Q           ATSEVPFLLLPQEIR FFR DLS C+   ++ H++PRFS+QI+  F+HD+VIN
Subjt:  ------EHDTHSTEEQCIRKATVQRASSAWTLFIDQ-----------ATSEVPFLLLPQEIRHFFRKDLSNCKKLVAQGHNIPRFSIQINEAFNHDLVIN

Query:  QSAAKLSKEMVKLLVLLIHQKNHVSCTSACNKKPPCDLIEEQARTSNYQKDKESISEDKELNDTNNTTSIE------------DTNLLSAISIFGYQILQ
        QSAA+LS+++VKLLV LIHQK+H   +S C K     +++ ++  +N Q DKE+ SE      +++ +S+              +NL+S IS FGYQIL+
Subjt:  QSAAKLSKEMVKLLVLLIHQKNHVSCTSACNKKPPCDLIEEQARTSNYQKDKESISEDKELNDTNNTTSIE------------DTNLLSAISIFGYQILQ

Query:  NPQFSVLCWVTSKFKEGPSANISKAQWEVWKFNSCVLHP--------TSSSDNVQTKENYGMVKGLVAVGLSATRGIYTSLQQVCSDVRAVLNVLVEKVN
         P F+ LCWVTSK KEGP A++S   W+ W FNSC++ P        +SS  N ++KE  G+V+GL+AVGLSA RG YTSL++V  DVR VL++LVE++N
Subjt:  NPQFSVLCWVTSKFKEGPSANISKAQWEVWKFNSCVLHP--------TSSSDNVQTKENYGMVKGLVAVGLSATRGIYTSLQQVCSDVRAVLNVLVEKVN

Query:  AKVAAGKDRLQFIGILSQVASLEDTVHSWAYALQ
        AK+ +GKDR Q+  +LSQVA LED V+SWA+ LQ
Subjt:  AKVAAGKDRLQFIGILSQVASLEDTVHSWAYALQ

TrEMBL top hitse value%identityAlignment
A0A0A0L9H9 PHD-type domain-containing protein2.6e-20750.84Show/hide
Query:  LTPPRGLLLHGYPGTSKTHVVRALVGSCARGNKRIAYFSRKGADGLGKYVGDSEKHLKQLFQVAEECQPSIIFFDEIYGLAASQTRQQDQTHTSVVSTLL
        +TPPRG+LLHGYPGT KTHVVRAL+GSCARG+KRIAYF+RKGAD LGKYVGD+E+ L+ LFQVAE+CQPSIIFFDEI GLA  +TRQQDQTH SVVSTLL
Subjt:  LTPPRGLLLHGYPGTSKTHVVRALVGSCARGNKRIAYFSRKGADGLGKYVGDSEKHLKQLFQVAEECQPSIIFFDEIYGLAASQTRQQDQTHTSVVSTLL

Query:  ALLDGLKSR-------------------------------------------------------------------GFAGADLQALCTQTTIIALKRTFP
        ALLDGLKSR                                                                   GFAGADLQALCTQ  + ALKR FP
Subjt:  ALLDGLKSR-------------------------------------------------------------------GFAGADLQALCTQTTIIALKRTFP

Query:  LKQVLSASRDTCDD-HPTDLPVVVVEERDWLEALSSCPPPCSRREASLAANDVASSPLPSHLIPCLIRPLSTLLVSLYMDECICLPNSLSTAATIIKSVV
        LK+VLSAS +     +   LP ++VEERDWLEAL   PPPCSRREA +AANDV SSPLP HLIPCL++PLSTLLVSLY+DE I LP +L  AAT+IKSV+
Subjt:  LKQVLSASRDTCDD-HPTDLPVVVVEERDWLEALSSCPPPCSRREASLAANDVASSPLPSHLIPCLIRPLSTLLVSLYMDECICLPNSLSTAATIIKSVV

Query:  VSTLERKKIVSNKWWFHVDDFIQDNEIANEIERKWQGSEANI------------IEDSDKLIPEDN--------TTNAKQTSFDLGNNHGFRIMIAGNPR
        VS L+ KKIV++ WW HV DF+QD +IANEIE K QGS   +            ++ S++    +N        +T  + +SF LGN  GFRI+IAGNPR
Subjt:  VSTLERKKIVSNKWWFHVDDFIQDNEIANEIERKWQGSEANI------------IEDSDKLIPEDN--------TTNAKQTSFDLGNNHGFRIMIAGNPR

Query:  SGQRHLASCIVNCFVEHAEIRKVDMATISQ-GYSNLGQGITYILANCLSRGSCLIFMSRVDLWAIES------------HDDQNLEHDT-----------
        SG RHLASC+++C+++H E+RKVD+ATISQ G+ +L QGI+ IL NC S GSCL+FM R+DLWAIE+            ++DQ LE  T           
Subjt:  SGQRHLASCIVNCFVEHAEIRKVDMATISQ-GYSNLGQGITYILANCLSRGSCLIFMSRVDLWAIES------------HDDQNLEHDT-----------

Query:  ---------HSTEEQCIRKATVQRASSAWTLFIDQ-----------ATSEVPFLLLPQEIRHFFRKDLSNCKKLVAQGHNIPRFSIQINEAFNHDLVINQ
                  STE   ++   +  AS AW+ F++Q           ATSEVPFLLLPQEIR FFR DLS C+   ++ H++PRFS+QI+  F+HD+VINQ
Subjt:  ---------HSTEEQCIRKATVQRASSAWTLFIDQ-----------ATSEVPFLLLPQEIRHFFRKDLSNCKKLVAQGHNIPRFSIQINEAFNHDLVINQ

Query:  SAAKLSKEMVKLLVLLIHQKNHVSCTSACNKKPPCDLIEEQARTSNYQKDKESISEDKELNDTNNTTSIE------------DTNLLSAISIFGYQILQN
        SAA+LS+++VKLLV LIHQK+H   T  C K     +I+++    N Q DKE+ SE      + + +S+              +NL+S IS FG+QIL+ 
Subjt:  SAAKLSKEMVKLLVLLIHQKNHVSCTSACNKKPPCDLIEEQARTSNYQKDKESISEDKELNDTNNTTSIE------------DTNLLSAISIFGYQILQN

Query:  PQFSVLCWVTSKFKEGPSANISKAQWEVWKFNSCVLHP---------TSSSDNVQTKENYGMVKGLVAVGLSATRGIYTSLQQVCSDVRAVLNVLVEKVN
        P F+ LCWVTSK KEGP A++S   W+ W FNSC++ P         +SS  N ++KE  G+V+GL+AVGLSA RG YTSL++V  DVR VL +LVE++N
Subjt:  PQFSVLCWVTSKFKEGPSANISKAQWEVWKFNSCVLHP---------TSSSDNVQTKENYGMVKGLVAVGLSATRGIYTSLQQVCSDVRAVLNVLVEKVN

Query:  AKVAAGKDRLQFIGILSQVASLEDTVHSWAYALQ
        AK+ +GK+R Q+  +LSQVA LED V+SWA+ LQ
Subjt:  AKVAAGKDRLQFIGILSQVASLEDTVHSWAYALQ

A0A6J1CM60 uncharacterized protein LOC111012888 isoform X14.1e-20549.77Show/hide
Query:  LTPPRGLLLHGYPGTSKTHVVRALVGSCARGNKRIAYFSRKGADGLGKYVGDSEKHLKQLFQVAEECQPSIIFFDEIYGLAASQTRQQDQTHTSVVSTLL
        +TPPRG+LLHGYPGT KTHVVRAL+GSCARG+KRIAYF+RKGAD LGKYVGD+E+ L+ LFQVAE CQPSIIFFDEI GLA  +TRQQDQTH SVVSTLL
Subjt:  LTPPRGLLLHGYPGTSKTHVVRALVGSCARGNKRIAYFSRKGADGLGKYVGDSEKHLKQLFQVAEECQPSIIFFDEIYGLAASQTRQQDQTHTSVVSTLL

Query:  ALLDGLKSR-------------------------------------------------------------------GFAGADLQALCTQTTIIALKRTFP
        ALLDGLKSR                                                                   GFAGADLQALCTQ  + ALKR FP
Subjt:  ALLDGLKSR-------------------------------------------------------------------GFAGADLQALCTQTTIIALKRTFP

Query:  LKQVLSASRDTCD-DHPTDLPVVVVEERDWLEALSSCPPPCSRREASLAANDVASSPLPSHLIPCLIRPLSTLLVSLYMDECICLPNSLSTAATIIKSVV
        LK+VLS S +    D+   LP ++VEERDWLEAL   PPPCSRREA +AANDVASSPLPSHLIP L++PLSTLLVSLY+DE I LP +L  AAT+IKSV+
Subjt:  LKQVLSASRDTCD-DHPTDLPVVVVEERDWLEALSSCPPPCSRREASLAANDVASSPLPSHLIPCLIRPLSTLLVSLYMDECICLPNSLSTAATIIKSVV

Query:  VSTLERKKIVSNKWWFHVDDFIQDNEIANEIERKWQGS---------------------EANIIEDSDKLIPEDNTTNAKQTSFDLGNNHGFRIMIAGNP
        +S L+ +KIV++ WW HV DF++D +IANEIE K QGS                     E +  E S       +TT  + TSFDLGN  GFRI+IAGNP
Subjt:  VSTLERKKIVSNKWWFHVDDFIQDNEIANEIERKWQGS---------------------EANIIEDSDKLIPEDNTTNAKQTSFDLGNNHGFRIMIAGNP

Query:  RSGQRHLASCIVNCFVEHAEIRKVDMATISQ-GYSNLGQGITYILANCLSRGSCLIFMSRVDLWAIESHDDQNLEHD-----------------------
        RSG RHLASC+++C+++H EIRKVD+ATISQ G+ +L QGI+ IL NC S GSC++FM R+DLWA+E  +    E D                       
Subjt:  RSGQRHLASCIVNCFVEHAEIRKVDMATISQ-GYSNLGQGITYILANCLSRGSCLIFMSRVDLWAIESHDDQNLEHD-----------------------

Query:  ----------THSTE-----------------------------EQCIRKATVQRASSAWTLFIDQ-----------ATSEVPFLLLPQEIRHFFRKDLS
                  + STE                             E+   +  +  AS AW+ F++Q           ATSEVPFLLLPQEIR FFR DLS
Subjt:  ----------THSTE-----------------------------EQCIRKATVQRASSAWTLFIDQ-----------ATSEVPFLLLPQEIRHFFRKDLS

Query:  NCKKLVAQGHNIPRFSIQINEAFNHDLVINQSAAKLSKEMVKLLVLLIHQKNHVSCTSACNKKPPCDLIEEQARTSNYQKDKESI---SEDKELNDTNNT
         C+   ++ H +PRFS+QI+  FNHD+VINQSAA+LS+++ KLLV LIHQK+H   TSAC K     +I+ ++  +N Q DKE+    + +K+  D N  
Subjt:  NCKKLVAQGHNIPRFSIQINEAFNHDLVINQSAAKLSKEMVKLLVLLIHQKNHVSCTSACNKKPPCDLIEEQARTSNYQKDKESI---SEDKELNDTNNT

Query:  ---------TSIEDTNLLSAISIFGYQILQNPQFSVLCWVTSKFKEGPSANISKAQWEVWKFNSCVLHP--------TSSSDNVQTKENYGMVKGLVAVG
                 T    +NL+S IS FGYQILQ P F+ LCWVTSK KEGPS ++S   W+ W FNSC++ P        +SS  N ++KE  G+V+GLVAVG
Subjt:  ---------TSIEDTNLLSAISIFGYQILQNPQFSVLCWVTSKFKEGPSANISKAQWEVWKFNSCVLHP--------TSSSDNVQTKENYGMVKGLVAVG

Query:  LSATRGIYTSLQQVCSDVRAVLNVLVEKVNAKVAAGKDRLQFIGILSQVASLEDTVHSWAYALQ
        LSA RG YTSL++V  DVR VL++LVE++NAK++AGKDR Q++ +LSQVA LED V++WA+ LQ
Subjt:  LSATRGIYTSLQQVCSDVRAVLNVLVEKVNAKVAAGKDRLQFIGILSQVASLEDTVHSWAYALQ

A0A6J1CP50 uncharacterized protein LOC111012888 isoform X21.8e-20851.32Show/hide
Query:  LTPPRGLLLHGYPGTSKTHVVRALVGSCARGNKRIAYFSRKGADGLGKYVGDSEKHLKQLFQVAEECQPSIIFFDEIYGLAASQTRQQDQTHTSVVSTLL
        +TPPRG+LLHGYPGT KTHVVRAL+GSCARG+KRIAYF+RKGAD LGKYVGD+E+ L+ LFQVAE CQPSIIFFDEI GLA  +TRQQDQTH SVVSTLL
Subjt:  LTPPRGLLLHGYPGTSKTHVVRALVGSCARGNKRIAYFSRKGADGLGKYVGDSEKHLKQLFQVAEECQPSIIFFDEIYGLAASQTRQQDQTHTSVVSTLL

Query:  ALLDGLKSR-------------------------------------------------------------------GFAGADLQALCTQTTIIALKRTFP
        ALLDGLKSR                                                                   GFAGADLQALCTQ  + ALKR FP
Subjt:  ALLDGLKSR-------------------------------------------------------------------GFAGADLQALCTQTTIIALKRTFP

Query:  LKQVLSASRDTCD-DHPTDLPVVVVEERDWLEALSSCPPPCSRREASLAANDVASSPLPSHLIPCLIRPLSTLLVSLYMDECICLPNSLSTAATIIKSVV
        LK+VLS S +    D+   LP ++VEERDWLEAL   PPPCSRREA +AANDVASSPLPSHLIP L++PLSTLLVSLY+DE I LP +L  AAT+IKSV+
Subjt:  LKQVLSASRDTCD-DHPTDLPVVVVEERDWLEALSSCPPPCSRREASLAANDVASSPLPSHLIPCLIRPLSTLLVSLYMDECICLPNSLSTAATIIKSVV

Query:  VSTLERKKIVSNKWWFHVDDFIQDNEIANEIERKWQGS---------------------EANIIEDSDKLIPEDNTTNAKQTSFDLGNNHGFRIMIAGNP
        +S L+ +KIV++ WW HV DF++D +IANEIE K QGS                     E +  E S       +TT  + TSFDLGN  GFRI+IAGNP
Subjt:  VSTLERKKIVSNKWWFHVDDFIQDNEIANEIERKWQGS---------------------EANIIEDSDKLIPEDNTTNAKQTSFDLGNNHGFRIMIAGNP

Query:  RSGQRHLASCIVNCFVEHAEIRKVDMATISQ-GYSNLGQGITYILANCLSRGSCLIFMSRVDLWAIESHDDQNLEHDTHSTEEQCIR-------------
        RSG RHLASC+++C+++H EIRKVD+ATISQ G+ +L QGI+ IL NC S GSC++FM R+DLWA+E  +    E D    E+Q                
Subjt:  RSGQRHLASCIVNCFVEHAEIRKVDMATISQ-GYSNLGQGITYILANCLSRGSCLIFMSRVDLWAIESHDDQNLEHDTHSTEEQCIR-------------

Query:  -------------------KATVQRASSAWTLFIDQ-----------ATSEVPFLLLPQEIRHFFRKDLSNCKKLVAQGHNIPRFSIQINEAFNHDLVIN
                           +  +  AS AW+ F++Q           ATSEVPFLLLPQEIR FFR DLS C+   ++ H +PRFS+QI+  FNHD+VIN
Subjt:  -------------------KATVQRASSAWTLFIDQ-----------ATSEVPFLLLPQEIRHFFRKDLSNCKKLVAQGHNIPRFSIQINEAFNHDLVIN

Query:  QSAAKLSKEMVKLLVLLIHQKNHVSCTSACNKKPPCDLIEEQARTSNYQKDKESI---SEDKELNDTNNT---------TSIEDTNLLSAISIFGYQILQ
        QSAA+LS+++ KLLV LIHQK+H   TSAC K     +I+ ++  +N Q DKE+    + +K+  D N           T    +NL+S IS FGYQILQ
Subjt:  QSAAKLSKEMVKLLVLLIHQKNHVSCTSACNKKPPCDLIEEQARTSNYQKDKESI---SEDKELNDTNNT---------TSIEDTNLLSAISIFGYQILQ

Query:  NPQFSVLCWVTSKFKEGPSANISKAQWEVWKFNSCVLHP--------TSSSDNVQTKENYGMVKGLVAVGLSATRGIYTSLQQVCSDVRAVLNVLVEKVN
         P F+ LCWVTSK KEGPS ++S   W+ W FNSC++ P        +SS  N ++KE  G+V+GLVAVGLSA RG YTSL++V  DVR VL++LVE++N
Subjt:  NPQFSVLCWVTSKFKEGPSANISKAQWEVWKFNSCVLHP--------TSSSDNVQTKENYGMVKGLVAVGLSATRGIYTSLQQVCSDVRAVLNVLVEKVN

Query:  AKVAAGKDRLQFIGILSQVASLEDTVHSWAYALQ
        AK++AGKDR Q++ +LSQVA LED V++WA+ LQ
Subjt:  AKVAAGKDRLQFIGILSQVASLEDTVHSWAYALQ

A0A6J1DS84 uncharacterized protein LOC1110238171.3e-26264.25Show/hide
Query:  LTPPRGLLLHGYPGTSKTHVVRALVGSCARGNKRIAYFSRKGADGLGKYVGDSEKHLKQLFQVAEECQPSIIFFDEIYGLAASQTRQQDQTHTSVVSTLL
        + PPRG+LLHGYPGT KTHVVRALVGSCARGNK IAYFSRKGAD LGKYVGDSE+ L++LFQVA++CQPSIIFFDEI GLA  +TR+QDQTHTSVVSTLL
Subjt:  LTPPRGLLLHGYPGTSKTHVVRALVGSCARGNKRIAYFSRKGADGLGKYVGDSEKHLKQLFQVAEECQPSIIFFDEIYGLAASQTRQQDQTHTSVVSTLL

Query:  ALLDGLKSR---------------------------------------------------------------------GFAGADLQALCTQTTIIALKRT
        ALLDGLKSR                                                                     GFAGADLQALCTQT IIALKRT
Subjt:  ALLDGLKSR---------------------------------------------------------------------GFAGADLQALCTQTTIIALKRT

Query:  FPLKQVLSASRDTCDDHPTDLPVVVVEERDWLEALSSCPPPCSRREASLAANDVASSPLPSHLIPCLIRPLSTLLVSLYMDECICLPNSLSTAATIIKSV
        FPLKQVL AS+DT    PT LP V+VEERDWLEALSSCPPPCSRREA++AANDV SSPLPSHLIPCLIRPLS LLVSLY+++CICLP +LS AA II++V
Subjt:  FPLKQVLSASRDTCDDHPTDLPVVVVEERDWLEALSSCPPPCSRREASLAANDVASSPLPSHLIPCLIRPLSTLLVSLYMDECICLPNSLSTAATIIKSV

Query:  VVSTLERKKIVSNKWWFHVDDFIQDNEIANEIERKWQGSEANIIE-----DSDKLIPEDNTTNAKQTSFDLGNNHGFRIMIAGNPRSGQRHLASCIVNCF
        ++S LERK   SNKWW HVDDFIQ+++IA+EIERKWQ    NI+E     +S + IPED   NAK TSFDLGN+ GFRIMIAGNP  GQRHLASCI+NCF
Subjt:  VVSTLERKKIVSNKWWFHVDDFIQDNEIANEIERKWQGSEANIIE-----DSDKLIPEDNTTNAKQTSFDLGNNHGFRIMIAGNPRSGQRHLASCIVNCF

Query:  VEHAEIRKVDMATISQGYSNLGQGITYILANCLSRGSCLIFMSRVDLWAIESHDDQNLEHD-THSTEEQCIRKATVQRASSAWTLFIDQ-----------
        V H EIRKVDMAT SQGY+NL QGI++ LANCLSRG CLIFM R+DLWA+E+H+DQ LEHD TH T+EQCIRKA V RASSAWTLFI Q           
Subjt:  VEHAEIRKVDMATISQGYSNLGQGITYILANCLSRGSCLIFMSRVDLWAIESHDDQNLEHD-THSTEEQCIRKATVQRASSAWTLFIDQ-----------

Query:  --ATSEVPFLLLPQEIRHFFRKDLSNCKKLVAQGHNIPRFSIQINEAFNHDLVINQSAAKLSKEMVKLLVLLIHQKNHVSCTSACNKKPPCDLIEEQART
          ATSEVPFLLLPQEIR FF  DL++CK+LVA  H IPRFSIQI+E FNHDLVI++SAA LSK+MVKLL LLIHQK HV+           +   E  R+
Subjt:  --ATSEVPFLLLPQEIRHFFRKDLSNCKKLVAQGHNIPRFSIQINEAFNHDLVINQSAAKLSKEMVKLLVLLIHQKNHVSCTSACNKKPPCDLIEEQART

Query:  SNYQKDKESISEDKELNDTNNTTSIEDTNLLSAISIFGYQILQNPQFSVLCWVTSKFKEGPSANISKAQWEVWKFNSCVLHPTSSSDNVQTKENYGMVKG
        SNYQKDKE                         IS FG QILQN  +S LC  TSK KEGPSANISK+ WE W+FN CVLH TSS D V+T+ENYGMVKG
Subjt:  SNYQKDKESISEDKELNDTNNTTSIEDTNLLSAISIFGYQILQNPQFSVLCWVTSKFKEGPSANISKAQWEVWKFNSCVLHPTSSSDNVQTKENYGMVKG

Query:  LVAVGLSATRGIYTSLQQVCSDVRAVLNVLVEKVNAKVAAGKDRLQFIGILSQVASLEDTVHSWAYALQMYY
        LV++GL ATRG+YTSLQQVCSDVR VL++ VEK+NAKV AGK+ LQ+  ILSQVASLEDTV++WAY LQ YY
Subjt:  LVAVGLSATRGIYTSLQQVCSDVRAVLNVLVEKVNAKVAAGKDRLQFIGILSQVASLEDTVHSWAYALQMYY

A0A6J1E3Z1 uncharacterized protein LOC1114306051.4e-20550.3Show/hide
Query:  LTPPRGLLLHGYPGTSKTHVVRALVGSCARGNKRIAYFSRKGADGLGKYVGDSEKHLKQLFQVAEECQPSIIFFDEIYGLAASQTRQQDQTHTSVVSTLL
        +TPPRG+LLHGYPGT KTHVVRAL+GSCARG+KRIAYF+RKGAD LGKYVGD+E+ L+ LFQVAE+CQPSIIFFDEI GLA  +TRQQDQTH SVVSTLL
Subjt:  LTPPRGLLLHGYPGTSKTHVVRALVGSCARGNKRIAYFSRKGADGLGKYVGDSEKHLKQLFQVAEECQPSIIFFDEIYGLAASQTRQQDQTHTSVVSTLL

Query:  ALLDGLKSR-------------------------------------------------------------------GFAGADLQALCTQTTIIALKRTFP
        ALLDGLKSR                                                                   GFAGADLQALCTQ  + ALKR FP
Subjt:  ALLDGLKSR-------------------------------------------------------------------GFAGADLQALCTQTTIIALKRTFP

Query:  LKQVLSASRDTCD-DHPTDLPVVVVEERDWLEALSSCPPPCSRREASLAANDVASSPLPSHLIPCLIRPLSTLLVSLYMDECICLPNSLSTAATIIKSVV
        LK+VLSAS +    D+   LP ++VEERDWLEAL   PPPCSRREA +AANDVASSPLPSHLIPCL++PLSTLLVSLY+DE I LP +L  AAT+IKSV+
Subjt:  LKQVLSASRDTCD-DHPTDLPVVVVEERDWLEALSSCPPPCSRREASLAANDVASSPLPSHLIPCLIRPLSTLLVSLYMDECICLPNSLSTAATIIKSVV

Query:  VSTLERKKIVSNKWWFHVDDFIQDNEIANEIERKWQGS----EANIIEDSDKLIPEDNTTNAK---------------QTSFDLGNNHGFRIMIAGNPRS
        V+ L+ +KIV++ WW +V DF+QD ++ANEIE K QGS    E +    S  L  E    ++K               ++ F L N  GFRI+IAGNPRS
Subjt:  VSTLERKKIVSNKWWFHVDDFIQDNEIANEIERKWQGS----EANIIEDSDKLIPEDNTTNAK---------------QTSFDLGNNHGFRIMIAGNPRS

Query:  GQRHLASCIVNCFVEHAEIRKVDMATISQ-GYSNLGQGITYILANCLSRGSCLIFMSRVDLWAIESHDDQNLE------------------------HDT
        G RHLASC+++C+++H EIRKVD+ATISQ G+ +L QGI+ IL NC S GSCL+FM R+DLWAIES +  + E                         + 
Subjt:  GQRHLASCIVNCFVEHAEIRKVDMATISQ-GYSNLGQGITYILANCLSRGSCLIFMSRVDLWAIESHDDQNLE------------------------HDT

Query:  HSTEEQC-------IRKATVQRASSAWTLFIDQ-----------ATSEVPFLLLPQEIRHFFRKDLSNCKKLVAQGHNIPRFSIQINEAFNHDLVINQSA
        HS  +Q        ++  ++  AS AW+ F++Q           ATSEVPFLLLPQEIR FFR DLS C+    + H++PRFS+QI+  F+HD+VINQSA
Subjt:  HSTEEQC-------IRKATVQRASSAWTLFIDQ-----------ATSEVPFLLLPQEIRHFFRKDLSNCKKLVAQGHNIPRFSIQINEAFNHDLVINQSA

Query:  AKLSKEMVKLLVLLIHQKNHVSCTSACNKKPPCDLIEEQARTSNYQKDKESISE---DKELNDTNNT---------TSIEDTNLLSAISIFGYQILQNPQ
         +LS++++KLLV LIHQK H   TS C K     + ++++  ++ + DKE+ SE   +K+  D ++          T    +NL S IS FGYQIL+ P 
Subjt:  AKLSKEMVKLLVLLIHQKNHVSCTSACNKKPPCDLIEEQARTSNYQKDKESISE---DKELNDTNNT---------TSIEDTNLLSAISIFGYQILQNPQ

Query:  FSVLCWVTSKFKEGPSANISKAQWEVWKFNSCVLHP--------TSSSDNVQTKENYGMVKGLVAVGLSATRGIYTSLQQVCSDVRAVLNVLVEKVNAKV
        F+ LCWVTSK KEGPSA++S   W+ W FNSC++ P        +SS+ N ++KE  GMV+GL+AVGLSA RG YTSL++V  DVR VL++LVE++NAK+
Subjt:  FSVLCWVTSKFKEGPSANISKAQWEVWKFNSCVLHP--------TSSSDNVQTKENYGMVKGLVAVGLSATRGIYTSLQQVCSDVRAVLNVLVEKVNAKV

Query:  AAGKDRLQFIGILSQVASLEDTVHSWAYALQMYYPLLINSHTSQN
         +GKDR Q+  +LSQVA LED V+SW + L        N  TS N
Subjt:  AAGKDRLQFIGILSQVASLEDTVHSWAYALQMYYPLLINSHTSQN

SwissProt top hitse value%identityAlignment
F4IAE9 ATPase family AAA domain-containing protein At1g059109.0e-3234.81Show/hide
Query:  SLTPPRGLLLHGYPGTSKTHVVRALVGSCARGNKRIAYFSRKGADGLGKYVGDSEKHLKQLFQVAEECQPSIIFFDEIYGLAASQTRQQDQTHTSVVSTL
        S+TPPRG+LL G PGT KT + RAL  + ++  ++++++ RKGAD L K+VG++E+ LK LF+ A+  QPSIIFFDEI GLA  ++ +Q+Q H S+VSTL
Subjt:  SLTPPRGLLLHGYPGTSKTHVVRALVGSCARGNKRIAYFSRKGADGLGKYVGDSEKHLKQLFQVAEECQPSIIFFDEIYGLAASQTRQQDQTHTSVVSTL

Query:  LALLDGLKSR-------------------------------------------------------------------GFAGADLQALCTQTTIIALKRTF
        LAL+DGL SR                                                                   G+ GADL+ALCT+  I A +  +
Subjt:  LALLDGLKSR-------------------------------------------------------------------GFAGADLQALCTQTTIIALKRTF

Query:  PLKQVLSASRDTCDDHPTDLPVVVVEERDWLEALSSCPPPCSRREASLAANDVASSPLPSHLIPCLIRPL
        P  QV ++     D +  D+ +V VE+  ++EA+S+  P   R       + V S PL   ++PCL R L
Subjt:  PLKQVLSASRDTCDDHPTDLPVVVVEERDWLEALSSCPPPCSRREASLAANDVASSPLPSHLIPCLIRPL

P40340 Tat-binding homolog 77.3e-3426.85Show/hide
Query:  SLTPPRGLLLHGYPGTSKTHVVRALVGSCARGNKRIAYFSRKGADGLGKYVGDSEKHLKQLFQVAEECQPSIIFFDEIYGLAASQTRQQDQTHTSVVSTL
        ++TPPRG+L HG PGT KT + RAL  SC+   ++I +F RKGAD L K+VG++E+ L+ LF+ A++ QPSIIFFDEI GLA  ++ +Q+Q H S+VSTL
Subjt:  SLTPPRGLLLHGYPGTSKTHVVRALVGSCARGNKRIAYFSRKGADGLGKYVGDSEKHLKQLFQVAEECQPSIIFFDEIYGLAASQTRQQDQTHTSVVSTL

Query:  LALLDGLKSR-------------------------------------------------------------------GFAGADLQALCTQTTIIALKRTF
        LAL+DG+ +R                                                                   G+ GADL++LCT+  +I+++R+F
Subjt:  LALLDGLKSR-------------------------------------------------------------------GFAGADLQALCTQTTIIALKRTF

Query:  PLKQVLSASRDTCDDHPTDLPVVVVEERDWLEALSSCPPPCSRREASLAANDVASSPLPSHLIPCLIRPLSTLLVSLYMDECICLPNSLSTAATIIKSVV
        P  Q+  ++    D    D   + V+  D++ AL    P         +A    SSP P   +P LI+P       L  D+     N+L      + ++ 
Subjt:  PLKQVLSASRDTCDDHPTDLPVVVVEERDWLEALSSCPPPCSRREASLAANDVASSPLPSHLIPCLIRPLSTLLVSLYMDECICLPNSLSTAATIIKSVV

Query:  VSTLERKKIVSNKWWFHVDDFIQDNEIANEIERKWQGSEANIIEDSDKLIPEDNTTNAKQTSFDLGNNHGF----RIMIAGNPRSGQRHLASCIVNCFVE
         +T +R   +       + +FI   E + E             E+ DK    ++T++ +   F            R++I G   +GQ+++ + I+N ++E
Subjt:  VSTLERKKIVSNKWWFHVDDFIQDNEIANEIERKWQGSEANIIEDSDKLIPEDNTTNAKQTSFDLGNNHGF----RIMIAGNPRSGQRHLASCIVNCFVE

Query:  HAEIRKVDMAT-ISQGYSNLGQGITYILANCLSRGSCLIFMSRVDLW
           ++ +D+A+ +S+    +   +         R   ++F+  +D+W
Subjt:  HAEIRKVDMAT-ISQGYSNLGQGITYILANCLSRGSCLIFMSRVDLW

Q5RDX4 ATPase family AAA domain-containing protein 21.3e-3060Show/hide
Query:  LTPPRGLLLHGYPGTSKTHVVRALVGSCARGNKRIAYFSRKGADGLGKYVGDSEKHLKQLFQVAEECQPSIIFFDEIYGLAASQTRQQDQTHTSVVSTLL
        + PPRG L +G PGT KT V RAL   C++G+KR+A+F RKGAD L K+VG+SE+ L+ LF  A + +PSIIFFDEI GLA  ++ +QDQ H+S+VSTLL
Subjt:  LTPPRGLLLHGYPGTSKTHVVRALVGSCARGNKRIAYFSRKGADGLGKYVGDSEKHLKQLFQVAEECQPSIIFFDEIYGLAASQTRQQDQTHTSVVSTLL

Query:  ALLDGLKSRG
        AL+DGL SRG
Subjt:  ALLDGLKSRG

Q6PL18 ATPase family AAA domain-containing protein 21.3e-3060Show/hide
Query:  LTPPRGLLLHGYPGTSKTHVVRALVGSCARGNKRIAYFSRKGADGLGKYVGDSEKHLKQLFQVAEECQPSIIFFDEIYGLAASQTRQQDQTHTSVVSTLL
        + PPRG L +G PGT KT V RAL   C++G+KR+A+F RKGAD L K+VG+SE+ L+ LF  A + +PSIIFFDEI GLA  ++ +QDQ H+S+VSTLL
Subjt:  LTPPRGLLLHGYPGTSKTHVVRALVGSCARGNKRIAYFSRKGADGLGKYVGDSEKHLKQLFQVAEECQPSIIFFDEIYGLAASQTRQQDQTHTSVVSTLL

Query:  ALLDGLKSRG
        AL+DGL SRG
Subjt:  ALLDGLKSRG

Q8CDM1 ATPase family AAA domain-containing protein 29.9e-3160Show/hide
Query:  LTPPRGLLLHGYPGTSKTHVVRALVGSCARGNKRIAYFSRKGADGLGKYVGDSEKHLKQLFQVAEECQPSIIFFDEIYGLAASQTRQQDQTHTSVVSTLL
        + PPRG L +G PGT KT V RAL   C+RG+KR+A+F RKGAD L K+VG+SE+ L+ LF  A + +P+IIFFDEI GLA  ++ +QDQ H+S+VSTLL
Subjt:  LTPPRGLLLHGYPGTSKTHVVRALVGSCARGNKRIAYFSRKGADGLGKYVGDSEKHLKQLFQVAEECQPSIIFFDEIYGLAASQTRQQDQTHTSVVSTLL

Query:  ALLDGLKSRG
        AL+DGL SRG
Subjt:  ALLDGLKSRG

Arabidopsis top hitse value%identityAlignment
AT1G05910.1 cell division cycle protein 48-related / CDC48-related6.4e-3334.81Show/hide
Query:  SLTPPRGLLLHGYPGTSKTHVVRALVGSCARGNKRIAYFSRKGADGLGKYVGDSEKHLKQLFQVAEECQPSIIFFDEIYGLAASQTRQQDQTHTSVVSTL
        S+TPPRG+LL G PGT KT + RAL  + ++  ++++++ RKGAD L K+VG++E+ LK LF+ A+  QPSIIFFDEI GLA  ++ +Q+Q H S+VSTL
Subjt:  SLTPPRGLLLHGYPGTSKTHVVRALVGSCARGNKRIAYFSRKGADGLGKYVGDSEKHLKQLFQVAEECQPSIIFFDEIYGLAASQTRQQDQTHTSVVSTL

Query:  LALLDGLKSR-------------------------------------------------------------------GFAGADLQALCTQTTIIALKRTF
        LAL+DGL SR                                                                   G+ GADL+ALCT+  I A +  +
Subjt:  LALLDGLKSR-------------------------------------------------------------------GFAGADLQALCTQTTIIALKRTF

Query:  PLKQVLSASRDTCDDHPTDLPVVVVEERDWLEALSSCPPPCSRREASLAANDVASSPLPSHLIPCLIRPL
        P  QV ++     D +  D+ +V VE+  ++EA+S+  P   R       + V S PL   ++PCL R L
Subjt:  PLKQVLSASRDTCDDHPTDLPVVVVEERDWLEALSSCPPPCSRREASLAANDVASSPLPSHLIPCLIRPL

AT3G09840.1 cell division cycle 482.7e-1539.25Show/hide
Query:  PPRGLLLHGYPGTSKTHVVRALVGSCARGNKRIAYFSRKGADGLGKYVGDSEKHLKQLFQVAEECQPSIIFFDEIYGLAASQTRQQDQTHTSVVSTLLAL
        PP+G+LL+G PG+ KT + RA+            +F   G + + K  G+SE +L++ F+ AE+  PSIIF DEI  +A  + +   +    +VS LL L
Subjt:  PPRGLLLHGYPGTSKTHVVRALVGSCARGNKRIAYFSRKGADGLGKYVGDSEKHLKQLFQVAEECQPSIIFFDEIYGLAASQTRQQDQTHTSVVSTLLAL

Query:  LDGLKSR
        +DGLKSR
Subjt:  LDGLKSR

AT3G15120.1 P-loop containing nucleoside triphosphate hydrolases superfamily protein6.3e-16642.3Show/hide
Query:  LTPPRGLLLHGYPGTSKTHVVRALVGSCARGNKRIAYFSRKGADGLGKYVGDSEKHLKQLFQVAEECQPSIIFFDEIYGLAASQTRQQDQTHTSVVSTLL
        LTPPRG+LLHG+PGT KT VVRAL+GS ARGN+RIAYF+RKGAD LGKYVGD+E+ L+ LFQVAE+CQPSIIFFDEI GLA  ++RQQDQTH+SVVSTLL
Subjt:  LTPPRGLLLHGYPGTSKTHVVRALVGSCARGNKRIAYFSRKGADGLGKYVGDSEKHLKQLFQVAEECQPSIIFFDEIYGLAASQTRQQDQTHTSVVSTLL

Query:  ALLDGLKSR-------------------------------------------------------------------GFAGADLQALCTQTTIIALKRTFP
        ALLDGLKSR                                                                   GFAGAD+QALCTQ  +IAL R+FP
Subjt:  ALLDGLKSR-------------------------------------------------------------------GFAGADLQALCTQTTIIALKRTFP

Query:  LKQVLSASR-DTCDDHPTDLPVVVVEERDWLEALSSCPPPCSRREASLAANDVASSPLPSHLIPCLIRPLSTLLVSLYMDECICLPNSLSTAATIIKSVV
        L++ L+A+       +   LP   VEERDWLEALS  PPPCSRR A +AA+D+ SSPLP++L+P L+ PL +LLV+L++DE I LP  LS AA  +++V+
Subjt:  LKQVLSASR-DTCDDHPTDLPVVVVEERDWLEALSSCPPPCSRREASLAANDVASSPLPSHLIPCLIRPLSTLLVSLYMDECICLPNSLSTAATIIKSVV

Query:  VSTLERKKIVSNKWWFHVDDFIQDNEIANEIERKWQ------------GSEANI-------IEDSDKLIPE--DNTTNAKQTSFDLGNNHGFRIMIAGNP
         S L  KKI    WW HVD  + + ++  +I ++              GS A+I       +  +  ++P    +       S +  +  GF+++IAG P
Subjt:  VSTLERKKIVSNKWWFHVDDFIQDNEIANEIERKWQ------------GSEANI-------IEDSDKLIPE--DNTTNAKQTSFDLGNNHGFRIMIAGNP

Query:  RSGQRHLASCIVNCFVEHAEIRKVDMATISQ-GYSNLGQGITYILANCLSRGSCLIFMSRVDLWAIESHD--DQNLEHDTHSTEEQCIRKATVQ------
        +SGQRHLASC+++CF+ +AE+ K+D ATISQ G  +L  G+T++L  C S+ SC++FM RVDLWA+++    ++ +E D  S +E C      +      
Subjt:  RSGQRHLASCIVNCFVEHAEIRKVDMATISQ-GYSNLGQGITYILANCLSRGSCLIFMSRVDLWAIESHD--DQNLEHDTHSTEEQCIRKATVQ------

Query:  RASSAWTLFIDQ-------------ATSEVPFLLLPQEIRHFFRKDLS-NCKKLVAQGHNIPRFSIQINEAFNHDLVINQSAAKLSKEMVKLLVLLIHQK
        R S AW  F +Q             ATS +P+ LLP +I+ FF+ DLS  C+  +++   +P+F++Q+ E+ + D+ I+ SA +L +  +++ + L+HQ 
Subjt:  RASSAWTLFIDQ-------------ATSEVPFLLLPQEIRHFFRKDLS-NCKKLVAQGHNIPRFSIQINEAFNHDLVINQSAAKLSKEMVKLLVLLIHQK

Query:  NHVSCTSACNKKPPCDLIEEQAR------TSNYQKDKESISEDKELNDTN--------NTTSIEDTNLLSAISIFGYQILQNPQFSVLCWVTSKFKEGPS
        +H  C     KK   + +++  R       ++++  +E++ + K L+D +        N      ++L  A+S FGYQILQ PQF+ LCWVTSK KEGPS
Subjt:  NHVSCTSACNKKPPCDLIEEQAR------TSNYQKDKESISEDKELNDTN--------NTTSIEDTNLLSAISIFGYQILQNPQFSVLCWVTSKFKEGPS

Query:  ANISKAQWEVWKFNSCVLHP---------TSSSDNVQTKENYGMVKGLVAVGLSATRGIYTSLQQVCSDVRAVLNVLVEKVNAKVAAGKDRLQFIGILSQ
        A++S   W  W FNSC+  P         +S S+NV+ K++ G+V+GL AVGLSA RG Y SL++V  +VR VL +LV +++ K+ AGKDR ++I ILSQ
Subjt:  ANISKAQWEVWKFNSCVLHP---------TSSSDNVQTKENYGMVKGLVAVGLSATRGIYTSLQQVCSDVRAVLNVLVEKVNAKVAAGKDRLQFIGILSQ

Query:  VASLEDTVHSWAYALQMY
        VA LED V+SW YA++ +
Subjt:  VASLEDTVHSWAYALQMY

AT3G53230.1 ATPase, AAA-type, CDC48 protein2.7e-1539.25Show/hide
Query:  PPRGLLLHGYPGTSKTHVVRALVGSCARGNKRIAYFSRKGADGLGKYVGDSEKHLKQLFQVAEECQPSIIFFDEIYGLAASQTRQQDQTHTSVVSTLLAL
        PP+G+LL+G PG+ KT + RA+            +F   G + + K  G+SE +L++ F+ AE+  PSIIF DEI  +A  + +   +    +VS LL L
Subjt:  PPRGLLLHGYPGTSKTHVVRALVGSCARGNKRIAYFSRKGADGLGKYVGDSEKHLKQLFQVAEECQPSIIFFDEIYGLAASQTRQQDQTHTSVVSTLLAL

Query:  LDGLKSR
        +DGLKSR
Subjt:  LDGLKSR

AT5G03340.1 ATPase, AAA-type, CDC48 protein2.7e-1539.25Show/hide
Query:  PPRGLLLHGYPGTSKTHVVRALVGSCARGNKRIAYFSRKGADGLGKYVGDSEKHLKQLFQVAEECQPSIIFFDEIYGLAASQTRQQDQTHTSVVSTLLAL
        PP+G+LL+G PG+ KT + RA+            +F   G + + K  G+SE +L++ F+ AE+  PSIIF DEI  +A  + +   +    +VS LL L
Subjt:  PPRGLLLHGYPGTSKTHVVRALVGSCARGNKRIAYFSRKGADGLGKYVGDSEKHLKQLFQVAEECQPSIIFFDEIYGLAASQTRQQDQTHTSVVSTLLAL

Query:  LDGLKSR
        +DGLKSR
Subjt:  LDGLKSR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTGAAATCTCTTACACCACCCAGAGGTCTTCTCCTCCATGGATATCCTGGAACAAGTAAAACACATGTCGTGCGCGCCTTGGTTGGTTCTTGTGCTCGTGGCAACAA
AAGGATAGCGTATTTTTCGCGTAAAGGAGCTGATGGCTTAGGAAAGTATGTTGGCGACTCAGAGAAGCATCTAAAGCAGCTATTTCAAGTTGCAGAGGAATGTCAACCTT
CAATCATATTCTTTGATGAGATATATGGACTGGCAGCTTCTCAAACGAGGCAACAAGATCAAACACATACTTCTGTCGTTTCCACGTTACTTGCTTTGCTAGATGGTTTG
AAATCTCGAGGCTTTGCTGGTGCTGATCTTCAGGCTCTATGTACTCAAACAACCATCATTGCTCTCAAAAGGACTTTTCCATTAAAACAAGTCTTATCTGCTTCTAGAGA
TACTTGTGATGATCATCCAACTGATCTTCCTGTTGTGGTAGTAGAGGAGAGAGATTGGTTGGAAGCTCTATCAAGTTGTCCACCTCCATGCTCTCGTAGAGAAGCATCGC
TAGCAGCTAATGATGTGGCATCCTCTCCTCTTCCTTCTCACCTTATCCCATGTCTCATTCGACCACTTTCCACCCTACTTGTTTCACTTTATATGGACGAATGCATATGC
CTACCTAATTCTCTTTCCACAGCTGCAACCATAATTAAGAGTGTGGTGGTTTCTACCTTGGAAAGAAAGAAAATAGTCAGTAATAAGTGGTGGTTCCATGTCGATGACTT
TATTCAGGACAATGAAATCGCAAATGAAATTGAAAGAAAATGGCAAGGTTCTGAAGCAAATATAATTGAAGATTCAGATAAACTTATTCCAGAAGACAATACTACAAATG
CAAAGCAAACATCATTTGACTTGGGAAATAATCATGGGTTTCGAATTATGATCGCAGGAAATCCAAGATCTGGGCAGAGACATCTTGCTTCATGCATTGTTAACTGCTTT
GTTGAACATGCTGAAATAAGAAAAGTTGATATGGCTACAATTTCTCAAGGATATAGTAATTTGGGACAAGGCATAACATATATATTAGCTAATTGTTTAAGTAGAGGATC
GTGCTTGATATTCATGTCGAGAGTTGATTTATGGGCTATTGAGTCACATGATGATCAAAATTTGGAGCATGATACACATTCAACTGAAGAACAATGTATTAGAAAGGCTA
CAGTTCAAAGAGCTTCATCTGCTTGGACTTTATTTATTGACCAAGCTACATCCGAAGTTCCATTTTTACTACTTCCTCAAGAAATAAGGCATTTCTTCCGAAAGGATTTA
TCAAATTGTAAAAAGTTGGTTGCACAAGGGCACAATATACCTCGCTTCTCCATCCAGATAAATGAGGCTTTCAACCATGACCTCGTGATCAATCAATCTGCAGCAAAACT
ATCCAAAGAAATGGTGAAATTACTTGTTCTATTGATTCACCAAAAAAATCACGTCTCCTGTACAAGTGCTTGCAACAAGAAGCCACCTTGTGATTTGATTGAAGAGCAGG
CCCGTACATCTAATTACCAAAAGGACAAAGAATCAATCAGTGAAGACAAAGAACTAAATGACACAAACAATACAACTTCCATAGAAGATACAAATTTGCTTTCAGCAATA
TCTATCTTTGGATATCAAATTCTACAAAATCCTCAATTTTCTGTTCTATGTTGGGTGACATCAAAGTTCAAAGAAGGCCCCTCAGCAAACATAAGTAAAGCTCAATGGGA
AGTTTGGAAATTCAACTCTTGTGTTCTTCATCCAACTAGCTCTTCGGATAATGTTCAAACTAAAGAAAATTATGGTATGGTAAAAGGCTTAGTTGCTGTTGGTTTATCAG
CAACCAGAGGTATTTATACATCACTGCAGCAAGTGTGTTCTGATGTTCGGGCGGTTCTTAATGTCTTAGTTGAAAAGGTCAATGCAAAAGTAGCTGCAGGCAAAGACAGA
CTCCAATTTATAGGCATTTTATCCCAAGTGGCATCACTAGAAGATACTGTTCATAGTTGGGCTTATGCACTACAAATGTACTATCCTCTCCTAATAAATTCACATACATC
ACAAAATGCATACCTCTGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGCTGAAATCTCTTACACCACCCAGAGGTCTTCTCCTCCATGGATATCCTGGAACAAGTAAAACACATGTCGTGCGCGCCTTGGTTGGTTCTTGTGCTCGTGGCAACAA
AAGGATAGCGTATTTTTCGCGTAAAGGAGCTGATGGCTTAGGAAAGTATGTTGGCGACTCAGAGAAGCATCTAAAGCAGCTATTTCAAGTTGCAGAGGAATGTCAACCTT
CAATCATATTCTTTGATGAGATATATGGACTGGCAGCTTCTCAAACGAGGCAACAAGATCAAACACATACTTCTGTCGTTTCCACGTTACTTGCTTTGCTAGATGGTTTG
AAATCTCGAGGCTTTGCTGGTGCTGATCTTCAGGCTCTATGTACTCAAACAACCATCATTGCTCTCAAAAGGACTTTTCCATTAAAACAAGTCTTATCTGCTTCTAGAGA
TACTTGTGATGATCATCCAACTGATCTTCCTGTTGTGGTAGTAGAGGAGAGAGATTGGTTGGAAGCTCTATCAAGTTGTCCACCTCCATGCTCTCGTAGAGAAGCATCGC
TAGCAGCTAATGATGTGGCATCCTCTCCTCTTCCTTCTCACCTTATCCCATGTCTCATTCGACCACTTTCCACCCTACTTGTTTCACTTTATATGGACGAATGCATATGC
CTACCTAATTCTCTTTCCACAGCTGCAACCATAATTAAGAGTGTGGTGGTTTCTACCTTGGAAAGAAAGAAAATAGTCAGTAATAAGTGGTGGTTCCATGTCGATGACTT
TATTCAGGACAATGAAATCGCAAATGAAATTGAAAGAAAATGGCAAGGTTCTGAAGCAAATATAATTGAAGATTCAGATAAACTTATTCCAGAAGACAATACTACAAATG
CAAAGCAAACATCATTTGACTTGGGAAATAATCATGGGTTTCGAATTATGATCGCAGGAAATCCAAGATCTGGGCAGAGACATCTTGCTTCATGCATTGTTAACTGCTTT
GTTGAACATGCTGAAATAAGAAAAGTTGATATGGCTACAATTTCTCAAGGATATAGTAATTTGGGACAAGGCATAACATATATATTAGCTAATTGTTTAAGTAGAGGATC
GTGCTTGATATTCATGTCGAGAGTTGATTTATGGGCTATTGAGTCACATGATGATCAAAATTTGGAGCATGATACACATTCAACTGAAGAACAATGTATTAGAAAGGCTA
CAGTTCAAAGAGCTTCATCTGCTTGGACTTTATTTATTGACCAAGCTACATCCGAAGTTCCATTTTTACTACTTCCTCAAGAAATAAGGCATTTCTTCCGAAAGGATTTA
TCAAATTGTAAAAAGTTGGTTGCACAAGGGCACAATATACCTCGCTTCTCCATCCAGATAAATGAGGCTTTCAACCATGACCTCGTGATCAATCAATCTGCAGCAAAACT
ATCCAAAGAAATGGTGAAATTACTTGTTCTATTGATTCACCAAAAAAATCACGTCTCCTGTACAAGTGCTTGCAACAAGAAGCCACCTTGTGATTTGATTGAAGAGCAGG
CCCGTACATCTAATTACCAAAAGGACAAAGAATCAATCAGTGAAGACAAAGAACTAAATGACACAAACAATACAACTTCCATAGAAGATACAAATTTGCTTTCAGCAATA
TCTATCTTTGGATATCAAATTCTACAAAATCCTCAATTTTCTGTTCTATGTTGGGTGACATCAAAGTTCAAAGAAGGCCCCTCAGCAAACATAAGTAAAGCTCAATGGGA
AGTTTGGAAATTCAACTCTTGTGTTCTTCATCCAACTAGCTCTTCGGATAATGTTCAAACTAAAGAAAATTATGGTATGGTAAAAGGCTTAGTTGCTGTTGGTTTATCAG
CAACCAGAGGTATTTATACATCACTGCAGCAAGTGTGTTCTGATGTTCGGGCGGTTCTTAATGTCTTAGTTGAAAAGGTCAATGCAAAAGTAGCTGCAGGCAAAGACAGA
CTCCAATTTATAGGCATTTTATCCCAAGTGGCATCACTAGAAGATACTGTTCATAGTTGGGCTTATGCACTACAAATGTACTATCCTCTCCTAATAAATTCACATACATC
ACAAAATGCATACCTCTGTTAA
Protein sequenceShow/hide protein sequence
MLKSLTPPRGLLLHGYPGTSKTHVVRALVGSCARGNKRIAYFSRKGADGLGKYVGDSEKHLKQLFQVAEECQPSIIFFDEIYGLAASQTRQQDQTHTSVVSTLLALLDGL
KSRGFAGADLQALCTQTTIIALKRTFPLKQVLSASRDTCDDHPTDLPVVVVEERDWLEALSSCPPPCSRREASLAANDVASSPLPSHLIPCLIRPLSTLLVSLYMDECIC
LPNSLSTAATIIKSVVVSTLERKKIVSNKWWFHVDDFIQDNEIANEIERKWQGSEANIIEDSDKLIPEDNTTNAKQTSFDLGNNHGFRIMIAGNPRSGQRHLASCIVNCF
VEHAEIRKVDMATISQGYSNLGQGITYILANCLSRGSCLIFMSRVDLWAIESHDDQNLEHDTHSTEEQCIRKATVQRASSAWTLFIDQATSEVPFLLLPQEIRHFFRKDL
SNCKKLVAQGHNIPRFSIQINEAFNHDLVINQSAAKLSKEMVKLLVLLIHQKNHVSCTSACNKKPPCDLIEEQARTSNYQKDKESISEDKELNDTNNTTSIEDTNLLSAI
SIFGYQILQNPQFSVLCWVTSKFKEGPSANISKAQWEVWKFNSCVLHPTSSSDNVQTKENYGMVKGLVAVGLSATRGIYTSLQQVCSDVRAVLNVLVEKVNAKVAAGKDR
LQFIGILSQVASLEDTVHSWAYALQMYYPLLINSHTSQNAYLC