; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg28031 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg28031
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
Descriptionsulfoquinovosidase-like isoform X1
Genome locationCarg_Chr03:1848333..1851705
RNA-Seq ExpressionCarg28031
SyntenyCarg28031
Gene Ontology termsGO:0008152 - metabolic process (biological process)
GO:0016798 - hydrolase activity, acting on glycosyl bonds (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6603308.1 hypothetical protein SDJN03_03917, partial [Cucurbita argyrosperma subsp. sororia]8.5e-23069.87Show/hide
Query:  MKNLKITKKHHIHLNNPFPSPPTSSPLVQGDLSANFQALPPYRVFSIGQDFQLLWRSENGGSLSIHHLSQPTRSIWSTIPGRAFVSAAMVETEVEESRGS
        MKNLKITKKHHIHLNNPFPSPPTSSPLVQGDLSANFQALPPYRVFSIGQDFQLLWRSENGGSLSIHHLSQPTRSIWSTIPGRAFVSAAMVETEVEESRGS
Subjt:  MKNLKITKKHHIHLNNPFPSPPTSSPLVQGDLSANFQALPPYRVFSIGQDFQLLWRSENGGSLSIHHLSQPTRSIWSTIPGRAFVSAAMVETEVEESRGS

Query:  FAVKDGAIRLVCNHQTIDDIKEINGCDYELEVRDHHFPSGYLDLDQKSHKKDAQFPIMLL---ITWQNIQYQEEETTTVFKKLVSNVPPASARIWVLFDR
        FAVKDGAIRLVCNHQTIDDIKEINGCDYELEVRDHHFPSGYLDLDQKSHKKDAQFP++L+   I     +  +++     ++   NVPPASAR WVLF++
Subjt:  FAVKDGAIRLVCNHQTIDDIKEINGCDYELEVRDHHFPSGYLDLDQKSHKKDAQFPIMLL---ITWQNIQYQEEETTTVFKKLVSNVPPASARIWVLFDR

Query:  KNSSQIGFQVMLGQPSYEYHSRGRLT-------GLRKGKFEWHWSLAKLKGCVRVSSSRRGKRRLESTRGYFEAFN-------TEKEEILWVLESSSLTW
        KNSSQIGFQVMLGQPSYEYHSRGRL         LRKGK EWHWSLAKLKGCVRVSSS   +  L++    FEAFN       +E++E  +         
Subjt:  KNSSQIGFQVMLGQPSYEYHSRGRLT-------GLRKGKFEWHWSLAKLKGCVRVSSSRRGKRRLESTRGYFEAFN-------TEKEEILWVLESSSLTW

Query:  TSRGKRVPILVQEQGIGRGDQPITFAANLVSYRAGGDWSTTYAPSPFYITSRMRSLYLEGYEYSVFDLTKNDMVQIQIYGNSIQGRILHGNSPSELIERF
          +GKRVPILVQEQGIGRGDQPITFAANLVSYRAGGDWSTTYAPSPFYITSRMRSLYLEGYEYSVFDLTKNDMVQIQIYGNSIQGRILHGNSPSELIERF
Subjt:  TSRGKRVPILVQEQGIGRGDQPITFAANLVSYRAGGDWSTTYAPSPFYITSRMRSLYLEGYEYSVFDLTKNDMVQIQIYGNSIQGRILHGNSPSELIERF

Query:  TETIGRPPELPGWIISGAVVGMQGGTDAVRQIWDVLKAHEVPISAFWL--------------------------------------QHIKVMTYCNPCLA
        TETIGRPPELPGWIISGAVVGMQGGTDAVRQIWDVLKAHEVPISAFWL                                      QHIKVMTYCNPCLA
Subjt:  TETIGRPPELPGWIISGAVVGMQGGTDAVRQIWDVLKAHEVPISAFWL--------------------------------------QHIKVMTYCNPCLA

Query:  PVSLFFNTSKSEILLIRSRNRRQRNLYEEAKGLGDLGKEEMENPTWFPTQLLMWECWILRTQITFQLVQGKSTEMV-------MMEF-------RTIISG
        P+                +  R+RNLYEEAK LG L K+  E P   P          L    T    +    EMV       M +F        T+ SG
Subjt:  PVSLFFNTSKSEILLIRSRNRRQRNLYEEAKGLGDLGKEEMENPTWFPTQLLMWECWILRTQITFQLVQGKSTEMV-------MMEF-------RTIISG

Query:  EDPITAHNRYPEMWAQINREFADE
        EDPITAHNRYPEMWAQINREFADE
Subjt:  EDPITAHNRYPEMWAQINREFADE

KAG7033580.1 yihQ, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+00100Show/hide
Query:  RIAYHPRNGSNQGLLFLVKISKAFQCLYELSELPWSSSPLFQHNTSMKNLKITKKHHIHLNNPFPSPPTSSPLVQGDLSANFQALPPYRVFSIGQDFQLL
        RIAYHPRNGSNQGLLFLVKISKAFQCLYELSELPWSSSPLFQHNTSMKNLKITKKHHIHLNNPFPSPPTSSPLVQGDLSANFQALPPYRVFSIGQDFQLL
Subjt:  RIAYHPRNGSNQGLLFLVKISKAFQCLYELSELPWSSSPLFQHNTSMKNLKITKKHHIHLNNPFPSPPTSSPLVQGDLSANFQALPPYRVFSIGQDFQLL

Query:  WRSENGGSLSIHHLSQPTRSIWSTIPGRAFVSAAMVETEVEESRGSFAVKDGAIRLVCNHQTIDDIKEINGCDYELEVRDHHFPSGYLDLDQKSHKKDAQ
        WRSENGGSLSIHHLSQPTRSIWSTIPGRAFVSAAMVETEVEESRGSFAVKDGAIRLVCNHQTIDDIKEINGCDYELEVRDHHFPSGYLDLDQKSHKKDAQ
Subjt:  WRSENGGSLSIHHLSQPTRSIWSTIPGRAFVSAAMVETEVEESRGSFAVKDGAIRLVCNHQTIDDIKEINGCDYELEVRDHHFPSGYLDLDQKSHKKDAQ

Query:  FPIMLLITWQNIQYQEEETTTVFKKLVSNVPPASARIWVLFDRKNSSQIGFQVMLGQPSYEYHSRGRLTGLRKGKFEWHWSLAKLKGCVRVSSSRRGKRR
        FPIMLLITWQNIQYQEEETTTVFKKLVSNVPPASARIWVLFDRKNSSQIGFQVMLGQPSYEYHSRGRLTGLRKGKFEWHWSLAKLKGCVRVSSSRRGKRR
Subjt:  FPIMLLITWQNIQYQEEETTTVFKKLVSNVPPASARIWVLFDRKNSSQIGFQVMLGQPSYEYHSRGRLTGLRKGKFEWHWSLAKLKGCVRVSSSRRGKRR

Query:  LESTRGYFEAFNTEKEEILWVLESSSLTWTSRGKRVPILVQEQGIGRGDQPITFAANLVSYRAGGDWSTTYAPSPFYITSRMRSLYLEGYEYSVFDLTKN
        LESTRGYFEAFNTEKEEILWVLESSSLTWTSRGKRVPILVQEQGIGRGDQPITFAANLVSYRAGGDWSTTYAPSPFYITSRMRSLYLEGYEYSVFDLTKN
Subjt:  LESTRGYFEAFNTEKEEILWVLESSSLTWTSRGKRVPILVQEQGIGRGDQPITFAANLVSYRAGGDWSTTYAPSPFYITSRMRSLYLEGYEYSVFDLTKN

Query:  DMVQIQIYGNSIQGRILHGNSPSELIERFTETIGRPPELPGWIISGAVVGMQGGTDAVRQIWDVLKAHEVPISAFWLQHIKVMTYCNPCLAPVSLFFNTS
        DMVQIQIYGNSIQGRILHGNSPSELIERFTETIGRPPELPGWIISGAVVGMQGGTDAVRQIWDVLKAHEVPISAFWLQHIKVMTYCNPCLAPVSLFFNTS
Subjt:  DMVQIQIYGNSIQGRILHGNSPSELIERFTETIGRPPELPGWIISGAVVGMQGGTDAVRQIWDVLKAHEVPISAFWLQHIKVMTYCNPCLAPVSLFFNTS

Query:  KSEILLIRSRNRRQRNLYEEAKGLGDLGKEEMENPTWFPTQLLMWECWILRTQITFQLVQGKSTEMVMMEFRTIISGEDPITAHNRYPEMWAQINREFAD
        KSEILLIRSRNRRQRNLYEEAKGLGDLGKEEMENPTWFPTQLLMWECWILRTQITFQLVQGKSTEMVMMEFRTIISGEDPITAHNRYPEMWAQINREFAD
Subjt:  KSEILLIRSRNRRQRNLYEEAKGLGDLGKEEMENPTWFPTQLLMWECWILRTQITFQLVQGKSTEMVMMEFRTIISGEDPITAHNRYPEMWAQINREFAD

Query:  E
        E
Subjt:  E

XP_022932849.1 uncharacterized protein LOC111439390 isoform X1 [Cucurbita moschata]3.2e-22969.87Show/hide
Query:  MKNLKITKKHHIHLNNPFPSPPTSSPLVQGDLSANFQALPPYRVFSIGQDFQLLWRSENGGSLSIHHLSQPTRSIWSTIPGRAFVSAAMVETEVEESRGS
        MKNLKITKKHHIHLNNPFPSPPTSSPLVQGDLSANFQALPPYRVFSIGQDFQLLWRSENGGSLSIHHLSQPTRSIWSTIPGRAFVSAAMVETEVEESRGS
Subjt:  MKNLKITKKHHIHLNNPFPSPPTSSPLVQGDLSANFQALPPYRVFSIGQDFQLLWRSENGGSLSIHHLSQPTRSIWSTIPGRAFVSAAMVETEVEESRGS

Query:  FAVKDGAIRLVCNHQTIDDIKEINGCDYELEVRDHHFPSGYLDLDQKSHKKDAQFPIMLL---ITWQNIQYQEEETTTVFKKLVSNVPPASARIWVLFDR
        FAVKDGAIRLVCNHQTIDDIKEINGCDYELEVRDHHFPSGYLDLDQKSHKKDAQFP++L+   I     +  +++     ++   NVPPASAR WVLF++
Subjt:  FAVKDGAIRLVCNHQTIDDIKEINGCDYELEVRDHHFPSGYLDLDQKSHKKDAQFPIMLL---ITWQNIQYQEEETTTVFKKLVSNVPPASARIWVLFDR

Query:  KNSSQIGFQVMLGQPSYEYHSRGRLT-------GLRKGKFEWHWSLAKLKGCVRVSSSRRGKRRLESTRGYFEAFN-------TEKEEILWVLESSSLTW
        KNSSQIGFQVMLGQPSYEYHSRGRL         LRKGK EWH SLAKLKGCVRVSSS   +  L++    FEAFN       +E++E  +         
Subjt:  KNSSQIGFQVMLGQPSYEYHSRGRLT-------GLRKGKFEWHWSLAKLKGCVRVSSSRRGKRRLESTRGYFEAFN-------TEKEEILWVLESSSLTW

Query:  TSRGKRVPILVQEQGIGRGDQPITFAANLVSYRAGGDWSTTYAPSPFYITSRMRSLYLEGYEYSVFDLTKNDMVQIQIYGNSIQGRILHGNSPSELIERF
          +GKRVPILVQEQGIGRGDQPITFAANLVSYRAGGDWSTTYAPSPFYITSRMRSLYLEGYEYSVFDLTKNDMVQIQIYGNSIQGRILHGNSPSELIERF
Subjt:  TSRGKRVPILVQEQGIGRGDQPITFAANLVSYRAGGDWSTTYAPSPFYITSRMRSLYLEGYEYSVFDLTKNDMVQIQIYGNSIQGRILHGNSPSELIERF

Query:  TETIGRPPELPGWIISGAVVGMQGGTDAVRQIWDVLKAHEVPISAFWL--------------------------------------QHIKVMTYCNPCLA
        TETIGRPPELPGWIISGAVVGMQGGTDAVRQIWDVLKAHEVPISAFWL                                      QHIKVMTYCNPCLA
Subjt:  TETIGRPPELPGWIISGAVVGMQGGTDAVRQIWDVLKAHEVPISAFWL--------------------------------------QHIKVMTYCNPCLA

Query:  PVSLFFNTSKSEILLIRSRNRRQRNLYEEAKGLGDLGKEEMENPTWFPTQLLMWECWILRTQITFQLVQGKSTEMV-------MMEF-------RTIISG
        P+                +  R+RNLYEEAKGLG L K+  E P   P          L    T    +    EMV       M +F        T+ SG
Subjt:  PVSLFFNTSKSEILLIRSRNRRQRNLYEEAKGLGDLGKEEMENPTWFPTQLLMWECWILRTQITFQLVQGKSTEMV-------MMEF-------RTIISG

Query:  EDPITAHNRYPEMWAQINREFADE
        EDPITAHNRYPEMWAQINREFADE
Subjt:  EDPITAHNRYPEMWAQINREFADE

XP_022932850.1 uncharacterized protein LOC111439390 isoform X2 [Cucurbita moschata]3.2e-22969.87Show/hide
Query:  MKNLKITKKHHIHLNNPFPSPPTSSPLVQGDLSANFQALPPYRVFSIGQDFQLLWRSENGGSLSIHHLSQPTRSIWSTIPGRAFVSAAMVETEVEESRGS
        MKNLKITKKHHIHLNNPFPSPPTSSPLVQGDLSANFQALPPYRVFSIGQDFQLLWRSENGGSLSIHHLSQPTRSIWSTIPGRAFVSAAMVETEVEESRGS
Subjt:  MKNLKITKKHHIHLNNPFPSPPTSSPLVQGDLSANFQALPPYRVFSIGQDFQLLWRSENGGSLSIHHLSQPTRSIWSTIPGRAFVSAAMVETEVEESRGS

Query:  FAVKDGAIRLVCNHQTIDDIKEINGCDYELEVRDHHFPSGYLDLDQKSHKKDAQFPIMLL---ITWQNIQYQEEETTTVFKKLVSNVPPASARIWVLFDR
        FAVKDGAIRLVCNHQTIDDIKEINGCDYELEVRDHHFPSGYLDLDQKSHKKDAQFP++L+   I     +  +++     ++   NVPPASAR WVLF++
Subjt:  FAVKDGAIRLVCNHQTIDDIKEINGCDYELEVRDHHFPSGYLDLDQKSHKKDAQFPIMLL---ITWQNIQYQEEETTTVFKKLVSNVPPASARIWVLFDR

Query:  KNSSQIGFQVMLGQPSYEYHSRGRLT-------GLRKGKFEWHWSLAKLKGCVRVSSSRRGKRRLESTRGYFEAFN-------TEKEEILWVLESSSLTW
        KNSSQIGFQVMLGQPSYEYHSRGRL         LRKGK EWH SLAKLKGCVRVSSS   +  L++    FEAFN       +E++E  +         
Subjt:  KNSSQIGFQVMLGQPSYEYHSRGRLT-------GLRKGKFEWHWSLAKLKGCVRVSSSRRGKRRLESTRGYFEAFN-------TEKEEILWVLESSSLTW

Query:  TSRGKRVPILVQEQGIGRGDQPITFAANLVSYRAGGDWSTTYAPSPFYITSRMRSLYLEGYEYSVFDLTKNDMVQIQIYGNSIQGRILHGNSPSELIERF
          +GKRVPILVQEQGIGRGDQPITFAANLVSYRAGGDWSTTYAPSPFYITSRMRSLYLEGYEYSVFDLTKNDMVQIQIYGNSIQGRILHGNSPSELIERF
Subjt:  TSRGKRVPILVQEQGIGRGDQPITFAANLVSYRAGGDWSTTYAPSPFYITSRMRSLYLEGYEYSVFDLTKNDMVQIQIYGNSIQGRILHGNSPSELIERF

Query:  TETIGRPPELPGWIISGAVVGMQGGTDAVRQIWDVLKAHEVPISAFWL--------------------------------------QHIKVMTYCNPCLA
        TETIGRPPELPGWIISGAVVGMQGGTDAVRQIWDVLKAHEVPISAFWL                                      QHIKVMTYCNPCLA
Subjt:  TETIGRPPELPGWIISGAVVGMQGGTDAVRQIWDVLKAHEVPISAFWL--------------------------------------QHIKVMTYCNPCLA

Query:  PVSLFFNTSKSEILLIRSRNRRQRNLYEEAKGLGDLGKEEMENPTWFPTQLLMWECWILRTQITFQLVQGKSTEMV-------MMEF-------RTIISG
        P+                +  R+RNLYEEAKGLG L K+  E P   P          L    T    +    EMV       M +F        T+ SG
Subjt:  PVSLFFNTSKSEILLIRSRNRRQRNLYEEAKGLGDLGKEEMENPTWFPTQLLMWECWILRTQITFQLVQGKSTEMV-------MMEF-------RTIISG

Query:  EDPITAHNRYPEMWAQINREFADE
        EDPITAHNRYPEMWAQINREFADE
Subjt:  EDPITAHNRYPEMWAQINREFADE

XP_023544824.1 uncharacterized protein LOC111804301 [Cucurbita pepo subsp. pepo]3.7e-23369.89Show/hide
Query:  LFQHNTSMKNLKITKKHHIHLNNPFPSPPTSSPLVQGDLSANFQALPPYRVFSIGQDFQLLWRSENGGSLSIHHLSQPTRSIWSTIPGRAFVSAAMVETE
        LFQHNTSMKNLKITKKHHIHLNNPFPSPPTSSPLVQGDLSANFQALPPYRVFSIGQDFQLLWRSENGGSLSIHHLSQP+RSIWSTIPGRAFVSAAMVETE
Subjt:  LFQHNTSMKNLKITKKHHIHLNNPFPSPPTSSPLVQGDLSANFQALPPYRVFSIGQDFQLLWRSENGGSLSIHHLSQPTRSIWSTIPGRAFVSAAMVETE

Query:  VEESRGSFAVKDGAIRLVCNHQTIDDIKEINGCDYELEVRDHHFPSGYLDLDQKSHKKDAQFPIMLL---ITWQNIQYQEEETTTVFKKLVSNVPPASAR
        VEESRGSFAVKDGAIRLVCNHQTIDDIKEINGCDYELEVRDHHFPSGYLDLDQKSHKKDAQFP++L+   I     +  ++      ++   NVPPASAR
Subjt:  VEESRGSFAVKDGAIRLVCNHQTIDDIKEINGCDYELEVRDHHFPSGYLDLDQKSHKKDAQFPIMLL---ITWQNIQYQEEETTTVFKKLVSNVPPASAR

Query:  IWVLFDRKNSSQIGFQVMLGQPSYEYHSRGRLT-------GLRKGKFEWHWSLAKLKGCVRVSSSRRGKRRLESTRGYFEAFN-------TEKEEILWVL
         WVLF++KNSSQIGFQVMLGQPSYEYHSRGRL+        LRKGK EWHWSLAKLKGCVRVSSS + +  L++    FEAFN       +E++E  +  
Subjt:  IWVLFDRKNSSQIGFQVMLGQPSYEYHSRGRLT-------GLRKGKFEWHWSLAKLKGCVRVSSSRRGKRRLESTRGYFEAFN-------TEKEEILWVL

Query:  ESSSLTWTSRGKRVPILVQEQGIGRGDQPITFAANLVSYRAGGDWSTTYAPSPFYITSRMRSLYLEGYEYSVFDLTKNDMVQIQIYGNSIQGRILHGNSP
                 +GKRVPILVQEQGIGRGDQPITFAANLVSYRAGGDWSTTYAPSPFYITSRMRSLYLEGYEYSVFDLTKND VQIQIYGNSIQGRILHGNSP
Subjt:  ESSSLTWTSRGKRVPILVQEQGIGRGDQPITFAANLVSYRAGGDWSTTYAPSPFYITSRMRSLYLEGYEYSVFDLTKNDMVQIQIYGNSIQGRILHGNSP

Query:  SELIERFTETIGRPPELPGWIISGAVVGMQGGTDAVRQIWDVLKAHEVPISAFWL--------------------------------------QHIKVMT
        SELIERFTETIGRPPELPGWIISGAVVGMQGGTDAVRQIWDVLKAHEVPISAFWL                                      QHIKVMT
Subjt:  SELIERFTETIGRPPELPGWIISGAVVGMQGGTDAVRQIWDVLKAHEVPISAFWL--------------------------------------QHIKVMT

Query:  YCNPCLAPVSLFFNTSKSEILLIRSRNRRQRNLYEEAKGLGDLGKEEMENPTWFPTQLLMWECWILRTQITFQLVQGKSTEMV-------MMEF------
        YCNPCLAP+                +  R+RNLYEEAK LG L K+  E P   P          L    T    +    EMV       M +F      
Subjt:  YCNPCLAPVSLFFNTSKSEILLIRSRNRRQRNLYEEAKGLGDLGKEEMENPTWFPTQLLMWECWILRTQITFQLVQGKSTEMV-------MMEF------

Query:  -RTIISGEDPITAHNRYPEMWAQINREFADE
          T+ SGEDPITAHNRYPEMWAQINREFADE
Subjt:  -RTIISGEDPITAHNRYPEMWAQINREFADE

TrEMBL top hitse value%identityAlignment
A0A0A0L3I8 Uncharacterized protein4.7e-20260.64Show/hide
Query:  ELPWSSSPLF-QHNTSMKNLKITKKHHIHLNNPFPSPPTSSPLVQGDLSANFQALPPYRVFSIGQDFQLLWRSENGGSLSIHHLSQPTRSIWSTIPGRAF
        +LP   S LF QHN +M NLK+TKKHHIHLNNPFPSPP S PL+QG+LSAN+QAL  Y+ FSIG+DFQLLWRS+NGGSLSI+HLS PTRSIWSTI G+AF
Subjt:  ELPWSSSPLF-QHNTSMKNLKITKKHHIHLNNPFPSPPTSSPLVQGDLSANFQALPPYRVFSIGQDFQLLWRSENGGSLSIHHLSQPTRSIWSTIPGRAF

Query:  VSAAMVETEVEESRGSFAVKDGAIRLVCNHQTIDDIKEINGCDYELEVRDHHFPSGYLDLDQKSH-KKDAQFPIMLLITWQNIQYQEEETTTVFKKLV--
        VSAAMVETEVEESRGSFAVKDGA+ L+CNHQTIDDIKEINGCD+E EV++HHFPSGYL LD K++ K+DAQFP MLLI+ +    +++       KL   
Subjt:  VSAAMVETEVEESRGSFAVKDGAIRLVCNHQTIDDIKEINGCDYELEVRDHHFPSGYLDLDQKSH-KKDAQFPIMLLITWQNIQYQEEETTTVFKKLV--

Query:  ---------SNVPPASARIWVLFDRKNSSQIGFQVMLGQPSYEY----HSRG-------RLTGLRKGKFEWHWSLAKLKGCVRVSSSRRGKRRLESTRGY
                 S V  ASAR WV F++K+SSQIGFQVMLGQPSYE+    HSRG       RL  LRK KFEWHWSL KLKG VRV SS +    L +    
Subjt:  ---------SNVPPASARIWVLFDRKNSSQIGFQVMLGQPSYEY----HSRG-------RLTGLRKGKFEWHWSLAKLKGCVRVSSSRRGKRRLESTRGY

Query:  FEAFN-------TEKEEILWVLESSSLTWTSRGKRVPILVQEQGIGRGDQPITFAANLVSYRAGGDWSTTYAPSPFYITSRMRSLYLEGYEYSVFDLTKN
        FEAFN       +E++E  +           +GKRVPI VQEQGIGRGDQPITFAANL+SYRAGGDWSTTYAPSPFY+TS+MRSLYLEGYEYS+FDLTKN
Subjt:  FEAFN-------TEKEEILWVLESSSLTWTSRGKRVPILVQEQGIGRGDQPITFAANLVSYRAGGDWSTTYAPSPFYITSRMRSLYLEGYEYSVFDLTKN

Query:  DMVQIQIYGNSIQGRILHGNSPSELIERFTETIGRPPELPGWIISGAVVGMQGGTDAVRQIWDVLKAHEVPISAFWLQ----------------------
        D VQIQI+GNS+QGRILHGNSPSELIERFTETIGRPPELPGWIISGAVVGMQGGT+ VR+IWD LKAHEVPISAFWLQ                      
Subjt:  DMVQIQIYGNSIQGRILHGNSPSELIERFTETIGRPPELPGWIISGAVVGMQGGTDAVRQIWDVLKAHEVPISAFWLQ----------------------

Query:  ----------------HIKVMTYCNPCLAPVSLFFNTSKSEILLIRSRNRRQRNLYEEAKGLGDLGKEEMENPTWFPTQLLMWECWILRTQITFQLVQGK
                        HIKVMTYCNPCLAP                 +  R+RNLYEEAK LG L K++   P   P          L    T    +  
Subjt:  ----------------HIKVMTYCNPCLAPVSLFFNTSKSEILLIRSRNRRQRNLYEEAKGLGDLGKEEMENPTWFPTQLLMWECWILRTQITFQLVQGK

Query:  STEMV-------MMEF-------RTIISGEDPITAHNRYPEMWAQINREFADE
          EMV       M +F        T+ SGEDPITAHNRYPE+WAQINREF DE
Subjt:  STEMV-------MMEF-------RTIISGEDPITAHNRYPEMWAQINREFADE

A0A6J1EY61 uncharacterized protein LOC111439390 isoform X11.6e-22969.87Show/hide
Query:  MKNLKITKKHHIHLNNPFPSPPTSSPLVQGDLSANFQALPPYRVFSIGQDFQLLWRSENGGSLSIHHLSQPTRSIWSTIPGRAFVSAAMVETEVEESRGS
        MKNLKITKKHHIHLNNPFPSPPTSSPLVQGDLSANFQALPPYRVFSIGQDFQLLWRSENGGSLSIHHLSQPTRSIWSTIPGRAFVSAAMVETEVEESRGS
Subjt:  MKNLKITKKHHIHLNNPFPSPPTSSPLVQGDLSANFQALPPYRVFSIGQDFQLLWRSENGGSLSIHHLSQPTRSIWSTIPGRAFVSAAMVETEVEESRGS

Query:  FAVKDGAIRLVCNHQTIDDIKEINGCDYELEVRDHHFPSGYLDLDQKSHKKDAQFPIMLL---ITWQNIQYQEEETTTVFKKLVSNVPPASARIWVLFDR
        FAVKDGAIRLVCNHQTIDDIKEINGCDYELEVRDHHFPSGYLDLDQKSHKKDAQFP++L+   I     +  +++     ++   NVPPASAR WVLF++
Subjt:  FAVKDGAIRLVCNHQTIDDIKEINGCDYELEVRDHHFPSGYLDLDQKSHKKDAQFPIMLL---ITWQNIQYQEEETTTVFKKLVSNVPPASARIWVLFDR

Query:  KNSSQIGFQVMLGQPSYEYHSRGRLT-------GLRKGKFEWHWSLAKLKGCVRVSSSRRGKRRLESTRGYFEAFN-------TEKEEILWVLESSSLTW
        KNSSQIGFQVMLGQPSYEYHSRGRL         LRKGK EWH SLAKLKGCVRVSSS   +  L++    FEAFN       +E++E  +         
Subjt:  KNSSQIGFQVMLGQPSYEYHSRGRLT-------GLRKGKFEWHWSLAKLKGCVRVSSSRRGKRRLESTRGYFEAFN-------TEKEEILWVLESSSLTW

Query:  TSRGKRVPILVQEQGIGRGDQPITFAANLVSYRAGGDWSTTYAPSPFYITSRMRSLYLEGYEYSVFDLTKNDMVQIQIYGNSIQGRILHGNSPSELIERF
          +GKRVPILVQEQGIGRGDQPITFAANLVSYRAGGDWSTTYAPSPFYITSRMRSLYLEGYEYSVFDLTKNDMVQIQIYGNSIQGRILHGNSPSELIERF
Subjt:  TSRGKRVPILVQEQGIGRGDQPITFAANLVSYRAGGDWSTTYAPSPFYITSRMRSLYLEGYEYSVFDLTKNDMVQIQIYGNSIQGRILHGNSPSELIERF

Query:  TETIGRPPELPGWIISGAVVGMQGGTDAVRQIWDVLKAHEVPISAFWL--------------------------------------QHIKVMTYCNPCLA
        TETIGRPPELPGWIISGAVVGMQGGTDAVRQIWDVLKAHEVPISAFWL                                      QHIKVMTYCNPCLA
Subjt:  TETIGRPPELPGWIISGAVVGMQGGTDAVRQIWDVLKAHEVPISAFWL--------------------------------------QHIKVMTYCNPCLA

Query:  PVSLFFNTSKSEILLIRSRNRRQRNLYEEAKGLGDLGKEEMENPTWFPTQLLMWECWILRTQITFQLVQGKSTEMV-------MMEF-------RTIISG
        P+                +  R+RNLYEEAKGLG L K+  E P   P          L    T    +    EMV       M +F        T+ SG
Subjt:  PVSLFFNTSKSEILLIRSRNRRQRNLYEEAKGLGDLGKEEMENPTWFPTQLLMWECWILRTQITFQLVQGKSTEMV-------MMEF-------RTIISG

Query:  EDPITAHNRYPEMWAQINREFADE
        EDPITAHNRYPEMWAQINREFADE
Subjt:  EDPITAHNRYPEMWAQINREFADE

A0A6J1F389 uncharacterized protein LOC111439390 isoform X21.6e-22969.87Show/hide
Query:  MKNLKITKKHHIHLNNPFPSPPTSSPLVQGDLSANFQALPPYRVFSIGQDFQLLWRSENGGSLSIHHLSQPTRSIWSTIPGRAFVSAAMVETEVEESRGS
        MKNLKITKKHHIHLNNPFPSPPTSSPLVQGDLSANFQALPPYRVFSIGQDFQLLWRSENGGSLSIHHLSQPTRSIWSTIPGRAFVSAAMVETEVEESRGS
Subjt:  MKNLKITKKHHIHLNNPFPSPPTSSPLVQGDLSANFQALPPYRVFSIGQDFQLLWRSENGGSLSIHHLSQPTRSIWSTIPGRAFVSAAMVETEVEESRGS

Query:  FAVKDGAIRLVCNHQTIDDIKEINGCDYELEVRDHHFPSGYLDLDQKSHKKDAQFPIMLL---ITWQNIQYQEEETTTVFKKLVSNVPPASARIWVLFDR
        FAVKDGAIRLVCNHQTIDDIKEINGCDYELEVRDHHFPSGYLDLDQKSHKKDAQFP++L+   I     +  +++     ++   NVPPASAR WVLF++
Subjt:  FAVKDGAIRLVCNHQTIDDIKEINGCDYELEVRDHHFPSGYLDLDQKSHKKDAQFPIMLL---ITWQNIQYQEEETTTVFKKLVSNVPPASARIWVLFDR

Query:  KNSSQIGFQVMLGQPSYEYHSRGRLT-------GLRKGKFEWHWSLAKLKGCVRVSSSRRGKRRLESTRGYFEAFN-------TEKEEILWVLESSSLTW
        KNSSQIGFQVMLGQPSYEYHSRGRL         LRKGK EWH SLAKLKGCVRVSSS   +  L++    FEAFN       +E++E  +         
Subjt:  KNSSQIGFQVMLGQPSYEYHSRGRLT-------GLRKGKFEWHWSLAKLKGCVRVSSSRRGKRRLESTRGYFEAFN-------TEKEEILWVLESSSLTW

Query:  TSRGKRVPILVQEQGIGRGDQPITFAANLVSYRAGGDWSTTYAPSPFYITSRMRSLYLEGYEYSVFDLTKNDMVQIQIYGNSIQGRILHGNSPSELIERF
          +GKRVPILVQEQGIGRGDQPITFAANLVSYRAGGDWSTTYAPSPFYITSRMRSLYLEGYEYSVFDLTKNDMVQIQIYGNSIQGRILHGNSPSELIERF
Subjt:  TSRGKRVPILVQEQGIGRGDQPITFAANLVSYRAGGDWSTTYAPSPFYITSRMRSLYLEGYEYSVFDLTKNDMVQIQIYGNSIQGRILHGNSPSELIERF

Query:  TETIGRPPELPGWIISGAVVGMQGGTDAVRQIWDVLKAHEVPISAFWL--------------------------------------QHIKVMTYCNPCLA
        TETIGRPPELPGWIISGAVVGMQGGTDAVRQIWDVLKAHEVPISAFWL                                      QHIKVMTYCNPCLA
Subjt:  TETIGRPPELPGWIISGAVVGMQGGTDAVRQIWDVLKAHEVPISAFWL--------------------------------------QHIKVMTYCNPCLA

Query:  PVSLFFNTSKSEILLIRSRNRRQRNLYEEAKGLGDLGKEEMENPTWFPTQLLMWECWILRTQITFQLVQGKSTEMV-------MMEF-------RTIISG
        P+                +  R+RNLYEEAKGLG L K+  E P   P          L    T    +    EMV       M +F        T+ SG
Subjt:  PVSLFFNTSKSEILLIRSRNRRQRNLYEEAKGLGDLGKEEMENPTWFPTQLLMWECWILRTQITFQLVQGKSTEMV-------MMEF-------RTIISG

Query:  EDPITAHNRYPEMWAQINREFADE
        EDPITAHNRYPEMWAQINREFADE
Subjt:  EDPITAHNRYPEMWAQINREFADE

A0A6J1HRK3 uncharacterized protein LOC111467157 isoform X27.8e-22968.62Show/hide
Query:  LFQHNTSMKNLKITKKHHIHLNNPFPSPPTSSPLVQGDLSANFQALPPYRVFSIGQDFQLLWRSENGGSLSIHHLSQPTRSIWSTIPGRAFVSAAMVETE
        LFQHN SMKNLKITKKHHIHLNNPFPS PTS PLVQGDLSANFQALPPYRVFSIGQDFQLLWRSENGGSLSIHHLSQPTRSIWSTIPGRAFVSAAMVETE
Subjt:  LFQHNTSMKNLKITKKHHIHLNNPFPSPPTSSPLVQGDLSANFQALPPYRVFSIGQDFQLLWRSENGGSLSIHHLSQPTRSIWSTIPGRAFVSAAMVETE

Query:  VEESRGSFAVKDGAIRLVCNHQTIDDIKEINGCDYELEVRDHHFPSGYLDLDQKSHKKDAQFPIMLL---ITWQNIQYQEEETTTVFKKLVSNVPPASAR
        VEESRGSF+VKDGAIRLVCNHQTIDDIKEINGCDYELEVRDHHFPSGYLDLDQKSHKKDAQFP++L+   I     +  +++     ++   NVPPASAR
Subjt:  VEESRGSFAVKDGAIRLVCNHQTIDDIKEINGCDYELEVRDHHFPSGYLDLDQKSHKKDAQFPIMLL---ITWQNIQYQEEETTTVFKKLVSNVPPASAR

Query:  IWVLFDRKNSSQIGFQVMLGQPSYEYHSRGRLT-------GLRKGKFEWHWSLAKLKGCVRVSSSRRGKRRLESTRGYFEAFN-------TEKEEILWVL
         WVLF++KN+SQIGFQVMLGQPSYEYHSRGRL         LRKGK EWHWSLAKLK CVRVSSS   +  L++    F+AFN       +E++E  +  
Subjt:  IWVLFDRKNSSQIGFQVMLGQPSYEYHSRGRLT-------GLRKGKFEWHWSLAKLKGCVRVSSSRRGKRRLESTRGYFEAFN-------TEKEEILWVL

Query:  ESSSLTWTSRGKRVPILVQEQGIGRGDQPITFAANLVSYRAGGDWSTTYAPSPFYITSRMRSLYLEGYEYSVFDLTKNDMVQIQIYGNSIQGRILHGNSP
                 +GKRVPILVQEQGIGRGDQPITFAANLVSYRAGGDWSTTYAPSPFYITSR+RSLYLEGYEYSVFDLTKN+MVQIQIYGNSIQGRILHGNSP
Subjt:  ESSSLTWTSRGKRVPILVQEQGIGRGDQPITFAANLVSYRAGGDWSTTYAPSPFYITSRMRSLYLEGYEYSVFDLTKNDMVQIQIYGNSIQGRILHGNSP

Query:  SELIERFTETIGRPPELPGWIISGAVVGMQGGTDAVRQIWDVLKAHEVPISAFWL--------------------------------------QHIKVMT
        SELIERFTETIGRPPELPGWIISGAVVGMQGGTDAVRQIWDVLKAHEVPISAFWL                                      QHIKVMT
Subjt:  SELIERFTETIGRPPELPGWIISGAVVGMQGGTDAVRQIWDVLKAHEVPISAFWL--------------------------------------QHIKVMT

Query:  YCNPCLAPVSLFFNTSKSEILLIRSRNRRQRNLYEEAKGLGDLGKEEMENPTWFPTQLLMWECWILRTQITFQLVQGKSTEMV-------MMEF------
        YCNPCLAP+                +  R+RNLY+EAK LG L K+  E P   P          L    T    +    EMV       M +F      
Subjt:  YCNPCLAPVSLFFNTSKSEILLIRSRNRRQRNLYEEAKGLGDLGKEEMENPTWFPTQLLMWECWILRTQITFQLVQGKSTEMV-------MMEF------

Query:  -RTIISGEDPITAHNRYPEMWAQINREFADE
          T+ SGEDPITAHNRYPEMWAQINREFADE
Subjt:  -RTIISGEDPITAHNRYPEMWAQINREFADE

A0A6J1HSX1 uncharacterized protein LOC111467157 isoform X17.8e-22968.62Show/hide
Query:  LFQHNTSMKNLKITKKHHIHLNNPFPSPPTSSPLVQGDLSANFQALPPYRVFSIGQDFQLLWRSENGGSLSIHHLSQPTRSIWSTIPGRAFVSAAMVETE
        LFQHN SMKNLKITKKHHIHLNNPFPS PTS PLVQGDLSANFQALPPYRVFSIGQDFQLLWRSENGGSLSIHHLSQPTRSIWSTIPGRAFVSAAMVETE
Subjt:  LFQHNTSMKNLKITKKHHIHLNNPFPSPPTSSPLVQGDLSANFQALPPYRVFSIGQDFQLLWRSENGGSLSIHHLSQPTRSIWSTIPGRAFVSAAMVETE

Query:  VEESRGSFAVKDGAIRLVCNHQTIDDIKEINGCDYELEVRDHHFPSGYLDLDQKSHKKDAQFPIMLL---ITWQNIQYQEEETTTVFKKLVSNVPPASAR
        VEESRGSF+VKDGAIRLVCNHQTIDDIKEINGCDYELEVRDHHFPSGYLDLDQKSHKKDAQFP++L+   I     +  +++     ++   NVPPASAR
Subjt:  VEESRGSFAVKDGAIRLVCNHQTIDDIKEINGCDYELEVRDHHFPSGYLDLDQKSHKKDAQFPIMLL---ITWQNIQYQEEETTTVFKKLVSNVPPASAR

Query:  IWVLFDRKNSSQIGFQVMLGQPSYEYHSRGRLT-------GLRKGKFEWHWSLAKLKGCVRVSSSRRGKRRLESTRGYFEAFN-------TEKEEILWVL
         WVLF++KN+SQIGFQVMLGQPSYEYHSRGRL         LRKGK EWHWSLAKLK CVRVSSS   +  L++    F+AFN       +E++E  +  
Subjt:  IWVLFDRKNSSQIGFQVMLGQPSYEYHSRGRLT-------GLRKGKFEWHWSLAKLKGCVRVSSSRRGKRRLESTRGYFEAFN-------TEKEEILWVL

Query:  ESSSLTWTSRGKRVPILVQEQGIGRGDQPITFAANLVSYRAGGDWSTTYAPSPFYITSRMRSLYLEGYEYSVFDLTKNDMVQIQIYGNSIQGRILHGNSP
                 +GKRVPILVQEQGIGRGDQPITFAANLVSYRAGGDWSTTYAPSPFYITSR+RSLYLEGYEYSVFDLTKN+MVQIQIYGNSIQGRILHGNSP
Subjt:  ESSSLTWTSRGKRVPILVQEQGIGRGDQPITFAANLVSYRAGGDWSTTYAPSPFYITSRMRSLYLEGYEYSVFDLTKNDMVQIQIYGNSIQGRILHGNSP

Query:  SELIERFTETIGRPPELPGWIISGAVVGMQGGTDAVRQIWDVLKAHEVPISAFWL--------------------------------------QHIKVMT
        SELIERFTETIGRPPELPGWIISGAVVGMQGGTDAVRQIWDVLKAHEVPISAFWL                                      QHIKVMT
Subjt:  SELIERFTETIGRPPELPGWIISGAVVGMQGGTDAVRQIWDVLKAHEVPISAFWL--------------------------------------QHIKVMT

Query:  YCNPCLAPVSLFFNTSKSEILLIRSRNRRQRNLYEEAKGLGDLGKEEMENPTWFPTQLLMWECWILRTQITFQLVQGKSTEMV-------MMEF------
        YCNPCLAP+                +  R+RNLY+EAK LG L K+  E P   P          L    T    +    EMV       M +F      
Subjt:  YCNPCLAPVSLFFNTSKSEILLIRSRNRRQRNLYEEAKGLGDLGKEEMENPTWFPTQLLMWECWILRTQITFQLVQGKSTEMV-------MMEF------

Query:  -RTIISGEDPITAHNRYPEMWAQINREFADE
          T+ SGEDPITAHNRYPEMWAQINREFADE
Subjt:  -RTIISGEDPITAHNRYPEMWAQINREFADE

SwissProt top hitse value%identityAlignment
P32138 Sulfoquinovosidase2.6e-1629.73Show/hide
Query:  RGKRVPILVQEQGIGRGDQP-ITFAANLVSYRAGGDWSTTYAPSPFYITSRMRSLYLEGYEYSVFDLTKNDMVQIQIYGNSIQGRILHGNSPSELIERFT
        RGK  P+   EQG+GR  Q  +T+ A+     AGGD+  T+ P P +++++    +++   Y  FD +  +  ++ ++ +    R    ++   L+E+ T
Subjt:  RGKRVPILVQEQGIGRGDQP-ITFAANLVSYRAGGDWSTTYAPSPFYITSRMRSLYLEGYEYSVFDLTKNDMVQIQIYGNSIQGRILHGNSPSELIERFT

Query:  ETIGRPPELPGWIISGAVVGMQGGTDAVRQIWDVLKAHEVPISAFWLQ
          +GR PELP WI  G  +G+QGGT+  ++  D ++   V ++  W Q
Subjt:  ETIGRPPELPGWIISGAVVGMQGGTDAVRQIWDVLKAHEVPISAFWLQ

P96793 Alpha-xylosidase XylQ6.8e-0422.96Show/hide
Query:  YAPSPFYITSRMRSLYLEGYEYSVFDLTKN--DMVQIQIYGNSIQGRILHGNSPSELIERFTETIGRPPELPGWIISGAVVGMQGGTD----AVRQIWDV
        Y   PFYI+S    ++++  +   F++     D VQ    G S+Q  +++G +P E++ R+T+  G     P W   G  +     TD     V +  D 
Subjt:  YAPSPFYITSRMRSLYLEGYEYSVFDLTKN--DMVQIQIYGNSIQGRILHGNSPSELIERFTETIGRPPELPGWIISGAVVGMQGGTD----AVRQIWDV

Query:  LKAHEVPISAF-----------WL---------------------QHIKVMTYCNPCLAPVSLFFNTSKSEILLIRSRNRR--QRNLYEEAKGLGD
        ++ H +P+  F           W                      + IKV  + NP +A  S  F  +K +  L+   N    Q +L++   G  D
Subjt:  LKAHEVPISAF-----------WL---------------------QHIKVMTYCNPCLAPVSLFFNTSKSEILLIRSRNRR--QRNLYEEAKGLGD

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
AGAATTGCGTACCATCCCAGAAATGGAAGCAACCAAGGACTTCTCTTCCTCGTAAAGATTTCAAAAGCCTTCCAGTGTCTATATGAACTTTCGGAACTTCCATGGTCTTC
TTCTCCCTTATTCCAACACAACACAAGCATGAAAAACCTCAAAATCACCAAAAAACATCACATACACCTCAACAATCCTTTCCCTTCTCCTCCAACTTCATCTCCTTTAG
TGCAGGGGGACCTCTCTGCAAACTTTCAAGCACTTCCTCCGTACAGAGTCTTCTCAATCGGACAGGATTTTCAGCTTCTATGGAGGTCCGAAAATGGTGGGTCTCTCTCA
ATTCATCATCTCTCTCAACCCACCAGATCAATCTGGTCAACAATCCCAGGGCGAGCTTTCGTATCTGCAGCAATGGTGGAAACTGAGGTGGAAGAAAGCCGAGGTTCGTT
TGCTGTCAAAGATGGGGCTATACGTTTGGTTTGTAATCATCAAACAATTGATGACATTAAGGAGATCAATGGCTGCGATTATGAATTGGAGGTTAGGGATCATCATTTTC
CATCTGGGTATTTGGATTTGGACCAGAAAAGCCATAAAAAAGATGCCCAGTTCCCTATTATGTTGCTCATTACATGGCAGAATATTCAATACCAAGAAGAAGAAACAACA
ACAGTCTTCAAGAAACTGGTTTCCAATGTTCCGCCTGCTTCTGCAAGGATTTGGGTATTGTTTGACAGAAAAAACAGCAGCCAAATTGGGTTTCAAGTGATGCTTGGGCA
ACCCAGCTATGAATATCATTCAAGAGGGAGACTCACAGGCTTAAGAAAGGGAAAGTTTGAATGGCATTGGTCTTTGGCAAAACTGAAAGGGTGTGTCAGAGTTTCTTCTT
CCAGAAGAGGAAAGAGAAGACTTGAAAGCACCCGAGGATATTTTGAAGCATTCAACACAGAGAAGGAGGAGATTCTTTGGGTTTTGGAGAGCAGTTCTCTCACATGGACT
TCAAGGGGAAAAAGAGTGCCAATCTTAGTTCAAGAACAGGGTATTGGTAGAGGAGATCAACCTATCACTTTTGCAGCTAATCTGGTTAGTTACAGGGCTGGAGGTGATTG
GAGTACTACCTATGCTCCTTCTCCATTTTACATAACATCCAGAATGAGGTCTCTGTACCTTGAAGGATACGAGTACTCTGTATTTGATCTAACAAAGAACGATATGGTTC
AGATTCAGATTTATGGAAATTCCATTCAAGGAAGGATATTGCATGGGAACTCGCCATCAGAGCTTATTGAACGTTTCACTGAGACCATTGGGAGGCCTCCAGAGCTTCCT
GGATGGATAATATCAGGTGCTGTGGTAGGGATGCAGGGTGGCACTGATGCCGTCCGCCAAATCTGGGATGTGCTAAAAGCTCATGAAGTTCCTATTTCTGCATTTTGGTT
GCAGCATATCAAAGTAATGACATACTGCAATCCCTGCCTAGCTCCGGTATCTCTCTTCTTCAACACCTCAAAATCAGAAATATTATTGATCAGAAGCAGAAACAGAAGGC
AGAGAAACCTTTATGAGGAAGCAAAGGGGTTGGGGGATCTTGGTAAAGAAGAAATGGAGAACCCTACATGGTTCCCAACACAGCTTTTGATGTGGGAATGTTGGATCTTA
CGCACCCAAATAACTTTCCAGTTGGTTCAAGGAAAATCTACAGAAATGGTGATGATGGAATTCAGAACCATAATTTCAGGTGAAGATCCTATTACTGCACATAACAGATA
CCCAGAAATGTGGGCGCAGATAAACAGAGAATTTGCAGATGAATGA
mRNA sequenceShow/hide mRNA sequence
AGAATTGCGTACCATCCCAGAAATGGAAGCAACCAAGGACTTCTCTTCCTCGTAAAGATTTCAAAAGCCTTCCAGTGTCTATATGAACTTTCGGAACTTCCATGGTCTTC
TTCTCCCTTATTCCAACACAACACAAGCATGAAAAACCTCAAAATCACCAAAAAACATCACATACACCTCAACAATCCTTTCCCTTCTCCTCCAACTTCATCTCCTTTAG
TGCAGGGGGACCTCTCTGCAAACTTTCAAGCACTTCCTCCGTACAGAGTCTTCTCAATCGGACAGGATTTTCAGCTTCTATGGAGGTCCGAAAATGGTGGGTCTCTCTCA
ATTCATCATCTCTCTCAACCCACCAGATCAATCTGGTCAACAATCCCAGGGCGAGCTTTCGTATCTGCAGCAATGGTGGAAACTGAGGTGGAAGAAAGCCGAGGTTCGTT
TGCTGTCAAAGATGGGGCTATACGTTTGGTTTGTAATCATCAAACAATTGATGACATTAAGGAGATCAATGGCTGCGATTATGAATTGGAGGTTAGGGATCATCATTTTC
CATCTGGGTATTTGGATTTGGACCAGAAAAGCCATAAAAAAGATGCCCAGTTCCCTATTATGTTGCTCATTACATGGCAGAATATTCAATACCAAGAAGAAGAAACAACA
ACAGTCTTCAAGAAACTGGTTTCCAATGTTCCGCCTGCTTCTGCAAGGATTTGGGTATTGTTTGACAGAAAAAACAGCAGCCAAATTGGGTTTCAAGTGATGCTTGGGCA
ACCCAGCTATGAATATCATTCAAGAGGGAGACTCACAGGCTTAAGAAAGGGAAAGTTTGAATGGCATTGGTCTTTGGCAAAACTGAAAGGGTGTGTCAGAGTTTCTTCTT
CCAGAAGAGGAAAGAGAAGACTTGAAAGCACCCGAGGATATTTTGAAGCATTCAACACAGAGAAGGAGGAGATTCTTTGGGTTTTGGAGAGCAGTTCTCTCACATGGACT
TCAAGGGGAAAAAGAGTGCCAATCTTAGTTCAAGAACAGGGTATTGGTAGAGGAGATCAACCTATCACTTTTGCAGCTAATCTGGTTAGTTACAGGGCTGGAGGTGATTG
GAGTACTACCTATGCTCCTTCTCCATTTTACATAACATCCAGAATGAGGTCTCTGTACCTTGAAGGATACGAGTACTCTGTATTTGATCTAACAAAGAACGATATGGTTC
AGATTCAGATTTATGGAAATTCCATTCAAGGAAGGATATTGCATGGGAACTCGCCATCAGAGCTTATTGAACGTTTCACTGAGACCATTGGGAGGCCTCCAGAGCTTCCT
GGATGGATAATATCAGGTGCTGTGGTAGGGATGCAGGGTGGCACTGATGCCGTCCGCCAAATCTGGGATGTGCTAAAAGCTCATGAAGTTCCTATTTCTGCATTTTGGTT
GCAGCATATCAAAGTAATGACATACTGCAATCCCTGCCTAGCTCCGGTATCTCTCTTCTTCAACACCTCAAAATCAGAAATATTATTGATCAGAAGCAGAAACAGAAGGC
AGAGAAACCTTTATGAGGAAGCAAAGGGGTTGGGGGATCTTGGTAAAGAAGAAATGGAGAACCCTACATGGTTCCCAACACAGCTTTTGATGTGGGAATGTTGGATCTTA
CGCACCCAAATAACTTTCCAGTTGGTTCAAGGAAAATCTACAGAAATGGTGATGATGGAATTCAGAACCATAATTTCAGGTGAAGATCCTATTACTGCACATAACAGATA
CCCAGAAATGTGGGCGCAGATAAACAGAGAATTTGCAGATGAATGA
Protein sequenceShow/hide protein sequence
RIAYHPRNGSNQGLLFLVKISKAFQCLYELSELPWSSSPLFQHNTSMKNLKITKKHHIHLNNPFPSPPTSSPLVQGDLSANFQALPPYRVFSIGQDFQLLWRSENGGSLS
IHHLSQPTRSIWSTIPGRAFVSAAMVETEVEESRGSFAVKDGAIRLVCNHQTIDDIKEINGCDYELEVRDHHFPSGYLDLDQKSHKKDAQFPIMLLITWQNIQYQEEETT
TVFKKLVSNVPPASARIWVLFDRKNSSQIGFQVMLGQPSYEYHSRGRLTGLRKGKFEWHWSLAKLKGCVRVSSSRRGKRRLESTRGYFEAFNTEKEEILWVLESSSLTWT
SRGKRVPILVQEQGIGRGDQPITFAANLVSYRAGGDWSTTYAPSPFYITSRMRSLYLEGYEYSVFDLTKNDMVQIQIYGNSIQGRILHGNSPSELIERFTETIGRPPELP
GWIISGAVVGMQGGTDAVRQIWDVLKAHEVPISAFWLQHIKVMTYCNPCLAPVSLFFNTSKSEILLIRSRNRRQRNLYEEAKGLGDLGKEEMENPTWFPTQLLMWECWIL
RTQITFQLVQGKSTEMVMMEFRTIISGEDPITAHNRYPEMWAQINREFADE