; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc07G10780 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc07G10780
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionULP_PROTEASE domain-containing protein
Genome locationClcChr07:25408331..25418678
RNA-Seq ExpressionClc07G10780
SyntenyClc07G10780
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0016020 - membrane (cellular component)
GO:0008234 - cysteine-type peptidase activity (molecular function)
InterPro domainsIPR003653 - Ulp1 protease family, C-terminal catalytic domain
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8649224.1 hypothetical protein Csa_014966 [Cucumis sativus]5.1e-20371.86Show/hide
Query:  EMSFVVDLSSKHYILQSASKKFRTFKSTLTQMYILPYKDEPSRLQNPPEKYSHIDKKQWESFVNARLSEEWE----------------------------
        +MSFVVDLSSKHYILQSASKKFR+FKSTLTQMYILPYKDEPSRLQ PPEKYSHIDKKQWESFV ARLSEEWE                            
Subjt:  EMSFVVDLSSKHYILQSASKKFRTFKSTLTQMYILPYKDEPSRLQNPPEKYSHIDKKQWESFVNARLSEEWE----------------------------

Query:  ---ELSSDPSNRAILWKEARKRKNNEYSDDATRECAKRIDELAAERKGQDILTEALGTPEHRGRIRGVGEFVSPALYLNVARGKLKLSPESQDEDETQHS
           ELSSDP NRA LWKEARKRKNN   DDAT EC KRIDELAA RKGQDILTEALGTPEHRGRIRGVGEFVSPAL++NVARG LKLS +SQDEDETQ S
Subjt:  ---ELSSDPSNRAILWKEARKRKNNEYSDDATRECAKRIDELAAERKGQDILTEALGTPEHRGRIRGVGEFVSPALYLNVARGKLKLSPESQDEDETQHS

Query:  IDEYETQQSQ-----------------KSSVLKKKTKGKKVQKGKKVQKGKKVPKGK------MVVKECEEILEGIPCHLAIGSMDNIVAVGTMFESDVQ
          E ETQQSQ                 +SSVL+KKTK KKVQKGKKV KGK V K         V++E E IL+GIPCHLAIGS+DN+VA+G MFESDVQ
Subjt:  IDEYETQQSQ-----------------KSSVLKKKTKGKKVQKGKKVQKGKKVPKGK------MVVKECEEILEGIPCHLAIGSMDNIVAVGTMFESDVQ

Query:  CPTIHEIPLGADNVRVMVDLVMGEDIEIPIPQNGEIKTSNQMIGNFVVWPRKLVIMTKEKKPPPSTTSKSITESSKYTDVHVTIKLLNRYAMLSMQVEDM
        CPTIH IPLGADN+RV VD++M ED+ +PIP  GEI+T NQ IGNFV WPRKLVI+T+EKK P    ++S T+SSKYTDVHVTIKLLNRYA+ +MQV+DM
Subjt:  CPTIHEIPLGADNVRVMVDLVMGEDIEIPIPQNGEIKTSNQMIGNFVVWPRKLVIMTKEKKPPPSTTSKSITESSKYTDVHVTIKLLNRYAMLSMQVEDM

Query:  IQIKLSEHIFGKEKTIYLRRDDIMQYCGMAEIGYSCILAYIACLWNACDCEVTKKFMIVDPATISSHVKSQEHRSRNLVDRLEMVSLEKLVLIPYNTG-C
        IQI L+EHIFGKEKTIYLR DDI+QYCGM EIGYSCIL YIACLWNACD E+TK+F++VD ATISSH+KSQE+RSRNL+ RLEM +L++LVLIPYNTG C
Subjt:  IQIKLSEHIFGKEKTIYLRRDDIMQYCGMAEIGYSCILAYIACLWNACDCEVTKKFMIVDPATISSHVKSQEHRSRNLVDRLEMVSLEKLVLIPYNTG-C

Query:  HWILIIIDLQENCVYVMDSLRSKILEDFQGVIN
        HWILI+IDL+ENCVYVMD LR+KIL +FQGVIN
Subjt:  HWILIIIDLQENCVYVMDSLRSKILEDFQGVIN

XP_008451868.1 PREDICTED: uncharacterized protein LOC103493028 isoform X1 [Cucumis melo]6.5e-20672.54Show/hide
Query:  EMSFVVDLSSKHYILQSASKKFRTFKSTLTQMYILPYKDEPSRLQNPPEKYSHIDKKQWESFVNARLSEEWE----------------------------
        +MSFVVDLSSKHYILQSASKKFR+FKSTLTQMYILPYKDEPSRLQ PPEKYSHIDKKQWESFV ARLSEEWE                            
Subjt:  EMSFVVDLSSKHYILQSASKKFRTFKSTLTQMYILPYKDEPSRLQNPPEKYSHIDKKQWESFVNARLSEEWE----------------------------

Query:  ---ELSSDPSNRAILWKEARKRKNNEYSDDATRECAKRIDELAAERKGQDILTEALGTPEHRGRIRGVGEFVSPALYLNVARGKLKLSPESQDEDETQHS
           ELSSDP NRA LWKEARKRKNN   DDATREC KRIDELAA RKGQDILTEALGTPEHRGRIRGVGEFVSPAL++NVARG LKLS +SQD+DETQ S
Subjt:  ---ELSSDPSNRAILWKEARKRKNNEYSDDATRECAKRIDELAAERKGQDILTEALGTPEHRGRIRGVGEFVSPALYLNVARGKLKLSPESQDEDETQHS

Query:  IDEYETQQSQ-----------------KSSVLKKKTKGKKVQKGKKVQKGKKVPKGKMVVKECEEILE------------GIPCHLAIGSMDNIVAVGTM
         DE ETQQS+                 +SSV +KKT      KGKKVQKGKK PKGKMVVKE EE LE            GIPCHLAIGS+DN+VAVG M
Subjt:  IDEYETQQSQ-----------------KSSVLKKKTKGKKVQKGKKVQKGKKVPKGKMVVKECEEILE------------GIPCHLAIGSMDNIVAVGTM

Query:  FESDVQCPTIHEIPLGADNVRVMVDLVMGEDIEIPIPQNGEIKTSNQMIGNFVVWPRKLVIMTKEKKPPPSTTSKSITESSKYTDVHVTIKLLNRYAMLS
        FESDVQCPTIH IPLGA+N+RV VD+ M ED+ +PIP  G+I+T NQ IGNFV WPRKLVI+TKEKK P  T S+S T+SSKYTDVHVTIKLLNRYAM +
Subjt:  FESDVQCPTIHEIPLGADNVRVMVDLVMGEDIEIPIPQNGEIKTSNQMIGNFVVWPRKLVIMTKEKKPPPSTTSKSITESSKYTDVHVTIKLLNRYAMLS

Query:  MQVEDMIQIKLSEHIFGKEKTIYLRRDDIMQYCGMAEIGYSCILAYIACLWNACDCEVTKKFMIVDPATISSHVKSQEHRSRNLVDRLEMVSLEKLVLIP
        MQVED+IQI LSEHIFGKEKTIYLRRDDI+QYCGM EIGYSCIL YIACLWN C+ E+TK+F++VD ATISSH+KSQE+RSRNL++RLEM +L++LVLIP
Subjt:  MQVEDMIQIKLSEHIFGKEKTIYLRRDDIMQYCGMAEIGYSCILAYIACLWNACDCEVTKKFMIVDPATISSHVKSQEHRSRNLVDRLEMVSLEKLVLIP

Query:  YNTG-CHWILIIIDLQENCVYVMDSLRSKILEDFQGVIN
        YNTG CHWILIIIDLQENCVYVMD LRSKIL +FQGVIN
Subjt:  YNTG-CHWILIIIDLQENCVYVMDSLRSKILEDFQGVIN

XP_031740251.1 uncharacterized protein LOC101213947 [Cucumis sativus]5.1e-20371.86Show/hide
Query:  EMSFVVDLSSKHYILQSASKKFRTFKSTLTQMYILPYKDEPSRLQNPPEKYSHIDKKQWESFVNARLSEEWE----------------------------
        +MSFVVDLSSKHYILQSASKKFR+FKSTLTQMYILPYKDEPSRLQ PPEKYSHIDKKQWESFV ARLSEEWE                            
Subjt:  EMSFVVDLSSKHYILQSASKKFRTFKSTLTQMYILPYKDEPSRLQNPPEKYSHIDKKQWESFVNARLSEEWE----------------------------

Query:  ---ELSSDPSNRAILWKEARKRKNNEYSDDATRECAKRIDELAAERKGQDILTEALGTPEHRGRIRGVGEFVSPALYLNVARGKLKLSPESQDEDETQHS
           ELSSDP NRA LWKEARKRKNN   DDAT EC KRIDELAA RKGQDILTEALGTPEHRGRIRGVGEFVSPAL++NVARG LKLS +SQDEDETQ S
Subjt:  ---ELSSDPSNRAILWKEARKRKNNEYSDDATRECAKRIDELAAERKGQDILTEALGTPEHRGRIRGVGEFVSPALYLNVARGKLKLSPESQDEDETQHS

Query:  IDEYETQQSQ-----------------KSSVLKKKTKGKKVQKGKKVQKGKKVPKGK------MVVKECEEILEGIPCHLAIGSMDNIVAVGTMFESDVQ
          E ETQQSQ                 +SSVL+KKTK KKVQKGKKV KGK V K         V++E E IL+GIPCHLAIGS+DN+VA+G MFESDVQ
Subjt:  IDEYETQQSQ-----------------KSSVLKKKTKGKKVQKGKKVQKGKKVPKGK------MVVKECEEILEGIPCHLAIGSMDNIVAVGTMFESDVQ

Query:  CPTIHEIPLGADNVRVMVDLVMGEDIEIPIPQNGEIKTSNQMIGNFVVWPRKLVIMTKEKKPPPSTTSKSITESSKYTDVHVTIKLLNRYAMLSMQVEDM
        CPTIH IPLGADN+RV VD++M ED+ +PIP  GEI+T NQ IGNFV WPRKLVI+T+EKK P    ++S T+SSKYTDVHVTIKLLNRYA+ +MQV+DM
Subjt:  CPTIHEIPLGADNVRVMVDLVMGEDIEIPIPQNGEIKTSNQMIGNFVVWPRKLVIMTKEKKPPPSTTSKSITESSKYTDVHVTIKLLNRYAMLSMQVEDM

Query:  IQIKLSEHIFGKEKTIYLRRDDIMQYCGMAEIGYSCILAYIACLWNACDCEVTKKFMIVDPATISSHVKSQEHRSRNLVDRLEMVSLEKLVLIPYNTG-C
        IQI L+EHIFGKEKTIYLR DDI+QYCGM EIGYSCIL YIACLWNACD E+TK+F++VD ATISSH+KSQE+RSRNL+ RLEM +L++LVLIPYNTG C
Subjt:  IQIKLSEHIFGKEKTIYLRRDDIMQYCGMAEIGYSCILAYIACLWNACDCEVTKKFMIVDPATISSHVKSQEHRSRNLVDRLEMVSLEKLVLIPYNTG-C

Query:  HWILIIIDLQENCVYVMDSLRSKILEDFQGVIN
        HWILI+IDL+ENCVYVMD LR+KIL +FQGVIN
Subjt:  HWILIIIDLQENCVYVMDSLRSKILEDFQGVIN

XP_038895921.1 uncharacterized protein LOC120084092 isoform X1 [Benincasa hispida]3.1e-21678.38Show/hide
Query:  EMSFVVDLSSKHYILQSASKKFRTFKSTLTQMYILPYKDEPSRLQNPPEKYSHIDKKQWESFVNARLSEEWE----------------------------
        +MSFVVDL SKHYILQSASKKFRTFKSTLTQ YILPYKDEPSRLQNPPEKYSHIDKKQWESFV ARLSEEWE                            
Subjt:  EMSFVVDLSSKHYILQSASKKFRTFKSTLTQMYILPYKDEPSRLQNPPEKYSHIDKKQWESFVNARLSEEWE----------------------------

Query:  ---ELSSDPSNRAILWKEARKRKNNEYSDDATRECAKRIDELAAERKGQDILTEALGTPEHRGRIRGVGEFVSPALYLNVARGKLKLSPESQDEDETQHS
           ELSSDP NRA LWKEARKRKNNEYSD ATRECAKRIDELAA RKGQDILTEALGTPEHRGRIRGVGEFVSPAL+ NVA+GKLKL  ESQ+E ETQ S
Subjt:  ---ELSSDPSNRAILWKEARKRKNNEYSDDATRECAKRIDELAAERKGQDILTEALGTPEHRGRIRGVGEFVSPALYLNVARGKLKLSPESQDEDETQHS

Query:  IDEYETQQSQ--------KSSVLKKKTKGKKVQKGKKVQKGKKVPKGKMVVKECEEILEGIPCHLAIGSMDNIVAVGTMFESDVQCPTIHEIPLGADNVR
         D+ ETQQSQ        +SSV++KKTK K+VQKG+ VQK KKVPKGKMVVK+ EEILEGIPCHLAIGS+DNIVAVGTMFESD QCP+I+EIPLG DNVR
Subjt:  IDEYETQQSQ--------KSSVLKKKTKGKKVQKGKKVQKGKKVPKGKMVVKECEEILEGIPCHLAIGSMDNIVAVGTMFESDVQCPTIHEIPLGADNVR

Query:  VMVDLVMGEDIEIPIPQNGEIKTSNQMIGNFVVWPRKLVIMTKEKKPPPSTTSKSITESSKYTDVHVTIKLLNRYAMLSMQVEDMIQIKLSEHIFGKEKT
         MVD+VMGED+ +PIPQ  +IKT +Q IGNFV WPRKLVI TKEKK P  TTSKSI +SSKYTDVHVTIKLLNRYAM SMQV+DMIQI LSE I GKEKT
Subjt:  VMVDLVMGEDIEIPIPQNGEIKTSNQMIGNFVVWPRKLVIMTKEKKPPPSTTSKSITESSKYTDVHVTIKLLNRYAMLSMQVEDMIQIKLSEHIFGKEKT

Query:  IYLRRDDIMQYCGMAEIGYSCILAYIACLWNACDCEVTKKFMIVDPATISSHVKSQEHRSRNLVDRLEMVSLEKLVLIPYNTG-CHWILIIIDLQENCVY
        IYL+RDDI+QYCGMAEIGYSCILAYIACLWNACD E+TKKF+IVD ATISSHVK QE RS+NL++RLEMVSL++LVLIPYNTG CHWILIII+LQENCVY
Subjt:  IYLRRDDIMQYCGMAEIGYSCILAYIACLWNACDCEVTKKFMIVDPATISSHVKSQEHRSRNLVDRLEMVSLEKLVLIPYNTG-CHWILIIIDLQENCVY

Query:  VMDSLRSKILEDFQGVIN
        VMDSLRSKILE+FQGVIN
Subjt:  VMDSLRSKILEDFQGVIN

XP_038895930.1 uncharacterized protein LOC120084092 isoform X2 [Benincasa hispida]1.3e-21778.53Show/hide
Query:  EMSFVVDLSSKHYILQSASKKFRTFKSTLTQMYILPYKDEPSRLQNPPEKYSHIDKKQWESFVNARLSEEWE----------------------------
        +MSFVVDL SKHYILQSASKKFRTFKSTLTQ YILPYKDEPSRLQNPPEKYSHIDKKQWESFV ARLSEEWE                            
Subjt:  EMSFVVDLSSKHYILQSASKKFRTFKSTLTQMYILPYKDEPSRLQNPPEKYSHIDKKQWESFVNARLSEEWE----------------------------

Query:  ---ELSSDPSNRAILWKEARKRKNNEYSDDATRECAKRIDELAAERKGQDILTEALGTPEHRGRIRGVGEFVSPALYLNVARGKLKLSPESQDEDETQHS
           ELSSDP NRA LWKEARKRKNNEYSD ATRECAKRIDELAA RKGQDILTEALGTPEHRGRIRGVGEFVSPAL+ NVA+GKLKL  ESQ+E ETQ S
Subjt:  ---ELSSDPSNRAILWKEARKRKNNEYSDDATRECAKRIDELAAERKGQDILTEALGTPEHRGRIRGVGEFVSPALYLNVARGKLKLSPESQDEDETQHS

Query:  IDEYETQQSQ--------KSSVLKKKTKGKKVQKGKKVQKGKKVPKGKMVVKECEEILEGIPCHLAIGSMDNIVAVGTMFESDVQCPTIHEIPLGADNVR
         D+ ETQQSQ        +SSV++KKTK K+VQKG+ VQK KKVPKGKMVVK+ EEILEGIPCHLAIGS+DNIVAVGTMFESD QCP+I+EIPLG DNVR
Subjt:  IDEYETQQSQ--------KSSVLKKKTKGKKVQKGKKVQKGKKVPKGKMVVKECEEILEGIPCHLAIGSMDNIVAVGTMFESDVQCPTIHEIPLGADNVR

Query:  VMVDLVMGEDIEIPIPQNGEIKTSNQMIGNFVVWPRKLVIMTKEKKPPPSTTSKSITESSKYTDVHVTIKLLNRYAMLSMQVEDMIQIKLSEHIFGKEKT
         MVD+VMGED+ +PIPQ  +IKT +Q IGNFV WPRKLVI TKEKK P  TTSKSI +SSKYTDVHVTIKLLNRYAM SMQV+DMIQI LSE I GKEKT
Subjt:  VMVDLVMGEDIEIPIPQNGEIKTSNQMIGNFVVWPRKLVIMTKEKKPPPSTTSKSITESSKYTDVHVTIKLLNRYAMLSMQVEDMIQIKLSEHIFGKEKT

Query:  IYLRRDDIMQYCGMAEIGYSCILAYIACLWNACDCEVTKKFMIVDPATISSHVKSQEHRSRNLVDRLEMVSLEKLVLIPYNTGCHWILIIIDLQENCVYV
        IYL+RDDI+QYCGMAEIGYSCILAYIACLWNACD E+TKKF+IVD ATISSHVK QE RS+NL++RLEMVSL++LVLIPYNTGCHWILIII+LQENCVYV
Subjt:  IYLRRDDIMQYCGMAEIGYSCILAYIACLWNACDCEVTKKFMIVDPATISSHVKSQEHRSRNLVDRLEMVSLEKLVLIPYNTGCHWILIIIDLQENCVYV

Query:  MDSLRSKILEDFQGVIN
        MDSLRSKILE+FQGVIN
Subjt:  MDSLRSKILEDFQGVIN

TrEMBL top hitse value%identityAlignment
A0A1S3BRX5 uncharacterized protein LOC103493028 isoform X13.1e-20672.54Show/hide
Query:  EMSFVVDLSSKHYILQSASKKFRTFKSTLTQMYILPYKDEPSRLQNPPEKYSHIDKKQWESFVNARLSEEWE----------------------------
        +MSFVVDLSSKHYILQSASKKFR+FKSTLTQMYILPYKDEPSRLQ PPEKYSHIDKKQWESFV ARLSEEWE                            
Subjt:  EMSFVVDLSSKHYILQSASKKFRTFKSTLTQMYILPYKDEPSRLQNPPEKYSHIDKKQWESFVNARLSEEWE----------------------------

Query:  ---ELSSDPSNRAILWKEARKRKNNEYSDDATRECAKRIDELAAERKGQDILTEALGTPEHRGRIRGVGEFVSPALYLNVARGKLKLSPESQDEDETQHS
           ELSSDP NRA LWKEARKRKNN   DDATREC KRIDELAA RKGQDILTEALGTPEHRGRIRGVGEFVSPAL++NVARG LKLS +SQD+DETQ S
Subjt:  ---ELSSDPSNRAILWKEARKRKNNEYSDDATRECAKRIDELAAERKGQDILTEALGTPEHRGRIRGVGEFVSPALYLNVARGKLKLSPESQDEDETQHS

Query:  IDEYETQQSQ-----------------KSSVLKKKTKGKKVQKGKKVQKGKKVPKGKMVVKECEEILE------------GIPCHLAIGSMDNIVAVGTM
         DE ETQQS+                 +SSV +KKT      KGKKVQKGKK PKGKMVVKE EE LE            GIPCHLAIGS+DN+VAVG M
Subjt:  IDEYETQQSQ-----------------KSSVLKKKTKGKKVQKGKKVQKGKKVPKGKMVVKECEEILE------------GIPCHLAIGSMDNIVAVGTM

Query:  FESDVQCPTIHEIPLGADNVRVMVDLVMGEDIEIPIPQNGEIKTSNQMIGNFVVWPRKLVIMTKEKKPPPSTTSKSITESSKYTDVHVTIKLLNRYAMLS
        FESDVQCPTIH IPLGA+N+RV VD+ M ED+ +PIP  G+I+T NQ IGNFV WPRKLVI+TKEKK P  T S+S T+SSKYTDVHVTIKLLNRYAM +
Subjt:  FESDVQCPTIHEIPLGADNVRVMVDLVMGEDIEIPIPQNGEIKTSNQMIGNFVVWPRKLVIMTKEKKPPPSTTSKSITESSKYTDVHVTIKLLNRYAMLS

Query:  MQVEDMIQIKLSEHIFGKEKTIYLRRDDIMQYCGMAEIGYSCILAYIACLWNACDCEVTKKFMIVDPATISSHVKSQEHRSRNLVDRLEMVSLEKLVLIP
        MQVED+IQI LSEHIFGKEKTIYLRRDDI+QYCGM EIGYSCIL YIACLWN C+ E+TK+F++VD ATISSH+KSQE+RSRNL++RLEM +L++LVLIP
Subjt:  MQVEDMIQIKLSEHIFGKEKTIYLRRDDIMQYCGMAEIGYSCILAYIACLWNACDCEVTKKFMIVDPATISSHVKSQEHRSRNLVDRLEMVSLEKLVLIP

Query:  YNTG-CHWILIIIDLQENCVYVMDSLRSKILEDFQGVIN
        YNTG CHWILIIIDLQENCVYVMD LRSKIL +FQGVIN
Subjt:  YNTG-CHWILIIIDLQENCVYVMDSLRSKILEDFQGVIN

A0A1S4DZN2 uncharacterized protein LOC103493028 isoform X26.0e-18970.98Show/hide
Query:  EMSFVVDLSSKHYILQSASKKFRTFKSTLTQMYILPYKDEPSRLQNPPEKYSHIDKKQWESFVNARLSEEWE----------------------------
        +MSFVVDLSSKHYILQSASKKFR+FKSTLTQMYILPYKDEPSRLQ PPEKYSHIDKKQWESFV ARLSEEWE                            
Subjt:  EMSFVVDLSSKHYILQSASKKFRTFKSTLTQMYILPYKDEPSRLQNPPEKYSHIDKKQWESFVNARLSEEWE----------------------------

Query:  ---ELSSDPSNRAILWKEARKRKNNEYSDDATRECAKRIDELAAERKGQDILTEALGTPEHRGRIRGVGEFVSPALYLNVARGKLKLSPESQDEDETQHS
           ELSSDP NRA LWKEARKRKNN   DDATREC KRIDELAA RKGQDILTEALGTPEHRGRIRGVGEFVSPAL++NVARG LKLS +SQD+DETQ S
Subjt:  ---ELSSDPSNRAILWKEARKRKNNEYSDDATRECAKRIDELAAERKGQDILTEALGTPEHRGRIRGVGEFVSPALYLNVARGKLKLSPESQDEDETQHS

Query:  IDEYETQQSQ-----------------KSSVLKKKTKGKKVQKGKKVQKGKKVPKGKMVVKECEEILE------------GIPCHLAIGSMDNIVAVGTM
         DE ETQQS+                 +SSV +KKT      KGKKVQKGKK PKGKMVVKE EE LE            GIPCHLAIGS+DN+VAVG M
Subjt:  IDEYETQQSQ-----------------KSSVLKKKTKGKKVQKGKKVQKGKKVPKGKMVVKECEEILE------------GIPCHLAIGSMDNIVAVGTM

Query:  FESDVQCPTIHEIPLGADNVRVMVDLVMGEDIEIPIPQNGEIKTSNQMIGNFVVWPRKLVIMTKEKKPPPSTTSKSITESSKYTDVHVTIKLLNRYAMLS
        FESDVQCPTIH IPLGA+N+RV VD+ M ED+ +PIP  G+I+T NQ IGNFV WPRKLVI+TKEKK P  T S+S T+SSKYTDVHVTIKLLNRYAM +
Subjt:  FESDVQCPTIHEIPLGADNVRVMVDLVMGEDIEIPIPQNGEIKTSNQMIGNFVVWPRKLVIMTKEKKPPPSTTSKSITESSKYTDVHVTIKLLNRYAMLS

Query:  MQVEDMIQIKLSEHIFGKEKTIYLRRDDIMQYCGMAEIGYSCILAYIACLWNACDCEVTKKFMIVDPATISSHVKSQEHRSRNLVDRLEMVSLEKLVLIP
        MQVED+IQI LSEHIFGKEKTIYLRRDDI+QYCGM EIGYSCIL YIACLWN C+ E+TK+F++VD ATISSH+KSQE+RSRNL++RLEM +L++LVLIP
Subjt:  MQVEDMIQIKLSEHIFGKEKTIYLRRDDIMQYCGMAEIGYSCILAYIACLWNACDCEVTKKFMIVDPATISSHVKSQEHRSRNLVDRLEMVSLEKLVLIP

Query:  YNTG----CH
        YNTG    CH
Subjt:  YNTG----CH

A0A5D3CYL9 ULP_PROTEASE domain-containing protein3.1e-20672.54Show/hide
Query:  EMSFVVDLSSKHYILQSASKKFRTFKSTLTQMYILPYKDEPSRLQNPPEKYSHIDKKQWESFVNARLSEEWE----------------------------
        +MSFVVDLSSKHYILQSASKKFR+FKSTLTQMYILPYKDEPSRLQ PPEKYSHIDKKQWESFV ARLSEEWE                            
Subjt:  EMSFVVDLSSKHYILQSASKKFRTFKSTLTQMYILPYKDEPSRLQNPPEKYSHIDKKQWESFVNARLSEEWE----------------------------

Query:  ---ELSSDPSNRAILWKEARKRKNNEYSDDATRECAKRIDELAAERKGQDILTEALGTPEHRGRIRGVGEFVSPALYLNVARGKLKLSPESQDEDETQHS
           ELSSDP NRA LWKEARKRKNN   DDATREC KRIDELAA RKGQDILTEALGTPEHRGRIRGVGEFVSPAL++NVARG LKLS +SQD+DETQ S
Subjt:  ---ELSSDPSNRAILWKEARKRKNNEYSDDATRECAKRIDELAAERKGQDILTEALGTPEHRGRIRGVGEFVSPALYLNVARGKLKLSPESQDEDETQHS

Query:  IDEYETQQSQ-----------------KSSVLKKKTKGKKVQKGKKVQKGKKVPKGKMVVKECEEILE------------GIPCHLAIGSMDNIVAVGTM
         DE ETQQS+                 +SSV +KKT      KGKKVQKGKK PKGKMVVKE EE LE            GIPCHLAIGS+DN+VAVG M
Subjt:  IDEYETQQSQ-----------------KSSVLKKKTKGKKVQKGKKVQKGKKVPKGKMVVKECEEILE------------GIPCHLAIGSMDNIVAVGTM

Query:  FESDVQCPTIHEIPLGADNVRVMVDLVMGEDIEIPIPQNGEIKTSNQMIGNFVVWPRKLVIMTKEKKPPPSTTSKSITESSKYTDVHVTIKLLNRYAMLS
        FESDVQCPTIH IPLGA+N+RV VD+ M ED+ +PIP  G+I+T NQ IGNFV WPRKLVI+TKEKK P  T S+S T+SSKYTDVHVTIKLLNRYAM +
Subjt:  FESDVQCPTIHEIPLGADNVRVMVDLVMGEDIEIPIPQNGEIKTSNQMIGNFVVWPRKLVIMTKEKKPPPSTTSKSITESSKYTDVHVTIKLLNRYAMLS

Query:  MQVEDMIQIKLSEHIFGKEKTIYLRRDDIMQYCGMAEIGYSCILAYIACLWNACDCEVTKKFMIVDPATISSHVKSQEHRSRNLVDRLEMVSLEKLVLIP
        MQVED+IQI LSEHIFGKEKTIYLRRDDI+QYCGM EIGYSCIL YIACLWN C+ E+TK+F++VD ATISSH+KSQE+RSRNL++RLEM +L++LVLIP
Subjt:  MQVEDMIQIKLSEHIFGKEKTIYLRRDDIMQYCGMAEIGYSCILAYIACLWNACDCEVTKKFMIVDPATISSHVKSQEHRSRNLVDRLEMVSLEKLVLIP

Query:  YNTG-CHWILIIIDLQENCVYVMDSLRSKILEDFQGVIN
        YNTG CHWILIIIDLQENCVYVMD LRSKIL +FQGVIN
Subjt:  YNTG-CHWILIIIDLQENCVYVMDSLRSKILEDFQGVIN

A0A6J1C2H7 uncharacterized protein LOC111007859 isoform X12.5e-16362.08Show/hide
Query:  EMSFVVDLSSKHYILQSASKKFRTFKSTLTQMYILPYKDEPSRLQNPPEKYSHIDKKQWESFVNARLSEEWE----------------------------
        E SFV+D  SKH+ILQSASKKFRTFKSTLT+ YILP+KDEP  LQNPPEKY HID++QW SFVNARLSEEWE                            
Subjt:  EMSFVVDLSSKHYILQSASKKFRTFKSTLTQMYILPYKDEPSRLQNPPEKYSHIDKKQWESFVNARLSEEWE----------------------------

Query:  ---ELSSDPSNRAILWKEARKRKNNEYSDDATRECAKRIDELAAERKGQDILTEALGTPEHRGRIRGVGEFVSPALYLNVARGKLKLSPESQDEDETQHS
           +LSSDPSNRAILWKEARK KNNEY DDATRECA RIDELAA  KG+DILTEALGT EH GR+RGVGEFVSP+LY NV +GK K + E Q    T   
Subjt:  ---ELSSDPSNRAILWKEARKRKNNEYSDDATRECAKRIDELAAERKGQDILTEALGTPEHRGRIRGVGEFVSPALYLNVARGKLKLSPESQDEDETQHS

Query:  IDEYETQQSQKSSVLKKKTKGKKVQKGKKVQKGKKVPKGKMVVKECEEILEGIPCHLAIGSMDNIVAVGTMFESDVQCPTIHEIPLGADNVRVMVDLVMG
                ++ S+  KKK+KGK++           V   + +    E+ +EG PCHLA+ S+DNIVAVGT+F+++VQCPT+H +PLG DNVRVMVD+V+ 
Subjt:  IDEYETQQSQKSSVLKKKTKGKKVQKGKKVQKGKKVPKGKMVVKECEEILEGIPCHLAIGSMDNIVAVGTMFESDVQCPTIHEIPLGADNVRVMVDLVMG

Query:  EDIEIPIPQNGEIKTSNQMIGNFVVWPRKLVIMTKEKKPPPSTTSKSITESSKYTDVHVTIKLLNRYAMLSMQVEDMIQIKLSEHIFGKEKTIYLRRDDI
        E   IPIP  GEI+T NQ IG FV WPR+LVI+++EK    S TS++ T+ SK+TDVHV+IKLLNRY MLSMQ ED ++I LS+ IFGKEK IYL R+DI
Subjt:  EDIEIPIPQNGEIKTSNQMIGNFVVWPRKLVIMTKEKKPPPSTTSKSITESSKYTDVHVTIKLLNRYAMLSMQVEDMIQIKLSEHIFGKEKTIYLRRDDI

Query:  MQYCGMAEIGYSCILAYIACLWNACDCEVTKKFMIVDPATISSHVKSQEHRSRNLVDRLEMVSLEKLVLIPYNTGCHWILIIIDLQENCVYVMDSLRSKI
        MQYC M EIGYSCIL YIA LW+  + E+TKKF+IVDPATIS +VKSQE+R RNL +RLEMV+LE+LVLIPY +GCHW+LIII+L+ENCVYV+DSLR KI
Subjt:  MQYCGMAEIGYSCILAYIACLWNACDCEVTKKFMIVDPATISSHVKSQEHRSRNLVDRLEMVSLEKLVLIPYNTGCHWILIIIDLQENCVYVMDSLRSKI

Query:  LEDFQGVIN
         ED+Q VIN
Subjt:  LEDFQGVIN

A0A6J1C2V2 uncharacterized protein LOC111007859 isoform X42.5e-16362.08Show/hide
Query:  EMSFVVDLSSKHYILQSASKKFRTFKSTLTQMYILPYKDEPSRLQNPPEKYSHIDKKQWESFVNARLSEEWE----------------------------
        E SFV+D  SKH+ILQSASKKFRTFKSTLT+ YILP+KDEP  LQNPPEKY HID++QW SFVNARLSEEWE                            
Subjt:  EMSFVVDLSSKHYILQSASKKFRTFKSTLTQMYILPYKDEPSRLQNPPEKYSHIDKKQWESFVNARLSEEWE----------------------------

Query:  ---ELSSDPSNRAILWKEARKRKNNEYSDDATRECAKRIDELAAERKGQDILTEALGTPEHRGRIRGVGEFVSPALYLNVARGKLKLSPESQDEDETQHS
           +LSSDPSNRAILWKEARK KNNEY DDATRECA RIDELAA  KG+DILTEALGT EH GR+RGVGEFVSP+LY NV +GK K + E Q    T   
Subjt:  ---ELSSDPSNRAILWKEARKRKNNEYSDDATRECAKRIDELAAERKGQDILTEALGTPEHRGRIRGVGEFVSPALYLNVARGKLKLSPESQDEDETQHS

Query:  IDEYETQQSQKSSVLKKKTKGKKVQKGKKVQKGKKVPKGKMVVKECEEILEGIPCHLAIGSMDNIVAVGTMFESDVQCPTIHEIPLGADNVRVMVDLVMG
                ++ S+  KKK+KGK++           V   + +    E+ +EG PCHLA+ S+DNIVAVGT+F+++VQCPT+H +PLG DNVRVMVD+V+ 
Subjt:  IDEYETQQSQKSSVLKKKTKGKKVQKGKKVQKGKKVPKGKMVVKECEEILEGIPCHLAIGSMDNIVAVGTMFESDVQCPTIHEIPLGADNVRVMVDLVMG

Query:  EDIEIPIPQNGEIKTSNQMIGNFVVWPRKLVIMTKEKKPPPSTTSKSITESSKYTDVHVTIKLLNRYAMLSMQVEDMIQIKLSEHIFGKEKTIYLRRDDI
        E   IPIP  GEI+T NQ IG FV WPR+LVI+++EK    S TS++ T+ SK+TDVHV+IKLLNRY MLSMQ ED ++I LS+ IFGKEK IYL R+DI
Subjt:  EDIEIPIPQNGEIKTSNQMIGNFVVWPRKLVIMTKEKKPPPSTTSKSITESSKYTDVHVTIKLLNRYAMLSMQVEDMIQIKLSEHIFGKEKTIYLRRDDI

Query:  MQYCGMAEIGYSCILAYIACLWNACDCEVTKKFMIVDPATISSHVKSQEHRSRNLVDRLEMVSLEKLVLIPYNTGCHWILIIIDLQENCVYVMDSLRSKI
        MQYC M EIGYSCIL YIA LW+  + E+TKKF+IVDPATIS +VKSQE+R RNL +RLEMV+LE+LVLIPY +GCHW+LIII+L+ENCVYV+DSLR KI
Subjt:  MQYCGMAEIGYSCILAYIACLWNACDCEVTKKFMIVDPATISSHVKSQEHRSRNLVDRLEMVSLEKLVLIPYNTGCHWILIIIDLQENCVYVMDSLRSKI

Query:  LEDFQGVIN
         ED+Q VIN
Subjt:  LEDFQGVIN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G13050.1 unknown protein1.6e-1325.91Show/hide
Query:  KEQKSPPDVDGPIKSPISDRPMPPKLISDAKRRTHPLVWLAAVLCTVVSIAVIIGGIVIFIGYLVIHPRIPTISVLDAHLDNFQNDIAGRLEVQLTIVVE
        +EQ+ P    G   +P    P  P       +RT P+   A + C ++ I +I+ G+++ + YL   PR P   +  A L+    D+   L   L +VV 
Subjt:  KEQKSPPDVDGPIKSPISDRPMPPKLISDAKRRTHPLVWLAAVLCTVVSIAVIIGGIVIFIGYLVIHPRIPTISVLDAHLDNFQNDIAGRLEVQLTIVVE

Query:  AENDNAKAHASFSDSSFFLHFLGIKIAQLVADPFEVRKNSSLKFHYAVVSNSIPLNPEQMEKVDLDLKADLSRFDLVGNTRVQWRVGLLGSMKYECHLHC
          N + K+   FS   F L+F    IA    +PF V K  S+   + +VS+ + +   Q + + L L       +L G    +  +G L    Y  H  C
Subjt:  AENDNAKAHASFSDSSFFLHFLGIKIAQLVADPFEVRKNSSLKFHYAVVSNSIPLNPEQMEKVDLDLKADLSRFDLVGNTRVQWRVGLLGSMKYECHLHC

Query:  DLKFH-PSNGTYLRSPCSSR
         +  + P  GT     C+++
Subjt:  DLKFH-PSNGTYLRSPCSSR

AT1G13050.2 unknown protein8.9e-1226.11Show/hide
Query:  AAVLCTVVSIAVIIGGIVIFIGYLVIHPRIPTISVLDAHLDNFQNDIAGRLEVQLTIVVEAENDNAKAHASFSDSSFFLHFLGIKIAQLVADPFEVRKNS
        A + C ++ I +I+ G+++ + YL   PR P   +  A L+    D+   L   L +VV   N + K+   FS   F L+F    IA    +PF V K  
Subjt:  AAVLCTVVSIAVIIGGIVIFIGYLVIHPRIPTISVLDAHLDNFQNDIAGRLEVQLTIVVEAENDNAKAHASFSDSSFFLHFLGIKIAQLVADPFEVRKNS

Query:  SLKFHYAVVSNSIPLNPEQMEKVDLDLKADLSRFDLVGNTRVQWRVGLLGSMKYECHLHCDLKFH-PSNGTYLRSPCSSR
        S+   + +VS+ + +   Q + + L L       +L G    +  +G L    Y  H  C +  + P  GT     C+++
Subjt:  SLKFHYAVVSNSIPLNPEQMEKVDLDLKADLSRFDLVGNTRVQWRVGLLGSMKYECHLHCDLKFH-PSNGTYLRSPCSSR

AT3G26350.1 LOCATED IN: chloroplast7.1e-1727.31Show/hide
Query:  PIKSPISDRPMP-------PKLISDAKRRTHPLVWLAAVLCTVVSIAVIIGGIVIFIGYLVIHPRIPTISVLDAHLDNFQNDIAGRLEVQLTIVVEAEND
        P  SP    P P       P+L     R T+ + W AA  C +  + +I+GG++I I YLV  PR P + +  A+L+    D+   L   LTI+    N 
Subjt:  PIKSPISDRPMP-------PKLISDAKRRTHPLVWLAAVLCTVVSIAVIIGGIVIFIGYLVIHPRIPTISVLDAHLDNFQNDIAGRLEVQLTIVVEAEND

Query:  NAKAHASFSDSSFFLHFLGIKIAQLVADPFEVRKNSSLKFHYAVVSNSIPLNPEQMEKVDLDLKADLSRFDLVGNTRVQWRVGLLGSMKYECHLHCDLKF
        + K+   FS  +F L++    IA    +PF+V K +S+  +  +VS+ + L   Q  ++   ++      +L G    +  +G L    Y+ H HC +  
Subjt:  NAKAHASFSDSSFFLHFLGIKIAQLVADPFEVRKNSSLKFHYAVVSNSIPLNPEQMEKVDLDLKADLSRFDLVGNTRVQWRVGLLGSMKYECHLHCDLKF

Query:  H-PSNGTYLRSPCSSR
        + P  G      C+++
Subjt:  H-PSNGTYLRSPCSSR

AT4G26490.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family1.1e-0924.47Show/hide
Query:  RTHPLVWLAAVLCTVVSIAVIIGGIVIFIGYLVIHPRIPTISVLDAHLDNFQNDIAGRLEVQLTIVVEAENDNAKAHASFSDSSFFLHFLGIKIAQLVAD
        RT   +W  A  C V S+ +I   I   I +L I PRIP   + +A+L     D        L+++V   N N K    F      L F    IA  V  
Subjt:  RTHPLVWLAAVLCTVVSIAVIIGGIVIFIGYLVIHPRIPTISVLDAHLDNFQNDIAGRLEVQLTIVVEAENDNAKAHASFSDSSFFLHFLGIKIAQLVAD

Query:  PFEVRKNSSLKFHYAVVSNSIPLNPEQMEKVDLDLKADLSRFDLVGNTRVQWRVGLLGSMKYECHLHCDLKF-HPSNGTYLRSPCSSR
        PF  +K+ +      ++S+ + L      ++   L+ +   +++ G  +V+   G++    Y+ H  C L+   P  G  +   C+++
Subjt:  PFEVRKNSSLKFHYAVVSNSIPLNPEQMEKVDLDLKADLSRFDLVGNTRVQWRVGLLGSMKYECHLHCDLKF-HPSNGTYLRSPCSSR

AT5G45320.1 FUNCTIONS IN: molecular_function unknown4.2e-5452.53Show/hide
Query:  PKLISDAKRRTHPLVWLAAVLCTVVSIAVIIGGIVIFIGYLVIHPRIPTISVLDAHLDNFQNDIAGRLEVQLTIVVEAENDNAKAHASFSDSSFFLHFLG
        P+L S  +  T P +W AA++C ++SI VI+GGI++F+GYLVIHPR+P ISV DAHLD  + DI G L+ QLTIV+  ENDNAKAHA F ++ F L + G
Subjt:  PKLISDAKRRTHPLVWLAAVLCTVVSIAVIIGGIVIFIGYLVIHPRIPTISVLDAHLDNFQNDIAGRLEVQLTIVVEAENDNAKAHASFSDSSFFLHFLG

Query:  IKIAQLVADPFEVRKNSSLKFHYAVVSNSIPLNPEQMEKVDLDLKADLSRFDLVGNTRVQWRVGLLGSMKYECHLHCDLKFHPSNGTYLRSPCSSRAK
          IA L A  FEV K  S+   Y V S  IPLNP  M+ VD  +K D+  F+L G +R +WRVG LGS+K+EC+L C L+F PS+ +Y+ SPC+S  K
Subjt:  IKIAQLVADPFEVRKNSSLKFHYAVVSNSIPLNPEQMEKVDLDLKADLSRFDLVGNTRVQWRVGLLGSMKYECHLHCDLKFHPSNGTYLRSPCSSRAK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGATTCCGATAACAGCAGCCAAGATGAGAGAAATGTTCTTATTCGTTATGAGATGTCGTTCGTGGTGGACCTTAGTTCCAAACATTATATTCTTCAATCTGCATC
AAAGAAGTTTCGAACGTTTAAGTCCACATTAACTCAGATGTACATACTTCCATATAAAGATGAACCATCTCGCTTGCAGAATCCTCCTGAAAAATATTCACATATTGATA
AGAAACAATGGGAGTCATTCGTTAATGCAAGACTAAGTGAAGAGTGGGAGGAATTGTCGAGTGATCCTTCCAATCGAGCAATTTTATGGAAAGAAGCACGAAAAAGAAAG
AATAATGAATACTCTGATGATGCCACCAGAGAATGTGCTAAGCGAATTGATGAATTAGCTGCGGAACGTAAGGGACAAGATATTTTGACTGAAGCATTAGGCACGCCAGA
ACATAGAGGGCGTATAAGGGGAGTGGGCGAGTTTGTTTCACCAGCCCTATATCTCAATGTTGCTAGAGGAAAGTTGAAGTTGAGTCCAGAATCTCAAGACGAAGATGAGA
CTCAACATTCTATAGACGAATATGAGACTCAACAATCTCAGAAGAGTAGTGTCTTAAAGAAGAAGACAAAAGGAAAAAAGGTTCAAAAAGGAAAAAAGGTTCAGAAAGGA
AAAAAGGTTCCAAAAGGAAAAATGGTTGTCAAAGAGTGTGAAGAAATTTTGGAGGGAATACCATGTCACTTAGCTATAGGATCAATGGATAACATTGTTGCTGTAGGCAC
AATGTTTGAGTCTGATGTTCAATGTCCAACCATTCATGAAATTCCACTAGGAGCCGATAATGTTAGAGTGATGGTCGATCTTGTAATGGGCGAAGACATTGAGATACCAA
TTCCTCAAAATGGTGAAATAAAGACTTCTAATCAAATGATTGGCAACTTTGTGGTATGGCCTCGCAAGCTTGTAATTATGACTAAGGAGAAAAAGCCTCCTCCTTCGACT
ACGTCTAAGAGTATTACAGAATCTTCCAAATATACGGATGTCCACGTGACCATTAAACTCCTAAATAGATATGCTATGCTCTCCATGCAAGTGGAGGATATGATTCAAAT
CAAGTTGAGTGAGCATATCTTTGGAAAGGAGAAGACAATTTACTTGCGACGTGATGACATCATGCAATACTGTGGCATGGCTGAAATTGGTTATTCATGTATACTTGCAT
ACATTGCGTGTCTTTGGAATGCATGTGACTGTGAGGTAACAAAAAAGTTTATGATAGTTGATCCAGCAACCATTTCATCACATGTAAAGTCTCAAGAACATCGTTCTAGA
AATCTAGTCGACAGGTTAGAAATGGTAAGCTTGGAAAAACTAGTACTCATCCCGTATAACACTGGTTGCCATTGGATATTGATTATTATCGATCTTCAAGAAAATTGTGT
TTATGTCATGGACTCCTTGCGAAGTAAGATTCTAGAAGATTTTCAAGGAGTCATAAATATTTTAACACGAGACATGCATATGAACAAGAAGAAATCGATACTGTTCGGTT
GGAATGGGCAGCTTTTGTTGGACAATTCGTGTAGAGCTACTAAAAGATTTGTTAATTTTCTATATCTTAGAAACAAACGATCAAGGAACTACGCCCCCACAATTAAATCA
AATTATCTCACAAATTTCAAAGAACAAAAATCGCCGCCGGATGTTGACGGCCCCATCAAATCCCCAATCTCCGACCGCCCAATGCCACCGAAACTAATCTCCGACGCAAA
GCGCCGCACTCATCCCCTCGTTTGGCTCGCCGCCGTGCTCTGTACTGTTGTTTCAATCGCCGTCATAATCGGAGGTATCGTCATCTTCATCGGTTACCTAGTAATCCACC
CCAGGATTCCGACGATTAGCGTTCTCGACGCTCATCTCGATAACTTCCAGAACGACATCGCCGGCCGCCTCGAAGTTCAGTTAACGATCGTCGTCGAGGCGGAGAATGAT
AACGCCAAAGCTCACGCCAGTTTCTCCGATTCCAGTTTCTTCCTTCATTTCTTAGGGATCAAGATCGCGCAACTTGTGGCAGATCCATTCGAAGTTCGGAAGAACAGTTC
TCTGAAATTCCATTACGCTGTCGTTTCGAATTCGATTCCTCTGAATCCAGAGCAAATGGAGAAAGTCGATTTGGATTTGAAGGCAGATCTGAGTCGATTTGATTTGGTAG
GGAATACGAGAGTTCAATGGCGAGTTGGATTGCTTGGATCTATGAAGTATGAGTGTCATCTTCATTGCGACCTTAAATTTCATCCTTCTAATGGAACTTACTTGAGATCG
CCTTGCAGTTCCAGAGCTAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGATGATTCCGATAACAGCAGCCAAGATGAGAGAAATGTTCTTATTCGTTATGAGATGTCGTTCGTGGTGGACCTTAGTTCCAAACATTATATTCTTCAATCTGCATC
AAAGAAGTTTCGAACGTTTAAGTCCACATTAACTCAGATGTACATACTTCCATATAAAGATGAACCATCTCGCTTGCAGAATCCTCCTGAAAAATATTCACATATTGATA
AGAAACAATGGGAGTCATTCGTTAATGCAAGACTAAGTGAAGAGTGGGAGGAATTGTCGAGTGATCCTTCCAATCGAGCAATTTTATGGAAAGAAGCACGAAAAAGAAAG
AATAATGAATACTCTGATGATGCCACCAGAGAATGTGCTAAGCGAATTGATGAATTAGCTGCGGAACGTAAGGGACAAGATATTTTGACTGAAGCATTAGGCACGCCAGA
ACATAGAGGGCGTATAAGGGGAGTGGGCGAGTTTGTTTCACCAGCCCTATATCTCAATGTTGCTAGAGGAAAGTTGAAGTTGAGTCCAGAATCTCAAGACGAAGATGAGA
CTCAACATTCTATAGACGAATATGAGACTCAACAATCTCAGAAGAGTAGTGTCTTAAAGAAGAAGACAAAAGGAAAAAAGGTTCAAAAAGGAAAAAAGGTTCAGAAAGGA
AAAAAGGTTCCAAAAGGAAAAATGGTTGTCAAAGAGTGTGAAGAAATTTTGGAGGGAATACCATGTCACTTAGCTATAGGATCAATGGATAACATTGTTGCTGTAGGCAC
AATGTTTGAGTCTGATGTTCAATGTCCAACCATTCATGAAATTCCACTAGGAGCCGATAATGTTAGAGTGATGGTCGATCTTGTAATGGGCGAAGACATTGAGATACCAA
TTCCTCAAAATGGTGAAATAAAGACTTCTAATCAAATGATTGGCAACTTTGTGGTATGGCCTCGCAAGCTTGTAATTATGACTAAGGAGAAAAAGCCTCCTCCTTCGACT
ACGTCTAAGAGTATTACAGAATCTTCCAAATATACGGATGTCCACGTGACCATTAAACTCCTAAATAGATATGCTATGCTCTCCATGCAAGTGGAGGATATGATTCAAAT
CAAGTTGAGTGAGCATATCTTTGGAAAGGAGAAGACAATTTACTTGCGACGTGATGACATCATGCAATACTGTGGCATGGCTGAAATTGGTTATTCATGTATACTTGCAT
ACATTGCGTGTCTTTGGAATGCATGTGACTGTGAGGTAACAAAAAAGTTTATGATAGTTGATCCAGCAACCATTTCATCACATGTAAAGTCTCAAGAACATCGTTCTAGA
AATCTAGTCGACAGGTTAGAAATGGTAAGCTTGGAAAAACTAGTACTCATCCCGTATAACACTGGTTGCCATTGGATATTGATTATTATCGATCTTCAAGAAAATTGTGT
TTATGTCATGGACTCCTTGCGAAGTAAGATTCTAGAAGATTTTCAAGGAGTCATAAATATTTTAACACGAGACATGCATATGAACAAGAAGAAATCGATACTGTTCGGTT
GGAATGGGCAGCTTTTGTTGGACAATTCGTGTAGAGCTACTAAAAGATTTGTTAATTTTCTATATCTTAGAAACAAACGATCAAGGAACTACGCCCCCACAATTAAATCA
AATTATCTCACAAATTTCAAAGAACAAAAATCGCCGCCGGATGTTGACGGCCCCATCAAATCCCCAATCTCCGACCGCCCAATGCCACCGAAACTAATCTCCGACGCAAA
GCGCCGCACTCATCCCCTCGTTTGGCTCGCCGCCGTGCTCTGTACTGTTGTTTCAATCGCCGTCATAATCGGAGGTATCGTCATCTTCATCGGTTACCTAGTAATCCACC
CCAGGATTCCGACGATTAGCGTTCTCGACGCTCATCTCGATAACTTCCAGAACGACATCGCCGGCCGCCTCGAAGTTCAGTTAACGATCGTCGTCGAGGCGGAGAATGAT
AACGCCAAAGCTCACGCCAGTTTCTCCGATTCCAGTTTCTTCCTTCATTTCTTAGGGATCAAGATCGCGCAACTTGTGGCAGATCCATTCGAAGTTCGGAAGAACAGTTC
TCTGAAATTCCATTACGCTGTCGTTTCGAATTCGATTCCTCTGAATCCAGAGCAAATGGAGAAAGTCGATTTGGATTTGAAGGCAGATCTGAGTCGATTTGATTTGGTAG
GGAATACGAGAGTTCAATGGCGAGTTGGATTGCTTGGATCTATGAAGTATGAGTGTCATCTTCATTGCGACCTTAAATTTCATCCTTCTAATGGAACTTACTTGAGATCG
CCTTGCAGTTCCAGAGCTAAATGA
Protein sequenceShow/hide protein sequence
MDDSDNSSQDERNVLIRYEMSFVVDLSSKHYILQSASKKFRTFKSTLTQMYILPYKDEPSRLQNPPEKYSHIDKKQWESFVNARLSEEWEELSSDPSNRAILWKEARKRK
NNEYSDDATRECAKRIDELAAERKGQDILTEALGTPEHRGRIRGVGEFVSPALYLNVARGKLKLSPESQDEDETQHSIDEYETQQSQKSSVLKKKTKGKKVQKGKKVQKG
KKVPKGKMVVKECEEILEGIPCHLAIGSMDNIVAVGTMFESDVQCPTIHEIPLGADNVRVMVDLVMGEDIEIPIPQNGEIKTSNQMIGNFVVWPRKLVIMTKEKKPPPST
TSKSITESSKYTDVHVTIKLLNRYAMLSMQVEDMIQIKLSEHIFGKEKTIYLRRDDIMQYCGMAEIGYSCILAYIACLWNACDCEVTKKFMIVDPATISSHVKSQEHRSR
NLVDRLEMVSLEKLVLIPYNTGCHWILIIIDLQENCVYVMDSLRSKILEDFQGVINILTRDMHMNKKKSILFGWNGQLLLDNSCRATKRFVNFLYLRNKRSRNYAPTIKS
NYLTNFKEQKSPPDVDGPIKSPISDRPMPPKLISDAKRRTHPLVWLAAVLCTVVSIAVIIGGIVIFIGYLVIHPRIPTISVLDAHLDNFQNDIAGRLEVQLTIVVEAEND
NAKAHASFSDSSFFLHFLGIKIAQLVADPFEVRKNSSLKFHYAVVSNSIPLNPEQMEKVDLDLKADLSRFDLVGNTRVQWRVGLLGSMKYECHLHCDLKFHPSNGTYLRS
PCSSRAK