; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi04G021450 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi04G021450
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
Descriptionprotein OVEREXPRESSOR OF CATIONIC PEROXIDASE 3
Genome locationchr04:28564684..28574086
RNA-Seq ExpressionLsi04G021450
SyntenyLsi04G021450
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0098869 - cellular oxidant detoxification (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000981 - DNA-binding transcription factor activity, RNA polymerase II-specific (molecular function)
GO:0003677 - DNA binding (molecular function)
GO:0004601 - peroxidase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004136473.1 uncharacterized protein LOC101217803 [Cucumis sativus]5.5e-17477.91Show/hide
Query:  MACKHLQRLFFISRRKHLNNTRFGALQSNSMLYHSAEHSSADQEVLPSEWYENAFRKIKNLSCSLKNVDLIDGRLFNVNDDSTIIDERIEQRMHTFKSLV
        M    LQRLFFISR KHL NTR GA QSNSMLYHSAE SSA QEVLPSEWYE AF KIK LSC L+NVDL+DGR+ N +DDSTI DERIEQ M TFKSLV
Subjt:  MACKHLQRLFFISRRKHLNNTRFGALQSNSMLYHSAEHSSADQEVLPSEWYENAFRKIKNLSCSLKNVDLIDGRLFNVNDDSTIIDERIEQRMHTFKSLV

Query:  RVLIGSPSAQRRITEMAVSSSIDCQPHAWFRNSSEREPIVVDSLTKVSNFLNVSAQQRKLVRHNICPQVTQHHIWTGALDHMLKELNLELDPLSHQSTNK
        R+LIGSPSAQRRITE+A SSSI+CQPHAWFRNSSERE +VVDSLTKV N L V+ QQRKLVRH ICPQVTQHHIWTGALD +LKELNLEL PLSH+ST+K
Subjt:  RVLIGSPSAQRRITEMAVSSSIDCQPHAWFRNSSEREPIVVDSLTKVSNFLNVSAQQRKLVRHNICPQVTQHHIWTGALDHMLKELNLELDPLSHQSTNK

Query:  GIKMGHQIVSSCLKFLNDATTNSNAHFTSWMRPAPLRAIVDSSAPPRWEDMLEMFTDLIGCLKDEKFLVHYVTKLEVMKEGLSQIKDVLTDKSIGYKEAS
        GIKM  QIVSSCLKFL D  TNSN HF+SW+RPAP R +V SS PPRWEDMLEMF DLIG LKDEK LVHYVTKLEVMKEGLSQIKDV +D+SIG++EA 
Subjt:  GIKMGHQIVSSCLKFLNDATTNSNAHFTSWMRPAPLRAIVDSSAPPRWEDMLEMFTDLIGCLKDEKFLVHYVTKLEVMKEGLSQIKDVLTDKSIGYKEAS

Query:  HQESLVQKKLSKTLGHSS---------------RDIEVDLSGGLLKADGNDKFLLFMGRVLSSDEEKIVWNGVRQLDRVMGLFKFVWEIAGMMGELELQG
         QESLVQKKLSKTLGHSS               RDIEVD  GGLLK DGNDKFLLFMGRVLS DEEKIVWNGVRQLDR MG+FK VWE AGM GEL L+G
Subjt:  HQESLVQKKLSKTLGHSS---------------RDIEVDLSGGLLKADGNDKFLLFMGRVLSSDEEKIVWNGVRQLDRVMGLFKFVWEIAGMMGELELQG

Query:  HLFCVGAEDRQL
        HLFCVG E RQL
Subjt:  HLFCVGAEDRQL

XP_008466389.1 PREDICTED: uncharacterized protein LOC103503810 [Cucumis melo]6.7e-17277.35Show/hide
Query:  LIMACKHLQRLFFISRRKHLNNTRFGALQSNSMLYHSAEHSSADQEVLPSEWYENAFRKIKNLSCSLKNVDLIDGRLFNVNDDSTIIDERIEQRMHTFKS
        +IM   HLQR FFISR KHL +TR GA QSNSMLYHS E SS DQEVLPSEWYE AF KIK LSC L+NVDL+DGR+ N +DDSTIIDERIEQ+M TFKS
Subjt:  LIMACKHLQRLFFISRRKHLNNTRFGALQSNSMLYHSAEHSSADQEVLPSEWYENAFRKIKNLSCSLKNVDLIDGRLFNVNDDSTIIDERIEQRMHTFKS

Query:  LVRVLIGSPSAQRRITEMAVSSSIDCQPHAWFRNSSEREPIVVDSLTKVSNFLNVSAQQRKLVRHNICPQVTQHHIWTGALDHMLKELNLELDPLSHQST
        LVR+LIGSPSAQRRITEMA SSSI+ Q HAWFRNSSERE +VVDSLTK  NFL V+ QQRKL+RH ICPQ+TQHHIWTGALD +LKELNLEL PLS++ST
Subjt:  LVRVLIGSPSAQRRITEMAVSSSIDCQPHAWFRNSSEREPIVVDSLTKVSNFLNVSAQQRKLVRHNICPQVTQHHIWTGALDHMLKELNLELDPLSHQST

Query:  NKGIKMGHQIVSSCLKFLNDATTNSNAHFTS-WMRPAPLRAIVDSSAPPRWEDMLEMFTDLIGCLKDEKFLVHYVTKLEVMKEGLSQIKDVLTDKSIGYK
        NKGI M  QIVSSCLKFL+DA TNSN HFTS W+RPAP R IV+SS PPRWEDMLEMF DLIG LKDEK LVHYVTKLEVMKEGLSQIKDV +D+SIG+K
Subjt:  NKGIKMGHQIVSSCLKFLNDATTNSNAHFTS-WMRPAPLRAIVDSSAPPRWEDMLEMFTDLIGCLKDEKFLVHYVTKLEVMKEGLSQIKDVLTDKSIGYK

Query:  EASHQESLVQKKLSKTLGHSS---------------RDIEVDLSGGLLKADGNDKFLLFMGRVLSSDEEKIVWNGVRQLDRVMGLFKFVWEIAGMMGELE
        EA  QESLVQKKLSKTLGHSS               RDIEVD  GGLLK DGNDKFLLFMGRVLS DEEKIVWNGVRQLDR MG+FK VWE AGM GEL 
Subjt:  EASHQESLVQKKLSKTLGHSS---------------RDIEVDLSGGLLKADGNDKFLLFMGRVLSSDEEKIVWNGVRQLDRVMGLFKFVWEIAGMMGELE

Query:  LQGHLFCVGAEDRQL
        LQGHLFCV  E RQL
Subjt:  LQGHLFCVGAEDRQL

XP_022940414.1 uncharacterized protein LOC111446029 [Cucurbita moschata]5.3e-16975.18Show/hide
Query:  LIMACKHLQRLFFISRRKHLNNTRFGALQSNSMLYHSAEHSSADQEVLPSEWYENAFRKIKNLSCSLKNVDLIDGRLFNVNDDSTIIDERIEQRMHTFKS
        +IMA KH QRL F+ R  HLN TR  A  SN MLYH  E S  D E LP++WYE AF KIK LSCSLKNVDLIDGRL NVNDDSTI+DERIEQRM  FKS
Subjt:  LIMACKHLQRLFFISRRKHLNNTRFGALQSNSMLYHSAEHSSADQEVLPSEWYENAFRKIKNLSCSLKNVDLIDGRLFNVNDDSTIIDERIEQRMHTFKS

Query:  LVRVLIGSPSAQRRITEMAVSSSIDCQPHAWFRNSSEREPIVVDSLTKVSNFLNVSAQQRKLVRHNICPQVTQHHIWTGALDHMLKELNLELDPLSHQST
        LVRV IGSPS QRR+TEMA S++ + QP   FRNSSEREP+VVDSLTKVSNFLNVSAQQRKLVRH ICPQ TQHHIWTGALDH+LKEL +ELDPL+H S 
Subjt:  LVRVLIGSPSAQRRITEMAVSSSIDCQPHAWFRNSSEREPIVVDSLTKVSNFLNVSAQQRKLVRHNICPQVTQHHIWTGALDHMLKELNLELDPLSHQST

Query:  NKGIKMGHQIVSSCLKFLNDATTNSNAHFTSWMRPAPLRAIVDSSAPPRWEDMLEMFTDLIGCLKDEKFLVHYVTKLEVMKEGLSQIKDVLTDKSIGYKE
        NKGIKMG QIVSSCL FLNDA TNSNAH TSWMRPAPL+  VDSS  P+WEDMLEMFTDLI  LKDEK L  YVTKLEVMKEGL+QI+DVLTDKSIG+KE
Subjt:  NKGIKMGHQIVSSCLKFLNDATTNSNAHFTSWMRPAPLRAIVDSSAPPRWEDMLEMFTDLIGCLKDEKFLVHYVTKLEVMKEGLSQIKDVLTDKSIGYKE

Query:  ASHQESLVQKKLSKTLGHSS---------------RDIEVDLSGGLLKA-DGNDKFLLFMGRVLSSDEEKIVWNGVRQLDRVMGLFKFVWEIAGMMGELE
        A HQESLVQKKLSKTLGHSS               RD+EVDL GGLLKA +  +K+L+FMGR+LS DEE++VWNGVRQLDR MGLFKFVWE AGM G+L 
Subjt:  ASHQESLVQKKLSKTLGHSS---------------RDIEVDLSGGLLKA-DGNDKFLLFMGRVLSSDEEKIVWNGVRQLDRVMGLFKFVWEIAGMMGELE

Query:  LQGHLFCVGAEDRQL
        LQGHLFCVGAEDRQL
Subjt:  LQGHLFCVGAEDRQL

XP_022982191.1 uncharacterized protein LOC111481093 [Cucurbita maxima]1.3e-17075.85Show/hide
Query:  LIMACKHLQRLFFISRRKHLNNTRFGALQSNSMLYHSAEHSSADQEVLPSEWYENAFRKIKNLSCSLKNVDLIDGRLFNVNDDSTIIDERIEQRMHTFKS
        +IMA KH QRL F+ R  HLN TR  AL SN MLYH +E S  DQE LP++WYE AF KIK LSCSLKNVDLIDGRL NVNDDSTI+DERIEQRM  FKS
Subjt:  LIMACKHLQRLFFISRRKHLNNTRFGALQSNSMLYHSAEHSSADQEVLPSEWYENAFRKIKNLSCSLKNVDLIDGRLFNVNDDSTIIDERIEQRMHTFKS

Query:  LVRVLIGSPSAQRRITEMAVSSSIDCQPHAWFRNSSEREPIVVDSLTKVSNFLNVSAQQRKLVRHNICPQVTQHHIWTGALDHMLKELNLELDPLSHQST
        LVRV IGS S QRR+TEMA S++I+ QP A FRNSSEREP+VVDS TKVSNFLNVSAQQRKLVRH ICPQ TQHHIWTGALDH+LKEL +ELDPL+H S 
Subjt:  LVRVLIGSPSAQRRITEMAVSSSIDCQPHAWFRNSSEREPIVVDSLTKVSNFLNVSAQQRKLVRHNICPQVTQHHIWTGALDHMLKELNLELDPLSHQST

Query:  NKGIKMGHQIVSSCLKFLNDATTNSNAHFTSWMRPAPLRAIVDSSAPPRWEDMLEMFTDLIGCLKDEKFLVHYVTKLEVMKEGLSQIKDVLTDKSIGYKE
        NKGIKMG QIVSSCLKFLNDA TNSNAH TSWMRPAPL+  VDSS  P+WEDMLEMFTDLIG LKDEK L  YVTKLEVMKEGL+QI+DVL DKSIG+KE
Subjt:  NKGIKMGHQIVSSCLKFLNDATTNSNAHFTSWMRPAPLRAIVDSSAPPRWEDMLEMFTDLIGCLKDEKFLVHYVTKLEVMKEGLSQIKDVLTDKSIGYKE

Query:  ASHQESLVQKKLSKTLGHSS---------------RDIEVDLSGGLLKADGNDKFLLFMGRVLSSDEEKIVWNGVRQLDRVMGLFKFVWEIAGMMGELEL
        A HQESLVQKKLSKTLGHSS               RD+EVDL GGLLKA   +K+L+FMGR+LS DEE+ VWNGVRQLDR MGLFKFVWE AGM G+L L
Subjt:  ASHQESLVQKKLSKTLGHSS---------------RDIEVDLSGGLLKADGNDKFLLFMGRVLSSDEEKIVWNGVRQLDRVMGLFKFVWEIAGMMGELEL

Query:  QGHLFCVGAEDRQL
        +GHLFCVGAEDRQL
Subjt:  QGHLFCVGAEDRQL

XP_038896888.1 uncharacterized protein LOC120085101 isoform X1 [Benincasa hispida]6.2e-20288.19Show/hide
Query:  LIMACKHLQRLFFISRRKHLNNTRFGALQSNSMLYHSAEHSSADQEVLPSEWYENAFRKIKNLSCSLKNVDLIDGRLFNVNDDSTIIDERIEQRMHTFKS
        +IMA K+LQRLFFISR KHLNNTRFGALQSNSMLYH AEHSSADQEVLPSEWYENAFRKIK LSCSLKNVDLIDGRL NVNDDSTIIDE IEQRM TFKS
Subjt:  LIMACKHLQRLFFISRRKHLNNTRFGALQSNSMLYHSAEHSSADQEVLPSEWYENAFRKIKNLSCSLKNVDLIDGRLFNVNDDSTIIDERIEQRMHTFKS

Query:  LVRVLIGSPSAQRRITEMAVSSSIDCQPHAWFRNSSEREPIVVDSLTKVSNFLNVSAQQRKLVRHNICPQVTQHHIWTGALDHMLKELNLELDPLSHQST
        LV VLIGSP+A+RRITEMAVSSSI CQPHAWFRN SEREP++VDSLTK+SNFLNVSAQQRKLVRH ICPQVTQHHIWTGALDHMLKELNLEL PLS QST
Subjt:  LVRVLIGSPSAQRRITEMAVSSSIDCQPHAWFRNSSEREPIVVDSLTKVSNFLNVSAQQRKLVRHNICPQVTQHHIWTGALDHMLKELNLELDPLSHQST

Query:  NKGIKMGHQIVSSCLKFLNDATTNSNAHFTSWMRPAPLR-AIVDSSAPPRWEDMLEMFTDLIGCLKDEKFLVHYVTKLEVMKEGLSQIKDVLTDKSIGYK
        NKGIKMGHQIVSSCLKFL+DA TNSNAHFTSWMRPAPLR A+VDSSAPPRWEDMLEMFTDLI CLK+EK LVHYVTKL+VMKEGLSQIKDVLTDKSIGYK
Subjt:  NKGIKMGHQIVSSCLKFLNDATTNSNAHFTSWMRPAPLR-AIVDSSAPPRWEDMLEMFTDLIGCLKDEKFLVHYVTKLEVMKEGLSQIKDVLTDKSIGYK

Query:  EASHQESLVQKKLSKTLGHSS---------------RDIEVDLSGGLLKADGNDKFLLFMGRVLSSDEEKIVWNGVRQLDRVMGLFKFVWEIAGMMGELE
        EASHQESLVQKKLSKTLGHSS               RDIEVDL GGLLKADGNDKFLLFMGRVLSSDEEKIVWNG+RQLDRVMGLFKFVWE AGM G+LE
Subjt:  EASHQESLVQKKLSKTLGHSS---------------RDIEVDLSGGLLKADGNDKFLLFMGRVLSSDEEKIVWNGVRQLDRVMGLFKFVWEIAGMMGELE

Query:  LQGHLFCVGAEDRQL
        LQGHLFCVG EDRQL
Subjt:  LQGHLFCVGAEDRQL

TrEMBL top hitse value%identityAlignment
A0A0A0LED2 Uncharacterized protein2.6e-17477.91Show/hide
Query:  MACKHLQRLFFISRRKHLNNTRFGALQSNSMLYHSAEHSSADQEVLPSEWYENAFRKIKNLSCSLKNVDLIDGRLFNVNDDSTIIDERIEQRMHTFKSLV
        M    LQRLFFISR KHL NTR GA QSNSMLYHSAE SSA QEVLPSEWYE AF KIK LSC L+NVDL+DGR+ N +DDSTI DERIEQ M TFKSLV
Subjt:  MACKHLQRLFFISRRKHLNNTRFGALQSNSMLYHSAEHSSADQEVLPSEWYENAFRKIKNLSCSLKNVDLIDGRLFNVNDDSTIIDERIEQRMHTFKSLV

Query:  RVLIGSPSAQRRITEMAVSSSIDCQPHAWFRNSSEREPIVVDSLTKVSNFLNVSAQQRKLVRHNICPQVTQHHIWTGALDHMLKELNLELDPLSHQSTNK
        R+LIGSPSAQRRITE+A SSSI+CQPHAWFRNSSERE +VVDSLTKV N L V+ QQRKLVRH ICPQVTQHHIWTGALD +LKELNLEL PLSH+ST+K
Subjt:  RVLIGSPSAQRRITEMAVSSSIDCQPHAWFRNSSEREPIVVDSLTKVSNFLNVSAQQRKLVRHNICPQVTQHHIWTGALDHMLKELNLELDPLSHQSTNK

Query:  GIKMGHQIVSSCLKFLNDATTNSNAHFTSWMRPAPLRAIVDSSAPPRWEDMLEMFTDLIGCLKDEKFLVHYVTKLEVMKEGLSQIKDVLTDKSIGYKEAS
        GIKM  QIVSSCLKFL D  TNSN HF+SW+RPAP R +V SS PPRWEDMLEMF DLIG LKDEK LVHYVTKLEVMKEGLSQIKDV +D+SIG++EA 
Subjt:  GIKMGHQIVSSCLKFLNDATTNSNAHFTSWMRPAPLRAIVDSSAPPRWEDMLEMFTDLIGCLKDEKFLVHYVTKLEVMKEGLSQIKDVLTDKSIGYKEAS

Query:  HQESLVQKKLSKTLGHSS---------------RDIEVDLSGGLLKADGNDKFLLFMGRVLSSDEEKIVWNGVRQLDRVMGLFKFVWEIAGMMGELELQG
         QESLVQKKLSKTLGHSS               RDIEVD  GGLLK DGNDKFLLFMGRVLS DEEKIVWNGVRQLDR MG+FK VWE AGM GEL L+G
Subjt:  HQESLVQKKLSKTLGHSS---------------RDIEVDLSGGLLKADGNDKFLLFMGRVLSSDEEKIVWNGVRQLDRVMGLFKFVWEIAGMMGELELQG

Query:  HLFCVGAEDRQL
        HLFCVG E RQL
Subjt:  HLFCVGAEDRQL

A0A1S3CR45 uncharacterized protein LOC1035038103.2e-17277.35Show/hide
Query:  LIMACKHLQRLFFISRRKHLNNTRFGALQSNSMLYHSAEHSSADQEVLPSEWYENAFRKIKNLSCSLKNVDLIDGRLFNVNDDSTIIDERIEQRMHTFKS
        +IM   HLQR FFISR KHL +TR GA QSNSMLYHS E SS DQEVLPSEWYE AF KIK LSC L+NVDL+DGR+ N +DDSTIIDERIEQ+M TFKS
Subjt:  LIMACKHLQRLFFISRRKHLNNTRFGALQSNSMLYHSAEHSSADQEVLPSEWYENAFRKIKNLSCSLKNVDLIDGRLFNVNDDSTIIDERIEQRMHTFKS

Query:  LVRVLIGSPSAQRRITEMAVSSSIDCQPHAWFRNSSEREPIVVDSLTKVSNFLNVSAQQRKLVRHNICPQVTQHHIWTGALDHMLKELNLELDPLSHQST
        LVR+LIGSPSAQRRITEMA SSSI+ Q HAWFRNSSERE +VVDSLTK  NFL V+ QQRKL+RH ICPQ+TQHHIWTGALD +LKELNLEL PLS++ST
Subjt:  LVRVLIGSPSAQRRITEMAVSSSIDCQPHAWFRNSSEREPIVVDSLTKVSNFLNVSAQQRKLVRHNICPQVTQHHIWTGALDHMLKELNLELDPLSHQST

Query:  NKGIKMGHQIVSSCLKFLNDATTNSNAHFTS-WMRPAPLRAIVDSSAPPRWEDMLEMFTDLIGCLKDEKFLVHYVTKLEVMKEGLSQIKDVLTDKSIGYK
        NKGI M  QIVSSCLKFL+DA TNSN HFTS W+RPAP R IV+SS PPRWEDMLEMF DLIG LKDEK LVHYVTKLEVMKEGLSQIKDV +D+SIG+K
Subjt:  NKGIKMGHQIVSSCLKFLNDATTNSNAHFTS-WMRPAPLRAIVDSSAPPRWEDMLEMFTDLIGCLKDEKFLVHYVTKLEVMKEGLSQIKDVLTDKSIGYK

Query:  EASHQESLVQKKLSKTLGHSS---------------RDIEVDLSGGLLKADGNDKFLLFMGRVLSSDEEKIVWNGVRQLDRVMGLFKFVWEIAGMMGELE
        EA  QESLVQKKLSKTLGHSS               RDIEVD  GGLLK DGNDKFLLFMGRVLS DEEKIVWNGVRQLDR MG+FK VWE AGM GEL 
Subjt:  EASHQESLVQKKLSKTLGHSS---------------RDIEVDLSGGLLKADGNDKFLLFMGRVLSSDEEKIVWNGVRQLDRVMGLFKFVWEIAGMMGELE

Query:  LQGHLFCVGAEDRQL
        LQGHLFCV  E RQL
Subjt:  LQGHLFCVGAEDRQL

A0A6J1CBU2 uncharacterized protein LOC111010055 isoform X19.1e-16775.6Show/hide
Query:  LIMACKHLQRLFFISRRKHLNNTRFGALQSNSMLYHSAEHSSADQEVLPSEWYENAFRKIKNLSCSLKNVDLIDGRLFNVNDDSTIIDERIEQRMHTFKS
        +I+A K  QRL FI R  HLNNTR+GAL SN MLYHSAE+SSADQE+LPSEWYENA+RKI+ LSCSLKNVDLIDGRL NV DDSTI DERIEQRM  FKS
Subjt:  LIMACKHLQRLFFISRRKHLNNTRFGALQSNSMLYHSAEHSSADQEVLPSEWYENAFRKIKNLSCSLKNVDLIDGRLFNVNDDSTIIDERIEQRMHTFKS

Query:  LVRVLIGSPSAQRRITE--MAVSSSIDCQPHAWFRNSSEREPIVVDSLTKVSNFLNVSAQQRKLVRHNICPQVTQHHIWTGALDHMLKELNLELDPLSHQ
        LVRV +GSPSA+RR+TE  MA SS+ +CQP   F NSSEREP+VVDSLTK+SNFLNVSAQQRKLVRH ICPQVTQHHIWTGALDHMLKEL LELDPL+HQ
Subjt:  LVRVLIGSPSAQRRITE--MAVSSSIDCQPHAWFRNSSEREPIVVDSLTKVSNFLNVSAQQRKLVRHNICPQVTQHHIWTGALDHMLKELNLELDPLSHQ

Query:  ST-NKGIKMGHQIVSSCLKFLNDATTNSNAHFTSWMRPAPLRAIVDSSAPPRWEDMLEMFTDLIGCLKDEKFLVHYVTKLEVMKEGLSQIKDVLTD-KSI
        ST NKGIKMG QIVSSCLKFL+DA TNSNAHFTSWMRPAP + +VD SA PRWEDMLEMF DLIG LK EK L+ +V KLEVMKEGLSQIKDVL+D KSI
Subjt:  ST-NKGIKMGHQIVSSCLKFLNDATTNSNAHFTSWMRPAPLRAIVDSSAPPRWEDMLEMFTDLIGCLKDEKFLVHYVTKLEVMKEGLSQIKDVLTD-KSI

Query:  GYKEASHQESLVQKKLSKTLGHSS---------------RDIEVDLSGGLLKADGNDKFLLFMGRVLSSDEEKIVWNGVRQLDRVMGLFKFVWEIAGMMG
        G+KE+ HQESLVQ+KLSKTLGHSS               RDIEVD  GG+LK   N+KF L MGR+LS DEEK+VWNGV+QLDR MG+FKFVWE AGM G
Subjt:  GYKEASHQESLVQKKLSKTLGHSS---------------RDIEVDLSGGLLKADGNDKFLLFMGRVLSSDEEKIVWNGVRQLDRVMGLFKFVWEIAGMMG

Query:  ELELQGHLFCVGAEDRQL
         LELQGHL+ VGA+ RQL
Subjt:  ELELQGHLFCVGAEDRQL

A0A6J1FK17 uncharacterized protein LOC1114460292.6e-16975.18Show/hide
Query:  LIMACKHLQRLFFISRRKHLNNTRFGALQSNSMLYHSAEHSSADQEVLPSEWYENAFRKIKNLSCSLKNVDLIDGRLFNVNDDSTIIDERIEQRMHTFKS
        +IMA KH QRL F+ R  HLN TR  A  SN MLYH  E S  D E LP++WYE AF KIK LSCSLKNVDLIDGRL NVNDDSTI+DERIEQRM  FKS
Subjt:  LIMACKHLQRLFFISRRKHLNNTRFGALQSNSMLYHSAEHSSADQEVLPSEWYENAFRKIKNLSCSLKNVDLIDGRLFNVNDDSTIIDERIEQRMHTFKS

Query:  LVRVLIGSPSAQRRITEMAVSSSIDCQPHAWFRNSSEREPIVVDSLTKVSNFLNVSAQQRKLVRHNICPQVTQHHIWTGALDHMLKELNLELDPLSHQST
        LVRV IGSPS QRR+TEMA S++ + QP   FRNSSEREP+VVDSLTKVSNFLNVSAQQRKLVRH ICPQ TQHHIWTGALDH+LKEL +ELDPL+H S 
Subjt:  LVRVLIGSPSAQRRITEMAVSSSIDCQPHAWFRNSSEREPIVVDSLTKVSNFLNVSAQQRKLVRHNICPQVTQHHIWTGALDHMLKELNLELDPLSHQST

Query:  NKGIKMGHQIVSSCLKFLNDATTNSNAHFTSWMRPAPLRAIVDSSAPPRWEDMLEMFTDLIGCLKDEKFLVHYVTKLEVMKEGLSQIKDVLTDKSIGYKE
        NKGIKMG QIVSSCL FLNDA TNSNAH TSWMRPAPL+  VDSS  P+WEDMLEMFTDLI  LKDEK L  YVTKLEVMKEGL+QI+DVLTDKSIG+KE
Subjt:  NKGIKMGHQIVSSCLKFLNDATTNSNAHFTSWMRPAPLRAIVDSSAPPRWEDMLEMFTDLIGCLKDEKFLVHYVTKLEVMKEGLSQIKDVLTDKSIGYKE

Query:  ASHQESLVQKKLSKTLGHSS---------------RDIEVDLSGGLLKA-DGNDKFLLFMGRVLSSDEEKIVWNGVRQLDRVMGLFKFVWEIAGMMGELE
        A HQESLVQKKLSKTLGHSS               RD+EVDL GGLLKA +  +K+L+FMGR+LS DEE++VWNGVRQLDR MGLFKFVWE AGM G+L 
Subjt:  ASHQESLVQKKLSKTLGHSS---------------RDIEVDLSGGLLKA-DGNDKFLLFMGRVLSSDEEKIVWNGVRQLDRVMGLFKFVWEIAGMMGELE

Query:  LQGHLFCVGAEDRQL
        LQGHLFCVGAEDRQL
Subjt:  LQGHLFCVGAEDRQL

A0A6J1IW03 uncharacterized protein LOC1114810936.1e-17175.85Show/hide
Query:  LIMACKHLQRLFFISRRKHLNNTRFGALQSNSMLYHSAEHSSADQEVLPSEWYENAFRKIKNLSCSLKNVDLIDGRLFNVNDDSTIIDERIEQRMHTFKS
        +IMA KH QRL F+ R  HLN TR  AL SN MLYH +E S  DQE LP++WYE AF KIK LSCSLKNVDLIDGRL NVNDDSTI+DERIEQRM  FKS
Subjt:  LIMACKHLQRLFFISRRKHLNNTRFGALQSNSMLYHSAEHSSADQEVLPSEWYENAFRKIKNLSCSLKNVDLIDGRLFNVNDDSTIIDERIEQRMHTFKS

Query:  LVRVLIGSPSAQRRITEMAVSSSIDCQPHAWFRNSSEREPIVVDSLTKVSNFLNVSAQQRKLVRHNICPQVTQHHIWTGALDHMLKELNLELDPLSHQST
        LVRV IGS S QRR+TEMA S++I+ QP A FRNSSEREP+VVDS TKVSNFLNVSAQQRKLVRH ICPQ TQHHIWTGALDH+LKEL +ELDPL+H S 
Subjt:  LVRVLIGSPSAQRRITEMAVSSSIDCQPHAWFRNSSEREPIVVDSLTKVSNFLNVSAQQRKLVRHNICPQVTQHHIWTGALDHMLKELNLELDPLSHQST

Query:  NKGIKMGHQIVSSCLKFLNDATTNSNAHFTSWMRPAPLRAIVDSSAPPRWEDMLEMFTDLIGCLKDEKFLVHYVTKLEVMKEGLSQIKDVLTDKSIGYKE
        NKGIKMG QIVSSCLKFLNDA TNSNAH TSWMRPAPL+  VDSS  P+WEDMLEMFTDLIG LKDEK L  YVTKLEVMKEGL+QI+DVL DKSIG+KE
Subjt:  NKGIKMGHQIVSSCLKFLNDATTNSNAHFTSWMRPAPLRAIVDSSAPPRWEDMLEMFTDLIGCLKDEKFLVHYVTKLEVMKEGLSQIKDVLTDKSIGYKE

Query:  ASHQESLVQKKLSKTLGHSS---------------RDIEVDLSGGLLKADGNDKFLLFMGRVLSSDEEKIVWNGVRQLDRVMGLFKFVWEIAGMMGELEL
        A HQESLVQKKLSKTLGHSS               RD+EVDL GGLLKA   +K+L+FMGR+LS DEE+ VWNGVRQLDR MGLFKFVWE AGM G+L L
Subjt:  ASHQESLVQKKLSKTLGHSS---------------RDIEVDLSGGLLKADGNDKFLLFMGRVLSSDEEKIVWNGVRQLDRVMGLFKFVWEIAGMMGELEL

Query:  QGHLFCVGAEDRQL
        +GHLFCVGAEDRQL
Subjt:  QGHLFCVGAEDRQL

SwissProt top hitse value%identityAlignment
Q8H0V5 Protein OVEREXPRESSOR OF CATIONIC PEROXIDASE 35.9e-6246.26Show/hide
Query:  PEAFPGSPSIS-GRESFFFRQPFPSNFSLLRLPPVRHTLSLTFARRRNQNSSVSPSSSSSKKKKRNLIPKEARGEEEDVEEDALELLFSQLEEDLKNDAP
        P +F  S  +S  R  F  R   P   SL  L P R    L FAR +N+   VS SSSS KK K+  +     G  E+ EED  E LF+ LEEDLKN   
Subjt:  PEAFPGSPSIS-GRESFFFRQPFPSNFSLLRLPPVRHTLSLTFARRRNQNSSVSPSSSSSKKKKRNLIPKEARGEEEDVEEDALELLFSQLEEDLKNDAP

Query:  SLDDGEEDEFSEEHLARLERELGLALG---------------------IDVDDEEEEETENEEVLEDTEEEEMPVKLKNWQLRRLASALKKGRRKTSIKS
          D+ +++E SEE L  L  EL  ALG                     +D DD++ ++ +N++  +D+EE+E P KLKNWQL+RLA ALK GRRKTSIK+
Subjt:  SLDDGEEDEFSEEHLARLERELGLALG---------------------IDVDDEEEEETENEEVLEDTEEEEMPVKLKNWQLRRLASALKKGRRKTSIKS

Query:  LAAELCLDRAIVLHLLREPPPSLLMLSATLPD-----------TPTPSIRETKTLHTDEELIVVDTAKEAEGVTVPVHVMQQSWAAQKRLKKVQIETLES
        LAAE+CLDRA VL LLR+PPP LLMLSATLPD           +P PS  E+       E +VV+  ++ +     VHVMQQ W+AQKR+KK  IETLE 
Subjt:  LAAELCLDRAIVLHLLREPPPSLLMLSATLPD-----------TPTPSIRETKTLHTDEELIVVDTAKEAEGVTVPVHVMQQSWAAQKRLKKVQIETLES

Query:  VYRKTKRPTVSIVSSIICRQKKSIKSLNSEYLIPVPNAMISSIVQVTNLPRKRIVKWFEDRRVEDGVPDQRVPY
        VYR++KRPT                           NA++SSIVQVTNLPRKR++KWFED+R EDGVPD+R PY
Subjt:  VYRKTKRPTVSIVSSIICRQKKSIKSLNSEYLIPVPNAMISSIVQVTNLPRKRIVKWFEDRRVEDGVPDQRVPY

Arabidopsis top hitse value%identityAlignment
AT5G11270.1 overexpressor of cationic peroxidase 34.2e-6346.26Show/hide
Query:  PEAFPGSPSIS-GRESFFFRQPFPSNFSLLRLPPVRHTLSLTFARRRNQNSSVSPSSSSSKKKKRNLIPKEARGEEEDVEEDALELLFSQLEEDLKNDAP
        P +F  S  +S  R  F  R   P   SL  L P R    L FAR +N+   VS SSSS KK K+  +     G  E+ EED  E LF+ LEEDLKN   
Subjt:  PEAFPGSPSIS-GRESFFFRQPFPSNFSLLRLPPVRHTLSLTFARRRNQNSSVSPSSSSSKKKKRNLIPKEARGEEEDVEEDALELLFSQLEEDLKNDAP

Query:  SLDDGEEDEFSEEHLARLERELGLALG---------------------IDVDDEEEEETENEEVLEDTEEEEMPVKLKNWQLRRLASALKKGRRKTSIKS
          D+ +++E SEE L  L  EL  ALG                     +D DD++ ++ +N++  +D+EE+E P KLKNWQL+RLA ALK GRRKTSIK+
Subjt:  SLDDGEEDEFSEEHLARLERELGLALG---------------------IDVDDEEEEETENEEVLEDTEEEEMPVKLKNWQLRRLASALKKGRRKTSIKS

Query:  LAAELCLDRAIVLHLLREPPPSLLMLSATLPD-----------TPTPSIRETKTLHTDEELIVVDTAKEAEGVTVPVHVMQQSWAAQKRLKKVQIETLES
        LAAE+CLDRA VL LLR+PPP LLMLSATLPD           +P PS  E+       E +VV+  ++ +     VHVMQQ W+AQKR+KK  IETLE 
Subjt:  LAAELCLDRAIVLHLLREPPPSLLMLSATLPD-----------TPTPSIRETKTLHTDEELIVVDTAKEAEGVTVPVHVMQQSWAAQKRLKKVQIETLES

Query:  VYRKTKRPTVSIVSSIICRQKKSIKSLNSEYLIPVPNAMISSIVQVTNLPRKRIVKWFEDRRVEDGVPDQRVPY
        VYR++KRPT                           NA++SSIVQVTNLPRKR++KWFED+R EDGVPD+R PY
Subjt:  VYRKTKRPTVSIVSSIICRQKKSIKSLNSEYLIPVPNAMISSIVQVTNLPRKRIVKWFEDRRVEDGVPDQRVPY

AT5G25500.1 unknown protein4.3e-8445.23Show/hide
Query:  NNTRFGALQSNSMLYHSAEHSSADQEVLPSEWYENAFRKIKNLSCSLKNVDLIDGRLFNVNDDSTIIDERIEQRMHTFKSLVRVLIGSPSAQRRITEMAV
        N +RF   +S  +LYH +  S  D  VLP EWYE     +K L+ +L++VDL+DG+L ++N    + D+ I ++M  FKSL R+ IGSPS Q+++ E   
Subjt:  NNTRFGALQSNSMLYHSAEHSSADQEVLPSEWYENAFRKIKNLSCSLKNVDLIDGRLFNVNDDSTIIDERIEQRMHTFKSLVRVLIGSPSAQRRITEMAV

Query:  SSSIDCQPHAWFRNSSEREPIVVDSLTKVSNFLNVSAQQRKLVRHNICPQVTQHHIWTGALDHMLKELNLELDPL-SHQSTNKGIKMGHQIVSSCLKFLN
                  +F + SEREP+VV+SLTKV NFLNVSAQQRKLVR  +C QVTQ+ IW G L+ +L  L  E+D L  H+  ++G  +  Q++ SCL+FL+
Subjt:  SSSIDCQPHAWFRNSSEREPIVVDSLTKVSNFLNVSAQQRKLVRHNICPQVTQHHIWTGALDHMLKELNLELDPL-SHQSTNKGIKMGHQIVSSCLKFLN

Query:  DATTNSNAH-FTSWMRPAPLRAIVDSSAPPRWEDMLEMFTDLIGCLK--DEKFLVHYVTKLEVMKEGLSQIKDVLTDKSIGYKEASHQESLVQKKLSKTL
        +++ +      TSWMRP P R    ++A  +WED+L+M  DL   L+  +E  +++++ KL  MKEGL QIKDV  D +IG++E  HQE LV +KLSK L
Subjt:  DATTNSNAH-FTSWMRPAPLRAIVDSSAPPRWEDMLEMFTDLIGCLK--DEKFLVHYVTKLEVMKEGLSQIKDVLTDKSIGYKEASHQESLVQKKLSKTL

Query:  GHSS---------------RDIEVDLSGGLLKADGNDKFLLFMGRVLSSDEEKIVWNGVRQLDRVMGLFKFVWEIAGMMGELELQGHLFCVGAEDRQL
        G  S               RDIEVDL GG  K + ++   L MGR+L+S +EK++  G++QLDR +GLF+FVWE AGM   L LQGHL+C+GAE+R +
Subjt:  GHSS---------------RDIEVDLSGGLLKADGNDKFLLFMGRVLSSDEEKIVWNGVRQLDRVMGLFKFVWEIAGMMGELELQGHLFCVGAEDRQL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAACGGTAGGCGAACTTTGCCCGAGGGCTAAAATTCCCCACTCAATTCGCTTTATGATTATTCAACTTAATCGGATTTTTTCCCGCTTGATTTCGCTAATGATTTC
AACGACCCAGATTCTCCATTTTCATTTCTGTAATGGTACACCCTCCAACTGTTCGACGAAATGCTGCAATGAAGTTCCAGGCTATGGCAGACTTGCTCGGAGTTCAACTA
GGGATGACTTCCGAGTCCTGATTATGGCGTGTAAGCATTTGCAGAGGCTGTTCTTCATCTCTCGCCGAAAGCATCTCAATAACACGAGATTTGGAGCTTTGCAATCAAAT
TCTATGTTGTATCATTCCGCAGAGCACTCCTCCGCTGATCAAGAGGTGTTACCATCTGAATGGTACGAGAATGCTTTTCGGAAGATAAAAAATTTGAGCTGCTCGTTGAA
GAATGTGGATTTGATCGATGGACGACTTTTTAATGTTAATGATGATTCGACCATTATCGACGAGCGAATTGAACAGAGAATGCATACTTTCAAGTCCCTTGTAAGGGTCT
TGATTGGTTCTCCATCGGCTCAGAGGAGAATAACAGAGATGGCCGTATCGAGTTCTATAGATTGTCAGCCTCACGCATGGTTCAGAAATTCGAGTGAACGAGAGCCAATT
GTTGTTGATTCACTCACCAAGGTCAGCAACTTCCTCAACGTCTCTGCCCAACAAAGGAAACTGGTGCGCCACAACATATGCCCACAGGTTACACAACATCACATTTGGAC
TGGTGCATTGGATCACATGCTGAAAGAGTTAAATTTGGAGTTGGATCCATTATCTCATCAGTCAACCAACAAAGGGATCAAAATGGGGCATCAGATAGTTTCAAGTTGCC
TAAAGTTTTTGAATGATGCTACTACCAATTCAAATGCTCACTTCACTTCATGGATGCGGCCAGCGCCATTACGAGCAATTGTCGATTCGTCAGCACCGCCAAGATGGGAA
GACATGCTCGAGATGTTCACCGATCTGATTGGCTGTCTGAAAGACGAGAAATTTTTGGTCCATTATGTGACAAAGCTTGAAGTTATGAAAGAGGGGCTTTCCCAGATCAA
AGATGTATTGACTGATAAAAGCATTGGATACAAGGAAGCCAGTCATCAAGAAAGCTTAGTGCAGAAGAAGCTTTCAAAGACATTGGGCCACTCATCCAGGGATATTGAAG
TGGATCTTTCTGGTGGGTTGTTGAAGGCTGATGGGAATGACAAGTTTTTGTTGTTCATGGGGAGGGTTTTGAGTTCTGATGAAGAGAAAATTGTTTGGAATGGGGTGAGG
CAGCTTGATAGAGTTATGGGGCTTTTTAAATTTGTTTGGGAAATAGCTGGAATGATGGGAGAATTGGAATTGCAAGGCCATTTATTTTGTGTTGGGGCTGAGGATAGGCA
GCTTATGTCTGTCGACATAAATCGACCGACGATGGTTCTTTCAGCGCCGGTGAAGGCACCGGAAGCATTTCCAGGTTCGCCGTCGATTTCTGGCCGAGAGAGCTTCTTCT
TTCGCCAGCCCTTTCCATCCAATTTTAGTCTTCTGCGTCTCCCTCCGGTTCGTCACACTCTATCGCTTACATTTGCTCGCCGTCGGAACCAAAATTCATCAGTCAGTCCG
TCTTCATCGTCTTCGAAGAAAAAGAAGAGAAATTTGATTCCAAAAGAAGCTAGGGGCGAGGAGGAGGATGTAGAGGAGGATGCTCTTGAGTTGTTGTTTAGTCAACTGGA
AGAAGATCTCAAAAATGACGCCCCTTCCTTGGATGACGGTGAAGAGGATGAATTTAGCGAAGAGCACCTTGCCAGACTAGAGCGCGAGTTAGGGTTAGCACTTGGCATCG
ATGTTGACGATGAAGAAGAAGAAGAAACAGAAAATGAAGAAGTTCTCGAGGATACTGAAGAAGAGGAAATGCCTGTAAAACTTAAGAACTGGCAACTTCGTCGACTAGCC
TCGGCTTTGAAGAAAGGCCGCCGTAAAACTAGCATCAAGAGTCTTGCTGCGGAGCTTTGTCTCGATAGGGCCATCGTTCTTCATTTGCTTCGGGAACCACCACCTAGTCT
TCTGATGTTGAGTGCTACTCTTCCAGACACTCCTACACCATCAATTCGAGAAACTAAAACATTACATACTGATGAAGAACTCATAGTAGTAGACACTGCAAAAGAGGCAG
AAGGGGTGACGGTGCCTGTTCATGTCATGCAACAGAGTTGGGCTGCTCAAAAGAGACTGAAGAAGGTTCAGATTGAAACTCTTGAAAGTGTTTATAGAAAAACAAAGCGG
CCCACTGTAAGTATAGTTTCCTCCATCATTTGTCGTCAAAAGAAGTCGATAAAATCTTTAAATTCCGAGTATCTGATTCCTGTGCCAAATGCGATGATTAGTAGCATCGT
CCAAGTGACAAATTTGCCTCGCAAAAGAATAGTGAAATGGTTTGAAGATAGGCGAGTTGAAGATGGGGTTCCTGATCAACGCGTGCCTTATGATCGGTCTGCTCCTAAAT
CTGTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAACGGTAGGCGAACTTTGCCCGAGGGCTAAAATTCCCCACTCAATTCGCTTTATGATTATTCAACTTAATCGGATTTTTTCCCGCTTGATTTCGCTAATGATTTC
AACGACCCAGATTCTCCATTTTCATTTCTGTAATGGTACACCCTCCAACTGTTCGACGAAATGCTGCAATGAAGTTCCAGGCTATGGCAGACTTGCTCGGAGTTCAACTA
GGGATGACTTCCGAGTCCTGATTATGGCGTGTAAGCATTTGCAGAGGCTGTTCTTCATCTCTCGCCGAAAGCATCTCAATAACACGAGATTTGGAGCTTTGCAATCAAAT
TCTATGTTGTATCATTCCGCAGAGCACTCCTCCGCTGATCAAGAGGTGTTACCATCTGAATGGTACGAGAATGCTTTTCGGAAGATAAAAAATTTGAGCTGCTCGTTGAA
GAATGTGGATTTGATCGATGGACGACTTTTTAATGTTAATGATGATTCGACCATTATCGACGAGCGAATTGAACAGAGAATGCATACTTTCAAGTCCCTTGTAAGGGTCT
TGATTGGTTCTCCATCGGCTCAGAGGAGAATAACAGAGATGGCCGTATCGAGTTCTATAGATTGTCAGCCTCACGCATGGTTCAGAAATTCGAGTGAACGAGAGCCAATT
GTTGTTGATTCACTCACCAAGGTCAGCAACTTCCTCAACGTCTCTGCCCAACAAAGGAAACTGGTGCGCCACAACATATGCCCACAGGTTACACAACATCACATTTGGAC
TGGTGCATTGGATCACATGCTGAAAGAGTTAAATTTGGAGTTGGATCCATTATCTCATCAGTCAACCAACAAAGGGATCAAAATGGGGCATCAGATAGTTTCAAGTTGCC
TAAAGTTTTTGAATGATGCTACTACCAATTCAAATGCTCACTTCACTTCATGGATGCGGCCAGCGCCATTACGAGCAATTGTCGATTCGTCAGCACCGCCAAGATGGGAA
GACATGCTCGAGATGTTCACCGATCTGATTGGCTGTCTGAAAGACGAGAAATTTTTGGTCCATTATGTGACAAAGCTTGAAGTTATGAAAGAGGGGCTTTCCCAGATCAA
AGATGTATTGACTGATAAAAGCATTGGATACAAGGAAGCCAGTCATCAAGAAAGCTTAGTGCAGAAGAAGCTTTCAAAGACATTGGGCCACTCATCCAGGGATATTGAAG
TGGATCTTTCTGGTGGGTTGTTGAAGGCTGATGGGAATGACAAGTTTTTGTTGTTCATGGGGAGGGTTTTGAGTTCTGATGAAGAGAAAATTGTTTGGAATGGGGTGAGG
CAGCTTGATAGAGTTATGGGGCTTTTTAAATTTGTTTGGGAAATAGCTGGAATGATGGGAGAATTGGAATTGCAAGGCCATTTATTTTGTGTTGGGGCTGAGGATAGGCA
GCTTATGTCTGTCGACATAAATCGACCGACGATGGTTCTTTCAGCGCCGGTGAAGGCACCGGAAGCATTTCCAGGTTCGCCGTCGATTTCTGGCCGAGAGAGCTTCTTCT
TTCGCCAGCCCTTTCCATCCAATTTTAGTCTTCTGCGTCTCCCTCCGGTTCGTCACACTCTATCGCTTACATTTGCTCGCCGTCGGAACCAAAATTCATCAGTCAGTCCG
TCTTCATCGTCTTCGAAGAAAAAGAAGAGAAATTTGATTCCAAAAGAAGCTAGGGGCGAGGAGGAGGATGTAGAGGAGGATGCTCTTGAGTTGTTGTTTAGTCAACTGGA
AGAAGATCTCAAAAATGACGCCCCTTCCTTGGATGACGGTGAAGAGGATGAATTTAGCGAAGAGCACCTTGCCAGACTAGAGCGCGAGTTAGGGTTAGCACTTGGCATCG
ATGTTGACGATGAAGAAGAAGAAGAAACAGAAAATGAAGAAGTTCTCGAGGATACTGAAGAAGAGGAAATGCCTGTAAAACTTAAGAACTGGCAACTTCGTCGACTAGCC
TCGGCTTTGAAGAAAGGCCGCCGTAAAACTAGCATCAAGAGTCTTGCTGCGGAGCTTTGTCTCGATAGGGCCATCGTTCTTCATTTGCTTCGGGAACCACCACCTAGTCT
TCTGATGTTGAGTGCTACTCTTCCAGACACTCCTACACCATCAATTCGAGAAACTAAAACATTACATACTGATGAAGAACTCATAGTAGTAGACACTGCAAAAGAGGCAG
AAGGGGTGACGGTGCCTGTTCATGTCATGCAACAGAGTTGGGCTGCTCAAAAGAGACTGAAGAAGGTTCAGATTGAAACTCTTGAAAGTGTTTATAGAAAAACAAAGCGG
CCCACTGTAAGTATAGTTTCCTCCATCATTTGTCGTCAAAAGAAGTCGATAAAATCTTTAAATTCCGAGTATCTGATTCCTGTGCCAAATGCGATGATTAGTAGCATCGT
CCAAGTGACAAATTTGCCTCGCAAAAGAATAGTGAAATGGTTTGAAGATAGGCGAGTTGAAGATGGGGTTCCTGATCAACGCGTGCCTTATGATCGGTCTGCTCCTAAAT
CTGTTTGATCTTCTTCATCTACCAAACGCCTCAATGGTGTATATCAATTAAAGAGAGAATTTTGGATGCTCATTCATCGTCAAAGGATTACCAAAATGATAAAAAGAGGC
AAAATATTTTGTACGACGGTGCTTACATATTTTAAATACTTATCTATAATAATAATAGTAGGGCCTACTTTATTGATTGACACCAAATAATTTCGGGCCTACTTTATTGA
TTGACACCAAATAATTTTACCGAAATAGTATGACGCATGTGATTGTCATTTTTACACTATTTAATTTAATATGTGCTGAGAAATTTAGAAACATATCT
Protein sequenceShow/hide protein sequence
METVGELCPRAKIPHSIRFMIIQLNRIFSRLISLMISTTQILHFHFCNGTPSNCSTKCCNEVPGYGRLARSSTRDDFRVLIMACKHLQRLFFISRRKHLNNTRFGALQSN
SMLYHSAEHSSADQEVLPSEWYENAFRKIKNLSCSLKNVDLIDGRLFNVNDDSTIIDERIEQRMHTFKSLVRVLIGSPSAQRRITEMAVSSSIDCQPHAWFRNSSEREPI
VVDSLTKVSNFLNVSAQQRKLVRHNICPQVTQHHIWTGALDHMLKELNLELDPLSHQSTNKGIKMGHQIVSSCLKFLNDATTNSNAHFTSWMRPAPLRAIVDSSAPPRWE
DMLEMFTDLIGCLKDEKFLVHYVTKLEVMKEGLSQIKDVLTDKSIGYKEASHQESLVQKKLSKTLGHSSRDIEVDLSGGLLKADGNDKFLLFMGRVLSSDEEKIVWNGVR
QLDRVMGLFKFVWEIAGMMGELELQGHLFCVGAEDRQLMSVDINRPTMVLSAPVKAPEAFPGSPSISGRESFFFRQPFPSNFSLLRLPPVRHTLSLTFARRRNQNSSVSP
SSSSSKKKKRNLIPKEARGEEEDVEEDALELLFSQLEEDLKNDAPSLDDGEEDEFSEEHLARLERELGLALGIDVDDEEEEETENEEVLEDTEEEEMPVKLKNWQLRRLA
SALKKGRRKTSIKSLAAELCLDRAIVLHLLREPPPSLLMLSATLPDTPTPSIRETKTLHTDEELIVVDTAKEAEGVTVPVHVMQQSWAAQKRLKKVQIETLESVYRKTKR
PTVSIVSSIICRQKKSIKSLNSEYLIPVPNAMISSIVQVTNLPRKRIVKWFEDRRVEDGVPDQRVPYDRSAPKSV