; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG05G009730 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG05G009730
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
Descriptionleucine-rich repeat extensin-like protein 2
Genome locationCG_Chr05:10773652..10776584
RNA-Seq ExpressionClCG05G009730
SyntenyClCG05G009730
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039065.1 synaptojanin-1 [Cucumis melo var. makuwa]9.6e-22689.58Show/hide
Query:  MGKGEEQNLPLQQRREVAPSGDSSGFLCGQCSTAFHRVCKELNFKCFFVLILGFVVFVPGFFWLLPLHERNSGFEAKDTIKLSATVQVYFVLEKPVNELL
        MGKGEEQNLPLQQRREVA SGDSSGFLCGQCS AFHRVCKELNFKCFFVL+LGFVVFVPG FWLLPLHERNSGFEAK+ +KLSATVQVYFVLEKPV ELL
Subjt:  MGKGEEQNLPLQQRREVAPSGDSSGFLCGQCSTAFHRVCKELNFKCFFVLILGFVVFVPGFFWLLPLHERNSGFEAKDTIKLSATVQVYFVLEKPVNELL

Query:  PHIKRLEFDINGELDIPNLKVSILSMHGIGESNRTYVVFGLLSEYITAPINPVSLSLLRSSLYDFFLSESNLTLTTSIFGQPSTFQILKFPGGISIIPFQ
        PHIKRLEFDINGELDIP++KVSILSMH IGESNRTYVVFGLLSEYITAPINPVSLSLLRSSLYDFFLSESNLTLTTSIFGQPST QILKFPGGISIIPFQ
Subjt:  PHIKRLEFDINGELDIPNLKVSILSMHGIGESNRTYVVFGLLSEYITAPINPVSLSLLRSSLYDFFLSESNLTLTTSIFGQPSTFQILKFPGGISIIPFQ

Query:  HASIWQFPQIVFNFTLTNSISEILDNFAKFKSQLKFGLSLRTYENVYLQITNKIGSTMQPLVIVQASITSELGRITSQRLQQLAAIINTSPERNLGLDYS
        HASIW+FPQIVFNFTLTNSISEILDNFAKFKS+LKFGL LR+YENVYLQITNKIGST+QPLVIVQASITSELGRITSQRLQQLAAIINTSPERNLGLDYS
Subjt:  HASIWQFPQIVFNFTLTNSISEILDNFAKFKSQLKFGLSLRTYENVYLQITNKIGSTMQPLVIVQASITSELGRITSQRLQQLAAIINTSPERNLGLDYS

Query:  VFGEVKSVSLSSYSKRPAKAMPPSFSPAPAPVPGDHVKLPSSPHPSRSAQSPATHSPPHANCETSSPTPSMVPPHSPREHSIPPTSYPKSTRLIVPPAEQ
        VFGEVKSVSLSSY KR +KAMPPSFSPAPAP PGDHV++PS PH  RS + PA HSPPHANC++ SP P MVP HSP EHSIPP SYPKSTRL+VPPA Q
Subjt:  VFGEVKSVSLSSYSKRPAKAMPPSFSPAPAPVPGDHVKLPSSPHPSRSAQSPATHSPPHANCETSSPTPSMVPPHSPREHSIPPTSYPKSTRLIVPPAEQ

Query:  PRVSSPRASPVEFSPLLPPDLLPKPKPSFHSKPGQTKEDTSHPDHVSFDIF
        PRVSSPRASP+EF PLLPPDLLPKPKPSFHSK GQT ED SHP HVSFD+F
Subjt:  PRVSSPRASPVEFSPLLPPDLLPKPKPSFHSKPGQTKEDTSHPDHVSFDIF

XP_004149972.2 uncharacterized protein LOC101222031 isoform X2 [Cucumis sativus]7.6e-22390.56Show/hide
Query:  MGKGEEQNLPLQQRREVAPSGDSSGFLCGQCSTAFHRVCKELNFKCFFVLILGFVVFVPGFFWLLPLHERNSGFEAKDTIKLSATVQVYFVLEKPVNELL
        MGKGEEQNLPLQQRREVA +GDSSGFLCGQCS AFHRVCKELNFKCFFVL+LGFVVFVPGFFWLLPLHERNSGFEAKD IKLSATVQVYFVLEKPV ELL
Subjt:  MGKGEEQNLPLQQRREVAPSGDSSGFLCGQCSTAFHRVCKELNFKCFFVLILGFVVFVPGFFWLLPLHERNSGFEAKDTIKLSATVQVYFVLEKPVNELL

Query:  PHIKRLEFDINGELDIPNLKVSILSMHGIGESNRTYVVFGLLSEYITAPINPVSLSLLRSSLYDFFLSESNLTLTTSIFGQPSTFQILKFPGGISIIPFQ
        PHIKRLEFDINGELDIPN+KVSILSMH IGESNRTYVVFGLLSEYITAPINPVSLSLLRSSLYDFFLSESNLTLTTSIFGQPST QILKFPGGISIIPFQ
Subjt:  PHIKRLEFDINGELDIPNLKVSILSMHGIGESNRTYVVFGLLSEYITAPINPVSLSLLRSSLYDFFLSESNLTLTTSIFGQPSTFQILKFPGGISIIPFQ

Query:  HASIWQFPQIVFNFTLTNSISEILDNFAKFKSQLKFGLSLRTYENVYLQITNKIGSTMQPLVIVQASITSELGRITSQRLQQLAAIINTSPERNLGLDYS
        HASIW+FPQIVFNFTLTNSISEILDNFAKFKSQLKFGL LR+YENVYLQITNKIGST+QPLVIVQASITSELGRITSQRLQQLAAIINTSPERNLGLDYS
Subjt:  HASIWQFPQIVFNFTLTNSISEILDNFAKFKSQLKFGLSLRTYENVYLQITNKIGSTMQPLVIVQASITSELGRITSQRLQQLAAIINTSPERNLGLDYS

Query:  VFGEVKSVSLSSYSKRPAKAMPPSFSPAPAPVPGDHVKLPSSPHPSRSAQSPATHSPPHANCETSSPTPSMVPPHSPREHSIPPTSYPKSTRLIVPPAEQ
        VFGEVKSVSLSSY KR +KAMPPSFSPAPAP PG+HV++PS PHP RS + PA HSPPHANC++SSP PSMVP +SP EHSIPP SYPKSTRLIVPPA Q
Subjt:  VFGEVKSVSLSSYSKRPAKAMPPSFSPAPAPVPGDHVKLPSSPHPSRSAQSPATHSPPHANCETSSPTPSMVPPHSPREHSIPPTSYPKSTRLIVPPAEQ

Query:  PRVSSPRASPVEFSPLLPPDLLPKPKPSFHSKPGQTKEDTSHPDH
        PRV SPRASPVE  PLLPPDLLPKPKPSF SK GQT ED SHP H
Subjt:  PRVSSPRASPVEFSPLLPPDLLPKPKPSFHSKPGQTKEDTSHPDH

XP_008456084.1 PREDICTED: uncharacterized protein LOC103496125 isoform X1 [Cucumis melo]4.4e-22389.89Show/hide
Query:  MGKGEEQNLPLQQRREVAPSGDSSGFLCGQCSTAFHRVCKELNFKCFFVLILGFVVFVPGFFWLLPLHERNSGFEAKDTIKLSATVQVYFVLEKPVNELL
        MGKGEEQNLPLQQRREVA SGDSSGFLCGQCS AFHRVCKELNFKCFFVL+LGFVVFVPG FWLLPLHERNSGFEAK+ +KLSATVQVYFVLEKPV ELL
Subjt:  MGKGEEQNLPLQQRREVAPSGDSSGFLCGQCSTAFHRVCKELNFKCFFVLILGFVVFVPGFFWLLPLHERNSGFEAKDTIKLSATVQVYFVLEKPVNELL

Query:  PHIKRLEFDINGELDIPNLKVSILSMHGIGESNRTYVVFGLLSEYITAPINPVSLSLLRSSLYDFFLSESNLTLTTSIFGQPSTFQILKFPGGISIIPFQ
        PHIKRLEFDINGELDIP++KVSILSMH IGESNRTYVVFGLLSEYITAPINPVSLSLLRSSLYDFFLSESNLTLTTSIFGQPST QILKFPGGISIIPFQ
Subjt:  PHIKRLEFDINGELDIPNLKVSILSMHGIGESNRTYVVFGLLSEYITAPINPVSLSLLRSSLYDFFLSESNLTLTTSIFGQPSTFQILKFPGGISIIPFQ

Query:  HASIWQFPQIVFNFTLTNSISEILDNFAKFKSQLKFGLSLRTYENVYLQITNKIGSTMQPLVIVQASITSELGRITSQRLQQLAAIINTSPERNLGLDYS
        HASIW+FPQIVFNFTLTNSISEILDNFAKFKS+LKFGL LR+YENVYLQITNKIGST+QPLVIVQASITSELGRITSQRLQQLAAIINTSPERNLGLDYS
Subjt:  HASIWQFPQIVFNFTLTNSISEILDNFAKFKSQLKFGLSLRTYENVYLQITNKIGSTMQPLVIVQASITSELGRITSQRLQQLAAIINTSPERNLGLDYS

Query:  VFGEVKSVSLSSYSKRPAKAMPPSFSPAPAPVPGDHVKLPSSPHPSRSAQSPATHSPPHANCETSSPTPSMVPPHSPREHSIPPTSYPKSTRLIVPPAEQ
        VFGEVKSVSLSSY KR +KAMPPSFSPAPAP PGDHV++PS PH  RS + PA HSPPHANC++ SP PSMVP HSP EHSIPP SYPKSTRL+VPPA Q
Subjt:  VFGEVKSVSLSSYSKRPAKAMPPSFSPAPAPVPGDHVKLPSSPHPSRSAQSPATHSPPHANCETSSPTPSMVPPHSPREHSIPPTSYPKSTRLIVPPAEQ

Query:  PRVSSPRASPVEFSPLLPPDLLPKPKPSFHSKPGQTKEDTSHPDH
        PRVSSPRASP+EF PLLPPDLLPKPKPSFHSK GQT ED SHP H
Subjt:  PRVSSPRASPVEFSPLLPPDLLPKPKPSFHSKPGQTKEDTSHPDH

XP_011651267.1 uncharacterized protein LOC101222031 isoform X1 [Cucumis sativus]2.0e-22390.58Show/hide
Query:  MGKGEEQNLPLQQRREVAPSGDSSGFLCGQCSTAFHRVCKELNFKCFFVLILGFVVFVPGFFWLLPLHERNSGFEAKDTIKLSATVQVYFVLEKPVNELL
        MGKGEEQNLPLQQRREVA +GDSSGFLCGQCS AFHRVCKELNFKCFFVL+LGFVVFVPGFFWLLPLHERNSGFEAKD IKLSATVQVYFVLEKPV ELL
Subjt:  MGKGEEQNLPLQQRREVAPSGDSSGFLCGQCSTAFHRVCKELNFKCFFVLILGFVVFVPGFFWLLPLHERNSGFEAKDTIKLSATVQVYFVLEKPVNELL

Query:  PHIKRLEFDINGELDIPNLKVSILSMHGIGESNRTYVVFGLLSEYITAPINPVSLSLLRSSLYDFFLSESNLTLTTSIFGQPSTFQILKFPGGISIIPFQ
        PHIKRLEFDINGELDIPN+KVSILSMH IGESNRTYVVFGLLSEYITAPINPVSLSLLRSSLYDFFLSESNLTLTTSIFGQPST QILKFPGGISIIPFQ
Subjt:  PHIKRLEFDINGELDIPNLKVSILSMHGIGESNRTYVVFGLLSEYITAPINPVSLSLLRSSLYDFFLSESNLTLTTSIFGQPSTFQILKFPGGISIIPFQ

Query:  HASIWQFPQIVFNFTLTNSISEILDNFAKFKSQLKFGLSLRTYENVYLQITNKIGSTMQPLVIVQASITSELGRITSQRLQQLAAIINTSPERNLGLDYS
        HASIW+FPQIVFNFTLTNSISEILDNFAKFKSQLKFGL LR+YENVYLQITNKIGST+QPLVIVQASITSELGRITSQRLQQLAAIINTSPERNLGLDYS
Subjt:  HASIWQFPQIVFNFTLTNSISEILDNFAKFKSQLKFGLSLRTYENVYLQITNKIGSTMQPLVIVQASITSELGRITSQRLQQLAAIINTSPERNLGLDYS

Query:  VFGEVKSVSLSSYSKRPAKAMPPSFSPAPAPVPGDHVKLPSSPHPSRSAQSPATHSPPHANCETSSPTPSMVPPHSPREHSIPPTSYPKSTRLIVPPAEQ
        VFGEVKSVSLSSY KR +KAMPPSFSPAPAP PG+HV++PS PHP RS + PA HSPPHANC++SSP PSMVP +SP EHSIPP SYPKSTRLIVPPA Q
Subjt:  VFGEVKSVSLSSYSKRPAKAMPPSFSPAPAPVPGDHVKLPSSPHPSRSAQSPATHSPPHANCETSSPTPSMVPPHSPREHSIPPTSYPKSTRLIVPPAEQ

Query:  PRVSSPRASPVEFSPLLPPDLLPKPKPSFHSKPGQTKEDTSHPDHV
        PRV SPRASPVE  PLLPPDLLPKPKPSF SK GQT ED SHP HV
Subjt:  PRVSSPRASPVEFSPLLPPDLLPKPKPSFHSKPGQTKEDTSHPDHV

XP_038891823.1 uncharacterized protein LOC120081196 isoform X1 [Benincasa hispida]2.7e-22089.66Show/hide
Query:  MGKGEEQNLPLQQRREVAPSGDSSGFLCGQCSTAFHRVCKELNFKCFFVLILGFVVFVPGFFWLLPLHERNSGFEAKDTIKLSATVQVYFVLEKPVNELL
        MGKGEEQNLPLQQRR+VAPSGDSSGFLCGQCS AFHRV  E NFKCFFVLILGFVVFVPGFFWLLPLHERNSGFEAKDTIKL AT QVYFVL+KPVNELL
Subjt:  MGKGEEQNLPLQQRREVAPSGDSSGFLCGQCSTAFHRVCKELNFKCFFVLILGFVVFVPGFFWLLPLHERNSGFEAKDTIKLSATVQVYFVLEKPVNELL

Query:  PHIKRLEFDINGELDIPNLKVSILSMHGIGESNRTYVVFGLLSEYITAPINPVSLSLLRSSLYDFFLSESNLTLTTSIFGQPSTFQILKFPGGISIIPFQ
        PHIKRLEFDINGELDIPNLKVSILSMH +GESNRTYVVFGLLSEYIT PINPVSLSLLRSSLYD FLSESNLTLTTSIFGQPSTFQILKFPGGISIIPFQ
Subjt:  PHIKRLEFDINGELDIPNLKVSILSMHGIGESNRTYVVFGLLSEYITAPINPVSLSLLRSSLYDFFLSESNLTLTTSIFGQPSTFQILKFPGGISIIPFQ

Query:  HASIWQFPQIVFNFTLTNSISEILDNFAKFKSQLKFGLSLRTYENVYLQITNKIGSTMQPLVIVQASITSELGRITSQRLQQLAAIINTSPERNLGLDYS
        HASIW+FPQIVFNFTLTNSISEILDNFAKFKSQL FGL LR YENVYLQITNKIGSTMQPLVIVQASITSELGRI+SQRLQQLAAIINTSP+ NLGLDY+
Subjt:  HASIWQFPQIVFNFTLTNSISEILDNFAKFKSQLKFGLSLRTYENVYLQITNKIGSTMQPLVIVQASITSELGRITSQRLQQLAAIINTSPERNLGLDYS

Query:  VFGEVKSVSLSSYSKRPAKAMPPSFSPAPAPVPGDHVKLPSSPHPSRSAQSPATHSPPHANCETSSPTPSMVPPHSPREHSIPPTSYPKSTRLIVPPAEQ
        VFGEVKSVSLSSY KRP+KAMPPSFSPA AP   DHV+LPS+PHPSRSA+    HSP HANCETSSPTPSMVP H+PREHSIPP SYPKSTRLIVPPA +
Subjt:  VFGEVKSVSLSSYSKRPAKAMPPSFSPAPAPVPGDHVKLPSSPHPSRSAQSPATHSPPHANCETSSPTPSMVPPHSPREHSIPPTSYPKSTRLIVPPAEQ

Query:  PRVSSPRASPVEFSPLLPPDLLPKPKPSFHSKPGQTKEDTSHPDH
        P VSSPRASP+ FSPLLPPDLLPKPK SF SKPGQ KED+SHPDH
Subjt:  PRVSSPRASPVEFSPLLPPDLLPKPKPSFHSKPGQTKEDTSHPDH

TrEMBL top hitse value%identityAlignment
A0A0A0L6J0 Uncharacterized protein3.7e-22390.56Show/hide
Query:  MGKGEEQNLPLQQRREVAPSGDSSGFLCGQCSTAFHRVCKELNFKCFFVLILGFVVFVPGFFWLLPLHERNSGFEAKDTIKLSATVQVYFVLEKPVNELL
        MGKGEEQNLPLQQRREVA +GDSSGFLCGQCS AFHRVCKELNFKCFFVL+LGFVVFVPGFFWLLPLHERNSGFEAKD IKLSATVQVYFVLEKPV ELL
Subjt:  MGKGEEQNLPLQQRREVAPSGDSSGFLCGQCSTAFHRVCKELNFKCFFVLILGFVVFVPGFFWLLPLHERNSGFEAKDTIKLSATVQVYFVLEKPVNELL

Query:  PHIKRLEFDINGELDIPNLKVSILSMHGIGESNRTYVVFGLLSEYITAPINPVSLSLLRSSLYDFFLSESNLTLTTSIFGQPSTFQILKFPGGISIIPFQ
        PHIKRLEFDINGELDIPN+KVSILSMH IGESNRTYVVFGLLSEYITAPINPVSLSLLRSSLYDFFLSESNLTLTTSIFGQPST QILKFPGGISIIPFQ
Subjt:  PHIKRLEFDINGELDIPNLKVSILSMHGIGESNRTYVVFGLLSEYITAPINPVSLSLLRSSLYDFFLSESNLTLTTSIFGQPSTFQILKFPGGISIIPFQ

Query:  HASIWQFPQIVFNFTLTNSISEILDNFAKFKSQLKFGLSLRTYENVYLQITNKIGSTMQPLVIVQASITSELGRITSQRLQQLAAIINTSPERNLGLDYS
        HASIW+FPQIVFNFTLTNSISEILDNFAKFKSQLKFGL LR+YENVYLQITNKIGST+QPLVIVQASITSELGRITSQRLQQLAAIINTSPERNLGLDYS
Subjt:  HASIWQFPQIVFNFTLTNSISEILDNFAKFKSQLKFGLSLRTYENVYLQITNKIGSTMQPLVIVQASITSELGRITSQRLQQLAAIINTSPERNLGLDYS

Query:  VFGEVKSVSLSSYSKRPAKAMPPSFSPAPAPVPGDHVKLPSSPHPSRSAQSPATHSPPHANCETSSPTPSMVPPHSPREHSIPPTSYPKSTRLIVPPAEQ
        VFGEVKSVSLSSY KR +KAMPPSFSPAPAP PG+HV++PS PHP RS + PA HSPPHANC++SSP PSMVP +SP EHSIPP SYPKSTRLIVPPA Q
Subjt:  VFGEVKSVSLSSYSKRPAKAMPPSFSPAPAPVPGDHVKLPSSPHPSRSAQSPATHSPPHANCETSSPTPSMVPPHSPREHSIPPTSYPKSTRLIVPPAEQ

Query:  PRVSSPRASPVEFSPLLPPDLLPKPKPSFHSKPGQTKEDTSHPDH
        PRV SPRASPVE  PLLPPDLLPKPKPSF SK GQT ED SHP H
Subjt:  PRVSSPRASPVEFSPLLPPDLLPKPKPSFHSKPGQTKEDTSHPDH

A0A1S3C2E0 uncharacterized protein LOC103496125 isoform X12.1e-22389.89Show/hide
Query:  MGKGEEQNLPLQQRREVAPSGDSSGFLCGQCSTAFHRVCKELNFKCFFVLILGFVVFVPGFFWLLPLHERNSGFEAKDTIKLSATVQVYFVLEKPVNELL
        MGKGEEQNLPLQQRREVA SGDSSGFLCGQCS AFHRVCKELNFKCFFVL+LGFVVFVPG FWLLPLHERNSGFEAK+ +KLSATVQVYFVLEKPV ELL
Subjt:  MGKGEEQNLPLQQRREVAPSGDSSGFLCGQCSTAFHRVCKELNFKCFFVLILGFVVFVPGFFWLLPLHERNSGFEAKDTIKLSATVQVYFVLEKPVNELL

Query:  PHIKRLEFDINGELDIPNLKVSILSMHGIGESNRTYVVFGLLSEYITAPINPVSLSLLRSSLYDFFLSESNLTLTTSIFGQPSTFQILKFPGGISIIPFQ
        PHIKRLEFDINGELDIP++KVSILSMH IGESNRTYVVFGLLSEYITAPINPVSLSLLRSSLYDFFLSESNLTLTTSIFGQPST QILKFPGGISIIPFQ
Subjt:  PHIKRLEFDINGELDIPNLKVSILSMHGIGESNRTYVVFGLLSEYITAPINPVSLSLLRSSLYDFFLSESNLTLTTSIFGQPSTFQILKFPGGISIIPFQ

Query:  HASIWQFPQIVFNFTLTNSISEILDNFAKFKSQLKFGLSLRTYENVYLQITNKIGSTMQPLVIVQASITSELGRITSQRLQQLAAIINTSPERNLGLDYS
        HASIW+FPQIVFNFTLTNSISEILDNFAKFKS+LKFGL LR+YENVYLQITNKIGST+QPLVIVQASITSELGRITSQRLQQLAAIINTSPERNLGLDYS
Subjt:  HASIWQFPQIVFNFTLTNSISEILDNFAKFKSQLKFGLSLRTYENVYLQITNKIGSTMQPLVIVQASITSELGRITSQRLQQLAAIINTSPERNLGLDYS

Query:  VFGEVKSVSLSSYSKRPAKAMPPSFSPAPAPVPGDHVKLPSSPHPSRSAQSPATHSPPHANCETSSPTPSMVPPHSPREHSIPPTSYPKSTRLIVPPAEQ
        VFGEVKSVSLSSY KR +KAMPPSFSPAPAP PGDHV++PS PH  RS + PA HSPPHANC++ SP PSMVP HSP EHSIPP SYPKSTRL+VPPA Q
Subjt:  VFGEVKSVSLSSYSKRPAKAMPPSFSPAPAPVPGDHVKLPSSPHPSRSAQSPATHSPPHANCETSSPTPSMVPPHSPREHSIPPTSYPKSTRLIVPPAEQ

Query:  PRVSSPRASPVEFSPLLPPDLLPKPKPSFHSKPGQTKEDTSHPDH
        PRVSSPRASP+EF PLLPPDLLPKPKPSFHSK GQT ED SHP H
Subjt:  PRVSSPRASPVEFSPLLPPDLLPKPKPSFHSKPGQTKEDTSHPDH

A0A5A7TCD6 Synaptojanin-14.6e-22689.58Show/hide
Query:  MGKGEEQNLPLQQRREVAPSGDSSGFLCGQCSTAFHRVCKELNFKCFFVLILGFVVFVPGFFWLLPLHERNSGFEAKDTIKLSATVQVYFVLEKPVNELL
        MGKGEEQNLPLQQRREVA SGDSSGFLCGQCS AFHRVCKELNFKCFFVL+LGFVVFVPG FWLLPLHERNSGFEAK+ +KLSATVQVYFVLEKPV ELL
Subjt:  MGKGEEQNLPLQQRREVAPSGDSSGFLCGQCSTAFHRVCKELNFKCFFVLILGFVVFVPGFFWLLPLHERNSGFEAKDTIKLSATVQVYFVLEKPVNELL

Query:  PHIKRLEFDINGELDIPNLKVSILSMHGIGESNRTYVVFGLLSEYITAPINPVSLSLLRSSLYDFFLSESNLTLTTSIFGQPSTFQILKFPGGISIIPFQ
        PHIKRLEFDINGELDIP++KVSILSMH IGESNRTYVVFGLLSEYITAPINPVSLSLLRSSLYDFFLSESNLTLTTSIFGQPST QILKFPGGISIIPFQ
Subjt:  PHIKRLEFDINGELDIPNLKVSILSMHGIGESNRTYVVFGLLSEYITAPINPVSLSLLRSSLYDFFLSESNLTLTTSIFGQPSTFQILKFPGGISIIPFQ

Query:  HASIWQFPQIVFNFTLTNSISEILDNFAKFKSQLKFGLSLRTYENVYLQITNKIGSTMQPLVIVQASITSELGRITSQRLQQLAAIINTSPERNLGLDYS
        HASIW+FPQIVFNFTLTNSISEILDNFAKFKS+LKFGL LR+YENVYLQITNKIGST+QPLVIVQASITSELGRITSQRLQQLAAIINTSPERNLGLDYS
Subjt:  HASIWQFPQIVFNFTLTNSISEILDNFAKFKSQLKFGLSLRTYENVYLQITNKIGSTMQPLVIVQASITSELGRITSQRLQQLAAIINTSPERNLGLDYS

Query:  VFGEVKSVSLSSYSKRPAKAMPPSFSPAPAPVPGDHVKLPSSPHPSRSAQSPATHSPPHANCETSSPTPSMVPPHSPREHSIPPTSYPKSTRLIVPPAEQ
        VFGEVKSVSLSSY KR +KAMPPSFSPAPAP PGDHV++PS PH  RS + PA HSPPHANC++ SP P MVP HSP EHSIPP SYPKSTRL+VPPA Q
Subjt:  VFGEVKSVSLSSYSKRPAKAMPPSFSPAPAPVPGDHVKLPSSPHPSRSAQSPATHSPPHANCETSSPTPSMVPPHSPREHSIPPTSYPKSTRLIVPPAEQ

Query:  PRVSSPRASPVEFSPLLPPDLLPKPKPSFHSKPGQTKEDTSHPDHVSFDIF
        PRVSSPRASP+EF PLLPPDLLPKPKPSFHSK GQT ED SHP HVSFD+F
Subjt:  PRVSSPRASPVEFSPLLPPDLLPKPKPSFHSKPGQTKEDTSHPDHVSFDIF

A0A6J1D566 uncharacterized protein LOC1110177074.7e-18676.86Show/hide
Query:  MGKGEEQNLPLQQRREVAPSGDSSGFLCGQCSTAFHRVCKELNFKCFFVLILGFVVFVPGFFWLLPLHERNSGFEAKDTIKLSATVQVYFVLEKPVNELL
        MGKGEEQNLP+Q+RREVA SGDSSGFLCGQCS A  RV +ELNFKC FVLILGF+VFVPGFFWLLPL ERNSGFEAKD IKLSATVQVYFVLEKPV ELL
Subjt:  MGKGEEQNLPLQQRREVAPSGDSSGFLCGQCSTAFHRVCKELNFKCFFVLILGFVVFVPGFFWLLPLHERNSGFEAKDTIKLSATVQVYFVLEKPVNELL

Query:  PHIKRLEFDINGELDIPNLKVSILSMHGIGESNRTYVVFGLLSEYITAPINPVSLSLLRSSLYDFFLSESNLTLTTSIFGQPSTFQILKFPGGISIIPFQ
        PHIKRLEFDINGELDI N+KVS+LSMH +GESNRTYVVFG+LSEYITAPINPVSLSL+RS+LYD FL ESNLTLTT IFGQPSTF+ILKFPGGISIIPFQ
Subjt:  PHIKRLEFDINGELDIPNLKVSILSMHGIGESNRTYVVFGLLSEYITAPINPVSLSLLRSSLYDFFLSESNLTLTTSIFGQPSTFQILKFPGGISIIPFQ

Query:  HASIWQFPQIVFNFTLTNSISEILDNFAKFKSQLKFGLSLRTYENVYLQITNKIGSTMQPLVIVQASITSELGRITSQRLQQLAAIINTSPERNLGLDYS
        HASIWQFPQIVFNFTL+NSISE+LDNFAKF+S+L FGL LR YENVY QITNKIGSTMQP +IVQASI+SELGR+TSQRLQQLAAIIN SPERNLGLDYS
Subjt:  HASIWQFPQIVFNFTLTNSISEILDNFAKFKSQLKFGLSLRTYENVYLQITNKIGSTMQPLVIVQASITSELGRITSQRLQQLAAIINTSPERNLGLDYS

Query:  VFGEVKSVSLSSYSKRPAKAMPPSFSPAPAPVPGDHVKLPSSPHPSRSA-----QSPATHSPPHANCETSSPTPSMVPPHSPREHSIPPTSYPKSTRLIV
        VFGEVKSVSLSSY K  + ++PPS SPAPAP PGDH +  S+P  SRS+     Q PA  SPP A C   SP PS+VP HSP  HS+PP+SYP STRLIV
Subjt:  VFGEVKSVSLSSYSKRPAKAMPPSFSPAPAPVPGDHVKLPSSPHPSRSA-----QSPATHSPPHANCETSSPTPSMVPPHSPREHSIPPTSYPKSTRLIV

Query:  PPAEQPRVSSPRASPVEFSPLLPPDLLPKPKPSFHSKPGQTKED--------TSHPDH
         P            PV F+PLLPPDLLPKPKP F  KPG  KE+        +SHPDH
Subjt:  PPAEQPRVSSPRASPVEFSPLLPPDLLPKPKPSFHSKPGQTKED--------TSHPDH

A0A6J1E5G3 uncharacterized protein LOC111430758 isoform X15.2e-18578.02Show/hide
Query:  MGKGEEQNLPLQQRREVAPSGDSSGFLCGQCSTAFHRVCKELNFKCFFVLILGFVVFVPGFFWLLPLHERNSGFEAKDTIKLSATVQVYFVLEKPVNELL
        MGKGE+QNLP Q RRE     DSSGF+C +CS +F R   ELNFKC FVLILGF VF+PGFFWLLPLHERN GFEAKD IKLSATVQVYFVLEKPV ELL
Subjt:  MGKGEEQNLPLQQRREVAPSGDSSGFLCGQCSTAFHRVCKELNFKCFFVLILGFVVFVPGFFWLLPLHERNSGFEAKDTIKLSATVQVYFVLEKPVNELL

Query:  PHIKRLEFDINGELDIPNLKVSILSMHGIGESNRTYVVFGLLSEYITAPINPVSLSLLRSSLYDFFLSESNLTLTTSIFGQPSTFQILKFPGGISIIPFQ
        PHIKRLEFDINGELDIPN+KVS+LSMH +GESNRTYVVFGLLSEYIT PINPVSLSLLRSSLYD FL +SNLTLTTSIFGQPS FQILKFPGGISIIPFQ
Subjt:  PHIKRLEFDINGELDIPNLKVSILSMHGIGESNRTYVVFGLLSEYITAPINPVSLSLLRSSLYDFFLSESNLTLTTSIFGQPSTFQILKFPGGISIIPFQ

Query:  HASIWQFPQIVFNFTLTNSISEILDNFAKFKSQLKFGLSLRTYENVYLQITNKIGSTMQPLVIVQASITSELGRITSQRLQQLAAIINTSPERNLGLDYS
         ASIWQFPQIVFNFTLTNSISEIL+ FAKF SQLK  L LR YENVYLQITNKIGSTMQP+V+VQASI+SELGRIT+QRLQQLAAIINTS ERNLGLDYS
Subjt:  HASIWQFPQIVFNFTLTNSISEILDNFAKFKSQLKFGLSLRTYENVYLQITNKIGSTMQPLVIVQASITSELGRITSQRLQQLAAIINTSPERNLGLDYS

Query:  VFGEVKSVSLSSYSKRPAKAMPPSFSPAPAPVPGDHVKLPSSPHPSRSAQSPATHSPPHANCETSSPTPSMVPPHSPREHSIPPTSYPKSTRLI-VPPAE
        VFGEVK +SLSSY K  + AMPPSFSPAPAP PGDHV+LPS+PHPSRSA+SPA  SPP ANCETSSP  SMVP  S  EHS+PP  YPKSTRLI VPPA+
Subjt:  VFGEVKSVSLSSYSKRPAKAMPPSFSPAPAPVPGDHVKLPSSPHPSRSAQSPATHSPPHANCETSSPTPSMVPPHSPREHSIPPTSYPKSTRLI-VPPAE

Query:  QPRVSSPRASPVEFSPLLPPDLLPKPKPSFHSKPGQTKED---------TSHPDH
        QPRVSSPRASPV                 FH KPG+TKED         +SH DH
Subjt:  QPRVSSPRASPVEFSPLLPPDLLPKPKPSFHSKPGQTKED---------TSHPDH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G10790.1 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT3G56590.2)9.7e-5138.6Show/hide
Query:  EEQNLPLQQRREVAPSGDSSGFLCGQ-CSTAFHRVCKELNFKCFFVLILGFVVFVPGFFWLLPLHERNSGFEAKDTIKLSATVQVYFVLEKPVNELLPHI
        +E  L LQQ      + +SS    G+ CS+AF R+   +  +C  VL+L   + +   FWL P     S F+A  T+KL+A+VQ  F L+KPV+E++ H 
Subjt:  EEQNLPLQQRREVAPSGDSSGFLCGQ-CSTAFHRVCKELNFKCFFVLILGFVVFVPGFFWLLPLHERNSGFEAKDTIKLSATVQVYFVLEKPVNELLPHI

Query:  KRLEFDINGELDIP-NLKVSILSMHGIGESNRTYVVFGLLSEYITAPINPVSLSLLRSSLYDFFLSESNLTLTTSIFGQPSTFQILKFPGGISIIPFQHA
         ++E DI   + +  N KV++LS++  G SN T V F +L       I+  SLSLLRSS    F   S L LTTS FG+P++FQ+LKFPGGI++ P + A
Subjt:  KRLEFDINGELDIP-NLKVSILSMHGIGESNRTYVVFGLLSEYITAPINPVSLSLLRSSLYDFFLSESNLTLTTSIFGQPSTFQILKFPGGISIIPFQHA

Query:  SIWQFPQIVFNFTLTNSISEILDNFAKFKSQLKFGLSLRTYENVYLQITNKIGSTMQPLVIVQASITSELGRITSQRLQQLAAIINTSPERNLGLDYSVF
         +     ++F+ T+  SIS + D         +  LSL  YE+V+ Q+TNK GST+ P +  Q  +   + +   QRL     II TS  +NLGLD +VF
Subjt:  SIWQFPQIVFNFTLTNSISEILDNFAKFKSQLKFGLSLRTYENVYLQITNKIGSTMQPLVIVQASITSELGRITSQRLQQLAAIINTSPERNLGLDYSVF

Query:  GEVKSVSLSSYSKRPAKAMPPSFSPAPAP
        GEVK ++ S+Y            +PAP P
Subjt:  GEVKSVSLSSYSKRPAKAMPPSFSPAPAP

AT3G10810.1 zinc finger (C3HC4-type RING finger) family protein4.5e-4033.91Show/hide
Query:  SGDSS--GFLCGQCSTAFHRVCKELNFKCFFVLILGFVVFVPGFFWLLPLHERNSGFEAKDTIKLSATVQVYFVLEKPVNELLPHIKRLEFDINGELDIP
        +GDS+     CG C      +   + FKC FVL+L   +F+   F LLP              +  A V   F + +  + L  +  +L+ DI  E+   
Subjt:  SGDSS--GFLCGQCSTAFHRVCKELNFKCFFVLILGFVVFVPGFFWLLPLHERNSGFEAKDTIKLSATVQVYFVLEKPVNELLPHIKRLEFDINGELDIP

Query:  NLKVSILSMHGIGESNRTYVVFGLLSEYITAPINPVSLSLLRSSLYDFFLSESNLTLTTSIFGQPSTFQILKFPGGISIIPFQHASIWQFPQIVFNFTLT
        ++KV+IL++    E N T VVFG+  +     I P+SLS ++       +++S L LT S+FG+   F++LKFPGGI++IP Q A   Q  +IVFNFTL 
Subjt:  NLKVSILSMHGIGESNRTYVVFGLLSEYITAPINPVSLSLLRSSLYDFFLSESNLTLTTSIFGQPSTFQILKFPGGISIIPFQHASIWQFPQIVFNFTLT

Query:  NSISEILDNFAKFKSQLKFGLSLRTYENVYLQITNKIGSTMQPLVIVQASITSELGRI-TSQRLQQLAAIINTSPERNLGLDYSVFGEVKSVSLSSYSKR
         SI +I  NF    SQLK GL+L  YEN+Y+ ++N  GST+ P   V +S+   +G   +S RL+QL   I  S  +NLGL+ ++FG+VK V LSS+   
Subjt:  NSISEILDNFAKFKSQLKFGLSLRTYENVYLQITNKIGSTMQPLVIVQASITSELGRI-TSQRLQQLAAIINTSPERNLGLDYSVFGEVKSVSLSSYSKR

Query:  PAKAMPPSFSPAPAPVPGDHVKLPSSPHPSRSAQSPATHSPPHANCETSSP--TPSMVPPHSPREH-------SIPPTSYP--------KSTRLIVPPAE
         + +   S SP+P+P    H K     H          H+  H +    SP   P + P  SP  H       S PP   P        K  +    PA 
Subjt:  PAKAMPPSFSPAPAPVPGDHVKLPSSPHPSRSAQSPATHSPPHANCETSSP--TPSMVPPHSPREH-------SIPPTSYP--------KSTRLIVPPAE

Query:  QPRVSSP
         P   +P
Subjt:  QPRVSSP

AT3G56590.1 hydroxyproline-rich glycoprotein family protein2.7e-4533.4Show/hide
Query:  MGKG--EEQNLPLQQRREVAPSGDSSGF-LCGQCSTAFHRVCKELNFKCFFVLILGFVVFVPGFFWLLP-LHERNSGFEAKDTIKLSATVQVYFVLEKPV
        MGK   EEQNLP+      A +    G   C  C      +    + +C  +L     VF+   FWL P L   + G    D       +   F + KP+
Subjt:  MGKG--EEQNLPLQQRREVAPSGDSSGF-LCGQCSTAFHRVCKELNFKCFFVLILGFVVFVPGFFWLLP-LHERNSGFEAKDTIKLSATVQVYFVLEKPV

Query:  NELLPHIKRLEFDINGELDIPNLKVSILSMHGIGESNRTYVVFGLLSEYITAPINPVSLSLLRSSLYDFFLSESNLTLTTSIFGQPSTFQILKFPGGISI
        + +  ++ +LE DI  E+  P  KV +L++  +G+ NRT V+F +  E   + I     SL++++       + +  LT S+FG+P  F++LKFPGGI++
Subjt:  NELLPHIKRLEFDINGELDIPNLKVSILSMHGIGESNRTYVVFGLLSEYITAPINPVSLSLLRSSLYDFFLSESNLTLTTSIFGQPSTFQILKFPGGISI

Query:  IPFQHASIWQFPQIVFNFTLTNSISEILDNFAKFKSQLKFGLSLRTYENVYLQITNKIGSTMQPLVIVQASITSELGRITSQRLQQLAAIINTSPERNLG
        IP Q     Q  Q++FNFTL  SI +I  NF +  SQLK G++L +YEN+Y+ ++N  GST+ P  IV +S+    G  +S RL+QLA  I +S  +NLG
Subjt:  IPFQHASIWQFPQIVFNFTLTNSISEILDNFAKFKSQLKFGLSLRTYENVYLQITNKIGSTMQPLVIVQASITSELGRITSQRLQQLAAIINTSPERNLG

Query:  LDYSVFGEVKSVSLSS-YSKRPAKAMPPSFSPAPAPVPGDHVKLPSSPHPSRSAQSPATHSPPHANCETSSPTP-SMVPPHSP---------------RE
        L+++VFG+VK V LSS     PA +  PS SP P      H       H    A  P+   P       S+PT  S +PP +P                 
Subjt:  LDYSVFGEVKSVSLSS-YSKRPAKAMPPSFSPAPAPVPGDHVKLPSSPHPSRSAQSPATHSPPHANCETSSPTP-SMVPPHSP---------------RE

Query:  HSIPPTSYPKSTRLIVPPAEQPRVSSPRASPVEFSPL---LPPDLLPKPKPSFHSKPGQTKEDTSHP
        H+ PPT  P  ++   PPA  P      A PV  SPL   +   + P  K S  S+P   K  +  P
Subjt:  HSIPPTSYPKSTRLIVPPAEQPRVSSPRASPVEFSPL---LPPDLLPKPKPSFHSKPGQTKEDTSHP

AT3G56590.2 hydroxyproline-rich glycoprotein family protein2.1e-4533.33Show/hide
Query:  MGKG--EEQNLPLQQRREVAPSGDSSGF-LCGQCSTAFHRVCKELNFKCFFVLILGFVVFVPGFFWLLP-LHERNSGFEAKDTIKLSATVQVYFVLEKPV
        MGK   EEQNLP+      A +    G   C  C      +    + +C  +L     VF+   FWL P L   + G    D       +   F + KP+
Subjt:  MGKG--EEQNLPLQQRREVAPSGDSSGF-LCGQCSTAFHRVCKELNFKCFFVLILGFVVFVPGFFWLLP-LHERNSGFEAKDTIKLSATVQVYFVLEKPV

Query:  NELLPHIKRLEFDINGELDIPNLKVSILSMHGIGESNRTYVVFGLLSEYITAPINPVSLSLLRSSLYDFFLSESNLTLTTSIFGQPSTFQILKFPGGISI
        + +  ++ +LE DI  E+  P  KV +L++  +G+ NRT V+F +  E   + I     SL++++       + +  LT S+FG+P  F++LKFPGGI++
Subjt:  NELLPHIKRLEFDINGELDIPNLKVSILSMHGIGESNRTYVVFGLLSEYITAPINPVSLSLLRSSLYDFFLSESNLTLTTSIFGQPSTFQILKFPGGISI

Query:  IPFQHASIWQFPQIVFNFTLTNSISEILDNFAKFKSQLKFGLSLRTYENVYLQITNKIGSTMQPLVIVQASITSELGRITSQRLQQLAAIINTSPERNLG
        IP Q     Q  Q++FNFTL  SI +I  NF +  SQLK G++L +YEN+Y+ ++N  GST+ P  IV +S+    G  +S RL+QLA  I +S  +NLG
Subjt:  IPFQHASIWQFPQIVFNFTLTNSISEILDNFAKFKSQLKFGLSLRTYENVYLQITNKIGSTMQPLVIVQASITSELGRITSQRLQQLAAIINTSPERNLG

Query:  LDYSVFGEVKSVSLSS-YSKRPAKAMPPSFSPAPAPVPGDHVKLPSSPHPSRSAQSPATHSPPHANCETSSPTP-SMVPPHSP---------------RE
        L+++VFG+VK V LSS     PA +  PS SP P      H       H    A  P+   P       S+PT  S +PP +P                 
Subjt:  LDYSVFGEVKSVSLSS-YSKRPAKAMPPSFSPAPAPVPGDHVKLPSSPHPSRSAQSPATHSPPHANCETSSPTP-SMVPPHSP---------------RE

Query:  HSIPPTSYPKSTRLIVPPAEQPRVSSPRASPVEFSPL---LPPDLLPKPKPSFHSKPGQTKEDTSHPDHVSFDI
        H+ PPT  P  ++   PPA  P      A PV  SPL   +   + P  K S  S+P   K  +  P   S  I
Subjt:  HSIPPTSYPKSTRLIVPPAEQPRVSSPRASPVEFSPL---LPPDLLPKPKPSFHSKPGQTKEDTSHPDHVSFDI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGAAAGGGGAAGAGCAAAATCTGCCGTTGCAGCAGCGTCGTGAGGTGGCTCCAAGTGGGGATTCTTCTGGGTTTCTTTGTGGTCAATGTTCGACTGCTTTTCATAG
AGTTTGTAAGGAGTTGAATTTCAAGTGTTTCTTCGTTTTGATTTTGGGGTTTGTGGTGTTTGTCCCTGGATTCTTTTGGCTTCTTCCTCTTCATGAAAGAAATTCTGGGT
TTGAGGCAAAAGACACCATTAAACTCAGTGCTACAGTTCAGGTGTATTTCGTTCTTGAAAAGCCCGTGAATGAGCTTCTCCCTCACATCAAGAGATTAGAGTTTGATATC
AATGGTGAATTAGACATTCCAAACTTGAAGGTTTCCATTCTATCCATGCATGGTATAGGTGAGTCGAACAGGACTTACGTGGTTTTTGGTCTTCTTTCTGAATACATAAC
TGCTCCAATAAATCCAGTGTCCTTAAGTCTGCTGAGATCGTCTTTATATGACTTTTTCCTTTCCGAATCCAACCTTACTTTGACGACATCGATTTTTGGACAGCCATCGA
CATTTCAAATTCTCAAGTTTCCAGGGGGAATTTCTATAATCCCATTTCAACATGCTTCAATTTGGCAGTTTCCCCAGATTGTATTTAACTTCACTCTTACTAACTCCATT
TCTGAAATACTCGACAACTTTGCCAAGTTCAAGAGCCAACTAAAGTTTGGATTGAGTCTGAGGACTTATGAGAATGTGTATTTGCAAATAACAAACAAGATTGGCTCGAC
GATGCAACCACTTGTAATTGTTCAGGCTTCTATTACGTCGGAATTGGGACGCATAACGTCACAGAGATTACAGCAGTTGGCTGCAATCATCAACACCTCTCCTGAAAGAA
ATCTTGGCCTTGATTATTCTGTTTTTGGAGAAGTCAAGAGTGTCAGTTTGTCTTCTTATTCGAAGAGACCTGCCAAGGCAATGCCTCCGAGTTTTTCTCCCGCTCCTGCC
CCAGTGCCTGGTGACCATGTAAAACTACCGAGTAGCCCACATCCATCGAGATCCGCGCAATCACCTGCAACTCATTCCCCACCTCATGCAAATTGTGAAACCTCGTCTCC
AACCCCTTCAATGGTTCCTCCACATTCCCCTCGTGAACATTCAATACCCCCAACCTCCTATCCAAAGTCTACAAGACTGATCGTTCCTCCGGCTGAACAACCTCGAGTTT
CTTCTCCACGTGCATCTCCGGTAGAGTTTTCACCGCTTTTGCCCCCCGATCTGTTACCTAAACCAAAGCCTTCTTTTCACTCCAAACCAGGGCAGACAAAGGAAGATACG
TCACATCCAGATCATGTAAGCTTTGACATATTTTGTTAG
mRNA sequenceShow/hide mRNA sequence
GTTCATCAATCTCCGCCAAACCCAAAAATCAAAACCCAATTTAATATTCATTCTCAATTCAATGGACAGTGAAAAAGAAGTTGCAGGAAGGAAGTGAATGAGCGTTCCAT
TAGATCTCCATTTGTTGGTCTTCGTGGACAAACTTTTCTTTAGCCTTTTCCAATTGTTATTATTTTAAATCCTTCAAGTGGGGTTTCTCTCTTTATTCTCATCTTCCCCT
TTACCCTCTCTTTGCCTTTCACCTTTGCTTCTCTTCAACGTTGGCTTGCCCCTAAAAATACCGATCCATTTCCATTCAGTTTGGTTCTTTCTAAGAAACCCACTAGGGGT
GCCTTTGAGTGTGCTCTTTTCGAGTTCGATTTTGTTTTCAGTGGTGGATTGTTACAGATGCTCTGCAAATGGGGAAAGGGGAAGAGCAAAATCTGCCGTTGCAGCAGCGT
CGTGAGGTGGCTCCAAGTGGGGATTCTTCTGGGTTTCTTTGTGGTCAATGTTCGACTGCTTTTCATAGAGTTTGTAAGGAGTTGAATTTCAAGTGTTTCTTCGTTTTGAT
TTTGGGGTTTGTGGTGTTTGTCCCTGGATTCTTTTGGCTTCTTCCTCTTCATGAAAGAAATTCTGGGTTTGAGGCAAAAGACACCATTAAACTCAGTGCTACAGTTCAGG
TGTATTTCGTTCTTGAAAAGCCCGTGAATGAGCTTCTCCCTCACATCAAGAGATTAGAGTTTGATATCAATGGTGAATTAGACATTCCAAACTTGAAGGTTTCCATTCTA
TCCATGCATGGTATAGGTGAGTCGAACAGGACTTACGTGGTTTTTGGTCTTCTTTCTGAATACATAACTGCTCCAATAAATCCAGTGTCCTTAAGTCTGCTGAGATCGTC
TTTATATGACTTTTTCCTTTCCGAATCCAACCTTACTTTGACGACATCGATTTTTGGACAGCCATCGACATTTCAAATTCTCAAGTTTCCAGGGGGAATTTCTATAATCC
CATTTCAACATGCTTCAATTTGGCAGTTTCCCCAGATTGTATTTAACTTCACTCTTACTAACTCCATTTCTGAAATACTCGACAACTTTGCCAAGTTCAAGAGCCAACTA
AAGTTTGGATTGAGTCTGAGGACTTATGAGAATGTGTATTTGCAAATAACAAACAAGATTGGCTCGACGATGCAACCACTTGTAATTGTTCAGGCTTCTATTACGTCGGA
ATTGGGACGCATAACGTCACAGAGATTACAGCAGTTGGCTGCAATCATCAACACCTCTCCTGAAAGAAATCTTGGCCTTGATTATTCTGTTTTTGGAGAAGTCAAGAGTG
TCAGTTTGTCTTCTTATTCGAAGAGACCTGCCAAGGCAATGCCTCCGAGTTTTTCTCCCGCTCCTGCCCCAGTGCCTGGTGACCATGTAAAACTACCGAGTAGCCCACAT
CCATCGAGATCCGCGCAATCACCTGCAACTCATTCCCCACCTCATGCAAATTGTGAAACCTCGTCTCCAACCCCTTCAATGGTTCCTCCACATTCCCCTCGTGAACATTC
AATACCCCCAACCTCCTATCCAAAGTCTACAAGACTGATCGTTCCTCCGGCTGAACAACCTCGAGTTTCTTCTCCACGTGCATCTCCGGTAGAGTTTTCACCGCTTTTGC
CCCCCGATCTGTTACCTAAACCAAAGCCTTCTTTTCACTCCAAACCAGGGCAGACAAAGGAAGATACGTCACATCCAGATCATGTAAGCTTTGACATATTTTGTTAG
Protein sequenceShow/hide protein sequence
MGKGEEQNLPLQQRREVAPSGDSSGFLCGQCSTAFHRVCKELNFKCFFVLILGFVVFVPGFFWLLPLHERNSGFEAKDTIKLSATVQVYFVLEKPVNELLPHIKRLEFDI
NGELDIPNLKVSILSMHGIGESNRTYVVFGLLSEYITAPINPVSLSLLRSSLYDFFLSESNLTLTTSIFGQPSTFQILKFPGGISIIPFQHASIWQFPQIVFNFTLTNSI
SEILDNFAKFKSQLKFGLSLRTYENVYLQITNKIGSTMQPLVIVQASITSELGRITSQRLQQLAAIINTSPERNLGLDYSVFGEVKSVSLSSYSKRPAKAMPPSFSPAPA
PVPGDHVKLPSSPHPSRSAQSPATHSPPHANCETSSPTPSMVPPHSPREHSIPPTSYPKSTRLIVPPAEQPRVSSPRASPVEFSPLLPPDLLPKPKPSFHSKPGQTKEDT
SHPDHVSFDIFC