; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0016052 (gene) of Snake gourd v1 genome

Gene IDTan0016052
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionprotein PHLOEM PROTEIN 2-LIKE A10
Genome locationLG01:112679763..112682064
RNA-Seq ExpressionTan0016052
SyntenyTan0016052
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6591436.1 Protein PHLOEM PROTEIN 2-LIKE A10, partial [Cucurbita argyrosperma subsp. sororia]1.3e-20389.91Show/hide
Query:  MDFELVRRALGFSQRRKKWLILLAVMGFSGYGAYRVYHLPSVERKRKRLMKLFGAMISVAEMVADSSEAIGVVSKDLKEFLKSDSDQIPNSLKQISKIAK
        MD ELVRR LGFSQRRKKWLILL VMG SGYGAY+VYHLPSVER+RKRLMKLFGAMISVAEMVADSSEAIGVVSKDLKEFLKSDSDQIPNSLKQISKIAK
Subjt:  MDFELVRRALGFSQRRKKWLILLAVMGFSGYGAYRVYHLPSVERKRKRLMKLFGAMISVAEMVADSSEAIGVVSKDLKEFLKSDSDQIPNSLKQISKIAK

Query:  SEEFSESLEKVTEAFTVGMMRGYKSVTMNEQNLEAGSANSSFSSGVVEKLFSTAGTGFASVVVGSFAKNLVMGYYSIPGSVDDASRNLQTGAYKSGSEFP
        SEEFSESLEK+TEAFT+GMMRGYKSV+ NEQN EAG ANSSF+S VVEKLFSTAGTGFASVVVGSFAKNLVMGYYSIPGSVDDASRN  + +YKSGSEFP
Subjt:  SEEFSESLEKVTEAFTVGMMRGYKSVTMNEQNLEAGSANSSFSSGVVEKLFSTAGTGFASVVVGSFAKNLVMGYYSIPGSVDDASRNLQTGAYKSGSEFP

Query:  DRPVPRWVTVASDEKCKNVIADCIQVFVSTAVAVYLDKTMDVNVYNDLFSGLTNPTHQDKVKDMLVSVCNGAVETLVKTSHQVLTSSRTTSNLSPVPSSN
        D  VPRWVTVAS+EK KN+IADCIQVFVSTAVAVYLDKTMDVNVYNDLFSGLTNPTHQDKVKDMLVSVCNGAVETLVKTSHQVLTSSR+TS LSPV S N
Subjt:  DRPVPRWVTVASDEKCKNVIADCIQVFVSTAVAVYLDKTMDVNVYNDLFSGLTNPTHQDKVKDMLVSVCNGAVETLVKTSHQVLTSSRTTSNLSPVPSSN

Query:  ELSKIGDDPFSEEASPKKMAVASSIESSQNGWVGTVSSTLAVPRNRKFVLDLTGRVTFETTRSVVEFLMWKLMDGLKRSLDIFHDEVVGRGLEVIRYFSA
         LSKIG++PFSE A PKKMA+ASS E SQNGWVG VSSTLAVPRNRKFVLDLTGRVTFETTRS VEFL+WKLMDGLKRSLDI HDEVVGRGL+VIRYFSA
Subjt:  ELSKIGDDPFSEEASPKKMAVASSIESSQNGWVGTVSSTLAVPRNRKFVLDLTGRVTFETTRSVVEFLMWKLMDGLKRSLDIFHDEVVGRGLEVIRYFSA

Query:  KSSVIVTICLALYLHIFGGTRLLLPA
        KSSV+VTICLALYLH+FGGTRLLLPA
Subjt:  KSSVIVTICLALYLHIFGGTRLLLPA

XP_022142088.1 protein PHLOEM PROTEIN 2-LIKE A10 [Momordica charantia]1.3e-20388.73Show/hide
Query:  MDFELVRRALGFSQRRKKWLILLAVMGFSGYGAYRVYHLPSVERKRKRLMKLFGAMISVAEMVADSSEAIGVVSKDLKEFLKSDSDQIPNSLKQISKIAK
        MD  LVR  LGFSQRRKKWLILLAVMG SGYGAY+VYHLPSVERKRKRLMKLFGA+ISVAEMVADSSEAIGVVSKDLKEFLKSDSDQIPNSLKQISKIA+
Subjt:  MDFELVRRALGFSQRRKKWLILLAVMGFSGYGAYRVYHLPSVERKRKRLMKLFGAMISVAEMVADSSEAIGVVSKDLKEFLKSDSDQIPNSLKQISKIAK

Query:  SEEFSESLEKVTEAFTVGMMRGYKSVTMNEQNLEAGSANSSFSSGVVEKLFSTAGTGFASVVVGSFAKNLVMGYYSIPGSVDDASRNLQTGAYKSGSEFP
        SEEFSES+EKVTEAFTVGMMRGYKSVT NEQN+EAGS NSSF++GVVEKLFSTAGTGFASVVVGSFAKNLVMGYYS+PGS DDASRNLQTGAY+SGSE  
Subjt:  SEEFSESLEKVTEAFTVGMMRGYKSVTMNEQNLEAGSANSSFSSGVVEKLFSTAGTGFASVVVGSFAKNLVMGYYSIPGSVDDASRNLQTGAYKSGSEFP

Query:  DRPVPRWVTVASDEKCKNVIADCIQVFVSTAVAVYLDKTMDVNVYNDLFSGLTNPTHQDKVKDMLVSVCNGAVETLVKTSHQVLTSSRTTSNLSPVPSSN
        +  VPRWV+VA++EKCKNV+ADCIQVFVSTAVAVYLDKTMDVNVY++LFSGLTNPTHQ+KVKDMLVS+CNGAVETLVKTSHQVLTSS+T+SNLSPV S N
Subjt:  DRPVPRWVTVASDEKCKNVIADCIQVFVSTAVAVYLDKTMDVNVYNDLFSGLTNPTHQDKVKDMLVSVCNGAVETLVKTSHQVLTSSRTTSNLSPVPSSN

Query:  ELSKIGDDPFSEEASPKKMAVASSIESSQNGWVGTVSSTLAVPRNRKFVLDLTGRVTFETTRSVVEFLMWKLMDGLKRSLDIFHDEVVGRGLEVIRYFSA
         LSKIGDDPFSEE  PK++AVA+SIE SQ+ WVG VSSTLAVPRNRKFVLDLTGRVTFETTRSVVEFL+WKLMDGLKRSLDI HDEVVGRGLEVIRYF+A
Subjt:  ELSKIGDDPFSEEASPKKMAVASSIESSQNGWVGTVSSTLAVPRNRKFVLDLTGRVTFETTRSVVEFLMWKLMDGLKRSLDIFHDEVVGRGLEVIRYFSA

Query:  KSSVIVTICLALYLHIFGGTRLLLPA
        KSSVIVTICLALYLH+FGGTRLLLPA
Subjt:  KSSVIVTICLALYLHIFGGTRLLLPA

XP_022937332.1 protein PHLOEM PROTEIN 2-LIKE A10-like [Cucurbita moschata]1.1e-20289.44Show/hide
Query:  MDFELVRRALGFSQRRKKWLILLAVMGFSGYGAYRVYHLPSVERKRKRLMKLFGAMISVAEMVADSSEAIGVVSKDLKEFLKSDSDQIPNSLKQISKIAK
        MD ELVRR LGFSQRRKKWLILL VMG SGYGAY+VYHLPSVER+RKRLMKLFGAMISVAEMVADSSEAIGVVSKDLKEFLKSDSDQIPNSLKQISKIAK
Subjt:  MDFELVRRALGFSQRRKKWLILLAVMGFSGYGAYRVYHLPSVERKRKRLMKLFGAMISVAEMVADSSEAIGVVSKDLKEFLKSDSDQIPNSLKQISKIAK

Query:  SEEFSESLEKVTEAFTVGMMRGYKSVTMNEQNLEAGSANSSFSSGVVEKLFSTAGTGFASVVVGSFAKNLVMGYYSIPGSVDDASRNLQTGAYKSGSEFP
        SEEFSESLEK+TEAFT+GMMRGYKSV+ N+QN EAG ANSSF+S VVEKLFSTAGTGFASVVVGSFAKNLVMGYYSIPGSVDDASRN  + +YKSGSEFP
Subjt:  SEEFSESLEKVTEAFTVGMMRGYKSVTMNEQNLEAGSANSSFSSGVVEKLFSTAGTGFASVVVGSFAKNLVMGYYSIPGSVDDASRNLQTGAYKSGSEFP

Query:  DRPVPRWVTVASDEKCKNVIADCIQVFVSTAVAVYLDKTMDVNVYNDLFSGLTNPTHQDKVKDMLVSVCNGAVETLVKTSHQVLTSSRTTSNLSPVPSSN
        D  VPRWVTVAS+EK KN+IADCIQVFVSTAVAVYLDKTMDVNVYNDLFSGLTNPTHQDKVKDMLVSVCNGAVETLVKTSHQVLTSSR+TS LSPV S N
Subjt:  DRPVPRWVTVASDEKCKNVIADCIQVFVSTAVAVYLDKTMDVNVYNDLFSGLTNPTHQDKVKDMLVSVCNGAVETLVKTSHQVLTSSRTTSNLSPVPSSN

Query:  ELSKIGDDPFSEEASPKKMAVASSIESSQNGWVGTVSSTLAVPRNRKFVLDLTGRVTFETTRSVVEFLMWKLMDGLKRSLDIFHDEVVGRGLEVIRYFSA
         LSKIG++PFSE A PKKMA+AS  E SQNGWVG VSSTLAVPRNRKFVLDLTGRVTFETTRS VEFL+WKLMDGLKRSLDI HDEVVGRGL+VIRYFSA
Subjt:  ELSKIGDDPFSEEASPKKMAVASSIESSQNGWVGTVSSTLAVPRNRKFVLDLTGRVTFETTRSVVEFLMWKLMDGLKRSLDIFHDEVVGRGLEVIRYFSA

Query:  KSSVIVTICLALYLHIFGGTRLLLPA
        KSSV+VTICLALYLH+FGGTRLLLPA
Subjt:  KSSVIVTICLALYLHIFGGTRLLLPA

XP_022977036.1 protein PHLOEM PROTEIN 2-LIKE A10-like [Cucurbita maxima]6.9e-20288.97Show/hide
Query:  MDFELVRRALGFSQRRKKWLILLAVMGFSGYGAYRVYHLPSVERKRKRLMKLFGAMISVAEMVADSSEAIGVVSKDLKEFLKSDSDQIPNSLKQISKIAK
        MD ELVRR LGFSQRRKKWLILL VMG SGYGAY+VYHLPSVER+RK+LMKL GAMISVAEMVADSSEAIGVVSKDLKEFLKSDSDQIPNSLKQISKIAK
Subjt:  MDFELVRRALGFSQRRKKWLILLAVMGFSGYGAYRVYHLPSVERKRKRLMKLFGAMISVAEMVADSSEAIGVVSKDLKEFLKSDSDQIPNSLKQISKIAK

Query:  SEEFSESLEKVTEAFTVGMMRGYKSVTMNEQNLEAGSANSSFSSGVVEKLFSTAGTGFASVVVGSFAKNLVMGYYSIPGSVDDASRNLQTGAYKSGSEFP
        SEEFSESLEK+TEAFT+GMMRGYKS++ NEQN EAG ANSSF+S VV+KLFSTAGTGFASVVVGSFAKNLV+GYYSIPGSVDDASRN  + +YKSGSEFP
Subjt:  SEEFSESLEKVTEAFTVGMMRGYKSVTMNEQNLEAGSANSSFSSGVVEKLFSTAGTGFASVVVGSFAKNLVMGYYSIPGSVDDASRNLQTGAYKSGSEFP

Query:  DRPVPRWVTVASDEKCKNVIADCIQVFVSTAVAVYLDKTMDVNVYNDLFSGLTNPTHQDKVKDMLVSVCNGAVETLVKTSHQVLTSSRTTSNLSPVPSSN
        D  VPRWVTVAS+EK KN+IADCIQVFVSTAVAVYLDKTMDVNVYNDLFSGLTNPTHQDKVKDMLVSVCNGAVETLVKTSHQVLTSSR+TS LSPV S N
Subjt:  DRPVPRWVTVASDEKCKNVIADCIQVFVSTAVAVYLDKTMDVNVYNDLFSGLTNPTHQDKVKDMLVSVCNGAVETLVKTSHQVLTSSRTTSNLSPVPSSN

Query:  ELSKIGDDPFSEEASPKKMAVASSIESSQNGWVGTVSSTLAVPRNRKFVLDLTGRVTFETTRSVVEFLMWKLMDGLKRSLDIFHDEVVGRGLEVIRYFSA
         LSKIGD+PFSE A PKKMA+ASS E SQNGWVG VSSTLAVPRNRKFVLDLTGRVTFETTRS VEFL+WKLMDGLKRSLDI HDEVVGRGL+VIRYFSA
Subjt:  ELSKIGDDPFSEEASPKKMAVASSIESSQNGWVGTVSSTLAVPRNRKFVLDLTGRVTFETTRSVVEFLMWKLMDGLKRSLDIFHDEVVGRGLEVIRYFSA

Query:  KSSVIVTICLALYLHIFGGTRLLLPA
        KSSV+VTICLALYLH+FGGTRLLLPA
Subjt:  KSSVIVTICLALYLHIFGGTRLLLPA

XP_038897733.1 protein PHLOEM PROTEIN 2-LIKE A10-like [Benincasa hispida]2.4e-20288.26Show/hide
Query:  MDFELVRRALGFSQRRKKWLILLAVMGFSGYGAYRVYHLPSVERKRKRLMKLFGAMISVAEMVADSSEAIGVVSKDLKEFLKSDSDQIPNSLKQISKIAK
        MDFELVRR LGFSQR++KWLILLAVMG SGYGAY+VYHLPSVERKRKRLMKLFGAMISVAEMVADSSEAIGV+SKDLKEFLKSDSD+IPNSLKQISKIAK
Subjt:  MDFELVRRALGFSQRRKKWLILLAVMGFSGYGAYRVYHLPSVERKRKRLMKLFGAMISVAEMVADSSEAIGVVSKDLKEFLKSDSDQIPNSLKQISKIAK

Query:  SEEFSESLEKVTEAFTVGMMRGYKSVTMNEQNLEAGSANSSFSSGVVEKLFSTAGTGFASVVVGSFAKNLVMGYYSIPGSVDDASRNLQTGAYKSGSEFP
        S EFS SLEKVTEAFT+GMM+GYK+VT NEQN EA SANSSFSSGV EKLFS AGTGFASVVVGSFAKNLVMGYYS+PGSV+DASRN QT  YKSGSE P
Subjt:  SEEFSESLEKVTEAFTVGMMRGYKSVTMNEQNLEAGSANSSFSSGVVEKLFSTAGTGFASVVVGSFAKNLVMGYYSIPGSVDDASRNLQTGAYKSGSEFP

Query:  DRPVPRWVTVASDEKCKNVIADCIQVFVSTAVAVYLDKTMDVNVYNDLFSGLTNPTHQDKVKDMLVSVCNGAVETLVKTSHQVLTSSRTTSNLSPVPSSN
        D  VPRWVTVAS+EKCKNVIADCIQVFVSTAV VYLDKTMDVNVYNDLFSGLTNPTHQDKVKDMLVSVCNGAVETLVK+SHQVLTSS++TSNLSPV S N
Subjt:  DRPVPRWVTVASDEKCKNVIADCIQVFVSTAVAVYLDKTMDVNVYNDLFSGLTNPTHQDKVKDMLVSVCNGAVETLVKTSHQVLTSSRTTSNLSPVPSSN

Query:  ELSKIGDDPFSEEASPKKMAVASSIESSQNGWVGTVSSTLAVPRNRKFVLDLTGRVTFETTRSVVEFLMWKLMDGLKRSLDIFHDEVVGRGLEVIRYFSA
         +SKIGD+PFSE A PKKMA+ +S E S+NGWV TVSSTLAVPRNRKFVLDLTGRVTFETTRSVVEF +WKLMDGLKRSLDI HDEVVGRGLEVIRYF A
Subjt:  ELSKIGDDPFSEEASPKKMAVASSIESSQNGWVGTVSSTLAVPRNRKFVLDLTGRVTFETTRSVVEFLMWKLMDGLKRSLDIFHDEVVGRGLEVIRYFSA

Query:  KSSVIVTICLALYLHIFGGTRLLLPA
        KSSVIVTICLALYLH+FGGTRLL+PA
Subjt:  KSSVIVTICLALYLHIFGGTRLLLPA

TrEMBL top hitse value%identityAlignment
A0A1S3BUC6 protein PHLOEM PROTEIN 2-LIKE A101.9e-19787.56Show/hide
Query:  MDFELVRRALGFSQRRKKWLILLAVMGFSGYGAYRVYHLPSVERKRKRLMKLFGAMISVAEMVADSSEAIGVVSKDLKEFLKSDSDQIPNSLKQISKIAK
        MDFELVRR LGFSQR+KKWLILLA+MG SGYGAY+VYHLPSVERKR+RLMKLFGAMISVAEMVADSSEAIGV+SKDLKEFLKSDSDQIPNSLKQISKIAK
Subjt:  MDFELVRRALGFSQRRKKWLILLAVMGFSGYGAYRVYHLPSVERKRKRLMKLFGAMISVAEMVADSSEAIGVVSKDLKEFLKSDSDQIPNSLKQISKIAK

Query:  SEEFSESLEKVTEAFTVGMMRGYKSVTMNEQNLEAGSANSSFSSGVVEKLFSTAGTGFASVVVGSFAKNLVMGYYSIPGSVDDASRNLQTGAYKSGSEFP
        S EFSESLEKVTEAFTVGMMRGYKSVT N+QN EA SANS FSSG+VEKLFST GTGFASVVVGSFA+NLVMGYYSI GSVDDAS N Q     SGSEFP
Subjt:  SEEFSESLEKVTEAFTVGMMRGYKSVTMNEQNLEAGSANSSFSSGVVEKLFSTAGTGFASVVVGSFAKNLVMGYYSIPGSVDDASRNLQTGAYKSGSEFP

Query:  DRPVPRWVTVASDEKCKNVIADCIQVFVSTAVAVYLDKTMDVNVYNDLFSGLTNPTHQDKVKDMLVSVCNGAVETLVKTSHQVLTSSRTTSNLSPVPSSN
        D  VPRWVTVAS+EKCKNVIADCIQVFVSTAV VYLDKTMDVNVYNDLFSGLTNPTHQDKVKDMLVSVCNGAVETLVK+SHQVLTSSR+TSNLS V   N
Subjt:  DRPVPRWVTVASDEKCKNVIADCIQVFVSTAVAVYLDKTMDVNVYNDLFSGLTNPTHQDKVKDMLVSVCNGAVETLVKTSHQVLTSSRTTSNLSPVPSSN

Query:  ELSKIGDDPFSEEASPKKMAVASSIESSQNGWVGTVSSTLAVPRNRKFVLDLTGRVTFETTRSVVEFLMWKLMDGLKRSLDIFHDEVVGRGLEVIRYFSA
         +SKIGD PFSE   PKK+A+ASS E SQNGWV TVSSTLAVPRNRKFVLDLTGRVTFETTRSVVE+ +WKLMDGLKRSLD  HDEVVGRGLEVIRYF A
Subjt:  ELSKIGDDPFSEEASPKKMAVASSIESSQNGWVGTVSSTLAVPRNRKFVLDLTGRVTFETTRSVVEFLMWKLMDGLKRSLDIFHDEVVGRGLEVIRYFSA

Query:  KSSVIVTICLALYLHIFGGTRLLLPA
        KSSVIVTICLALYLH+FGGTRLL+PA
Subjt:  KSSVIVTICLALYLHIFGGTRLLLPA

A0A5A7V8K7 Protein PHLOEM PROTEIN 2-LIKE A101.9e-19787.56Show/hide
Query:  MDFELVRRALGFSQRRKKWLILLAVMGFSGYGAYRVYHLPSVERKRKRLMKLFGAMISVAEMVADSSEAIGVVSKDLKEFLKSDSDQIPNSLKQISKIAK
        MDFELVRR LGFSQR+KKWLILLA+MG SGYGAY+VYHLPSVERKR+RLMKLFGAMISVAEMVADSSEAIGV+SKDLKEFLKSDSDQIPNSLKQISKIAK
Subjt:  MDFELVRRALGFSQRRKKWLILLAVMGFSGYGAYRVYHLPSVERKRKRLMKLFGAMISVAEMVADSSEAIGVVSKDLKEFLKSDSDQIPNSLKQISKIAK

Query:  SEEFSESLEKVTEAFTVGMMRGYKSVTMNEQNLEAGSANSSFSSGVVEKLFSTAGTGFASVVVGSFAKNLVMGYYSIPGSVDDASRNLQTGAYKSGSEFP
        S EFSESLEKVTEAFTVGMMRGYKSVT N+QN EA SANS FSSG+VEKLFST GTGFASVVVGSFA+NLVMGYYSI GSVDDAS N Q     SGSEFP
Subjt:  SEEFSESLEKVTEAFTVGMMRGYKSVTMNEQNLEAGSANSSFSSGVVEKLFSTAGTGFASVVVGSFAKNLVMGYYSIPGSVDDASRNLQTGAYKSGSEFP

Query:  DRPVPRWVTVASDEKCKNVIADCIQVFVSTAVAVYLDKTMDVNVYNDLFSGLTNPTHQDKVKDMLVSVCNGAVETLVKTSHQVLTSSRTTSNLSPVPSSN
        D  VPRWVTVAS+EKCKNVIADCIQVFVSTAV VYLDKTMDVNVYNDLFSGLTNPTHQDKVKDMLVSVCNGAVETLVK+SHQVLTSSR+TSNLS V   N
Subjt:  DRPVPRWVTVASDEKCKNVIADCIQVFVSTAVAVYLDKTMDVNVYNDLFSGLTNPTHQDKVKDMLVSVCNGAVETLVKTSHQVLTSSRTTSNLSPVPSSN

Query:  ELSKIGDDPFSEEASPKKMAVASSIESSQNGWVGTVSSTLAVPRNRKFVLDLTGRVTFETTRSVVEFLMWKLMDGLKRSLDIFHDEVVGRGLEVIRYFSA
         +SKIGD PFSE   PKK+A+ASS E SQNGWV TVSSTLAVPRNRKFVLDLTGRVTFETTRSVVE+ +WKLMDGLKRSLD  HDEVVGRGLEVIRYF A
Subjt:  ELSKIGDDPFSEEASPKKMAVASSIESSQNGWVGTVSSTLAVPRNRKFVLDLTGRVTFETTRSVVEFLMWKLMDGLKRSLDIFHDEVVGRGLEVIRYFSA

Query:  KSSVIVTICLALYLHIFGGTRLLLPA
        KSSVIVTICLALYLH+FGGTRLL+PA
Subjt:  KSSVIVTICLALYLHIFGGTRLLLPA

A0A6J1CMC9 protein PHLOEM PROTEIN 2-LIKE A106.1e-20488.73Show/hide
Query:  MDFELVRRALGFSQRRKKWLILLAVMGFSGYGAYRVYHLPSVERKRKRLMKLFGAMISVAEMVADSSEAIGVVSKDLKEFLKSDSDQIPNSLKQISKIAK
        MD  LVR  LGFSQRRKKWLILLAVMG SGYGAY+VYHLPSVERKRKRLMKLFGA+ISVAEMVADSSEAIGVVSKDLKEFLKSDSDQIPNSLKQISKIA+
Subjt:  MDFELVRRALGFSQRRKKWLILLAVMGFSGYGAYRVYHLPSVERKRKRLMKLFGAMISVAEMVADSSEAIGVVSKDLKEFLKSDSDQIPNSLKQISKIAK

Query:  SEEFSESLEKVTEAFTVGMMRGYKSVTMNEQNLEAGSANSSFSSGVVEKLFSTAGTGFASVVVGSFAKNLVMGYYSIPGSVDDASRNLQTGAYKSGSEFP
        SEEFSES+EKVTEAFTVGMMRGYKSVT NEQN+EAGS NSSF++GVVEKLFSTAGTGFASVVVGSFAKNLVMGYYS+PGS DDASRNLQTGAY+SGSE  
Subjt:  SEEFSESLEKVTEAFTVGMMRGYKSVTMNEQNLEAGSANSSFSSGVVEKLFSTAGTGFASVVVGSFAKNLVMGYYSIPGSVDDASRNLQTGAYKSGSEFP

Query:  DRPVPRWVTVASDEKCKNVIADCIQVFVSTAVAVYLDKTMDVNVYNDLFSGLTNPTHQDKVKDMLVSVCNGAVETLVKTSHQVLTSSRTTSNLSPVPSSN
        +  VPRWV+VA++EKCKNV+ADCIQVFVSTAVAVYLDKTMDVNVY++LFSGLTNPTHQ+KVKDMLVS+CNGAVETLVKTSHQVLTSS+T+SNLSPV S N
Subjt:  DRPVPRWVTVASDEKCKNVIADCIQVFVSTAVAVYLDKTMDVNVYNDLFSGLTNPTHQDKVKDMLVSVCNGAVETLVKTSHQVLTSSRTTSNLSPVPSSN

Query:  ELSKIGDDPFSEEASPKKMAVASSIESSQNGWVGTVSSTLAVPRNRKFVLDLTGRVTFETTRSVVEFLMWKLMDGLKRSLDIFHDEVVGRGLEVIRYFSA
         LSKIGDDPFSEE  PK++AVA+SIE SQ+ WVG VSSTLAVPRNRKFVLDLTGRVTFETTRSVVEFL+WKLMDGLKRSLDI HDEVVGRGLEVIRYF+A
Subjt:  ELSKIGDDPFSEEASPKKMAVASSIESSQNGWVGTVSSTLAVPRNRKFVLDLTGRVTFETTRSVVEFLMWKLMDGLKRSLDIFHDEVVGRGLEVIRYFSA

Query:  KSSVIVTICLALYLHIFGGTRLLLPA
        KSSVIVTICLALYLH+FGGTRLLLPA
Subjt:  KSSVIVTICLALYLHIFGGTRLLLPA

A0A6J1FAX1 protein PHLOEM PROTEIN 2-LIKE A10-like5.2e-20389.44Show/hide
Query:  MDFELVRRALGFSQRRKKWLILLAVMGFSGYGAYRVYHLPSVERKRKRLMKLFGAMISVAEMVADSSEAIGVVSKDLKEFLKSDSDQIPNSLKQISKIAK
        MD ELVRR LGFSQRRKKWLILL VMG SGYGAY+VYHLPSVER+RKRLMKLFGAMISVAEMVADSSEAIGVVSKDLKEFLKSDSDQIPNSLKQISKIAK
Subjt:  MDFELVRRALGFSQRRKKWLILLAVMGFSGYGAYRVYHLPSVERKRKRLMKLFGAMISVAEMVADSSEAIGVVSKDLKEFLKSDSDQIPNSLKQISKIAK

Query:  SEEFSESLEKVTEAFTVGMMRGYKSVTMNEQNLEAGSANSSFSSGVVEKLFSTAGTGFASVVVGSFAKNLVMGYYSIPGSVDDASRNLQTGAYKSGSEFP
        SEEFSESLEK+TEAFT+GMMRGYKSV+ N+QN EAG ANSSF+S VVEKLFSTAGTGFASVVVGSFAKNLVMGYYSIPGSVDDASRN  + +YKSGSEFP
Subjt:  SEEFSESLEKVTEAFTVGMMRGYKSVTMNEQNLEAGSANSSFSSGVVEKLFSTAGTGFASVVVGSFAKNLVMGYYSIPGSVDDASRNLQTGAYKSGSEFP

Query:  DRPVPRWVTVASDEKCKNVIADCIQVFVSTAVAVYLDKTMDVNVYNDLFSGLTNPTHQDKVKDMLVSVCNGAVETLVKTSHQVLTSSRTTSNLSPVPSSN
        D  VPRWVTVAS+EK KN+IADCIQVFVSTAVAVYLDKTMDVNVYNDLFSGLTNPTHQDKVKDMLVSVCNGAVETLVKTSHQVLTSSR+TS LSPV S N
Subjt:  DRPVPRWVTVASDEKCKNVIADCIQVFVSTAVAVYLDKTMDVNVYNDLFSGLTNPTHQDKVKDMLVSVCNGAVETLVKTSHQVLTSSRTTSNLSPVPSSN

Query:  ELSKIGDDPFSEEASPKKMAVASSIESSQNGWVGTVSSTLAVPRNRKFVLDLTGRVTFETTRSVVEFLMWKLMDGLKRSLDIFHDEVVGRGLEVIRYFSA
         LSKIG++PFSE A PKKMA+AS  E SQNGWVG VSSTLAVPRNRKFVLDLTGRVTFETTRS VEFL+WKLMDGLKRSLDI HDEVVGRGL+VIRYFSA
Subjt:  ELSKIGDDPFSEEASPKKMAVASSIESSQNGWVGTVSSTLAVPRNRKFVLDLTGRVTFETTRSVVEFLMWKLMDGLKRSLDIFHDEVVGRGLEVIRYFSA

Query:  KSSVIVTICLALYLHIFGGTRLLLPA
        KSSV+VTICLALYLH+FGGTRLLLPA
Subjt:  KSSVIVTICLALYLHIFGGTRLLLPA

A0A6J1IIL3 protein PHLOEM PROTEIN 2-LIKE A10-like3.4e-20288.97Show/hide
Query:  MDFELVRRALGFSQRRKKWLILLAVMGFSGYGAYRVYHLPSVERKRKRLMKLFGAMISVAEMVADSSEAIGVVSKDLKEFLKSDSDQIPNSLKQISKIAK
        MD ELVRR LGFSQRRKKWLILL VMG SGYGAY+VYHLPSVER+RK+LMKL GAMISVAEMVADSSEAIGVVSKDLKEFLKSDSDQIPNSLKQISKIAK
Subjt:  MDFELVRRALGFSQRRKKWLILLAVMGFSGYGAYRVYHLPSVERKRKRLMKLFGAMISVAEMVADSSEAIGVVSKDLKEFLKSDSDQIPNSLKQISKIAK

Query:  SEEFSESLEKVTEAFTVGMMRGYKSVTMNEQNLEAGSANSSFSSGVVEKLFSTAGTGFASVVVGSFAKNLVMGYYSIPGSVDDASRNLQTGAYKSGSEFP
        SEEFSESLEK+TEAFT+GMMRGYKS++ NEQN EAG ANSSF+S VV+KLFSTAGTGFASVVVGSFAKNLV+GYYSIPGSVDDASRN  + +YKSGSEFP
Subjt:  SEEFSESLEKVTEAFTVGMMRGYKSVTMNEQNLEAGSANSSFSSGVVEKLFSTAGTGFASVVVGSFAKNLVMGYYSIPGSVDDASRNLQTGAYKSGSEFP

Query:  DRPVPRWVTVASDEKCKNVIADCIQVFVSTAVAVYLDKTMDVNVYNDLFSGLTNPTHQDKVKDMLVSVCNGAVETLVKTSHQVLTSSRTTSNLSPVPSSN
        D  VPRWVTVAS+EK KN+IADCIQVFVSTAVAVYLDKTMDVNVYNDLFSGLTNPTHQDKVKDMLVSVCNGAVETLVKTSHQVLTSSR+TS LSPV S N
Subjt:  DRPVPRWVTVASDEKCKNVIADCIQVFVSTAVAVYLDKTMDVNVYNDLFSGLTNPTHQDKVKDMLVSVCNGAVETLVKTSHQVLTSSRTTSNLSPVPSSN

Query:  ELSKIGDDPFSEEASPKKMAVASSIESSQNGWVGTVSSTLAVPRNRKFVLDLTGRVTFETTRSVVEFLMWKLMDGLKRSLDIFHDEVVGRGLEVIRYFSA
         LSKIGD+PFSE A PKKMA+ASS E SQNGWVG VSSTLAVPRNRKFVLDLTGRVTFETTRS VEFL+WKLMDGLKRSLDI HDEVVGRGL+VIRYFSA
Subjt:  ELSKIGDDPFSEEASPKKMAVASSIESSQNGWVGTVSSTLAVPRNRKFVLDLTGRVTFETTRSVVEFLMWKLMDGLKRSLDIFHDEVVGRGLEVIRYFSA

Query:  KSSVIVTICLALYLHIFGGTRLLLPA
        KSSV+VTICLALYLH+FGGTRLLLPA
Subjt:  KSSVIVTICLALYLHIFGGTRLLLPA

SwissProt top hitse value%identityAlignment
Q9SY57 Protein PHLOEM PROTEIN 2-LIKE A107.6e-11953.12Show/hide
Query:  LVRRALGFSQRRKKWLILLAVMGFSGYGAYRVYHLPSVERKRKRLMKLFGAMISVAEMVADSSEAIGVVSKDLKEFLKSDSDQIPNSLKQISKIAKSEEF
        L  + +  SQRR+KWLI +A+ G SGYGAY+VYHLPSV RKRKRL KLFGA++SVAE+++DS+E + +VS+D+K+FL SDSD+IPNSLKQI+KI  S EF
Subjt:  LVRRALGFSQRRKKWLILLAVMGFSGYGAYRVYHLPSVERKRKRLMKLFGAMISVAEMVADSSEAIGVVSKDLKEFLKSDSDQIPNSLKQISKIAKSEEF

Query:  SESLEKVTEAFTVGMMRGYKSVTMNEQNLEAGSANSSFSSGVVEKLFSTAGTGFASVVVGSFAKNLVMGYYSIPGSVDDASRNLQTGAYKSGSEFPDRPV
        ++SL +V++A T+G  RGYKS +    +    S++SS    V++K+FS AGTGF SVVVGSFAKNLV+G+Y         S  +++G    GS+  +   
Subjt:  SESLEKVTEAFTVGMMRGYKSVTMNEQNLEAGSANSSFSSGVVEKLFSTAGTGFASVVVGSFAKNLVMGYYSIPGSVDDASRNLQTGAYKSGSEFPDRPV

Query:  PRWVTVASDEKCKNVIADCIQVFVSTAVAVYLDKTMDVNVYNDLFSGLTNPTHQDKVKDMLVSVCNGAVETLVKTSHQVLTSSRTTSNLSPVPSSNELSK
        PRWVT+  D+KC+ ++ADCI+ F STA+ VYLDKTMD+N Y+ +F GLTNP HQD VKD+LVSVCNGA+ET+V+TSH V TSSR         S N + +
Subjt:  PRWVTVASDEKCKNVIADCIQVFVSTAVAVYLDKTMDVNVYNDLFSGLTNPTHQDKVKDMLVSVCNGAVETLVKTSHQVLTSSRTTSNLSPVPSSNELSK

Query:  IGDDPFSEEASPKKMAVASSIES-SQNGWVGTVSSTLAVPRNRKFVLDLTGRVTFETTRSVVEFLMWKLMDGLKRSLDIFHDEVVGRGLEVIRYFSAKSS
        I DD F    S +   V+ S +    NGW   +++TLAVP NR+F+ D+TGRVT ETTRS++ F+M K   G ++S+++ H+EV  RG + + Y  AKSS
Subjt:  IGDDPFSEEASPKKMAVASSIES-SQNGWVGTVSSTLAVPRNRKFVLDLTGRVTFETTRSVVEFLMWKLMDGLKRSLDIFHDEVVGRGLEVIRYFSAKSS

Query:  VIVTICLALYLHIFGG
        VI+T+CLALYLHI  G
Subjt:  VIVTICLALYLHIFGG

Arabidopsis top hitse value%identityAlignment
AT1G10150.1 Carbohydrate-binding protein5.4e-12053.12Show/hide
Query:  LVRRALGFSQRRKKWLILLAVMGFSGYGAYRVYHLPSVERKRKRLMKLFGAMISVAEMVADSSEAIGVVSKDLKEFLKSDSDQIPNSLKQISKIAKSEEF
        L  + +  SQRR+KWLI +A+ G SGYGAY+VYHLPSV RKRKRL KLFGA++SVAE+++DS+E + +VS+D+K+FL SDSD+IPNSLKQI+KI  S EF
Subjt:  LVRRALGFSQRRKKWLILLAVMGFSGYGAYRVYHLPSVERKRKRLMKLFGAMISVAEMVADSSEAIGVVSKDLKEFLKSDSDQIPNSLKQISKIAKSEEF

Query:  SESLEKVTEAFTVGMMRGYKSVTMNEQNLEAGSANSSFSSGVVEKLFSTAGTGFASVVVGSFAKNLVMGYYSIPGSVDDASRNLQTGAYKSGSEFPDRPV
        ++SL +V++A T+G  RGYKS +    +    S++SS    V++K+FS AGTGF SVVVGSFAKNLV+G+Y         S  +++G    GS+  +   
Subjt:  SESLEKVTEAFTVGMMRGYKSVTMNEQNLEAGSANSSFSSGVVEKLFSTAGTGFASVVVGSFAKNLVMGYYSIPGSVDDASRNLQTGAYKSGSEFPDRPV

Query:  PRWVTVASDEKCKNVIADCIQVFVSTAVAVYLDKTMDVNVYNDLFSGLTNPTHQDKVKDMLVSVCNGAVETLVKTSHQVLTSSRTTSNLSPVPSSNELSK
        PRWVT+  D+KC+ ++ADCI+ F STA+ VYLDKTMD+N Y+ +F GLTNP HQD VKD+LVSVCNGA+ET+V+TSH V TSSR         S N + +
Subjt:  PRWVTVASDEKCKNVIADCIQVFVSTAVAVYLDKTMDVNVYNDLFSGLTNPTHQDKVKDMLVSVCNGAVETLVKTSHQVLTSSRTTSNLSPVPSSNELSK

Query:  IGDDPFSEEASPKKMAVASSIES-SQNGWVGTVSSTLAVPRNRKFVLDLTGRVTFETTRSVVEFLMWKLMDGLKRSLDIFHDEVVGRGLEVIRYFSAKSS
        I DD F    S +   V+ S +    NGW   +++TLAVP NR+F+ D+TGRVT ETTRS++ F+M K   G ++S+++ H+EV  RG + + Y  AKSS
Subjt:  IGDDPFSEEASPKKMAVASSIES-SQNGWVGTVSSTLAVPRNRKFVLDLTGRVTFETTRSVVEFLMWKLMDGLKRSLDIFHDEVVGRGLEVIRYFSAKSS

Query:  VIVTICLALYLHIFGG
        VI+T+CLALYLHI  G
Subjt:  VIVTICLALYLHIFGG

AT1G59510.1 Carbohydrate-binding protein4.6e-10350.5Show/hide
Query:  QRRKKWLILLAVMGFSGYGAYRVYHLPSVERKRKRLMKLFGAMISVAEMVADSSEAIGVVSKDLKEFLKSDSDQIPNSLKQISKIAKSEEFSESLEKVTE
        QRR+KWLILLAV G SGYG YRVY+   + +K KRLMKLF  ++S AEMV DS+E I +VS+DLKEFL+S+S +IPNSLKQ+SKI KS+EF++SL +V+E
Subjt:  QRRKKWLILLAVMGFSGYGAYRVYHLPSVERKRKRLMKLFGAMISVAEMVADSSEAIGVVSKDLKEFLKSDSDQIPNSLKQISKIAKSEEFSESLEKVTE

Query:  AFTVGMMRGYKSVTMNEQNLEAGSANSSFSSGVVEKLFSTAGTGFASVVVGSFAKNLVMGYYSIPGSVDDASRNLQTGAYKSGSEFPDRPVPRWVTVASD
        A  +G+ RGY S    + N+E  S     +  VV+++FS  G GF SVVVGSFAKNLV+G+YS              G  + GS+  D   PRW+ + SD
Subjt:  AFTVGMMRGYKSVTMNEQNLEAGSANSSFSSGVVEKLFSTAGTGFASVVVGSFAKNLVMGYYSIPGSVDDASRNLQTGAYKSGSEFPDRPVPRWVTVASD

Query:  EKCKNVIADCIQVFVSTAVAVYLDKTMDVNVYNDLFSGLTNPTHQDKVKDMLVSVCNGAVETLVKTSHQVLTSSRTTSNLSPVPSSNELSKIGDDPFSEE
        +KC+ ++ADCI+ F S+AV+VY+DKT+ VN Y+ +F+GLTNP H+D  +D+LVSVCNGA+ET ++TSH V TSS   ++ S   S N             
Subjt:  EKCKNVIADCIQVFVSTAVAVYLDKTMDVNVYNDLFSGLTNPTHQDKVKDMLVSVCNGAVETLVKTSHQVLTSSRTTSNLSPVPSSNELSKIGDDPFSEE

Query:  ASPKKMAVASSIESSQNGWVGTVSSTLAVPRNRKFVLDLTGRVTFETTRSVVEFLMWKLMDGLKRSLDIFHDEVVGRGLEVIRYFSAKSSVIVTICLALY
                       +NGW   +S+TLAVP NRKF+ D+TGRVT ET RS++EF++ K     KRSLD+ H+EV  RG +V+ Y  AKSSVI+T+CLA+Y
Subjt:  ASPKKMAVASSIESSQNGWVGTVSSTLAVPRNRKFVLDLTGRVTFETTRSVVEFLMWKLMDGLKRSLDIFHDEVVGRGLEVIRYFSAKSSVIVTICLALY

Query:  LHIF
         HIF
Subjt:  LHIF

AT3G49790.1 Carbohydrate-binding protein1.8e-7844.44Show/hide
Query:  FSQRRKKWLILLAVMGFSGYGAYRVYHLPSVERKRKRLMKLFGAMISVAEMVADSSEAIGVVSKDLKEFLKSDSDQIPNSLKQISKIAKSEEFSESLEKV
        F+ + KKW ILLAV   SGYGA+RVYH PS+ +KRKR+ KLF  ++++ E  +DS+E + V+SKDL EFL+SDSDQIPNSLKQISKIAKS+E + SL + 
Subjt:  FSQRRKKWLILLAVMGFSGYGAYRVYHLPSVERKRKRLMKLFGAMISVAEMVADSSEAIGVVSKDLKEFLKSDSDQIPNSLKQISKIAKSEEFSESLEKV

Query:  TEAFTVGMMRGYKSVTMNEQNLEAGSANSSFSSGVVEKLFSTAGTGFASVVVGSFAKNLVMGYYSIPGSVDDASRNLQTGAYKSGSEFPDRPVPRWVTVA
        T+A TVG++RG          ++ GS  S F+  V++KLF+ +G+GFAS +VGSFA+NLV+  YS  G               S S+  D        V 
Subjt:  TEAFTVGMMRGYKSVTMNEQNLEAGSANSSFSSGVVEKLFSTAGTGFASVVVGSFAKNLVMGYYSIPGSVDDASRNLQTGAYKSGSEFPDRPVPRWVTVA

Query:  SDEKCKNVIADCIQVFVSTAVAVYLDKTMDVNVYNDLFSGLTNPTHQDKVKDMLVSVCNGAVETLVKTSHQVLTSSRTTSNLSPVPSSNELSKIGDDPFS
        SD+  + +I DC+Q FVSTAV+VYLDKT DVNV++DLF+GLTNP H+ KVK  LV++CN AVET V+ S + +  +R++S       S++   +G     
Subjt:  SDEKCKNVIADCIQVFVSTAVAVYLDKTMDVNVYNDLFSGLTNPTHQDKVKDMLVSVCNGAVETLVKTSHQVLTSSRTTSNLSPVPSSNELSKIGDDPFS

Query:  EEASPKKMAVASSIESSQNGWVGTVSSTLAVPRNRKFVLDLTGRVTFETTRSVVEFLMWKLMDGLKRSLDIFHDEVVGRGLEVIRYFSAKSSVIVTICLA
                       + Q  W+  VSS+L+VP NRK+V+DLTGRVTFET RS++E     L++     ++ + ++V  RG E  R+   K+S++ ++CL+
Subjt:  EEASPKKMAVASSIESSQNGWVGTVSSTLAVPRNRKFVLDLTGRVTFETTRSVVEFLMWKLMDGLKRSLDIFHDEVVGRGLEVIRYFSAKSSVIVTICLA

Query:  LYLHIFGGTRLLLP
        L L I     +L P
Subjt:  LYLHIFGGTRLLLP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTTTGAACTGGTTAGAAGGGCCTTGGGATTCTCTCAGAGGAGAAAGAAGTGGCTTATTCTATTGGCTGTCATGGGGTTCTCTGGTTATGGTGCTTACAGGGTCTA
TCATTTGCCCTCTGTCGAGAGGAAGAGAAAGAGGCTGATGAAGCTCTTCGGCGCTATGATTTCCGTCGCTGAAATGGTTGCGGATTCTTCTGAAGCTATTGGAGTAGTTT
CTAAGGACTTGAAGGAGTTTCTCAAATCTGATTCCGACCAAATTCCCAACAGCTTGAAGCAAATTTCCAAAATTGCAAAGTCTGAGGAATTTTCGGAGTCTCTGGAAAAG
GTCACCGAGGCATTTACGGTTGGAATGATGAGGGGGTATAAATCTGTGACAATGAATGAACAAAATTTGGAGGCTGGTTCGGCGAATTCAAGCTTTTCTTCTGGTGTCGT
TGAGAAGCTTTTCTCAACAGCTGGGACTGGTTTTGCTTCTGTTGTGGTCGGAAGTTTTGCAAAGAATTTGGTTATGGGGTATTACTCGATCCCTGGATCAGTTGATGATG
CAAGTCGAAATTTGCAGACAGGTGCTTACAAATCTGGATCTGAATTTCCAGACAGGCCCGTGCCCAGATGGGTAACGGTCGCTTCCGATGAGAAATGCAAAAATGTCATA
GCAGATTGCATACAAGTTTTTGTTAGCACTGCAGTTGCTGTATATCTTGATAAAACAATGGATGTTAATGTGTACAATGATCTCTTTAGTGGATTGACCAACCCCACTCA
TCAGGACAAGGTGAAGGACATGCTTGTTTCTGTTTGTAATGGCGCTGTGGAAACTCTGGTAAAAACATCTCACCAGGTATTGACAAGCTCGCGAACGACTTCAAATTTGA
GTCCAGTTCCATCCTCTAATGAACTGTCGAAGATAGGAGACGACCCTTTCTCAGAAGAAGCATCTCCAAAGAAGATGGCTGTGGCCAGCTCAATTGAGAGTAGCCAAAAT
GGGTGGGTTGGCACAGTTTCATCTACTTTGGCAGTTCCTAGAAATCGAAAGTTTGTGCTTGATTTAACTGGTAGAGTGACCTTTGAGACAACAAGATCTGTTGTGGAATT
TTTGATGTGGAAGCTAATGGATGGTCTGAAGAGAAGTCTCGATATATTTCATGATGAAGTTGTGGGTAGGGGATTGGAAGTCATCAGATACTTCAGTGCGAAGTCTTCTG
TTATTGTTACAATTTGTCTAGCATTATATCTACATATTTTTGGTGGCACCAGACTTCTTTTACCTGCATAA
mRNA sequenceShow/hide mRNA sequence
CTGATTGACCACCAATTTTTATAGGATTTTATTTATACAAGCCACAAAGTTCGCGTGGTCCGTTGAAACTTGAAATCATTGTGATTGGTTATCTCATATGCGATTTTGGT
ATTTCCGTGTACGAATGGTAAAACCGATCTGAATACACAACTCCTTAACCTTTTTCAATTCATTCTGAATCTGATCTTTATTTATTTTTCTTCCGTTCAATTTTATTTTC
TTCTTTTTAATATTTGTGTAATCAGATTCCATTCTATCGATTCCTTTCCGAACCCCAACGCTGATTTCAAGCCAATCACCTGCACTCACCCTCCCTTCCCTGTTCTCTAG
CTGCTCCCACTTGTCGCCAATTAAATTTCTCAGAAATTTTGCTTGAAAAATCTTCCATGTTTCACCCCTCAATCTTCATTGACTTCGTACATTAACCCCAAGCAGGTTTG
CGATTCCGTTTTATTCATCGGATTTCTTCTTTTTTCCATTCTCGTTCTTTTTCTTCACTTTCTGTCGCCCCATTTTCAATCTCGAATCTGGGTTTCTGTTTGATTTGGGT
TTTAGGGTTTCTTTGAGTCTCGCTCCTCATGGATTTTGAACTGGTTAGAAGGGCCTTGGGATTCTCTCAGAGGAGAAAGAAGTGGCTTATTCTATTGGCTGTCATGGGGT
TCTCTGGTTATGGTGCTTACAGGGTCTATCATTTGCCCTCTGTCGAGAGGAAGAGAAAGAGGCTGATGAAGCTCTTCGGCGCTATGATTTCCGTCGCTGAAATGGTTGCG
GATTCTTCTGAAGCTATTGGAGTAGTTTCTAAGGACTTGAAGGAGTTTCTCAAATCTGATTCCGACCAAATTCCCAACAGCTTGAAGCAAATTTCCAAAATTGCAAAGTC
TGAGGAATTTTCGGAGTCTCTGGAAAAGGTCACCGAGGCATTTACGGTTGGAATGATGAGGGGGTATAAATCTGTGACAATGAATGAACAAAATTTGGAGGCTGGTTCGG
CGAATTCAAGCTTTTCTTCTGGTGTCGTTGAGAAGCTTTTCTCAACAGCTGGGACTGGTTTTGCTTCTGTTGTGGTCGGAAGTTTTGCAAAGAATTTGGTTATGGGGTAT
TACTCGATCCCTGGATCAGTTGATGATGCAAGTCGAAATTTGCAGACAGGTGCTTACAAATCTGGATCTGAATTTCCAGACAGGCCCGTGCCCAGATGGGTAACGGTCGC
TTCCGATGAGAAATGCAAAAATGTCATAGCAGATTGCATACAAGTTTTTGTTAGCACTGCAGTTGCTGTATATCTTGATAAAACAATGGATGTTAATGTGTACAATGATC
TCTTTAGTGGATTGACCAACCCCACTCATCAGGACAAGGTGAAGGACATGCTTGTTTCTGTTTGTAATGGCGCTGTGGAAACTCTGGTAAAAACATCTCACCAGGTATTG
ACAAGCTCGCGAACGACTTCAAATTTGAGTCCAGTTCCATCCTCTAATGAACTGTCGAAGATAGGAGACGACCCTTTCTCAGAAGAAGCATCTCCAAAGAAGATGGCTGT
GGCCAGCTCAATTGAGAGTAGCCAAAATGGGTGGGTTGGCACAGTTTCATCTACTTTGGCAGTTCCTAGAAATCGAAAGTTTGTGCTTGATTTAACTGGTAGAGTGACCT
TTGAGACAACAAGATCTGTTGTGGAATTTTTGATGTGGAAGCTAATGGATGGTCTGAAGAGAAGTCTCGATATATTTCATGATGAAGTTGTGGGTAGGGGATTGGAAGTC
ATCAGATACTTCAGTGCGAAGTCTTCTGTTATTGTTACAATTTGTCTAGCATTATATCTACATATTTTTGGTGGCACCAGACTTCTTTTACCTGCATAATCTGAAGTGGG
AAGTAGTTATAGTGCTTAGATTAGAAAGTATATACCATTGTTGCATTCAGGGAAGGTTTTGTACTTTCTCCATATCATTCAAATTTTAATGATAATGATACTTCTAGAAC
AAATGCATATATTGATTCATAAGTAACTTCCACTATCTTCTCTCCCTCTCTCTCTACCTTGTTTGTGCCTGAATGCACTATGTGTCTTAAGATTCTTCTGGGGTCTGTCT
CTCTATATAATATCTGCAATGAGTGTGTATCAACTGCCACCCTTCTTATTGGAGTAGTGGGAATCGAAATACTCAGAAGGAAATCACTTGTATCTTTTCTGTCGATAATT
TATTTACTTTGAACGGTTTTGAAGCTCATGCCTGTTATGGAGGCAATGTGACCAAAAAGACACTATCTCTTAGCTAGTTTGCTTTACTGCAATAATATAAAG
Protein sequenceShow/hide protein sequence
MDFELVRRALGFSQRRKKWLILLAVMGFSGYGAYRVYHLPSVERKRKRLMKLFGAMISVAEMVADSSEAIGVVSKDLKEFLKSDSDQIPNSLKQISKIAKSEEFSESLEK
VTEAFTVGMMRGYKSVTMNEQNLEAGSANSSFSSGVVEKLFSTAGTGFASVVVGSFAKNLVMGYYSIPGSVDDASRNLQTGAYKSGSEFPDRPVPRWVTVASDEKCKNVI
ADCIQVFVSTAVAVYLDKTMDVNVYNDLFSGLTNPTHQDKVKDMLVSVCNGAVETLVKTSHQVLTSSRTTSNLSPVPSSNELSKIGDDPFSEEASPKKMAVASSIESSQN
GWVGTVSSTLAVPRNRKFVLDLTGRVTFETTRSVVEFLMWKLMDGLKRSLDIFHDEVVGRGLEVIRYFSAKSSVIVTICLALYLHIFGGTRLLLPA