; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr026123 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr026123
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionMFS domain-containing protein
Genome locationtig00153031:1986019..1988738
RNA-Seq ExpressionSgr026123
SyntenySgr026123
Gene Ontology termsGO:0055085 - transmembrane transport (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0022857 - transmembrane transporter activity (molecular function)
InterPro domainsIPR011701 - Major facilitator superfamily
IPR020846 - Major facilitator superfamily domain
IPR036259 - MFS transporter superfamily
IPR044770 - Protein spinster-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7034191.1 norA, partial [Cucurbita argyrosperma subsp. argyrosperma]1.7e-22078.96Show/hide
Query:  MKSEAVTLILVNLAGIMERADESLLPGVYKEVGAALHTDPTGLGSLTLFRSIVQCSCYPLAAYLAVHHNRAHVIALGAFLWAAATFLVALSSTFLQVAIS
        MKSEAVTLILVNLA IMERADESLLPGVYKEVGAALHTDPTGLGSLTLFRS+VQ SCYPLAAYLA+HHNRAHVIALGAFLWAAATFLVALSSTFLQVAIS
Subjt:  MKSEAVTLILVNLAGIMERADESLLPGVYKEVGAALHTDPTGLGSLTLFRSIVQCSCYPLAAYLAVHHNRAHVIALGAFLWAAATFLVALSSTFLQVAIS

Query:  RGLNGIGLAIVIPAIQSLVADSTDESNRGLAFGWLQLTGNLGSIIGGLCSVLIASTSFMGIPGWRISFHLVGLISVVVGILVWLFANDPRFSEIDERVKG
        RGLNGIGLAIV+P+IQSLVADSTD+SNRGLAFGWLQLTGNLGSIIGGL SVL+ASTSFMGIPGWRI+FHLVGLIS++VG+LVWLFA+DPRFSEID   K 
Subjt:  RGLNGIGLAIVIPAIQSLVADSTDESNRGLAFGWLQLTGNLGSIIGGLCSVLIASTSFMGIPGWRISFHLVGLISVVVGILVWLFANDPRFSEIDERVKG

Query:  QSRKSFW-----------------------------------------------LFSQETAFLWTLFVVAGSLGGLFGGRMGDILAKRLPNSGRIILSQI
        Q RK FW                                                  ++T FLWTLF++AGSLGGLFGGRMGDIL+KR PNSGRI+LSQI
Subjt:  QSRKSFW-----------------------------------------------LFSQETAFLWTLFVVAGSLGGLFGGRMGDILAKRLPNSGRIILSQI

Query:  SSGSAIPLAAILLLVLPDDPSTAFLHGLVLFIMGLSISWNAPATNNPIFAEIVPEKSRTSIYALDRSFESILSSFAPPVVGVLAQHVYGYKPIARGSSDS
        SSGSAIPLAAILLLVLPDDPSTAFLHGLVLFIMG S+SWN+PATNNPIFAEIVPEKSRTSIYALDRSFESILSSFAPPVVGVLAQHVYGYKP+ARGSSDS
Subjt:  SSGSAIPLAAILLLVLPDDPSTAFLHGLVLFIMGLSISWNAPATNNPIFAEIVPEKSRTSIYALDRSFESILSSFAPPVVGVLAQHVYGYKPIARGSSDS

Query:  VQIETDRENAKSLARALYTAIGIPMSLCCFIYSFLYCSYPRDRERARMHALIQSEMLQLEASSSPFRERDSEFQISEAKELDDKDRTEIDLIYEVEDSLD
         QIE DRENAKSLARALYTAIGIPMSLCCFIYSFLYCSYPRDRERARMHALI SEMLQLE++++          IS AK++DDKD+TEIDLIYE+ED LD
Subjt:  VQIETDRENAKSLARALYTAIGIPMSLCCFIYSFLYCSYPRDRERARMHALIQSEMLQLEASSSPFRERDSEFQISEAKELDDKDRTEIDLIYEVEDSLD

Query:  FNDNDERHLLQHQLKVSD
        FND+DE HLL HQL VSD
Subjt:  FNDNDERHLLQHQLKVSD

XP_022132543.1 uncharacterized protein LOC111005375 [Momordica charantia]5.1e-23083.08Show/hide
Query:  MKSEAVTLILVNLAGIMERADESLLPGVYKEVGAALHTDPTGLGSLTLFRSIVQCSCYPLAAYLAVHHNRAHVIALGAFLWAAATFLVALSSTFLQVAIS
        MKSEA TLILVNLA IMERADESLLPGVYKE+GAALHTDPTGLGSLTLFRS+VQ SCYPLAAYLAVHHNRAHVIALGAFLWAAATFLVALSSTFLQVAIS
Subjt:  MKSEAVTLILVNLAGIMERADESLLPGVYKEVGAALHTDPTGLGSLTLFRSIVQCSCYPLAAYLAVHHNRAHVIALGAFLWAAATFLVALSSTFLQVAIS

Query:  RGLNGIGLAIVIPAIQSLVADSTDESNRGLAFGWLQLTGNLGSIIGGLCSVLIASTSFMGIPGWRISFHLVGLISVVVGILVWLFANDPRFSEIDERVKG
        RGLNGIGLAIVIPAIQSLVADSTDESNRGLAFGWLQLTGNLGSIIGGL SVLIASTSFMGIPGWRI+FHLVGLISV+VG+LVWLFANDPRFSEID RVK 
Subjt:  RGLNGIGLAIVIPAIQSLVADSTDESNRGLAFGWLQLTGNLGSIIGGLCSVLIASTSFMGIPGWRISFHLVGLISVVVGILVWLFANDPRFSEIDERVKG

Query:  QSRKSFWL----------------------------------------------FSQE-TAFLWTLFVVAGSLGGLFGGRMGDILAKRLPNSGRIILSQI
        QSRK FW                                               FS E TAFLWTLF+VA SLGGLFGGRMGDI AKR PNSGRI+LSQI
Subjt:  QSRKSFWL----------------------------------------------FSQE-TAFLWTLFVVAGSLGGLFGGRMGDILAKRLPNSGRIILSQI

Query:  SSGSAIPLAAILLLVLPDDPSTAFLHGLVLFIMGLSISWNAPATNNPIFAEIVPEKSRTSIYALDRSFESILSSFAPPVVGVLAQHVYGYKPIARGSSDS
        SSGSAIPLAAILLLVLPDDPSTAFLHGLVLFIMGLSISWN PATNNPIFAEIVPEKSRTSIYALDRSFES+LSSFAPPVVGVLAQHVYGYKP+ RGSSDS
Subjt:  SSGSAIPLAAILLLVLPDDPSTAFLHGLVLFIMGLSISWNAPATNNPIFAEIVPEKSRTSIYALDRSFESILSSFAPPVVGVLAQHVYGYKPIARGSSDS

Query:  VQIETDRENAKSLARALYTAIGIPMSLCCFIYSFLYCSYPRDRERARMHALIQSEMLQLEASSSPFRERDSEFQISEAKELDDKDRTEIDLIYEVEDSLD
        VQIETDRENAKSLARALYTAIGIPMSLCCFIYSFLY SYPRDRERARMHALI+SEMLQLEAS+ PF E DS F ISEAK+L++KD+TEIDL+Y VEDSLD
Subjt:  VQIETDRENAKSLARALYTAIGIPMSLCCFIYSFLYCSYPRDRERARMHALIQSEMLQLEASSSPFRERDSEFQISEAKELDDKDRTEIDLIYEVEDSLD

Query:  FNDNDERHLLQHQLKVSDLK
         NDNDE+HLLQHQL  SDLK
Subjt:  FNDNDERHLLQHQLKVSDLK

XP_022950940.1 uncharacterized protein LOC111453885 [Cucurbita moschata]1.4e-21978.96Show/hide
Query:  MKSEAVTLILVNLAGIMERADESLLPGVYKEVGAALHTDPTGLGSLTLFRSIVQCSCYPLAAYLAVHHNRAHVIALGAFLWAAATFLVALSSTFLQVAIS
        MKSEAVTLILVNLA IMERADESLLPGVYKEVGAALHTDPTGLGSLTLFRS+VQ SCYPLAAYLA+HHNRAHVIALGAFLWAAATFLVALSSTFLQVAIS
Subjt:  MKSEAVTLILVNLAGIMERADESLLPGVYKEVGAALHTDPTGLGSLTLFRSIVQCSCYPLAAYLAVHHNRAHVIALGAFLWAAATFLVALSSTFLQVAIS

Query:  RGLNGIGLAIVIPAIQSLVADSTDESNRGLAFGWLQLTGNLGSIIGGLCSVLIASTSFMGIPGWRISFHLVGLISVVVGILVWLFANDPRFSEIDERVKG
        RGLNGIGLAIV+P+IQSLVADSTD+SNRGLAFGWLQLTGNLGSIIGGL SVL+ASTSFMGIPGWRI+FHLVGLISV+VG+LVWLFA+DPRFSEID   K 
Subjt:  RGLNGIGLAIVIPAIQSLVADSTDESNRGLAFGWLQLTGNLGSIIGGLCSVLIASTSFMGIPGWRISFHLVGLISVVVGILVWLFANDPRFSEIDERVKG

Query:  QSRKSFW-----------------------------------------------LFSQETAFLWTLFVVAGSLGGLFGGRMGDILAKRLPNSGRIILSQI
        Q RK FW                                                  ++T FLWTLF++AGSLGGLFGGRMGDIL+KR PNSGRI+LSQI
Subjt:  QSRKSFW-----------------------------------------------LFSQETAFLWTLFVVAGSLGGLFGGRMGDILAKRLPNSGRIILSQI

Query:  SSGSAIPLAAILLLVLPDDPSTAFLHGLVLFIMGLSISWNAPATNNPIFAEIVPEKSRTSIYALDRSFESILSSFAPPVVGVLAQHVYGYKPIARGSSDS
        SSGSAIPLAAILLLVLPDDPSTAFLHGLVLFIMG S+SWN+PATNNPIFAEIVPEKSRTSIYALDRSFESILSSFAPPVVGVLAQHVYGYKP+ARGSSDS
Subjt:  SSGSAIPLAAILLLVLPDDPSTAFLHGLVLFIMGLSISWNAPATNNPIFAEIVPEKSRTSIYALDRSFESILSSFAPPVVGVLAQHVYGYKPIARGSSDS

Query:  VQIETDRENAKSLARALYTAIGIPMSLCCFIYSFLYCSYPRDRERARMHALIQSEMLQLEASSSPFRERDSEFQISEAKELDDKDRTEIDLIYEVEDSLD
         QIE DRENAKSLARALYTAIGIPMSLCCFIYSFLYCSYPRDRERARMHALI SEMLQLE++++          IS AK++D KD+TEIDLIYE+ED LD
Subjt:  VQIETDRENAKSLARALYTAIGIPMSLCCFIYSFLYCSYPRDRERARMHALIQSEMLQLEASSSPFRERDSEFQISEAKELDDKDRTEIDLIYEVEDSLD

Query:  FNDNDERHLLQHQLKVSD
        FND+DE HLL HQL VSD
Subjt:  FNDNDERHLLQHQLKVSD

XP_023544539.1 uncharacterized protein LOC111804088 [Cucurbita pepo subsp. pepo]1.5e-21879.15Show/hide
Query:  MKSEAVTLILVNLAGIMERADESLLPGVYKEVGAALHTDPTGLGSLTLFRSIVQCSCYPLAAYLAVHHNRAHVIALGAFLWAAATFLVALSSTFLQVAIS
        MKSEA+TLILVNLA IMERADESLLPGVYKEVGAALHTDPTGLGSLTLFRS+VQ SCYPLAAYLA+HHNRAHVIALGAFLWAAATFLVALSSTFLQVAIS
Subjt:  MKSEAVTLILVNLAGIMERADESLLPGVYKEVGAALHTDPTGLGSLTLFRSIVQCSCYPLAAYLAVHHNRAHVIALGAFLWAAATFLVALSSTFLQVAIS

Query:  RGLNGIGLAIVIPAIQSLVADSTDESNRGLAFGWLQLTGNLGSIIGGLCSVLIASTSFMGIPGWRISFHLVGLISVVVGILVWLFANDPRFSEIDERVKG
        RGLNGIGLAIV+P+IQSLVADSTD+SNRGLAFGWLQLTGNLGSIIGGL SVL+ASTSFMGIPGWRI+FHLVGLISV+VG+LVWLFA+DPRFSEID   K 
Subjt:  RGLNGIGLAIVIPAIQSLVADSTDESNRGLAFGWLQLTGNLGSIIGGLCSVLIASTSFMGIPGWRISFHLVGLISVVVGILVWLFANDPRFSEIDERVKG

Query:  QSRKSFWL----------------------------------------------FSQE-TAFLWTLFVVAGSLGGLFGGRMGDILAKRLPNSGRIILSQI
        Q RK FW                                               FS E T FLWTLF++AGSLGGLFGGRMGDIL+KR PNSGRI+LSQI
Subjt:  QSRKSFWL----------------------------------------------FSQE-TAFLWTLFVVAGSLGGLFGGRMGDILAKRLPNSGRIILSQI

Query:  SSGSAIPLAAILLLVLPDDPSTAFLHGLVLFIMGLSISWNAPATNNPIFAEIVPEKSRTSIYALDRSFESILSSFAPPVVGVLAQHVYGYKPIARGSSDS
        SSGSAIPLAAILLLVLPD PSTAFLHGLVLFIMG S+SWN+PATNNPIFAEIVPEKSRTSIYALDRSFESILSSFAPPVVGVLAQHVYGYKP+ARGSSDS
Subjt:  SSGSAIPLAAILLLVLPDDPSTAFLHGLVLFIMGLSISWNAPATNNPIFAEIVPEKSRTSIYALDRSFESILSSFAPPVVGVLAQHVYGYKPIARGSSDS

Query:  VQIETDRENAKSLARALYTAIGIPMSLCCFIYSFLYCSYPRDRERARMHALIQSEMLQLEASSSPFRERDSEFQISEAKELDDKDRTEIDLIYEVEDSLD
         QIE DRENAKSLARALYTAIGIPMSLCCFIYSFLYCSYPRDRERARMHALI SEMLQLE+++       +   IS AK++DDKD+ EIDLIYE+E SLD
Subjt:  VQIETDRENAKSLARALYTAIGIPMSLCCFIYSFLYCSYPRDRERARMHALIQSEMLQLEASSSPFRERDSEFQISEAKELDDKDRTEIDLIYEVEDSLD

Query:  FNDNDERHLLQHQLKVSD
        FND+DE HLL HQL VSD
Subjt:  FNDNDERHLLQHQLKVSD

XP_038880865.1 uncharacterized protein LOC120072548 [Benincasa hispida]2.7e-22379.15Show/hide
Query:  MKSEAVTLILVNLAGIMERADESLLPGVYKEVGAALHTDPTGLGSLTLFRSIVQCSCYPLAAYLAVHHNRAHVIALGAFLWAAATFLVALSSTFLQVAIS
        MKSEAVTLILVNLA IMERADESLLPGVYKEVGAALHTDPTGLGSLTLFRS+VQ SCYPLAAYLAVHHNRAHVIALGAFLWA ATFLVALSSTF QVAIS
Subjt:  MKSEAVTLILVNLAGIMERADESLLPGVYKEVGAALHTDPTGLGSLTLFRSIVQCSCYPLAAYLAVHHNRAHVIALGAFLWAAATFLVALSSTFLQVAIS

Query:  RGLNGIGLAIVIPAIQSLVADSTDESNRGLAFGWLQLTGNLGSIIGGLCSVLIASTSFMGIPGWRISFHLVGLISVVVGILVWLFANDPRFSEIDERVKG
        RGLNGIGLAIVIPAIQSL+ADSTD+SNRGLAFGWLQLTGNLGSIIGGLCSVL+ASTSFMGIPGWRI+FHLVGLISV+VG+L+W FANDP FSEI+ R K 
Subjt:  RGLNGIGLAIVIPAIQSLVADSTDESNRGLAFGWLQLTGNLGSIIGGLCSVLIASTSFMGIPGWRISFHLVGLISVVVGILVWLFANDPRFSEIDERVKG

Query:  QSRKSFW---------------------LFSQ--------------------------ETAFLWTLFVVAGSLGGLFGGRMGDILAKRLPNSGRIILSQI
        Q RK FW                     + SQ                          +T FLWTLF++A SLGGLFGGR+GDIL+K  PNSGRI+LSQI
Subjt:  QSRKSFW---------------------LFSQ--------------------------ETAFLWTLFVVAGSLGGLFGGRMGDILAKRLPNSGRIILSQI

Query:  SSGSAIPLAAILLLVLPDDPSTAFLHGLVLFIMGLSISWNAPATNNPIFAEIVPEKSRTSIYALDRSFESILSSFAPPVVGVLAQHVYGYKPIARGSSDS
        SSGSA+PLAAILLLVLPD+PST FLHGL+LFIMG  +SWN+PATNNPIFAEIVPEKSRTSIYALDRSFESILSSFAPPVVGVLAQHVYGYKP ARGSSDS
Subjt:  SSGSAIPLAAILLLVLPDDPSTAFLHGLVLFIMGLSISWNAPATNNPIFAEIVPEKSRTSIYALDRSFESILSSFAPPVVGVLAQHVYGYKPIARGSSDS

Query:  VQIETDRENAKSLARALYTAIGIPMSLCCFIYSFLYCSYPRDRERARMHALIQSEMLQLEASSSPFRERDSEFQISEAKELDDKDRTEIDLIYEVEDSLD
         QIETDRENAKSLARALY AIG PMSLCCFIYSFLYCSYPRDRERARMHALI+SEMLQLE+++SP  ERDS+FQIS AK++D  D+TEIDLIYE+EDSLD
Subjt:  VQIETDRENAKSLARALYTAIGIPMSLCCFIYSFLYCSYPRDRERARMHALIQSEMLQLEASSSPFRERDSEFQISEAKELDDKDRTEIDLIYEVEDSLD

Query:  FNDNDERHLLQHQLKVSD
        FNDNDE+HLL HQL VSD
Subjt:  FNDNDERHLLQHQLKVSD

TrEMBL top hitse value%identityAlignment
A0A1S3B1J2 LOW QUALITY PROTEIN: uncharacterized protein LOC1034849775.9e-21677.24Show/hide
Query:  MKSEAVTLILVNLAGIMERADESLLPGVYKEVGAALHTDPTGLGSLTLFRSIVQCSCYPLAAYLAVHHNRAHVIALGAFLWAAATFLVALSSTFLQVAIS
        MK EAVTLILVNLA IMER D SLLPGVYKEVGAALH DPTGLGSLTLFRS+VQ SCYPLAAYLAVHHNRAHVIA+GAFLWAAATFLVA+SSTF QVAIS
Subjt:  MKSEAVTLILVNLAGIMERADESLLPGVYKEVGAALHTDPTGLGSLTLFRSIVQCSCYPLAAYLAVHHNRAHVIALGAFLWAAATFLVALSSTFLQVAIS

Query:  RGLNGIGLAIVIPAIQSLVADSTDESNRGLAFGWLQLTGNLGSIIGGLCSVLIASTSFMGIPGWRISFHLVGLISVVVGILVWLFANDPRFSEIDERVKG
        R LNGIGLAIVIPAIQSLVADSTD+SNRGLAFGWLQLTGNLGSIIGGLCS+L+ASTSFMGIPGWRISFHLVGLISV+VG+LVW+FANDP FSE + R K 
Subjt:  RGLNGIGLAIVIPAIQSLVADSTDESNRGLAFGWLQLTGNLGSIIGGLCSVLIASTSFMGIPGWRISFHLVGLISVVVGILVWLFANDPRFSEIDERVKG

Query:  QSRKSFWL----------------------------------------------FSQE-TAFLWTLFVVAGSLGGLFGGRMGDILAKRLPNSGRIILSQI
          RK  W                                               FS E T FLWTLF++A SLGG+FGGRMGDIL+KR PNSGRI+LSQI
Subjt:  QSRKSFWL----------------------------------------------FSQE-TAFLWTLFVVAGSLGGLFGGRMGDILAKRLPNSGRIILSQI

Query:  SSGSAIPLAAILLLVLPDDPSTAFLHGLVLFIMGLSISWNAPATNNPIFAEIVPEKSRTSIYALDRSFESILSSFAPPVVGVLAQHVYGYKPIARGSSDS
        SS SA+PLAAILLLVLPD+PST FLHGLVLFIMG S+SWNAPATNNPIFAEIVP+KSRTSIYALDRSFESILSSFAPPVVGVLAQHVYGYKP A+GS+DS
Subjt:  SSGSAIPLAAILLLVLPDDPSTAFLHGLVLFIMGLSISWNAPATNNPIFAEIVPEKSRTSIYALDRSFESILSSFAPPVVGVLAQHVYGYKPIARGSSDS

Query:  VQIETDRENAKSLARALYTAIGIPMSLCCFIYSFLYCSYPRDRERARMHALIQSEMLQLEASSSPFRERDSEFQISEAKELDDKDRTEIDLIYEVEDSLD
         QIETDRENAKSLARALY AIG PMSLCCFIY+FLYCSYPRDRERARMHALI+SEML LE+S+SP  E+D +F ISEAK+ DDKD+TE+DL YE+EDSLD
Subjt:  VQIETDRENAKSLARALYTAIGIPMSLCCFIYSFLYCSYPRDRERARMHALIQSEMLQLEASSSPFRERDSEFQISEAKELDDKDRTEIDLIYEVEDSLD

Query:  FNDNDERHLLQHQL
        F DNDE+HLL HQL
Subjt:  FNDNDERHLLQHQL

A0A5D3CMN2 Protein spinster1.2e-21677.43Show/hide
Query:  MKSEAVTLILVNLAGIMERADESLLPGVYKEVGAALHTDPTGLGSLTLFRSIVQCSCYPLAAYLAVHHNRAHVIALGAFLWAAATFLVALSSTFLQVAIS
        MK EAVTLILVNLA IMER DESLLPGVYKEVGAALH DPTGLGSLTLFRS+VQ SCYPLAAYLAVHHNRAHVIA+GAFLWAAATFLVA+SSTF QVAIS
Subjt:  MKSEAVTLILVNLAGIMERADESLLPGVYKEVGAALHTDPTGLGSLTLFRSIVQCSCYPLAAYLAVHHNRAHVIALGAFLWAAATFLVALSSTFLQVAIS

Query:  RGLNGIGLAIVIPAIQSLVADSTDESNRGLAFGWLQLTGNLGSIIGGLCSVLIASTSFMGIPGWRISFHLVGLISVVVGILVWLFANDPRFSEIDERVKG
        R LNGIGLAIVIPAIQSLVADSTD+SNRGLAFGWLQLTGNLGSIIGGLCS+L+ASTSFMGIPGWRISFHLVGLISV+VG+LVW+FANDP FSE + R K 
Subjt:  RGLNGIGLAIVIPAIQSLVADSTDESNRGLAFGWLQLTGNLGSIIGGLCSVLIASTSFMGIPGWRISFHLVGLISVVVGILVWLFANDPRFSEIDERVKG

Query:  QSRKSFWL----------------------------------------------FSQE-TAFLWTLFVVAGSLGGLFGGRMGDILAKRLPNSGRIILSQI
          RK  W                                               FS E T FLWTLF++A SLGG+FGGRMGDIL+KR PNSGRI+LSQI
Subjt:  QSRKSFWL----------------------------------------------FSQE-TAFLWTLFVVAGSLGGLFGGRMGDILAKRLPNSGRIILSQI

Query:  SSGSAIPLAAILLLVLPDDPSTAFLHGLVLFIMGLSISWNAPATNNPIFAEIVPEKSRTSIYALDRSFESILSSFAPPVVGVLAQHVYGYKPIARGSSDS
        SS SA+PLAAILLLVLPD+PST FLHGLVLFIMG S+SWNAPATNNPIFAEIVP+KSRTSIYALDRSFESILSSFAPPVVGVLAQHVYGYKP A+GS+DS
Subjt:  SSGSAIPLAAILLLVLPDDPSTAFLHGLVLFIMGLSISWNAPATNNPIFAEIVPEKSRTSIYALDRSFESILSSFAPPVVGVLAQHVYGYKPIARGSSDS

Query:  VQIETDRENAKSLARALYTAIGIPMSLCCFIYSFLYCSYPRDRERARMHALIQSEMLQLEASSSPFRERDSEFQISEAKELDDKDRTEIDLIYEVEDSLD
         QIETDRENAKSLARALY AIG PMSLCCFIY+FLYCSYPRDRERARMHALI+SEML LE+S+SP  E+D +F ISEAK+ DDKD+TE+DL YE+EDSLD
Subjt:  VQIETDRENAKSLARALYTAIGIPMSLCCFIYSFLYCSYPRDRERARMHALIQSEMLQLEASSSPFRERDSEFQISEAKELDDKDRTEIDLIYEVEDSLD

Query:  FNDNDERHLLQHQL
        F DNDE+HLL HQL
Subjt:  FNDNDERHLLQHQL

A0A6J1BWJ4 uncharacterized protein LOC1110053752.5e-23083.08Show/hide
Query:  MKSEAVTLILVNLAGIMERADESLLPGVYKEVGAALHTDPTGLGSLTLFRSIVQCSCYPLAAYLAVHHNRAHVIALGAFLWAAATFLVALSSTFLQVAIS
        MKSEA TLILVNLA IMERADESLLPGVYKE+GAALHTDPTGLGSLTLFRS+VQ SCYPLAAYLAVHHNRAHVIALGAFLWAAATFLVALSSTFLQVAIS
Subjt:  MKSEAVTLILVNLAGIMERADESLLPGVYKEVGAALHTDPTGLGSLTLFRSIVQCSCYPLAAYLAVHHNRAHVIALGAFLWAAATFLVALSSTFLQVAIS

Query:  RGLNGIGLAIVIPAIQSLVADSTDESNRGLAFGWLQLTGNLGSIIGGLCSVLIASTSFMGIPGWRISFHLVGLISVVVGILVWLFANDPRFSEIDERVKG
        RGLNGIGLAIVIPAIQSLVADSTDESNRGLAFGWLQLTGNLGSIIGGL SVLIASTSFMGIPGWRI+FHLVGLISV+VG+LVWLFANDPRFSEID RVK 
Subjt:  RGLNGIGLAIVIPAIQSLVADSTDESNRGLAFGWLQLTGNLGSIIGGLCSVLIASTSFMGIPGWRISFHLVGLISVVVGILVWLFANDPRFSEIDERVKG

Query:  QSRKSFWL----------------------------------------------FSQE-TAFLWTLFVVAGSLGGLFGGRMGDILAKRLPNSGRIILSQI
        QSRK FW                                               FS E TAFLWTLF+VA SLGGLFGGRMGDI AKR PNSGRI+LSQI
Subjt:  QSRKSFWL----------------------------------------------FSQE-TAFLWTLFVVAGSLGGLFGGRMGDILAKRLPNSGRIILSQI

Query:  SSGSAIPLAAILLLVLPDDPSTAFLHGLVLFIMGLSISWNAPATNNPIFAEIVPEKSRTSIYALDRSFESILSSFAPPVVGVLAQHVYGYKPIARGSSDS
        SSGSAIPLAAILLLVLPDDPSTAFLHGLVLFIMGLSISWN PATNNPIFAEIVPEKSRTSIYALDRSFES+LSSFAPPVVGVLAQHVYGYKP+ RGSSDS
Subjt:  SSGSAIPLAAILLLVLPDDPSTAFLHGLVLFIMGLSISWNAPATNNPIFAEIVPEKSRTSIYALDRSFESILSSFAPPVVGVLAQHVYGYKPIARGSSDS

Query:  VQIETDRENAKSLARALYTAIGIPMSLCCFIYSFLYCSYPRDRERARMHALIQSEMLQLEASSSPFRERDSEFQISEAKELDDKDRTEIDLIYEVEDSLD
        VQIETDRENAKSLARALYTAIGIPMSLCCFIYSFLY SYPRDRERARMHALI+SEMLQLEAS+ PF E DS F ISEAK+L++KD+TEIDL+Y VEDSLD
Subjt:  VQIETDRENAKSLARALYTAIGIPMSLCCFIYSFLYCSYPRDRERARMHALIQSEMLQLEASSSPFRERDSEFQISEAKELDDKDRTEIDLIYEVEDSLD

Query:  FNDNDERHLLQHQLKVSDLK
         NDNDE+HLLQHQL  SDLK
Subjt:  FNDNDERHLLQHQLKVSDLK

A0A6J1GH66 uncharacterized protein LOC1114538856.8e-22078.96Show/hide
Query:  MKSEAVTLILVNLAGIMERADESLLPGVYKEVGAALHTDPTGLGSLTLFRSIVQCSCYPLAAYLAVHHNRAHVIALGAFLWAAATFLVALSSTFLQVAIS
        MKSEAVTLILVNLA IMERADESLLPGVYKEVGAALHTDPTGLGSLTLFRS+VQ SCYPLAAYLA+HHNRAHVIALGAFLWAAATFLVALSSTFLQVAIS
Subjt:  MKSEAVTLILVNLAGIMERADESLLPGVYKEVGAALHTDPTGLGSLTLFRSIVQCSCYPLAAYLAVHHNRAHVIALGAFLWAAATFLVALSSTFLQVAIS

Query:  RGLNGIGLAIVIPAIQSLVADSTDESNRGLAFGWLQLTGNLGSIIGGLCSVLIASTSFMGIPGWRISFHLVGLISVVVGILVWLFANDPRFSEIDERVKG
        RGLNGIGLAIV+P+IQSLVADSTD+SNRGLAFGWLQLTGNLGSIIGGL SVL+ASTSFMGIPGWRI+FHLVGLISV+VG+LVWLFA+DPRFSEID   K 
Subjt:  RGLNGIGLAIVIPAIQSLVADSTDESNRGLAFGWLQLTGNLGSIIGGLCSVLIASTSFMGIPGWRISFHLVGLISVVVGILVWLFANDPRFSEIDERVKG

Query:  QSRKSFW-----------------------------------------------LFSQETAFLWTLFVVAGSLGGLFGGRMGDILAKRLPNSGRIILSQI
        Q RK FW                                                  ++T FLWTLF++AGSLGGLFGGRMGDIL+KR PNSGRI+LSQI
Subjt:  QSRKSFW-----------------------------------------------LFSQETAFLWTLFVVAGSLGGLFGGRMGDILAKRLPNSGRIILSQI

Query:  SSGSAIPLAAILLLVLPDDPSTAFLHGLVLFIMGLSISWNAPATNNPIFAEIVPEKSRTSIYALDRSFESILSSFAPPVVGVLAQHVYGYKPIARGSSDS
        SSGSAIPLAAILLLVLPDDPSTAFLHGLVLFIMG S+SWN+PATNNPIFAEIVPEKSRTSIYALDRSFESILSSFAPPVVGVLAQHVYGYKP+ARGSSDS
Subjt:  SSGSAIPLAAILLLVLPDDPSTAFLHGLVLFIMGLSISWNAPATNNPIFAEIVPEKSRTSIYALDRSFESILSSFAPPVVGVLAQHVYGYKPIARGSSDS

Query:  VQIETDRENAKSLARALYTAIGIPMSLCCFIYSFLYCSYPRDRERARMHALIQSEMLQLEASSSPFRERDSEFQISEAKELDDKDRTEIDLIYEVEDSLD
         QIE DRENAKSLARALYTAIGIPMSLCCFIYSFLYCSYPRDRERARMHALI SEMLQLE++++          IS AK++D KD+TEIDLIYE+ED LD
Subjt:  VQIETDRENAKSLARALYTAIGIPMSLCCFIYSFLYCSYPRDRERARMHALIQSEMLQLEASSSPFRERDSEFQISEAKELDDKDRTEIDLIYEVEDSLD

Query:  FNDNDERHLLQHQLKVSD
        FND+DE HLL HQL VSD
Subjt:  FNDNDERHLLQHQLKVSD

A0A6J1IQ70 uncharacterized protein LOC1114784403.7e-21878.38Show/hide
Query:  MKSEAVTLILVNLAGIMERADESLLPGVYKEVGAALHTDPTGLGSLTLFRSIVQCSCYPLAAYLAVHHNRAHVIALGAFLWAAATFLVALSSTFLQVAIS
        MKSEAVTLILVNLA IMERADESLLPGVYKEVGAALHTDPTGLGSLTLFRS+VQ SCYPLAAYLA+HHNRAHVIALGAFLWAAATFLVALSSTFLQVAIS
Subjt:  MKSEAVTLILVNLAGIMERADESLLPGVYKEVGAALHTDPTGLGSLTLFRSIVQCSCYPLAAYLAVHHNRAHVIALGAFLWAAATFLVALSSTFLQVAIS

Query:  RGLNGIGLAIVIPAIQSLVADSTDESNRGLAFGWLQLTGNLGSIIGGLCSVLIASTSFMGIPGWRISFHLVGLISVVVGILVWLFANDPRFSEIDERVKG
        RGLNGIGLAIV+P+IQSLVADSTD+SNRGLAFGWLQLTGNLGSIIGGL SVL+ASTS MGIPGWRI+FHLVGLISV+VG+LVWLFA+DPRFSEID   K 
Subjt:  RGLNGIGLAIVIPAIQSLVADSTDESNRGLAFGWLQLTGNLGSIIGGLCSVLIASTSFMGIPGWRISFHLVGLISVVVGILVWLFANDPRFSEIDERVKG

Query:  QSRKSFW-----------------------------------------------LFSQETAFLWTLFVVAGSLGGLFGGRMGDILAKRLPNSGRIILSQI
        + RK FW                                                  +ET FLWTLF++AGSLGGLFGGRMGDIL+KR PNSGRI+LSQI
Subjt:  QSRKSFW-----------------------------------------------LFSQETAFLWTLFVVAGSLGGLFGGRMGDILAKRLPNSGRIILSQI

Query:  SSGSAIPLAAILLLVLPDDPSTAFLHGLVLFIMGLSISWNAPATNNPIFAEIVPEKSRTSIYALDRSFESILSSFAPPVVGVLAQHVYGYKPIARGSSDS
        SSGSAIPLAAILLLVLPDDPSTAFLHGLVLFIMG S+SWN+PATNNPIFAEIVPEKSRTSIYALDRSFESILSSFAPPVVGVLAQHVYGYKP+ARGSSDS
Subjt:  SSGSAIPLAAILLLVLPDDPSTAFLHGLVLFIMGLSISWNAPATNNPIFAEIVPEKSRTSIYALDRSFESILSSFAPPVVGVLAQHVYGYKPIARGSSDS

Query:  VQIETDRENAKSLARALYTAIGIPMSLCCFIYSFLYCSYPRDRERARMHALIQSEMLQLEASSSPFRERDSEFQISEAKELDDKDRTEIDLIYEVEDSLD
         +IE DRENAKSLARALYTAIGIPMSLCCFIYSFLYCSYPRDRERARMHALI SEMLQLE++++      +   IS A ++DDKD+TEIDLIYE+E SLD
Subjt:  VQIETDRENAKSLARALYTAIGIPMSLCCFIYSFLYCSYPRDRERARMHALIQSEMLQLEASSSPFRERDSEFQISEAKELDDKDRTEIDLIYEVEDSLD

Query:  FNDNDERHLLQHQLKVSD
        FND+DE H L HQL VSD
Subjt:  FNDNDERHLLQHQLKVSD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G78130.1 Major facilitator superfamily protein3.8e-17568.68Show/hide
Query:  MKSEAVTLILVNLAGIMERADESLLPGVYKEVGAALHTDPTGLGSLTLFRSIVQCSCYPLAAYLAVHHNRAHVIALGAFLWAAATFLVALSSTFLQVAIS
        MK+E +TL+LVNLAGIMERADESLLPGVYKEVG ALHTDPTGLGSLTL RS+VQ +CYPLAAY+A+ HNRAHVIALGAFLW+AATFLVA SSTF QVA+S
Subjt:  MKSEAVTLILVNLAGIMERADESLLPGVYKEVGAALHTDPTGLGSLTLFRSIVQCSCYPLAAYLAVHHNRAHVIALGAFLWAAATFLVALSSTFLQVAIS

Query:  RGLNGIGLAIVIPAIQSLVADSTDESNRGLAFGWLQLTGNLGSIIGGLCSVLIASTSFMGIPGWRISFHLVGLISVVVGILVWLFANDPRFSEIDERVKG
        R LNGIGLA+V PAIQSLVADSTD++NRG AFGWLQLT N+GSI+GGLCSVLIA  +FMGIPGWR++FH+VG+ISV+VG+LV +FANDP F +    V  
Subjt:  RGLNGIGLAIVIPAIQSLVADSTDESNRGLAFGWLQLTGNLGSIIGGLCSVLIASTSFMGIPGWRISFHLVGLISVVVGILVWLFANDPRFSEIDERVKG

Query:  Q--SRKSF------------------------------------------WL----FSQ-ETAFLWTLFVVAGSLGGLFGGRMGDILAKRLPNSGRIILS
        Q  SRK F                                          WL    FS  +TAFL  LFV A SLGGLFGG+MGD L+ RLPNSGRIIL+
Subjt:  Q--SRKSF------------------------------------------WL----FSQ-ETAFLWTLFVVAGSLGGLFGGRMGDILAKRLPNSGRIILS

Query:  QISSGSAIPLAAILLLVLPDDPSTAFLHGLVLFIMGLSISWNAPATNNPIFAEIVPEKSRTSIYALDRSFESILSSFAPPVVGVLAQHVYGYKPIARGSS
        QISS SAIPLAAILLLVLPDDPSTA +HGL+L ++GL +SWNAPATNNPIFAEIVPEKSRTS+YALD+SFESILSSFAPP+VG+LAQHVYGYKPI  GSS
Subjt:  QISSGSAIPLAAILLLVLPDDPSTAFLHGLVLFIMGLSISWNAPATNNPIFAEIVPEKSRTSIYALDRSFESILSSFAPPVVGVLAQHVYGYKPIARGSS

Query:  DSVQIETDRENAKSLARALYTAIGIPMSLCCFIYSFLYCSYPRDRERARMHALIQSEMLQLEASSSPFRERDSEFQISE
         S +I TDRENA SLA+ALYT+IG+PM+ CCFIYSFLY SYP DR+RARM A I SEM +L   SS    RD EF   E
Subjt:  DSVQIETDRENAKSLARALYTAIGIPMSLCCFIYSFLYCSYPRDRERARMHALIQSEMLQLEASSSPFRERDSEFQISE

AT2G18590.1 Major facilitator superfamily protein6.5e-6632.62Show/hide
Query:  AVTLILVNLAGIMERADESLLPGVYKEVGAALHTDPTGLGSLTLFRSIVQCSCYPLAAYLAVHHNRAHVIALGAFLWAAATFLVALSSTFLQVAISRGLN
        +++LI++NLA +M+RADE L+P   KE+  A H   + +G L+  R+IVQ    PLA   A+ ++R  V A G+F W ++T    +S  F+QV +    N
Subjt:  AVTLILVNLAGIMERADESLLPGVYKEVGAALHTDPTGLGSLTLFRSIVQCSCYPLAAYLAVHHNRAHVIALGAFLWAAATFLVALSSTFLQVAISRGLN

Query:  GIGLAIVIPAIQSLVADSTDESNRGLAFGWLQLTGNLGSIIGGLCSVLIASTSFMGIPGWRISFHLVGLISVVVGILVWLFANDPRFS------------
        G+G AIV P +QS++ADS  ES+RG  FG   L G +G I G +   ++A   F GI GWR +F L   +S +VGILV+ F +DPR              
Subjt:  GIGLAIVIPAIQSLVADSTDESNRGLAFGWLQLTGNLGSIIGGLCSVLIASTSFMGIPGWRISFHLVGLISVVVGILVWLFANDPRFS------------

Query:  -EIDERVKGQSRKS----------------------------------------FWLF--------SQETAFLWTLFVVAGSLGGLFGGRMGDILAKRLP
         E DE   G   +S                                        FW            + A L  +F    ++G L GG + D +++  P
Subjt:  -EIDERVKGQSRKS----------------------------------------FWLF--------SQETAFLWTLFVVAGSLGGLFGGRMGDILAKRLP

Query:  NSGRIILSQISSGSAIPLAAILLLVLPDDPSTAFLHGLVLFIMGLSISWNAPATNNPIFAEIVPEKSRTSIYALDRSFESILSSFAPPVVGVLAQHVYGY
        NSGR+I +Q S       + +LL ++P   ++ ++  + LF+MGL+I+W  PA N+PI AEIVP K RT +YA DR+ E   SSF  P+VG++++ ++G+
Subjt:  NSGRIILSQISSGSAIPLAAILLLVLPDDPSTAFLHGLVLFIMGLSISWNAPATNNPIFAEIVPEKSRTSIYALDRSFESILSSFAPPVVGVLAQHVYGY

Query:  KPIARGSSDSVQIETDRENAKSLARALYTAIGIPMSLCCFIYSFLYCSYPRDRERARMHALIQSEM
           A+G      +      A++L + +   + +P  LCC  Y+ L+  + +DR+  R  +  + EM
Subjt:  KPIARGSSDSVQIETDRENAKSLARALYTAIGIPMSLCCFIYSFLYCSYPRDRERARMHALIQSEM

AT4G36790.1 Major facilitator superfamily protein2.6e-8339.16Show/hide
Query:  AVTLILVNLAGIMERADESLLPGVYKEVGAALHTDPTGLGSLTLFRSIVQCSCYPLAAYLAVHHNRAHVIALGAFLWAAATFLVALSSTFLQVAISRGLN
        +++LIL+NLA IMERADE+LLP VYKEV  A +  P+ LG LT  R+ VQ    PLA  L + ++R  V+A+G F WA +T  V  SS F+QVA+ R +N
Subjt:  AVTLILVNLAGIMERADESLLPGVYKEVGAALHTDPTGLGSLTLFRSIVQCSCYPLAAYLAVHHNRAHVIALGAFLWAAATFLVALSSTFLQVAISRGLN

Query:  GIGLAIVIPAIQSLVADSTDESNRGLAFGWLQLTGNLGSIIGGLCSVLIASTSFMGIPGWRISFHLVGLISVVVGILVWLFANDPRFSEIDERVKGQSRK
        G GLAIVIPA+QS +ADS  +  RG  FG L L G +G I GG+ + ++A + F GIPGWR +F ++  +S V+G+LV+LF  DPR +   E +      
Subjt:  GIGLAIVIPAIQSLVADSTDESNRGLAFGWLQLTGNLGSIIGGLCSVLIASTSFMGIPGWRISFHLVGLISVVVGILVWLFANDPRFSEIDERVKGQSRK

Query:  SFWLFSQETA----------------------FLWT------------------------LFVVAGSLGGLFGGRMGDILAKRLPNSGRIILSQISSGSA
        S  +++   A                      F WT                        +F   G++G L GG + D +++  PNSGR++ +Q S+   
Subjt:  SFWLFSQETA----------------------FLWT------------------------LFVVAGSLGGLFGGRMGDILAKRLPNSGRIILSQISSGSA

Query:  IPLAAILLLVLPDDPSTAFLHGLVLFIMGLSISWNAPATNNPIFAEIVPEKSRTSIYALDRSFESILSSFAPPVVGVLAQHVYGYKPIARGSSDSVQIET
        IP + ILL V+P   S+  +  + LF+MGL+I+W   A N P+FAE+VP + RT IYA DR+FE   SSFA P+VG+L++ ++GY   +RG  D ++  +
Subjt:  IPLAAILLLVLPDDPSTAFLHGLVLFIMGLSISWNAPATNNPIFAEIVPEKSRTSIYALDRSFESILSSFAPPVVGVLAQHVYGYKPIARGSSDSVQIET

Query:  DRENAKSLARALYTAIGIPMSLCCFIYSFLYCSYPRDRERARMHALIQSEML
         RE A +L++ L + + +P  LCC  Y+ L+  + +DRE A++ +  ++EM+
Subjt:  DRENAKSLARALYTAIGIPMSLCCFIYSFLYCSYPRDRERARMHALIQSEML

AT5G10190.1 Major facilitator superfamily protein7.5e-16365.15Show/hide
Query:  MKSEAVTLILVNLAGIMERADESLLPGVYKEVGAALHTDPTGLGSLTLFRSIVQCSCYPLAAYLAVHHNRAHVIALGAFLWAAATFLVALSSTFLQVAIS
        MKSE +TL+LV LAGIMERADESLLPGVYKEVG ALH DPT LG+LTLFRSIVQ SCYPLAAYL+  HNRAHVIALGAFLWA ATFLVA+S+TF QVA+S
Subjt:  MKSEAVTLILVNLAGIMERADESLLPGVYKEVGAALHTDPTGLGSLTLFRSIVQCSCYPLAAYLAVHHNRAHVIALGAFLWAAATFLVALSSTFLQVAIS

Query:  RGLNGIGLAIVIPAIQSLVADSTDESNRGLAFGWLQLTGNLGSIIGGLCSVLIASTSFMGIPGWRISFHLVGLISVVVGILVWLFANDPRFSE--IDERV
        RGLNGIGLAIV PAIQSLVADSTD+ NRG+AFGWL  T N+GSI+G +CS+L AS SF G+ GWRI+F LV ++SV+VGILV LFA DP +S+  I + V
Subjt:  RGLNGIGLAIVIPAIQSLVADSTDESNRGLAFGWLQLTGNLGSIIGGLCSVLIASTSFMGIPGWRISFHLVGLISVVVGILVWLFANDPRFSE--IDERV

Query:  KGQSRKS---------------------------------------FWL----FSQE-TAFLWTLFVVAGSLGGLFGGRMGDILAKRLPNSGRIILSQIS
        K +   S                                        WL    FS + TA L TLF ++ SLGGLFGG MGD LAK+ PN GRI LSQ+S
Subjt:  KGQSRKS---------------------------------------FWL----FSQE-TAFLWTLFVVAGSLGGLFGGRMGDILAKRLPNSGRIILSQIS

Query:  SGSAIPLAAILLLVLPDDPSTAFLHGLVLFIMGLSISWNAPATNNPIFAEIVPEKSRTSIYALDRSFESILSSFAPPVVGVLAQHVYGYKPIARGSSDSV
        SGSAIPLAAILL+ LPDDPSTAF HGLVL IMGL ISWN  ATN PIFAEIVPE++RTSIYALDRSFESIL+SFAPP+VG+LAQ++YGYKPI  GS+ SV
Subjt:  SGSAIPLAAILLLVLPDDPSTAFLHGLVLFIMGLSISWNAPATNNPIFAEIVPEKSRTSIYALDRSFESILSSFAPPVVGVLAQHVYGYKPIARGSSDSV

Query:  QIETDRENAKSLARALYTAIGIPMSLCCFIYSFLYCSYPRDRERARMHALIQSEMLQLEASSSPFRERDSEFQISEAKELDD
        +I+TDR NA SLA+ALYT+IGIPM +CC IYSFLYC+YPRDR+RA+M ALI+SEM QL        E + E +   A+E D+
Subjt:  QIETDRENAKSLARALYTAIGIPMSLCCFIYSFLYCSYPRDRERARMHALIQSEMLQLEASSSPFRERDSEFQISEAKELDD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAATCTGAAGCAGTGACGTTGATATTGGTGAATTTGGCTGGTATAATGGAGAGGGCCGATGAGTCCTTGTTGCCTGGAGTCTACAAGGAGGTTGGGGCCGCTCTGCA
CACCGATCCAACTGGCTTGGGTTCCCTTACTCTCTTCAGATCTATAGTGCAGTGTTCGTGTTACCCCTTAGCTGCTTACTTAGCTGTGCATCACAACCGCGCCCACGTCA
TTGCTCTTGGTGCTTTTCTCTGGGCCGCCGCCACTTTCCTCGTCGCCCTTTCCTCCACATTCTTACAGGTGGCAATTTCCAGAGGTTTAAATGGTATTGGGCTTGCCATA
GTGATACCTGCCATTCAGTCCCTCGTTGCTGACTCAACCGATGAAAGCAACCGTGGCTTGGCTTTTGGATGGCTACAACTAACAGGAAATCTTGGTTCCATCATTGGTGG
GCTTTGTTCTGTATTAATAGCCTCCACATCTTTCATGGGAATCCCGGGATGGAGAATCTCCTTCCATCTAGTTGGATTGATAAGTGTCGTAGTTGGTATACTAGTATGGC
TTTTTGCTAATGATCCACGCTTTTCTGAGATTGACGAAAGAGTTAAGGGTCAATCACGCAAGTCATTCTGGCTTTTCTCACAAGAAACAGCATTCCTTTGGACTCTCTTT
GTAGTTGCTGGTTCACTTGGTGGTCTTTTTGGAGGAAGGATGGGGGATATCTTAGCAAAACGCCTTCCTAATTCAGGAAGAATAATTCTGTCTCAGATAAGTTCTGGTTC
TGCAATTCCTCTAGCTGCAATTTTGCTGCTGGTTTTGCCTGACGATCCATCCACAGCATTCTTGCATGGACTGGTCTTGTTCATAATGGGTTTGAGCATATCATGGAATG
CACCAGCAACTAACAATCCAATATTTGCAGAGATAGTCCCAGAGAAGTCCCGCACAAGCATCTACGCTTTGGATCGATCATTTGAGTCCATACTGTCCTCCTTTGCTCCT
CCTGTTGTTGGAGTTCTGGCTCAGCATGTTTATGGATATAAACCAATTGCAAGAGGATCCTCAGACTCTGTCCAGATTGAAACCGATAGAGAGAATGCAAAATCATTAGC
CAGGGCACTCTACACAGCCATTGGCATTCCGATGTCCCTGTGTTGCTTCATCTACTCTTTCCTATATTGCTCATATCCAAGAGACCGAGAGCGAGCAAGAATGCACGCCC
TGATACAGTCTGAAATGCTGCAGCTGGAGGCAAGCAGTTCACCTTTCCGTGAGCGGGACAGTGAGTTTCAGATTTCAGAGGCAAAAGAACTTGATGATAAGGATCGAACA
GAGATTGACCTGATCTATGAAGTTGAGGACAGTCTTGATTTCAATGACAATGATGAAAGACATCTTCTTCAACACCAGCTAAAAGTTTCTGATTTGAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGAAATCTGAAGCAGTGACGTTGATATTGGTGAATTTGGCTGGTATAATGGAGAGGGCCGATGAGTCCTTGTTGCCTGGAGTCTACAAGGAGGTTGGGGCCGCTCTGCA
CACCGATCCAACTGGCTTGGGTTCCCTTACTCTCTTCAGATCTATAGTGCAGTGTTCGTGTTACCCCTTAGCTGCTTACTTAGCTGTGCATCACAACCGCGCCCACGTCA
TTGCTCTTGGTGCTTTTCTCTGGGCCGCCGCCACTTTCCTCGTCGCCCTTTCCTCCACATTCTTACAGGTGGCAATTTCCAGAGGTTTAAATGGTATTGGGCTTGCCATA
GTGATACCTGCCATTCAGTCCCTCGTTGCTGACTCAACCGATGAAAGCAACCGTGGCTTGGCTTTTGGATGGCTACAACTAACAGGAAATCTTGGTTCCATCATTGGTGG
GCTTTGTTCTGTATTAATAGCCTCCACATCTTTCATGGGAATCCCGGGATGGAGAATCTCCTTCCATCTAGTTGGATTGATAAGTGTCGTAGTTGGTATACTAGTATGGC
TTTTTGCTAATGATCCACGCTTTTCTGAGATTGACGAAAGAGTTAAGGGTCAATCACGCAAGTCATTCTGGCTTTTCTCACAAGAAACAGCATTCCTTTGGACTCTCTTT
GTAGTTGCTGGTTCACTTGGTGGTCTTTTTGGAGGAAGGATGGGGGATATCTTAGCAAAACGCCTTCCTAATTCAGGAAGAATAATTCTGTCTCAGATAAGTTCTGGTTC
TGCAATTCCTCTAGCTGCAATTTTGCTGCTGGTTTTGCCTGACGATCCATCCACAGCATTCTTGCATGGACTGGTCTTGTTCATAATGGGTTTGAGCATATCATGGAATG
CACCAGCAACTAACAATCCAATATTTGCAGAGATAGTCCCAGAGAAGTCCCGCACAAGCATCTACGCTTTGGATCGATCATTTGAGTCCATACTGTCCTCCTTTGCTCCT
CCTGTTGTTGGAGTTCTGGCTCAGCATGTTTATGGATATAAACCAATTGCAAGAGGATCCTCAGACTCTGTCCAGATTGAAACCGATAGAGAGAATGCAAAATCATTAGC
CAGGGCACTCTACACAGCCATTGGCATTCCGATGTCCCTGTGTTGCTTCATCTACTCTTTCCTATATTGCTCATATCCAAGAGACCGAGAGCGAGCAAGAATGCACGCCC
TGATACAGTCTGAAATGCTGCAGCTGGAGGCAAGCAGTTCACCTTTCCGTGAGCGGGACAGTGAGTTTCAGATTTCAGAGGCAAAAGAACTTGATGATAAGGATCGAACA
GAGATTGACCTGATCTATGAAGTTGAGGACAGTCTTGATTTCAATGACAATGATGAAAGACATCTTCTTCAACACCAGCTAAAAGTTTCTGATTTGAAATGA
Protein sequenceShow/hide protein sequence
MKSEAVTLILVNLAGIMERADESLLPGVYKEVGAALHTDPTGLGSLTLFRSIVQCSCYPLAAYLAVHHNRAHVIALGAFLWAAATFLVALSSTFLQVAISRGLNGIGLAI
VIPAIQSLVADSTDESNRGLAFGWLQLTGNLGSIIGGLCSVLIASTSFMGIPGWRISFHLVGLISVVVGILVWLFANDPRFSEIDERVKGQSRKSFWLFSQETAFLWTLF
VVAGSLGGLFGGRMGDILAKRLPNSGRIILSQISSGSAIPLAAILLLVLPDDPSTAFLHGLVLFIMGLSISWNAPATNNPIFAEIVPEKSRTSIYALDRSFESILSSFAP
PVVGVLAQHVYGYKPIARGSSDSVQIETDRENAKSLARALYTAIGIPMSLCCFIYSFLYCSYPRDRERARMHALIQSEMLQLEASSSPFRERDSEFQISEAKELDDKDRT
EIDLIYEVEDSLDFNDNDERHLLQHQLKVSDLK