; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr016933 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr016933
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionTy3-gypsy retrotransposon protein
Genome locationtig00153016:720257..732615
RNA-Seq ExpressionSgr016933
SyntenySgr016933
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6579038.1 hypothetical protein SDJN03_23486, partial [Cucurbita argyrosperma subsp. sororia]1.0e-24978.24Show/hide
Query:  MNHSISSDELDHLSLEVRQKMLLEKTQRFLWAIFRLCNDLLFKKEDECCDLQGVSSVISACDARNLGDQQLEAQINDTSGHLEDCYSEGARLNIEKQISC
        MN  ISSDELDHLSL VR+KML E     L    +  +  + KKE+ECCDLQG S++ISACDARNLGDQQLEAQINDT+GH  D YSEGARLN E     
Subjt:  MNHSISSDELDHLSLEVRQKMLLEKTQRFLWAIFRLCNDLLFKKEDECCDLQGVSSVISACDARNLGDQQLEAQINDTSGHLEDCYSEGARLNIEKQISC

Query:  TMAFENPTPPEVPDWVRVESTGILPGFLADGVDNFASAGVAVTKVKNETFDDFDEDLDHVVLIERLRMLLSRRALGFMNHHVEGSSGVSSGNLIQCFLKQ
           FENPTPPEV D VRVEST IL G LA GVDNFA AGVAVTKVKNE FDDF+EDLDHV+LIERLRMLLSRRALG MN HVEG SGV SG+L+QCFLKQ
Subjt:  TMAFENPTPPEVPDWVRVESTGILPGFLADGVDNFASAGVAVTKVKNETFDDFDEDLDHVVLIERLRMLLSRRALGFMNHHVEGSSGVSSGNLIQCFLKQ

Query:  KEKFMFANGELMGIGNALHDKIGSDAPRLCIPSVICSPNTAFSGSCFSSDHSLNKSTESGNDMELKEDDKICSSEKVATELGPRLLTDHVPEVNLFNSTK
        K K MFA+ E M IGN LHDK GS APR C PSV+CSPN   SGS FSS+HSLNKSTESGNDMELKE DKICSSEKVATELG R LT+HVP+ NL +STK
Subjt:  KEKFMFANGELMGIGNALHDKIGSDAPRLCIPSVICSPNTAFSGSCFSSDHSLNKSTESGNDMELKEDDKICSSEKVATELGPRLLTDHVPEVNLFNSTK

Query:  VKDEPYDHVDGCNLYDKDTKNICSRILSIKSETIMPDEPYENKVDDMRLQDRMKFFSSRKVFGSTSRDYEHPKPSDPGCSSLVSEPSSLMNIKHRRKRKK
        VKDEPYDH +GC++Y KD  N+ S  LSIKSET MPDEPYENKVDDM LQDRMKFFSSRK  G TS DYEHPKPSDPGCS LVSEP +  N K RRK+KK
Subjt:  VKDEPYDHVDGCNLYDKDTKNICSRILSIKSETIMPDEPYENKVDDMRLQDRMKFFSSRKVFGSTSRDYEHPKPSDPGCSSLVSEPSSLMNIKHRRKRKK

Query:  TATNSVETALEEDAPGLLQILVGKGVEIDEIKLYGEMESDDDLHESFSEDSFGELEAVISRLFSQRHSFLKFPS-IRCTKASRASYCLACLVSLIEQTRY
        TATNS+ETALEEDAPGLLQILV KG+++DEIKLYGE ESDDDL ES SEDSF ELE VI+RLF QRHSFLKFPS IRCTKASRASYCLACLVSLIEQTRY
Subjt:  TATNSVETALEEDAPGLLQILVGKGVEIDEIKLYGEMESDDDLHESFSEDSFGELEAVISRLFSQRHSFLKFPS-IRCTKASRASYCLACLVSLIEQTRY

Query:  LHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYGYATYFFELVDSLPIDWQIKRLVIALKLTSCSRISLIENKPLL
        LHFR+WPVEWGWCRDLQSFIFVFERHKRIV+ERPEYGYATYFFELV+SLPI WQIKRLVIA+KLT+CSRISL+EN+PLL
Subjt:  LHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYGYATYFFELVDSLPIDWQIKRLVIALKLTSCSRISLIENKPLL

XP_022141523.1 uncharacterized protein LOC111011878 isoform X1 [Momordica charantia]4.7e-26680.55Show/hide
Query:  MNHSISSDELDHLSLEVRQKMLLEKTQRFL----WAIFRLCNDLLFKKEDECCDLQGVSSVISACDARNLGDQQLEAQINDTSGHLEDCYSEGARLNIEK
        MNH  S DELDHLSL  RQKMLLE     L      I  LCND + K+EDECCD+QGVSS+IS  D  NLG QQLE Q+ DTSGHLED Y+E ARLN EK
Subjt:  MNHSISSDELDHLSLEVRQKMLLEKTQRFL----WAIFRLCNDLLFKKEDECCDLQGVSSVISACDARNLGDQQLEAQINDTSGHLEDCYSEGARLNIEK

Query:  QISCTMAFENPTPPEVPDWVRVESTGILPGFLADGVDNFASAGVAVTKVKNETFDDFDEDLDHVVLIERLRMLLSRRALGFMNHHVEGSSGVSSGNLIQC
        QISCTM FENPTPPEVPDWVRVESTGIL G L DGVDNF SAGVAVTKVKNE FDDF+EDLDHVV IERLRMLLSR+ALG MN HVEG SG SSG+ +QC
Subjt:  QISCTMAFENPTPPEVPDWVRVESTGILPGFLADGVDNFASAGVAVTKVKNETFDDFDEDLDHVVLIERLRMLLSRRALGFMNHHVEGSSGVSSGNLIQC

Query:  FLKQKEKFMFANGELMGIGNALHDKIGSDAPRLCIPSVICSPNTAFSGSCFSSDHSLNKSTESGNDMELKEDDKICSSEKVATELGPRLLTDHVPEVNLF
        FLKQK K MF+N EL G  N LHD+ G DAP L  PSV+CSP    SGS FSS+ SLNK TESGNDMELKEDD+IC SEKV TELG RLLT+H PE NLF
Subjt:  FLKQKEKFMFANGELMGIGNALHDKIGSDAPRLCIPSVICSPNTAFSGSCFSSDHSLNKSTESGNDMELKEDDKICSSEKVATELGPRLLTDHVPEVNLF

Query:  NSTKVKDEPYDHVDGCNLYDKDTKNICSRILSIKSETIMPDEPYENKVDDMRLQDRMKFFSSRKVFGSTSRDYEHPKPSDPGCSSLVSEPSSLMNIKHRR
         STKVKDEPYDHVDGCNL+ KD  N+CSRILS+KSET MPDEPYENKVDDMRLQDRMKFFSSRKVFGSTSRDYEHPKPSDPGCSSLVSEP+SLMN+K RR
Subjt:  NSTKVKDEPYDHVDGCNLYDKDTKNICSRILSIKSETIMPDEPYENKVDDMRLQDRMKFFSSRKVFGSTSRDYEHPKPSDPGCSSLVSEPSSLMNIKHRR

Query:  KRKKTATNSVETALEEDAPGLL----QILVGKGVEIDEIKLYGEMESDDDLHESFSEDSFGELEAVISRLFSQRHSFLKFPSIRCTKASRASYCLACLVS
        K K+TATNS+ETALEEDAPGLL    QILV KGV +DEIKLYGEMESDDDL ESFSE+SFGELEAVISRLFSQR SFLKFP IRCTKASR+SYCLACLVS
Subjt:  KRKKTATNSVETALEEDAPGLL----QILVGKGVEIDEIKLYGEMESDDDLHESFSEDSFGELEAVISRLFSQRHSFLKFPSIRCTKASRASYCLACLVS

Query:  LIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYGYATYFFELVDSLPIDWQIKRLVIALKLTSCSRISLIENKPLL
        LIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYGYATYFFELV+ LPI WQIKRLVIALKLT+CSRISL+EN+PLL
Subjt:  LIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYGYATYFFELVDSLPIDWQIKRLVIALKLTSCSRISLIENKPLL

XP_022141525.1 uncharacterized protein LOC111011878 isoform X2 [Momordica charantia]8.5e-26881.1Show/hide
Query:  MNHSISSDELDHLSLEVRQKMLLEKTQRFL----WAIFRLCNDLLFKKEDECCDLQGVSSVISACDARNLGDQQLEAQINDTSGHLEDCYSEGARLNIEK
        MNH  S DELDHLSL  RQKMLLE     L      I  LCND + K+EDECCD+QGVSS+IS  D  NLG QQLE Q+ DTSGHLED Y+E ARLN EK
Subjt:  MNHSISSDELDHLSLEVRQKMLLEKTQRFL----WAIFRLCNDLLFKKEDECCDLQGVSSVISACDARNLGDQQLEAQINDTSGHLEDCYSEGARLNIEK

Query:  QISCTMAFENPTPPEVPDWVRVESTGILPGFLADGVDNFASAGVAVTKVKNETFDDFDEDLDHVVLIERLRMLLSRRALGFMNHHVEGSSGVSSGNLIQC
        QISCTM FENPTPPEVPDWVRVESTGIL G L DGVDNF SAGVAVTKVKNE FDDF+EDLDHVV IERLRMLLSR+ALG MN HVEG SG SSG+ +QC
Subjt:  QISCTMAFENPTPPEVPDWVRVESTGILPGFLADGVDNFASAGVAVTKVKNETFDDFDEDLDHVVLIERLRMLLSRRALGFMNHHVEGSSGVSSGNLIQC

Query:  FLKQKEKFMFANGELMGIGNALHDKIGSDAPRLCIPSVICSPNTAFSGSCFSSDHSLNKSTESGNDMELKEDDKICSSEKVATELGPRLLTDHVPEVNLF
        FLKQK K MF+N EL G  N LHD+ G DAP L  PSV+CSP    SGS FSS+ SLNK TESGNDMELKEDD+IC SEKV TELG RLLT+H PE NLF
Subjt:  FLKQKEKFMFANGELMGIGNALHDKIGSDAPRLCIPSVICSPNTAFSGSCFSSDHSLNKSTESGNDMELKEDDKICSSEKVATELGPRLLTDHVPEVNLF

Query:  NSTKVKDEPYDHVDGCNLYDKDTKNICSRILSIKSETIMPDEPYENKVDDMRLQDRMKFFSSRKVFGSTSRDYEHPKPSDPGCSSLVSEPSSLMNIKHRR
         STKVKDEPYDHVDGCNL+ KD  N+CSRILS+KSET MPDEPYENKVDDMRLQDRMKFFSSRKVFGSTSRDYEHPKPSDPGCSSLVSEP+SLMN+K RR
Subjt:  NSTKVKDEPYDHVDGCNLYDKDTKNICSRILSIKSETIMPDEPYENKVDDMRLQDRMKFFSSRKVFGSTSRDYEHPKPSDPGCSSLVSEPSSLMNIKHRR

Query:  KRKKTATNSVETALEEDAPGLLQILVGKGVEIDEIKLYGEMESDDDLHESFSEDSFGELEAVISRLFSQRHSFLKFPSIRCTKASRASYCLACLVSLIEQ
        K K+TATNS+ETALEEDAPGLLQILV KGV +DEIKLYGEMESDDDL ESFSE+SFGELEAVISRLFSQR SFLKFP IRCTKASR+SYCLACLVSLIEQ
Subjt:  KRKKTATNSVETALEEDAPGLLQILVGKGVEIDEIKLYGEMESDDDLHESFSEDSFGELEAVISRLFSQRHSFLKFPSIRCTKASRASYCLACLVSLIEQ

Query:  TRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYGYATYFFELVDSLPIDWQIKRLVIALKLTSCSRISLIENKPLL
        TRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYGYATYFFELV+ LPI WQIKRLVIALKLT+CSRISL+EN+PLL
Subjt:  TRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYGYATYFFELVDSLPIDWQIKRLVIALKLTSCSRISLIENKPLL

XP_022141526.1 uncharacterized protein LOC111011878 isoform X3 [Momordica charantia]8.3e-24782.7Show/hide
Query:  VISACDARNLGDQQLEAQINDTSGHLEDCYSEGARLNIEKQISCTMAFENPTPPEVPDWVRVESTGILPGFLADGVDNFASAGVAVTKVKNETFDDFDED
        +IS  D  NLG QQLE Q+ DTSGHLED Y+E ARLN EKQISCTM FENPTPPEVPDWVRVESTGIL G L DGVDNF SAGVAVTKVKNE FDDF+ED
Subjt:  VISACDARNLGDQQLEAQINDTSGHLEDCYSEGARLNIEKQISCTMAFENPTPPEVPDWVRVESTGILPGFLADGVDNFASAGVAVTKVKNETFDDFDED

Query:  LDHVVLIERLRMLLSRRALGFMNHHVEGSSGVSSGNLIQCFLKQKEKFMFANGELMGIGNALHDKIGSDAPRLCIPSVICSPNTAFSGSCFSSDHSLNKS
        LDHVV IERLRMLLSR+ALG MN HVEG SG SSG+ +QCFLKQK K MF+N EL G  N LHD+ G DAP L  PSV+CSP    SGS FSS+ SLNK 
Subjt:  LDHVVLIERLRMLLSRRALGFMNHHVEGSSGVSSGNLIQCFLKQKEKFMFANGELMGIGNALHDKIGSDAPRLCIPSVICSPNTAFSGSCFSSDHSLNKS

Query:  TESGNDMELKEDDKICSSEKVATELGPRLLTDHVPEVNLFNSTKVKDEPYDHVDGCNLYDKDTKNICSRILSIKSETIMPDEPYENKVDDMRLQDRMKFF
        TESGNDMELKEDD+IC SEKV TELG RLLT+H PE NLF STKVKDEPYDHVDGCNL+ KD  N+CSRILS+KSET MPDEPYENKVDDMRLQDRMKFF
Subjt:  TESGNDMELKEDDKICSSEKVATELGPRLLTDHVPEVNLFNSTKVKDEPYDHVDGCNLYDKDTKNICSRILSIKSETIMPDEPYENKVDDMRLQDRMKFF

Query:  SSRKVFGSTSRDYEHPKPSDPGCSSLVSEPSSLMNIKHRRKRKKTATNSVETALEEDAPGLL----QILVGKGVEIDEIKLYGEMESDDDLHESFSEDSF
        SSRKVFGSTSRDYEHPKPSDPGCSSLVSEP+SLMN+K RRK K+TATNS+ETALEEDAPGLL    QILV KGV +DEIKLYGEMESDDDL ESFSE+SF
Subjt:  SSRKVFGSTSRDYEHPKPSDPGCSSLVSEPSSLMNIKHRRKRKKTATNSVETALEEDAPGLL----QILVGKGVEIDEIKLYGEMESDDDLHESFSEDSF

Query:  GELEAVISRLFSQRHSFLKFPSIRCTKASRASYCLACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYGYATYFFELVDSLPIDW
        GELEAVISRLFSQR SFLKFP IRCTKASR+SYCLACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYGYATYFFELV+ LPI W
Subjt:  GELEAVISRLFSQRHSFLKFPSIRCTKASRASYCLACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYGYATYFFELVDSLPIDW

Query:  QIKRLVIALKLTSCSRISLIENKPLL
        QIKRLVIALKLT+CSRISL+EN+PLL
Subjt:  QIKRLVIALKLTSCSRISLIENKPLL

XP_022939493.1 uncharacterized protein LOC111445382 isoform X1 [Cucurbita moschata]8.8e-24978.07Show/hide
Query:  MNHSISSDELDHLSLEVRQKMLLEKTQRFLWAIFRLCNDLLFKKEDECCDLQGVSSVISACDARNLGDQQLEAQINDTSGHLEDCYSEGARLNIEKQISC
        MN  ISSDELDHLSL VR+KML E     L    +  +  + KKE+ECCDLQG S++ISACDARNLGDQQLEAQINDT+GH  D YSEGARLN E     
Subjt:  MNHSISSDELDHLSLEVRQKMLLEKTQRFLWAIFRLCNDLLFKKEDECCDLQGVSSVISACDARNLGDQQLEAQINDTSGHLEDCYSEGARLNIEKQISC

Query:  TMAFENPTPPEVPDWVRVESTGILPGFLADGVDNFASAGVAVTKVKNETFDDFDEDLDHVVLIERLRMLLSRRALGFMNHHVEGSSGVSSGNLIQCFLKQ
           FENPTPPEV D VRVEST IL G L  GVDNFA AGVAVTKVKNE FDDFDEDLDHV+LIERLRMLLSRRALG MN HVEG SGV SG+L+QCFLKQ
Subjt:  TMAFENPTPPEVPDWVRVESTGILPGFLADGVDNFASAGVAVTKVKNETFDDFDEDLDHVVLIERLRMLLSRRALGFMNHHVEGSSGVSSGNLIQCFLKQ

Query:  KEKFMFANGELMGIGNALHDKIGSDAPRLCIPSVICSPNTAFSGSCFSSDHSLNKSTESGNDMELKEDDKICSSEKVATELGPRLLTDHVPEVNLFNSTK
        K K MFA+ E M IGN LHDK GS APR C PSV+CSPN   SGS FSS+HSLNKSTESGNDMELKE DKICSSEKVATELG R LT+HVP+ NL +STK
Subjt:  KEKFMFANGELMGIGNALHDKIGSDAPRLCIPSVICSPNTAFSGSCFSSDHSLNKSTESGNDMELKEDDKICSSEKVATELGPRLLTDHVPEVNLFNSTK

Query:  VKDEPYDHVDGCNLYDKDTKNICSRILSIKSETIMPDEPYENKVDDMRLQDRMKFFSSRKVFGSTSRDYEHPKPSDPGCSSLVSEPSSLMNIKHRRKRKK
        VKDEPYDH +GC++Y KD  N+ S  LSIKSET MPDEPYENKVDDM LQDRMKFFSSRK  G TS DYEHPKPSDPGCS LVSEP +  N K RRK+KK
Subjt:  VKDEPYDHVDGCNLYDKDTKNICSRILSIKSETIMPDEPYENKVDDMRLQDRMKFFSSRKVFGSTSRDYEHPKPSDPGCSSLVSEPSSLMNIKHRRKRKK

Query:  TATNSVETALEEDAPGLLQILVGKGVEIDEIKLYGEMESDDDLHESFSEDSFGELEAVISRLFSQRHSFLKFPS-IRCTKASRASYCLACLVSLIEQTRY
        TATNS+ETALEEDAPGLLQILV KG+++DEIKLYGE ESDDDL ES SEDSF ELE VI+RLF QRHSFLKFPS IRC KASRASYCLACLVSLIEQTRY
Subjt:  TATNSVETALEEDAPGLLQILVGKGVEIDEIKLYGEMESDDDLHESFSEDSFGELEAVISRLFSQRHSFLKFPS-IRCTKASRASYCLACLVSLIEQTRY

Query:  LHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYGYATYFFELVDSLPIDWQIKRLVIALKLTSCSRISLIENKPLL
        LHFR+WPVEWGWCRDLQSFIFVFERHKRIV+ERPEYGYATYFFELV+SLPI WQIKRLVIA+KLT+CSRISL+EN+PLL
Subjt:  LHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYGYATYFFELVDSLPIDWQIKRLVIALKLTSCSRISLIENKPLL

TrEMBL top hitse value%identityAlignment
A0A6J1CIB2 uncharacterized protein LOC111011878 isoform X24.1e-26881.1Show/hide
Query:  MNHSISSDELDHLSLEVRQKMLLEKTQRFL----WAIFRLCNDLLFKKEDECCDLQGVSSVISACDARNLGDQQLEAQINDTSGHLEDCYSEGARLNIEK
        MNH  S DELDHLSL  RQKMLLE     L      I  LCND + K+EDECCD+QGVSS+IS  D  NLG QQLE Q+ DTSGHLED Y+E ARLN EK
Subjt:  MNHSISSDELDHLSLEVRQKMLLEKTQRFL----WAIFRLCNDLLFKKEDECCDLQGVSSVISACDARNLGDQQLEAQINDTSGHLEDCYSEGARLNIEK

Query:  QISCTMAFENPTPPEVPDWVRVESTGILPGFLADGVDNFASAGVAVTKVKNETFDDFDEDLDHVVLIERLRMLLSRRALGFMNHHVEGSSGVSSGNLIQC
        QISCTM FENPTPPEVPDWVRVESTGIL G L DGVDNF SAGVAVTKVKNE FDDF+EDLDHVV IERLRMLLSR+ALG MN HVEG SG SSG+ +QC
Subjt:  QISCTMAFENPTPPEVPDWVRVESTGILPGFLADGVDNFASAGVAVTKVKNETFDDFDEDLDHVVLIERLRMLLSRRALGFMNHHVEGSSGVSSGNLIQC

Query:  FLKQKEKFMFANGELMGIGNALHDKIGSDAPRLCIPSVICSPNTAFSGSCFSSDHSLNKSTESGNDMELKEDDKICSSEKVATELGPRLLTDHVPEVNLF
        FLKQK K MF+N EL G  N LHD+ G DAP L  PSV+CSP    SGS FSS+ SLNK TESGNDMELKEDD+IC SEKV TELG RLLT+H PE NLF
Subjt:  FLKQKEKFMFANGELMGIGNALHDKIGSDAPRLCIPSVICSPNTAFSGSCFSSDHSLNKSTESGNDMELKEDDKICSSEKVATELGPRLLTDHVPEVNLF

Query:  NSTKVKDEPYDHVDGCNLYDKDTKNICSRILSIKSETIMPDEPYENKVDDMRLQDRMKFFSSRKVFGSTSRDYEHPKPSDPGCSSLVSEPSSLMNIKHRR
         STKVKDEPYDHVDGCNL+ KD  N+CSRILS+KSET MPDEPYENKVDDMRLQDRMKFFSSRKVFGSTSRDYEHPKPSDPGCSSLVSEP+SLMN+K RR
Subjt:  NSTKVKDEPYDHVDGCNLYDKDTKNICSRILSIKSETIMPDEPYENKVDDMRLQDRMKFFSSRKVFGSTSRDYEHPKPSDPGCSSLVSEPSSLMNIKHRR

Query:  KRKKTATNSVETALEEDAPGLLQILVGKGVEIDEIKLYGEMESDDDLHESFSEDSFGELEAVISRLFSQRHSFLKFPSIRCTKASRASYCLACLVSLIEQ
        K K+TATNS+ETALEEDAPGLLQILV KGV +DEIKLYGEMESDDDL ESFSE+SFGELEAVISRLFSQR SFLKFP IRCTKASR+SYCLACLVSLIEQ
Subjt:  KRKKTATNSVETALEEDAPGLLQILVGKGVEIDEIKLYGEMESDDDLHESFSEDSFGELEAVISRLFSQRHSFLKFPSIRCTKASRASYCLACLVSLIEQ

Query:  TRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYGYATYFFELVDSLPIDWQIKRLVIALKLTSCSRISLIENKPLL
        TRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYGYATYFFELV+ LPI WQIKRLVIALKLT+CSRISL+EN+PLL
Subjt:  TRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYGYATYFFELVDSLPIDWQIKRLVIALKLTSCSRISLIENKPLL

A0A6J1CJF8 uncharacterized protein LOC111011878 isoform X34.0e-24782.7Show/hide
Query:  VISACDARNLGDQQLEAQINDTSGHLEDCYSEGARLNIEKQISCTMAFENPTPPEVPDWVRVESTGILPGFLADGVDNFASAGVAVTKVKNETFDDFDED
        +IS  D  NLG QQLE Q+ DTSGHLED Y+E ARLN EKQISCTM FENPTPPEVPDWVRVESTGIL G L DGVDNF SAGVAVTKVKNE FDDF+ED
Subjt:  VISACDARNLGDQQLEAQINDTSGHLEDCYSEGARLNIEKQISCTMAFENPTPPEVPDWVRVESTGILPGFLADGVDNFASAGVAVTKVKNETFDDFDED

Query:  LDHVVLIERLRMLLSRRALGFMNHHVEGSSGVSSGNLIQCFLKQKEKFMFANGELMGIGNALHDKIGSDAPRLCIPSVICSPNTAFSGSCFSSDHSLNKS
        LDHVV IERLRMLLSR+ALG MN HVEG SG SSG+ +QCFLKQK K MF+N EL G  N LHD+ G DAP L  PSV+CSP    SGS FSS+ SLNK 
Subjt:  LDHVVLIERLRMLLSRRALGFMNHHVEGSSGVSSGNLIQCFLKQKEKFMFANGELMGIGNALHDKIGSDAPRLCIPSVICSPNTAFSGSCFSSDHSLNKS

Query:  TESGNDMELKEDDKICSSEKVATELGPRLLTDHVPEVNLFNSTKVKDEPYDHVDGCNLYDKDTKNICSRILSIKSETIMPDEPYENKVDDMRLQDRMKFF
        TESGNDMELKEDD+IC SEKV TELG RLLT+H PE NLF STKVKDEPYDHVDGCNL+ KD  N+CSRILS+KSET MPDEPYENKVDDMRLQDRMKFF
Subjt:  TESGNDMELKEDDKICSSEKVATELGPRLLTDHVPEVNLFNSTKVKDEPYDHVDGCNLYDKDTKNICSRILSIKSETIMPDEPYENKVDDMRLQDRMKFF

Query:  SSRKVFGSTSRDYEHPKPSDPGCSSLVSEPSSLMNIKHRRKRKKTATNSVETALEEDAPGLL----QILVGKGVEIDEIKLYGEMESDDDLHESFSEDSF
        SSRKVFGSTSRDYEHPKPSDPGCSSLVSEP+SLMN+K RRK K+TATNS+ETALEEDAPGLL    QILV KGV +DEIKLYGEMESDDDL ESFSE+SF
Subjt:  SSRKVFGSTSRDYEHPKPSDPGCSSLVSEPSSLMNIKHRRKRKKTATNSVETALEEDAPGLL----QILVGKGVEIDEIKLYGEMESDDDLHESFSEDSF

Query:  GELEAVISRLFSQRHSFLKFPSIRCTKASRASYCLACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYGYATYFFELVDSLPIDW
        GELEAVISRLFSQR SFLKFP IRCTKASR+SYCLACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYGYATYFFELV+ LPI W
Subjt:  GELEAVISRLFSQRHSFLKFPSIRCTKASRASYCLACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYGYATYFFELVDSLPIDW

Query:  QIKRLVIALKLTSCSRISLIENKPLL
        QIKRLVIALKLT+CSRISL+EN+PLL
Subjt:  QIKRLVIALKLTSCSRISLIENKPLL

A0A6J1CKR0 uncharacterized protein LOC111011878 isoform X12.3e-26680.55Show/hide
Query:  MNHSISSDELDHLSLEVRQKMLLEKTQRFL----WAIFRLCNDLLFKKEDECCDLQGVSSVISACDARNLGDQQLEAQINDTSGHLEDCYSEGARLNIEK
        MNH  S DELDHLSL  RQKMLLE     L      I  LCND + K+EDECCD+QGVSS+IS  D  NLG QQLE Q+ DTSGHLED Y+E ARLN EK
Subjt:  MNHSISSDELDHLSLEVRQKMLLEKTQRFL----WAIFRLCNDLLFKKEDECCDLQGVSSVISACDARNLGDQQLEAQINDTSGHLEDCYSEGARLNIEK

Query:  QISCTMAFENPTPPEVPDWVRVESTGILPGFLADGVDNFASAGVAVTKVKNETFDDFDEDLDHVVLIERLRMLLSRRALGFMNHHVEGSSGVSSGNLIQC
        QISCTM FENPTPPEVPDWVRVESTGIL G L DGVDNF SAGVAVTKVKNE FDDF+EDLDHVV IERLRMLLSR+ALG MN HVEG SG SSG+ +QC
Subjt:  QISCTMAFENPTPPEVPDWVRVESTGILPGFLADGVDNFASAGVAVTKVKNETFDDFDEDLDHVVLIERLRMLLSRRALGFMNHHVEGSSGVSSGNLIQC

Query:  FLKQKEKFMFANGELMGIGNALHDKIGSDAPRLCIPSVICSPNTAFSGSCFSSDHSLNKSTESGNDMELKEDDKICSSEKVATELGPRLLTDHVPEVNLF
        FLKQK K MF+N EL G  N LHD+ G DAP L  PSV+CSP    SGS FSS+ SLNK TESGNDMELKEDD+IC SEKV TELG RLLT+H PE NLF
Subjt:  FLKQKEKFMFANGELMGIGNALHDKIGSDAPRLCIPSVICSPNTAFSGSCFSSDHSLNKSTESGNDMELKEDDKICSSEKVATELGPRLLTDHVPEVNLF

Query:  NSTKVKDEPYDHVDGCNLYDKDTKNICSRILSIKSETIMPDEPYENKVDDMRLQDRMKFFSSRKVFGSTSRDYEHPKPSDPGCSSLVSEPSSLMNIKHRR
         STKVKDEPYDHVDGCNL+ KD  N+CSRILS+KSET MPDEPYENKVDDMRLQDRMKFFSSRKVFGSTSRDYEHPKPSDPGCSSLVSEP+SLMN+K RR
Subjt:  NSTKVKDEPYDHVDGCNLYDKDTKNICSRILSIKSETIMPDEPYENKVDDMRLQDRMKFFSSRKVFGSTSRDYEHPKPSDPGCSSLVSEPSSLMNIKHRR

Query:  KRKKTATNSVETALEEDAPGLL----QILVGKGVEIDEIKLYGEMESDDDLHESFSEDSFGELEAVISRLFSQRHSFLKFPSIRCTKASRASYCLACLVS
        K K+TATNS+ETALEEDAPGLL    QILV KGV +DEIKLYGEMESDDDL ESFSE+SFGELEAVISRLFSQR SFLKFP IRCTKASR+SYCLACLVS
Subjt:  KRKKTATNSVETALEEDAPGLL----QILVGKGVEIDEIKLYGEMESDDDLHESFSEDSFGELEAVISRLFSQRHSFLKFPSIRCTKASRASYCLACLVS

Query:  LIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYGYATYFFELVDSLPIDWQIKRLVIALKLTSCSRISLIENKPLL
        LIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYGYATYFFELV+ LPI WQIKRLVIALKLT+CSRISL+EN+PLL
Subjt:  LIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYGYATYFFELVDSLPIDWQIKRLVIALKLTSCSRISLIENKPLL

A0A6J1FLT1 uncharacterized protein LOC111445382 isoform X14.3e-24978.07Show/hide
Query:  MNHSISSDELDHLSLEVRQKMLLEKTQRFLWAIFRLCNDLLFKKEDECCDLQGVSSVISACDARNLGDQQLEAQINDTSGHLEDCYSEGARLNIEKQISC
        MN  ISSDELDHLSL VR+KML E     L    +  +  + KKE+ECCDLQG S++ISACDARNLGDQQLEAQINDT+GH  D YSEGARLN E     
Subjt:  MNHSISSDELDHLSLEVRQKMLLEKTQRFLWAIFRLCNDLLFKKEDECCDLQGVSSVISACDARNLGDQQLEAQINDTSGHLEDCYSEGARLNIEKQISC

Query:  TMAFENPTPPEVPDWVRVESTGILPGFLADGVDNFASAGVAVTKVKNETFDDFDEDLDHVVLIERLRMLLSRRALGFMNHHVEGSSGVSSGNLIQCFLKQ
           FENPTPPEV D VRVEST IL G L  GVDNFA AGVAVTKVKNE FDDFDEDLDHV+LIERLRMLLSRRALG MN HVEG SGV SG+L+QCFLKQ
Subjt:  TMAFENPTPPEVPDWVRVESTGILPGFLADGVDNFASAGVAVTKVKNETFDDFDEDLDHVVLIERLRMLLSRRALGFMNHHVEGSSGVSSGNLIQCFLKQ

Query:  KEKFMFANGELMGIGNALHDKIGSDAPRLCIPSVICSPNTAFSGSCFSSDHSLNKSTESGNDMELKEDDKICSSEKVATELGPRLLTDHVPEVNLFNSTK
        K K MFA+ E M IGN LHDK GS APR C PSV+CSPN   SGS FSS+HSLNKSTESGNDMELKE DKICSSEKVATELG R LT+HVP+ NL +STK
Subjt:  KEKFMFANGELMGIGNALHDKIGSDAPRLCIPSVICSPNTAFSGSCFSSDHSLNKSTESGNDMELKEDDKICSSEKVATELGPRLLTDHVPEVNLFNSTK

Query:  VKDEPYDHVDGCNLYDKDTKNICSRILSIKSETIMPDEPYENKVDDMRLQDRMKFFSSRKVFGSTSRDYEHPKPSDPGCSSLVSEPSSLMNIKHRRKRKK
        VKDEPYDH +GC++Y KD  N+ S  LSIKSET MPDEPYENKVDDM LQDRMKFFSSRK  G TS DYEHPKPSDPGCS LVSEP +  N K RRK+KK
Subjt:  VKDEPYDHVDGCNLYDKDTKNICSRILSIKSETIMPDEPYENKVDDMRLQDRMKFFSSRKVFGSTSRDYEHPKPSDPGCSSLVSEPSSLMNIKHRRKRKK

Query:  TATNSVETALEEDAPGLLQILVGKGVEIDEIKLYGEMESDDDLHESFSEDSFGELEAVISRLFSQRHSFLKFPS-IRCTKASRASYCLACLVSLIEQTRY
        TATNS+ETALEEDAPGLLQILV KG+++DEIKLYGE ESDDDL ES SEDSF ELE VI+RLF QRHSFLKFPS IRC KASRASYCLACLVSLIEQTRY
Subjt:  TATNSVETALEEDAPGLLQILVGKGVEIDEIKLYGEMESDDDLHESFSEDSFGELEAVISRLFSQRHSFLKFPS-IRCTKASRASYCLACLVSLIEQTRY

Query:  LHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYGYATYFFELVDSLPIDWQIKRLVIALKLTSCSRISLIENKPLL
        LHFR+WPVEWGWCRDLQSFIFVFERHKRIV+ERPEYGYATYFFELV+SLPI WQIKRLVIA+KLT+CSRISL+EN+PLL
Subjt:  LHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYGYATYFFELVDSLPIDWQIKRLVIALKLTSCSRISLIENKPLL

A0A6J1JZL8 uncharacterized protein LOC111489311 isoform X11.9e-24176.17Show/hide
Query:  MNHSISSDELDHLSLEVRQKMLLEKTQRFLWAIFRLCNDLLFKKEDECCDLQGVSSVISACDARNLGDQQLEAQINDTSGHLEDCYSEGARLNIEKQISC
        MN  ISSDELDHLSL VR+KML E     L    +  +  + KKE+ECCDLQG S++IS    RNLGDQQLEA+INDT+GHL D YSEGARLN E     
Subjt:  MNHSISSDELDHLSLEVRQKMLLEKTQRFLWAIFRLCNDLLFKKEDECCDLQGVSSVISACDARNLGDQQLEAQINDTSGHLEDCYSEGARLNIEKQISC

Query:  TMAFENPTPPEVPDWVRVESTGILPGFLADGVDNFASAGVAVTKVKNETFDDFDEDLDHVVLIERLRMLLSRRALGFMNHHVEGSSGVSSGNLIQCFLKQ
           FENPTPPEV D VRVEST IL G LA  VDNFA AGVAVTKVKNE FDDFDEDLDHV+LIERLRMLLSRR+LG MN HVEG SGV SG+L+QCFLKQ
Subjt:  TMAFENPTPPEVPDWVRVESTGILPGFLADGVDNFASAGVAVTKVKNETFDDFDEDLDHVVLIERLRMLLSRRALGFMNHHVEGSSGVSSGNLIQCFLKQ

Query:  KEKFMFANGELMGIGNALHDKIGSDAPRLCIPSVICSPNTAFSGSCFSSDHSLNKSTESGNDMELKEDDKICSSEKVATELGPRLLTDHVPEVNLFNSTK
        K K MFA+ E M IGN LHDK  S APR C PSV+CSPN   SGS FSS+HSLNKSTESGNDMELKE DKI SS+KVATELG R LT+HVP+ NL +STK
Subjt:  KEKFMFANGELMGIGNALHDKIGSDAPRLCIPSVICSPNTAFSGSCFSSDHSLNKSTESGNDMELKEDDKICSSEKVATELGPRLLTDHVPEVNLFNSTK

Query:  VKDEPYDHVDGCNLYDKDTKNICSRILSIKSETIMPDEPYENKVDDMRLQDRMKFFSSRKVFGSTSRDYEHPKPSDPGCSSLVSEPSSLMNIKHRRKRKK
        VKDEPYDH +GC++Y KD  N+    LS+KSET MPDEP+ENKVDDM LQDRMKFFSSRK FG TS DYEHPKPSDPGCS LVSEP +  N K RRK KK
Subjt:  VKDEPYDHVDGCNLYDKDTKNICSRILSIKSETIMPDEPYENKVDDMRLQDRMKFFSSRKVFGSTSRDYEHPKPSDPGCSSLVSEPSSLMNIKHRRKRKK

Query:  TATNSVETALEEDAPGLLQILVGKGVEIDEIKLYGEMESDDDLHESFSEDSFGELEAVISRLFSQRHSFLKFPS-IRCTKASRASYCLACLVSLIEQTRY
        TATNS+ETALEEDAPGLLQILV KG+++DEIKLYGE ESDDDL ES SEDSF ELE VI+RLF QRHSFLKFPS IRC KASRASYCLACLVSLIEQTRY
Subjt:  TATNSVETALEEDAPGLLQILVGKGVEIDEIKLYGEMESDDDLHESFSEDSFGELEAVISRLFSQRHSFLKFPS-IRCTKASRASYCLACLVSLIEQTRY

Query:  LHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYGYATYFFELVDSLPIDWQIKRLVIALKLTSCSRISLIENKPLL
        LHFR+WPVEWGWCRDLQSFIFVF+RHKRIV+ERPEYGYATYFFELV+SLPI WQIKRLVIA+KLT+CSRISL+EN+PLL
Subjt:  LHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYGYATYFFELVDSLPIDWQIKRLVIALKLTSCSRISLIENKPLL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G16610.1 unknown protein7.4e-6855.78Show/hide
Query:  IKSETIMPDEPYENKVDDMRLQDRMKFFSSRKVFGS-TSRDYEHPKPSDPGCSSLVSEPSSLMNIKHRRKRKKTATNSVETALEEDAPGLLQILVGKGVE
        +K+E     E  E+ +D M+L DR+K    R   GS    D   P      C+S   E      +    KRKKTAT+S+ETALEEDAPGLLQ+L+ +GV 
Subjt:  IKSETIMPDEPYENKVDDMRLQDRMKFFSSRKVFGS-TSRDYEHPKPSDPGCSSLVSEPSSLMNIKHRRKRKKTATNSVETALEEDAPGLLQILVGKGVE

Query:  IDEIKLYGEMESDDDLHESFSEDSFGELEAVISRLFSQRHSFLKFPSIRCTKASRASYCLACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKR
        +DE++LYG    D    +S   +SF ELE VIS+LF +R +  K  +   +KASR SYCL CL SLIEQ RYL FR WPVEWGWCRDLQSFIFVFERH R
Subjt:  IDEIKLYGEMESDDDLHESFSEDSFGELEAVISRLFSQRHSFLKFPSIRCTKASRASYCLACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKR

Query:  IVLERPEYGYATYFFELVDSLPIDWQIKRLVIALKLTSCSRISLIENKPLL
        IV+ERPEYGYATYFFEL ++  I WQ+KRLV+A+KL SC R  LIENKPLL
Subjt:  IVLERPEYGYATYFFELVDSLPIDWQIKRLVIALKLTSCSRISLIENKPLL

AT5G16610.2 unknown protein3.1e-7436.01Show/hide
Query:  ELDHLSLEVRQKMLL--EKTQRFLWAIFRLCN-DLLFKKEDECCDLQGVSSVISACDARNLGDQQLEAQINDTSGHLEDCYSEGARLNIEKQISCTMAFE
        E DHL L  R+ +LL  E+  + + A     N D + K+E+E C  +    V+S CDA      ++   +N                     I C+   +
Subjt:  ELDHLSLEVRQKMLL--EKTQRFLWAIFRLCN-DLLFKKEDECCDLQGVSSVISACDARNLGDQQLEAQINDTSGHLEDCYSEGARLNIEKQISCTMAFE

Query:  NPTPPEVPDWVRVESTGILPGFLADGVDNFASAGVAVTKVKNETFDDFDEDLDHVVLIERLRMLLSRRALGFMNHHVEGSSGVSSGNLIQCFLKQKEKFM
                D   +     + G  ++ V+NF   G        ET  +  +DL+H+ L ER +MLL R A+     +VE ++       +    K K +  
Subjt:  NPTPPEVPDWVRVESTGILPGFLADGVDNFASAGVAVTKVKNETFDDFDEDLDHVVLIERLRMLLSRRALGFMNHHVEGSSGVSSGNLIQCFLKQKEKFM

Query:  FANGELMGIGNALHDKIGSDAPRLCIPSVICSPNTAFSGSCFSSDHSLN--KSTESGNDMELKEDDKICSSEKVATELGPRLLTDHVP-EVNLFNSTKVK
          NG     G      +      LC    ICS + +  G    SD  ++  +S +   +  L E   + SS K   +   R+  + +P   N   ST+VK
Subjt:  FANGELMGIGNALHDKIGSDAPRLCIPSVICSPNTAFSGSCFSSDHSLN--KSTESGNDMELKEDDKICSSEKVATELGPRLLTDHVP-EVNLFNSTKVK

Query:  DEPYDHVDGCNLYDKDTKN-ICSRILSIKSETIMPDEPY-ENKVDDMRLQDRMKF-------FSSRKVFGSTSRD-----YEHPKPSD---------PGC
         +P   +  C + D D KN + S+ + +K E     E   EN++D ++L  R+         F   K    T+ +      +H K  D          G 
Subjt:  DEPYDHVDGCNLYDKDTKN-ICSRILSIKSETIMPDEPY-ENKVDDMRLQDRMKF-------FSSRKVFGSTSRD-----YEHPKPSD---------PGC

Query:  SSLVSEPSSLMN-------IKHRR-----KRKKTATNSVETALEEDAPGLLQILVGKGVEIDEIKLYGEMESDDDLHESFSEDSFGELEAVISRLFSQRH
           ++ PSS  +       +K  R     KRKKTAT+S+ETALEEDAPGLLQ+L+ +GV +DE++LYG    D    +S   +SF ELE VIS+LF +R 
Subjt:  SSLVSEPSSLMN-------IKHRR-----KRKKTATNSVETALEEDAPGLLQILVGKGVEIDEIKLYGEMESDDDLHESFSEDSFGELEAVISRLFSQRH

Query:  SFLKFPSIRCTKASRASYCLACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYGYATYFFELVDSLPIDWQIKRLVIALKLTSCS
        +  K  +   +KASR SYCL CL SLIEQ RYL FR WPVEWGWCRDLQSFIFVFERH RIV+ERPEYGYATYFFEL ++  I WQ+KRLV+A+KL SC 
Subjt:  SFLKFPSIRCTKASRASYCLACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYGYATYFFELVDSLPIDWQIKRLVIALKLTSCS

Query:  RISLIENKPLL
        R  LIENKPLL
Subjt:  RISLIENKPLL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAGAGAGTTCGCATAAAAAAAATAATAATAATAATAATTAGCACCTGCAAGCGTCCGGAGGCCGGTGGTCGGCCTGGGATTCTTCAGCGCAGGGCGCTGTCCAGGAG
TCAGCTCGTCACGCCAGGGAAGAACCTTGCTCCTCGGTCTGAAATTGGATTGAAGACTCTCCTTCGTGGTTGGAGACACAGAGCACGCACTGGCTCGGACGTCGGAATGA
ATCATAGTATCAGTTCTGATGAGCTCGACCATCTTTCGTTGGAGGTGAGGCAGAAAATGCTACTGGAAAAGACTCAACGTTTTCTTTGGGCCATATTCAGGCTTTGCAAC
GATTTGCTTTTCAAGAAAGAAGATGAATGCTGTGATTTGCAGGGTGTTTCCTCTGTGATATCTGCTTGTGATGCCAGAAATCTGGGAGATCAACAACTGGAAGCTCAAAT
TAACGATACTAGTGGTCATCTTGAGGACTGCTACAGTGAGGGGGCTCGATTAAATATAGAGAAGCAGATATCATGCACTATGGCATTTGAGAATCCAACACCACCTGAGG
TTCCTGATTGGGTAAGGGTGGAATCTACAGGCATCTTACCAGGTTTCTTGGCAGATGGTGTAGATAACTTTGCTTCCGCTGGTGTGGCTGTAACTAAGGTTAAAAATGAG
ACGTTTGATGACTTCGATGAAGATCTTGATCATGTTGTATTGATAGAGCGACTAAGGATGCTGCTATCAAGGAGAGCGTTGGGTTTCATGAATCACCATGTGGAGGGTAG
TTCTGGTGTGTCATCAGGAAATCTTATACAATGCTTTTTGAAACAGAAGGAAAAGTTTATGTTTGCTAATGGGGAGCTGATGGGAATTGGCAATGCGTTGCATGATAAAA
TTGGAAGTGATGCTCCTCGTCTTTGCATCCCTTCAGTAATTTGTTCACCTAATACAGCCTTTTCTGGATCCTGTTTCTCAAGCGATCATTCTTTAAATAAATCAACTGAA
TCAGGCAATGACATGGAACTTAAAGAAGATGATAAGATCTGTTCATCTGAGAAGGTGGCCACAGAATTAGGCCCACGGCTTTTGACTGATCATGTCCCTGAAGTAAATTT
ATTTAATTCCACAAAAGTGAAGGATGAACCTTATGATCATGTTGACGGCTGCAACTTATATGATAAGGATACGAAGAACATCTGCAGCAGAATTTTGTCAATAAAGAGTG
AAACAATCATGCCCGATGAACCTTATGAAAACAAGGTAGATGATATGCGACTGCAAGATCGAATGAAGTTTTTCTCATCTCGAAAGGTTTTTGGTTCTACGTCAAGGGAT
TACGAGCATCCAAAACCTTCTGACCCTGGATGTAGTTCTCTTGTTTCAGAACCTTCTAGTTTAATGAACATTAAACATCGACGCAAGCGGAAAAAAACTGCCACGAATTC
AGTTGAAACAGCACTCGAGGAAGATGCCCCTGGCCTTCTGCAGATACTAGTTGGCAAAGGTGTAGAAATTGATGAAATTAAGCTTTATGGAGAGATGGAAAGTGATGATG
ATCTGCATGAGTCATTTAGTGAAGACAGCTTTGGTGAGCTTGAAGCTGTGATATCGAGGCTGTTTTCTCAACGCCATTCCTTTTTGAAGTTTCCTTCTATAAGATGCACA
AAAGCTTCTAGAGCAAGCTATTGCTTAGCTTGTCTAGTTTCACTTATTGAGCAGACAAGATATCTTCATTTCCGGAGTTGGCCTGTCGAATGGGGGTGGTGCCGTGATCT
TCAGTCTTTTATATTTGTATTCGAGAGACATAAAAGAATAGTGCTGGAACGTCCCGAGTATGGCTATGCTACATATTTTTTTGAGCTTGTCGATTCCTTACCTATCGACT
GGCAGATAAAGCGGTTGGTGATTGCTTTGAAGCTTACTAGTTGTAGCAGAATTTCACTAATTGAGAACAAACCATTGTTG
mRNA sequenceShow/hide mRNA sequence
ATGCAGAGAGTTCGCATAAAAAAAATAATAATAATAATAATTAGCACCTGCAAGCGTCCGGAGGCCGGTGGTCGGCCTGGGATTCTTCAGCGCAGGGCGCTGTCCAGGAG
TCAGCTCGTCACGCCAGGGAAGAACCTTGCTCCTCGGTCTGAAATTGGATTGAAGACTCTCCTTCGTGGTTGGAGACACAGAGCACGCACTGGCTCGGACGTCGGAATGA
ATCATAGTATCAGTTCTGATGAGCTCGACCATCTTTCGTTGGAGGTGAGGCAGAAAATGCTACTGGAAAAGACTCAACGTTTTCTTTGGGCCATATTCAGGCTTTGCAAC
GATTTGCTTTTCAAGAAAGAAGATGAATGCTGTGATTTGCAGGGTGTTTCCTCTGTGATATCTGCTTGTGATGCCAGAAATCTGGGAGATCAACAACTGGAAGCTCAAAT
TAACGATACTAGTGGTCATCTTGAGGACTGCTACAGTGAGGGGGCTCGATTAAATATAGAGAAGCAGATATCATGCACTATGGCATTTGAGAATCCAACACCACCTGAGG
TTCCTGATTGGGTAAGGGTGGAATCTACAGGCATCTTACCAGGTTTCTTGGCAGATGGTGTAGATAACTTTGCTTCCGCTGGTGTGGCTGTAACTAAGGTTAAAAATGAG
ACGTTTGATGACTTCGATGAAGATCTTGATCATGTTGTATTGATAGAGCGACTAAGGATGCTGCTATCAAGGAGAGCGTTGGGTTTCATGAATCACCATGTGGAGGGTAG
TTCTGGTGTGTCATCAGGAAATCTTATACAATGCTTTTTGAAACAGAAGGAAAAGTTTATGTTTGCTAATGGGGAGCTGATGGGAATTGGCAATGCGTTGCATGATAAAA
TTGGAAGTGATGCTCCTCGTCTTTGCATCCCTTCAGTAATTTGTTCACCTAATACAGCCTTTTCTGGATCCTGTTTCTCAAGCGATCATTCTTTAAATAAATCAACTGAA
TCAGGCAATGACATGGAACTTAAAGAAGATGATAAGATCTGTTCATCTGAGAAGGTGGCCACAGAATTAGGCCCACGGCTTTTGACTGATCATGTCCCTGAAGTAAATTT
ATTTAATTCCACAAAAGTGAAGGATGAACCTTATGATCATGTTGACGGCTGCAACTTATATGATAAGGATACGAAGAACATCTGCAGCAGAATTTTGTCAATAAAGAGTG
AAACAATCATGCCCGATGAACCTTATGAAAACAAGGTAGATGATATGCGACTGCAAGATCGAATGAAGTTTTTCTCATCTCGAAAGGTTTTTGGTTCTACGTCAAGGGAT
TACGAGCATCCAAAACCTTCTGACCCTGGATGTAGTTCTCTTGTTTCAGAACCTTCTAGTTTAATGAACATTAAACATCGACGCAAGCGGAAAAAAACTGCCACGAATTC
AGTTGAAACAGCACTCGAGGAAGATGCCCCTGGCCTTCTGCAGATACTAGTTGGCAAAGGTGTAGAAATTGATGAAATTAAGCTTTATGGAGAGATGGAAAGTGATGATG
ATCTGCATGAGTCATTTAGTGAAGACAGCTTTGGTGAGCTTGAAGCTGTGATATCGAGGCTGTTTTCTCAACGCCATTCCTTTTTGAAGTTTCCTTCTATAAGATGCACA
AAAGCTTCTAGAGCAAGCTATTGCTTAGCTTGTCTAGTTTCACTTATTGAGCAGACAAGATATCTTCATTTCCGGAGTTGGCCTGTCGAATGGGGGTGGTGCCGTGATCT
TCAGTCTTTTATATTTGTATTCGAGAGACATAAAAGAATAGTGCTGGAACGTCCCGAGTATGGCTATGCTACATATTTTTTTGAGCTTGTCGATTCCTTACCTATCGACT
GGCAGATAAAGCGGTTGGTGATTGCTTTGAAGCTTACTAGTTGTAGCAGAATTTCACTAATTGAGAACAAACCATTGTTG
Protein sequenceShow/hide protein sequence
MQRVRIKKIIIIIISTCKRPEAGGRPGILQRRALSRSQLVTPGKNLAPRSEIGLKTLLRGWRHRARTGSDVGMNHSISSDELDHLSLEVRQKMLLEKTQRFLWAIFRLCN
DLLFKKEDECCDLQGVSSVISACDARNLGDQQLEAQINDTSGHLEDCYSEGARLNIEKQISCTMAFENPTPPEVPDWVRVESTGILPGFLADGVDNFASAGVAVTKVKNE
TFDDFDEDLDHVVLIERLRMLLSRRALGFMNHHVEGSSGVSSGNLIQCFLKQKEKFMFANGELMGIGNALHDKIGSDAPRLCIPSVICSPNTAFSGSCFSSDHSLNKSTE
SGNDMELKEDDKICSSEKVATELGPRLLTDHVPEVNLFNSTKVKDEPYDHVDGCNLYDKDTKNICSRILSIKSETIMPDEPYENKVDDMRLQDRMKFFSSRKVFGSTSRD
YEHPKPSDPGCSSLVSEPSSLMNIKHRRKRKKTATNSVETALEEDAPGLLQILVGKGVEIDEIKLYGEMESDDDLHESFSEDSFGELEAVISRLFSQRHSFLKFPSIRCT
KASRASYCLACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYGYATYFFELVDSLPIDWQIKRLVIALKLTSCSRISLIENKPLL