; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS013803 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS013803
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionTransmembrane protein
Genome locationscaffold541:101272..102789
RNA-Seq ExpressionMS013803
SyntenyMS013803
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0045443.1 uncharacterized protein E6C27_scaffold294G00460 [Cucumis melo var. makuwa]1.8e-21275.95Show/hide
Query:  MGLPLTGKSKSTAGENWGMGFLLVFFSEDSPSAIADQKKLF---------------------SSSSVRRSNYNLLTRAQSVISVCALLVFVSLLLFTLST
        MGL LTGKSKSTAGENWGMG LLVFFSEDSPS IAD   LF                     SS+S+RRSNYNLLT+AQS ISVCALLVF+SLLLFTLST
Subjt:  MGLPLTGKSKSTAGENWGMGFLLVFFSEDSPSAIADQKKLF---------------------SSSSVRRSNYNLLTRAQSVISVCALLVFVSLLLFTLST

Query:  FEPAIKMNLTPPRRFLSQKSAPIEVRTPSGKYVYRWNWLGKMWKQKPATAKRTDA-AAPAVALQGMGTLYMRGTRAMADLAVVHVAEDVGEEDLRLFLRL
        FEP IKMNLTPPRR L+QKS PI+VR P G    RWNW GKMWKQKPA  K T   A   VALQ MGTLYMRGTRAM DL VVHV+EDVGEED RLFLRL
Subjt:  FEPAIKMNLTPPRRFLSQKSAPIEVRTPSGKYVYRWNWLGKMWKQKPATAKRTDA-AAPAVALQGMGTLYMRGTRAMADLAVVHVAEDVGEEDLRLFLRL

Query:  FHRSGVTAKSDSVFIFPSAAFSSRFGSVIREENESFLKLVRRYRNLNSTASRRAAVGFDVTQFVKSKEKKETEEPLWGKRVKRVSNESDGGEDELTRLSY
        FHRSGVTAKSDSVF+FPS AFS RFG +IREEN+SFLKL+ RYRNLN TASR AA GFDVT+  KSKEKKETEEP+WGKRVKR +N S+GGEDELTRLSY
Subjt:  FHRSGVTAKSDSVFIFPSAAFSSRFGSVIREENESFLKLVRRYRNLNSTASRRAAVGFDVTQFVKSKEKKETEEPLWGKRVKRVSNESDGGEDELTRLSY

Query:  GSVVSFDAAEIDPENSLSGFSDHIPMSLRRWACYPMLLGRVRRNFKHIMLIDAKSSLLLGDPLGRVRSKSTESVILFTKPEQIQFPSSSTKHGKKNSDKS
        GSVVSFDA EIDPENSLSGFSDHIPMSLRRW+CYPMLLGRVRRNFKH+MLIDAKSSLLLGDPL RVR+K TESVI FT            KHGKKNS+KS
Subjt:  GSVVSFDAAEIDPENSLSGFSDHIPMSLRRWACYPMLLGRVRRNFKHIMLIDAKSSLLLGDPLGRVRSKSTESVILFTKPEQIQFPSSSTKHGKKNSDKS

Query:  NSRRHLVNPAVVIGGARGVRRLSNAALVEIARTLMQHKKKNSVSDSGVLSHLVNSEFLLKNVKVITAAESIPEASSLAGFDSDSVGSLSAPENAIFQRGN
        NS  H+VNP++VIGGARG+RR+SNAA+VEI R LMQHKKKNSVSDSGVLSHLVNSEFLLKNVKVI A ESIPEASS  G + +SVG LSAPE  +F +GN
Subjt:  NSRRHLVNPAVVIGGARGVRRLSNAALVEIARTLMQHKKKNSVSDSGVLSHLVNSEFLLKNVKVITAAESIPEASSLAGFDSDSVGSLSAPENAIFQRGN

Query:  NGNFREINSIIMKKICSSEMDSSVYSGC
        NGN  EINS+IMKKICSSE+DSSVY+ C
Subjt:  NGNFREINSIIMKKICSSEMDSSVYSGC

TYK19793.1 uncharacterized protein E5676_scaffold307G00200 [Cucumis melo var. makuwa]7.2e-21477.26Show/hide
Query:  MGLPLTGKSKSTAGENWGMGFLLVFFSEDSPSAIADQKKLF------------SSSSVRRSNYNLLTRAQSVISVCALLVFVSLLLFTLSTFEPAIKMNL
        MGL LTGKSKSTAGENWGMG LLVFFSEDSPS IAD   LF            SS+S+RRSNYNLLT+AQS ISVCALLVF+SLLLFTLSTFEP IKMNL
Subjt:  MGLPLTGKSKSTAGENWGMGFLLVFFSEDSPSAIADQKKLF------------SSSSVRRSNYNLLTRAQSVISVCALLVFVSLLLFTLSTFEPAIKMNL

Query:  TPPRRFLSQKSAPIEVRTPSGKYVYRWNWLGKMWKQKPATAKRTDA-AAPAVALQGMGTLYMRGTRAMADLAVVHVAEDVGEEDLRLFLRLFHRSGVTAK
        TPPRR L+QKS PI+VR P G    RWNW GKMWKQKPA  K T   A   VALQ MGTLYMRGTRAM DL VVHV+EDVGEED RLFLRLFHRSGVTAK
Subjt:  TPPRRFLSQKSAPIEVRTPSGKYVYRWNWLGKMWKQKPATAKRTDA-AAPAVALQGMGTLYMRGTRAMADLAVVHVAEDVGEEDLRLFLRLFHRSGVTAK

Query:  SDSVFIFPSAAFSSRFGSVIREENESFLKLVRRYRNLNSTASRRAAVGFDVTQFVKSKEKKETEEPLWGKRVKRVSNESDGGEDELTRLSYGSVVSFDAA
        SDSVF+FPS AFS RFG +IREEN+SFLKL+ RYRNLN TASR AA GFDVT+  KSKEKKETEEP+WGKRVKR +N S+GGEDELTRLSYGSVVSFDA 
Subjt:  SDSVFIFPSAAFSSRFGSVIREENESFLKLVRRYRNLNSTASRRAAVGFDVTQFVKSKEKKETEEPLWGKRVKRVSNESDGGEDELTRLSYGSVVSFDAA

Query:  EIDPENSLSGFSDHIPMSLRRWACYPMLLGRVRRNFKHIMLIDAKSSLLLGDPLGRVRSKSTESVILFTKPEQIQFPSSSTKHGKKNSDKSNSRRHLVNP
        EIDPENSLSGFSDHIPMSLRRW+CYPMLLGRVRRNFKH+MLIDAKSSLLLGDPL RVR+K TESVI FT            KHGKKNS+KSNS  H+VNP
Subjt:  EIDPENSLSGFSDHIPMSLRRWACYPMLLGRVRRNFKHIMLIDAKSSLLLGDPLGRVRSKSTESVILFTKPEQIQFPSSSTKHGKKNSDKSNSRRHLVNP

Query:  AVVIGGARGVRRLSNAALVEIARTLMQHKKKNSVSDSGVLSHLVNSEFLLKNVKVITAAESIPEASSLAGFDSDSVGSLSAPENAIFQRGNNGNFREINS
        ++VIGGARG+RR+SNAA+VEI R LMQHKKKNSVSDSGVLSHLVNSEFLLKNVKVI A+ESIPEASS  G + +SVG LSAPE  +F +GNNGN  EINS
Subjt:  AVVIGGARGVRRLSNAALVEIARTLMQHKKKNSVSDSGVLSHLVNSEFLLKNVKVITAAESIPEASSLAGFDSDSVGSLSAPENAIFQRGNNGNFREINS

Query:  IIMKKICSSEMDSSVYSGC
        +IMKKICSSE+DSSVY+ C
Subjt:  IIMKKICSSEMDSSVYSGC

XP_004151150.1 uncharacterized protein LOC101208268 [Cucumis sativus]8.8e-21277.82Show/hide
Query:  MGLPLTGKSKSTAGENWGMGFLLVFFSEDSPSAIADQKKLF-------SSSSVRRSNYNLLTRAQSVISVCALLVFVSLLLFTLSTFEPAIKMNLTPPRR
        MGL LTGKSKSTAG+NWGMG LLVFFSEDSPS IAD K LF       SS+S RRSNYNLLT+AQS ISVCALLVF+SLLLFTLSTFEP IKMNLTPPRR
Subjt:  MGLPLTGKSKSTAGENWGMGFLLVFFSEDSPSAIADQKKLF-------SSSSVRRSNYNLLTRAQSVISVCALLVFVSLLLFTLSTFEPAIKMNLTPPRR

Query:  FLSQKSAPIEVRTPSGKYVYRWNWLGKMWKQKPATAKRTDA-AAPAVALQGMGTLYMRGTRAMADLAVVHVAEDVGEEDLRLFLRLFHRSGVTAKSDSVF
         L+QKS PIE+R P G    RWNW  +MWKQKPA  K T   A   VALQ MGTLYMRGTRAM DL VVHV+ED+GEED RLFLRLFHRSGVTAKSDSVF
Subjt:  FLSQKSAPIEVRTPSGKYVYRWNWLGKMWKQKPATAKRTDA-AAPAVALQGMGTLYMRGTRAMADLAVVHVAEDVGEEDLRLFLRLFHRSGVTAKSDSVF

Query:  IFPSAAFSSRFGSVIREENESFLKLVRRYRNLNSTASRRAAVGFDVTQFVKSKEKKETEEPLWGKRVKRVSNESDGGEDELTRLSYGSVVSFDAAEIDPE
        +FPS AFS RFG +IR+ENESFLKL+ RYRNLN T SR AA GFDVTQ  KSKEKKETEEP+WGKRVKR+ N S+GGEDELTRLSYGSVVSFDA EIDPE
Subjt:  IFPSAAFSSRFGSVIREENESFLKLVRRYRNLNSTASRRAAVGFDVTQFVKSKEKKETEEPLWGKRVKRVSNESDGGEDELTRLSYGSVVSFDAAEIDPE

Query:  NSLSGFSDHIPMSLRRWACYPMLLGRVRRNFKHIMLIDAKSSLLLGDPLGRVRSKSTESVILFTKPEQIQFPSSSTKHGKKNSDKSNSRRHLVNPAVVIG
        NSLSGFSDHIPMSLRRW+CYPMLLGRVRRNFKH+MLIDAKSSLLLGDPL RVR+K TESVI FT            KH KKNS+KSNS  HLVNP++VIG
Subjt:  NSLSGFSDHIPMSLRRWACYPMLLGRVRRNFKHIMLIDAKSSLLLGDPLGRVRSKSTESVILFTKPEQIQFPSSSTKHGKKNSDKSNSRRHLVNPAVVIG

Query:  GARGVRRLSNAALVEIARTLMQHKKKNSVSDSGVLSHLVNSEFLLKNVKVITAAESIPEASSLAGFDSDSVGSLSAPENAIFQRGNNGNFREINSIIMKK
        GARG+RRLSNAA VEI R LMQHKKKNSVSDSGVLS LVNSEFLLKNVKVI A+ESIPEASSL G + +SVGSLSAPE  +F +GNNGN  EINS+IMKK
Subjt:  GARGVRRLSNAALVEIARTLMQHKKKNSVSDSGVLSHLVNSEFLLKNVKVITAAESIPEASSLAGFDSDSVGSLSAPENAIFQRGNNGNFREINSIIMKK

Query:  ICSSEMDSSVYSGC
        ICSSE+DSSVY+ C
Subjt:  ICSSEMDSSVYSGC

XP_008460778.1 PREDICTED: uncharacterized protein LOC103499540 [Cucumis melo]6.1e-21376.09Show/hide
Query:  MGLPLTGKSKSTAGENWGMGFLLVFFSEDSPSAIADQKKLF--------------------SSSSVRRSNYNLLTRAQSVISVCALLVFVSLLLFTLSTF
        MGL LTGKSKSTAGENWGMG LLVFFSEDSPS IAD   LF                    SS+S+RRSNYNLLT+AQS ISVCALLVF+SLLLFTLSTF
Subjt:  MGLPLTGKSKSTAGENWGMGFLLVFFSEDSPSAIADQKKLF--------------------SSSSVRRSNYNLLTRAQSVISVCALLVFVSLLLFTLSTF

Query:  EPAIKMNLTPPRRFLSQKSAPIEVRTPSGKYVYRWNWLGKMWKQKPATAKRTDA-AAPAVALQGMGTLYMRGTRAMADLAVVHVAEDVGEEDLRLFLRLF
        EP IKMNLTPPRR L+QKS PI+VR P G    RWNW GKMWKQKPA  K T   A   VALQ MGTLYMRGTRAM DL VVHV+EDVGEED RLFLRLF
Subjt:  EPAIKMNLTPPRRFLSQKSAPIEVRTPSGKYVYRWNWLGKMWKQKPATAKRTDA-AAPAVALQGMGTLYMRGTRAMADLAVVHVAEDVGEEDLRLFLRLF

Query:  HRSGVTAKSDSVFIFPSAAFSSRFGSVIREENESFLKLVRRYRNLNSTASRRAAVGFDVTQFVKSKEKKETEEPLWGKRVKRVSNESDGGEDELTRLSYG
        HRSGVTAKSDSVF+FPS AFS RFG +IREEN+SFLKL+ RYRNLN TASR AA GFDVT+  KSKEKKETEEP+WGKRVKR +N S+GGEDELTRLSYG
Subjt:  HRSGVTAKSDSVFIFPSAAFSSRFGSVIREENESFLKLVRRYRNLNSTASRRAAVGFDVTQFVKSKEKKETEEPLWGKRVKRVSNESDGGEDELTRLSYG

Query:  SVVSFDAAEIDPENSLSGFSDHIPMSLRRWACYPMLLGRVRRNFKHIMLIDAKSSLLLGDPLGRVRSKSTESVILFTKPEQIQFPSSSTKHGKKNSDKSN
        SVVSFDA EIDPENSLSGFSDHIPMSLRRW+CYPMLLGRVRRNFKH+MLIDAKSSLLLGDPL RVR+K TESVI FT            KHGKKNS+KSN
Subjt:  SVVSFDAAEIDPENSLSGFSDHIPMSLRRWACYPMLLGRVRRNFKHIMLIDAKSSLLLGDPLGRVRSKSTESVILFTKPEQIQFPSSSTKHGKKNSDKSN

Query:  SRRHLVNPAVVIGGARGVRRLSNAALVEIARTLMQHKKKNSVSDSGVLSHLVNSEFLLKNVKVITAAESIPEASSLAGFDSDSVGSLSAPENAIFQRGNN
        S  H+VNP++VIGGARG+RR+SNAA+VEI R LMQHKKKNSVSDSGVLSHLVNSEFLLKNVKVI A+ESIPEASS  G + +SVG LSAPE  +F +GNN
Subjt:  SRRHLVNPAVVIGGARGVRRLSNAALVEIARTLMQHKKKNSVSDSGVLSHLVNSEFLLKNVKVITAAESIPEASSLAGFDSDSVGSLSAPENAIFQRGNN

Query:  GNFREINSIIMKKICSSEMDSSVYSGC
        GN  EINS+IMKKICSSE+DSSVY+ C
Subjt:  GNFREINSIIMKKICSSEMDSSVYSGC

XP_038883664.1 uncharacterized protein LOC120074578 [Benincasa hispida]4.7e-22179.77Show/hide
Query:  MGLPLTGKSKSTAGENWGMGFLLVFFSEDSPSAIADQKKLFSSSSV--------RRSNYNLLTRAQSVISVCALLVFVSLLLFTLSTFEPAIKMNLTPPR
        MGL LTGKSKS+AGENWGMG LLVFFSEDS SAIADQKKLFSSSS         RRSNYNLL +AQS ISVCALLVFVSLLLFTLSTFEPAIKMNLTPPR
Subjt:  MGLPLTGKSKSTAGENWGMGFLLVFFSEDSPSAIADQKKLFSSSSV--------RRSNYNLLTRAQSVISVCALLVFVSLLLFTLSTFEPAIKMNLTPPR

Query:  RFLSQKSAPIEVRTPSGKYVYRWNWLGKMWKQKPATAKRTDAAAPAVALQGMGTLYMRGTRAMADLAVVHVAEDVGEEDLRLFLRLFHRSGVTAKSDSVF
        R LSQKS PIEVRTPS     +WNW GKMWKQKPA  K T+ A    ALQ MGTLYMRGTRAM DL VVHV+EDVGEEDLRLFLRLFHRSGVTAKSDSVF
Subjt:  RFLSQKSAPIEVRTPSGKYVYRWNWLGKMWKQKPATAKRTDAAAPAVALQGMGTLYMRGTRAMADLAVVHVAEDVGEEDLRLFLRLFHRSGVTAKSDSVF

Query:  IFPSAAFSSRFGSVIREENESFLKLVRRYRNLNSTASRRAAVGFDVTQFVKSKEKKETEEPLWGKRVKRVSNESDGGEDELTRLSYGSVVSFDAAEIDPE
        +FPS   S RFG +IREENESFLKL+ +YRNLN TASR AA GFDVTQFVK+KEKKETEEP+WGKRVKR++N+S+G  DELTRLSYGSVV FDAAEIDPE
Subjt:  IFPSAAFSSRFGSVIREENESFLKLVRRYRNLNSTASRRAAVGFDVTQFVKSKEKKETEEPLWGKRVKRVSNESDGGEDELTRLSYGSVVSFDAAEIDPE

Query:  NSLSGFSDHIPMSLRRWACYPMLLGRVRRNFKHIMLIDAKSSLLLGDPLGRVRSKSTESVILFTKPEQIQFPSSSTKHGKKNSDKSNSRRHLVNPAVVIG
        NSLSGFSDHIPMSLRRWACYPMLLGRVRRNFKH+ML+DAK+SL+LGDPL RVR+K TESVILFT            KH KKNS++SN+  HLVNPA+V+G
Subjt:  NSLSGFSDHIPMSLRRWACYPMLLGRVRRNFKHIMLIDAKSSLLLGDPLGRVRSKSTESVILFTKPEQIQFPSSSTKHGKKNSDKSNSRRHLVNPAVVIG

Query:  GARGVRRLSNAALVEIARTLMQHKKKNSVSDSGVLSHLVNSEFLLKNVKVITAAESIPEASSLAGFDSDSVGSLSAPENAIFQRGNNGNFREINSIIMKK
        GARG+RRLSNAA+VEIAR LMQHKKKNSVSDSGVLSHLVNSEFLLKNVKVIT+ ESIPE SSLAG + DSVGS SAPE  +FQRGNNGN REINS+IMKK
Subjt:  GARGVRRLSNAALVEIARTLMQHKKKNSVSDSGVLSHLVNSEFLLKNVKVITAAESIPEASSLAGFDSDSVGSLSAPENAIFQRGNNGNFREINSIIMKK

Query:  ICSSEMDSSVYSGC
        ICSSE+DSSVYS C
Subjt:  ICSSEMDSSVYSGC

TrEMBL top hitse value%identityAlignment
A0A0A0KWA6 Uncharacterized protein4.3e-21277.82Show/hide
Query:  MGLPLTGKSKSTAGENWGMGFLLVFFSEDSPSAIADQKKLF-------SSSSVRRSNYNLLTRAQSVISVCALLVFVSLLLFTLSTFEPAIKMNLTPPRR
        MGL LTGKSKSTAG+NWGMG LLVFFSEDSPS IAD K LF       SS+S RRSNYNLLT+AQS ISVCALLVF+SLLLFTLSTFEP IKMNLTPPRR
Subjt:  MGLPLTGKSKSTAGENWGMGFLLVFFSEDSPSAIADQKKLF-------SSSSVRRSNYNLLTRAQSVISVCALLVFVSLLLFTLSTFEPAIKMNLTPPRR

Query:  FLSQKSAPIEVRTPSGKYVYRWNWLGKMWKQKPATAKRTDA-AAPAVALQGMGTLYMRGTRAMADLAVVHVAEDVGEEDLRLFLRLFHRSGVTAKSDSVF
         L+QKS PIE+R P G    RWNW  +MWKQKPA  K T   A   VALQ MGTLYMRGTRAM DL VVHV+ED+GEED RLFLRLFHRSGVTAKSDSVF
Subjt:  FLSQKSAPIEVRTPSGKYVYRWNWLGKMWKQKPATAKRTDA-AAPAVALQGMGTLYMRGTRAMADLAVVHVAEDVGEEDLRLFLRLFHRSGVTAKSDSVF

Query:  IFPSAAFSSRFGSVIREENESFLKLVRRYRNLNSTASRRAAVGFDVTQFVKSKEKKETEEPLWGKRVKRVSNESDGGEDELTRLSYGSVVSFDAAEIDPE
        +FPS AFS RFG +IR+ENESFLKL+ RYRNLN T SR AA GFDVTQ  KSKEKKETEEP+WGKRVKR+ N S+GGEDELTRLSYGSVVSFDA EIDPE
Subjt:  IFPSAAFSSRFGSVIREENESFLKLVRRYRNLNSTASRRAAVGFDVTQFVKSKEKKETEEPLWGKRVKRVSNESDGGEDELTRLSYGSVVSFDAAEIDPE

Query:  NSLSGFSDHIPMSLRRWACYPMLLGRVRRNFKHIMLIDAKSSLLLGDPLGRVRSKSTESVILFTKPEQIQFPSSSTKHGKKNSDKSNSRRHLVNPAVVIG
        NSLSGFSDHIPMSLRRW+CYPMLLGRVRRNFKH+MLIDAKSSLLLGDPL RVR+K TESVI FT            KH KKNS+KSNS  HLVNP++VIG
Subjt:  NSLSGFSDHIPMSLRRWACYPMLLGRVRRNFKHIMLIDAKSSLLLGDPLGRVRSKSTESVILFTKPEQIQFPSSSTKHGKKNSDKSNSRRHLVNPAVVIG

Query:  GARGVRRLSNAALVEIARTLMQHKKKNSVSDSGVLSHLVNSEFLLKNVKVITAAESIPEASSLAGFDSDSVGSLSAPENAIFQRGNNGNFREINSIIMKK
        GARG+RRLSNAA VEI R LMQHKKKNSVSDSGVLS LVNSEFLLKNVKVI A+ESIPEASSL G + +SVGSLSAPE  +F +GNNGN  EINS+IMKK
Subjt:  GARGVRRLSNAALVEIARTLMQHKKKNSVSDSGVLSHLVNSEFLLKNVKVITAAESIPEASSLAGFDSDSVGSLSAPENAIFQRGNNGNFREINSIIMKK

Query:  ICSSEMDSSVYSGC
        ICSSE+DSSVY+ C
Subjt:  ICSSEMDSSVYSGC

A0A1S3CD81 uncharacterized protein LOC1034995402.9e-21376.09Show/hide
Query:  MGLPLTGKSKSTAGENWGMGFLLVFFSEDSPSAIADQKKLF--------------------SSSSVRRSNYNLLTRAQSVISVCALLVFVSLLLFTLSTF
        MGL LTGKSKSTAGENWGMG LLVFFSEDSPS IAD   LF                    SS+S+RRSNYNLLT+AQS ISVCALLVF+SLLLFTLSTF
Subjt:  MGLPLTGKSKSTAGENWGMGFLLVFFSEDSPSAIADQKKLF--------------------SSSSVRRSNYNLLTRAQSVISVCALLVFVSLLLFTLSTF

Query:  EPAIKMNLTPPRRFLSQKSAPIEVRTPSGKYVYRWNWLGKMWKQKPATAKRTDA-AAPAVALQGMGTLYMRGTRAMADLAVVHVAEDVGEEDLRLFLRLF
        EP IKMNLTPPRR L+QKS PI+VR P G    RWNW GKMWKQKPA  K T   A   VALQ MGTLYMRGTRAM DL VVHV+EDVGEED RLFLRLF
Subjt:  EPAIKMNLTPPRRFLSQKSAPIEVRTPSGKYVYRWNWLGKMWKQKPATAKRTDA-AAPAVALQGMGTLYMRGTRAMADLAVVHVAEDVGEEDLRLFLRLF

Query:  HRSGVTAKSDSVFIFPSAAFSSRFGSVIREENESFLKLVRRYRNLNSTASRRAAVGFDVTQFVKSKEKKETEEPLWGKRVKRVSNESDGGEDELTRLSYG
        HRSGVTAKSDSVF+FPS AFS RFG +IREEN+SFLKL+ RYRNLN TASR AA GFDVT+  KSKEKKETEEP+WGKRVKR +N S+GGEDELTRLSYG
Subjt:  HRSGVTAKSDSVFIFPSAAFSSRFGSVIREENESFLKLVRRYRNLNSTASRRAAVGFDVTQFVKSKEKKETEEPLWGKRVKRVSNESDGGEDELTRLSYG

Query:  SVVSFDAAEIDPENSLSGFSDHIPMSLRRWACYPMLLGRVRRNFKHIMLIDAKSSLLLGDPLGRVRSKSTESVILFTKPEQIQFPSSSTKHGKKNSDKSN
        SVVSFDA EIDPENSLSGFSDHIPMSLRRW+CYPMLLGRVRRNFKH+MLIDAKSSLLLGDPL RVR+K TESVI FT            KHGKKNS+KSN
Subjt:  SVVSFDAAEIDPENSLSGFSDHIPMSLRRWACYPMLLGRVRRNFKHIMLIDAKSSLLLGDPLGRVRSKSTESVILFTKPEQIQFPSSSTKHGKKNSDKSN

Query:  SRRHLVNPAVVIGGARGVRRLSNAALVEIARTLMQHKKKNSVSDSGVLSHLVNSEFLLKNVKVITAAESIPEASSLAGFDSDSVGSLSAPENAIFQRGNN
        S  H+VNP++VIGGARG+RR+SNAA+VEI R LMQHKKKNSVSDSGVLSHLVNSEFLLKNVKVI A+ESIPEASS  G + +SVG LSAPE  +F +GNN
Subjt:  SRRHLVNPAVVIGGARGVRRLSNAALVEIARTLMQHKKKNSVSDSGVLSHLVNSEFLLKNVKVITAAESIPEASSLAGFDSDSVGSLSAPENAIFQRGNN

Query:  GNFREINSIIMKKICSSEMDSSVYSGC
        GN  EINS+IMKKICSSE+DSSVY+ C
Subjt:  GNFREINSIIMKKICSSEMDSSVYSGC

A0A5A7TPI4 Uncharacterized protein8.6e-21375.95Show/hide
Query:  MGLPLTGKSKSTAGENWGMGFLLVFFSEDSPSAIADQKKLF---------------------SSSSVRRSNYNLLTRAQSVISVCALLVFVSLLLFTLST
        MGL LTGKSKSTAGENWGMG LLVFFSEDSPS IAD   LF                     SS+S+RRSNYNLLT+AQS ISVCALLVF+SLLLFTLST
Subjt:  MGLPLTGKSKSTAGENWGMGFLLVFFSEDSPSAIADQKKLF---------------------SSSSVRRSNYNLLTRAQSVISVCALLVFVSLLLFTLST

Query:  FEPAIKMNLTPPRRFLSQKSAPIEVRTPSGKYVYRWNWLGKMWKQKPATAKRTDA-AAPAVALQGMGTLYMRGTRAMADLAVVHVAEDVGEEDLRLFLRL
        FEP IKMNLTPPRR L+QKS PI+VR P G    RWNW GKMWKQKPA  K T   A   VALQ MGTLYMRGTRAM DL VVHV+EDVGEED RLFLRL
Subjt:  FEPAIKMNLTPPRRFLSQKSAPIEVRTPSGKYVYRWNWLGKMWKQKPATAKRTDA-AAPAVALQGMGTLYMRGTRAMADLAVVHVAEDVGEEDLRLFLRL

Query:  FHRSGVTAKSDSVFIFPSAAFSSRFGSVIREENESFLKLVRRYRNLNSTASRRAAVGFDVTQFVKSKEKKETEEPLWGKRVKRVSNESDGGEDELTRLSY
        FHRSGVTAKSDSVF+FPS AFS RFG +IREEN+SFLKL+ RYRNLN TASR AA GFDVT+  KSKEKKETEEP+WGKRVKR +N S+GGEDELTRLSY
Subjt:  FHRSGVTAKSDSVFIFPSAAFSSRFGSVIREENESFLKLVRRYRNLNSTASRRAAVGFDVTQFVKSKEKKETEEPLWGKRVKRVSNESDGGEDELTRLSY

Query:  GSVVSFDAAEIDPENSLSGFSDHIPMSLRRWACYPMLLGRVRRNFKHIMLIDAKSSLLLGDPLGRVRSKSTESVILFTKPEQIQFPSSSTKHGKKNSDKS
        GSVVSFDA EIDPENSLSGFSDHIPMSLRRW+CYPMLLGRVRRNFKH+MLIDAKSSLLLGDPL RVR+K TESVI FT            KHGKKNS+KS
Subjt:  GSVVSFDAAEIDPENSLSGFSDHIPMSLRRWACYPMLLGRVRRNFKHIMLIDAKSSLLLGDPLGRVRSKSTESVILFTKPEQIQFPSSSTKHGKKNSDKS

Query:  NSRRHLVNPAVVIGGARGVRRLSNAALVEIARTLMQHKKKNSVSDSGVLSHLVNSEFLLKNVKVITAAESIPEASSLAGFDSDSVGSLSAPENAIFQRGN
        NS  H+VNP++VIGGARG+RR+SNAA+VEI R LMQHKKKNSVSDSGVLSHLVNSEFLLKNVKVI A ESIPEASS  G + +SVG LSAPE  +F +GN
Subjt:  NSRRHLVNPAVVIGGARGVRRLSNAALVEIARTLMQHKKKNSVSDSGVLSHLVNSEFLLKNVKVITAAESIPEASSLAGFDSDSVGSLSAPENAIFQRGN

Query:  NGNFREINSIIMKKICSSEMDSSVYSGC
        NGN  EINS+IMKKICSSE+DSSVY+ C
Subjt:  NGNFREINSIIMKKICSSEMDSSVYSGC

A0A5D3D8F3 Uncharacterized protein3.5e-21477.26Show/hide
Query:  MGLPLTGKSKSTAGENWGMGFLLVFFSEDSPSAIADQKKLF------------SSSSVRRSNYNLLTRAQSVISVCALLVFVSLLLFTLSTFEPAIKMNL
        MGL LTGKSKSTAGENWGMG LLVFFSEDSPS IAD   LF            SS+S+RRSNYNLLT+AQS ISVCALLVF+SLLLFTLSTFEP IKMNL
Subjt:  MGLPLTGKSKSTAGENWGMGFLLVFFSEDSPSAIADQKKLF------------SSSSVRRSNYNLLTRAQSVISVCALLVFVSLLLFTLSTFEPAIKMNL

Query:  TPPRRFLSQKSAPIEVRTPSGKYVYRWNWLGKMWKQKPATAKRTDA-AAPAVALQGMGTLYMRGTRAMADLAVVHVAEDVGEEDLRLFLRLFHRSGVTAK
        TPPRR L+QKS PI+VR P G    RWNW GKMWKQKPA  K T   A   VALQ MGTLYMRGTRAM DL VVHV+EDVGEED RLFLRLFHRSGVTAK
Subjt:  TPPRRFLSQKSAPIEVRTPSGKYVYRWNWLGKMWKQKPATAKRTDA-AAPAVALQGMGTLYMRGTRAMADLAVVHVAEDVGEEDLRLFLRLFHRSGVTAK

Query:  SDSVFIFPSAAFSSRFGSVIREENESFLKLVRRYRNLNSTASRRAAVGFDVTQFVKSKEKKETEEPLWGKRVKRVSNESDGGEDELTRLSYGSVVSFDAA
        SDSVF+FPS AFS RFG +IREEN+SFLKL+ RYRNLN TASR AA GFDVT+  KSKEKKETEEP+WGKRVKR +N S+GGEDELTRLSYGSVVSFDA 
Subjt:  SDSVFIFPSAAFSSRFGSVIREENESFLKLVRRYRNLNSTASRRAAVGFDVTQFVKSKEKKETEEPLWGKRVKRVSNESDGGEDELTRLSYGSVVSFDAA

Query:  EIDPENSLSGFSDHIPMSLRRWACYPMLLGRVRRNFKHIMLIDAKSSLLLGDPLGRVRSKSTESVILFTKPEQIQFPSSSTKHGKKNSDKSNSRRHLVNP
        EIDPENSLSGFSDHIPMSLRRW+CYPMLLGRVRRNFKH+MLIDAKSSLLLGDPL RVR+K TESVI FT            KHGKKNS+KSNS  H+VNP
Subjt:  EIDPENSLSGFSDHIPMSLRRWACYPMLLGRVRRNFKHIMLIDAKSSLLLGDPLGRVRSKSTESVILFTKPEQIQFPSSSTKHGKKNSDKSNSRRHLVNP

Query:  AVVIGGARGVRRLSNAALVEIARTLMQHKKKNSVSDSGVLSHLVNSEFLLKNVKVITAAESIPEASSLAGFDSDSVGSLSAPENAIFQRGNNGNFREINS
        ++VIGGARG+RR+SNAA+VEI R LMQHKKKNSVSDSGVLSHLVNSEFLLKNVKVI A+ESIPEASS  G + +SVG LSAPE  +F +GNNGN  EINS
Subjt:  AVVIGGARGVRRLSNAALVEIARTLMQHKKKNSVSDSGVLSHLVNSEFLLKNVKVITAAESIPEASSLAGFDSDSVGSLSAPENAIFQRGNNGNFREINS

Query:  IIMKKICSSEMDSSVYSGC
        +IMKKICSSE+DSSVY+ C
Subjt:  IIMKKICSSEMDSSVYSGC

A0A6J1EAJ1 uncharacterized protein LOC1114323124.7e-20376.32Show/hide
Query:  MGLPLTGKSKSTAGENWGMGFLLVFFSEDSPSAIADQKKLF----SSSSVRRSNYNLLTRAQSVISVCALLVFVSLLLFTLSTFEPAIKMNLTPPRRFLS
        MGL +TGKSKSTA ENWGMG  LVFFSEDSPSAIAD  KLF    SSSS RRSNYNLL++AQS ISVCALLVFVSLLLFTLSTFEPAIKMNLTPPRR LS
Subjt:  MGLPLTGKSKSTAGENWGMGFLLVFFSEDSPSAIADQKKLF----SSSSVRRSNYNLLTRAQSVISVCALLVFVSLLLFTLSTFEPAIKMNLTPPRRFLS

Query:  QKSAPIEVRTPSGKYVYRWNWLGKMWKQKPATAKRTDAAAPAVALQGMGTLYMRGTRAMADLAVVHVAEDVGEEDLRLFLRLFHRSGVTAKSDSVFIFPS
        +KS PIE+RTPS   V RWNW  KMWKQKPA       +    ALQ MGTLY+RGTRAMAD+ VVHV EDV E+D RLFLRLFHRSGVTAKSDSVFIF S
Subjt:  QKSAPIEVRTPSGKYVYRWNWLGKMWKQKPATAKRTDAAAPAVALQGMGTLYMRGTRAMADLAVVHVAEDVGEEDLRLFLRLFHRSGVTAKSDSVFIFPS

Query:  AAFSSRFGSVIREENESFLKLVRRYRNLNSTASRRAAVGFDVTQFVKSKEKKETEEPLWGKRVKRVSNESDGGEDELTRLSYGSVVSFDAAEIDPENSLS
          FS +FG +IREENESFLKL+ R RN N TA+RRA  GFDV QFVK KEKKE EEP+WGK+ KR +N+S GGEDELTRLSYGSVVSFDAAEIDPENSLS
Subjt:  AAFSSRFGSVIREENESFLKLVRRYRNLNSTASRRAAVGFDVTQFVKSKEKKETEEPLWGKRVKRVSNESDGGEDELTRLSYGSVVSFDAAEIDPENSLS

Query:  GFSDHIPMSLRRWACYPMLLGRVRRNFKHIMLIDAKSSLLLGDPLGRVRSKSTESVILFTKPEQIQFPSSSTKHGKKNSDKSNSRRHLVNPAVVIGGARG
        GFSDHIPMSLRRWACYPMLLGRVRRNFKH+ML+DAK+S+L+GDPLGR+R+K TESVILFT            KH KKNS+KS+   +LVNPAVVIGGARG
Subjt:  GFSDHIPMSLRRWACYPMLLGRVRRNFKHIMLIDAKSSLLLGDPLGRVRSKSTESVILFTKPEQIQFPSSSTKHGKKNSDKSNSRRHLVNPAVVIGGARG

Query:  VRRLSNAALVEIARTLMQHKKKNSVSDSGVLSHLVNSEFLLKNVKVITAAESIPEASSLAGFDSDSVGSLSAPE-NAIFQRGNNGNFREINSIIMKKICS
        VRRLSNA +VEIAR LMQH KKNSVSDS VLSHLVNSEFLLKNVKVI A ESIP+AS LAG +S SVGSLSAPE   I +R N GN REINS+I+KKICS
Subjt:  VRRLSNAALVEIARTLMQHKKKNSVSDSGVLSHLVNSEFLLKNVKVITAAESIPEASSLAGFDSDSVGSLSAPE-NAIFQRGNNGNFREINSIIMKKICS

Query:  SEMDSSVYSGC
        SE+DSSVYS C
Subjt:  SEMDSSVYSGC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G57400.1 unknown protein4.8e-10746.65Show/hide
Query:  SKSTAGENWGMGFLLVFF------SEDSPSAIADQ---KKLFSSSSVRRSNYNLLTRAQSVISVCALLVFVSLLLFTLSTFEPA----IKMNLTPPRRFL
        +K T     GMG LLVFF      ++DSPS+ +       LF S    RS+  LL++AQS IS+C LL+F++L LFTLSTFEP+       +  P RRFL
Subjt:  SKSTAGENWGMGFLLVFF------SEDSPSAIADQ---KKLFSSSSVRRSNYNLLTRAQSVISVCALLVFVSLLLFTLSTFEPA----IKMNLTPPRRFL

Query:  SQK--SAPIEVRTPSGKYVYRWNWLGKMWKQKPATAKRTDAAAPAVALQGMGTLYMRGTRAMADLAVVHVAEDVGEEDLRLFLRLFHRSGVTAKSDSVFI
          +  SA  E R    ++                            ALQGMGTL++RGT++M DL VVH++ D  E+DLRLF+RL HRSGVT+KSD V +
Subjt:  SQK--SAPIEVRTPSGKYVYRWNWLGKMWKQKPATAKRTDAAAPAVALQGMGTLYMRGTRAMADLAVVHVAEDVGEEDLRLFLRLFHRSGVTAKSDSVFI

Query:  FPSAAFSSRFGSVIREENESFLKLVRRYRNLNSTASRRAAVGFDVTQFVKSKEKKETEEPLWGKRVKRVSNESDGGEDELTR----LSYGSVVSFDAAEI
        F S    +RF  +I EEN+SFLKLV  +R  NS+    +  GF++T+F+K + K  + EP+WGK+  R +       +  T     L++GSVV FD  E+
Subjt:  FPSAAFSSRFGSVIREENESFLKLVRRYRNLNSTASRRAAVGFDVTQFVKSKEKKETEEPLWGKRVKRVSNESDGGEDELTR----LSYGSVVSFDAAEI

Query:  DPENSLSGFSDHIPMSLRRWACYPMLLGRVRRNFKHIMLIDAKSSLLLGDPLGRVRSKSTESVILFTKPEQIQFPSSSTKHGKKNSDKSNSRRHLVNPAV
        DPENSLSGF DH+P+SLRRWACYPMLLGRVRRNFKH+ML+DAK+SL LGDPL R+R++S ESV+ F+K       SSS+   KK+S+        VNPA+
Subjt:  DPENSLSGFSDHIPMSLRRWACYPMLLGRVRRNFKHIMLIDAKSSLLLGDPLGRVRSKSTESVILFTKPEQIQFPSSSTKHGKKNSDKSNSRRHLVNPAV

Query:  VIGGARGVRRLSNAALVEIARTLM--QHKKKNSVSDSGVLSHLVNSEFLLKNVKVITAAESIPEASSLAGFDSDSVGSLSAPENAIFQRG----NNGNFR
        +IGGA+G+RRLS++   EI R  +  QHKKKNSV++S VLS LV +  + KN +V+T+   +PEASSLA   + +  + S   + I QRG    N+ +  
Subjt:  VIGGARGVRRLSNAALVEIARTLM--QHKKKNSVSDSGVLSHLVNSEFLLKNVKVITAAESIPEASSLAGFDSDSVGSLSAPENAIFQRG----NNGNFR

Query:  EINSIIMKKICSSEMDSSVYSGC
        +I +IIMK+ICS E+DSSVY+ C
Subjt:  EINSIIMKKICSSEMDSSVYSGC

AT5G52500.1 unknown protein3.4e-7640.45Show/hide
Query:  MGFLLVFF----SEDSPSAIADQKKLFSSSSVRRSNYNLLTRAQSVISVCALLVFVSLLLFTLSTFEPAIKMNLTPPRRFLSQKSAPIEVRTPSGKYVYR
        MG  LV F    + DS S  +  K  F S   + S+  LL++A+S IS C +L+F++L LFTLSTFE        P  RF +       + +P  +++  
Subjt:  MGFLLVFF----SEDSPSAIADQKKLFSSSSVRRSNYNLLTRAQSVISVCALLVFVSLLLFTLSTFEPAIKMNLTPPRRFLSQKSAPIEVRTPSGKYVYR

Query:  WNWLGKMWKQKPATAKRTDAAAPAVALQGMGTLYMRGTRAMADLAVVHVAEDVGEEDLRLFLRLFHRSGVTAKSDSVFIFPSAAFSSRFGSVIREENESF
         N + + +                 ALQGMGTL++RGT++M DL + H+A    E DLRLF+RL HRSGVT+KSD V +F S + ++RF  +I +EN SF
Subjt:  WNWLGKMWKQKPATAKRTDAAAPAVALQGMGTLYMRGTRAMADLAVVHVAEDVGEEDLRLFLRLFHRSGVTAKSDSVFIFPSAAFSSRFGSVIREENESF

Query:  LKLVRRYRNLNSTASRRAAVGFDVTQFVKSKEKKETEEPLWGKRVKRVSNESDGGEDELTRLSYGSVVSFDAAEIDPENSLSGFSDHIPMSLRRWACYPM
        LKLV  +RN + T+S                    +E  +WGK+ +  +N +         L++GS+V FD  E+DPENSLSGF D +P+SLRRWACYPM
Subjt:  LKLVRRYRNLNSTASRRAAVGFDVTQFVKSKEKKETEEPLWGKRVKRVSNESDGGEDELTRLSYGSVVSFDAAEIDPENSLSGFSDHIPMSLRRWACYPM

Query:  LLGRVRRNFKHIMLIDAKSSLLLGDPLGRVRSKSTESVILFTKPEQIQFPSSSTKHGKKNSDKSNSRRHLVNPAVVIGGARGVRRLSNAALVEIAR-TLM
        LLGRVRR+FKH+ML+DAK+S  +GDP  R+R++S +SV+ F            +KH  KN+ +       VNP ++IGGA+G+RRLS++   EI R T+M
Subjt:  LLGRVRRNFKHIMLIDAKSSLLLGDPLGRVRSKSTESVILFTKPEQIQFPSSSTKHGKKNSDKSNSRRHLVNPAVVIGGARGVRRLSNAALVEIAR-TLM

Query:  QHKKKNSVSDSGVLSHLVNSEFLLKNVKVITAAESIPEAS
        +      V++S VLS LV +  + KN +V+ +   +PEA+
Subjt:  QHKKKNSVSDSGVLSHLVNSEFLLKNVKVITAAESIPEAS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTCTCCCTCTCACCGGAAAATCCAAATCCACCGCCGGCGAGAACTGGGGCATGGGCTTTCTTCTCGTCTTCTTCTCGGAGGACTCACCCTCCGCCATTGCCGACCA
GAAGAAGCTCTTCTCTTCTTCCTCTGTTCGTCGGAGTAATTACAATCTTCTCACCAGAGCTCAGTCCGTTATCTCCGTCTGCGCTCTCCTCGTCTTCGTCTCTCTCCTTC
TCTTCACTCTCTCGACCTTCGAGCCCGCCATTAAAATGAACCTCACTCCGCCTCGGAGGTTTTTATCTCAGAAATCGGCGCCGATTGAAGTCCGTACTCCGTCGGGGAAG
TATGTGTATCGGTGGAATTGGTTGGGGAAAATGTGGAAGCAGAAACCGGCAACGGCGAAGAGGACCGATGCGGCTGCTCCGGCGGTGGCGCTGCAAGGAATGGGGACTCT
GTACATGCGAGGTACTCGAGCCATGGCGGACCTCGCGGTGGTTCACGTGGCGGAGGATGTCGGAGAAGAAGACCTCCGCCTCTTTCTCCGACTGTTTCATCGGTCCGGTG
TCACCGCGAAATCCGATTCCGTGTTCATCTTCCCTTCGGCGGCGTTCTCCTCGAGATTCGGTTCAGTGATTCGGGAGGAAAACGAGTCGTTCTTGAAACTCGTTCGTCGG
TACCGGAATTTGAACAGCACAGCTAGCCGGAGAGCGGCGGTGGGTTTCGACGTGACTCAGTTTGTAAAGAGTAAAGAGAAGAAGGAAACGGAGGAGCCGCTGTGGGGGAA
GAGAGTGAAACGAGTCTCCAACGAGTCGGACGGCGGCGAGGACGAGTTGACTCGGCTGAGTTACGGCTCGGTAGTGAGTTTCGACGCGGCGGAGATAGATCCGGAGAATT
CGCTCTCCGGATTCTCAGATCACATTCCGATGAGTCTACGGCGGTGGGCGTGTTATCCGATGCTCCTCGGCCGAGTCCGCCGGAATTTCAAGCACATAATGCTCATCGAC
GCCAAAAGCTCGCTCCTACTCGGCGATCCACTCGGCCGAGTCAGAAGCAAAAGCACCGAGTCAGTAATTCTCTTCACAAAACCAGAACAAATCCAATTCCCATCCTCATC
AACCAAGCACGGCAAAAAGAACTCGGACAAATCCAATTCCCGCCGCCACCTGGTAAATCCGGCCGTCGTGATCGGCGGGGCTCGCGGCGTCCGGCGGCTATCGAACGCCG
CACTGGTGGAGATAGCCCGAACCCTAATGCAGCACAAGAAGAAGAACTCGGTCTCCGACTCGGGAGTACTGAGTCACCTCGTTAACAGCGAGTTCTTGTTGAAGAATGTG
AAGGTGATCACGGCGGCTGAGTCGATACCCGAAGCGAGTTCACTCGCCGGGTTTGACTCGGATTCAGTCGGTTCCTTGTCGGCGCCGGAGAATGCGATATTCCAGCGGGG
TAATAATGGTAATTTTCGGGAAATTAATTCTATAATTATGAAGAAAATATGTTCGTCGGAAATGGATTCTTCTGTCTATAGTGGCTGC
mRNA sequenceShow/hide mRNA sequence
ATGGGTCTCCCTCTCACCGGAAAATCCAAATCCACCGCCGGCGAGAACTGGGGCATGGGCTTTCTTCTCGTCTTCTTCTCGGAGGACTCACCCTCCGCCATTGCCGACCA
GAAGAAGCTCTTCTCTTCTTCCTCTGTTCGTCGGAGTAATTACAATCTTCTCACCAGAGCTCAGTCCGTTATCTCCGTCTGCGCTCTCCTCGTCTTCGTCTCTCTCCTTC
TCTTCACTCTCTCGACCTTCGAGCCCGCCATTAAAATGAACCTCACTCCGCCTCGGAGGTTTTTATCTCAGAAATCGGCGCCGATTGAAGTCCGTACTCCGTCGGGGAAG
TATGTGTATCGGTGGAATTGGTTGGGGAAAATGTGGAAGCAGAAACCGGCAACGGCGAAGAGGACCGATGCGGCTGCTCCGGCGGTGGCGCTGCAAGGAATGGGGACTCT
GTACATGCGAGGTACTCGAGCCATGGCGGACCTCGCGGTGGTTCACGTGGCGGAGGATGTCGGAGAAGAAGACCTCCGCCTCTTTCTCCGACTGTTTCATCGGTCCGGTG
TCACCGCGAAATCCGATTCCGTGTTCATCTTCCCTTCGGCGGCGTTCTCCTCGAGATTCGGTTCAGTGATTCGGGAGGAAAACGAGTCGTTCTTGAAACTCGTTCGTCGG
TACCGGAATTTGAACAGCACAGCTAGCCGGAGAGCGGCGGTGGGTTTCGACGTGACTCAGTTTGTAAAGAGTAAAGAGAAGAAGGAAACGGAGGAGCCGCTGTGGGGGAA
GAGAGTGAAACGAGTCTCCAACGAGTCGGACGGCGGCGAGGACGAGTTGACTCGGCTGAGTTACGGCTCGGTAGTGAGTTTCGACGCGGCGGAGATAGATCCGGAGAATT
CGCTCTCCGGATTCTCAGATCACATTCCGATGAGTCTACGGCGGTGGGCGTGTTATCCGATGCTCCTCGGCCGAGTCCGCCGGAATTTCAAGCACATAATGCTCATCGAC
GCCAAAAGCTCGCTCCTACTCGGCGATCCACTCGGCCGAGTCAGAAGCAAAAGCACCGAGTCAGTAATTCTCTTCACAAAACCAGAACAAATCCAATTCCCATCCTCATC
AACCAAGCACGGCAAAAAGAACTCGGACAAATCCAATTCCCGCCGCCACCTGGTAAATCCGGCCGTCGTGATCGGCGGGGCTCGCGGCGTCCGGCGGCTATCGAACGCCG
CACTGGTGGAGATAGCCCGAACCCTAATGCAGCACAAGAAGAAGAACTCGGTCTCCGACTCGGGAGTACTGAGTCACCTCGTTAACAGCGAGTTCTTGTTGAAGAATGTG
AAGGTGATCACGGCGGCTGAGTCGATACCCGAAGCGAGTTCACTCGCCGGGTTTGACTCGGATTCAGTCGGTTCCTTGTCGGCGCCGGAGAATGCGATATTCCAGCGGGG
TAATAATGGTAATTTTCGGGAAATTAATTCTATAATTATGAAGAAAATATGTTCGTCGGAAATGGATTCTTCTGTCTATAGTGGCTGC
Protein sequenceShow/hide protein sequence
MGLPLTGKSKSTAGENWGMGFLLVFFSEDSPSAIADQKKLFSSSSVRRSNYNLLTRAQSVISVCALLVFVSLLLFTLSTFEPAIKMNLTPPRRFLSQKSAPIEVRTPSGK
YVYRWNWLGKMWKQKPATAKRTDAAAPAVALQGMGTLYMRGTRAMADLAVVHVAEDVGEEDLRLFLRLFHRSGVTAKSDSVFIFPSAAFSSRFGSVIREENESFLKLVRR
YRNLNSTASRRAAVGFDVTQFVKSKEKKETEEPLWGKRVKRVSNESDGGEDELTRLSYGSVVSFDAAEIDPENSLSGFSDHIPMSLRRWACYPMLLGRVRRNFKHIMLID
AKSSLLLGDPLGRVRSKSTESVILFTKPEQIQFPSSSTKHGKKNSDKSNSRRHLVNPAVVIGGARGVRRLSNAALVEIARTLMQHKKKNSVSDSGVLSHLVNSEFLLKNV
KVITAAESIPEASSLAGFDSDSVGSLSAPENAIFQRGNNGNFREINSIIMKKICSSEMDSSVYSGC