; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10003201 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10003201
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionArmadillo-type fold containing protein
Genome locationChr11:18666060..18668084
RNA-Seq ExpressionHG10003201
SyntenyHG10003201
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008448939.1 PREDICTED: uncharacterized protein LOC103490955 isoform X1 [Cucumis melo]1.9e-26681.92Show/hide
Query:  MAKQASSVFLEEWLKSIGGIRTSLYSKPTSSSAREIIQAWAELRSSLEHQSFDDHHIQSLKTLVNSQ---------------------------------
        MAKQ SSVFLEEWLKSI GI     SKPTSSSAREIIQAWAELRSSLEHQ FDD HIQSLK LVNSQ                                 
Subjt:  MAKQASSVFLEEWLKSIGGIRTSLYSKPTSSSAREIIQAWAELRSSLEHQSFDDHHIQSLKTLVNSQ---------------------------------

Query:  --------KSLRPSLVLVDSSVEVLSQIFSSKIELRKNPLFFSEGVLVLGAISYLLSASEKSKLCSLELLCRVLEEEYLLAGSVGGIIPEFLAGVGYALS
                KSLRPSLVL+DSSVEVLSQIFSSKIELRK PLF SEGVLVLGAISY LSASEKSKLC LELLCRVLEE+YLL   VGGI+PEFLAG+GYALS
Subjt:  --------KSLRPSLVLVDSSVEVLSQIFSSKIELRKNPLFFSEGVLVLGAISYLLSASEKSKLCSLELLCRVLEEEYLLAGSVGGIIPEFLAGVGYALS

Query:  SSVNAHVIRLLDSLLGIWGKVGGPTGTISSGLMILHMIEWVTSGLISLHSFEKLDVFSHATLVSSKESYASFAVVMAAAGILRAFNTYKTLLSSSERETI
        SSVNAHV+RLLDSLLGIW KV GP  T+SSGLMILHMIEWVTSGLI+LHSFEKLDVFSHATLVSSKESYASFAVVMAAAGILR FNTYK LL+SSERETI
Subjt:  SSVNAHVIRLLDSLLGIWGKVGGPTGTISSGLMILHMIEWVTSGLISLHSFEKLDVFSHATLVSSKESYASFAVVMAAAGILRAFNTYKTLLSSSERETI

Query:  SRIRISAQDCLESIARNFISTMEGSSITSNDHKRSVLLLCISLAIARCGPVSSRPPVLVCVVYALLTEIFPLQRLYAKIIEFSFAEMGVLGLTLVKEHLG
        SRIRI+AQDCLESIARNFISTME SSIT NDH+RSVLLLCISLAIARCGPVS+RPPVL+ VVY LLTEIFPLQRLYAKI EFSFAE+GVLGLTLVKEHLG
Subjt:  SRIRISAQDCLESIARNFISTMEGSSITSNDHKRSVLLLCISLAIARCGPVSSRPPVLVCVVYALLTEIFPLQRLYAKIIEFSFAEMGVLGLTLVKEHLG

Query:  SIPFKEAGAIVGVLCSQYALLEEEDKNFVENLVWYYCQDVYSKHRQVGLVLRDREDELLENIEKIAESAFLMFVVFALAVTKEKLDSKYTLESQIDVSVR
        SIPFKEAGAI GVLCSQYA L EE+++ VENLVW YC+DVYS+HR VGLVLR REDELLENIEKIAESAFLM VVFALAVTKEKLDSKYTLESQ DVSVR
Subjt:  SIPFKEAGAIVGVLCSQYALLEEEDKNFVENLVWYYCQDVYSKHRQVGLVLRDREDELLENIEKIAESAFLMFVVFALAVTKEKLDSKYTLESQIDVSVR

Query:  ILILFSCMEYFRRIRLPEYMDTIRGVVASIQGNESACVSFIESMPTYQDQTNGPDNSIGQKIKYSWIKDEVQTARMLFYVRVIPTCIERVRTQVYGKVVA
        IL+ FSCMEYFRRIRL EYM+TIRGVVASIQGNESACVSFIESMPTYQDQTNGPDNSIGQKIKYSW+KDEVQTARMLFY+RVIPTC+E V TQVYGKVVA
Subjt:  ILILFSCMEYFRRIRLPEYMDTIRGVVASIQGNESACVSFIESMPTYQDQTNGPDNSIGQKIKYSWIKDEVQTARMLFYVRVIPTCIERVRTQVYGKVVA

Query:  PTMFLYPVLRNSYV
        PTMFLY   +N+ V
Subjt:  PTMFLYPVLRNSYV

XP_008448940.1 PREDICTED: uncharacterized protein LOC103490955 isoform X2 [Cucumis melo]1.9e-26681.92Show/hide
Query:  MAKQASSVFLEEWLKSIGGIRTSLYSKPTSSSAREIIQAWAELRSSLEHQSFDDHHIQSLKTLVNSQ---------------------------------
        MAKQ SSVFLEEWLKSI GI     SKPTSSSAREIIQAWAELRSSLEHQ FDD HIQSLK LVNSQ                                 
Subjt:  MAKQASSVFLEEWLKSIGGIRTSLYSKPTSSSAREIIQAWAELRSSLEHQSFDDHHIQSLKTLVNSQ---------------------------------

Query:  --------KSLRPSLVLVDSSVEVLSQIFSSKIELRKNPLFFSEGVLVLGAISYLLSASEKSKLCSLELLCRVLEEEYLLAGSVGGIIPEFLAGVGYALS
                KSLRPSLVL+DSSVEVLSQIFSSKIELRK PLF SEGVLVLGAISY LSASEKSKLC LELLCRVLEE+YLL   VGGI+PEFLAG+GYALS
Subjt:  --------KSLRPSLVLVDSSVEVLSQIFSSKIELRKNPLFFSEGVLVLGAISYLLSASEKSKLCSLELLCRVLEEEYLLAGSVGGIIPEFLAGVGYALS

Query:  SSVNAHVIRLLDSLLGIWGKVGGPTGTISSGLMILHMIEWVTSGLISLHSFEKLDVFSHATLVSSKESYASFAVVMAAAGILRAFNTYKTLLSSSERETI
        SSVNAHV+RLLDSLLGIW KV GP  T+SSGLMILHMIEWVTSGLI+LHSFEKLDVFSHATLVSSKESYASFAVVMAAAGILR FNTYK LL+SSERETI
Subjt:  SSVNAHVIRLLDSLLGIWGKVGGPTGTISSGLMILHMIEWVTSGLISLHSFEKLDVFSHATLVSSKESYASFAVVMAAAGILRAFNTYKTLLSSSERETI

Query:  SRIRISAQDCLESIARNFISTMEGSSITSNDHKRSVLLLCISLAIARCGPVSSRPPVLVCVVYALLTEIFPLQRLYAKIIEFSFAEMGVLGLTLVKEHLG
        SRIRI+AQDCLESIARNFISTME SSIT NDH+RSVLLLCISLAIARCGPVS+RPPVL+ VVY LLTEIFPLQRLYAKI EFSFAE+GVLGLTLVKEHLG
Subjt:  SRIRISAQDCLESIARNFISTMEGSSITSNDHKRSVLLLCISLAIARCGPVSSRPPVLVCVVYALLTEIFPLQRLYAKIIEFSFAEMGVLGLTLVKEHLG

Query:  SIPFKEAGAIVGVLCSQYALLEEEDKNFVENLVWYYCQDVYSKHRQVGLVLRDREDELLENIEKIAESAFLMFVVFALAVTKEKLDSKYTLESQIDVSVR
        SIPFKEAGAI GVLCSQYA L EE+++ VENLVW YC+DVYS+HR VGLVLR REDELLENIEKIAESAFLM VVFALAVTKEKLDSKYTLESQ DVSVR
Subjt:  SIPFKEAGAIVGVLCSQYALLEEEDKNFVENLVWYYCQDVYSKHRQVGLVLRDREDELLENIEKIAESAFLMFVVFALAVTKEKLDSKYTLESQIDVSVR

Query:  ILILFSCMEYFRRIRLPEYMDTIRGVVASIQGNESACVSFIESMPTYQDQTNGPDNSIGQKIKYSWIKDEVQTARMLFYVRVIPTCIERVRTQVYGKVVA
        IL+ FSCMEYFRRIRL EYM+TIRGVVASIQGNESACVSFIESMPTYQDQTNGPDNSIGQKIKYSW+KDEVQTARMLFY+RVIPTC+E V TQVYGKVVA
Subjt:  ILILFSCMEYFRRIRLPEYMDTIRGVVASIQGNESACVSFIESMPTYQDQTNGPDNSIGQKIKYSWIKDEVQTARMLFYVRVIPTCIERVRTQVYGKVVA

Query:  PTMFLYPVLRNSYV
        PTMFLY   +N+ V
Subjt:  PTMFLYPVLRNSYV

XP_038903921.1 uncharacterized protein LOC120090375 isoform X1 [Benincasa hispida]6.1e-27384.65Show/hide
Query:  MAKQASSVFLEEWLKSIGGIRTSLYSKPTSSSAREIIQAWAELRSSLEHQSFDDHHIQSLKTLVNSQ---------------------------------
        MAKQ+SS+FLEEWLKSIGG  T+L SK TSSSAREIIQAWAELRSSLEHQSFDD HIQSLK LVNSQ                                 
Subjt:  MAKQASSVFLEEWLKSIGGIRTSLYSKPTSSSAREIIQAWAELRSSLEHQSFDDHHIQSLKTLVNSQ---------------------------------

Query:  --------KSLRPSLVLVDSSVEVLSQIFSSKIELRKNPLFFSEGVLVLGAISYLLSASEKSKLCSLELLCRVLEEEYLLAGSVGGIIPEFLAGVGYALS
                KSLRPSLVLVDSSVEVLS IFSSKIELRKNPLFFSEGVLVLGAISYLLSASEKSKLC LELLCRVLEEEYLL GSVG IIPEFLAG+GYALS
Subjt:  --------KSLRPSLVLVDSSVEVLSQIFSSKIELRKNPLFFSEGVLVLGAISYLLSASEKSKLCSLELLCRVLEEEYLLAGSVGGIIPEFLAGVGYALS

Query:  SSVNAHVIRLLDSLLGIWGKVGGPTGTISSGLMILHMIEWVTSGLISLHSFEKLDVFSHATLVSSKESYASFAVVMAAAGILRAFNTYKTLLSSSERETI
        SSVNAHV+RLLDSLLGIWG +GGP  T+SSGLMILHMIEWVTSG+ISLHSFEKLDVFS A LVSSKESYASFAVVMAAAGILRAFNT K LLSSSERETI
Subjt:  SSVNAHVIRLLDSLLGIWGKVGGPTGTISSGLMILHMIEWVTSGLISLHSFEKLDVFSHATLVSSKESYASFAVVMAAAGILRAFNTYKTLLSSSERETI

Query:  SRIRISAQDCLESIARNFISTMEGSSITSNDHKRSVLLLCISLAIARCGPVSSRPPVLVCVVYALLTEIFPLQRLYAKIIEFSFAEMGVLGLTLVKEHLG
        SRIRISAQDCLESIARNFISTMEGSSIT NDH+RSVLLLCISLAIARCGPVSS PPVL+CVVYALLTEIFPLQRLYAKI EFSFAE+G LGLTLV EHLG
Subjt:  SRIRISAQDCLESIARNFISTMEGSSITSNDHKRSVLLLCISLAIARCGPVSSRPPVLVCVVYALLTEIFPLQRLYAKIIEFSFAEMGVLGLTLVKEHLG

Query:  SIPFKEAGAIVGVLCSQYALLEEEDKNFVENLVWYYCQDVYSKHRQVGLVLRDREDELLENIEKIAESAFLMFVVFALAVTKEKLDSKYTLESQIDVSVR
        SIPFKEAGAI GV CSQYA LEEEDK+FVENLVW YCQDVYS+HR  GLVLR REDELLENIEKIAESAFLM VVFALAVTKEKLDSKYTLESQ D+SVR
Subjt:  SIPFKEAGAIVGVLCSQYALLEEEDKNFVENLVWYYCQDVYSKHRQVGLVLRDREDELLENIEKIAESAFLMFVVFALAVTKEKLDSKYTLESQIDVSVR

Query:  ILILFSCMEYFRRIRLPEYMDTIRGVVASIQGNESACVSFIESMPTYQDQTNGPDNSIGQKIKYSWIKDEVQTARMLFYVRVIPTCIERVRTQVYGKVVA
        IL+ FSCMEYFRRIRLPEYMDTIRGVVASIQGNESACVSFIESMPTYQDQTNGPDNSIG+  KYSW KDEVQTARMLFYVRVIPTCIERV TQVYGKVVA
Subjt:  ILILFSCMEYFRRIRLPEYMDTIRGVVASIQGNESACVSFIESMPTYQDQTNGPDNSIGQKIKYSWIKDEVQTARMLFYVRVIPTCIERVRTQVYGKVVA

Query:  PTMFLY
        PTMFLY
Subjt:  PTMFLY

XP_038903923.1 uncharacterized protein LOC120090375 isoform X2 [Benincasa hispida]6.1e-27384.65Show/hide
Query:  MAKQASSVFLEEWLKSIGGIRTSLYSKPTSSSAREIIQAWAELRSSLEHQSFDDHHIQSLKTLVNSQ---------------------------------
        MAKQ+SS+FLEEWLKSIGG  T+L SK TSSSAREIIQAWAELRSSLEHQSFDD HIQSLK LVNSQ                                 
Subjt:  MAKQASSVFLEEWLKSIGGIRTSLYSKPTSSSAREIIQAWAELRSSLEHQSFDDHHIQSLKTLVNSQ---------------------------------

Query:  --------KSLRPSLVLVDSSVEVLSQIFSSKIELRKNPLFFSEGVLVLGAISYLLSASEKSKLCSLELLCRVLEEEYLLAGSVGGIIPEFLAGVGYALS
                KSLRPSLVLVDSSVEVLS IFSSKIELRKNPLFFSEGVLVLGAISYLLSASEKSKLC LELLCRVLEEEYLL GSVG IIPEFLAG+GYALS
Subjt:  --------KSLRPSLVLVDSSVEVLSQIFSSKIELRKNPLFFSEGVLVLGAISYLLSASEKSKLCSLELLCRVLEEEYLLAGSVGGIIPEFLAGVGYALS

Query:  SSVNAHVIRLLDSLLGIWGKVGGPTGTISSGLMILHMIEWVTSGLISLHSFEKLDVFSHATLVSSKESYASFAVVMAAAGILRAFNTYKTLLSSSERETI
        SSVNAHV+RLLDSLLGIWG +GGP  T+SSGLMILHMIEWVTSG+ISLHSFEKLDVFS A LVSSKESYASFAVVMAAAGILRAFNT K LLSSSERETI
Subjt:  SSVNAHVIRLLDSLLGIWGKVGGPTGTISSGLMILHMIEWVTSGLISLHSFEKLDVFSHATLVSSKESYASFAVVMAAAGILRAFNTYKTLLSSSERETI

Query:  SRIRISAQDCLESIARNFISTMEGSSITSNDHKRSVLLLCISLAIARCGPVSSRPPVLVCVVYALLTEIFPLQRLYAKIIEFSFAEMGVLGLTLVKEHLG
        SRIRISAQDCLESIARNFISTMEGSSIT NDH+RSVLLLCISLAIARCGPVSS PPVL+CVVYALLTEIFPLQRLYAKI EFSFAE+G LGLTLV EHLG
Subjt:  SRIRISAQDCLESIARNFISTMEGSSITSNDHKRSVLLLCISLAIARCGPVSSRPPVLVCVVYALLTEIFPLQRLYAKIIEFSFAEMGVLGLTLVKEHLG

Query:  SIPFKEAGAIVGVLCSQYALLEEEDKNFVENLVWYYCQDVYSKHRQVGLVLRDREDELLENIEKIAESAFLMFVVFALAVTKEKLDSKYTLESQIDVSVR
        SIPFKEAGAI GV CSQYA LEEEDK+FVENLVW YCQDVYS+HR  GLVLR REDELLENIEKIAESAFLM VVFALAVTKEKLDSKYTLESQ D+SVR
Subjt:  SIPFKEAGAIVGVLCSQYALLEEEDKNFVENLVWYYCQDVYSKHRQVGLVLRDREDELLENIEKIAESAFLMFVVFALAVTKEKLDSKYTLESQIDVSVR

Query:  ILILFSCMEYFRRIRLPEYMDTIRGVVASIQGNESACVSFIESMPTYQDQTNGPDNSIGQKIKYSWIKDEVQTARMLFYVRVIPTCIERVRTQVYGKVVA
        IL+ FSCMEYFRRIRLPEYMDTIRGVVASIQGNESACVSFIESMPTYQDQTNGPDNSIG+  KYSW KDEVQTARMLFYVRVIPTCIERV TQVYGKVVA
Subjt:  ILILFSCMEYFRRIRLPEYMDTIRGVVASIQGNESACVSFIESMPTYQDQTNGPDNSIGQKIKYSWIKDEVQTARMLFYVRVIPTCIERVRTQVYGKVVA

Query:  PTMFLY
        PTMFLY
Subjt:  PTMFLY

XP_038903924.1 uncharacterized protein LOC120090375 isoform X3 [Benincasa hispida]8.8e-27284.6Show/hide
Query:  MAKQASSVFLEEWLKSIGGIRTSLYSKPTSSSAREIIQAWAELRSSLEHQSFDDHHIQSLKTLVNSQ---------------------------------
        MAKQ+SS+FLEEWLKSIGG  T+L SK TSSSAREIIQAWAELRSSLEHQSFDD HIQSLK LVNSQ                                 
Subjt:  MAKQASSVFLEEWLKSIGGIRTSLYSKPTSSSAREIIQAWAELRSSLEHQSFDDHHIQSLKTLVNSQ---------------------------------

Query:  --------KSLRPSLVLVDSSVEVLSQIFSSKIELRKNPLFFSEGVLVLGAISYLLSASEKSKLCSLELLCRVLEEEYLLAGSVGGIIPEFLAGVGYALS
                KSLRPSLVLVDSSVEVLS IFSSKIELRKNPLFFSEGVLVLGAISYLLSASEKSKLC LELLCRVLEEEYLL GSVG IIPEFLAG+GYALS
Subjt:  --------KSLRPSLVLVDSSVEVLSQIFSSKIELRKNPLFFSEGVLVLGAISYLLSASEKSKLCSLELLCRVLEEEYLLAGSVGGIIPEFLAGVGYALS

Query:  SSVNAHVIRLLDSLLGIWGKVGGPTGTISSGLMILHMIEWVTSGLISLHSFEKLDVFSHATLVSSKESYASFAVVMAAAGILRAFNTYKTLLSSSERETI
        SSVNAHV+RLLDSLLGIWG +GGP  T+SSGLMILHMIEWVTSG+ISLHSFEKLDVFS A LVSSKESYASFAVVMAAAGILRAFNT K LLSSSERETI
Subjt:  SSVNAHVIRLLDSLLGIWGKVGGPTGTISSGLMILHMIEWVTSGLISLHSFEKLDVFSHATLVSSKESYASFAVVMAAAGILRAFNTYKTLLSSSERETI

Query:  SRIRISAQDCLESIARNFISTMEGSSITSNDHKRSVLLLCISLAIARCGPVSSRPPVLVCVVYALLTEIFPLQRLYAKIIEFSFAEMGVLGLTLVKEHLG
        SRIRISAQDCLESIARNFISTMEGSSIT NDH+RSVLLLCISLAIARCGPVSS PPVL+CVVYALLTEIFPLQRLYAKI EFSFAE+G LGLTLV EHLG
Subjt:  SRIRISAQDCLESIARNFISTMEGSSITSNDHKRSVLLLCISLAIARCGPVSSRPPVLVCVVYALLTEIFPLQRLYAKIIEFSFAEMGVLGLTLVKEHLG

Query:  SIPFKEAGAIVGVLCSQYALLEEEDKNFVENLVWYYCQDVYSKHRQVGLVLRDREDELLENIEKIAESAFLMFVVFALAVTKEKLDSKYTLESQIDVSVR
        SIPFKEAGAI GV CSQYA LEEEDK+FVENLVW YCQDVYS+HR  GLVLR REDELLENIEKIAESAFLM VVFALAVTKEKLDSKYTLESQ D+SVR
Subjt:  SIPFKEAGAIVGVLCSQYALLEEEDKNFVENLVWYYCQDVYSKHRQVGLVLRDREDELLENIEKIAESAFLMFVVFALAVTKEKLDSKYTLESQIDVSVR

Query:  ILILFSCMEYFRRIRLPEYMDTIRGVVASIQGNESACVSFIESMPTYQDQTNGPDNSIGQKIKYSWIKDEVQTARMLFYVRVIPTCIERVRTQVYGKVVA
        IL+ FSCMEYFRRIRLPEYMDTIRGVVASIQGNESACVSFIESMPTYQDQTNGPDNSIG+  KYSW KDEVQTARMLFYVRVIPTCIERV TQVYGKVVA
Subjt:  ILILFSCMEYFRRIRLPEYMDTIRGVVASIQGNESACVSFIESMPTYQDQTNGPDNSIGQKIKYSWIKDEVQTARMLFYVRVIPTCIERVRTQVYGKVVA

Query:  PTMF
        PTMF
Subjt:  PTMF

TrEMBL top hitse value%identityAlignment
A0A0A0L5R8 Uncharacterized protein6.6e-26581.76Show/hide
Query:  MAKQASSVFLEEWLKSIGGIRTSLYSKPTSSSAREIIQAWAELRSSLEHQSFDDHHIQSLKTLVNSQ---------------------------------
        MAKQ SSVFLE+WLKSIGGI     SKPTSSSAREIIQAWAELRSSLEHQ FDD HIQSLK LVNSQ                                 
Subjt:  MAKQASSVFLEEWLKSIGGIRTSLYSKPTSSSAREIIQAWAELRSSLEHQSFDDHHIQSLKTLVNSQ---------------------------------

Query:  --------KSLRPSLVLVDSSVEVLSQIFSSKIELRKNPLFFSEGVLVLGAISYLLSASEKSKLCSLELLCRVLEEEYLLAGSVGGIIPEFLAGVGYALS
                KSLRPSLVLVDSSVEVLSQIFSSKIELRKNPLF SEGVLVLGAISYL SASEKSKLC LELLCRVLEE+YLL   VGGI+PEFLAG+GYA S
Subjt:  --------KSLRPSLVLVDSSVEVLSQIFSSKIELRKNPLFFSEGVLVLGAISYLLSASEKSKLCSLELLCRVLEEEYLLAGSVGGIIPEFLAGVGYALS

Query:  SSVNAHVIRLLDSLLGIWGKVGGPTGTISSGLMILHMIEWVTSGLISLHSFEKLDVFSHATLVSSKESYASFAVVMAAAGILRAFNTYKTLLSSSERETI
        SSVNAHV+RLLDSLLGIW KV GP  T+SSGLMILHMI WVTSGLI+LHSFEKLDVFSHATLVSSKESYASFAVVMAAAGILRAFNTYK LLSSSERETI
Subjt:  SSVNAHVIRLLDSLLGIWGKVGGPTGTISSGLMILHMIEWVTSGLISLHSFEKLDVFSHATLVSSKESYASFAVVMAAAGILRAFNTYKTLLSSSERETI

Query:  SRIRISAQDCLESIARNFISTMEGSSITSNDHKRSVLLLCISLAIARCGPVSSRPPVLVCVVYALLTEIFPLQRLYAKIIEFSFAEMGVLGLTLVKEHLG
        SRIRISAQDCLESIARNFISTMEGSSIT NDH+RSVLLLCISLAIARCGPVS+RPPVL+ VVYALLTEIFPLQRLYAKI EFSF+E+ VLGLTLVKEHLG
Subjt:  SRIRISAQDCLESIARNFISTMEGSSITSNDHKRSVLLLCISLAIARCGPVSSRPPVLVCVVYALLTEIFPLQRLYAKIIEFSFAEMGVLGLTLVKEHLG

Query:  SIPFKEAGAIVGVLCSQYALLEEEDKNFVENLVWYYCQDVYSKHRQVGLVLRDREDELLENIEKIAESAFLMFVVFALAVTKEKLDSKYTLESQIDVSVR
        SIPFKEAGAI GVLCSQYA L EE+K+ VENLVW YC+DVYS+HR V LVL  REDELLE+IEKIAESAFLM VVFALAVTKEKL SKYTLESQ DVSV+
Subjt:  SIPFKEAGAIVGVLCSQYALLEEEDKNFVENLVWYYCQDVYSKHRQVGLVLRDREDELLENIEKIAESAFLMFVVFALAVTKEKLDSKYTLESQIDVSVR

Query:  ILILFSCMEYFRRIRLPEYMDTIRGVVASIQGNESACVSFIESMPTYQDQTNGPDNSIGQKIKYSWIKDEVQTARMLFYVRVIPTCIERVRTQVYGKVVA
        IL+ FSCMEYFRRIRLPEYMDTIRGVV SIQGNESACV FIESMPTYQDQTNGPDNSIGQKI+YSW KDEVQTARMLFY+RV+PTCIE V TQVYGKVVA
Subjt:  ILILFSCMEYFRRIRLPEYMDTIRGVVASIQGNESACVSFIESMPTYQDQTNGPDNSIGQKIKYSWIKDEVQTARMLFYVRVIPTCIERVRTQVYGKVVA

Query:  PTMFLYPVLRNSYV
        PTMFLY    NS V
Subjt:  PTMFLYPVLRNSYV

A0A1S3BKA7 uncharacterized protein LOC103490955 isoform X29.2e-26781.92Show/hide
Query:  MAKQASSVFLEEWLKSIGGIRTSLYSKPTSSSAREIIQAWAELRSSLEHQSFDDHHIQSLKTLVNSQ---------------------------------
        MAKQ SSVFLEEWLKSI GI     SKPTSSSAREIIQAWAELRSSLEHQ FDD HIQSLK LVNSQ                                 
Subjt:  MAKQASSVFLEEWLKSIGGIRTSLYSKPTSSSAREIIQAWAELRSSLEHQSFDDHHIQSLKTLVNSQ---------------------------------

Query:  --------KSLRPSLVLVDSSVEVLSQIFSSKIELRKNPLFFSEGVLVLGAISYLLSASEKSKLCSLELLCRVLEEEYLLAGSVGGIIPEFLAGVGYALS
                KSLRPSLVL+DSSVEVLSQIFSSKIELRK PLF SEGVLVLGAISY LSASEKSKLC LELLCRVLEE+YLL   VGGI+PEFLAG+GYALS
Subjt:  --------KSLRPSLVLVDSSVEVLSQIFSSKIELRKNPLFFSEGVLVLGAISYLLSASEKSKLCSLELLCRVLEEEYLLAGSVGGIIPEFLAGVGYALS

Query:  SSVNAHVIRLLDSLLGIWGKVGGPTGTISSGLMILHMIEWVTSGLISLHSFEKLDVFSHATLVSSKESYASFAVVMAAAGILRAFNTYKTLLSSSERETI
        SSVNAHV+RLLDSLLGIW KV GP  T+SSGLMILHMIEWVTSGLI+LHSFEKLDVFSHATLVSSKESYASFAVVMAAAGILR FNTYK LL+SSERETI
Subjt:  SSVNAHVIRLLDSLLGIWGKVGGPTGTISSGLMILHMIEWVTSGLISLHSFEKLDVFSHATLVSSKESYASFAVVMAAAGILRAFNTYKTLLSSSERETI

Query:  SRIRISAQDCLESIARNFISTMEGSSITSNDHKRSVLLLCISLAIARCGPVSSRPPVLVCVVYALLTEIFPLQRLYAKIIEFSFAEMGVLGLTLVKEHLG
        SRIRI+AQDCLESIARNFISTME SSIT NDH+RSVLLLCISLAIARCGPVS+RPPVL+ VVY LLTEIFPLQRLYAKI EFSFAE+GVLGLTLVKEHLG
Subjt:  SRIRISAQDCLESIARNFISTMEGSSITSNDHKRSVLLLCISLAIARCGPVSSRPPVLVCVVYALLTEIFPLQRLYAKIIEFSFAEMGVLGLTLVKEHLG

Query:  SIPFKEAGAIVGVLCSQYALLEEEDKNFVENLVWYYCQDVYSKHRQVGLVLRDREDELLENIEKIAESAFLMFVVFALAVTKEKLDSKYTLESQIDVSVR
        SIPFKEAGAI GVLCSQYA L EE+++ VENLVW YC+DVYS+HR VGLVLR REDELLENIEKIAESAFLM VVFALAVTKEKLDSKYTLESQ DVSVR
Subjt:  SIPFKEAGAIVGVLCSQYALLEEEDKNFVENLVWYYCQDVYSKHRQVGLVLRDREDELLENIEKIAESAFLMFVVFALAVTKEKLDSKYTLESQIDVSVR

Query:  ILILFSCMEYFRRIRLPEYMDTIRGVVASIQGNESACVSFIESMPTYQDQTNGPDNSIGQKIKYSWIKDEVQTARMLFYVRVIPTCIERVRTQVYGKVVA
        IL+ FSCMEYFRRIRL EYM+TIRGVVASIQGNESACVSFIESMPTYQDQTNGPDNSIGQKIKYSW+KDEVQTARMLFY+RVIPTC+E V TQVYGKVVA
Subjt:  ILILFSCMEYFRRIRLPEYMDTIRGVVASIQGNESACVSFIESMPTYQDQTNGPDNSIGQKIKYSWIKDEVQTARMLFYVRVIPTCIERVRTQVYGKVVA

Query:  PTMFLYPVLRNSYV
        PTMFLY   +N+ V
Subjt:  PTMFLYPVLRNSYV

A0A1S3BLT3 uncharacterized protein LOC103490955 isoform X19.2e-26781.92Show/hide
Query:  MAKQASSVFLEEWLKSIGGIRTSLYSKPTSSSAREIIQAWAELRSSLEHQSFDDHHIQSLKTLVNSQ---------------------------------
        MAKQ SSVFLEEWLKSI GI     SKPTSSSAREIIQAWAELRSSLEHQ FDD HIQSLK LVNSQ                                 
Subjt:  MAKQASSVFLEEWLKSIGGIRTSLYSKPTSSSAREIIQAWAELRSSLEHQSFDDHHIQSLKTLVNSQ---------------------------------

Query:  --------KSLRPSLVLVDSSVEVLSQIFSSKIELRKNPLFFSEGVLVLGAISYLLSASEKSKLCSLELLCRVLEEEYLLAGSVGGIIPEFLAGVGYALS
                KSLRPSLVL+DSSVEVLSQIFSSKIELRK PLF SEGVLVLGAISY LSASEKSKLC LELLCRVLEE+YLL   VGGI+PEFLAG+GYALS
Subjt:  --------KSLRPSLVLVDSSVEVLSQIFSSKIELRKNPLFFSEGVLVLGAISYLLSASEKSKLCSLELLCRVLEEEYLLAGSVGGIIPEFLAGVGYALS

Query:  SSVNAHVIRLLDSLLGIWGKVGGPTGTISSGLMILHMIEWVTSGLISLHSFEKLDVFSHATLVSSKESYASFAVVMAAAGILRAFNTYKTLLSSSERETI
        SSVNAHV+RLLDSLLGIW KV GP  T+SSGLMILHMIEWVTSGLI+LHSFEKLDVFSHATLVSSKESYASFAVVMAAAGILR FNTYK LL+SSERETI
Subjt:  SSVNAHVIRLLDSLLGIWGKVGGPTGTISSGLMILHMIEWVTSGLISLHSFEKLDVFSHATLVSSKESYASFAVVMAAAGILRAFNTYKTLLSSSERETI

Query:  SRIRISAQDCLESIARNFISTMEGSSITSNDHKRSVLLLCISLAIARCGPVSSRPPVLVCVVYALLTEIFPLQRLYAKIIEFSFAEMGVLGLTLVKEHLG
        SRIRI+AQDCLESIARNFISTME SSIT NDH+RSVLLLCISLAIARCGPVS+RPPVL+ VVY LLTEIFPLQRLYAKI EFSFAE+GVLGLTLVKEHLG
Subjt:  SRIRISAQDCLESIARNFISTMEGSSITSNDHKRSVLLLCISLAIARCGPVSSRPPVLVCVVYALLTEIFPLQRLYAKIIEFSFAEMGVLGLTLVKEHLG

Query:  SIPFKEAGAIVGVLCSQYALLEEEDKNFVENLVWYYCQDVYSKHRQVGLVLRDREDELLENIEKIAESAFLMFVVFALAVTKEKLDSKYTLESQIDVSVR
        SIPFKEAGAI GVLCSQYA L EE+++ VENLVW YC+DVYS+HR VGLVLR REDELLENIEKIAESAFLM VVFALAVTKEKLDSKYTLESQ DVSVR
Subjt:  SIPFKEAGAIVGVLCSQYALLEEEDKNFVENLVWYYCQDVYSKHRQVGLVLRDREDELLENIEKIAESAFLMFVVFALAVTKEKLDSKYTLESQIDVSVR

Query:  ILILFSCMEYFRRIRLPEYMDTIRGVVASIQGNESACVSFIESMPTYQDQTNGPDNSIGQKIKYSWIKDEVQTARMLFYVRVIPTCIERVRTQVYGKVVA
        IL+ FSCMEYFRRIRL EYM+TIRGVVASIQGNESACVSFIESMPTYQDQTNGPDNSIGQKIKYSW+KDEVQTARMLFY+RVIPTC+E V TQVYGKVVA
Subjt:  ILILFSCMEYFRRIRLPEYMDTIRGVVASIQGNESACVSFIESMPTYQDQTNGPDNSIGQKIKYSWIKDEVQTARMLFYVRVIPTCIERVRTQVYGKVVA

Query:  PTMFLYPVLRNSYV
        PTMFLY   +N+ V
Subjt:  PTMFLYPVLRNSYV

A0A5D3D7C1 Uncharacterized protein2.7e-26682.67Show/hide
Query:  MAKQASSVFLEEWLKSIGGIRTSLYSKPTSSSAREIIQAWAELRSSLEHQSFDDHHIQSLKTLVNSQ---------------------------------
        MAKQ SSVFLEEWLKSI GI     SKPTSSSAREIIQAWAELRSSLEHQ FDD HIQSLK LVNSQ                                 
Subjt:  MAKQASSVFLEEWLKSIGGIRTSLYSKPTSSSAREIIQAWAELRSSLEHQSFDDHHIQSLKTLVNSQ---------------------------------

Query:  --------KSLRPSLVLVDSSVEVLSQIFSSKIELRKNPLFFSEGVLVLGAISYLLSASEKSKLCSLELLCRVLEEEYLLAGSVGGIIPEFLAGVGYALS
                KSLRPSLVL+DSSVEVLSQIFSSKIELRK PLF SEGVLVLGAISY LSASEKSKLC LELLCRVLEE+YLL   VGGI+PEFLAG+GYALS
Subjt:  --------KSLRPSLVLVDSSVEVLSQIFSSKIELRKNPLFFSEGVLVLGAISYLLSASEKSKLCSLELLCRVLEEEYLLAGSVGGIIPEFLAGVGYALS

Query:  SSVNAHVIRLLDSLLGIWGKVGGPTGTISSGLMILHMIEWVTSGLISLHSFEKLDVFSHATLVSSKESYASFAVVMAAAGILRAFNTYKTLLSSSERETI
        SSVNAHV+RLLDSLLGIW KV GP  T+SSGLMILHMIEWVTSGLI+LHSFEKLDVFSHAT VSSKESYASFAVVMAAAGILR FNTYK LL+SSERETI
Subjt:  SSVNAHVIRLLDSLLGIWGKVGGPTGTISSGLMILHMIEWVTSGLISLHSFEKLDVFSHATLVSSKESYASFAVVMAAAGILRAFNTYKTLLSSSERETI

Query:  SRIRISAQDCLESIARNFISTMEGSSITSNDHKRSVLLLCISLAIARCGPVSSRPPVLVCVVYALLTEIFPLQRLYAKIIEFSFAEMGVLGLTLVKEHLG
        SRIRI+AQDCLESIARNFISTME SSIT NDH+RSVLLLCISLAIARCGPVS+RPPVL+ VVY LLTEIFPLQRLYAKI EFSFAE+GVLGLTLVKEHLG
Subjt:  SRIRISAQDCLESIARNFISTMEGSSITSNDHKRSVLLLCISLAIARCGPVSSRPPVLVCVVYALLTEIFPLQRLYAKIIEFSFAEMGVLGLTLVKEHLG

Query:  SIPFKEAGAIVGVLCSQYALLEEEDKNFVENLVWYYCQDVYSKHRQVGLVLRDREDELLENIEKIAESAFLMFVVFALAVTKEKLDSKYTLESQIDVSVR
        SIPFKEAGAI GVLCSQYA L EE+++ VENLVW YC+DVYS+HR VGLVLR REDELLENIEKIAESAFLM VVFALAVTKEKLDSKYTLESQ DVSVR
Subjt:  SIPFKEAGAIVGVLCSQYALLEEEDKNFVENLVWYYCQDVYSKHRQVGLVLRDREDELLENIEKIAESAFLMFVVFALAVTKEKLDSKYTLESQIDVSVR

Query:  ILILFSCMEYFRRIRLPEYMDTIRGVVASIQGNESACVSFIESMPTYQDQTNGPDNSIGQKIKYSWIKDEVQTARMLFYVRVIPTCIERVRTQVYGKVVA
        IL+ FSCMEYFRRIRL EYM+TIRGVVASIQGNESACVSFIESMPTYQDQTNGPDNSIGQKIKYSW+KDEVQTARMLFY+RVIPTCIE V TQVYGKVVA
Subjt:  ILILFSCMEYFRRIRLPEYMDTIRGVVASIQGNESACVSFIESMPTYQDQTNGPDNSIGQKIKYSWIKDEVQTARMLFYVRVIPTCIERVRTQVYGKVVA

Query:  PTMFLY
        PTMFLY
Subjt:  PTMFLY

A0A6J1KX18 uncharacterized protein LOC111498339 isoform X11.1e-25677.76Show/hide
Query:  MAKQASSVFLEEWLKSIGGIRTSLYSKPTSSSAREIIQAWAELRSSLEHQSFDDHHIQSLKTLVNSQ---------------------------------
        MAKQA+SVFLEEWLKSI GI +   SK +SSSAREIIQAWAELRSSLEHQ FDD HIQSLKTLVNSQ                                 
Subjt:  MAKQASSVFLEEWLKSIGGIRTSLYSKPTSSSAREIIQAWAELRSSLEHQSFDDHHIQSLKTLVNSQ---------------------------------

Query:  --------KSLRPSLVLVDSSVEVLSQIFSSKIELRKNPLFFSEGVLVLGAISYLLSASEKSKLCSLELLCRVL-EEEYLLAGSVGGIIPEFLAGVGYAL
                KSLRPSLVLVDSSVE+LSQIFSSKI LRKNPLF SEGVL+LGAISY++SASEK KLC LELLCR+L EEE+LL GSVGG +PEF AG+GYAL
Subjt:  --------KSLRPSLVLVDSSVEVLSQIFSSKIELRKNPLFFSEGVLVLGAISYLLSASEKSKLCSLELLCRVL-EEEYLLAGSVGGIIPEFLAGVGYAL

Query:  SSSVNAHVIRLLDSLLGIWGKVGGPTGTISSGLMILHMIEWVTSGLISLHSFEKLDVFSHATLVSSKESYASFAVVMAAAGILRAFNTYKTLLSSSERET
        SSSVNAHV+RLLDSLLGIWGK+G PTG +S+GLMILH+IEWVTSGLISLHSF+KLD  S A L SSKESYASFAVVMAAAGILRAFN+YK LLSSSERET
Subjt:  SSSVNAHVIRLLDSLLGIWGKVGGPTGTISSGLMILHMIEWVTSGLISLHSFEKLDVFSHATLVSSKESYASFAVVMAAAGILRAFNTYKTLLSSSERET

Query:  ISRIRISAQDCLESIARNFISTMEGSSITSN-DHKRSVLLLCISLAIARCGPVSSRPPVLVCVVYALLTEIFPLQRLYAKIIEFSFAEMGVLGLTLVKEH
        ISRIRISAQDCLESIA+NFISTMEGSSIT N DH RS+LLLCISLA+ARCGPV+SRPPVL+CV YALLTEIFPLQRLYAK++EFSF E GVLGL+LVKEH
Subjt:  ISRIRISAQDCLESIARNFISTMEGSSITSN-DHKRSVLLLCISLAIARCGPVSSRPPVLVCVVYALLTEIFPLQRLYAKIIEFSFAEMGVLGLTLVKEH

Query:  LGSIPFKEAGAIVGVLCSQYALLEEEDKNFVENLVWYYCQDVYSKHRQVGLVLRDREDELLENIEKIAESAFLMFVVFALAVTKEKLDSKYTLESQIDVS
        L SIPFKEAG I GVLCSQYA ++E+DK  VENLVW YCQD+YS+HR+VGLVLR REDELLENIEKIAESAFLM VVFALAVTKEKL+SKYTLE+Q DVS
Subjt:  LGSIPFKEAGAIVGVLCSQYALLEEEDKNFVENLVWYYCQDVYSKHRQVGLVLRDREDELLENIEKIAESAFLMFVVFALAVTKEKLDSKYTLESQIDVS

Query:  VRILILFSCMEYFRRIRLPEYMDTIRGVVASIQGNESACVSFIESMPTYQDQTNGPDNSIGQKIKYSWIKDEVQTARMLFYVRVIPTCIERVRTQVYGKV
        VRIL  FSCMEYFRRIR+PEYMDTIRGVVAS+Q NESACVSFIESMP+YQDQT+GPD+SIGQK++Y W +DEVQTARMLFY+RVIPTCIE V TQVY KV
Subjt:  VRILILFSCMEYFRRIRLPEYMDTIRGVVASIQGNESACVSFIESMPTYQDQTNGPDNSIGQKIKYSWIKDEVQTARMLFYVRVIPTCIERVRTQVYGKV

Query:  VAPTMFLYPVLRNSYV
        VAPTMFLY    NS V
Subjt:  VAPTMFLYPVLRNSYV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G73970.1 unknown protein8.6e-13244.59Show/hide
Query:  MAKQA-SSVFLEEWLKSIGGIRTS--LYSKPTSSSAREIIQAWAELRSSLEHQSFDDHHIQSLKTLVNSQ------------------------------
        MA++A +S FLEEWL+++ G   S  L  + ++ SAR IIQAW+E+R SL++Q+FD  ++Q+L+ LV+S+                              
Subjt:  MAKQA-SSVFLEEWLKSIGGIRTS--LYSKPTSSSAREIIQAWAELRSSLEHQSFDDHHIQSLKTLVNSQ------------------------------

Query:  -----------KSLRPSLVLVDSSVEVLSQIFSSKIELRKNPLFFSEGVLVLGAISYLLSASEKSKLCSLELLCRVLEEEYLLAGSVGGIIPEFLAGVGY
                   K+ RPS  LV  +V+ +  +   +  L+  P   ++ VLV GA + + S S   K+  LELLCR+LEEEY L GS   ++P  LAG+GY
Subjt:  -----------KSLRPSLVLVDSSVEVLSQIFSSKIELRKNPLFFSEGVLVLGAISYLLSASEKSKLCSLELLCRVLEEEYLLAGSVGGIIPEFLAGVGY

Query:  ALSSSVNAHVIRLLDSLLGIWGKVGGPTGTISSGLMILHMIEWVTSGLISLHSFEKLDVFSHATLVSSKESYASFAVVMAAAGILRAFNTYKTLLSSSER
        ALSSS++ H +RLLD L GIW K  GP GT++ GLMILH+IEWV SG +  +S  K+ +F++  L +SKE YA FAV MAAAG++RA  +     S ++ 
Subjt:  ALSSSVNAHVIRLLDSLLGIWGKVGGPTGTISSGLMILHMIEWVTSGLISLHSFEKLDVFSHATLVSSKESYASFAVVMAAAGILRAFNTYKTLLSSSER

Query:  ETISRIRISAQDCLESIARNFISTMEGSSIT-SNDHKRSVLLLCISLAIARCGPVSSRPPVLVCVVYALLTEIFPLQRLYAKIIEFSFAEMGVLGLTLVK
          IS++R SA+  +E +A+  +S   G+ +T     +   LL C ++A+ARCG VSS  P+L+C+  ALLT++FPL ++Y         E     L  V+
Subjt:  ETISRIRISAQDCLESIARNFISTMEGSSIT-SNDHKRSVLLLCISLAIARCGPVSSRPPVLVCVVYALLTEIFPLQRLYAKIIEFSFAEMGVLGLTLVK

Query:  EHLGSIPFKEAGAIVGVLCSQYALLEEEDKNFVENLVWYYCQDVYSKHRQVGLVLRDREDELLENIEKIAESAFLMFVVFALAVTKEKLDSKYTLESQID
        EHL  + FKE+GAI G  C+QY+   EE+K  VEN++W +CQ++Y +HRQ+ ++L   ED LL +IEKIAES+FLM VVFALAVTK+ L    + E ++ 
Subjt:  EHLGSIPFKEAGAIVGVLCSQYALLEEEDKNFVENLVWYYCQDVYSKHRQVGLVLRDREDELLENIEKIAESAFLMFVVFALAVTKEKLDSKYTLESQID

Query:  VSVRILILFSCMEYFRRIRLPEYMDTIRGVVASIQGNESACVSFIESMPTYQDQTNGPDNSIGQKIKYSWIKDEVQTARMLFYVRVIPTCIERVRTQVYG
         SV+IL+ FSC+EYFR IRLPEYM+TIR V++ +Q N++ CVSF+ES+P Y   TN P +   Q+IKY W +D+VQT+R+LFY+RVIPTCI R+    + 
Subjt:  VSVRILILFSCMEYFRRIRLPEYMDTIRGVVASIQGNESACVSFIESMPTYQDQTNGPDNSIGQKIKYSWIKDEVQTARMLFYVRVIPTCIERVRTQVYG

Query:  KVVAPTMFLY
         VVA TMFLY
Subjt:  KVVAPTMFLY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAAAGCAGGCAAGTTCTGTTTTCCTCGAAGAATGGTTGAAGAGCATCGGCGGTATAAGAACTTCTCTTTACTCTAAACCCACTTCCTCTTCTGCTCGAGAAATTAT
CCAAGCATGGGCTGAGCTTAGAAGCTCTTTGGAGCATCAATCGTTTGATGATCACCACATTCAATCACTCAAAACTCTCGTTAACTCTCAAAAATCTCTCCGGCCTTCTT
TAGTTCTTGTCGATTCATCCGTTGAGGTTCTCTCTCAGATTTTTTCTTCCAAAATTGAATTGAGGAAGAACCCATTGTTTTTCTCCGAAGGAGTTTTAGTTTTGGGTGCA
ATTTCGTATCTGCTTTCAGCTTCAGAAAAATCAAAATTATGCTCTTTGGAGTTGCTTTGCAGGGTTTTGGAAGAAGAATACCTACTTGCTGGATCAGTGGGAGGGATAAT
TCCAGAATTTCTTGCTGGGGTTGGTTATGCTTTATCTTCATCAGTGAATGCTCATGTTATTAGACTGTTAGATTCTTTGTTAGGAATTTGGGGTAAGGTAGGTGGCCCTA
CTGGTACAATTTCTAGTGGGTTAATGATTCTGCACATGATTGAATGGGTGACCTCTGGTTTGATTAGTCTTCATTCTTTTGAGAAATTAGATGTTTTTAGCCATGCTACT
TTAGTGTCTTCAAAGGAAAGCTATGCTTCATTTGCTGTTGTAATGGCTGCAGCTGGAATATTGAGGGCTTTTAATACTTACAAAACCTTGTTGAGTAGTTCAGAAAGAGA
AACAATATCTAGAATAAGGATTTCAGCCCAGGATTGCTTAGAATCTATAGCCAGGAATTTTATTTCTACTATGGAAGGGTCTTCAATCACAAGCAATGACCATAAAAGGA
GTGTGCTTCTATTGTGTATTTCATTGGCAATAGCACGCTGTGGCCCGGTGTCATCTCGCCCACCTGTCCTCGTTTGCGTTGTTTATGCTTTGTTGACTGAAATATTTCCT
TTGCAGCGTTTATATGCCAAGATTATTGAATTCTCTTTTGCTGAGATGGGTGTTTTGGGGCTTACTCTAGTGAAAGAGCATCTGGGTAGTATTCCTTTTAAGGAAGCAGG
GGCCATCGTCGGTGTTCTTTGCAGTCAGTATGCTTTACTTGAGGAAGAGGACAAAAATTTTGTAGAGAATCTTGTATGGTATTACTGTCAAGATGTCTACTCAAAGCACA
GACAAGTTGGTTTGGTGCTTCGTGACAGAGAGGATGAATTACTAGAGAATATAGAGAAAATTGCAGAGTCTGCTTTTCTCATGTTTGTAGTTTTTGCATTAGCTGTCACA
AAAGAAAAGTTAGATTCCAAATATACACTGGAAAGTCAGATTGATGTTTCTGTAAGAATACTTATTTTATTCTCTTGTATGGAATACTTTAGGCGTATTCGCTTGCCAGA
ATATATGGATACTATCCGAGGGGTTGTTGCAAGCATTCAGGGGAATGAGTCTGCTTGTGTATCTTTCATTGAATCAATGCCTACATACCAAGATCAAACAAATGGGCCTG
ATAACTCTATTGGGCAGAAAATAAAATATTCATGGATCAAGGACGAAGTGCAAACTGCCCGTATGTTGTTTTATGTACGAGTCATTCCAACTTGCATTGAGCGTGTTCGT
ACCCAAGTGTATGGGAAGGTGGTAGCCCCAACAATGTTTTTGTATCCTGTTTTAAGGAATTCGTATGTGCAAAGTCTGCTTTGTTTCTTTTTTCTTCTTCTTCTTCATTT
TTTTCCCTCGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCAAAGCAGGCAAGTTCTGTTTTCCTCGAAGAATGGTTGAAGAGCATCGGCGGTATAAGAACTTCTCTTTACTCTAAACCCACTTCCTCTTCTGCTCGAGAAATTAT
CCAAGCATGGGCTGAGCTTAGAAGCTCTTTGGAGCATCAATCGTTTGATGATCACCACATTCAATCACTCAAAACTCTCGTTAACTCTCAAAAATCTCTCCGGCCTTCTT
TAGTTCTTGTCGATTCATCCGTTGAGGTTCTCTCTCAGATTTTTTCTTCCAAAATTGAATTGAGGAAGAACCCATTGTTTTTCTCCGAAGGAGTTTTAGTTTTGGGTGCA
ATTTCGTATCTGCTTTCAGCTTCAGAAAAATCAAAATTATGCTCTTTGGAGTTGCTTTGCAGGGTTTTGGAAGAAGAATACCTACTTGCTGGATCAGTGGGAGGGATAAT
TCCAGAATTTCTTGCTGGGGTTGGTTATGCTTTATCTTCATCAGTGAATGCTCATGTTATTAGACTGTTAGATTCTTTGTTAGGAATTTGGGGTAAGGTAGGTGGCCCTA
CTGGTACAATTTCTAGTGGGTTAATGATTCTGCACATGATTGAATGGGTGACCTCTGGTTTGATTAGTCTTCATTCTTTTGAGAAATTAGATGTTTTTAGCCATGCTACT
TTAGTGTCTTCAAAGGAAAGCTATGCTTCATTTGCTGTTGTAATGGCTGCAGCTGGAATATTGAGGGCTTTTAATACTTACAAAACCTTGTTGAGTAGTTCAGAAAGAGA
AACAATATCTAGAATAAGGATTTCAGCCCAGGATTGCTTAGAATCTATAGCCAGGAATTTTATTTCTACTATGGAAGGGTCTTCAATCACAAGCAATGACCATAAAAGGA
GTGTGCTTCTATTGTGTATTTCATTGGCAATAGCACGCTGTGGCCCGGTGTCATCTCGCCCACCTGTCCTCGTTTGCGTTGTTTATGCTTTGTTGACTGAAATATTTCCT
TTGCAGCGTTTATATGCCAAGATTATTGAATTCTCTTTTGCTGAGATGGGTGTTTTGGGGCTTACTCTAGTGAAAGAGCATCTGGGTAGTATTCCTTTTAAGGAAGCAGG
GGCCATCGTCGGTGTTCTTTGCAGTCAGTATGCTTTACTTGAGGAAGAGGACAAAAATTTTGTAGAGAATCTTGTATGGTATTACTGTCAAGATGTCTACTCAAAGCACA
GACAAGTTGGTTTGGTGCTTCGTGACAGAGAGGATGAATTACTAGAGAATATAGAGAAAATTGCAGAGTCTGCTTTTCTCATGTTTGTAGTTTTTGCATTAGCTGTCACA
AAAGAAAAGTTAGATTCCAAATATACACTGGAAAGTCAGATTGATGTTTCTGTAAGAATACTTATTTTATTCTCTTGTATGGAATACTTTAGGCGTATTCGCTTGCCAGA
ATATATGGATACTATCCGAGGGGTTGTTGCAAGCATTCAGGGGAATGAGTCTGCTTGTGTATCTTTCATTGAATCAATGCCTACATACCAAGATCAAACAAATGGGCCTG
ATAACTCTATTGGGCAGAAAATAAAATATTCATGGATCAAGGACGAAGTGCAAACTGCCCGTATGTTGTTTTATGTACGAGTCATTCCAACTTGCATTGAGCGTGTTCGT
ACCCAAGTGTATGGGAAGGTGGTAGCCCCAACAATGTTTTTGTATCCTGTTTTAAGGAATTCGTATGTGCAAAGTCTGCTTTGTTTCTTTTTTCTTCTTCTTCTTCATTT
TTTTCCCTCGTGA
Protein sequenceShow/hide protein sequence
MAKQASSVFLEEWLKSIGGIRTSLYSKPTSSSAREIIQAWAELRSSLEHQSFDDHHIQSLKTLVNSQKSLRPSLVLVDSSVEVLSQIFSSKIELRKNPLFFSEGVLVLGA
ISYLLSASEKSKLCSLELLCRVLEEEYLLAGSVGGIIPEFLAGVGYALSSSVNAHVIRLLDSLLGIWGKVGGPTGTISSGLMILHMIEWVTSGLISLHSFEKLDVFSHAT
LVSSKESYASFAVVMAAAGILRAFNTYKTLLSSSERETISRIRISAQDCLESIARNFISTMEGSSITSNDHKRSVLLLCISLAIARCGPVSSRPPVLVCVVYALLTEIFP
LQRLYAKIIEFSFAEMGVLGLTLVKEHLGSIPFKEAGAIVGVLCSQYALLEEEDKNFVENLVWYYCQDVYSKHRQVGLVLRDREDELLENIEKIAESAFLMFVVFALAVT
KEKLDSKYTLESQIDVSVRILILFSCMEYFRRIRLPEYMDTIRGVVASIQGNESACVSFIESMPTYQDQTNGPDNSIGQKIKYSWIKDEVQTARMLFYVRVIPTCIERVR
TQVYGKVVAPTMFLYPVLRNSYVQSLLCFFFLLLLHFFPS