; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0033002 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0033002
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
Descriptionglycine-rich protein
Genome locationchr11:39817960..39825114
RNA-Seq ExpressionLag0033002
SyntenyLag0033002
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6571753.1 hypothetical protein SDJN03_28481, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0086.65Show/hide
Query:  GTNGRGTGVGELANERTRKRW-------ERSPV------IGSSGG--GLAGEWIDGGRLAG-----GIRVLTARECPVGTYKDVDGSDANLCIPCSLDLL
        G NG   G G     R    W       E +PV      I SSGG     G +   G + G     G+      ECPVGTYKDV+GSD NLC+PCSLDLL
Subjt:  GTNGRGTGVGELANERTRKRW-------ERSPV------IGSSGG--GLAGEWIDGGRLAG-----GIRVLTARECPVGTYKDVDGSDANLCIPCSLDLL

Query:  PNRANFIHSRGGVDRPFCPYKCISDKYKMPNCFTPLEELMYTFGGPWPFAVILSCFLVLLALLLSTLRVKFVGYGSYRDADSIEPHSHHHFPHLLSLSEV
        PNRANFI++RGGV +PFCPYKCISDKY+MPNCFTPLEELMYTFGGPWPF+VILSCFLVLLALLLSTLRVKFVGYGSYRD+DSIEPH+H HFPHLLSLSEV
Subjt:  PNRANFIHSRGGVDRPFCPYKCISDKYKMPNCFTPLEELMYTFGGPWPFAVILSCFLVLLALLLSTLRVKFVGYGSYRDADSIEPHSHHHFPHLLSLSEV

Query:  RGTRAEETQSHVYRMYFMGPNTFREPWHLPYSPPNAIIEIVYEDAFNRFIDEINSVAAYHWWEGSVHSILSILAYPCAWTWKQWRRRHKIHRLQDYVKSE
        RGTRAEETQSHVYRMYFMGPNTFREPWHLPYSPPNAIIEIVYEDAFNRFIDEINSVAAYHWWEGSVHSILSILAYPCAWTWKQWRRRHKIH LQDYVKSE
Subjt:  RGTRAEETQSHVYRMYFMGPNTFREPWHLPYSPPNAIIEIVYEDAFNRFIDEINSVAAYHWWEGSVHSILSILAYPCAWTWKQWRRRHKIHRLQDYVKSE

Query:  YDHSCLRSCRSRALYKGMKVGATPDLMVAYIDFFLGGDEKRLDIVSIIEKRYPMCIIFGGDGSYMTPYNLHSDALLTNLIGQHVPATVWNRLVAGLNAQL
        YDHSCLRSCRSRALYKGMKVGATPDLMVAYIDFFLGGDEKR+DIVSIIEKRYPMCIIFGGDGSYMTPYNLHSDALLTNLIGQHVPATVWNRLVAG+NAQL
Subjt:  YDHSCLRSCRSRALYKGMKVGATPDLMVAYIDFFLGGDEKRLDIVSIIEKRYPMCIIFGGDGSYMTPYNLHSDALLTNLIGQHVPATVWNRLVAGLNAQL

Query:  RIVRNRSIRSSLIPVIDWINSHANPQLEFHGVKIEVGWFQATASGYYQLGVLVVAFGDYSSHQLERSDVLLQHTNEPSRKDAASATESLMQFDQSWQALS
        RIVRNRSIRSSL+ V+DWINSHANPQLEFHGVKIEVGWFQATASGYYQLGVLVVA GDYSSHQ   SDVL QHT+EPSRK+AASATE  MQFDQ+WQ+LS
Subjt:  RIVRNRSIRSSLIPVIDWINSHANPQLEFHGVKIEVGWFQATASGYYQLGVLVVAFGDYSSHQLERSDVLLQHTNEPSRKDAASATESLMQFDQSWQALS

Query:  LKRITGVVNGGLINKANVGFLHYKWDFLYPLSLLLRNTKPIGHLDTVQLLITMILLADISITLLMLLQFYWISLVAFLLVLLVLPLSLLSPFPAGLNALF
        LKRITGVVNGGLINKANVGFLHYKWDFLYP SLLLRNTKP+GHLDTVQLLIT++LLADISITLLMLLQFYWISL+AFLL+LLVLPLSLLSPFPAGLNALF
Subjt:  LKRITGVVNGGLINKANVGFLHYKWDFLYPLSLLLRNTKPIGHLDTVQLLITMILLADISITLLMLLQFYWISLVAFLLVLLVLPLSLLSPFPAGLNALF

Query:  SKEPRRASLARIYALWNATSFANIGVAFIYGIFLYGFSTFQPPDETNTWSNRRETDKWWIMPVILLLFKSTQAQFVDWHIANLEIKDISLFCPDPDAFWA
        SKEPRRASLARIYALWNATSF NIGVAFIYGIFLYGFSTFQPP ETNTWSNRRE+DKWWIMPVILLLFKSTQAQ VDWHIANLEIKDISLFCPDPDAFWA
Subjt:  SKEPRRASLARIYALWNATSFANIGVAFIYGIFLYGFSTFQPPDETNTWSNRRETDKWWIMPVILLLFKSTQAQFVDWHIANLEIKDISLFCPDPDAFWA

Query:  AEFA
        AEFA
Subjt:  AEFA

XP_022157179.1 uncharacterized protein LOC111023954 isoform X1 [Momordica charantia]0.0e+0086.79Show/hide
Query:  GTNGRGTGVGELANERTRKRW-------ERSPV--IGSSGGGLAGEWIDGGRLAG-----------GIRVLTARECPVGTYKDVDGSDANLCIPCSLDLL
        G NG   G G     R    W       E  PV  IG S     G    GG   G           G+      ECPVGTYKDVDGSDANLCIPCSLDLL
Subjt:  GTNGRGTGVGELANERTRKRW-------ERSPV--IGSSGGGLAGEWIDGGRLAG-----------GIRVLTARECPVGTYKDVDGSDANLCIPCSLDLL

Query:  PNRANFIHSRGGVDRPFCPYKCISDKYKMPNCFTPLEELMYTFGGPWPFAVILSCFLVLLALLLSTLRVKFVGYGSYRDADSIEPHSHHHFPHLLSLSEV
        PNRANFI+SRGGVDRPFCPY+CISDKY+MPNCFTPLEELMYTFGGPWPF+VILSCFLVLLALLLSTLRVKFVGYGSYRDADSIEPHSH HFPHLLSLSEV
Subjt:  PNRANFIHSRGGVDRPFCPYKCISDKYKMPNCFTPLEELMYTFGGPWPFAVILSCFLVLLALLLSTLRVKFVGYGSYRDADSIEPHSHHHFPHLLSLSEV

Query:  RGTRAEETQSHVYRMYFMGPNTFREPWHLPYSPPNAIIEIVYEDAFNRFIDEINSVAAYHWWEGSVHSILSILAYPCAWTWKQWRRRHKIHRLQDYVKSE
        RGTRAEETQSHV+RMYFMGPNTFREPWHLPYSPPNAIIEIVYEDAFNRFIDEINSVAAYHWWEGSVH ILSILAYPCAW+WKQWRRRHKIH LQDYVKSE
Subjt:  RGTRAEETQSHVYRMYFMGPNTFREPWHLPYSPPNAIIEIVYEDAFNRFIDEINSVAAYHWWEGSVHSILSILAYPCAWTWKQWRRRHKIHRLQDYVKSE

Query:  YDHSCLRSCRSRALYKGMKVGATPDLMVAYIDFFLGGDEKRLDIVSIIEKRYPMCIIFGGDGSYMTPYNLHSDALLTNLIGQHVPATVWNRLVAGLNAQL
        YDHSCLRSCRSRALYKGMKVGATPDLMVAYIDFFLGGDEKRLDIVSIIEKRYPMCIIFGGDGSYMTPYNLHSDALLTNLIGQHVPATVWNRLVAGLNAQL
Subjt:  YDHSCLRSCRSRALYKGMKVGATPDLMVAYIDFFLGGDEKRLDIVSIIEKRYPMCIIFGGDGSYMTPYNLHSDALLTNLIGQHVPATVWNRLVAGLNAQL

Query:  RIVRNRSIRSSLIPVIDWINSHANPQLEFHGVKIEVGWFQATASGYYQLGVLVVAFGDYSSHQLERSDVLLQHTNEPSRKDAASATESLMQFDQSWQALS
        R+VR+RSIRSSLIPVIDW+NSHANPQLEFHGVKIEVGWFQATASGYYQLGVLVV FGDYSS Q ERSDVL QHT E  R+DA SATESLMQF+QSWQALS
Subjt:  RIVRNRSIRSSLIPVIDWINSHANPQLEFHGVKIEVGWFQATASGYYQLGVLVVAFGDYSSHQLERSDVLLQHTNEPSRKDAASATESLMQFDQSWQALS

Query:  LKRITGVVNGGLINKANVGFLHYKWDFLYPLSLLLRNTKPIGHLDTVQLLITMILLADISITLLMLLQFYWISLVAFLLVLLVLPLSLLSPFPAGLNALF
        LKRITGVVNGGLINKANVGFLHYKWDFLYPLSLLLRNTKPIGHLDTVQLLIT +LLADISITLL+LLQFYWISLV FLLVLLVLPLSLLSPFPAGLNALF
Subjt:  LKRITGVVNGGLINKANVGFLHYKWDFLYPLSLLLRNTKPIGHLDTVQLLITMILLADISITLLMLLQFYWISLVAFLLVLLVLPLSLLSPFPAGLNALF

Query:  SKEPRRASLARIYALWNATSFANIGVAFIYGIFLYGFSTFQPPDETNTWSNRRETDKWWIMPVILLLFKSTQAQFVDWHIANLEIKDISLFCPDPDAFWA
        SKEPRRASLARIYALWNATSF NIGVAF++GI  YG STFQPPDE NTWSNRRETDKWWI+PVILLLFKS+QAQFV+WHIANLEIKDISLFCPDPD FWA
Subjt:  SKEPRRASLARIYALWNATSFANIGVAFIYGIFLYGFSTFQPPDETNTWSNRRETDKWWIMPVILLLFKSTQAQFVDWHIANLEIKDISLFCPDPDAFWA

Query:  AEFA
        AEFA
Subjt:  AEFA

XP_022958188.1 uncharacterized protein LOC111459491 [Cucurbita moschata]0.0e+0086.36Show/hide
Query:  GTNGRGTGVGELANERTRKRW-------ERSPVIGSSGG-GLAGEWIDGGRLAGGIRVLTAR------------ECPVGTYKDVDGSDANLCIPCSLDLL
        G NG   G G     R    W       E +PV   SG   ++G   + G   GG   +T +            ECPVGTYKDV+GSDANLC+PCSLDLL
Subjt:  GTNGRGTGVGELANERTRKRW-------ERSPVIGSSGG-GLAGEWIDGGRLAGGIRVLTAR------------ECPVGTYKDVDGSDANLCIPCSLDLL

Query:  PNRANFIHSRGGVDRPFCPYKCISDKYKMPNCFTPLEELMYTFGGPWPFAVILSCFLVLLALLLSTLRVKFVGYGSYRDADSIEPHSHHHFPHLLSLSEV
        PNRANFI++RGGVD PFCPYKCISDKY+MPNCFTPLEELMYTFGGP PF+VILSC L+LLALLLSTLRVKFVGYGSYRDADSIEPH HHHFPHLLSLSEV
Subjt:  PNRANFIHSRGGVDRPFCPYKCISDKYKMPNCFTPLEELMYTFGGPWPFAVILSCFLVLLALLLSTLRVKFVGYGSYRDADSIEPHSHHHFPHLLSLSEV

Query:  RGTRAEETQSHVYRMYFMGPNTFREPWHLPYSPPNAIIEIVYEDAFNRFIDEINSVAAYHWWEGSVHSILSILAYPCAWTWKQWRRRHKIHRLQDYVKSE
        RGTRA+E QSHVYRMYFMGPNTFREPWHLPYSPPNA+IEIVYEDAFNRFID+INSVAAY WWEGSVHSILSILAYPCAWTWKQWRRRHKIHRLQDYVKSE
Subjt:  RGTRAEETQSHVYRMYFMGPNTFREPWHLPYSPPNAIIEIVYEDAFNRFIDEINSVAAYHWWEGSVHSILSILAYPCAWTWKQWRRRHKIHRLQDYVKSE

Query:  YDHSCLRSCRSRALYKGMKVGATPDLMVAYIDFFLGGDEKRLDIVSIIEKRYPMCIIFGGDGSYMTPYNLHSDALLTNLIGQHVPATVWNRLVAGLNAQL
        YDHSCLRSCRSRALYKGMKVGATPDLMVA+IDFFLGGDEKRLDIVSIIEKRYPMCIIFGGDGSYMTPYNLHSDALLTNLIGQHVPATVWNRLVAGLNAQL
Subjt:  YDHSCLRSCRSRALYKGMKVGATPDLMVAYIDFFLGGDEKRLDIVSIIEKRYPMCIIFGGDGSYMTPYNLHSDALLTNLIGQHVPATVWNRLVAGLNAQL

Query:  RIVRNRSIRSSLIPVIDWINSHANPQLEFHGVKIEVGWFQATASGYYQLGVLVVAFGDYSSHQLERSDVLLQHTNEPSRKDAASATESLMQFDQSWQALS
        RIVRNRSIRSSLIPV+DWINSHANPQLEFHGVKIEVGWFQ TASGYYQLGVLVVA GDYS HQLERSD+L QHTNEPSRKDAASATESLMQF+QSWQALS
Subjt:  RIVRNRSIRSSLIPVIDWINSHANPQLEFHGVKIEVGWFQATASGYYQLGVLVVAFGDYSSHQLERSDVLLQHTNEPSRKDAASATESLMQFDQSWQALS

Query:  LKRITGVVNGGLINKANVGFLHYKWDFLYPLSLLLRNTKPIGHLDTVQLLITMILLADISITLLMLLQFYWISLVAFLLVLLVLPLSLLSPFPAGLNALF
        LKRITGVVNGGLINKANVGFLHYKWDFLYP SLLLRNTKPIGHLDTVQLLIT ++LADIS+TLLMLLQFYWISLVAFLLVLLVLPLSLLSPFPAGLNALF
Subjt:  LKRITGVVNGGLINKANVGFLHYKWDFLYPLSLLLRNTKPIGHLDTVQLLITMILLADISITLLMLLQFYWISLVAFLLVLLVLPLSLLSPFPAGLNALF

Query:  SKEPRRASLARIYALWNATSFANIGVAFIYGIFLYGFSTFQPPDETNTWSNRRETDKWWIMPVILLLFKSTQAQFVDWHIANLEIKDISLFCPDPDAFWA
        SKEPR+ASL+RIYALWNATSFANIGVA IYG FLYGFSTFQPPDETNTWSNRRE DKWWI+PVILLLFKS QA+ VD HIANLEIKDISLFCPDPDAFWA
Subjt:  SKEPRRASLARIYALWNATSFANIGVAFIYGIFLYGFSTFQPPDETNTWSNRRETDKWWIMPVILLLFKSTQAQFVDWHIANLEIKDISLFCPDPDAFWA

Query:  AEFA
        AEFA
Subjt:  AEFA

XP_022995674.1 uncharacterized protein LOC111491138 [Cucurbita maxima]0.0e+0086.36Show/hide
Query:  GTNGRGTGVGELANERTRKRW-------ERSPVIGSSGG-GLAGEWIDGGRLAGGIRVLTAR------------ECPVGTYKDVDGSDANLCIPCSLDLL
        G NG   G G     R    W       E +PV   SG   ++G   + G   GG   +T +            ECPVGTYKD++GSDANLC+PCSLDLL
Subjt:  GTNGRGTGVGELANERTRKRW-------ERSPVIGSSGG-GLAGEWIDGGRLAGGIRVLTAR------------ECPVGTYKDVDGSDANLCIPCSLDLL

Query:  PNRANFIHSRGGVDRPFCPYKCISDKYKMPNCFTPLEELMYTFGGPWPFAVILSCFLVLLALLLSTLRVKFVGYGSYRDADSIEPHSHHHFPHLLSLSEV
        PNRANFI++RGGVD PFCPYKCISDKY+MPNCFTPLEELMYTFGGP PF+VILSC L+LLALLLSTLRVKFVGYGSYRDADSIEPH HHHFPHLLSLSEV
Subjt:  PNRANFIHSRGGVDRPFCPYKCISDKYKMPNCFTPLEELMYTFGGPWPFAVILSCFLVLLALLLSTLRVKFVGYGSYRDADSIEPHSHHHFPHLLSLSEV

Query:  RGTRAEETQSHVYRMYFMGPNTFREPWHLPYSPPNAIIEIVYEDAFNRFIDEINSVAAYHWWEGSVHSILSILAYPCAWTWKQWRRRHKIHRLQDYVKSE
        RGTRA+ETQSHVYRMYFMGPNTFREPWHLPYSPPNA+IEIVYEDAFNRFIDEINSVAAYHWWEGSVHSILSILAYPCAWTWKQWRRRHKIHRLQDYVKSE
Subjt:  RGTRAEETQSHVYRMYFMGPNTFREPWHLPYSPPNAIIEIVYEDAFNRFIDEINSVAAYHWWEGSVHSILSILAYPCAWTWKQWRRRHKIHRLQDYVKSE

Query:  YDHSCLRSCRSRALYKGMKVGATPDLMVAYIDFFLGGDEKRLDIVSIIEKRYPMCIIFGGDGSYMTPYNLHSDALLTNLIGQHVPATVWNRLVAGLNAQL
        YDHSCLRSCRSRALYKGMKVGATPDLMVA+IDFFLGGDEKRLDIVSIIEKRYPMCIIFGGDGSYMTPYN+HSDALLTNLIGQHVPATVWNRLVAGLNAQL
Subjt:  YDHSCLRSCRSRALYKGMKVGATPDLMVAYIDFFLGGDEKRLDIVSIIEKRYPMCIIFGGDGSYMTPYNLHSDALLTNLIGQHVPATVWNRLVAGLNAQL

Query:  RIVRNRSIRSSLIPVIDWINSHANPQLEFHGVKIEVGWFQATASGYYQLGVLVVAFGDYSSHQLERSDVLLQHTNEPSRKDAASATESLMQFDQSWQALS
        RIVRNRSIRSSLIPV+DWINSHANPQLEFHGVKIEVGWFQ TASGYYQLGVLVVA GDYS HQLERSD+L QHTNEPSRKDAASATES MQF+QSWQALS
Subjt:  RIVRNRSIRSSLIPVIDWINSHANPQLEFHGVKIEVGWFQATASGYYQLGVLVVAFGDYSSHQLERSDVLLQHTNEPSRKDAASATESLMQFDQSWQALS

Query:  LKRITGVVNGGLINKANVGFLHYKWDFLYPLSLLLRNTKPIGHLDTVQLLITMILLADISITLLMLLQFYWISLVAFLLVLLVLPLSLLSPFPAGLNALF
        LKRITGVVNGGLINKANVGFLHYKWDFLYP SLLLRNTKPIGHLDTVQLLIT +LLADIS+TLLMLLQFYWISLVAFLLVLLVLPLSLLSPFPAGL+ALF
Subjt:  LKRITGVVNGGLINKANVGFLHYKWDFLYPLSLLLRNTKPIGHLDTVQLLITMILLADISITLLMLLQFYWISLVAFLLVLLVLPLSLLSPFPAGLNALF

Query:  SKEPRRASLARIYALWNATSFANIGVAFIYGIFLYGFSTFQPPDETNTWSNRRETDKWWIMPVILLLFKSTQAQFVDWHIANLEIKDISLFCPDPDAFWA
        SKEPR+ASL+RIYALWNATSFANIGVA IYG FLYGFSTFQPPDETNTWSNRRE DKWWI+PVILLLFKS QA+ VD HIANLEIKDISLFCPDPDAFWA
Subjt:  SKEPRRASLARIYALWNATSFANIGVAFIYGIFLYGFSTFQPPDETNTWSNRRETDKWWIMPVILLLFKSTQAQFVDWHIANLEIKDISLFCPDPDAFWA

Query:  AEFA
        AEFA
Subjt:  AEFA

XP_038888230.1 uncharacterized protein LOC120078090 isoform X1 [Benincasa hispida]0.0e+0086.79Show/hide
Query:  GTNGRGTGVGELANERTRKRWER-------SPV------IGSSGG--GLAGEWIDGGRLAG-----GIRVLTARECPVGTYKDVDGSDANLCIPCSLDLL
        G NG   G G     R    W         +PV      I SSGG     G +   G + G     G+      ECPVGTYKDV+GSDANLC PCSLDLL
Subjt:  GTNGRGTGVGELANERTRKRWER-------SPV------IGSSGG--GLAGEWIDGGRLAG-----GIRVLTARECPVGTYKDVDGSDANLCIPCSLDLL

Query:  PNRANFIHSRGGVDRPFCPYKCISDKYKMPNCFTPLEELMYTFGGPWPFAVILSCFLVLLALLLSTLRVKFVGYGSYRDADSIEPHSHHHFPHLLSLSEV
        PNRANFI++RGGVD+PFCPYKCIS+KY+MPNCFTPLEELMYTFGGPWPF+VILSCFLVLLALLLSTLRVKFVGYGSYRDAD+IEPHSH HFPHLLSLSEV
Subjt:  PNRANFIHSRGGVDRPFCPYKCISDKYKMPNCFTPLEELMYTFGGPWPFAVILSCFLVLLALLLSTLRVKFVGYGSYRDADSIEPHSHHHFPHLLSLSEV

Query:  RGTRAEETQSHVYRMYFMGPNTFREPWHLPYSPPNAIIEIVYEDAFNRFIDEINSVAAYHWWEGSVHSILSILAYPCAWTWKQWRRRHKIHRLQDYVKSE
        RGTRAEETQSHVYRMYFMGPNTFREPWHLPYSPPNAIIEIVYEDAFNRFIDEINSVAAYHWWEGSVHSILSILAYPCAWTWKQWRRRHKIHRLQDYVKSE
Subjt:  RGTRAEETQSHVYRMYFMGPNTFREPWHLPYSPPNAIIEIVYEDAFNRFIDEINSVAAYHWWEGSVHSILSILAYPCAWTWKQWRRRHKIHRLQDYVKSE

Query:  YDHSCLRSCRSRALYKGMKVGATPDLMVAYIDFFLGGDEKRLDIVSIIEKRYPMCIIFGGDGSYMTPYNLHSDALLTNLIGQHVPATVWNRLVAGLNAQL
        YDHSCLRSCRSRALYKGMKVGATPDLMVAYIDFFLGGDEKRLDIVSIIEKRYPMCIIFGGDGSYMTPYNLHSDALLTNLIGQHVPATVWNRLVAG+NAQL
Subjt:  YDHSCLRSCRSRALYKGMKVGATPDLMVAYIDFFLGGDEKRLDIVSIIEKRYPMCIIFGGDGSYMTPYNLHSDALLTNLIGQHVPATVWNRLVAGLNAQL

Query:  RIVRNRSIRSSLIPVIDWINSHANPQLEFHGVKIEVGWFQATASGYYQLGVLVVAFGDYSSHQLERSDVLLQHTNEPSRKDAASATESLMQFDQSWQALS
        RIVRNRSI SSLIPVIDWINSHANPQLEFHGVKIEVGWFQATASGYYQLGVLVVAFGDYSSHQLERSD+L QHTNEPSR+DAAS+TESL QFDQ+WQ+LS
Subjt:  RIVRNRSIRSSLIPVIDWINSHANPQLEFHGVKIEVGWFQATASGYYQLGVLVVAFGDYSSHQLERSDVLLQHTNEPSRKDAASATESLMQFDQSWQALS

Query:  LKRITGVVNGGLINKANVGFLHYKWDFLYPLSLLLRNTKPIGHLDTVQLLITMILLADISITLLMLLQFYWISLVAFLLVLLVLPLSLLSPFPAGLNALF
        LK+ITGVVNGGLINKANVG LHY+WDFLYP SLLLRN KPIGHLDTVQLLIT++LLADISITLLMLLQFYWISLVAFLLVLLVLPLSLL PFPAGLNALF
Subjt:  LKRITGVVNGGLINKANVGFLHYKWDFLYPLSLLLRNTKPIGHLDTVQLLITMILLADISITLLMLLQFYWISLVAFLLVLLVLPLSLLSPFPAGLNALF

Query:  SKEPRRASLARIYALWNATSFANIGVAFIYGIFLYGFSTFQPPDETNTWSNRRETDKWWIMPVILLLFKSTQAQFVDWHIANLEIKDISLFCPDPDAFWA
        SKEPRRASLARIYALWNATS ANIGVAFIY +FLY FSTFQPP+ETNTWSNRRET+KWWIMPVILL+FKS QAQ VDWHIANLE+KDISLFCPDPDAFWA
Subjt:  SKEPRRASLARIYALWNATSFANIGVAFIYGIFLYGFSTFQPPDETNTWSNRRETDKWWIMPVILLLFKSTQAQFVDWHIANLEIKDISLFCPDPDAFWA

Query:  AEFA
        AEFA
Subjt:  AEFA

TrEMBL top hitse value%identityAlignment
A0A6J1DVS2 uncharacterized protein LOC111023954 isoform X10.0e+0086.79Show/hide
Query:  GTNGRGTGVGELANERTRKRW-------ERSPV--IGSSGGGLAGEWIDGGRLAG-----------GIRVLTARECPVGTYKDVDGSDANLCIPCSLDLL
        G NG   G G     R    W       E  PV  IG S     G    GG   G           G+      ECPVGTYKDVDGSDANLCIPCSLDLL
Subjt:  GTNGRGTGVGELANERTRKRW-------ERSPV--IGSSGGGLAGEWIDGGRLAG-----------GIRVLTARECPVGTYKDVDGSDANLCIPCSLDLL

Query:  PNRANFIHSRGGVDRPFCPYKCISDKYKMPNCFTPLEELMYTFGGPWPFAVILSCFLVLLALLLSTLRVKFVGYGSYRDADSIEPHSHHHFPHLLSLSEV
        PNRANFI+SRGGVDRPFCPY+CISDKY+MPNCFTPLEELMYTFGGPWPF+VILSCFLVLLALLLSTLRVKFVGYGSYRDADSIEPHSH HFPHLLSLSEV
Subjt:  PNRANFIHSRGGVDRPFCPYKCISDKYKMPNCFTPLEELMYTFGGPWPFAVILSCFLVLLALLLSTLRVKFVGYGSYRDADSIEPHSHHHFPHLLSLSEV

Query:  RGTRAEETQSHVYRMYFMGPNTFREPWHLPYSPPNAIIEIVYEDAFNRFIDEINSVAAYHWWEGSVHSILSILAYPCAWTWKQWRRRHKIHRLQDYVKSE
        RGTRAEETQSHV+RMYFMGPNTFREPWHLPYSPPNAIIEIVYEDAFNRFIDEINSVAAYHWWEGSVH ILSILAYPCAW+WKQWRRRHKIH LQDYVKSE
Subjt:  RGTRAEETQSHVYRMYFMGPNTFREPWHLPYSPPNAIIEIVYEDAFNRFIDEINSVAAYHWWEGSVHSILSILAYPCAWTWKQWRRRHKIHRLQDYVKSE

Query:  YDHSCLRSCRSRALYKGMKVGATPDLMVAYIDFFLGGDEKRLDIVSIIEKRYPMCIIFGGDGSYMTPYNLHSDALLTNLIGQHVPATVWNRLVAGLNAQL
        YDHSCLRSCRSRALYKGMKVGATPDLMVAYIDFFLGGDEKRLDIVSIIEKRYPMCIIFGGDGSYMTPYNLHSDALLTNLIGQHVPATVWNRLVAGLNAQL
Subjt:  YDHSCLRSCRSRALYKGMKVGATPDLMVAYIDFFLGGDEKRLDIVSIIEKRYPMCIIFGGDGSYMTPYNLHSDALLTNLIGQHVPATVWNRLVAGLNAQL

Query:  RIVRNRSIRSSLIPVIDWINSHANPQLEFHGVKIEVGWFQATASGYYQLGVLVVAFGDYSSHQLERSDVLLQHTNEPSRKDAASATESLMQFDQSWQALS
        R+VR+RSIRSSLIPVIDW+NSHANPQLEFHGVKIEVGWFQATASGYYQLGVLVV FGDYSS Q ERSDVL QHT E  R+DA SATESLMQF+QSWQALS
Subjt:  RIVRNRSIRSSLIPVIDWINSHANPQLEFHGVKIEVGWFQATASGYYQLGVLVVAFGDYSSHQLERSDVLLQHTNEPSRKDAASATESLMQFDQSWQALS

Query:  LKRITGVVNGGLINKANVGFLHYKWDFLYPLSLLLRNTKPIGHLDTVQLLITMILLADISITLLMLLQFYWISLVAFLLVLLVLPLSLLSPFPAGLNALF
        LKRITGVVNGGLINKANVGFLHYKWDFLYPLSLLLRNTKPIGHLDTVQLLIT +LLADISITLL+LLQFYWISLV FLLVLLVLPLSLLSPFPAGLNALF
Subjt:  LKRITGVVNGGLINKANVGFLHYKWDFLYPLSLLLRNTKPIGHLDTVQLLITMILLADISITLLMLLQFYWISLVAFLLVLLVLPLSLLSPFPAGLNALF

Query:  SKEPRRASLARIYALWNATSFANIGVAFIYGIFLYGFSTFQPPDETNTWSNRRETDKWWIMPVILLLFKSTQAQFVDWHIANLEIKDISLFCPDPDAFWA
        SKEPRRASLARIYALWNATSF NIGVAF++GI  YG STFQPPDE NTWSNRRETDKWWI+PVILLLFKS+QAQFV+WHIANLEIKDISLFCPDPD FWA
Subjt:  SKEPRRASLARIYALWNATSFANIGVAFIYGIFLYGFSTFQPPDETNTWSNRRETDKWWIMPVILLLFKSTQAQFVDWHIANLEIKDISLFCPDPDAFWA

Query:  AEFA
        AEFA
Subjt:  AEFA

A0A6J1EYB2 uncharacterized protein LOC111439602 isoform X20.0e+0086.65Show/hide
Query:  GTNGRGTGVGELANERTRKRW-------ERSPV------IGSSGG--GLAGEWIDGGRLAG-----GIRVLTARECPVGTYKDVDGSDANLCIPCSLDLL
        G NG   G G     R    W       E +PV      I SSGG     G +   G + G     G+      ECPVGTYKDV+GSD NLC+PCSLDLL
Subjt:  GTNGRGTGVGELANERTRKRW-------ERSPV------IGSSGG--GLAGEWIDGGRLAG-----GIRVLTARECPVGTYKDVDGSDANLCIPCSLDLL

Query:  PNRANFIHSRGGVDRPFCPYKCISDKYKMPNCFTPLEELMYTFGGPWPFAVILSCFLVLLALLLSTLRVKFVGYGSYRDADSIEPHSHHHFPHLLSLSEV
        PNRANFI+ RGGV +PFCPYKCISDKY+MPNCFTPLEELMYTFGGPWPF+VILSCFLVLLALLLSTLRVKFVGYGSYRD+DSIEPH+H HFPHLLSLSEV
Subjt:  PNRANFIHSRGGVDRPFCPYKCISDKYKMPNCFTPLEELMYTFGGPWPFAVILSCFLVLLALLLSTLRVKFVGYGSYRDADSIEPHSHHHFPHLLSLSEV

Query:  RGTRAEETQSHVYRMYFMGPNTFREPWHLPYSPPNAIIEIVYEDAFNRFIDEINSVAAYHWWEGSVHSILSILAYPCAWTWKQWRRRHKIHRLQDYVKSE
        RGTRAEETQSHVYRMYFMGPNTFREPWHLPYSPPNAIIEIVYEDAFNRFIDEINSVAAYHWWEGSVHSILSILAYPCAWTWKQWRRRHKIH LQDYVKSE
Subjt:  RGTRAEETQSHVYRMYFMGPNTFREPWHLPYSPPNAIIEIVYEDAFNRFIDEINSVAAYHWWEGSVHSILSILAYPCAWTWKQWRRRHKIHRLQDYVKSE

Query:  YDHSCLRSCRSRALYKGMKVGATPDLMVAYIDFFLGGDEKRLDIVSIIEKRYPMCIIFGGDGSYMTPYNLHSDALLTNLIGQHVPATVWNRLVAGLNAQL
        YDHSCLRSCRSRALYKGMKVGATPDLMVAYIDFFLGGDEKR+DIVSIIEKRYPMCIIFGGDGSYMTPYNLHSDALLTNLIGQHVPATVWNRLVAG+NAQL
Subjt:  YDHSCLRSCRSRALYKGMKVGATPDLMVAYIDFFLGGDEKRLDIVSIIEKRYPMCIIFGGDGSYMTPYNLHSDALLTNLIGQHVPATVWNRLVAGLNAQL

Query:  RIVRNRSIRSSLIPVIDWINSHANPQLEFHGVKIEVGWFQATASGYYQLGVLVVAFGDYSSHQLERSDVLLQHTNEPSRKDAASATESLMQFDQSWQALS
        RIVRNRSIRSSL+ V+DWINSHANPQLEFHGVKIEVGWFQATASGYYQLGVLVVA GDYSSHQ   SDVL QHT+EPSRK+AASATE  MQFDQ+WQ+LS
Subjt:  RIVRNRSIRSSLIPVIDWINSHANPQLEFHGVKIEVGWFQATASGYYQLGVLVVAFGDYSSHQLERSDVLLQHTNEPSRKDAASATESLMQFDQSWQALS

Query:  LKRITGVVNGGLINKANVGFLHYKWDFLYPLSLLLRNTKPIGHLDTVQLLITMILLADISITLLMLLQFYWISLVAFLLVLLVLPLSLLSPFPAGLNALF
        LKRITGVVNGGLINKANVGFLHYKWDFLYP SLLLRNTKP+GHLDTVQLLIT++LLADISITLLMLLQFYWISL+AFLL+LLVLPLSLLSPFPAGLNALF
Subjt:  LKRITGVVNGGLINKANVGFLHYKWDFLYPLSLLLRNTKPIGHLDTVQLLITMILLADISITLLMLLQFYWISLVAFLLVLLVLPLSLLSPFPAGLNALF

Query:  SKEPRRASLARIYALWNATSFANIGVAFIYGIFLYGFSTFQPPDETNTWSNRRETDKWWIMPVILLLFKSTQAQFVDWHIANLEIKDISLFCPDPDAFWA
        SKEPRRASLARIYALWNATSF NIGVAFIYGIFLYGFSTFQPP ETNTWSNRRE+DKWWIMPVILLLFKSTQAQ VDWHIANLEIKDISLFCPDPDAFWA
Subjt:  SKEPRRASLARIYALWNATSFANIGVAFIYGIFLYGFSTFQPPDETNTWSNRRETDKWWIMPVILLLFKSTQAQFVDWHIANLEIKDISLFCPDPDAFWA

Query:  AEFA
        AEFA
Subjt:  AEFA

A0A6J1F3N5 uncharacterized protein LOC111439602 isoform X10.0e+0086.65Show/hide
Query:  GTNGRGTGVGELANERTRKRW-------ERSPV------IGSSGG--GLAGEWIDGGRLAG-----GIRVLTARECPVGTYKDVDGSDANLCIPCSLDLL
        G NG   G G     R    W       E +PV      I SSGG     G +   G + G     G+      ECPVGTYKDV+GSD NLC+PCSLDLL
Subjt:  GTNGRGTGVGELANERTRKRW-------ERSPV------IGSSGG--GLAGEWIDGGRLAG-----GIRVLTARECPVGTYKDVDGSDANLCIPCSLDLL

Query:  PNRANFIHSRGGVDRPFCPYKCISDKYKMPNCFTPLEELMYTFGGPWPFAVILSCFLVLLALLLSTLRVKFVGYGSYRDADSIEPHSHHHFPHLLSLSEV
        PNRANFI+ RGGV +PFCPYKCISDKY+MPNCFTPLEELMYTFGGPWPF+VILSCFLVLLALLLSTLRVKFVGYGSYRD+DSIEPH+H HFPHLLSLSEV
Subjt:  PNRANFIHSRGGVDRPFCPYKCISDKYKMPNCFTPLEELMYTFGGPWPFAVILSCFLVLLALLLSTLRVKFVGYGSYRDADSIEPHSHHHFPHLLSLSEV

Query:  RGTRAEETQSHVYRMYFMGPNTFREPWHLPYSPPNAIIEIVYEDAFNRFIDEINSVAAYHWWEGSVHSILSILAYPCAWTWKQWRRRHKIHRLQDYVKSE
        RGTRAEETQSHVYRMYFMGPNTFREPWHLPYSPPNAIIEIVYEDAFNRFIDEINSVAAYHWWEGSVHSILSILAYPCAWTWKQWRRRHKIH LQDYVKSE
Subjt:  RGTRAEETQSHVYRMYFMGPNTFREPWHLPYSPPNAIIEIVYEDAFNRFIDEINSVAAYHWWEGSVHSILSILAYPCAWTWKQWRRRHKIHRLQDYVKSE

Query:  YDHSCLRSCRSRALYKGMKVGATPDLMVAYIDFFLGGDEKRLDIVSIIEKRYPMCIIFGGDGSYMTPYNLHSDALLTNLIGQHVPATVWNRLVAGLNAQL
        YDHSCLRSCRSRALYKGMKVGATPDLMVAYIDFFLGGDEKR+DIVSIIEKRYPMCIIFGGDGSYMTPYNLHSDALLTNLIGQHVPATVWNRLVAG+NAQL
Subjt:  YDHSCLRSCRSRALYKGMKVGATPDLMVAYIDFFLGGDEKRLDIVSIIEKRYPMCIIFGGDGSYMTPYNLHSDALLTNLIGQHVPATVWNRLVAGLNAQL

Query:  RIVRNRSIRSSLIPVIDWINSHANPQLEFHGVKIEVGWFQATASGYYQLGVLVVAFGDYSSHQLERSDVLLQHTNEPSRKDAASATESLMQFDQSWQALS
        RIVRNRSIRSSL+ V+DWINSHANPQLEFHGVKIEVGWFQATASGYYQLGVLVVA GDYSSHQ   SDVL QHT+EPSRK+AASATE  MQFDQ+WQ+LS
Subjt:  RIVRNRSIRSSLIPVIDWINSHANPQLEFHGVKIEVGWFQATASGYYQLGVLVVAFGDYSSHQLERSDVLLQHTNEPSRKDAASATESLMQFDQSWQALS

Query:  LKRITGVVNGGLINKANVGFLHYKWDFLYPLSLLLRNTKPIGHLDTVQLLITMILLADISITLLMLLQFYWISLVAFLLVLLVLPLSLLSPFPAGLNALF
        LKRITGVVNGGLINKANVGFLHYKWDFLYP SLLLRNTKP+GHLDTVQLLIT++LLADISITLLMLLQFYWISL+AFLL+LLVLPLSLLSPFPAGLNALF
Subjt:  LKRITGVVNGGLINKANVGFLHYKWDFLYPLSLLLRNTKPIGHLDTVQLLITMILLADISITLLMLLQFYWISLVAFLLVLLVLPLSLLSPFPAGLNALF

Query:  SKEPRRASLARIYALWNATSFANIGVAFIYGIFLYGFSTFQPPDETNTWSNRRETDKWWIMPVILLLFKSTQAQFVDWHIANLEIKDISLFCPDPDAFWA
        SKEPRRASLARIYALWNATSF NIGVAFIYGIFLYGFSTFQPP ETNTWSNRRE+DKWWIMPVILLLFKSTQAQ VDWHIANLEIKDISLFCPDPDAFWA
Subjt:  SKEPRRASLARIYALWNATSFANIGVAFIYGIFLYGFSTFQPPDETNTWSNRRETDKWWIMPVILLLFKSTQAQFVDWHIANLEIKDISLFCPDPDAFWA

Query:  AEFA
        AEFA
Subjt:  AEFA

A0A6J1H2F9 uncharacterized protein LOC1114594910.0e+0086.36Show/hide
Query:  GTNGRGTGVGELANERTRKRW-------ERSPVIGSSGG-GLAGEWIDGGRLAGGIRVLTAR------------ECPVGTYKDVDGSDANLCIPCSLDLL
        G NG   G G     R    W       E +PV   SG   ++G   + G   GG   +T +            ECPVGTYKDV+GSDANLC+PCSLDLL
Subjt:  GTNGRGTGVGELANERTRKRW-------ERSPVIGSSGG-GLAGEWIDGGRLAGGIRVLTAR------------ECPVGTYKDVDGSDANLCIPCSLDLL

Query:  PNRANFIHSRGGVDRPFCPYKCISDKYKMPNCFTPLEELMYTFGGPWPFAVILSCFLVLLALLLSTLRVKFVGYGSYRDADSIEPHSHHHFPHLLSLSEV
        PNRANFI++RGGVD PFCPYKCISDKY+MPNCFTPLEELMYTFGGP PF+VILSC L+LLALLLSTLRVKFVGYGSYRDADSIEPH HHHFPHLLSLSEV
Subjt:  PNRANFIHSRGGVDRPFCPYKCISDKYKMPNCFTPLEELMYTFGGPWPFAVILSCFLVLLALLLSTLRVKFVGYGSYRDADSIEPHSHHHFPHLLSLSEV

Query:  RGTRAEETQSHVYRMYFMGPNTFREPWHLPYSPPNAIIEIVYEDAFNRFIDEINSVAAYHWWEGSVHSILSILAYPCAWTWKQWRRRHKIHRLQDYVKSE
        RGTRA+E QSHVYRMYFMGPNTFREPWHLPYSPPNA+IEIVYEDAFNRFID+INSVAAY WWEGSVHSILSILAYPCAWTWKQWRRRHKIHRLQDYVKSE
Subjt:  RGTRAEETQSHVYRMYFMGPNTFREPWHLPYSPPNAIIEIVYEDAFNRFIDEINSVAAYHWWEGSVHSILSILAYPCAWTWKQWRRRHKIHRLQDYVKSE

Query:  YDHSCLRSCRSRALYKGMKVGATPDLMVAYIDFFLGGDEKRLDIVSIIEKRYPMCIIFGGDGSYMTPYNLHSDALLTNLIGQHVPATVWNRLVAGLNAQL
        YDHSCLRSCRSRALYKGMKVGATPDLMVA+IDFFLGGDEKRLDIVSIIEKRYPMCIIFGGDGSYMTPYNLHSDALLTNLIGQHVPATVWNRLVAGLNAQL
Subjt:  YDHSCLRSCRSRALYKGMKVGATPDLMVAYIDFFLGGDEKRLDIVSIIEKRYPMCIIFGGDGSYMTPYNLHSDALLTNLIGQHVPATVWNRLVAGLNAQL

Query:  RIVRNRSIRSSLIPVIDWINSHANPQLEFHGVKIEVGWFQATASGYYQLGVLVVAFGDYSSHQLERSDVLLQHTNEPSRKDAASATESLMQFDQSWQALS
        RIVRNRSIRSSLIPV+DWINSHANPQLEFHGVKIEVGWFQ TASGYYQLGVLVVA GDYS HQLERSD+L QHTNEPSRKDAASATESLMQF+QSWQALS
Subjt:  RIVRNRSIRSSLIPVIDWINSHANPQLEFHGVKIEVGWFQATASGYYQLGVLVVAFGDYSSHQLERSDVLLQHTNEPSRKDAASATESLMQFDQSWQALS

Query:  LKRITGVVNGGLINKANVGFLHYKWDFLYPLSLLLRNTKPIGHLDTVQLLITMILLADISITLLMLLQFYWISLVAFLLVLLVLPLSLLSPFPAGLNALF
        LKRITGVVNGGLINKANVGFLHYKWDFLYP SLLLRNTKPIGHLDTVQLLIT ++LADIS+TLLMLLQFYWISLVAFLLVLLVLPLSLLSPFPAGLNALF
Subjt:  LKRITGVVNGGLINKANVGFLHYKWDFLYPLSLLLRNTKPIGHLDTVQLLITMILLADISITLLMLLQFYWISLVAFLLVLLVLPLSLLSPFPAGLNALF

Query:  SKEPRRASLARIYALWNATSFANIGVAFIYGIFLYGFSTFQPPDETNTWSNRRETDKWWIMPVILLLFKSTQAQFVDWHIANLEIKDISLFCPDPDAFWA
        SKEPR+ASL+RIYALWNATSFANIGVA IYG FLYGFSTFQPPDETNTWSNRRE DKWWI+PVILLLFKS QA+ VD HIANLEIKDISLFCPDPDAFWA
Subjt:  SKEPRRASLARIYALWNATSFANIGVAFIYGIFLYGFSTFQPPDETNTWSNRRETDKWWIMPVILLLFKSTQAQFVDWHIANLEIKDISLFCPDPDAFWA

Query:  AEFA
        AEFA
Subjt:  AEFA

A0A6J1K6L2 uncharacterized protein LOC1114911380.0e+0086.36Show/hide
Query:  GTNGRGTGVGELANERTRKRW-------ERSPVIGSSGG-GLAGEWIDGGRLAGGIRVLTAR------------ECPVGTYKDVDGSDANLCIPCSLDLL
        G NG   G G     R    W       E +PV   SG   ++G   + G   GG   +T +            ECPVGTYKD++GSDANLC+PCSLDLL
Subjt:  GTNGRGTGVGELANERTRKRW-------ERSPVIGSSGG-GLAGEWIDGGRLAGGIRVLTAR------------ECPVGTYKDVDGSDANLCIPCSLDLL

Query:  PNRANFIHSRGGVDRPFCPYKCISDKYKMPNCFTPLEELMYTFGGPWPFAVILSCFLVLLALLLSTLRVKFVGYGSYRDADSIEPHSHHHFPHLLSLSEV
        PNRANFI++RGGVD PFCPYKCISDKY+MPNCFTPLEELMYTFGGP PF+VILSC L+LLALLLSTLRVKFVGYGSYRDADSIEPH HHHFPHLLSLSEV
Subjt:  PNRANFIHSRGGVDRPFCPYKCISDKYKMPNCFTPLEELMYTFGGPWPFAVILSCFLVLLALLLSTLRVKFVGYGSYRDADSIEPHSHHHFPHLLSLSEV

Query:  RGTRAEETQSHVYRMYFMGPNTFREPWHLPYSPPNAIIEIVYEDAFNRFIDEINSVAAYHWWEGSVHSILSILAYPCAWTWKQWRRRHKIHRLQDYVKSE
        RGTRA+ETQSHVYRMYFMGPNTFREPWHLPYSPPNA+IEIVYEDAFNRFIDEINSVAAYHWWEGSVHSILSILAYPCAWTWKQWRRRHKIHRLQDYVKSE
Subjt:  RGTRAEETQSHVYRMYFMGPNTFREPWHLPYSPPNAIIEIVYEDAFNRFIDEINSVAAYHWWEGSVHSILSILAYPCAWTWKQWRRRHKIHRLQDYVKSE

Query:  YDHSCLRSCRSRALYKGMKVGATPDLMVAYIDFFLGGDEKRLDIVSIIEKRYPMCIIFGGDGSYMTPYNLHSDALLTNLIGQHVPATVWNRLVAGLNAQL
        YDHSCLRSCRSRALYKGMKVGATPDLMVA+IDFFLGGDEKRLDIVSIIEKRYPMCIIFGGDGSYMTPYN+HSDALLTNLIGQHVPATVWNRLVAGLNAQL
Subjt:  YDHSCLRSCRSRALYKGMKVGATPDLMVAYIDFFLGGDEKRLDIVSIIEKRYPMCIIFGGDGSYMTPYNLHSDALLTNLIGQHVPATVWNRLVAGLNAQL

Query:  RIVRNRSIRSSLIPVIDWINSHANPQLEFHGVKIEVGWFQATASGYYQLGVLVVAFGDYSSHQLERSDVLLQHTNEPSRKDAASATESLMQFDQSWQALS
        RIVRNRSIRSSLIPV+DWINSHANPQLEFHGVKIEVGWFQ TASGYYQLGVLVVA GDYS HQLERSD+L QHTNEPSRKDAASATES MQF+QSWQALS
Subjt:  RIVRNRSIRSSLIPVIDWINSHANPQLEFHGVKIEVGWFQATASGYYQLGVLVVAFGDYSSHQLERSDVLLQHTNEPSRKDAASATESLMQFDQSWQALS

Query:  LKRITGVVNGGLINKANVGFLHYKWDFLYPLSLLLRNTKPIGHLDTVQLLITMILLADISITLLMLLQFYWISLVAFLLVLLVLPLSLLSPFPAGLNALF
        LKRITGVVNGGLINKANVGFLHYKWDFLYP SLLLRNTKPIGHLDTVQLLIT +LLADIS+TLLMLLQFYWISLVAFLLVLLVLPLSLLSPFPAGL+ALF
Subjt:  LKRITGVVNGGLINKANVGFLHYKWDFLYPLSLLLRNTKPIGHLDTVQLLITMILLADISITLLMLLQFYWISLVAFLLVLLVLPLSLLSPFPAGLNALF

Query:  SKEPRRASLARIYALWNATSFANIGVAFIYGIFLYGFSTFQPPDETNTWSNRRETDKWWIMPVILLLFKSTQAQFVDWHIANLEIKDISLFCPDPDAFWA
        SKEPR+ASL+RIYALWNATSFANIGVA IYG FLYGFSTFQPPDETNTWSNRRE DKWWI+PVILLLFKS QA+ VD HIANLEIKDISLFCPDPDAFWA
Subjt:  SKEPRRASLARIYALWNATSFANIGVAFIYGIFLYGFSTFQPPDETNTWSNRRETDKWWIMPVILLLFKSTQAQFVDWHIANLEIKDISLFCPDPDAFWA

Query:  AEFA
        AEFA
Subjt:  AEFA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G32920.1 glycine-rich protein1.5e-17548.64Show/hide
Query:  GGGLAGEWIDG-GRLAG-----GIRVLTARECPVGTYKDVDGSDANLCIPCSLDLLPNRANFIHSRGGVDRPFCPYKCISDKYKMPNCFTPLEELMYTFG
        G G+  + I G G L G     G+  L   ECP GTYK+V GSD  LC  C  + +P+RA ++  RGGV    CPYKCISD+Y MP+C+T LEEL+YTFG
Subjt:  GGGLAGEWIDG-GRLAG-----GIRVLTARECPVGTYKDVDGSDANLCIPCSLDLLPNRANFIHSRGGVDRPFCPYKCISDKYKMPNCFTPLEELMYTFG

Query:  GPWPFAVILSCFLVLLALLLSTLRVKFVGYGSYRDADSIEPHSH--HHFPHLLSLSEVRGT-RAEETQSHVYRMYFMGPNTFREPWHLPYSPPNAIIEIV
        GPW F V+L   L+LLAL+ S  R+KFV       +   +  S   H FP L SL+EV  T R EE+Q H++R+YF+GPNTF EPWHL ++PP  I EIV
Subjt:  GPWPFAVILSCFLVLLALLLSTLRVKFVGYGSYRDADSIEPHSH--HHFPHLLSLSEVRGT-RAEETQSHVYRMYFMGPNTFREPWHLPYSPPNAIIEIV

Query:  YEDAFNRFIDEINSVAAYHWWEGSVHSILSILAYPCAWTWKQWRRRHKIHRLQDYVKSEYDHSCLRSCRSRALYKGMKVGATPDLMVAYIDFFLGGDEKR
        YE AFN F+DE+N +AAY WWEG+++ +LS+L YP AW+W+Q RRR K  +L+D+V+SEYDHSCLRSCRSRALY+G+KV ATPDLM+A++DFFLGGDEKR
Subjt:  YEDAFNRFIDEINSVAAYHWWEGSVHSILSILAYPCAWTWKQWRRRHKIHRLQDYVKSEYDHSCLRSCRSRALYKGMKVGATPDLMVAYIDFFLGGDEKR

Query:  LDIVSIIEKRYPMCIIFGGDGSYMTPYNLHSDALLTNLIGQHVPATVWNRLVAGLNAQLRIVRNRSIRSSLIPVIDWINSHANPQLEFHGVKIEVGWFQA
         D+   + +R PM +IFGGDGSYM  Y+L SD +LT+L+ Q VP T W R VAGLNAQLR+V+   +RS+   V+ WI +H NP L+ HGV++++  FQA
Subjt:  LDIVSIIEKRYPMCIIFGGDGSYMTPYNLHSDALLTNLIGQHVPATVWNRLVAGLNAQLRIVRNRSIRSSLIPVIDWINSHANPQLEFHGVKIEVGWFQA

Query:  TASGYYQLGVLVVAFGDYSSHQLERSDVLLQHTNEPSRKDAASATESLMQFDQSWQAL--SLKRITGVVNGGLINKANVGFLHYKWDFLYPLSLLLRNTK
         +S   Q G+LV    D  +      +   QH      ++ +        F +++Q L   +  +     G +I+  ++ FL  + D L  +S L+ NTK
Subjt:  TASGYYQLGVLVVAFGDYSSHQLERSDVLLQHTNEPSRKDAASATESLMQFDQSWQAL--SLKRITGVVNGGLINKANVGFLHYKWDFLYPLSLLLRNTK

Query:  PIGHLDTVQLLITMILLADISITLLMLLQFYWISLVAFLLVLLVLPLSLLSPFPAGLNALFSKEPRR-ASLARIYALWNATSFANIGVAFIYG-IFLYGF
        P+GH D V L+I+++LL D+++TLL LLQ Y ISL+   L + +LPLS++ PFPAG++ALFS  PRR AS  R+YALWN TS  N+ VAF+ G +  +G 
Subjt:  PIGHLDTVQLLITMILLADISITLLMLLQFYWISLVAFLLVLLVLPLSLLSPFPAGLNALFSKEPRR-ASLARIYALWNATSFANIGVAFIYG-IFLYGF

Query:  STFQPPDETNTWSNRRETDKWWIMPVILLLFKSTQAQFVDWHIANLEIKDISLFCPDPDAFW
        S+ +       W+   + ++WWI PV L L K  Q+Q V+WH+ANLEI+D SL+  D + FW
Subjt:  STFQPPDETNTWSNRRETDKWWIMPVILLLFKSTQAQFVDWHIANLEIKDISLFCPDPDAFW

AT4G32920.2 glycine-rich protein1.5e-17548.64Show/hide
Query:  GGGLAGEWIDG-GRLAG-----GIRVLTARECPVGTYKDVDGSDANLCIPCSLDLLPNRANFIHSRGGVDRPFCPYKCISDKYKMPNCFTPLEELMYTFG
        G G+  + I G G L G     G+  L   ECP GTYK+V GSD  LC  C  + +P+RA ++  RGGV    CPYKCISD+Y MP+C+T LEEL+YTFG
Subjt:  GGGLAGEWIDG-GRLAG-----GIRVLTARECPVGTYKDVDGSDANLCIPCSLDLLPNRANFIHSRGGVDRPFCPYKCISDKYKMPNCFTPLEELMYTFG

Query:  GPWPFAVILSCFLVLLALLLSTLRVKFVGYGSYRDADSIEPHSH--HHFPHLLSLSEVRGT-RAEETQSHVYRMYFMGPNTFREPWHLPYSPPNAIIEIV
        GPW F V+L   L+LLAL+ S  R+KFV       +   +  S   H FP L SL+EV  T R EE+Q H++R+YF+GPNTF EPWHL ++PP  I EIV
Subjt:  GPWPFAVILSCFLVLLALLLSTLRVKFVGYGSYRDADSIEPHSH--HHFPHLLSLSEVRGT-RAEETQSHVYRMYFMGPNTFREPWHLPYSPPNAIIEIV

Query:  YEDAFNRFIDEINSVAAYHWWEGSVHSILSILAYPCAWTWKQWRRRHKIHRLQDYVKSEYDHSCLRSCRSRALYKGMKVGATPDLMVAYIDFFLGGDEKR
        YE AFN F+DE+N +AAY WWEG+++ +LS+L YP AW+W+Q RRR K  +L+D+V+SEYDHSCLRSCRSRALY+G+KV ATPDLM+A++DFFLGGDEKR
Subjt:  YEDAFNRFIDEINSVAAYHWWEGSVHSILSILAYPCAWTWKQWRRRHKIHRLQDYVKSEYDHSCLRSCRSRALYKGMKVGATPDLMVAYIDFFLGGDEKR

Query:  LDIVSIIEKRYPMCIIFGGDGSYMTPYNLHSDALLTNLIGQHVPATVWNRLVAGLNAQLRIVRNRSIRSSLIPVIDWINSHANPQLEFHGVKIEVGWFQA
         D+   + +R PM +IFGGDGSYM  Y+L SD +LT+L+ Q VP T W R VAGLNAQLR+V+   +RS+   V+ WI +H NP L+ HGV++++  FQA
Subjt:  LDIVSIIEKRYPMCIIFGGDGSYMTPYNLHSDALLTNLIGQHVPATVWNRLVAGLNAQLRIVRNRSIRSSLIPVIDWINSHANPQLEFHGVKIEVGWFQA

Query:  TASGYYQLGVLVVAFGDYSSHQLERSDVLLQHTNEPSRKDAASATESLMQFDQSWQAL--SLKRITGVVNGGLINKANVGFLHYKWDFLYPLSLLLRNTK
         +S   Q G+LV    D  +      +   QH      ++ +        F +++Q L   +  +     G +I+  ++ FL  + D L  +S L+ NTK
Subjt:  TASGYYQLGVLVVAFGDYSSHQLERSDVLLQHTNEPSRKDAASATESLMQFDQSWQAL--SLKRITGVVNGGLINKANVGFLHYKWDFLYPLSLLLRNTK

Query:  PIGHLDTVQLLITMILLADISITLLMLLQFYWISLVAFLLVLLVLPLSLLSPFPAGLNALFSKEPRR-ASLARIYALWNATSFANIGVAFIYG-IFLYGF
        P+GH D V L+I+++LL D+++TLL LLQ Y ISL+   L + +LPLS++ PFPAG++ALFS  PRR AS  R+YALWN TS  N+ VAF+ G +  +G 
Subjt:  PIGHLDTVQLLITMILLADISITLLMLLQFYWISLVAFLLVLLVLPLSLLSPFPAGLNALFSKEPRR-ASLARIYALWNATSFANIGVAFIYG-IFLYGF

Query:  STFQPPDETNTWSNRRETDKWWIMPVILLLFKSTQAQFVDWHIANLEIKDISLFCPDPDAFW
        S+ +       W+   + ++WWI PV L L K  Q+Q V+WH+ANLEI+D SL+  D + FW
Subjt:  STFQPPDETNTWSNRRETDKWWIMPVILLLFKSTQAQFVDWHIANLEIKDISLFCPDPDAFW

AT5G11700.1 LOCATED IN: vacuole9.8e-18348.31Show/hide
Query:  GGAGTNGRGTGVGELANERTRKRWERSP-------------VIGSSGGGLAGEWIDG--GRLAG-----GIRVLTARECPVGTYKDVDGSDANLCIPCSL
        GG+G+ G G G G     R    W   P             +I + GG  A +   G  G + G     G+  +  +ECP GT+K+V GSD +LC PC +
Subjt:  GGAGTNGRGTGVGELANERTRKRWERSP-------------VIGSSGGGLAGEWIDG--GRLAG-----GIRVLTARECPVGTYKDVDGSDANLCIPCSL

Query:  DLLPNRANFIHSRGGVDRPFCPYKCISDKYKMPNCFTPLEELMYTFGGPWPFAVILSCFLVLLALLLSTLRVKFVGYGSYRDADSIEPHSH-----HHFP
        D LP RA ++  RGGV    CPY+CIS++Y MP+C+T LEEL+YTFGGPW F ++L   L+LLAL+LS  R+KFVG     D     P  H     H FP
Subjt:  DLLPNRANFIHSRGGVDRPFCPYKCISDKYKMPNCFTPLEELMYTFGGPWPFAVILSCFLVLLALLLSTLRVKFVGYGSYRDADSIEPHSH-----HHFP

Query:  HLLSLSEVRGT-RAEETQSHVYRMYFMGPNTFREPWHLPYSPPNAIIEIVYEDAFNRFIDEINSVAAYHWWEGSVHSILSILAYPCAWTWKQWRRRHKIH
         L SL+EV  T RAE++QSHV+RMYFMGPNTF EPWHL + PP  I EIVYE AFN F+DEINS+AAY WWEG+++SILS++AYP AW+W+QWRR+ K+ 
Subjt:  HLLSLSEVRGT-RAEETQSHVYRMYFMGPNTFREPWHLPYSPPNAIIEIVYEDAFNRFIDEINSVAAYHWWEGSVHSILSILAYPCAWTWKQWRRRHKIH

Query:  RLQDYVKSEYDHSCLRSCRSRALYKGMKVGATPDLMVAYIDFFLGGDEKRLDIVSIIEKRYPMCIIFGGDGSYMTPYNLHSDALLTNLIGQHVPATVWNR
        +L+++V+SEYDHSCLRSCRSRALY+G+KV AT DLM+AY+DFFLGGDEKR D+   + +R+PM I+FGGDGSYM P++L +D +LT+L+ Q    T W R
Subjt:  RLQDYVKSEYDHSCLRSCRSRALYKGMKVGATPDLMVAYIDFFLGGDEKRLDIVSIIEKRYPMCIIFGGDGSYMTPYNLHSDALLTNLIGQHVPATVWNR

Query:  LVAGLNAQLRIVRNRSIRSSLIPVIDWINSHANPQLEFHGVKIEVGWFQATASGYYQLGVLVVAFGDYSSHQLERSDVLLQHTNEPSRKDAASATESLMQ
        LVAG+NAQLR+VR   +RS+   V+ W+ +HANP LE HG+++++ WFQ TA GY Q G+L+        H +E          EP+     S T     
Subjt:  LVAGLNAQLRIVRNRSIRSSLIPVIDWINSHANPQLEFHGVKIEVGWFQATASGYYQLGVLVVAFGDYSSHQLERSDVLLQHTNEPSRKDAASATESLMQ

Query:  FDQSWQALSLKRITGVVNGGLINKANVGFLHYKWDFLYPLSLLLRNTKPIGHLDTVQLLITMILLADISITLLMLLQFYWISLVAFLLVLLVLPLSLLSP
           +W  +  ++      GG+I+  ++  L  K D  + LS L+ NTKP+GH D V L+I+M+LL D S+ LL LLQ Y ISL+  LL L +LPL LL P
Subjt:  FDQSWQALSLKRITGVVNGGLINKANVGFLHYKWDFLYPLSLLLRNTKPIGHLDTVQLLITMILLADISITLLMLLQFYWISLVAFLLVLLVLPLSLLSP

Query:  FPAGLNALFSKEPRR-ASLARIYALWNATSFANIGVAFIYG-IFLYGFSTFQPPDETNTWSNRRETDKWWIMPVILLLFKSTQAQFVDWHIANLEIKDIS
        FPAG+NALFS  PRR A LAR+YALWN  S  N+ VAF+ G +  +  S+         W+      +WWI P  L++ K  Q+Q ++ H+ANLEI+D S
Subjt:  FPAGLNALFSKEPRR-ASLARIYALWNATSFANIGVAFIYG-IFLYGFSTFQPPDETNTWSNRRETDKWWIMPVILLLFKSTQAQFVDWHIANLEIKDIS

Query:  LFCPDPDAFW
        L+  D + FW
Subjt:  LFCPDPDAFW

AT5G11700.2 BEST Arabidopsis thaliana protein match is: glycine-rich protein (TAIR:AT4G32920.3)3.5e-18046.82Show/hide
Query:  GGAGTNGRGTGVGELANERTRKRWERSP-------------VIGSSGGGLAGEWIDG--GRLAG-----GIRVLTARECPVGTYKDVDGSDANLCIPCSL
        GG+G+ G G G G     R    W   P             +I + GG  A +   G  G + G     G+  +  +ECP GT+K+V GSD +LC PC +
Subjt:  GGAGTNGRGTGVGELANERTRKRWERSP-------------VIGSSGGGLAGEWIDG--GRLAG-----GIRVLTARECPVGTYKDVDGSDANLCIPCSL

Query:  DLLPNRANFIHSRGGVDRPFCPYKCISDKYKMPNCFTPLEELMYTFGGPWPFAVILSCFLVLLALLLSTLRVKFVGYGSYRDADSIEPHSH-----HHFP
        D LP RA ++  RGGV    CPY+CIS++Y MP+C+T LEEL+YTFGGPW F ++L   L+LLAL+LS  R+KFVG     D     P  H     H FP
Subjt:  DLLPNRANFIHSRGGVDRPFCPYKCISDKYKMPNCFTPLEELMYTFGGPWPFAVILSCFLVLLALLLSTLRVKFVGYGSYRDADSIEPHSH-----HHFP

Query:  HLLSLSEVRGT-RAEETQSHVYRMYFMGPNTFREPWHLPYSPPNAIIEIVYEDAFNRFIDEINSVAAYHWWEGSVHSILSILAYPCAWTWKQWRRRHKIH
         L SL+EV  T RAE++QSHV+RMYFMGPNTF EPWHL + PP  I EIVYE AFN F+DEINS+AAY WWEG+++SILS++AYP AW+W+QWRR+ K+ 
Subjt:  HLLSLSEVRGT-RAEETQSHVYRMYFMGPNTFREPWHLPYSPPNAIIEIVYEDAFNRFIDEINSVAAYHWWEGSVHSILSILAYPCAWTWKQWRRRHKIH

Query:  RLQDYVKSEYDHSCLRSCRSRALYKGMKVGATPDLMVAYIDFFLGGDEKRLDIVSIIEKRYPMCIIFGGDGSYMTPYNLHSDALLTNLIGQHVPATVWNR
        +L+++V+SEYDHSCLRSCRSRALY+G+KV AT DLM+AY+DFFLGGDEKR D+   + +R+PM I+FGGDGSYM P++L +D +LT+L+ Q    T W R
Subjt:  RLQDYVKSEYDHSCLRSCRSRALYKGMKVGATPDLMVAYIDFFLGGDEKRLDIVSIIEKRYPMCIIFGGDGSYMTPYNLHSDALLTNLIGQHVPATVWNR

Query:  LVAGLNAQLRIVRNRSIRSSLIPVIDWINSHANPQLEFHGVKIEVGWFQATASGYYQLGVLVVAFGDYSS------HQLERSDVLLQHTNEPSRKDAASA
        LVAG+NAQLR+VR   +RS+   V+ W+ +HANP LE HG+++++ WFQ TA GY Q G+L+    D          +   +++  +H    S K   S 
Subjt:  LVAGLNAQLRIVRNRSIRSSLIPVIDWINSHANPQLEFHGVKIEVGWFQATASGYYQLGVLVVAFGDYSS------HQLERSDVLLQHTNEPSRKDAASA

Query:  TESLMQFDQSWQALSLK-------RITGVVN----------------GGLINKANVGFLHYKWDFLYPLSLLLRNTKPIGHLDTVQLLITMILLADISIT
          SL      +   + K       R + + N                GG+I+  ++  L  K D  + LS L+ NTKP+GH D V L+I+M+LL D S+ 
Subjt:  TESLMQFDQSWQALSLK-------RITGVVN----------------GGLINKANVGFLHYKWDFLYPLSLLLRNTKPIGHLDTVQLLITMILLADISIT

Query:  LLMLLQFYWISLVAFLLVLLVLPLSLLSPFPAGLNALFSKEPRR-ASLARIYALWNATSFANIGVAFIYG-IFLYGFSTFQPPDETNTWSNRRETDKWWI
        LL LLQ Y ISL+  LL L +LPL LL PFPAG+NALFS  PRR A LAR+YALWN  S  N+ VAF+ G +  +  S+         W+      +WWI
Subjt:  LLMLLQFYWISLVAFLLVLLVLPLSLLSPFPAGLNALFSKEPRR-ASLARIYALWNATSFANIGVAFIYG-IFLYGFSTFQPPDETNTWSNRRETDKWWI

Query:  MPVILLLFKSTQAQFVDWHIANLEIKDISLFCPDPDAFW
         P  L++ K  Q+Q ++ H+ANLEI+D SL+  D + FW
Subjt:  MPVILLLFKSTQAQFVDWHIANLEIKDISLFCPDPDAFW

AT5G47020.1 unknown protein1.2e-25765Show/hide
Query:  GSSGGGLAGE-WIDGGRLAGGIRVLTARECPVGTYKDVDGSDANLCIPCSLDLLPNRANFIHSRGGVDRPFCPYKCISDKYKMPNCFTPLEELMYTFGGP
        G +GG    E  + G +   G+      ECP+GTYK+V+GSD  LC PC  + LP+RA F++ RGGV  P CPYKC+S+KY++PNC+TPLEEL+YTFGGP
Subjt:  GSSGGGLAGE-WIDGGRLAGGIRVLTARECPVGTYKDVDGSDANLCIPCSLDLLPNRANFIHSRGGVDRPFCPYKCISDKYKMPNCFTPLEELMYTFGGP

Query:  WPFAVILSCFLVLLALLLSTLRVKFVGYGSYRDADSIEPHSHHHFPHLLSLSEVRGTRAEETQSHVYRMYFMGPNTFREPWHLPYSPPNAIIEIVYEDAF
         PF ++LSC +V+L LLLSTL +K +   S+  A+SIE  S H  PHLLSLSEVRG ++E+TQ+H YRMYFMGPN FREPWHLPYSPP AIIEIVYEDAF
Subjt:  WPFAVILSCFLVLLALLLSTLRVKFVGYGSYRDADSIEPHSHHHFPHLLSLSEVRGTRAEETQSHVYRMYFMGPNTFREPWHLPYSPPNAIIEIVYEDAF

Query:  NRFIDEINSVAAYHWWEGSVHSILSILAYPCAWTWKQWRRRHKIHRLQDYVKSEYDHSCLRSCRSRALYKGMKVGATPDLMVAYIDFFLGGDEKRLDIVS
        N FI+EINS AAY WWEGSVHSILS+LA PCAW+WKQWRRR KIHRLQ+YVKS+YDHSCLRSCRSRALYKGMKVGATPDLMVAYIDFFLGGDEKR+D+VS
Subjt:  NRFIDEINSVAAYHWWEGSVHSILSILAYPCAWTWKQWRRRHKIHRLQDYVKSEYDHSCLRSCRSRALYKGMKVGATPDLMVAYIDFFLGGDEKRLDIVS

Query:  IIEKRYPMCIIFGGDGSYMTPYNLHSDALLTNLIGQHVPATVWNRLVAGLNAQLRIVRNRSIRSSLIPVIDWINSHANPQLEFHGVKIEVGWFQATASGY
        II+KR+PMCI+FGGDGSYM+PY+LHSD LLTNL+GQH+P +VW+R VAGLNAQLR VR+ SIRS+L+PVI WINSH NPQLEFHGV+IE+GWFQATASGY
Subjt:  IIEKRYPMCIIFGGDGSYMTPYNLHSDALLTNLIGQHVPATVWNRLVAGLNAQLRIVRNRSIRSSLIPVIDWINSHANPQLEFHGVKIEVGWFQATASGY

Query:  YQLGVLVVAFGDYSSHQLERSDVLLQHTNEPSRKDAASATESLMQFDQS----WQALSLKRITGVVNGGLINKANVGFLHYKWDFLYPLSLLLRNTKPIG
        YQLG+LV   GD+  + + RS    +  +E    ++A  ++SL++  Q+       LS KRI G +NGGLIN+ ++  L Y+ D L+P SLLL NT+P+G
Subjt:  YQLGVLVVAFGDYSSHQLERSDVLLQHTNEPSRKDAASATESLMQFDQS----WQALSLKRITGVVNGGLINKANVGFLHYKWDFLYPLSLLLRNTKPIG

Query:  HLDTVQLLITMILLADISITLLMLLQFYWISLVAFLLVLLVLPLSLLSPFPAGLNALFSKEPRRASLARIYALWNATSFANIGVAFIYGIFLYGFSTFQP
          DT+   I+++LLAD+S+TLL LLQFYW++L AFL +LL+LPLSLL PFPAGLNAL SKE RRASL RIY LWNATS  N+ VAFI G+   GF T + 
Subjt:  HLDTVQLLITMILLADISITLLMLLQFYWISLVAFLLVLLVLPLSLLSPFPAGLNALFSKEPRRASLARIYALWNATSFANIGVAFIYGIFLYGFSTFQP

Query:  PDETNTWSNRRETDKWWIMPVILLLFKSTQAQFVDWHIANLEIKDISLFCPDPDAFWAAE
        P   N W+  R+ DKWW++P  LLL KS QA+F+DWH+ANLE+ D SL CPDPD FWA E
Subjt:  PDETNTWSNRRETDKWWIMPVILLLFKSTQAQFVDWHIANLEIKDISLFCPDPDAFWAAE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACGAAGTCAGACGCGCGATTGATGAACAGTCGCGAATGGCTGAGACGAACCGACGAAGAACCGGCGGAGCTGGCACGAACGGACGAGGTACCGGCGTGGGAGAACT
CGCGAACGAACGAACCAGGAAACGGTGGGAGAGGTCGCCGGTGATCGGGTCGTCGGGAGGTGGGCTCGCCGGTGAATGGATCGACGGAGGACGACTTGCCGGAGGAATTA
GGGTTTTGACGGCTAGGGAATGCCCTGTTGGCACTTACAAAGATGTGGATGGGTCTGATGCAAATCTTTGCATTCCTTGTTCTCTTGATCTTCTGCCCAATCGTGCAAAT
TTCATACATTCACGGGGAGGAGTTGATCGGCCATTTTGTCCGTATAAATGCATAAGTGACAAATATAAGATGCCAAATTGCTTCACACCTCTTGAGGAGCTGATGTACAC
TTTTGGGGGTCCCTGGCCCTTTGCTGTTATATTGTCTTGCTTTTTGGTTCTTTTAGCACTTCTGTTAAGCACGCTGAGAGTAAAATTTGTTGGATATGGTTCTTATCGTG
ATGCTGATTCGATTGAACCCCACAGTCATCATCATTTCCCGCATCTTCTTTCCTTATCAGAGGTACGTGGAACTAGAGCTGAGGAAACTCAAAGTCATGTTTACAGAATG
TACTTTATGGGTCCCAATACATTCAGAGAGCCCTGGCATCTTCCTTACTCTCCTCCCAATGCAATTATAGAAATTGTGTACGAAGATGCTTTTAATAGATTCATTGATGA
GATAAATTCGGTGGCTGCATATCATTGGTGGGAAGGATCAGTGCACAGCATACTTTCTATTCTTGCATATCCTTGTGCATGGACCTGGAAACAGTGGCGACGTAGGCATA
AAATCCACCGCCTTCAGGACTATGTGAAATCTGAATATGACCATTCATGTCTGCGATCCTGCAGATCTCGTGCTTTATACAAAGGGATGAAGGTTGGAGCAACACCTGAT
TTAATGGTTGCTTACATCGATTTTTTTCTTGGTGGTGATGAGAAGCGTTTAGACATAGTCTCAATTATTGAAAAGAGATATCCAATGTGCATCATTTTTGGTGGTGATGG
AAGCTATATGACTCCTTATAATCTTCACAGTGATGCATTGTTGACCAATCTTATTGGACAGCATGTCCCTGCGACGGTTTGGAATCGCTTGGTTGCTGGCCTCAATGCTC
AATTGAGGATAGTGAGGAATAGATCCATTCGTTCTTCCTTAATTCCTGTCATAGATTGGATAAATAGTCATGCAAACCCTCAACTTGAATTCCATGGGGTTAAGATTGAG
GTTGGATGGTTCCAAGCTACTGCCTCTGGTTACTATCAGCTTGGTGTTTTGGTTGTAGCATTTGGTGATTATTCTTCTCATCAGCTAGAAAGATCGGATGTGTTGTTGCA
ACATACTAATGAACCATCAAGGAAAGATGCTGCAAGTGCAACAGAATCTCTCATGCAGTTTGATCAGAGCTGGCAAGCATTATCTCTAAAAAGGATTACAGGAGTAGTTA
ATGGAGGGCTTATAAACAAGGCCAACGTGGGATTTTTGCATTATAAATGGGACTTTCTCTATCCTCTCTCTCTCTTACTACGCAACACAAAACCTATTGGTCATCTGGAT
ACTGTACAGCTGCTTATCACCATGATACTTCTAGCAGATATTTCCATTACCCTGCTTATGCTGCTGCAGTTCTATTGGATTTCACTGGTTGCTTTTCTTCTTGTTCTTCT
TGTTCTTCCCTTGTCTCTACTCTCTCCTTTCCCAGCTGGCTTGAATGCACTTTTCAGCAAAGAGCCTAGAAGAGCATCACTTGCTCGAATATATGCTTTGTGGAATGCAA
CCTCTTTTGCTAATATTGGTGTGGCCTTCATTTATGGCATATTTCTTTATGGATTTTCAACATTCCAGCCACCCGACGAGACAAATACGTGGAGCAACAGAAGGGAGACT
GATAAGTGGTGGATCATGCCCGTAATCCTTCTGCTATTCAAATCAACACAAGCACAATTTGTTGATTGGCATATTGCCAATTTGGAAATCAAAGATATCAGTCTATTTTG
TCCCGATCCTGATGCTTTTTGGGCCGCTGAGTTTGCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAACGAAGTCAGACGCGCGATTGATGAACAGTCGCGAATGGCTGAGACGAACCGACGAAGAACCGGCGGAGCTGGCACGAACGGACGAGGTACCGGCGTGGGAGAACT
CGCGAACGAACGAACCAGGAAACGGTGGGAGAGGTCGCCGGTGATCGGGTCGTCGGGAGGTGGGCTCGCCGGTGAATGGATCGACGGAGGACGACTTGCCGGAGGAATTA
GGGTTTTGACGGCTAGGGAATGCCCTGTTGGCACTTACAAAGATGTGGATGGGTCTGATGCAAATCTTTGCATTCCTTGTTCTCTTGATCTTCTGCCCAATCGTGCAAAT
TTCATACATTCACGGGGAGGAGTTGATCGGCCATTTTGTCCGTATAAATGCATAAGTGACAAATATAAGATGCCAAATTGCTTCACACCTCTTGAGGAGCTGATGTACAC
TTTTGGGGGTCCCTGGCCCTTTGCTGTTATATTGTCTTGCTTTTTGGTTCTTTTAGCACTTCTGTTAAGCACGCTGAGAGTAAAATTTGTTGGATATGGTTCTTATCGTG
ATGCTGATTCGATTGAACCCCACAGTCATCATCATTTCCCGCATCTTCTTTCCTTATCAGAGGTACGTGGAACTAGAGCTGAGGAAACTCAAAGTCATGTTTACAGAATG
TACTTTATGGGTCCCAATACATTCAGAGAGCCCTGGCATCTTCCTTACTCTCCTCCCAATGCAATTATAGAAATTGTGTACGAAGATGCTTTTAATAGATTCATTGATGA
GATAAATTCGGTGGCTGCATATCATTGGTGGGAAGGATCAGTGCACAGCATACTTTCTATTCTTGCATATCCTTGTGCATGGACCTGGAAACAGTGGCGACGTAGGCATA
AAATCCACCGCCTTCAGGACTATGTGAAATCTGAATATGACCATTCATGTCTGCGATCCTGCAGATCTCGTGCTTTATACAAAGGGATGAAGGTTGGAGCAACACCTGAT
TTAATGGTTGCTTACATCGATTTTTTTCTTGGTGGTGATGAGAAGCGTTTAGACATAGTCTCAATTATTGAAAAGAGATATCCAATGTGCATCATTTTTGGTGGTGATGG
AAGCTATATGACTCCTTATAATCTTCACAGTGATGCATTGTTGACCAATCTTATTGGACAGCATGTCCCTGCGACGGTTTGGAATCGCTTGGTTGCTGGCCTCAATGCTC
AATTGAGGATAGTGAGGAATAGATCCATTCGTTCTTCCTTAATTCCTGTCATAGATTGGATAAATAGTCATGCAAACCCTCAACTTGAATTCCATGGGGTTAAGATTGAG
GTTGGATGGTTCCAAGCTACTGCCTCTGGTTACTATCAGCTTGGTGTTTTGGTTGTAGCATTTGGTGATTATTCTTCTCATCAGCTAGAAAGATCGGATGTGTTGTTGCA
ACATACTAATGAACCATCAAGGAAAGATGCTGCAAGTGCAACAGAATCTCTCATGCAGTTTGATCAGAGCTGGCAAGCATTATCTCTAAAAAGGATTACAGGAGTAGTTA
ATGGAGGGCTTATAAACAAGGCCAACGTGGGATTTTTGCATTATAAATGGGACTTTCTCTATCCTCTCTCTCTCTTACTACGCAACACAAAACCTATTGGTCATCTGGAT
ACTGTACAGCTGCTTATCACCATGATACTTCTAGCAGATATTTCCATTACCCTGCTTATGCTGCTGCAGTTCTATTGGATTTCACTGGTTGCTTTTCTTCTTGTTCTTCT
TGTTCTTCCCTTGTCTCTACTCTCTCCTTTCCCAGCTGGCTTGAATGCACTTTTCAGCAAAGAGCCTAGAAGAGCATCACTTGCTCGAATATATGCTTTGTGGAATGCAA
CCTCTTTTGCTAATATTGGTGTGGCCTTCATTTATGGCATATTTCTTTATGGATTTTCAACATTCCAGCCACCCGACGAGACAAATACGTGGAGCAACAGAAGGGAGACT
GATAAGTGGTGGATCATGCCCGTAATCCTTCTGCTATTCAAATCAACACAAGCACAATTTGTTGATTGGCATATTGCCAATTTGGAAATCAAAGATATCAGTCTATTTTG
TCCCGATCCTGATGCTTTTTGGGCCGCTGAGTTTGCTTAG
Protein sequenceShow/hide protein sequence
MNEVRRAIDEQSRMAETNRRRTGGAGTNGRGTGVGELANERTRKRWERSPVIGSSGGGLAGEWIDGGRLAGGIRVLTARECPVGTYKDVDGSDANLCIPCSLDLLPNRAN
FIHSRGGVDRPFCPYKCISDKYKMPNCFTPLEELMYTFGGPWPFAVILSCFLVLLALLLSTLRVKFVGYGSYRDADSIEPHSHHHFPHLLSLSEVRGTRAEETQSHVYRM
YFMGPNTFREPWHLPYSPPNAIIEIVYEDAFNRFIDEINSVAAYHWWEGSVHSILSILAYPCAWTWKQWRRRHKIHRLQDYVKSEYDHSCLRSCRSRALYKGMKVGATPD
LMVAYIDFFLGGDEKRLDIVSIIEKRYPMCIIFGGDGSYMTPYNLHSDALLTNLIGQHVPATVWNRLVAGLNAQLRIVRNRSIRSSLIPVIDWINSHANPQLEFHGVKIE
VGWFQATASGYYQLGVLVVAFGDYSSHQLERSDVLLQHTNEPSRKDAASATESLMQFDQSWQALSLKRITGVVNGGLINKANVGFLHYKWDFLYPLSLLLRNTKPIGHLD
TVQLLITMILLADISITLLMLLQFYWISLVAFLLVLLVLPLSLLSPFPAGLNALFSKEPRRASLARIYALWNATSFANIGVAFIYGIFLYGFSTFQPPDETNTWSNRRET
DKWWIMPVILLLFKSTQAQFVDWHIANLEIKDISLFCPDPDAFWAAEFA