; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0004707 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0004707
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionProtein of unknown function (DUF3537)
Genome locationchr6:6260513..6262860
RNA-Seq ExpressionLag0004707
SyntenyLag0004707
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR021924 - Protein of unknown function DUF3537


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6598276.1 hypothetical protein SDJN03_08054, partial [Cucurbita argyrosperma subsp. sororia]8.2e-21488.99Show/hide
Query:  MAELRDQVDIEDPQAHFPLLESNQNQSSQPTGLEDD-EEAHLDSALKVLDVLLGFLGFHQSSVLSCAVSWSGFVLVGIVLPVVVLELTDCAACEKYQIKD
        MA+ RD VDI+DP+AH PLL+SNQNQSSQ TG E+D EEAHLDSALK+ D LLGFLGFHQSSVLSC +SWSGFVLVGIVLPVV+L+LTDCAAC++YQIKD
Subjt:  MAELRDQVDIEDPQAHFPLLESNQNQSSQPTGLEDD-EEAHLDSALKVLDVLLGFLGFHQSSVLSCAVSWSGFVLVGIVLPVVVLELTDCAACEKYQIKD

Query:  FELDIVASQACLAAVSLLCLSHNLRKYGLKRFLTVDRQTSSLARFRKEYVKKILGSMRLLLLWAFPCFILKTAREVIRILYAERVSWGLSVAILLAMTIS
        FELDIVASQACLAAVSLLCLSHNLRKYG+KRFL+VDRQ SSLA FRK+YVKKI GS+RLL+ WA PCFILK AREVIRILYAERVSWGLSVAILLAMTIS
Subjt:  FELDIVASQACLAAVSLLCLSHNLRKYGLKRFLTVDRQTSSLARFRKEYVKKILGSMRLLLLWAFPCFILKTAREVIRILYAERVSWGLSVAILLAMTIS

Query:  WTYLSLISLSAAIVFHLMCNLQVIHFDDYAKLLQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLLEFLVVTASQFMTLFQTTGYSAMVTLINGGDFAVS
        WTYLSLISLSAAIVFHLMCNLQVIHFD+YAKLLQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLL+FLVVTASQFMTLFQTT YSAMVTLIN GDFAVS
Subjt:  WTYLSLISLSAAIVFHLMCNLQVIHFDDYAKLLQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLLEFLVVTASQFMTLFQTTGYSAMVTLINGGDFAVS

Query:  AIVQVVGVILCLHGATKISHRAQGIASVASRWHALVTCGPSDLSQPRYPNGNGNSETPSRLNSMTSAYSESDLESLDIVTMPTTTQLASYMSSYHKRQAF
        AIVQVVGVILCLHGATKISHRAQGIASVASRWHALVTCGP D+SQ R+ NG+GNSE+P RLNSMT AYSESDLESLDIVTMPTTTQLASYMSSYHKR+AF
Subjt:  AIVQVVGVILCLHGATKISHRAQGIASVASRWHALVTCGPSDLSQPRYPNGNGNSETPSRLNSMTSAYSESDLESLDIVTMPTTTQLASYMSSYHKRQAF

Query:  VMYLQMNPGGITIFGWTVDRALLNTIFFIELTLVTFVLGKTIIFS
        VMYLQMNPGGITIFGWTV+RALLNTIFFIELTLVTFVLGKT++F+
Subjt:  VMYLQMNPGGITIFGWTVDRALLNTIFFIELTLVTFVLGKTIIFS

XP_022131491.1 uncharacterized protein LOC111004677 isoform X2 [Momordica charantia]1.0e-21689.44Show/hide
Query:  MAELRDQVDIEDPQAHFPLLESNQNQSSQPTGLEDDEEAHLDSALKVLDVLLGFLGFHQSSVLSCAVSWSGFVLVGIVLPVVVLELTDCAACEKYQIKDF
        MAE RDQVDIEDP A  PLLES Q+Q+S+PTG EDDEEAHLDSAL++LDVLL FLGFHQSSVLSCA+SW+GFVLVGIVLPVVVL+LTDCA CEKYQIKDF
Subjt:  MAELRDQVDIEDPQAHFPLLESNQNQSSQPTGLEDDEEAHLDSALKVLDVLLGFLGFHQSSVLSCAVSWSGFVLVGIVLPVVVLELTDCAACEKYQIKDF

Query:  ELDIVASQACLAAVSLLCLSHNLRKYGLKRFLTVDRQTSSLARFRKEYVKKILGSMRLLLLWAFPCFILKTAREVIRILYAERVSWGLSVAILLAMTISW
        ELDIVASQACLAAVSLLCLSHNLRKYG+KRFLTVDRQTSSLARFRK+YV+KILGS+RLL+ WA  CFILK  REVIRILYAERVSWG+S+AILLAMTISW
Subjt:  ELDIVASQACLAAVSLLCLSHNLRKYGLKRFLTVDRQTSSLARFRKEYVKKILGSMRLLLLWAFPCFILKTAREVIRILYAERVSWGLSVAILLAMTISW

Query:  TYLSLISLSAAIVFHLMCNLQVIHFDDYAKLLQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLLEFLVVTASQFMTLFQTTGYSAMVTLINGGDFAVSA
        TY+SLIS+S AIVFHLMCNLQ+IHFDDYAKLLQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLL+FLVVTASQFMTLFQTTGYSAM+TLINGGDFAVSA
Subjt:  TYLSLISLSAAIVFHLMCNLQVIHFDDYAKLLQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLLEFLVVTASQFMTLFQTTGYSAMVTLINGGDFAVSA

Query:  IVQVVGVILCLHGATKISHRAQGIASVASRWHALVTCGPSDLSQPRYPNGNGNSETP-SRLNSMTSAYSESDLESLDIVTMPTTTQLASYMSSYHKRQAF
        IVQVVGVILCLHGATKISHRAQG ASVASRWHALVTCGP D+SQPR+ NGNGNS+ P SRLNSMTS YSESDLESLDI+TMPTTTQLASYM+SYHKRQAF
Subjt:  IVQVVGVILCLHGATKISHRAQGIASVASRWHALVTCGPSDLSQPRYPNGNGNSETP-SRLNSMTSAYSESDLESLDIVTMPTTTQLASYMSSYHKRQAF

Query:  VMYLQMNPGGITIFGWTVDRALLNTIFFIELTLVTFVLGKTIIFS
        VMYLQMNPGGITIFGWTVDRAL+NTIFFIELTLVTFVLGKTI+FS
Subjt:  VMYLQMNPGGITIFGWTVDRALLNTIFFIELTLVTFVLGKTIIFS

XP_022962411.1 uncharacterized protein LOC111462859 [Cucurbita moschata]7.0e-21388.54Show/hide
Query:  MAELRDQVDIEDPQAHFPLLESNQNQSSQPTGLEDD-EEAHLDSALKVLDVLLGFLGFHQSSVLSCAVSWSGFVLVGIVLPVVVLELTDCAACEKYQIKD
        MA+ RD VDI+DP+AH PLL+SNQNQSSQ T  E+D EEAHLDSALK+ D LLGFLGFHQSSVLSC +SWSGFVLVGIVLPVV+L+LTDCAAC+++QIKD
Subjt:  MAELRDQVDIEDPQAHFPLLESNQNQSSQPTGLEDD-EEAHLDSALKVLDVLLGFLGFHQSSVLSCAVSWSGFVLVGIVLPVVVLELTDCAACEKYQIKD

Query:  FELDIVASQACLAAVSLLCLSHNLRKYGLKRFLTVDRQTSSLARFRKEYVKKILGSMRLLLLWAFPCFILKTAREVIRILYAERVSWGLSVAILLAMTIS
        FELDIVASQACLAAVSLLCLSHNLRKYG+KRFL+VDRQ SSLA FRK+YVKKI GS+RLL+ WA PCFILK AREVIRILYAERVSWG+SVAILLAMTIS
Subjt:  FELDIVASQACLAAVSLLCLSHNLRKYGLKRFLTVDRQTSSLARFRKEYVKKILGSMRLLLLWAFPCFILKTAREVIRILYAERVSWGLSVAILLAMTIS

Query:  WTYLSLISLSAAIVFHLMCNLQVIHFDDYAKLLQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLLEFLVVTASQFMTLFQTTGYSAMVTLINGGDFAVS
        WTYLSLISLSAAIVFHLMCNLQVIHFD+YAKLLQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLL+FLVVTASQFMTLFQTT YSAMVTLIN GDFAVS
Subjt:  WTYLSLISLSAAIVFHLMCNLQVIHFDDYAKLLQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLLEFLVVTASQFMTLFQTTGYSAMVTLINGGDFAVS

Query:  AIVQVVGVILCLHGATKISHRAQGIASVASRWHALVTCGPSDLSQPRYPNGNGNSETPSRLNSMTSAYSESDLESLDIVTMPTTTQLASYMSSYHKRQAF
        AIVQVVGVILCLHGATKISHRAQGIASVASRWHALVTCGP D+SQ R+PNG+GNSE+P RLNSMT AYSESDLESLDIVTMPTTTQLASYMSSYHKR+AF
Subjt:  AIVQVVGVILCLHGATKISHRAQGIASVASRWHALVTCGPSDLSQPRYPNGNGNSETPSRLNSMTSAYSESDLESLDIVTMPTTTQLASYMSSYHKRQAF

Query:  VMYLQMNPGGITIFGWTVDRALLNTIFFIELTLVTFVLGKTIIFS
        VMYLQMNPGGITIFGWTV+RALLNTIFFIELTLVTFVLGKT++F+
Subjt:  VMYLQMNPGGITIFGWTVDRALLNTIFFIELTLVTFVLGKTIIFS

XP_022996746.1 uncharacterized protein LOC111491891 [Cucurbita maxima]1.8e-21388.54Show/hide
Query:  MAELRDQVDIEDPQAHFPLLESNQNQSSQPTGLEDD-EEAHLDSALKVLDVLLGFLGFHQSSVLSCAVSWSGFVLVGIVLPVVVLELTDCAACEKYQIKD
        MA+ RD VDI+DP+AH PLL+SNQNQSSQ TG E+D EEAHLDSALK+ D LLGFLGFHQSSVLSC +SWSGFVLVGIVLPVV+L+LTDCAAC++YQIKD
Subjt:  MAELRDQVDIEDPQAHFPLLESNQNQSSQPTGLEDD-EEAHLDSALKVLDVLLGFLGFHQSSVLSCAVSWSGFVLVGIVLPVVVLELTDCAACEKYQIKD

Query:  FELDIVASQACLAAVSLLCLSHNLRKYGLKRFLTVDRQTSSLARFRKEYVKKILGSMRLLLLWAFPCFILKTAREVIRILYAERVSWGLSVAILLAMTIS
        FELDIVASQACLAAVSLLCLSHNLRKYG+KRFL+VDRQ SSLA FRK+YVKKI GS+RLL+ WA PCFILK AREVIRILY ERV+WGLSVAILLAMTIS
Subjt:  FELDIVASQACLAAVSLLCLSHNLRKYGLKRFLTVDRQTSSLARFRKEYVKKILGSMRLLLLWAFPCFILKTAREVIRILYAERVSWGLSVAILLAMTIS

Query:  WTYLSLISLSAAIVFHLMCNLQVIHFDDYAKLLQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLLEFLVVTASQFMTLFQTTGYSAMVTLINGGDFAVS
        WTYLSLISLSAAIVFHLMCNLQVIHFD+YAKLLQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLL+FLVVTASQFMTLFQTT YSAMVTLIN GDFAVS
Subjt:  WTYLSLISLSAAIVFHLMCNLQVIHFDDYAKLLQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLLEFLVVTASQFMTLFQTTGYSAMVTLINGGDFAVS

Query:  AIVQVVGVILCLHGATKISHRAQGIASVASRWHALVTCGPSDLSQPRYPNGNGNSETPSRLNSMTSAYSESDLESLDIVTMPTTTQLASYMSSYHKRQAF
        AIVQVVGVILCLHGATKISHRAQGIASVASRWHALVTCGP D+SQ R+PNG+GNSE+P RLNSM  AYSESDLESLDIVTMPTTTQLASYMSSYHKR+AF
Subjt:  AIVQVVGVILCLHGATKISHRAQGIASVASRWHALVTCGPSDLSQPRYPNGNGNSETPSRLNSMTSAYSESDLESLDIVTMPTTTQLASYMSSYHKRQAF

Query:  VMYLQMNPGGITIFGWTVDRALLNTIFFIELTLVTFVLGKTIIFS
        VMYLQMNPGGITIFGWTV+RALLNTIFFIELTLVTFVLGKT++F+
Subjt:  VMYLQMNPGGITIFGWTVDRALLNTIFFIELTLVTFVLGKTIIFS

XP_023547131.1 uncharacterized protein LOC111806032 [Cucurbita pepo subsp. pepo]2.8e-21488.99Show/hide
Query:  MAELRDQVDIEDPQAHFPLLESNQNQSSQPTGLEDD-EEAHLDSALKVLDVLLGFLGFHQSSVLSCAVSWSGFVLVGIVLPVVVLELTDCAACEKYQIKD
        MA+ RD VDI+DP+AH PLL+SNQNQSSQ TG E+D EEAHLDSALK+ D LLGFLGFHQSSVLSC +SWSGFVLVGIVLPVV+L+LTDCAAC++YQIKD
Subjt:  MAELRDQVDIEDPQAHFPLLESNQNQSSQPTGLEDD-EEAHLDSALKVLDVLLGFLGFHQSSVLSCAVSWSGFVLVGIVLPVVVLELTDCAACEKYQIKD

Query:  FELDIVASQACLAAVSLLCLSHNLRKYGLKRFLTVDRQTSSLARFRKEYVKKILGSMRLLLLWAFPCFILKTAREVIRILYAERVSWGLSVAILLAMTIS
        FELDIVASQACLAAVSLLCLSHNLRKYG+KRFL+VDRQ SSLA FRK+YVKKI GS+RLL+ WA PCFILK AREVIRILYAERVSWGLSVAILLAMTIS
Subjt:  FELDIVASQACLAAVSLLCLSHNLRKYGLKRFLTVDRQTSSLARFRKEYVKKILGSMRLLLLWAFPCFILKTAREVIRILYAERVSWGLSVAILLAMTIS

Query:  WTYLSLISLSAAIVFHLMCNLQVIHFDDYAKLLQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLLEFLVVTASQFMTLFQTTGYSAMVTLINGGDFAVS
        WTYLSLISLSAAIVFHLMCNLQVIHFD+YAKLLQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLL+FLVVTASQFMTLFQTT YSA+VTLIN GDFAVS
Subjt:  WTYLSLISLSAAIVFHLMCNLQVIHFDDYAKLLQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLLEFLVVTASQFMTLFQTTGYSAMVTLINGGDFAVS

Query:  AIVQVVGVILCLHGATKISHRAQGIASVASRWHALVTCGPSDLSQPRYPNGNGNSETPSRLNSMTSAYSESDLESLDIVTMPTTTQLASYMSSYHKRQAF
        AIVQVVGVILCLHGATKISHRAQGIASVASRWHALVTCGP D+SQ R+PNG+GNSE+P RLNSMT AYSESDLESLDIVTMPTTTQLASYMSSYHKR+AF
Subjt:  AIVQVVGVILCLHGATKISHRAQGIASVASRWHALVTCGPSDLSQPRYPNGNGNSETPSRLNSMTSAYSESDLESLDIVTMPTTTQLASYMSSYHKRQAF

Query:  VMYLQMNPGGITIFGWTVDRALLNTIFFIELTLVTFVLGKTIIFS
        VMYLQMNPGGITIFGWTV+RALLNTIFFIELTLVTFVLGKT++F+
Subjt:  VMYLQMNPGGITIFGWTVDRALLNTIFFIELTLVTFVLGKTIIFS

TrEMBL top hitse value%identityAlignment
A0A0A0LQU6 Uncharacterized protein4.1e-21186.94Show/hide
Query:  MAELRDQVDIEDPQAHFPLLESNQNQSSQPTGLEDDEEAHLDSALKVLDVLLGFLGFHQSSVLSCAVSWSGFVLVGIVLPVVVLELTDCAACEKYQIKDF
        MAELRDQVDIEDPQAH PLL+SN NQSSQPT  EDDEEAHLDSA K+ D LLG LGFHQSSV SC +SWS FVLVGIVLPVVVL+L+DCAA EKYQIK F
Subjt:  MAELRDQVDIEDPQAHFPLLESNQNQSSQPTGLEDDEEAHLDSALKVLDVLLGFLGFHQSSVLSCAVSWSGFVLVGIVLPVVVLELTDCAACEKYQIKDF

Query:  ELDIVASQACLAAVSLLCLSHNLRKYGLKRFLTVDRQTSSLARFRKEYVKKILGSMRLLLLWAFPCFILKTAREVIRILYAERVSWGLSVAILLAMTISW
        ELD+VASQACLAAVSLLCLSHNLRKYG+KRFL+VDRQ +SLARFRKEYVKKI GS+RLL+ WA PCF+LKTAREVIRILYAERVSWGLSVA LLAM ISW
Subjt:  ELDIVASQACLAAVSLLCLSHNLRKYGLKRFLTVDRQTSSLARFRKEYVKKILGSMRLLLLWAFPCFILKTAREVIRILYAERVSWGLSVAILLAMTISW

Query:  TYLSLISLSAAIVFHLMCNLQVIHFDDYAKLLQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLLEFLVVTASQFMTLFQTTGYSAMVTLINGGDFAVSA
        TYL+LISLSAAIVFHLMCNLQV HFD+YAKLLQTESEVL+LIE+HIFLRYHLSKISHRFRIFLLL+F VV+ASQFMTLFQTT Y+  VTLINGGDFAVSA
Subjt:  TYLSLISLSAAIVFHLMCNLQVIHFDDYAKLLQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLLEFLVVTASQFMTLFQTTGYSAMVTLINGGDFAVSA

Query:  IVQVVGVILCLHGATKISHRAQGIASVASRWHALVTCGPSDLSQPRYPNGNGNSETPSRLNSMTSAYSESDLESLDIVTMPTTTQLASYMSSYHKRQAFV
        IVQVVGVILCLHGATKISHRA+GIASVASRWHALVTCGP ++SQPRYPNGNGNSE+P RL SMT  YSESDLESLDIVTMPTTTQLASYMSSYHKR+AFV
Subjt:  IVQVVGVILCLHGATKISHRAQGIASVASRWHALVTCGPSDLSQPRYPNGNGNSETPSRLNSMTSAYSESDLESLDIVTMPTTTQLASYMSSYHKRQAFV

Query:  MYLQMNPGGITIFGWTVDRALLNTIFFIELTLVTFVLGKTIIFS
        MYLQMNPGGITIFGWTV+RALLNTIFF+ELTLVTFVLGKT++F+
Subjt:  MYLQMNPGGITIFGWTVDRALLNTIFFIELTLVTFVLGKTIIFS

A0A5A7V5N0 Uncharacterized protein1.6e-21085.81Show/hide
Query:  MAELRDQVDIEDPQAHFPLLESNQNQSSQPTGLEDDEEAHLDSALKVLDVLLGFLGFHQSSVLSCAVSWSGFVLVGIVLPVVVLELTDCAACEKYQIKDF
        MAELRDQVD+EDPQAH PLLESN NQSSQPT  EDDEEAHLDSA K+ D LLG LGFHQSSV SC +SWS FVLVG+VLPVVVL+L+DCAA EKYQIKDF
Subjt:  MAELRDQVDIEDPQAHFPLLESNQNQSSQPTGLEDDEEAHLDSALKVLDVLLGFLGFHQSSVLSCAVSWSGFVLVGIVLPVVVLELTDCAACEKYQIKDF

Query:  ELDIVASQACLAAVSLLCLSHNLRKYGLKRFLTVDRQTSSLARFRKEYVKKILGSMRLLLLWAFPCFILKTAREVIRILYAERVSWGLSVAILLAMTISW
        ELD+VAS ACLAAVSLLCLSHNLRKYG+ RFL+VDRQ +SLARFRKEYVKKI GS+RLL+ WA PCF+LKTAREVIRILYAER+SWGLSVA +LAM ISW
Subjt:  ELDIVASQACLAAVSLLCLSHNLRKYGLKRFLTVDRQTSSLARFRKEYVKKILGSMRLLLLWAFPCFILKTAREVIRILYAERVSWGLSVAILLAMTISW

Query:  TYLSLISLSAAIVFHLMCNLQVIHFDDYAKLLQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLLEFLVVTASQFMTLFQTTGYSAMVTLINGGDFAVSA
        TYL+LISLSAAIVFHLMCNLQVIHFD+YAKLLQTESEVL+LIE+HIFLRYHLSKISHRFRIFLLL+FLVV+A+QFMTLFQTT Y+  VTL+NGGDFAVSA
Subjt:  TYLSLISLSAAIVFHLMCNLQVIHFDDYAKLLQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLLEFLVVTASQFMTLFQTTGYSAMVTLINGGDFAVSA

Query:  IVQVVGVILCLHGATKISHRAQGIASVASRWHALVTCGPSDLSQPRYPNGNGNSETPSRLNSMTSAYSESDLESLDIVTMPTTTQLASYMSSYHKRQAFV
        IVQVVGVILCLHGATKISHRA+GIASVASRWHALVTCGP ++SQPRYPNGNGNS++P RL SMT  YSESDLESLDIVTMPTTTQLASYMSSYHKR+AFV
Subjt:  IVQVVGVILCLHGATKISHRAQGIASVASRWHALVTCGPSDLSQPRYPNGNGNSETPSRLNSMTSAYSESDLESLDIVTMPTTTQLASYMSSYHKRQAFV

Query:  MYLQMNPGGITIFGWTVDRALLNTIFFIELTLVTFVLGKTIIFS
        MYLQMNPGGITIFGWTV+RALLNTIFF+ELTLVTFVLGKT++F+
Subjt:  MYLQMNPGGITIFGWTVDRALLNTIFFIELTLVTFVLGKTIIFS

A0A6J1BPM9 uncharacterized protein LOC111004677 isoform X25.0e-21789.44Show/hide
Query:  MAELRDQVDIEDPQAHFPLLESNQNQSSQPTGLEDDEEAHLDSALKVLDVLLGFLGFHQSSVLSCAVSWSGFVLVGIVLPVVVLELTDCAACEKYQIKDF
        MAE RDQVDIEDP A  PLLES Q+Q+S+PTG EDDEEAHLDSAL++LDVLL FLGFHQSSVLSCA+SW+GFVLVGIVLPVVVL+LTDCA CEKYQIKDF
Subjt:  MAELRDQVDIEDPQAHFPLLESNQNQSSQPTGLEDDEEAHLDSALKVLDVLLGFLGFHQSSVLSCAVSWSGFVLVGIVLPVVVLELTDCAACEKYQIKDF

Query:  ELDIVASQACLAAVSLLCLSHNLRKYGLKRFLTVDRQTSSLARFRKEYVKKILGSMRLLLLWAFPCFILKTAREVIRILYAERVSWGLSVAILLAMTISW
        ELDIVASQACLAAVSLLCLSHNLRKYG+KRFLTVDRQTSSLARFRK+YV+KILGS+RLL+ WA  CFILK  REVIRILYAERVSWG+S+AILLAMTISW
Subjt:  ELDIVASQACLAAVSLLCLSHNLRKYGLKRFLTVDRQTSSLARFRKEYVKKILGSMRLLLLWAFPCFILKTAREVIRILYAERVSWGLSVAILLAMTISW

Query:  TYLSLISLSAAIVFHLMCNLQVIHFDDYAKLLQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLLEFLVVTASQFMTLFQTTGYSAMVTLINGGDFAVSA
        TY+SLIS+S AIVFHLMCNLQ+IHFDDYAKLLQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLL+FLVVTASQFMTLFQTTGYSAM+TLINGGDFAVSA
Subjt:  TYLSLISLSAAIVFHLMCNLQVIHFDDYAKLLQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLLEFLVVTASQFMTLFQTTGYSAMVTLINGGDFAVSA

Query:  IVQVVGVILCLHGATKISHRAQGIASVASRWHALVTCGPSDLSQPRYPNGNGNSETP-SRLNSMTSAYSESDLESLDIVTMPTTTQLASYMSSYHKRQAF
        IVQVVGVILCLHGATKISHRAQG ASVASRWHALVTCGP D+SQPR+ NGNGNS+ P SRLNSMTS YSESDLESLDI+TMPTTTQLASYM+SYHKRQAF
Subjt:  IVQVVGVILCLHGATKISHRAQGIASVASRWHALVTCGPSDLSQPRYPNGNGNSETP-SRLNSMTSAYSESDLESLDIVTMPTTTQLASYMSSYHKRQAF

Query:  VMYLQMNPGGITIFGWTVDRALLNTIFFIELTLVTFVLGKTIIFS
        VMYLQMNPGGITIFGWTVDRAL+NTIFFIELTLVTFVLGKTI+FS
Subjt:  VMYLQMNPGGITIFGWTVDRALLNTIFFIELTLVTFVLGKTIIFS

A0A6J1HD22 uncharacterized protein LOC1114628593.4e-21388.54Show/hide
Query:  MAELRDQVDIEDPQAHFPLLESNQNQSSQPTGLEDD-EEAHLDSALKVLDVLLGFLGFHQSSVLSCAVSWSGFVLVGIVLPVVVLELTDCAACEKYQIKD
        MA+ RD VDI+DP+AH PLL+SNQNQSSQ T  E+D EEAHLDSALK+ D LLGFLGFHQSSVLSC +SWSGFVLVGIVLPVV+L+LTDCAAC+++QIKD
Subjt:  MAELRDQVDIEDPQAHFPLLESNQNQSSQPTGLEDD-EEAHLDSALKVLDVLLGFLGFHQSSVLSCAVSWSGFVLVGIVLPVVVLELTDCAACEKYQIKD

Query:  FELDIVASQACLAAVSLLCLSHNLRKYGLKRFLTVDRQTSSLARFRKEYVKKILGSMRLLLLWAFPCFILKTAREVIRILYAERVSWGLSVAILLAMTIS
        FELDIVASQACLAAVSLLCLSHNLRKYG+KRFL+VDRQ SSLA FRK+YVKKI GS+RLL+ WA PCFILK AREVIRILYAERVSWG+SVAILLAMTIS
Subjt:  FELDIVASQACLAAVSLLCLSHNLRKYGLKRFLTVDRQTSSLARFRKEYVKKILGSMRLLLLWAFPCFILKTAREVIRILYAERVSWGLSVAILLAMTIS

Query:  WTYLSLISLSAAIVFHLMCNLQVIHFDDYAKLLQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLLEFLVVTASQFMTLFQTTGYSAMVTLINGGDFAVS
        WTYLSLISLSAAIVFHLMCNLQVIHFD+YAKLLQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLL+FLVVTASQFMTLFQTT YSAMVTLIN GDFAVS
Subjt:  WTYLSLISLSAAIVFHLMCNLQVIHFDDYAKLLQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLLEFLVVTASQFMTLFQTTGYSAMVTLINGGDFAVS

Query:  AIVQVVGVILCLHGATKISHRAQGIASVASRWHALVTCGPSDLSQPRYPNGNGNSETPSRLNSMTSAYSESDLESLDIVTMPTTTQLASYMSSYHKRQAF
        AIVQVVGVILCLHGATKISHRAQGIASVASRWHALVTCGP D+SQ R+PNG+GNSE+P RLNSMT AYSESDLESLDIVTMPTTTQLASYMSSYHKR+AF
Subjt:  AIVQVVGVILCLHGATKISHRAQGIASVASRWHALVTCGPSDLSQPRYPNGNGNSETPSRLNSMTSAYSESDLESLDIVTMPTTTQLASYMSSYHKRQAF

Query:  VMYLQMNPGGITIFGWTVDRALLNTIFFIELTLVTFVLGKTIIFS
        VMYLQMNPGGITIFGWTV+RALLNTIFFIELTLVTFVLGKT++F+
Subjt:  VMYLQMNPGGITIFGWTVDRALLNTIFFIELTLVTFVLGKTIIFS

A0A6J1K5M7 uncharacterized protein LOC1114918918.9e-21488.54Show/hide
Query:  MAELRDQVDIEDPQAHFPLLESNQNQSSQPTGLEDD-EEAHLDSALKVLDVLLGFLGFHQSSVLSCAVSWSGFVLVGIVLPVVVLELTDCAACEKYQIKD
        MA+ RD VDI+DP+AH PLL+SNQNQSSQ TG E+D EEAHLDSALK+ D LLGFLGFHQSSVLSC +SWSGFVLVGIVLPVV+L+LTDCAAC++YQIKD
Subjt:  MAELRDQVDIEDPQAHFPLLESNQNQSSQPTGLEDD-EEAHLDSALKVLDVLLGFLGFHQSSVLSCAVSWSGFVLVGIVLPVVVLELTDCAACEKYQIKD

Query:  FELDIVASQACLAAVSLLCLSHNLRKYGLKRFLTVDRQTSSLARFRKEYVKKILGSMRLLLLWAFPCFILKTAREVIRILYAERVSWGLSVAILLAMTIS
        FELDIVASQACLAAVSLLCLSHNLRKYG+KRFL+VDRQ SSLA FRK+YVKKI GS+RLL+ WA PCFILK AREVIRILY ERV+WGLSVAILLAMTIS
Subjt:  FELDIVASQACLAAVSLLCLSHNLRKYGLKRFLTVDRQTSSLARFRKEYVKKILGSMRLLLLWAFPCFILKTAREVIRILYAERVSWGLSVAILLAMTIS

Query:  WTYLSLISLSAAIVFHLMCNLQVIHFDDYAKLLQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLLEFLVVTASQFMTLFQTTGYSAMVTLINGGDFAVS
        WTYLSLISLSAAIVFHLMCNLQVIHFD+YAKLLQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLL+FLVVTASQFMTLFQTT YSAMVTLIN GDFAVS
Subjt:  WTYLSLISLSAAIVFHLMCNLQVIHFDDYAKLLQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLLEFLVVTASQFMTLFQTTGYSAMVTLINGGDFAVS

Query:  AIVQVVGVILCLHGATKISHRAQGIASVASRWHALVTCGPSDLSQPRYPNGNGNSETPSRLNSMTSAYSESDLESLDIVTMPTTTQLASYMSSYHKRQAF
        AIVQVVGVILCLHGATKISHRAQGIASVASRWHALVTCGP D+SQ R+PNG+GNSE+P RLNSM  AYSESDLESLDIVTMPTTTQLASYMSSYHKR+AF
Subjt:  AIVQVVGVILCLHGATKISHRAQGIASVASRWHALVTCGPSDLSQPRYPNGNGNSETPSRLNSMTSAYSESDLESLDIVTMPTTTQLASYMSSYHKRQAF

Query:  VMYLQMNPGGITIFGWTVDRALLNTIFFIELTLVTFVLGKTIIFS
        VMYLQMNPGGITIFGWTV+RALLNTIFFIELTLVTFVLGKT++F+
Subjt:  VMYLQMNPGGITIFGWTVDRALLNTIFFIELTLVTFVLGKTIIFS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G50630.1 Protein of unknown function (DUF3537)1.8e-4930.75Show/hide
Query:  AHLDSALKVLDVLLGFLGFHQSSVLSCAVSWSGFVLVGIVLPVVVLELTDCAACEKYQIKDFELDIVASQACLAAVSLLCLSHNLRKYGLKRFLTVDRQT
        +H    L      L ++    SS  +  +SW+ F++  +V+P +   L  CA C+ Y  + ++  +  S + +A VS LCL+  + KYGL+RFL  D+  
Subjt:  AHLDSALKVLDVLLGFLGFHQSSVLSCAVSWSGFVLVGIVLPVVVLELTDCAACEKYQIKDFELDIVASQACLAAVSLLCLSHNLRKYGLKRFLTVDRQT

Query:  SSLARFRKEYVKKILGSMRLLLLWAFPCFILKTAREV-------IRILYAERVSWGLSVAILLAMTISWTYLSLISLSAAIVFHLMCNLQVIHFDDYAKL
              R+ Y  ++  S+ ++  +  PCF   +A ++        RI +        +VA ++ +  SW Y + +     ++F L+C+LQ++   D+AKL
Subjt:  SSLARFRKEYVKKILGSMRLLLLWAFPCFILKTAREV-------IRILYAERVSWGLSVAILLAMTISWTYLSLISLSAAIVFHLMCNLQVIHFDDYAKL

Query:  LQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLLEFLVVTASQFMTLFQTTGYSAMVTLINGGDFAVSAIVQVVGVILCLHGATKISHRAQGIASVASRW
         Q +S+V  ++ EH+ +R HL  ISHR+R F+L   ++VT SQF +L  TT     V +   G+ A+ ++  V  +++ L  A+KI+H+AQ +  +A++W
Subjt:  LQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLLEFLVVTASQFMTLFQTTGYSAMVTLINGGDFAVSAIVQVVGVILCLHGATKISHRAQGIASVASRW

Query:  HALVTCGPSDLSQPRYP-----------NGNGNSE-----TPSRLNSMTSAYSESDLESLDIVTMPTTTQLASYMSSYHKRQAFVMYLQMNPGGITIFGW
        H   T    D +   +            N N N+      T +  +S      E DL++ DI+ +      A    S+ KRQA V Y + N  GIT++G+
Subjt:  HALVTCGPSDLSQPRYP-----------NGNGNSE-----TPSRLNSMTSAYSESDLESLDIVTMPTTTQLASYMSSYHKRQAFVMYLQMNPGGITIFGW

Query:  TVDRALLNTIFFIELTLVTFVLGKTI
        T+DR  L+TIF +EL+LV ++LGKTI
Subjt:  TVDRALLNTIFFIELTLVTFVLGKTI

AT1G67570.1 Protein of unknown function (DUF3537)6.9e-13459.42Show/hide
Query:  EDPQAHFP-LLESNQNQSSQPT-------------GLEDDEEAHLDSALKVLDVLLGFLGFHQSSVLSCAVSWSGFVLVGIVLPVVVLELTDCAACEKYQ
        + P    P LL SNQ+   +P+             G+ DD    LD  L+ L+  L  LGF+QSS  S  +SW  F+ +G+VLPV VLEL  C  CE+YQ
Subjt:  EDPQAHFP-LLESNQNQSSQPT-------------GLEDDEEAHLDSALKVLDVLLGFLGFHQSSVLSCAVSWSGFVLVGIVLPVVVLELTDCAACEKYQ

Query:  IKDFELDIVASQACLAAVSLLCLSHNLRKYGLKRFLTVDRQTSSLARFRKEYVKKILGSMRLLLLWAFPCFILKTAREVIRILYAERVSWGLSVAILLAM
         K FEL+IV SQA LA VSLLC+SHNLRK+G+++FL VD+ +  + R + +Y+++IL S+RLL +W+ PCF LK  RE+IR+ Y       LSVAILL+M
Subjt:  IKDFELDIVASQACLAAVSLLCLSHNLRKYGLKRFLTVDRQTSSLARFRKEYVKKILGSMRLLLLWAFPCFILKTAREVIRILYAERVSWGLSVAILLAM

Query:  TISWTYLSLISLSAAIVFHLMCNLQVIHFDDYAKLLQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLLEFLVVTASQFMTLFQTTGYSAMVTLINGGDF
         +SWTYLS I L+A+ +FHL+CNLQVIHF+DYAKLL+ ESE+ L I EH+ LR++LSKISHRFRIFLLL+FLVVTASQF TLFQTT YS  +T INGGDF
Subjt:  TISWTYLSLISLSAAIVFHLMCNLQVIHFDDYAKLLQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLLEFLVVTASQFMTLFQTTGYSAMVTLINGGDF

Query:  AVSAIVQVVGVILCLHGATKISHRAQGIASVASRWHALVTCGPSDLSQPR-YPNGNGNSETPSRLNSMTSAYSESDLESLD-IVTMPTTTQLASY--MSS
        AVSA+VQVVG+ILCLH ATKISHRAQ IASVASRWHA+++C  +D +Q R  P+G     T +   S   + S+SD+ES+D  + MP T Q  SY  MSS
Subjt:  AVSAIVQVVGVILCLHGATKISHRAQGIASVASRWHALVTCGPSDLSQPR-YPNGNGNSETPSRLNSMTSAYSESDLESLD-IVTMPTTTQLASY--MSS

Query:  YHKRQAFVMYLQMNPGGITIFGWTVDRALLNTIFFIELTLVTFVLGKTIIF
        YHKRQAFV+YLQMNPGGITIFGWTVDR L+NTIFFIEL+LVTFVLGKT++F
Subjt:  YHKRQAFVMYLQMNPGGITIFGWTVDRALLNTIFFIELTLVTFVLGKTIIF

AT3G20300.1 Protein of unknown function (DUF3537)3.1e-4930.97Show/hide
Query:  AHLDSALKVLDVLLGFLGFHQSSVLSCAVSWSGFVLVGIVLPVVVLELTDCAACEKYQIKDFELDIVASQACLAAVSLLCLSHNLRKYGLKRFLTVDRQT
        +H    L      L ++   QSS  +  +SWS FV+  +V+P     +  C+ C+ +  + ++  +  S +  AA+S LCLS  + KYGL+RFL  D+  
Subjt:  AHLDSALKVLDVLLGFLGFHQSSVLSCAVSWSGFVLVGIVLPVVVLELTDCAACEKYQIKDFELDIVASQACLAAVSLLCLSHNLRKYGLKRFLTVDRQT

Query:  SSLARFRKEYVKKILGSMRLLLLWAFPCFILKTAREV-------IRILYAERVSWGLSVAILLAMTISWTYLSLISLSAAIVFHLMCNLQVIHFDDYAKL
              R  Y  ++  S+++L  +  PCF+  ++ ++        +I +   V    +VA L+ +  SW Y + +     ++F L+C+LQ++   D+A++
Subjt:  SSLARFRKEYVKKILGSMRLLLLWAFPCFILKTAREV-------IRILYAERVSWGLSVAILLAMTISWTYLSLISLSAAIVFHLMCNLQVIHFDDYAKL

Query:  LQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLLEFLVVTASQFMTLFQTTGYSAMVTLINGGDFAVSAIVQVVGVILCLHGATKISHRAQGIASVASRW
         Q +S+V  ++ EH+ +R HL  ISHR+R F+LL  ++VT SQF +L  TT   A + +   G+ A+ ++  V  +++ L  A+KI+H+AQ +  +A++W
Subjt:  LQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLLEFLVVTASQFMTLFQTTGYSAMVTLINGGDFAVSAIVQVVGVILCLHGATKISHRAQGIASVASRW

Query:  HALVTC---GPSDLSQPR----------YPNGNGNSETPSRLNSMTSAYSESDLESLDIVTMPTTTQLASYMSSYHKRQAFVMYLQMNPGGITIFGWTVD
        H   T       D   PR          YP  + N E+    +S      E D ++ +++     + +     S+ KRQA V Y + N  GIT+FG+T+D
Subjt:  HALVTC---GPSDLSQPR----------YPNGNGNSETPSRLNSMTSAYSESDLESLDIVTMPTTTQLASYMSSYHKRQAFVMYLQMNPGGITIFGWTVD

Query:  RALLNTIFFIELTLVTFVLGKTI
        R+ L+TIF IE++LV ++LGKTI
Subjt:  RALLNTIFFIELTLVTFVLGKTI

AT4G03820.1 Protein of unknown function (DUF3537)1.2e-4030.67Show/hide
Query:  FLGFHQSSVLSCAVSWSGFVLVGIVLPVVVLELTDCAACEKYQIKDFELDIVASQACLAAVSLLCLSHNLRKYGLKRFLTVDRQTSSLARFRKEYVKKIL
        FL F QS+ +   +SWS F L+ +++P++   +  CA C+    + ++  +  S +  A +S + LS   +KYG++RFL  D+      + R  Y  KI 
Subjt:  FLGFHQSSVLSCAVSWSGFVLVGIVLPVVVLELTDCAACEKYQIKDFELDIVASQACLAAVSLLCLSHNLRKYGLKRFLTVDRQTSSLARFRKEYVKKIL

Query:  GSMRLLLLWAFPCFILKT-------AREVIRILYAERVSWGLSVAILLAMTISWTYLSLISLSAAIVFHLMCNLQVIHFDDYAKLLQTE-SEVLLLIEEH
         SM+LL ++  P   L+        A    +I Y    +    +A  L ++ SW Y + + + A I++  +C+LQV+  D++A+   +E  +   ++ EH
Subjt:  GSMRLLLLWAFPCFILKT-------AREVIRILYAERVSWGLSVAILLAMTISWTYLSLISLSAAIVFHLMCNLQVIHFDDYAKLLQTE-SEVLLLIEEH

Query:  IFLRYHLSKISHRFRIFLLLEFLVVTASQFMTLFQTTGYSAMVTLINGGDFAVSAIVQVVGVILCLHGATKISHRAQGIASVASRWHALVTCGPSDL---
        + +R  L  +SHRFR F+LL    VTA+QFM L  T   S    +   G+ A+ +   V G+ +CL  AT+++H+AQ + S+A++W+   +    D+   
Subjt:  IFLRYHLSKISHRFRIFLLLEFLVVTASQFMTLFQTTGYSAMVTLINGGDFAVSAIVQVVGVILCLHGATKISHRAQGIASVASRWHALVTCGPSDL---

Query:  -SQPRYPNGNGNSETPSRLNSMTSAYSESDLESLDIVTMPTTTQLASYMSSYHKRQAFVMYLQMNPGGITIFGWTVDRALLNTIFFIELTLVTFVLGKTI
           P+ P    +S+  SR  ++  +  + +    D   +      A  +SS  KRQA V YL+ N  GIT++G+ VD+  L  IF IEL L+ ++L KTI
Subjt:  -SQPRYPNGNGNSETPSRLNSMTSAYSESDLESLDIVTMPTTTQLASYMSSYHKRQAFVMYLQMNPGGITIFGWTVDRALLNTIFFIELTLVTFVLGKTI

Query:  I
        +
Subjt:  I

AT4G22270.1 Protein of unknown function (DUF3537)1.9e-4632.07Show/hide
Query:  LGFHQSSVLSCAVSWSGFVLVGIVLPVVVLELTDCAACEKYQIKDFELDIVASQACLAAVSLLCLSHNLRKYGLKRFLTVDRQTSSLARFRKEYVKKILG
        L F QS+  +  +SWS F L+ +++P++   L  C+ C+ +  + +++ +  S +  A +S + LS   RK+G++RFL +D+      + R EY  +I  
Subjt:  LGFHQSSVLSCAVSWSGFVLVGIVLPVVVLELTDCAACEKYQIKDFELDIVASQACLAAVSLLCLSHNLRKYGLKRFLTVDRQTSSLARFRKEYVKKILG

Query:  SMRLLLLWAFPCFILKTAREV-------IRILYAERVSWGLSVAILLAMTISWTYLSLISLSAAIVFHLMCNLQVIHFDDYAKLLQTE-SEVLLLIEEHI
        S++ L+++  P   L+    +        +I Y         VA  L ++ SW Y + + +   I++ + C+LQ +  DD+A+   +E ++V   + EH 
Subjt:  SMRLLLLWAFPCFILKTAREV-------IRILYAERVSWGLSVAILLAMTISWTYLSLISLSAAIVFHLMCNLQVIHFDDYAKLLQTE-SEVLLLIEEHI

Query:  FLRYHLSKISHRFRIFLLLEFLVVTASQFMTLFQTTGYSAMVTLINGGDFAVSAIVQVVGVILCLHGATKISHRAQGIASVASRWHALVTCGPSDLSQPR
         +R +L  +SHRFR F+LL  ++VTA+QFM L  TT  S  V +   G+ A+ ++  V GV +CL  ATKI+H+AQ + S+A++W+   T    D     
Subjt:  FLRYHLSKISHRFRIFLLLEFLVVTASQFMTLFQTTGYSAMVTLINGGDFAVSAIVQVVGVILCLHGATKISHRAQGIASVASRWHALVTCGPSDLSQPR

Query:  YPNGNGNSETPSRLNSMTSAYSESDLESLDIVTMPTTTQLASYMSSYHKRQAFVMYLQMNPGGITIFGWTVDRALLNTIFFIELTLVTFVLGKTII
         P G+      S   +      + + E  D +       + +   SY KRQA V YL+ N  GIT++G+ VDR+ LNTIF IEL L+ ++L KTI+
Subjt:  YPNGNGNSETPSRLNSMTSAYSESDLESLDIVTMPTTTQLASYMSSYHKRQAFVMYLQMNPGGITIFGWTVDRALLNTIFFIELTLVTFVLGKTII


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGAGCTGCGTGATCAAGTGGACATTGAAGATCCTCAAGCTCATTTTCCACTTCTTGAGTCAAATCAGAATCAGAGCTCACAACCGACAGGGCTGGAAGACGACGA
AGAAGCCCATTTGGACAGTGCCCTCAAAGTGCTCGACGTATTGCTTGGCTTTTTGGGGTTCCATCAATCTTCTGTTTTGAGCTGTGCTGTGTCTTGGTCTGGTTTTGTTC
TTGTCGGTATTGTTCTGCCTGTTGTGGTGCTGGAACTCACTGATTGCGCCGCCTGTGAGAAGTATCAGATTAAGGATTTTGAGCTTGACATCGTGGCCTCGCAAGCTTGT
CTTGCAGCTGTGTCTCTGCTCTGTCTTTCTCATAATCTCAGAAAATATGGCCTAAAGAGGTTCCTTACTGTTGATAGGCAGACCAGCTCCTTGGCTCGCTTTCGTAAAGA
ATATGTCAAGAAGATTCTGGGTTCGATGCGTTTGCTTCTCTTGTGGGCATTTCCATGTTTCATTCTGAAGACAGCTCGAGAGGTAATTCGAATATTATATGCGGAACGTG
TATCGTGGGGATTATCTGTTGCTATCTTACTGGCTATGACCATATCCTGGACTTATCTGAGCTTGATCTCTCTATCAGCTGCCATTGTGTTTCATTTGATGTGCAATTTG
CAAGTCATCCACTTTGATGACTATGCGAAGCTGCTGCAAACAGAGTCTGAAGTTTTGTTATTAATAGAGGAGCATATTTTCCTACGCTATCATTTGTCCAAGATAAGCCA
CAGATTCCGAATCTTTCTTCTTCTAGAGTTCTTGGTTGTAACTGCAAGCCAATTTATGACTCTGTTCCAGACAACGGGATACAGTGCAATGGTCACTCTCATAAATGGTG
GTGATTTTGCAGTCTCTGCAATTGTTCAAGTGGTTGGAGTCATTCTTTGCTTGCATGGAGCAACAAAGATTTCCCACAGAGCCCAAGGAATTGCATCAGTGGCTAGTAGA
TGGCATGCTTTAGTCACTTGCGGTCCAAGCGATCTATCTCAACCTCGATATCCAAACGGTAACGGGAACTCGGAAACTCCCAGCAGACTGAACTCGATGACTAGCGCTTA
CTCTGAAAGTGATTTGGAGTCTCTGGACATTGTCACAATGCCTACAACTACACAGCTAGCTTCTTATATGTCCTCCTATCATAAAAGACAAGCATTTGTTATGTATTTGC
AGATGAATCCTGGAGGCATTACCATTTTTGGGTGGACAGTTGATAGAGCTCTGCTAAACACCATCTTCTTTATTGAACTCACACTGGTCACCTTCGTGCTTGGGAAGACG
ATAATTTTCTCCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTGAGCTGCGTGATCAAGTGGACATTGAAGATCCTCAAGCTCATTTTCCACTTCTTGAGTCAAATCAGAATCAGAGCTCACAACCGACAGGGCTGGAAGACGACGA
AGAAGCCCATTTGGACAGTGCCCTCAAAGTGCTCGACGTATTGCTTGGCTTTTTGGGGTTCCATCAATCTTCTGTTTTGAGCTGTGCTGTGTCTTGGTCTGGTTTTGTTC
TTGTCGGTATTGTTCTGCCTGTTGTGGTGCTGGAACTCACTGATTGCGCCGCCTGTGAGAAGTATCAGATTAAGGATTTTGAGCTTGACATCGTGGCCTCGCAAGCTTGT
CTTGCAGCTGTGTCTCTGCTCTGTCTTTCTCATAATCTCAGAAAATATGGCCTAAAGAGGTTCCTTACTGTTGATAGGCAGACCAGCTCCTTGGCTCGCTTTCGTAAAGA
ATATGTCAAGAAGATTCTGGGTTCGATGCGTTTGCTTCTCTTGTGGGCATTTCCATGTTTCATTCTGAAGACAGCTCGAGAGGTAATTCGAATATTATATGCGGAACGTG
TATCGTGGGGATTATCTGTTGCTATCTTACTGGCTATGACCATATCCTGGACTTATCTGAGCTTGATCTCTCTATCAGCTGCCATTGTGTTTCATTTGATGTGCAATTTG
CAAGTCATCCACTTTGATGACTATGCGAAGCTGCTGCAAACAGAGTCTGAAGTTTTGTTATTAATAGAGGAGCATATTTTCCTACGCTATCATTTGTCCAAGATAAGCCA
CAGATTCCGAATCTTTCTTCTTCTAGAGTTCTTGGTTGTAACTGCAAGCCAATTTATGACTCTGTTCCAGACAACGGGATACAGTGCAATGGTCACTCTCATAAATGGTG
GTGATTTTGCAGTCTCTGCAATTGTTCAAGTGGTTGGAGTCATTCTTTGCTTGCATGGAGCAACAAAGATTTCCCACAGAGCCCAAGGAATTGCATCAGTGGCTAGTAGA
TGGCATGCTTTAGTCACTTGCGGTCCAAGCGATCTATCTCAACCTCGATATCCAAACGGTAACGGGAACTCGGAAACTCCCAGCAGACTGAACTCGATGACTAGCGCTTA
CTCTGAAAGTGATTTGGAGTCTCTGGACATTGTCACAATGCCTACAACTACACAGCTAGCTTCTTATATGTCCTCCTATCATAAAAGACAAGCATTTGTTATGTATTTGC
AGATGAATCCTGGAGGCATTACCATTTTTGGGTGGACAGTTGATAGAGCTCTGCTAAACACCATCTTCTTTATTGAACTCACACTGGTCACCTTCGTGCTTGGGAAGACG
ATAATTTTCTCCTGA
Protein sequenceShow/hide protein sequence
MAELRDQVDIEDPQAHFPLLESNQNQSSQPTGLEDDEEAHLDSALKVLDVLLGFLGFHQSSVLSCAVSWSGFVLVGIVLPVVVLELTDCAACEKYQIKDFELDIVASQAC
LAAVSLLCLSHNLRKYGLKRFLTVDRQTSSLARFRKEYVKKILGSMRLLLLWAFPCFILKTAREVIRILYAERVSWGLSVAILLAMTISWTYLSLISLSAAIVFHLMCNL
QVIHFDDYAKLLQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLLEFLVVTASQFMTLFQTTGYSAMVTLINGGDFAVSAIVQVVGVILCLHGATKISHRAQGIASVASR
WHALVTCGPSDLSQPRYPNGNGNSETPSRLNSMTSAYSESDLESLDIVTMPTTTQLASYMSSYHKRQAFVMYLQMNPGGITIFGWTVDRALLNTIFFIELTLVTFVLGKT
IIFS