; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr022924 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr022924
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionProtein of unknown function (DUF3537)
Genome locationtig00000729:1064564..1066994
RNA-Seq ExpressionSgr022924
SyntenySgr022924
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR021924 - Protein of unknown function DUF3537


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7020327.1 hypothetical protein SDJN02_17011, partial [Cucurbita argyrosperma subsp. argyrosperma]1.9e-18682.58Show/hide
Query:  NLPLLLQPSYGRSKSLTFDEFRNFRISLKWCALDHSSVSGKFISYFTFLCLTVLVPVVTVVAFQVPADDPISFNKLVQLPESGLAAIGFFSLSRFFRRYG
        NLPLLLQ SY RSKSLT DE RNFRISLKW ALDHSS +GKF+SYF F+ LTV+VP+VT+V+ + PADDPIS NKLVQ+PESGLAAIGFFSL RFFRRYG
Subjt:  NLPLLLQPSYGRSKSLTFDEFRNFRISLKWCALDHSSVSGKFISYFTFLCLTVLVPVVTVVAFQVPADDPISFNKLVQLPESGLAAIGFFSLSRFFRRYG

Query:  LRQLLFLDGLQDDSLYVQRGYARELDKSFRYLAYILFPSFFVELAHKIIFFTTVKVSFPYVDSGGFPVNSILFVLVLAAWVYRTGVFLLVCVLFRLTCEL
        LRQLLFLDGLQ+DSLYVQRGYARELDK+FRYLAYILFPSFFVELAHKIIFF+TVK+SFP    GGFP+NSILF+ VLA+WVYRTGVFLLVCVLFRLTCEL
Subjt:  LRQLLFLDGLQDDSLYVQRGYARELDKSFRYLAYILFPSFFVELAHKIIFFTTVKVSFPYVDSGGFPVNSILFVLVLAAWVYRTGVFLLVCVLFRLTCEL

Query:  QILRLQGVHKLFEGSGGSGSDAVVIFEEHVRIRKQLWVTSHRYRFFIIGCLFVITVSQFVALLLVLASKTEKNFFNSGDVVVCSAVQLCGFFLCLFGAAR
        QILRLQGV+KLFE S GSGSDA+VIF EH+RIRKQLWVTSHRYRFFII CLF+ITVSQFVALLLVL SKT+KNFFNSGD+VVCS VQLCGFFLC+ GAAR
Subjt:  QILRLQGVHKLFEGSGGSGSDAVVIFEEHVRIRKQLWVTSHRYRFFIIGCLFVITVSQFVALLLVLASKTEKNFFNSGDVVVCSAVQLCGFFLCLFGAAR

Query:  ITHRAQGIVSIATRWHMIVTSASAGSNQSKAPTP-PPDDEQGGGDSDSDSDSSDVLISVSTQELPSSFQTRQALVSYLRHNNGGITVYGFALDRGLLHTL
        ITHRAQ I S+ATRWHM+VTSA+AG N +K P P PP  +  G D+DSDS SS+VLIS+STQE PSSFQTRQALVSYL+HNNGGIT++GFALDRGLLHTL
Subjt:  ITHRAQGIVSIATRWHMIVTSASAGSNQSKAPTP-PPDDEQGGGDSDSDSDSSDVLISVSTQELPSSFQTRQALVSYLRHNNGGITVYGFALDRGLLHTL

Query:  FAFEFSLVMWILSKVVVLS
        FAFEFSLVMWI+SKVVVLS
Subjt:  FAFEFSLVMWILSKVVVLS

XP_022132051.1 uncharacterized protein LOC111005015 [Momordica charantia]4.8e-19082.38Show/hide
Query:  MTSSD-TNLDVSTHHPVNLPLLLQPSYGRSKSLTFDEFRNFRISLKWCALDHSSVSGKFISYFTFLCLTVLVPVVTVVAFQVPADDPISFNKLVQLPESG
        MTS+D +N+ VS         LL P Y RSKSLT DE RNFRISLKWCALDHSS +GKF+SYF F+   V+VP V +VA  VP DDPISFNKLVQ+PESG
Subjt:  MTSSD-TNLDVSTHHPVNLPLLLQPSYGRSKSLTFDEFRNFRISLKWCALDHSSVSGKFISYFTFLCLTVLVPVVTVVAFQVPADDPISFNKLVQLPESG

Query:  LAAIGFFSLSRFFRRYGLRQLLFLDGLQDDSLYVQRGYARELDKSFRYLAYILFPSFFVELAHKIIFFTTVKVSFPYVDSGGFPVNSILFVLVLAAWVYR
        LAAIGFF+LSRFFRRYGLRQLLFLDGLQ+DS+YVQRGYARELDKSFRYLAYILFPSFFVELAHKI FFTTVK+SFP    GGFP+NSI+FVLVL AWVYR
Subjt:  LAAIGFFSLSRFFRRYGLRQLLFLDGLQDDSLYVQRGYARELDKSFRYLAYILFPSFFVELAHKIIFFTTVKVSFPYVDSGGFPVNSILFVLVLAAWVYR

Query:  TGVFLLVCVLFRLTCELQILRLQGVHKLFEGSGGSGSDAVVIFEEHVRIRKQLWVTSHRYRFFIIGCLFVITVSQFVALLLVLASKTEKNFFNSGDVVVC
        TGVFLLVC+LFRLTCELQILRLQGV KLFEGS GSGS+A VIF+EHVRI+KQLWVTSHRYRFFIIGCL VITVSQFVALLLVLASKT+KNFFNSGDVVVC
Subjt:  TGVFLLVCVLFRLTCELQILRLQGVHKLFEGSGGSGSDAVVIFEEHVRIRKQLWVTSHRYRFFIIGCLFVITVSQFVALLLVLASKTEKNFFNSGDVVVC

Query:  SAVQLCGFFLCLFGAARITHRAQGIVSIATRWHMIVTS--ASAGSNQSKAPTPPPDDEQGGGDSDSDSDSSDVLISVSTQELPSSFQTRQALVSYLRHNN
        SAVQLCGFFLCLFGAARITHRAQGIVSIA+RWHM+VTS  ++AGS+Q+K PTP PD   G GD D+DSDSS VLISVS  + PSSFQTRQALVSYLRHNN
Subjt:  SAVQLCGFFLCLFGAARITHRAQGIVSIATRWHMIVTS--ASAGSNQSKAPTPPPDDEQGGGDSDSDSDSSDVLISVSTQELPSSFQTRQALVSYLRHNN

Query:  GGITVYGFALDRGLLHTLFAFEFSLVMWILSKVVVLS
        GGITVYGFALDRGLLHTLFAFEFSLVMWILSKVVVLS
Subjt:  GGITVYGFALDRGLLHTLFAFEFSLVMWILSKVVVLS

XP_022951427.1 uncharacterized protein LOC111454247 [Cucurbita moschata]4.2e-18682.34Show/hide
Query:  NLPLLLQPSYGRSKSLTFDEFRNFRISLKWCALDHSSVSGKFISYFTFLCLTVLVPVVTVVAFQVPADDPISFNKLVQLPESGLAAIGFFSLSRFFRRYG
        NLPLLLQ SY RSKSLT DE RNFRISLKW ALDHSS +GKF+SYF F+ LTV+VP+VT+V+ + PADDPIS NKLVQ+PESGLAAIGFFSL RFFRRYG
Subjt:  NLPLLLQPSYGRSKSLTFDEFRNFRISLKWCALDHSSVSGKFISYFTFLCLTVLVPVVTVVAFQVPADDPISFNKLVQLPESGLAAIGFFSLSRFFRRYG

Query:  LRQLLFLDGLQDDSLYVQRGYARELDKSFRYLAYILFPSFFVELAHKIIFFTTVKVSFPYVDSGGFPVNSILFVLVLAAWVYRTGVFLLVCVLFRLTCEL
        LRQLLFLDGLQ+DSLYVQRGYARELDK+FRYLAYILFPSFFVELAHKIIFF+TVK+SFP    GGFP+NSILF+ VLA+WVYRTGVFLLVCVLFRLTCEL
Subjt:  LRQLLFLDGLQDDSLYVQRGYARELDKSFRYLAYILFPSFFVELAHKIIFFTTVKVSFPYVDSGGFPVNSILFVLVLAAWVYRTGVFLLVCVLFRLTCEL

Query:  QILRLQGVHKLFEGSGGSGSDAVVIFEEHVRIRKQLWVTSHRYRFFIIGCLFVITVSQFVALLLVLASKTEKNFFNSGDVVVCSAVQLCGFFLCLFGAAR
        QILRLQGV+KLFE S GSGSDA+VIF EH+RIRKQLWVTSHRYRFFII CLF+ITVSQFVALLL L SKT+KNFFNSGD+VVCS VQLCGFFLC+ GAAR
Subjt:  QILRLQGVHKLFEGSGGSGSDAVVIFEEHVRIRKQLWVTSHRYRFFIIGCLFVITVSQFVALLLVLASKTEKNFFNSGDVVVCSAVQLCGFFLCLFGAAR

Query:  ITHRAQGIVSIATRWHMIVTSASAGSNQSKAPTP-PPDDEQGGGDSDSDSDSSDVLISVSTQELPSSFQTRQALVSYLRHNNGGITVYGFALDRGLLHTL
        ITHRAQ I S+ATRWHM+VTSA+AG N +K P P PP  +  G D+DSDS SS+VLIS+STQE PSSFQTRQALVSYL+HNNGGIT++GFALDRGLLHTL
Subjt:  ITHRAQGIVSIATRWHMIVTSASAGSNQSKAPTP-PPDDEQGGGDSDSDSDSSDVLISVSTQELPSSFQTRQALVSYLRHNNGGITVYGFALDRGLLHTL

Query:  FAFEFSLVMWILSKVVVLS
        FAFEFSLVMWI+SKVVVLS
Subjt:  FAFEFSLVMWILSKVVVLS

XP_023002362.1 uncharacterized protein LOC111496225 [Cucurbita maxima]4.9e-18782.34Show/hide
Query:  NLPLLLQPSYGRSKSLTFDEFRNFRISLKWCALDHSSVSGKFISYFTFLCLTVLVPVVTVVAFQVPADDPISFNKLVQLPESGLAAIGFFSLSRFFRRYG
        NLPLLLQ SY RSKSLT DE RNFRISLKWCALDHSS +GKF+SYF F+ LTV+VP+VT+V+ + PADDPIS NKLVQ+PESGLAAIGFFSL RFFRRYG
Subjt:  NLPLLLQPSYGRSKSLTFDEFRNFRISLKWCALDHSSVSGKFISYFTFLCLTVLVPVVTVVAFQVPADDPISFNKLVQLPESGLAAIGFFSLSRFFRRYG

Query:  LRQLLFLDGLQDDSLYVQRGYARELDKSFRYLAYILFPSFFVELAHKIIFFTTVKVSFPYVDSGGFPVNSILFVLVLAAWVYRTGVFLLVCVLFRLTCEL
        LRQLLFLDGLQ+DSLYVQRGYARELDK+FRYLAY+LFPSFF ELAHKIIFF+TVK+SFP    GGFP+NSILF+ VL +WVYRTGVFLLVCVLFRLTCEL
Subjt:  LRQLLFLDGLQDDSLYVQRGYARELDKSFRYLAYILFPSFFVELAHKIIFFTTVKVSFPYVDSGGFPVNSILFVLVLAAWVYRTGVFLLVCVLFRLTCEL

Query:  QILRLQGVHKLFEGSGGSGSDAVVIFEEHVRIRKQLWVTSHRYRFFIIGCLFVITVSQFVALLLVLASKTEKNFFNSGDVVVCSAVQLCGFFLCLFGAAR
        QILRLQGV+KLFE S GSGSDA+VIF EH+RIRKQLWVTSHRYRFFII CLF+ITVSQFVALLLVL SKTEKNFFNSGD+VVCS VQLCGFFLC+ GAAR
Subjt:  QILRLQGVHKLFEGSGGSGSDAVVIFEEHVRIRKQLWVTSHRYRFFIIGCLFVITVSQFVALLLVLASKTEKNFFNSGDVVVCSAVQLCGFFLCLFGAAR

Query:  ITHRAQGIVSIATRWHMIVTSASAGSNQSKAPTP-PPDDEQGGGDSDSDSDSSDVLISVSTQELPSSFQTRQALVSYLRHNNGGITVYGFALDRGLLHTL
        ITHRAQ I S+ATRWHM+VTSA+AG N +K P P PP  +  G D+DSDS SS+VLIS+STQE PSSFQTRQALVSYL+HNNGGIT++GFALDRGLLHTL
Subjt:  ITHRAQGIVSIATRWHMIVTSASAGSNQSKAPTP-PPDDEQGGGDSDSDSDSSDVLISVSTQELPSSFQTRQALVSYLRHNNGGITVYGFALDRGLLHTL

Query:  FAFEFSLVMWILSKVVVLS
        FAFEFSLVMWI+SKVVVLS
Subjt:  FAFEFSLVMWILSKVVVLS

XP_023537461.1 uncharacterized protein LOC111798502 [Cucurbita pepo subsp. pepo]9.3e-18682.3Show/hide
Query:  NLPLLLQPSYGRSKSLTFDEFRNFRISLKWCALDHSSVSGKFISYFTFLCLTVLVPVVTVVAFQVPADDPISFNKLVQLPESGLAAIGFFSLSRFFRRYG
        NLPLLLQ SY RSKSLT DE RNFRISLKW ALDHSS +GKF+SYF F+ LTV+VP+VT+V+ + PADDPIS NKLVQ+PESGLAAIGFFSL RFFRRYG
Subjt:  NLPLLLQPSYGRSKSLTFDEFRNFRISLKWCALDHSSVSGKFISYFTFLCLTVLVPVVTVVAFQVPADDPISFNKLVQLPESGLAAIGFFSLSRFFRRYG

Query:  LRQLLFLDGLQDDSLYVQRGYARELDKSFRYLAYILFPSFFVELAHKIIFFTTVKVSFPYVDSGGFPVNSILFVLVLAAWVYRTGVFLLVCVLFRLTCEL
        LRQLLFLDGLQ+DSLYVQRGYARELDK+FRYLAYILFPSFFVELAHKIIFF+TVK+SFP    GGFP+NSILF+ VLA+WVYRTGVFLLVCVLFRLTCEL
Subjt:  LRQLLFLDGLQDDSLYVQRGYARELDKSFRYLAYILFPSFFVELAHKIIFFTTVKVSFPYVDSGGFPVNSILFVLVLAAWVYRTGVFLLVCVLFRLTCEL

Query:  QILRLQGVHKLFEGSGGSGSDAVVIFEEHVRIRKQLWVTSHRYRFFIIGCLFVITVSQFVALLLVLASKTEKNFFNSGDVVVCSAVQLCGFFLCLFGAAR
        QILRLQGV+KLFE S GSGSDA+VIF EH+RIRKQLWVTSHRYRFFII CLF+ITVSQFVALLLVL SKT+KNFFNSGD+VVCS V+LCGFFLC+ GAAR
Subjt:  QILRLQGVHKLFEGSGGSGSDAVVIFEEHVRIRKQLWVTSHRYRFFIIGCLFVITVSQFVALLLVLASKTEKNFFNSGDVVVCSAVQLCGFFLCLFGAAR

Query:  ITHRAQGIVSIATRWHMIVTSASAGSNQSKAPTP-PPDDEQGGGDSDSDSDSSDVLISVSTQELPSSFQTRQALVSYLRHNNGGITVYGFALDRGLLHTL
        ITHRAQ I S+ATRWHM+VTSA+AG N +K P P PP  +  G D+DSDS SS+VLIS+STQE PSSFQTRQALVSYL+HNNGGIT++GFALDRGLLHTL
Subjt:  ITHRAQGIVSIATRWHMIVTSASAGSNQSKAPTP-PPDDEQGGGDSDSDSDSSDVLISVSTQELPSSFQTRQALVSYLRHNNGGITVYGFALDRGLLHTL

Query:  FAFEFSLVMWILSKVVVL
        FAFEFSLVMWI+SKVVVL
Subjt:  FAFEFSLVMWILSKVVVL

TrEMBL top hitse value%identityAlignment
A0A0A0LS65 Uncharacterized protein2.2e-17276.57Show/hide
Query:  NLPLLLQPSYGRSKSLTFDEFRNFRISLKWCALDHSSVSGKFISYFTFLCLTVLVPVVTVVAFQVPADDPISFNKLVQLPESGLAAIGFFSLSRFFRRYG
        NLPLLLQ SY RSKS+T D+ RN+RISLKWCALDHSS +GKF SY  F  LT++VP VT++A Q P DDPISFNKLVQ+PESGLAA+GF SL RFFRRYG
Subjt:  NLPLLLQPSYGRSKSLTFDEFRNFRISLKWCALDHSSVSGKFISYFTFLCLTVLVPVVTVVAFQVPADDPISFNKLVQLPESGLAAIGFFSLSRFFRRYG

Query:  LRQLLFLDGLQDDSLYVQRGYARELDKSFRYLAYILFPSFFVELAHKIIFFTTVKVSFPYVDSGGFPVNSILFVLVLAAWVYRTGVFLLVCVLFRLTCEL
        LRQLLFL+GLQ+DS+YVQ+GYAREL+K FR +AYILFPSFFVELAHKIIFF+TVK+ FP     GFPVNSILF+LVL +WVYRTGVFLLVCVLFRLTCEL
Subjt:  LRQLLFLDGLQDDSLYVQRGYARELDKSFRYLAYILFPSFFVELAHKIIFFTTVKVSFPYVDSGGFPVNSILFVLVLAAWVYRTGVFLLVCVLFRLTCEL

Query:  QILRLQGVHKLFEGSGGSGSDAVVIFEEHVRIRKQLWVTSHRYRFFIIGCLFVITVSQFVALLLVLASKTEKNFFNSGDVVVCSAVQLCGFFLCLFGAAR
        QILRLQGVHKLFE S GS SD+V+IF EH+RIRKQL +TSHRYRFFII CL VITVSQFVALLLVLASKT+KNFFNSGDVVVCS VQLCGFFLCLFGAAR
Subjt:  QILRLQGVHKLFEGSGGSGSDAVVIFEEHVRIRKQLWVTSHRYRFFIIGCLFVITVSQFVALLLVLASKTEKNFFNSGDVVVCSAVQLCGFFLCLFGAAR

Query:  ITHRAQGIVSIATRWHMIVTSASAGSNQSKAPTPPP-------------DDEQGGGDSDSDSD-SSDVLISVSTQELPSSFQTRQALVSYLRHNNGGITV
        ITHRAQGI S+ATRWHM+VTSA++GSN +K   PPP                 G  + D+DSD SSDVLISV  +E  SSFQTRQALVSYL+HNNGGIT+
Subjt:  ITHRAQGIVSIATRWHMIVTSASAGSNQSKAPTPPP-------------DDEQGGGDSDSDSD-SSDVLISVSTQELPSSFQTRQALVSYLRHNNGGITV

Query:  YGFALDRGLLHTLFAFEFSLVMWILSKVVVL
        +G+ALDRGLLHTLFAFEFSLVM ILSKVVVL
Subjt:  YGFALDRGLLHTLFAFEFSLVMWILSKVVVL

A0A5A7VCT9 Uncharacterized protein8.0e-17577.57Show/hide
Query:  NLPLLLQPSYGRSKSLTFDEFRNFRISLKWCALDHSSVSGKFISYFTFLCLTVLVPVVTVVAFQVPADDPISFNKLVQLPESGLAAIGFFSLSRFFRRYG
        NLPLLLQ SY RSKS+T D+ RNFRISLKWCALDHSS  GKF SY  F  LT++VP VTV+A Q P DDPISFNKLVQ+PESGLAA+ F SL RFFRRYG
Subjt:  NLPLLLQPSYGRSKSLTFDEFRNFRISLKWCALDHSSVSGKFISYFTFLCLTVLVPVVTVVAFQVPADDPISFNKLVQLPESGLAAIGFFSLSRFFRRYG

Query:  LRQLLFLDGLQDDSLYVQRGYARELDKSFRYLAYILFPSFFVELAHKIIFFTTVKVSFPYVDSGGFPVNSILFVLVLAAWVYRTGVFLLVCVLFRLTCEL
        LRQLLFL+GLQ+DS+YVQ+GYAREL+K FR +AYILFPSFFVELAHKIIFF+TVK+SFP     GFP+NSILF+ VL +WVYRTGVFLLVCVLFRLTCEL
Subjt:  LRQLLFLDGLQDDSLYVQRGYARELDKSFRYLAYILFPSFFVELAHKIIFFTTVKVSFPYVDSGGFPVNSILFVLVLAAWVYRTGVFLLVCVLFRLTCEL

Query:  QILRLQGVHKLFEGSGGSGSDAVVIFEEHVRIRKQLWVTSHRYRFFIIGCLFVITVSQFVALLLVLASKTEKNFFNSGDVVVCSAVQLCGFFLCLFGAAR
        QILRLQGVHKLFE S GS SD+V+IF EH+RIRKQL +TSHRYRFFII CL VITVSQFVALLLVLASK +KNFFNSGDVVVCS VQLCGFFLCLFGAAR
Subjt:  QILRLQGVHKLFEGSGGSGSDAVVIFEEHVRIRKQLWVTSHRYRFFIIGCLFVITVSQFVALLLVLASKTEKNFFNSGDVVVCSAVQLCGFFLCLFGAAR

Query:  ITHRAQGIVSIATRWHMIVTSASAGSNQSKAPTPPP---------DDEQGGGDSDSDSD-SSDVLISVSTQELPSSFQTRQALVSYLRHNNGGITVYGFA
        ITHRAQGI SIATRWHM+VTSA++GSN +K+P   P         +   G  + D+DSD SSDVLISV  QE  SSFQTRQALVSYL+HNNGGIT++G+A
Subjt:  ITHRAQGIVSIATRWHMIVTSASAGSNQSKAPTPPP---------DDEQGGGDSDSDSD-SSDVLISVSTQELPSSFQTRQALVSYLRHNNGGITVYGFA

Query:  LDRGLLHTLFAFEFSLVMWILSKVVVLS
        LDRGLLHTLFAFEFSLVMWILSKVVVLS
Subjt:  LDRGLLHTLFAFEFSLVMWILSKVVVLS

A0A6J1BR58 uncharacterized protein LOC1110050152.3e-19082.38Show/hide
Query:  MTSSD-TNLDVSTHHPVNLPLLLQPSYGRSKSLTFDEFRNFRISLKWCALDHSSVSGKFISYFTFLCLTVLVPVVTVVAFQVPADDPISFNKLVQLPESG
        MTS+D +N+ VS         LL P Y RSKSLT DE RNFRISLKWCALDHSS +GKF+SYF F+   V+VP V +VA  VP DDPISFNKLVQ+PESG
Subjt:  MTSSD-TNLDVSTHHPVNLPLLLQPSYGRSKSLTFDEFRNFRISLKWCALDHSSVSGKFISYFTFLCLTVLVPVVTVVAFQVPADDPISFNKLVQLPESG

Query:  LAAIGFFSLSRFFRRYGLRQLLFLDGLQDDSLYVQRGYARELDKSFRYLAYILFPSFFVELAHKIIFFTTVKVSFPYVDSGGFPVNSILFVLVLAAWVYR
        LAAIGFF+LSRFFRRYGLRQLLFLDGLQ+DS+YVQRGYARELDKSFRYLAYILFPSFFVELAHKI FFTTVK+SFP    GGFP+NSI+FVLVL AWVYR
Subjt:  LAAIGFFSLSRFFRRYGLRQLLFLDGLQDDSLYVQRGYARELDKSFRYLAYILFPSFFVELAHKIIFFTTVKVSFPYVDSGGFPVNSILFVLVLAAWVYR

Query:  TGVFLLVCVLFRLTCELQILRLQGVHKLFEGSGGSGSDAVVIFEEHVRIRKQLWVTSHRYRFFIIGCLFVITVSQFVALLLVLASKTEKNFFNSGDVVVC
        TGVFLLVC+LFRLTCELQILRLQGV KLFEGS GSGS+A VIF+EHVRI+KQLWVTSHRYRFFIIGCL VITVSQFVALLLVLASKT+KNFFNSGDVVVC
Subjt:  TGVFLLVCVLFRLTCELQILRLQGVHKLFEGSGGSGSDAVVIFEEHVRIRKQLWVTSHRYRFFIIGCLFVITVSQFVALLLVLASKTEKNFFNSGDVVVC

Query:  SAVQLCGFFLCLFGAARITHRAQGIVSIATRWHMIVTS--ASAGSNQSKAPTPPPDDEQGGGDSDSDSDSSDVLISVSTQELPSSFQTRQALVSYLRHNN
        SAVQLCGFFLCLFGAARITHRAQGIVSIA+RWHM+VTS  ++AGS+Q+K PTP PD   G GD D+DSDSS VLISVS  + PSSFQTRQALVSYLRHNN
Subjt:  SAVQLCGFFLCLFGAARITHRAQGIVSIATRWHMIVTS--ASAGSNQSKAPTPPPDDEQGGGDSDSDSDSSDVLISVSTQELPSSFQTRQALVSYLRHNN

Query:  GGITVYGFALDRGLLHTLFAFEFSLVMWILSKVVVLS
        GGITVYGFALDRGLLHTLFAFEFSLVMWILSKVVVLS
Subjt:  GGITVYGFALDRGLLHTLFAFEFSLVMWILSKVVVLS

A0A6J1GHK9 uncharacterized protein LOC1114542472.0e-18682.34Show/hide
Query:  NLPLLLQPSYGRSKSLTFDEFRNFRISLKWCALDHSSVSGKFISYFTFLCLTVLVPVVTVVAFQVPADDPISFNKLVQLPESGLAAIGFFSLSRFFRRYG
        NLPLLLQ SY RSKSLT DE RNFRISLKW ALDHSS +GKF+SYF F+ LTV+VP+VT+V+ + PADDPIS NKLVQ+PESGLAAIGFFSL RFFRRYG
Subjt:  NLPLLLQPSYGRSKSLTFDEFRNFRISLKWCALDHSSVSGKFISYFTFLCLTVLVPVVTVVAFQVPADDPISFNKLVQLPESGLAAIGFFSLSRFFRRYG

Query:  LRQLLFLDGLQDDSLYVQRGYARELDKSFRYLAYILFPSFFVELAHKIIFFTTVKVSFPYVDSGGFPVNSILFVLVLAAWVYRTGVFLLVCVLFRLTCEL
        LRQLLFLDGLQ+DSLYVQRGYARELDK+FRYLAYILFPSFFVELAHKIIFF+TVK+SFP    GGFP+NSILF+ VLA+WVYRTGVFLLVCVLFRLTCEL
Subjt:  LRQLLFLDGLQDDSLYVQRGYARELDKSFRYLAYILFPSFFVELAHKIIFFTTVKVSFPYVDSGGFPVNSILFVLVLAAWVYRTGVFLLVCVLFRLTCEL

Query:  QILRLQGVHKLFEGSGGSGSDAVVIFEEHVRIRKQLWVTSHRYRFFIIGCLFVITVSQFVALLLVLASKTEKNFFNSGDVVVCSAVQLCGFFLCLFGAAR
        QILRLQGV+KLFE S GSGSDA+VIF EH+RIRKQLWVTSHRYRFFII CLF+ITVSQFVALLL L SKT+KNFFNSGD+VVCS VQLCGFFLC+ GAAR
Subjt:  QILRLQGVHKLFEGSGGSGSDAVVIFEEHVRIRKQLWVTSHRYRFFIIGCLFVITVSQFVALLLVLASKTEKNFFNSGDVVVCSAVQLCGFFLCLFGAAR

Query:  ITHRAQGIVSIATRWHMIVTSASAGSNQSKAPTP-PPDDEQGGGDSDSDSDSSDVLISVSTQELPSSFQTRQALVSYLRHNNGGITVYGFALDRGLLHTL
        ITHRAQ I S+ATRWHM+VTSA+AG N +K P P PP  +  G D+DSDS SS+VLIS+STQE PSSFQTRQALVSYL+HNNGGIT++GFALDRGLLHTL
Subjt:  ITHRAQGIVSIATRWHMIVTSASAGSNQSKAPTP-PPDDEQGGGDSDSDSDSSDVLISVSTQELPSSFQTRQALVSYLRHNNGGITVYGFALDRGLLHTL

Query:  FAFEFSLVMWILSKVVVLS
        FAFEFSLVMWI+SKVVVLS
Subjt:  FAFEFSLVMWILSKVVVLS

A0A6J1KJB4 uncharacterized protein LOC1114962252.4e-18782.34Show/hide
Query:  NLPLLLQPSYGRSKSLTFDEFRNFRISLKWCALDHSSVSGKFISYFTFLCLTVLVPVVTVVAFQVPADDPISFNKLVQLPESGLAAIGFFSLSRFFRRYG
        NLPLLLQ SY RSKSLT DE RNFRISLKWCALDHSS +GKF+SYF F+ LTV+VP+VT+V+ + PADDPIS NKLVQ+PESGLAAIGFFSL RFFRRYG
Subjt:  NLPLLLQPSYGRSKSLTFDEFRNFRISLKWCALDHSSVSGKFISYFTFLCLTVLVPVVTVVAFQVPADDPISFNKLVQLPESGLAAIGFFSLSRFFRRYG

Query:  LRQLLFLDGLQDDSLYVQRGYARELDKSFRYLAYILFPSFFVELAHKIIFFTTVKVSFPYVDSGGFPVNSILFVLVLAAWVYRTGVFLLVCVLFRLTCEL
        LRQLLFLDGLQ+DSLYVQRGYARELDK+FRYLAY+LFPSFF ELAHKIIFF+TVK+SFP    GGFP+NSILF+ VL +WVYRTGVFLLVCVLFRLTCEL
Subjt:  LRQLLFLDGLQDDSLYVQRGYARELDKSFRYLAYILFPSFFVELAHKIIFFTTVKVSFPYVDSGGFPVNSILFVLVLAAWVYRTGVFLLVCVLFRLTCEL

Query:  QILRLQGVHKLFEGSGGSGSDAVVIFEEHVRIRKQLWVTSHRYRFFIIGCLFVITVSQFVALLLVLASKTEKNFFNSGDVVVCSAVQLCGFFLCLFGAAR
        QILRLQGV+KLFE S GSGSDA+VIF EH+RIRKQLWVTSHRYRFFII CLF+ITVSQFVALLLVL SKTEKNFFNSGD+VVCS VQLCGFFLC+ GAAR
Subjt:  QILRLQGVHKLFEGSGGSGSDAVVIFEEHVRIRKQLWVTSHRYRFFIIGCLFVITVSQFVALLLVLASKTEKNFFNSGDVVVCSAVQLCGFFLCLFGAAR

Query:  ITHRAQGIVSIATRWHMIVTSASAGSNQSKAPTP-PPDDEQGGGDSDSDSDSSDVLISVSTQELPSSFQTRQALVSYLRHNNGGITVYGFALDRGLLHTL
        ITHRAQ I S+ATRWHM+VTSA+AG N +K P P PP  +  G D+DSDS SS+VLIS+STQE PSSFQTRQALVSYL+HNNGGIT++GFALDRGLLHTL
Subjt:  ITHRAQGIVSIATRWHMIVTSASAGSNQSKAPTP-PPDDEQGGGDSDSDSDSSDVLISVSTQELPSSFQTRQALVSYLRHNNGGITVYGFALDRGLLHTL

Query:  FAFEFSLVMWILSKVVVLS
        FAFEFSLVMWI+SKVVVLS
Subjt:  FAFEFSLVMWILSKVVVLS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G50630.1 Protein of unknown function (DUF3537)8.4e-6836.09Show/hide
Query:  YGRSKSLTFDEFRNFRISLKWCALDHSSVSGKFISYFTFLCLTVLVPVVTVVAFQVPADDPI---SFNKLVQLPESGLAAIGFFSLSRFFRRYGLRQLLF
        + R  S   DE  +FR  L+W  +DHSS     +S+  F+  T++VP ++         D      ++ +VQL  S +A + F  L+RF  +YGLR+ LF
Subjt:  YGRSKSLTFDEFRNFRISLKWCALDHSSVSGKFISYFTFLCLTVLVPVVTVVAFQVPADDPI---SFNKLVQLPESGLAAIGFFSLSRFFRRYGLRQLLF

Query:  LDGLQDDSLYVQRGYARELDKSFRYLAYILFPSFFVELAHKIIFFTTVKVSFPYVDSGGFPVNSILFVLVLAAWVYRTGVFLLVCVLFRLTCELQILRLQ
         D L D+S  V+R Y  +L+ S   ++Y + P F    A+KI ++ +     P++ +     +++  ++ L +W+YRT V  LVCVLFRL C LQILRLQ
Subjt:  LDGLQDDSLYVQRGYARELDKSFRYLAYILFPSFFVELAHKIIFFTTVKVSFPYVDSGGFPVNSILFVLVLAAWVYRTGVFLLVCVLFRLTCELQILRLQ

Query:  GVHKLFEGSGGSGSDAVVIFEEHVRIRKQLWVTSHRYRFFIIGCLFVITVSQFVALLLVLASKTEKNFFNSGDVVVCSAVQLCGFFLCLFGAARITHRAQ
           KLF+     GS    I  EH+RIR+ L + SHRYR FI+  L ++T SQF +LL+   + TE N + +G++ +CS   +    + L  A++ITH+AQ
Subjt:  GVHKLFEGSGGSGSDAVVIFEEHVRIRKQLWVTSHRYRFFIIGCLFVITVSQFVALLLVLASKTEKNFFNSGDVVVCSAVQLCGFFLCLFGAARITHRAQ

Query:  GIVSIATRWHMIVTSAS-----------------------AGSNQSKAPTPPPDDEQGGGDSDSDSDSSDVLISVSTQELPSSFQTRQALVSYLRHNNGG
         +  +A +WH+  T  S                         +N     T    D    GD + D D++D++   +   +  SFQ RQALVSY  +N+ G
Subjt:  GIVSIATRWHMIVTSAS-----------------------AGSNQSKAPTPPPDDEQGGGDSDSDSDSSDVLISVSTQELPSSFQTRQALVSYLRHNNGG

Query:  ITVYGFALDRGLLHTLFAFEFSLVMWILSKVVVLS
        ITVYGF LDRG LHT+F  E SLV+W+L K + +S
Subjt:  ITVYGFALDRGLLHTLFAFEFSLVMWILSKVVVLS

AT1G50630.2 Protein of unknown function (DUF3537)7.1e-5133.33Show/hide
Query:  YGRSKSLTFDEFRNFRISLKWCALDHSSVSGKFISYFTFLCLTVLVPVVTVVAFQVPADDPI---SFNKLVQLPESGLAAIGFFSLSRFFRRYGLRQLLF
        + R  S   DE  +FR  L+W  +DHSS     +S+  F+  T++VP ++         D      ++ +VQL  S +A + F  L+RF  +YGLR+ LF
Subjt:  YGRSKSLTFDEFRNFRISLKWCALDHSSVSGKFISYFTFLCLTVLVPVVTVVAFQVPADDPI---SFNKLVQLPESGLAAIGFFSLSRFFRRYGLRQLLF

Query:  LDGLQDDSLYVQRGYARELDKSFRYLAYILFPSFFVELAHKIIFFTTVKVSFPYVDSGGFPVNSILFVLVLAAWVYRTGVFLLVCVLFRLTCELQILRLQ
         D L D+S  V+R Y  +L+ S   ++Y + P F    A+KI ++ +     P++ +     +++  ++ L +W+YRT V  LVCVLFRL C LQILRLQ
Subjt:  LDGLQDDSLYVQRGYARELDKSFRYLAYILFPSFFVELAHKIIFFTTVKVSFPYVDSGGFPVNSILFVLVLAAWVYRTGVFLLVCVLFRLTCELQILRLQ

Query:  GVHKLFEGSGGSGSDAVVIFEEHVRIRKQLWVTSHRYRFFIIGCLFVITVSQFVALLLVLASKTEKNFFNSGDVVVCSAVQLCGFFLCLFGAARITHRAQ
           KLF+     GS    I  EH+RIR+ L + SHRYR FI+  L ++T SQF +LL+   + TE N + +G++ +CS   +    + L  A++ITH+AQ
Subjt:  GVHKLFEGSGGSGSDAVVIFEEHVRIRKQLWVTSHRYRFFIIGCLFVITVSQFVALLLVLASKTEKNFFNSGDVVVCSAVQLCGFFLCLFGAARITHRAQ

Query:  GIVSIATRWHMIVTSAS-----------------------AGSNQSKAPTPPPDDEQGGGDSDSDSDSSDVLISVSTQELPSSFQTRQAL
         +  +A +WH+  T  S                         +N     T    D    GD + D D++D++   +   +  SFQ RQAL
Subjt:  GIVSIATRWHMIVTSAS-----------------------AGSNQSKAPTPPPDDEQGGGDSDSDSDSSDVLISVSTQELPSSFQTRQAL

AT2G21080.1 unknown protein1.0e-12961.68Show/hide
Query:  VNLPLLLQPSYGRSKSLTFDEFRNFRISLKWCALDHSSVSGKFISYFTFLCLTVLVPVVTVVAFQVPADDPI------SFNKLVQLPESGLAAIGFFSLS
        ++LP L       SKSL  D+ RNFR+ LKWCALDHSS  GK +SY  F+  T+LVP+++ +  + P + P       SFN LVQ PESGLA IGF +L 
Subjt:  VNLPLLLQPSYGRSKSLTFDEFRNFRISLKWCALDHSSVSGKFISYFTFLCLTVLVPVVTVVAFQVPADDPI------SFNKLVQLPESGLAAIGFFSLS

Query:  RFFRRYGLRQLLFLDGLQDDSLYVQRGYARELDKSFRYLAYILFPSFFVELAHKIIFFTTVKVSFPYVDSGGFPVNSILFVLVLAAWVYRTGVFLLVCVL
         FFR Y L +LLFL    DDS  V+ GY+RELDK+ RYLAYIL PSF VEL HK IFF + +VSFP++ S    +N ++F LVL +WVYRTGVFLLVC+L
Subjt:  RFFRRYGLRQLLFLDGLQDDSLYVQRGYARELDKSFRYLAYILFPSFFVELAHKIIFFTTVKVSFPYVDSGGFPVNSILFVLVLAAWVYRTGVFLLVCVL

Query:  FRLTCELQILRLQGVHKLFEGSGGSGSDAVV-IFEEHVRIRKQLWVTSHRYRFFIIGCLFVITVSQFVALLLVLASKTEKNFFNSGDVVVCSAVQLCGFF
        FRLTCELQILR +G+HKLF+     GSD +  + +EHVRI+KQL  TSHRYRFFII    VI+ SQFVALLLVLASK+EK+F +SGD+VVCSAVQL GFF
Subjt:  FRLTCELQILRLQGVHKLFEGSGGSGSDAVV-IFEEHVRIRKQLWVTSHRYRFFIIGCLFVITVSQFVALLLVLASKTEKNFFNSGDVVVCSAVQLCGFF

Query:  LCLFGAARITHRAQGIVSIATRWHMIVTSASAGSNQSKAPTPPPDDEQGGGDSDSDSDSSD-VLISVS-TQELPSSFQTRQALVSYLRHNNGGITVYGFA
        LCL GAARITHRAQG+V IATRWHM +T A      S+A +P           +SD+DSSD + I+VS + +L S FQ RQALV YLRHNN GIT+YG+A
Subjt:  LCLFGAARITHRAQGIVSIATRWHMIVTSASAGSNQSKAPTPPPDDEQGGGDSDSDSDSSD-VLISVS-TQELPSSFQTRQALVSYLRHNNGGITVYGFA

Query:  LDRGLLHTLFAFEFSLVMWILSKVVVLS
        LDRGLLHTLFAFEFSLVMWILSKVVVLS
Subjt:  LDRGLLHTLFAFEFSLVMWILSKVVVLS

AT3G20300.1 Protein of unknown function (DUF3537)2.1e-6636.28Show/hide
Query:  YGRSKSLTFDEFRNFRISLKWCALDHSSVSGKFISYFTFLCLTVLVPVVTVVAFQVPADDPIS-----FNKLVQLPESGLAAIGFFSLSRFFRRYGLRQL
        + RS S   DE  +FR  L+W  +D SS     +S+  F+  T++VP  +   F +   D  S     ++ +VQL  S  AA+ F  LSRF  +YGLR+ 
Subjt:  YGRSKSLTFDEFRNFRISLKWCALDHSSVSGKFISYFTFLCLTVLVPVVTVVAFQVPADDPIS-----FNKLVQLPESGLAAIGFFSLSRFFRRYGLRQL

Query:  LFLDGLQDDSLYVQRGYARELDKSFRYLAYILFPSFFVELAHKIIFFTTVKVSFPYVDSGGFPVNSILFVLVLAAWVYRTGVFLLVCVLFRLTCELQILR
        LF D L D+S  V+ GY  +L++S + L+Y + P F    ++KI ++ +     P++       +++  ++ L +W+YRT V  LVCVLFRL C LQILR
Subjt:  LFLDGLQDDSLYVQRGYARELDKSFRYLAYILFPSFFVELAHKIIFFTTVKVSFPYVDSGGFPVNSILFVLVLAAWVYRTGVFLLVCVLFRLTCELQILR

Query:  LQGVHKLFEGSGGSGSDAVVIFEEHVRIRKQLWVTSHRYRFFIIGCLFVITVSQFVALLLVLASKTEKNFFNSGDVVVCSAVQLCGFFLCLFGAARITHR
        LQ   ++F+     GS    I  EH+RIR+ L + SHRYR FI+  L ++T SQF +LL+   +  E N + +G++ +CS   +    + L  A++ITH+
Subjt:  LQGVHKLFEGSGGSGSDAVVIFEEHVRIRKQLWVTSHRYRFFIIGCLFVITVSQFVALLLVLASKTEKNFFNSGDVVVCSAVQLCGFFLCLFGAARITHR

Query:  AQGIVSIATRWHMIVTSASAGSNQSKAPT----------PPPDDEQGG------GDSDSDSDSSDVLISVSTQELPSSFQTRQALVSYLRHNNGGITVYG
        AQ +  +A +WH+  T  S  +   + P            P DD+ G       GD + D D+++++ + +   +  SFQ RQALV+Y  +N  GITV+G
Subjt:  AQGIVSIATRWHMIVTSASAGSNQSKAPT----------PPPDDEQGG------GDSDSDSDSSDVLISVSTQELPSSFQTRQALVSYLRHNNGGITVYG

Query:  FALDRGLLHTLFAFEFSLVMWILSKVVVLS
        F LDR  LHT+F  E SLV+W+L K + +S
Subjt:  FALDRGLLHTLFAFEFSLVMWILSKVVVLS

AT4G22270.1 Protein of unknown function (DUF3537)5.3e-5433.5Show/hide
Query:  DHSSVSGKFISYFTFLCLTVLVPVVT---VVAFQVPADDPISFNKLVQLPESGLAAIGFFSLSRFFRRYGLRQLLFLDGLQDDSLYVQRGYARELDKSFR
        D S+     +S+  F  L V+VP+++   +V           ++ +VQL  S  A I F SLS + R++G+R+ LFLD L D S  V+  Y  E+ +S +
Subjt:  DHSSVSGKFISYFTFLCLTVLVPVVT---VVAFQVPADDPISFNKLVQLPESGLAAIGFFSLSRFFRRYGLRQLLFLDGLQDDSLYVQRGYARELDKSFR

Query:  YLAYILFPSFFVELAHKIIFFTTVKVSFPYVDSGGFPVNS--ILFVLVLAAWVYRTGVFLLVCVLFRLTCELQILRLQGVHKLFEGSGGSGSDAVVIFEE
         L   + PS  +E  ++I ++ +     PY+ +   P+ S  +   L L++W+YR  +F++VC+L+++TC LQ LRL    + F       +D      E
Subjt:  YLAYILFPSFFVELAHKIIFFTTVKVSFPYVDSGGFPVNS--ILFVLVLAAWVYRTGVFLLVCVLFRLTCELQILRLQGVHKLFEGSGGSGSDAVVIFEE

Query:  HVRIRKQLWVTSHRYRFFIIGCLFVITVSQFVALLLVLASKTEKNFFNSGDVVVCSAVQLCGFFLCLFGAARITHRAQGIVSIATRWHMIVTSASAGSNQ
        H +IR+ L + SHR+R FI+  L ++T +QF+ALL    +    N +  G++ +CS   + G F+CL  A +ITH+AQ + S+A +W++  T  S     
Subjt:  HVRIRKQLWVTSHRYRFFIIGCLFVITVSQFVALLLVLASKTEKNFFNSGDVVVCSAVQLCGFFLCLFGAARITHRAQGIVSIATRWHMIVTSASAGSNQ

Query:  SKAPT---------------PPPDDEQGGGDSDSDSDSSDVLISVSTQELPSSFQTRQALVSYLRHNNGGITVYGFALDRGLLHTLFAFEFSLVMWILSK
         + PT                  DDE+G GD D D+     + + +      S+Q RQALV+YL +N  GITVYGF +DR  L+T+F  E +L++W+L+K
Subjt:  SKAPT---------------PPPDDEQGGGDSDSDSDSSDVLISVSTQELPSSFQTRQALVSYLRHNNGGITVYGFALDRGLLHTLFAFEFSLVMWILSK

Query:  VVV
         +V
Subjt:  VVV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCTCCTCCGATACAAACCTCGACGTCTCCACTCACCACCCCGTCAACCTCCCTCTCCTTCTCCAACCTTCCTATGGAAGATCCAAATCCCTAACCTTCGACGAGTT
TCGTAACTTCCGCATCAGCCTCAAATGGTGCGCCCTCGACCACTCCTCCGTCTCCGGAAAGTTCATTTCCTACTTCACCTTCCTTTGCCTCACTGTTCTCGTCCCCGTCG
TAACCGTCGTAGCCTTCCAGGTCCCCGCCGACGATCCCATCTCCTTCAACAAACTCGTCCAGCTCCCCGAGTCCGGCCTCGCCGCGATCGGATTCTTCTCGCTGTCTCGT
TTCTTCCGCCGCTACGGCCTCCGCCAGCTCCTCTTCCTCGACGGCTTGCAAGACGACTCTCTCTACGTGCAGCGCGGTTACGCTCGCGAGCTCGACAAGTCGTTCCGCTA
CCTCGCTTACATTCTCTTCCCGTCGTTCTTCGTGGAGCTGGCTCACAAGATCATCTTCTTCACCACGGTGAAGGTTTCGTTTCCGTACGTGGACAGCGGCGGTTTTCCGG
TGAACTCGATACTGTTCGTGTTGGTGTTGGCGGCGTGGGTGTACCGGACGGGGGTGTTCTTGCTGGTGTGCGTGCTGTTTCGGCTGACGTGCGAGCTGCAGATTCTCCGG
CTGCAAGGGGTGCACAAGTTGTTCGAAGGGAGCGGCGGATCTGGATCGGACGCGGTGGTGATCTTCGAGGAACATGTGAGGATAAGAAAGCAGCTGTGGGTTACGAGCCA
CCGGTATCGGTTCTTCATTATTGGTTGCTTGTTCGTCATCACCGTCAGCCAGTTCGTGGCGCTGCTGCTGGTTTTGGCCTCCAAAACTGAGAAAAATTTCTTCAACTCCG
GCGACGTCGTGGTTTGCTCGGCGGTGCAGTTATGCGGATTCTTCCTGTGCCTGTTCGGAGCGGCGAGAATAACTCACAGAGCGCAAGGAATCGTGTCGATCGCAACCAGA
TGGCACATGATCGTCACTTCCGCCTCCGCCGGATCCAATCAGAGCAAAGCTCCAACGCCACCACCGGACGACGAGCAAGGCGGCGGCGACAGTGACAGCGATTCCGACTC
GTCGGACGTCTTAATTTCAGTGAGCACGCAGGAATTACCTTCCTCCTTCCAGACCAGACAAGCGTTAGTGTCGTATTTGAGGCACAACAATGGCGGAATCACGGTGTACG
GATTTGCATTGGATCGGGGATTGCTTCATACGCTCTTCGCGTTCGAATTCTCTCTGGTGATGTGGATTCTGAGCAAGGTCGTCGTATTATCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGACCTCCTCCGATACAAACCTCGACGTCTCCACTCACCACCCCGTCAACCTCCCTCTCCTTCTCCAACCTTCCTATGGAAGATCCAAATCCCTAACCTTCGACGAGTT
TCGTAACTTCCGCATCAGCCTCAAATGGTGCGCCCTCGACCACTCCTCCGTCTCCGGAAAGTTCATTTCCTACTTCACCTTCCTTTGCCTCACTGTTCTCGTCCCCGTCG
TAACCGTCGTAGCCTTCCAGGTCCCCGCCGACGATCCCATCTCCTTCAACAAACTCGTCCAGCTCCCCGAGTCCGGCCTCGCCGCGATCGGATTCTTCTCGCTGTCTCGT
TTCTTCCGCCGCTACGGCCTCCGCCAGCTCCTCTTCCTCGACGGCTTGCAAGACGACTCTCTCTACGTGCAGCGCGGTTACGCTCGCGAGCTCGACAAGTCGTTCCGCTA
CCTCGCTTACATTCTCTTCCCGTCGTTCTTCGTGGAGCTGGCTCACAAGATCATCTTCTTCACCACGGTGAAGGTTTCGTTTCCGTACGTGGACAGCGGCGGTTTTCCGG
TGAACTCGATACTGTTCGTGTTGGTGTTGGCGGCGTGGGTGTACCGGACGGGGGTGTTCTTGCTGGTGTGCGTGCTGTTTCGGCTGACGTGCGAGCTGCAGATTCTCCGG
CTGCAAGGGGTGCACAAGTTGTTCGAAGGGAGCGGCGGATCTGGATCGGACGCGGTGGTGATCTTCGAGGAACATGTGAGGATAAGAAAGCAGCTGTGGGTTACGAGCCA
CCGGTATCGGTTCTTCATTATTGGTTGCTTGTTCGTCATCACCGTCAGCCAGTTCGTGGCGCTGCTGCTGGTTTTGGCCTCCAAAACTGAGAAAAATTTCTTCAACTCCG
GCGACGTCGTGGTTTGCTCGGCGGTGCAGTTATGCGGATTCTTCCTGTGCCTGTTCGGAGCGGCGAGAATAACTCACAGAGCGCAAGGAATCGTGTCGATCGCAACCAGA
TGGCACATGATCGTCACTTCCGCCTCCGCCGGATCCAATCAGAGCAAAGCTCCAACGCCACCACCGGACGACGAGCAAGGCGGCGGCGACAGTGACAGCGATTCCGACTC
GTCGGACGTCTTAATTTCAGTGAGCACGCAGGAATTACCTTCCTCCTTCCAGACCAGACAAGCGTTAGTGTCGTATTTGAGGCACAACAATGGCGGAATCACGGTGTACG
GATTTGCATTGGATCGGGGATTGCTTCATACGCTCTTCGCGTTCGAATTCTCTCTGGTGATGTGGATTCTGAGCAAGGTCGTCGTATTATCTTAG
Protein sequenceShow/hide protein sequence
MTSSDTNLDVSTHHPVNLPLLLQPSYGRSKSLTFDEFRNFRISLKWCALDHSSVSGKFISYFTFLCLTVLVPVVTVVAFQVPADDPISFNKLVQLPESGLAAIGFFSLSR
FFRRYGLRQLLFLDGLQDDSLYVQRGYARELDKSFRYLAYILFPSFFVELAHKIIFFTTVKVSFPYVDSGGFPVNSILFVLVLAAWVYRTGVFLLVCVLFRLTCELQILR
LQGVHKLFEGSGGSGSDAVVIFEEHVRIRKQLWVTSHRYRFFIIGCLFVITVSQFVALLLVLASKTEKNFFNSGDVVVCSAVQLCGFFLCLFGAARITHRAQGIVSIATR
WHMIVTSASAGSNQSKAPTPPPDDEQGGGDSDSDSDSSDVLISVSTQELPSSFQTRQALVSYLRHNNGGITVYGFALDRGLLHTLFAFEFSLVMWILSKVVVLS