; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh05G007590 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh05G007590
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionFPL domain-containing protein
Genome locationCmo_Chr05:4112346..4121021
RNA-Seq ExpressionCmoCh05G007590
SyntenyCmoCh05G007590
Gene Ontology termsGO:0008333 - endosome to lysosome transport (biological process)
GO:0016197 - endosomal transport (biological process)
GO:1901096 - regulation of autophagosome maturation (biological process)
GO:0005770 - late endosome (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0036020 - endolysosome membrane (cellular component)
InterPro domainsIPR039272 - CLEC16A/TT9


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6598900.1 Protein TRANSPARENT TESTA 9, partial [Cucurbita argyrosperma subsp. sororia]2.0e-24080.24Show/hide
Query:  GIQIGAVTSLYLLCCILRIVKIKDLANTISTAFFCPLDSFSPHCEGRLIENMNWLSCANRSQSSGSDSIVRQPLDAESLRKEVLYSSTPKTELEGASMKN
        GIQIGAVTSLYLLCCILRIVKIKDLANTIS AFFCPLDSFSPHCEGRLIENMNWLSCANRSQSSGSDSIVRQPLDAESLRK VLYSSTPKTELEGAS KN
Subjt:  GIQIGAVTSLYLLCCILRIVKIKDLANTISTAFFCPLDSFSPHCEGRLIENMNWLSCANRSQSSGSDSIVRQPLDAESLRKEVLYSSTPKTELEGASMKN

Query:  GFRGSCLDLREALLSHITTGDDVEVLGALSVLATLLQTEGQINAVIQLPSLRYSFLIMHPISVELDESMLDALAILPQRKQHKKLLLEALVGEDSGEQQL
        G RGS LDLREALLSHITTGDDVEVLGALSVLATLLQTE                        ELDESMLDAL ILPQRKQHKKLLLEALVGEDSGEQQL
Subjt:  GFRGSCLDLREALLSHITTGDDVEVLGALSVLATLLQTEGQINAVIQLPSLRYSFLIMHPISVELDESMLDALAILPQRKQHKKLLLEALVGEDSGEQQL

Query:  FSSENASSKGGVNVELDGYLKKLKDYGISYFLKVGASPRALRIHTSTGLLS----------------------------------------SYR------
        FSSENASSKGGVNVELDGYLKKLKDYGISYFLKVGASPRALR      L+S                                        SY+      
Subjt:  FSSENASSKGGVNVELDGYLKKLKDYGISYFLKVGASPRALRIHTSTGLLS----------------------------------------SYR------

Query:  --------------------KLEEFAIEAPSPRKEPKCMLLYSAKASVVDAVPPESSLAAGQRMSELVKVFVLLHQLQSFSLGKALSEQPFMDPPSEASE
                            K  + AIEAPSPRKEPKCMLLYSAKASVVDAVPPESSLAAGQRMSELVKVFVLLHQL SFSLGKALSEQPFMDPPSEASE
Subjt:  --------------------KLEEFAIEAPSPRKEPKCMLLYSAKASVVDAVPPESSLAAGQRMSELVKVFVLLHQLQSFSLGKALSEQPFMDPPSEASE

Query:  CSRAKVAGLDASGPKPGAELRLDGSVPCRISFERGKERHFYFLGTSMGTSGWIILAEELPSKLNCGIIRVAAPLAGSNPRMDEKHPRWLHLRIRPSTLPF
        CSRAKVAGLDAS PKPGAELRLDGSVPCRISFERGKER+FYFLGTSMGTSGWIILAEELPSKLNCGIIRVAAPLAGSNPR+DEKHPRWLHLRIRPSTLPF
Subjt:  CSRAKVAGLDASGPKPGAELRLDGSVPCRISFERGKERHFYFLGTSMGTSGWIILAEELPSKLNCGIIRVAAPLAGSNPRMDEKHPRWLHLRIRPSTLPF

Query:  LDHPAKYGTPLNLKTKPFVDGRWILAFQDDNTCKSALSMVLEEINLQSQEVERRLKPLIDLERAVDSSKMHR
        LDHPAKYGTPLNLKTKPFVDGRWILAFQDD+TCKSALSMVLEEINLQSQEVERRLKPLIDLERAVDSSKMHR
Subjt:  LDHPAKYGTPLNLKTKPFVDGRWILAFQDDNTCKSALSMVLEEINLQSQEVERRLKPLIDLERAVDSSKMHR

KAG7029854.1 hypothetical protein SDJN02_08197, partial [Cucurbita argyrosperma subsp. argyrosperma]1.4e-25784.44Show/hide
Query:  GIQIGAVTSLYLLCCILRIVKIKDLANTISTAFFCPLDSFSPHCEGRLIENMNWLSCANRSQSSGSDSIVRQPLDAESLRKEVLYSSTPKTELEGASMKN
        GIQIGAVTSLYLLCCILRIVKIKDLANTIS AFFCPLDSFSPHCEGRLIENMNWLSCANRSQSSGSDSIVRQPLDAESLRK VLYSSTPKTELEGAS KN
Subjt:  GIQIGAVTSLYLLCCILRIVKIKDLANTISTAFFCPLDSFSPHCEGRLIENMNWLSCANRSQSSGSDSIVRQPLDAESLRKEVLYSSTPKTELEGASMKN

Query:  GFRGSCLDLREALLSHITTGDDVEVLGALSVLATLLQTEGQINAVIQLPSLRYSFLIMHPISVELDESMLDALAILPQRKQHKKLLLEALVGEDSGEQQL
        G RGS LDLREALLSHITTGDDVEVLGALSVLATLLQTEGQINAVIQLPSLRYSFLIMHPISVELDESMLDAL ILPQRKQHKKLLLEALVGEDSGEQQL
Subjt:  GFRGSCLDLREALLSHITTGDDVEVLGALSVLATLLQTEGQINAVIQLPSLRYSFLIMHPISVELDESMLDALAILPQRKQHKKLLLEALVGEDSGEQQL

Query:  FSSENASSKGGVNVELDGYLKKLKDYGISYFLKVGASPRALRIHTSTGLLS----------------------------------------SYR------
        FSSENASSKGGVNVELDGYLKKLKDYGISYFLKVGASPRALR      L+S                                        SY+      
Subjt:  FSSENASSKGGVNVELDGYLKKLKDYGISYFLKVGASPRALRIHTSTGLLS----------------------------------------SYR------

Query:  --------------------KLEEFAIEAPSPRKEPKCMLLYSAKASVVDAVPPESSLAAGQRMSELVKVFVLLHQLQSFSLGKALSEQPFMDPPSEASE
                            K  + AIEAPSPRKEPKCMLLYSAKASVVDAVPPESSLAAGQRMSELVKVFVLLHQL SFSLGKALSEQPFMDPPSEASE
Subjt:  --------------------KLEEFAIEAPSPRKEPKCMLLYSAKASVVDAVPPESSLAAGQRMSELVKVFVLLHQLQSFSLGKALSEQPFMDPPSEASE

Query:  CSRAKVAGLDASGPKPGAELRLDGSVPCRISFERGKERHFYFLGTSMGTSGWIILAEELPSKLNCGIIRVAAPLAGSNPRMDEKHPRWLHLRIRPSTLPF
        CSRAKVAGLDAS PKPGAELRLDGSVPCRISFERGKERHFYFLGTSMGTSGWIILAEELPSKLNCGIIRVAAPLAGSNPR+DEKHPRWLHLRIRPSTLPF
Subjt:  CSRAKVAGLDASGPKPGAELRLDGSVPCRISFERGKERHFYFLGTSMGTSGWIILAEELPSKLNCGIIRVAAPLAGSNPRMDEKHPRWLHLRIRPSTLPF

Query:  LDHPAKYGTPLNLKTKPFVDGRWILAFQDDNTCKSALSMVLEEINLQSQEVERRLKPLIDLERAVDSSKMHR
        LDHPAKYGTPLNLKTK FVDGRWILAFQDD+TCKSALSMVLEEINLQSQEVERRLKPLIDLERAVDSSKMHR
Subjt:  LDHPAKYGTPLNLKTKPFVDGRWILAFQDDNTCKSALSMVLEEINLQSQEVERRLKPLIDLERAVDSSKMHR

XP_022929612.1 uncharacterized protein LOC111436147 isoform X1 [Cucurbita moschata]1.4e-24982.17Show/hide
Query:  GIQIGAVTSLYLLCCILRIVKIKDLANTISTAFFCPLDSFSPHCEGRLIENMNWLSCANRSQSSGSDSIVRQPLDAESLRKEVLYSSTPKTELEGASMKN
        GIQIGAVTSLYLLCCILRIVKIKDLANTISTAFFCPLDSFSPHCEGRLIENMNWLSCANRSQSSGSDSIVRQPLDAESLRKEVLYSSTPKTELEGASMKN
Subjt:  GIQIGAVTSLYLLCCILRIVKIKDLANTISTAFFCPLDSFSPHCEGRLIENMNWLSCANRSQSSGSDSIVRQPLDAESLRKEVLYSSTPKTELEGASMKN

Query:  GFRGSCLDLREALLSHITTGDDVEVLGALSVLATLLQTEGQINAVIQLPSLRYSFLIMHPISVELDESMLDALAILPQRKQHKKLLLEALVGEDSGEQQL
        GFRGSCLDLREALLSHITTGDDVEVLGALSVLATLLQTE                        ELDESMLDALAILPQRKQHKKLLLEALVGEDSGEQQL
Subjt:  GFRGSCLDLREALLSHITTGDDVEVLGALSVLATLLQTEGQINAVIQLPSLRYSFLIMHPISVELDESMLDALAILPQRKQHKKLLLEALVGEDSGEQQL

Query:  FSSENASSKGGVNVELDGYLKKLKDYGISYFLKVGASPRALRIHTSTGLLS----------------------------------------SYR------
        FSSENASSKGGVNVELDGYLKKLKDYGISYFLKVGASPRALR      L+S                                        SY+      
Subjt:  FSSENASSKGGVNVELDGYLKKLKDYGISYFLKVGASPRALRIHTSTGLLS----------------------------------------SYR------

Query:  --------------------KLEEFAIEAPSPRKEPKCMLLYSAKASVVDAVPPESSLAAGQRMSELVKVFVLLHQLQSFSLGKALSEQPFMDPPSEASE
                            K  + AIEAPSPRKEPKCMLLYSAKASVVDAVPPESSLAAGQRMSELVKVFVLLHQLQSFSLGKALSEQPFMDPPSEASE
Subjt:  --------------------KLEEFAIEAPSPRKEPKCMLLYSAKASVVDAVPPESSLAAGQRMSELVKVFVLLHQLQSFSLGKALSEQPFMDPPSEASE

Query:  CSRAKVAGLDASGPKPGAELRLDGSVPCRISFERGKERHFYFLGTSMGTSGWIILAEELPSKLNCGIIRVAAPLAGSNPRMDEKHPRWLHLRIRPSTLPF
        CSRAKVAGLDASGPKPGAELRLDGSVPCRISFERGKERHFYFLGTSMGTSGWIILAEELPSKLNCGIIRVAAPLAGSNPRMDEKHPRWLHLRIRPSTLPF
Subjt:  CSRAKVAGLDASGPKPGAELRLDGSVPCRISFERGKERHFYFLGTSMGTSGWIILAEELPSKLNCGIIRVAAPLAGSNPRMDEKHPRWLHLRIRPSTLPF

Query:  LDHPAKYGTPLNLKTKPFVDGRWILAFQDDNTCKSALSMVLEEINLQSQEVERRLKPLIDLERAVDSSKMHR
        LDHPAKYGTPLNLKTKPFVDGRWILAFQDDNTCKSALSMVLEEINLQSQEVERRLKPLIDLERAVDSSKMHR
Subjt:  LDHPAKYGTPLNLKTKPFVDGRWILAFQDDNTCKSALSMVLEEINLQSQEVERRLKPLIDLERAVDSSKMHR

XP_022929613.1 uncharacterized protein LOC111436147 isoform X2 [Cucurbita moschata]2.4e-24182.79Show/hide
Query:  GIQIGAVTSLYLLCCILRIVKIKDLANTISTAFFCPLDSFSPHCEGRLIENMNWLSCANRSQSSGSDSIVRQPLDAESLRKEVLYSSTPKTELEGASMKN
        GIQIGAVTSLYLLCCILRIVKIKDLANTISTAFFCPLDSFSPHCEGRLIENMNWLSCANRSQSSGSDSIVRQPLDAESLRKEVLYSSTPKTELEGASMKN
Subjt:  GIQIGAVTSLYLLCCILRIVKIKDLANTISTAFFCPLDSFSPHCEGRLIENMNWLSCANRSQSSGSDSIVRQPLDAESLRKEVLYSSTPKTELEGASMKN

Query:  GFRGSCLDLREALLSHITTGDDVEVLGALSVLATLLQTEGQINAVIQLPSLRYSFLIMHPISVELDESMLDALAILPQRKQHKKLLLEALVGEDSGEQQL
        GFRGSCLDLREALLSHITTGDDVEVLGALSVLATLLQTE                        ELDESMLDALAILPQRKQHKKLLLEALVGEDSGEQQL
Subjt:  GFRGSCLDLREALLSHITTGDDVEVLGALSVLATLLQTEGQINAVIQLPSLRYSFLIMHPISVELDESMLDALAILPQRKQHKKLLLEALVGEDSGEQQL

Query:  FSSENASSKGGVNVELDGYLKKLK--DYGISYFLKVGASPRAL------------------RIHTSTGLLSSYR--------------------------
        FSSENASSKGGVNVELDGYLKKLK  D  +S F +   S   L                    H    L  SY+                          
Subjt:  FSSENASSKGGVNVELDGYLKKLK--DYGISYFLKVGASPRAL------------------RIHTSTGLLSSYR--------------------------

Query:  KLEEFAIEAPSPRKEPKCMLLYSAKASVVDAVPPESSLAAGQRMSELVKVFVLLHQLQSFSLGKALSEQPFMDPPSEASECSRAKVAGLDASGPKPGAEL
        K  + AIEAPSPRKEPKCMLLYSAKASVVDAVPPESSLAAGQRMSELVKVFVLLHQLQSFSLGKALSEQPFMDPPSEASECSRAKVAGLDASGPKPGAEL
Subjt:  KLEEFAIEAPSPRKEPKCMLLYSAKASVVDAVPPESSLAAGQRMSELVKVFVLLHQLQSFSLGKALSEQPFMDPPSEASECSRAKVAGLDASGPKPGAEL

Query:  RLDGSVPCRISFERGKERHFYFLGTSMGTSGWIILAEELPSKLNCGIIRVAAPLAGSNPRMDEKHPRWLHLRIRPSTLPFLDHPAKYGTPLNLKTKPFVD
        RLDGSVPCRISFERGKERHFYFLGTSMGTSGWIILAEELPSKLNCGIIRVAAPLAGSNPRMDEKHPRWLHLRIRPSTLPFLDHPAKYGTPLNLKTKPFVD
Subjt:  RLDGSVPCRISFERGKERHFYFLGTSMGTSGWIILAEELPSKLNCGIIRVAAPLAGSNPRMDEKHPRWLHLRIRPSTLPFLDHPAKYGTPLNLKTKPFVD

Query:  GRWILAFQDDNTCKSALSMVLEEINLQSQEVERRLKPLIDLERAVDSSKMHR
        GRWILAFQDDNTCKSALSMVLEEINLQSQEVERRLKPLIDLERAVDSSKMHR
Subjt:  GRWILAFQDDNTCKSALSMVLEEINLQSQEVERRLKPLIDLERAVDSSKMHR

XP_022929615.1 uncharacterized protein LOC111436147 isoform X4 [Cucurbita moschata]1.4e-24982.17Show/hide
Query:  GIQIGAVTSLYLLCCILRIVKIKDLANTISTAFFCPLDSFSPHCEGRLIENMNWLSCANRSQSSGSDSIVRQPLDAESLRKEVLYSSTPKTELEGASMKN
        GIQIGAVTSLYLLCCILRIVKIKDLANTISTAFFCPLDSFSPHCEGRLIENMNWLSCANRSQSSGSDSIVRQPLDAESLRKEVLYSSTPKTELEGASMKN
Subjt:  GIQIGAVTSLYLLCCILRIVKIKDLANTISTAFFCPLDSFSPHCEGRLIENMNWLSCANRSQSSGSDSIVRQPLDAESLRKEVLYSSTPKTELEGASMKN

Query:  GFRGSCLDLREALLSHITTGDDVEVLGALSVLATLLQTEGQINAVIQLPSLRYSFLIMHPISVELDESMLDALAILPQRKQHKKLLLEALVGEDSGEQQL
        GFRGSCLDLREALLSHITTGDDVEVLGALSVLATLLQTE                        ELDESMLDALAILPQRKQHKKLLLEALVGEDSGEQQL
Subjt:  GFRGSCLDLREALLSHITTGDDVEVLGALSVLATLLQTEGQINAVIQLPSLRYSFLIMHPISVELDESMLDALAILPQRKQHKKLLLEALVGEDSGEQQL

Query:  FSSENASSKGGVNVELDGYLKKLKDYGISYFLKVGASPRALRIHTSTGLLS----------------------------------------SYR------
        FSSENASSKGGVNVELDGYLKKLKDYGISYFLKVGASPRALR      L+S                                        SY+      
Subjt:  FSSENASSKGGVNVELDGYLKKLKDYGISYFLKVGASPRALRIHTSTGLLS----------------------------------------SYR------

Query:  --------------------KLEEFAIEAPSPRKEPKCMLLYSAKASVVDAVPPESSLAAGQRMSELVKVFVLLHQLQSFSLGKALSEQPFMDPPSEASE
                            K  + AIEAPSPRKEPKCMLLYSAKASVVDAVPPESSLAAGQRMSELVKVFVLLHQLQSFSLGKALSEQPFMDPPSEASE
Subjt:  --------------------KLEEFAIEAPSPRKEPKCMLLYSAKASVVDAVPPESSLAAGQRMSELVKVFVLLHQLQSFSLGKALSEQPFMDPPSEASE

Query:  CSRAKVAGLDASGPKPGAELRLDGSVPCRISFERGKERHFYFLGTSMGTSGWIILAEELPSKLNCGIIRVAAPLAGSNPRMDEKHPRWLHLRIRPSTLPF
        CSRAKVAGLDASGPKPGAELRLDGSVPCRISFERGKERHFYFLGTSMGTSGWIILAEELPSKLNCGIIRVAAPLAGSNPRMDEKHPRWLHLRIRPSTLPF
Subjt:  CSRAKVAGLDASGPKPGAELRLDGSVPCRISFERGKERHFYFLGTSMGTSGWIILAEELPSKLNCGIIRVAAPLAGSNPRMDEKHPRWLHLRIRPSTLPF

Query:  LDHPAKYGTPLNLKTKPFVDGRWILAFQDDNTCKSALSMVLEEINLQSQEVERRLKPLIDLERAVDSSKMHR
        LDHPAKYGTPLNLKTKPFVDGRWILAFQDDNTCKSALSMVLEEINLQSQEVERRLKPLIDLERAVDSSKMHR
Subjt:  LDHPAKYGTPLNLKTKPFVDGRWILAFQDDNTCKSALSMVLEEINLQSQEVERRLKPLIDLERAVDSSKMHR

TrEMBL top hitse value%identityAlignment
A0A6J1EN85 uncharacterized protein LOC111436147 isoform X16.7e-25082.17Show/hide
Query:  GIQIGAVTSLYLLCCILRIVKIKDLANTISTAFFCPLDSFSPHCEGRLIENMNWLSCANRSQSSGSDSIVRQPLDAESLRKEVLYSSTPKTELEGASMKN
        GIQIGAVTSLYLLCCILRIVKIKDLANTISTAFFCPLDSFSPHCEGRLIENMNWLSCANRSQSSGSDSIVRQPLDAESLRKEVLYSSTPKTELEGASMKN
Subjt:  GIQIGAVTSLYLLCCILRIVKIKDLANTISTAFFCPLDSFSPHCEGRLIENMNWLSCANRSQSSGSDSIVRQPLDAESLRKEVLYSSTPKTELEGASMKN

Query:  GFRGSCLDLREALLSHITTGDDVEVLGALSVLATLLQTEGQINAVIQLPSLRYSFLIMHPISVELDESMLDALAILPQRKQHKKLLLEALVGEDSGEQQL
        GFRGSCLDLREALLSHITTGDDVEVLGALSVLATLLQTE                        ELDESMLDALAILPQRKQHKKLLLEALVGEDSGEQQL
Subjt:  GFRGSCLDLREALLSHITTGDDVEVLGALSVLATLLQTEGQINAVIQLPSLRYSFLIMHPISVELDESMLDALAILPQRKQHKKLLLEALVGEDSGEQQL

Query:  FSSENASSKGGVNVELDGYLKKLKDYGISYFLKVGASPRALRIHTSTGLLS----------------------------------------SYR------
        FSSENASSKGGVNVELDGYLKKLKDYGISYFLKVGASPRALR      L+S                                        SY+      
Subjt:  FSSENASSKGGVNVELDGYLKKLKDYGISYFLKVGASPRALRIHTSTGLLS----------------------------------------SYR------

Query:  --------------------KLEEFAIEAPSPRKEPKCMLLYSAKASVVDAVPPESSLAAGQRMSELVKVFVLLHQLQSFSLGKALSEQPFMDPPSEASE
                            K  + AIEAPSPRKEPKCMLLYSAKASVVDAVPPESSLAAGQRMSELVKVFVLLHQLQSFSLGKALSEQPFMDPPSEASE
Subjt:  --------------------KLEEFAIEAPSPRKEPKCMLLYSAKASVVDAVPPESSLAAGQRMSELVKVFVLLHQLQSFSLGKALSEQPFMDPPSEASE

Query:  CSRAKVAGLDASGPKPGAELRLDGSVPCRISFERGKERHFYFLGTSMGTSGWIILAEELPSKLNCGIIRVAAPLAGSNPRMDEKHPRWLHLRIRPSTLPF
        CSRAKVAGLDASGPKPGAELRLDGSVPCRISFERGKERHFYFLGTSMGTSGWIILAEELPSKLNCGIIRVAAPLAGSNPRMDEKHPRWLHLRIRPSTLPF
Subjt:  CSRAKVAGLDASGPKPGAELRLDGSVPCRISFERGKERHFYFLGTSMGTSGWIILAEELPSKLNCGIIRVAAPLAGSNPRMDEKHPRWLHLRIRPSTLPF

Query:  LDHPAKYGTPLNLKTKPFVDGRWILAFQDDNTCKSALSMVLEEINLQSQEVERRLKPLIDLERAVDSSKMHR
        LDHPAKYGTPLNLKTKPFVDGRWILAFQDDNTCKSALSMVLEEINLQSQEVERRLKPLIDLERAVDSSKMHR
Subjt:  LDHPAKYGTPLNLKTKPFVDGRWILAFQDDNTCKSALSMVLEEINLQSQEVERRLKPLIDLERAVDSSKMHR

A0A6J1ENM5 uncharacterized protein LOC111436147 isoform X46.7e-25082.17Show/hide
Query:  GIQIGAVTSLYLLCCILRIVKIKDLANTISTAFFCPLDSFSPHCEGRLIENMNWLSCANRSQSSGSDSIVRQPLDAESLRKEVLYSSTPKTELEGASMKN
        GIQIGAVTSLYLLCCILRIVKIKDLANTISTAFFCPLDSFSPHCEGRLIENMNWLSCANRSQSSGSDSIVRQPLDAESLRKEVLYSSTPKTELEGASMKN
Subjt:  GIQIGAVTSLYLLCCILRIVKIKDLANTISTAFFCPLDSFSPHCEGRLIENMNWLSCANRSQSSGSDSIVRQPLDAESLRKEVLYSSTPKTELEGASMKN

Query:  GFRGSCLDLREALLSHITTGDDVEVLGALSVLATLLQTEGQINAVIQLPSLRYSFLIMHPISVELDESMLDALAILPQRKQHKKLLLEALVGEDSGEQQL
        GFRGSCLDLREALLSHITTGDDVEVLGALSVLATLLQTE                        ELDESMLDALAILPQRKQHKKLLLEALVGEDSGEQQL
Subjt:  GFRGSCLDLREALLSHITTGDDVEVLGALSVLATLLQTEGQINAVIQLPSLRYSFLIMHPISVELDESMLDALAILPQRKQHKKLLLEALVGEDSGEQQL

Query:  FSSENASSKGGVNVELDGYLKKLKDYGISYFLKVGASPRALRIHTSTGLLS----------------------------------------SYR------
        FSSENASSKGGVNVELDGYLKKLKDYGISYFLKVGASPRALR      L+S                                        SY+      
Subjt:  FSSENASSKGGVNVELDGYLKKLKDYGISYFLKVGASPRALRIHTSTGLLS----------------------------------------SYR------

Query:  --------------------KLEEFAIEAPSPRKEPKCMLLYSAKASVVDAVPPESSLAAGQRMSELVKVFVLLHQLQSFSLGKALSEQPFMDPPSEASE
                            K  + AIEAPSPRKEPKCMLLYSAKASVVDAVPPESSLAAGQRMSELVKVFVLLHQLQSFSLGKALSEQPFMDPPSEASE
Subjt:  --------------------KLEEFAIEAPSPRKEPKCMLLYSAKASVVDAVPPESSLAAGQRMSELVKVFVLLHQLQSFSLGKALSEQPFMDPPSEASE

Query:  CSRAKVAGLDASGPKPGAELRLDGSVPCRISFERGKERHFYFLGTSMGTSGWIILAEELPSKLNCGIIRVAAPLAGSNPRMDEKHPRWLHLRIRPSTLPF
        CSRAKVAGLDASGPKPGAELRLDGSVPCRISFERGKERHFYFLGTSMGTSGWIILAEELPSKLNCGIIRVAAPLAGSNPRMDEKHPRWLHLRIRPSTLPF
Subjt:  CSRAKVAGLDASGPKPGAELRLDGSVPCRISFERGKERHFYFLGTSMGTSGWIILAEELPSKLNCGIIRVAAPLAGSNPRMDEKHPRWLHLRIRPSTLPF

Query:  LDHPAKYGTPLNLKTKPFVDGRWILAFQDDNTCKSALSMVLEEINLQSQEVERRLKPLIDLERAVDSSKMHR
        LDHPAKYGTPLNLKTKPFVDGRWILAFQDDNTCKSALSMVLEEINLQSQEVERRLKPLIDLERAVDSSKMHR
Subjt:  LDHPAKYGTPLNLKTKPFVDGRWILAFQDDNTCKSALSMVLEEINLQSQEVERRLKPLIDLERAVDSSKMHR

A0A6J1ESN2 uncharacterized protein LOC111436147 isoform X21.2e-24182.79Show/hide
Query:  GIQIGAVTSLYLLCCILRIVKIKDLANTISTAFFCPLDSFSPHCEGRLIENMNWLSCANRSQSSGSDSIVRQPLDAESLRKEVLYSSTPKTELEGASMKN
        GIQIGAVTSLYLLCCILRIVKIKDLANTISTAFFCPLDSFSPHCEGRLIENMNWLSCANRSQSSGSDSIVRQPLDAESLRKEVLYSSTPKTELEGASMKN
Subjt:  GIQIGAVTSLYLLCCILRIVKIKDLANTISTAFFCPLDSFSPHCEGRLIENMNWLSCANRSQSSGSDSIVRQPLDAESLRKEVLYSSTPKTELEGASMKN

Query:  GFRGSCLDLREALLSHITTGDDVEVLGALSVLATLLQTEGQINAVIQLPSLRYSFLIMHPISVELDESMLDALAILPQRKQHKKLLLEALVGEDSGEQQL
        GFRGSCLDLREALLSHITTGDDVEVLGALSVLATLLQTE                        ELDESMLDALAILPQRKQHKKLLLEALVGEDSGEQQL
Subjt:  GFRGSCLDLREALLSHITTGDDVEVLGALSVLATLLQTEGQINAVIQLPSLRYSFLIMHPISVELDESMLDALAILPQRKQHKKLLLEALVGEDSGEQQL

Query:  FSSENASSKGGVNVELDGYLKKLK--DYGISYFLKVGASPRAL------------------RIHTSTGLLSSYR--------------------------
        FSSENASSKGGVNVELDGYLKKLK  D  +S F +   S   L                    H    L  SY+                          
Subjt:  FSSENASSKGGVNVELDGYLKKLK--DYGISYFLKVGASPRAL------------------RIHTSTGLLSSYR--------------------------

Query:  KLEEFAIEAPSPRKEPKCMLLYSAKASVVDAVPPESSLAAGQRMSELVKVFVLLHQLQSFSLGKALSEQPFMDPPSEASECSRAKVAGLDASGPKPGAEL
        K  + AIEAPSPRKEPKCMLLYSAKASVVDAVPPESSLAAGQRMSELVKVFVLLHQLQSFSLGKALSEQPFMDPPSEASECSRAKVAGLDASGPKPGAEL
Subjt:  KLEEFAIEAPSPRKEPKCMLLYSAKASVVDAVPPESSLAAGQRMSELVKVFVLLHQLQSFSLGKALSEQPFMDPPSEASECSRAKVAGLDASGPKPGAEL

Query:  RLDGSVPCRISFERGKERHFYFLGTSMGTSGWIILAEELPSKLNCGIIRVAAPLAGSNPRMDEKHPRWLHLRIRPSTLPFLDHPAKYGTPLNLKTKPFVD
        RLDGSVPCRISFERGKERHFYFLGTSMGTSGWIILAEELPSKLNCGIIRVAAPLAGSNPRMDEKHPRWLHLRIRPSTLPFLDHPAKYGTPLNLKTKPFVD
Subjt:  RLDGSVPCRISFERGKERHFYFLGTSMGTSGWIILAEELPSKLNCGIIRVAAPLAGSNPRMDEKHPRWLHLRIRPSTLPFLDHPAKYGTPLNLKTKPFVD

Query:  GRWILAFQDDNTCKSALSMVLEEINLQSQEVERRLKPLIDLERAVDSSKMHR
        GRWILAFQDDNTCKSALSMVLEEINLQSQEVERRLKPLIDLERAVDSSKMHR
Subjt:  GRWILAFQDDNTCKSALSMVLEEINLQSQEVERRLKPLIDLERAVDSSKMHR

A0A6J1K559 uncharacterized protein LOC111491772 isoform X23.7e-22477.68Show/hide
Query:  GIQIGAVTSLYLLCCILRIVKIKDLANTISTAFFCPLDSFSPHCEGRLIENMNWLSCANRSQSSGSDSIVRQPLDAESLRKEVLYSSTPKTELEGASMKN
        GIQIGAVTSLYLLCCILRIVKIKDLANTIS AFFCPLDSFSPHCEGRLI NMNWL CANRSQSSGSDSIVRQPLD ESLRKEVLYSS PKTELEGAS KN
Subjt:  GIQIGAVTSLYLLCCILRIVKIKDLANTISTAFFCPLDSFSPHCEGRLIENMNWLSCANRSQSSGSDSIVRQPLDAESLRKEVLYSSTPKTELEGASMKN

Query:  GFRGSCLDLREALLSHITTGDDVEVLGALSVLATLLQTEGQINAVIQLPSLRYSFLIMHPISVELDESMLDALAILPQRKQHKKLLLEALVGEDSGEQQL
        G RGS LDLREALLSHITTGDDVEVLGALSVLATLLQTE                        ELDESMLDAL ILPQRKQHKKLLLEALV EDSGEQQL
Subjt:  GFRGSCLDLREALLSHITTGDDVEVLGALSVLATLLQTEGQINAVIQLPSLRYSFLIMHPISVELDESMLDALAILPQRKQHKKLLLEALVGEDSGEQQL

Query:  FSSENASSKGGVNVELDGYLKKLK--DYGISYFLKVGASPRAL------------------RIHTSTGLLSSYR--------------------------
        FSSENASSKGG++VE+DGYLKKLK  D  +S F +   S   L                    H    L  SY+                          
Subjt:  FSSENASSKGGVNVELDGYLKKLK--DYGISYFLKVGASPRAL------------------RIHTSTGLLSSYR--------------------------

Query:  KLEEFAIEAPSPRKEPKCMLLYSAKASVVDAVPPESSLAAGQRMSELVKVFVLLHQLQSFSLGKALSEQPFMDPPSEASECSRAKVAGLDASGPKPGAEL
        K  + AIEAPSP KEPKC+LLYSAKASVVDAVPPESSLAAGQRMSELVKVFVLLHQLQSFSLGKAL EQPFMDPPSE SECSRAKVAGLDASGPKPGAEL
Subjt:  KLEEFAIEAPSPRKEPKCMLLYSAKASVVDAVPPESSLAAGQRMSELVKVFVLLHQLQSFSLGKALSEQPFMDPPSEASECSRAKVAGLDASGPKPGAEL

Query:  RLDGSVPCRISFERGKERHFYFLGTSMGTSGWIILAEELPSKLNCGIIRVAAPLAGSNPRMDEKHPRWLHLRIRPSTLPFLDHPAKYGTPLNLKTKPFVD
        RLDGSVPCRISFERGKERHFYF+GTSMGTSGWIILA+ELPSKLN GIIRVAAPLAGSNPR+DEKHPRWLHLRIRPSTLPFLDHPAKYGT LNLKT+PFVD
Subjt:  RLDGSVPCRISFERGKERHFYFLGTSMGTSGWIILAEELPSKLNCGIIRVAAPLAGSNPRMDEKHPRWLHLRIRPSTLPFLDHPAKYGTPLNLKTKPFVD

Query:  GRWILAFQDDNTCKSALSMVLEEINLQSQEVERRLKPLIDLERAVDSSKMH
        GRWILAFQDD+TCKSALSMVLEEINLQS EVERRLKPL+DLERAVD S+MH
Subjt:  GRWILAFQDDNTCKSALSMVLEEINLQSQEVERRLKPLIDLERAVDSSKMH

A0A6J1K759 uncharacterized protein LOC111491772 isoform X19.8e-23377.23Show/hide
Query:  GIQIGAVTSLYLLCCILRIVKIKDLANTISTAFFCPLDSFSPHCEGRLIENMNWLSCANRSQSSGSDSIVRQPLDAESLRKEVLYSSTPKTELEGASMKN
        GIQIGAVTSLYLLCCILRIVKIKDLANTIS AFFCPLDSFSPHCEGRLI NMNWL CANRSQSSGSDSIVRQPLD ESLRKEVLYSS PKTELEGAS KN
Subjt:  GIQIGAVTSLYLLCCILRIVKIKDLANTISTAFFCPLDSFSPHCEGRLIENMNWLSCANRSQSSGSDSIVRQPLDAESLRKEVLYSSTPKTELEGASMKN

Query:  GFRGSCLDLREALLSHITTGDDVEVLGALSVLATLLQTEGQINAVIQLPSLRYSFLIMHPISVELDESMLDALAILPQRKQHKKLLLEALVGEDSGEQQL
        G RGS LDLREALLSHITTGDDVEVLGALSVLATLLQTE                        ELDESMLDAL ILPQRKQHKKLLLEALV EDSGEQQL
Subjt:  GFRGSCLDLREALLSHITTGDDVEVLGALSVLATLLQTEGQINAVIQLPSLRYSFLIMHPISVELDESMLDALAILPQRKQHKKLLLEALVGEDSGEQQL

Query:  FSSENASSKGGVNVELDGYLKKLKDYGISYFLKVGASPRALRIHTSTGLLS----------------------------------------SYR------
        FSSENASSKGG++VE+DGYLKKLKDYGISYFLKVGASPRALR      L+S                                        SY+      
Subjt:  FSSENASSKGGVNVELDGYLKKLKDYGISYFLKVGASPRALRIHTSTGLLS----------------------------------------SYR------

Query:  --------------------KLEEFAIEAPSPRKEPKCMLLYSAKASVVDAVPPESSLAAGQRMSELVKVFVLLHQLQSFSLGKALSEQPFMDPPSEASE
                            K  + AIEAPSP KEPKC+LLYSAKASVVDAVPPESSLAAGQRMSELVKVFVLLHQLQSFSLGKAL EQPFMDPPSE SE
Subjt:  --------------------KLEEFAIEAPSPRKEPKCMLLYSAKASVVDAVPPESSLAAGQRMSELVKVFVLLHQLQSFSLGKALSEQPFMDPPSEASE

Query:  CSRAKVAGLDASGPKPGAELRLDGSVPCRISFERGKERHFYFLGTSMGTSGWIILAEELPSKLNCGIIRVAAPLAGSNPRMDEKHPRWLHLRIRPSTLPF
        CSRAKVAGLDASGPKPGAELRLDGSVPCRISFERGKERHFYF+GTSMGTSGWIILA+ELPSKLN GIIRVAAPLAGSNPR+DEKHPRWLHLRIRPSTLPF
Subjt:  CSRAKVAGLDASGPKPGAELRLDGSVPCRISFERGKERHFYFLGTSMGTSGWIILAEELPSKLNCGIIRVAAPLAGSNPRMDEKHPRWLHLRIRPSTLPF

Query:  LDHPAKYGTPLNLKTKPFVDGRWILAFQDDNTCKSALSMVLEEINLQSQEVERRLKPLIDLERAVDSSKMH
        LDHPAKYGT LNLKT+PFVDGRWILAFQDD+TCKSALSMVLEEINLQS EVERRLKPL+DLERAVD S+MH
Subjt:  LDHPAKYGTPLNLKTKPFVDGRWILAFQDDNTCKSALSMVLEEINLQSQEVERRLKPLIDLERAVDSSKMH

SwissProt top hitse value%identityAlignment
Q8W4P9 Protein TRANSPARENT TESTA 91.3e-10946.29Show/hide
Query:  IQIGAVTSLYLLCCILRIVKIKDLANTISTAFFCPLDSFSPHCEGRLIENMNWLSCANRSQSSG-SDSIVRQPLDAESLRKEVLYSSTPKTEL-EGASMK
        I +  VTSLYLL CILRIVKIKDLAN  +   FCP+ +F       L++  + L+    +  +G  D  V +  + +        S    + L    + K
Subjt:  IQIGAVTSLYLLCCILRIVKIKDLANTISTAFFCPLDSFSPHCEGRLIENMNWLSCANRSQSSG-SDSIVRQPLDAESLRKEVLYSSTPKTEL-EGASMK

Query:  NGFRGSCLDLREALLSHITTGDDVEVLGALSVLATLLQTEGQINAVIQLPSLRYSFLIMHPISVELDESMLDALAILPQRKQHKKLLLEALVGEDSGEQQ
        + F  S +  RE LL +I+ GDDV+  G+L VLATLLQT+                        EL+ESMLDA  ILPQRKQHKKLLL++LVGED+GE+Q
Subjt:  NGFRGSCLDLREALLSHITTGDDVEVLGALSVLATLLQTEGQINAVIQLPSLRYSFLIMHPISVELDESMLDALAILPQRKQHKKLLLEALVGEDSGEQQ

Query:  LFSSENASSKGGVNVELDGYLKKLKD-YGISYFLKVGA-SPRALR---IHTSTGLLS-------------------------------------SYRK--
        LFS  N S + G++ ELD YL++L++ +G+   L   A  PR  R   + T   LL                                      SY K  
Subjt:  LFSSENASSKGGVNVELDGYLKKLKD-YGISYFLKVGA-SPRALR---IHTSTGLLS-------------------------------------SYRK--

Query:  -------------------LEEF-----AIEAPSPRKEPKCMLLYSAKASVVDAVPPESSLAAGQRMSELVKVFVLLHQLQSFSLGKALSEQPFMDPPSE
                           L+E+      IEAPSP+KEPK +LL   ++S  D    ESS  AG+RM E+VKVFVLLHQLQ FSLG++L EQP + PP++
Subjt:  -------------------LEEF-----AIEAPSPRKEPKCMLLYSAKASVVDAVPPESSLAAGQRMSELVKVFVLLHQLQSFSLGKALSEQPFMDPPSE

Query:  ASECSRAKVAGLDASGPKPGAELRLDGSVPCRISFERGKERHFYFLGTSMGTSGWIILAEELPSKLNCGIIRVAAPLAGSNPRMDEKHPRWLHLRIRPST
         SE SRA  AGLD S PKPG EL+L  +VPCRI+FERGKER F FL  S G SGWI+LA+      + GI+RV APLAG  PR+DEKHPRWLHLRIRPST
Subjt:  ASECSRAKVAGLDASGPKPGAELRLDGSVPCRISFERGKERHFYFLGTSMGTSGWIILAEELPSKLNCGIIRVAAPLAGSNPRMDEKHPRWLHLRIRPST

Query:  LPFLDHPAKYGTPLNLKTKPFVDGRWILAFQDDNTCKSALSMVLEEINLQSQEVERRLKPLIDLER
        LP LD P K G    LK+K  VDGRWILAF+DD +C SA SMV  EI+LQ  EVERRL+PL DLER
Subjt:  LPFLDHPAKYGTPLNLKTKPFVDGRWILAFQDDNTCKSALSMVLEEINLQSQEVERRLKPLIDLER

Arabidopsis top hitse value%identityAlignment
AT3G28430.1 unknown protein9.4e-11146.29Show/hide
Query:  IQIGAVTSLYLLCCILRIVKIKDLANTISTAFFCPLDSFSPHCEGRLIENMNWLSCANRSQSSG-SDSIVRQPLDAESLRKEVLYSSTPKTEL-EGASMK
        I +  VTSLYLL CILRIVKIKDLAN  +   FCP+ +F       L++  + L+    +  +G  D  V +  + +        S    + L    + K
Subjt:  IQIGAVTSLYLLCCILRIVKIKDLANTISTAFFCPLDSFSPHCEGRLIENMNWLSCANRSQSSG-SDSIVRQPLDAESLRKEVLYSSTPKTEL-EGASMK

Query:  NGFRGSCLDLREALLSHITTGDDVEVLGALSVLATLLQTEGQINAVIQLPSLRYSFLIMHPISVELDESMLDALAILPQRKQHKKLLLEALVGEDSGEQQ
        + F  S +  RE LL +I+ GDDV+  G+L VLATLLQT+                        EL+ESMLDA  ILPQRKQHKKLLL++LVGED+GE+Q
Subjt:  NGFRGSCLDLREALLSHITTGDDVEVLGALSVLATLLQTEGQINAVIQLPSLRYSFLIMHPISVELDESMLDALAILPQRKQHKKLLLEALVGEDSGEQQ

Query:  LFSSENASSKGGVNVELDGYLKKLKD-YGISYFLKVGA-SPRALR---IHTSTGLLS-------------------------------------SYRK--
        LFS  N S + G++ ELD YL++L++ +G+   L   A  PR  R   + T   LL                                      SY K  
Subjt:  LFSSENASSKGGVNVELDGYLKKLKD-YGISYFLKVGA-SPRALR---IHTSTGLLS-------------------------------------SYRK--

Query:  -------------------LEEF-----AIEAPSPRKEPKCMLLYSAKASVVDAVPPESSLAAGQRMSELVKVFVLLHQLQSFSLGKALSEQPFMDPPSE
                           L+E+      IEAPSP+KEPK +LL   ++S  D    ESS  AG+RM E+VKVFVLLHQLQ FSLG++L EQP + PP++
Subjt:  -------------------LEEF-----AIEAPSPRKEPKCMLLYSAKASVVDAVPPESSLAAGQRMSELVKVFVLLHQLQSFSLGKALSEQPFMDPPSE

Query:  ASECSRAKVAGLDASGPKPGAELRLDGSVPCRISFERGKERHFYFLGTSMGTSGWIILAEELPSKLNCGIIRVAAPLAGSNPRMDEKHPRWLHLRIRPST
         SE SRA  AGLD S PKPG EL+L  +VPCRI+FERGKER F FL  S G SGWI+LA+      + GI+RV APLAG  PR+DEKHPRWLHLRIRPST
Subjt:  ASECSRAKVAGLDASGPKPGAELRLDGSVPCRISFERGKERHFYFLGTSMGTSGWIILAEELPSKLNCGIIRVAAPLAGSNPRMDEKHPRWLHLRIRPST

Query:  LPFLDHPAKYGTPLNLKTKPFVDGRWILAFQDDNTCKSALSMVLEEINLQSQEVERRLKPLIDLER
        LP LD P K G    LK+K  VDGRWILAF+DD +C SA SMV  EI+LQ  EVERRL+PL DLER
Subjt:  LPFLDHPAKYGTPLNLKTKPFVDGRWILAFQDDNTCKSALSMVLEEINLQSQEVERRLKPLIDLER


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAATTCAAATTGGAGCTGTCACTTCTCTATATTTACTTTGTTGCATTTTGCGCATAGTTAAAATAAAAGATCTGGCAAACACCATCTCTACTGCCTTTTTTTGTCC
ATTGGACTCTTTCTCCCCACATTGTGAAGGCAGACTGATCGAAAATATGAATTGGTTATCTTGTGCAAATAGAAGCCAGTCATCAGGAAGTGATAGCATTGTAAGGCAGC
CCTTGGATGCCGAGTCTTTAAGAAAAGAAGTATTATATTCTTCTACTCCTAAAACTGAGTTAGAAGGTGCGTCTATGAAAAATGGTTTTCGAGGCTCCTGCTTGGATTTG
AGGGAAGCTTTGCTTTCTCATATAACAACTGGGGACGATGTAGAAGTCTTGGGTGCTCTAAGTGTTCTGGCTACACTATTGCAGACTGAAGGTCAGATCAATGCAGTTAT
TCAACTTCCTTCACTCCGTTATTCTTTTCTTATCATGCATCCCATCTCTGTAGAGCTGGACGAATCAATGCTGGATGCGCTTGCAATCCTTCCTCAAAGAAAACAACATA
AGAAATTGTTATTGGAAGCCTTAGTTGGTGAGGATTCTGGCGAACAACAACTCTTTTCTTCAGAAAACGCCTCATCGAAAGGTGGCGTCAATGTTGAACTTGATGGTTAC
CTAAAGAAGCTTAAGGATTATGGCATTTCATATTTTCTTAAAGTAGGTGCAAGCCCTCGTGCCCTTAGGATTCATACAAGTACTGGGCTACTGAGCTCTTACAGGAAGCT
AGAGGAATTTGCAATTGAAGCCCCATCACCAAGGAAAGAACCAAAGTGCATGCTCTTGTATTCTGCAAAGGCTTCTGTCGTAGATGCTGTTCCACCCGAATCATCGCTCG
CTGCTGGTCAAAGAATGTCCGAGTTGGTAAAGGTATTTGTTCTTCTACACCAACTTCAGTCATTTTCCCTTGGCAAGGCTTTGTCAGAACAACCCTTTATGGACCCTCCC
TCAGAAGCTTCTGAATGCTCCCGTGCAAAGGTTGCTGGGCTCGATGCTTCGGGACCTAAACCGGGTGCGGAGTTGAGACTTGATGGATCTGTGCCTTGTAGAATTTCATT
TGAGAGAGGCAAAGAGCGCCATTTTTACTTTCTTGGAACTTCCATGGGAACTTCCGGATGGATAATTCTTGCTGAAGAACTGCCATCAAAACTGAATTGTGGAATTATTC
GAGTTGCTGCACCTCTTGCTGGATCAAATCCTAGAATGGATGAAAAGCATCCAAGATGGCTGCATTTGAGGATTCGTCCATCAACTTTACCCTTTTTGGATCATCCTGCT
AAATATGGTACCCCCTTAAACTTAAAGACAAAGCCTTTTGTGGATGGGAGATGGATCCTGGCATTCCAGGACGACAATACTTGCAAATCCGCTTTATCTATGGTTTTGGA
GGAGATTAATCTGCAAAGCCAGGAGGTCGAGAGACGACTTAAACCATTGATTGACCTCGAAAGAGCTGTAGATTCGTCCAAGATGCATCGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGAATTCAAATTGGAGCTGTCACTTCTCTATATTTACTTTGTTGCATTTTGCGCATAGTTAAAATAAAAGATCTGGCAAACACCATCTCTACTGCCTTTTTTTGTCC
ATTGGACTCTTTCTCCCCACATTGTGAAGGCAGACTGATCGAAAATATGAATTGGTTATCTTGTGCAAATAGAAGCCAGTCATCAGGAAGTGATAGCATTGTAAGGCAGC
CCTTGGATGCCGAGTCTTTAAGAAAAGAAGTATTATATTCTTCTACTCCTAAAACTGAGTTAGAAGGTGCGTCTATGAAAAATGGTTTTCGAGGCTCCTGCTTGGATTTG
AGGGAAGCTTTGCTTTCTCATATAACAACTGGGGACGATGTAGAAGTCTTGGGTGCTCTAAGTGTTCTGGCTACACTATTGCAGACTGAAGGTCAGATCAATGCAGTTAT
TCAACTTCCTTCACTCCGTTATTCTTTTCTTATCATGCATCCCATCTCTGTAGAGCTGGACGAATCAATGCTGGATGCGCTTGCAATCCTTCCTCAAAGAAAACAACATA
AGAAATTGTTATTGGAAGCCTTAGTTGGTGAGGATTCTGGCGAACAACAACTCTTTTCTTCAGAAAACGCCTCATCGAAAGGTGGCGTCAATGTTGAACTTGATGGTTAC
CTAAAGAAGCTTAAGGATTATGGCATTTCATATTTTCTTAAAGTAGGTGCAAGCCCTCGTGCCCTTAGGATTCATACAAGTACTGGGCTACTGAGCTCTTACAGGAAGCT
AGAGGAATTTGCAATTGAAGCCCCATCACCAAGGAAAGAACCAAAGTGCATGCTCTTGTATTCTGCAAAGGCTTCTGTCGTAGATGCTGTTCCACCCGAATCATCGCTCG
CTGCTGGTCAAAGAATGTCCGAGTTGGTAAAGGTATTTGTTCTTCTACACCAACTTCAGTCATTTTCCCTTGGCAAGGCTTTGTCAGAACAACCCTTTATGGACCCTCCC
TCAGAAGCTTCTGAATGCTCCCGTGCAAAGGTTGCTGGGCTCGATGCTTCGGGACCTAAACCGGGTGCGGAGTTGAGACTTGATGGATCTGTGCCTTGTAGAATTTCATT
TGAGAGAGGCAAAGAGCGCCATTTTTACTTTCTTGGAACTTCCATGGGAACTTCCGGATGGATAATTCTTGCTGAAGAACTGCCATCAAAACTGAATTGTGGAATTATTC
GAGTTGCTGCACCTCTTGCTGGATCAAATCCTAGAATGGATGAAAAGCATCCAAGATGGCTGCATTTGAGGATTCGTCCATCAACTTTACCCTTTTTGGATCATCCTGCT
AAATATGGTACCCCCTTAAACTTAAAGACAAAGCCTTTTGTGGATGGGAGATGGATCCTGGCATTCCAGGACGACAATACTTGCAAATCCGCTTTATCTATGGTTTTGGA
GGAGATTAATCTGCAAAGCCAGGAGGTCGAGAGACGACTTAAACCATTGATTGACCTCGAAAGAGCTGTAGATTCGTCCAAGATGCATCGTTAAGTTCTACTAAGTAAGT
CATTGACTTGTGTGTGTGAATGTATTAGCTCAGAAATGCAAAGGGAAAAGGAAAGTAGTTTTAGGATGTGAGAGTTTGTATCATTGTTTGTTTGTATAGTAATAGTAAGT
ATTCATTCATTTGTTGATCTGTAACCATCGGCTTCTGCTCTCTCCCTCATTGTGTAATCAGTATAATAAACATTAGTATATAATTGATTGATTGAACTTTAAGGTTTTCT
TTCG
Protein sequenceShow/hide protein sequence
MGIQIGAVTSLYLLCCILRIVKIKDLANTISTAFFCPLDSFSPHCEGRLIENMNWLSCANRSQSSGSDSIVRQPLDAESLRKEVLYSSTPKTELEGASMKNGFRGSCLDL
REALLSHITTGDDVEVLGALSVLATLLQTEGQINAVIQLPSLRYSFLIMHPISVELDESMLDALAILPQRKQHKKLLLEALVGEDSGEQQLFSSENASSKGGVNVELDGY
LKKLKDYGISYFLKVGASPRALRIHTSTGLLSSYRKLEEFAIEAPSPRKEPKCMLLYSAKASVVDAVPPESSLAAGQRMSELVKVFVLLHQLQSFSLGKALSEQPFMDPP
SEASECSRAKVAGLDASGPKPGAELRLDGSVPCRISFERGKERHFYFLGTSMGTSGWIILAEELPSKLNCGIIRVAAPLAGSNPRMDEKHPRWLHLRIRPSTLPFLDHPA
KYGTPLNLKTKPFVDGRWILAFQDDNTCKSALSMVLEEINLQSQEVERRLKPLIDLERAVDSSKMHR