; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0023073 (gene) of Chayote v1 genome

Gene IDSed0023073
OrganismSechium edule (Chayote v1)
DescriptionProtein of unknown function (DUF789)
Genome locationLG12:4894499..4903442
RNA-Seq ExpressionSed0023073
SyntenySed0023073
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR008507 - Protein of unknown function DUF789


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6597557.1 hypothetical protein SDJN03_10737, partial [Cucurbita argyrosperma subsp. sororia]5.1e-19785.04Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRL---QQQQQQQQQQQQQQQKQSALDSKEAVVVVASAARIDELEKRSEFDDCRSWSTRSDCSVSDRGVADSTNL
        MSVSGGVSIARIRGENRFYHPPAMRRRL   QQQQQQQQQQQQQQQKQ+ALD KE  V  A+ ARIDELEK SE D+CRSWSTRSDCSVSDRGVADSTNL
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRL---QQQQQQQQQQQQQQQKQSALDSKEAVVVVASAARIDELEKRSEFDDCRSWSTRSDCSVSDRGVADSTNL

Query:  DRFLEHTTPVVPALRNPKASQRGWRNREVLEAPPYFLLGDLWESFKEWSAYGAGTPLLLNGSDSVVQYYVPYLSGIQLYVNPSKSSALSRRPGA-SDADS
        DRFLE+TTPVVPA    K S +GWRNREV EAPPYF+LGDLWESFKEWSAYGAG PLLLNGSDSVVQYYVPYLSGIQLY++PSKSSALSRR GA SDA+S
Subjt:  DRFLEHTTPVVPALRNPKASQRGWRNREVLEAPPYFLLGDLWESFKEWSAYGAGTPLLLNGSDSVVQYYVPYLSGIQLYVNPSKSSALSRRPGA-SDADS

Query:  SKETSSDGSSSCGAEKQSRTAIRDEWIQDSSVLGSQRALQMNVPSSESSSDESDSCYRQGQLVFEYLEREPPFGREPLSDKITILASRFPELMTFRSCDL
        SKET+SDGSS+CG EK++ TA++DEWIQDSSV GS+RA QMNVPS+ESSSDESDSCYRQGQLVFEY+E +PPF REPL+DKITILASRFPEL T+RSCDL
Subjt:  SKETSSDGSSSCGAEKQSRTAIRDEWIQDSSVLGSQRALQMNVPSSESSSDESDSCYRQGQLVFEYLEREPPFGREPLSDKITILASRFPELMTFRSCDL

Query:  SPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTTIQGNGADGLQFHWPRFREVNTADCPLKLQLLTFGLASYKFKISFWNSTGAEECPKANTLLQG
        SPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLST  QG G DGLQFHWPR REV+TA+ PLKLQL TFGLASYKFK SFWNSTG EECPKANTL Q 
Subjt:  SPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTTIQGNGADGLQFHWPRFREVNTADCPLKLQLLTFGLASYKFKISFWNSTGAEECPKANTLLQG

Query:  ADNWLRSLNVNHPDYRFFSSH
        ADNWLRSLNVNHPDYRFF+SH
Subjt:  ADNWLRSLNVNHPDYRFFSSH

XP_008438916.1 PREDICTED: uncharacterized protein LOC103483873 isoform X1 [Cucumis melo]1.9e-19684.29Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQ--KQSALDSKEAVVVVASAARIDELEKRSEFDDCRSWSTRSDCSVSDRGVADSTNLD
        MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQ  KQSALDSK+  VV A+ + ID+LEKRSEFD+CRSWSTRSDCSVSDRG+ DSTNLD
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQ--KQSALDSKEAVVVVASAARIDELEKRSEFDDCRSWSTRSDCSVSDRGVADSTNLD

Query:  RFLEHTTPVVPALRNPKASQRGWRNREVLEAPPYFLLGDLWESFKEWSAYGAGTPLLLNGSDSVVQYYVPYLSGIQLYVNPSKSSALSRRPGA-SDADSS
        RFLEHTTP+VPA   PK S RGWRNREV EA PYF+LGDLWESFKEWSAYGAG PLLLNGSDSVVQYYVPYLSGIQLYV+PSKS ALSRR GA SDA+SS
Subjt:  RFLEHTTPVVPALRNPKASQRGWRNREVLEAPPYFLLGDLWESFKEWSAYGAGTPLLLNGSDSVVQYYVPYLSGIQLYVNPSKSSALSRRPGA-SDADSS

Query:  KETSSDGSSSCGAEKQSRTAIRDEWIQDSSVLGSQRALQMNVPSSESSSDESDSCYRQGQLVFEYLEREPPFGREPLSDKITILASRFPELMTFRSCDLS
        KETSSDGSS+ GAEK+++TA+++EWIQD + LGSQRALQMNVPSSESSSDESDSCYR GQLVFEYLER+PPF REPL+DKIT+LASRFPEL T+RSCDLS
Subjt:  KETSSDGSSSCGAEKQSRTAIRDEWIQDSSVLGSQRALQMNVPSSESSSDESDSCYRQGQLVFEYLEREPPFGREPLSDKITILASRFPELMTFRSCDLS

Query:  PSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTTIQGNGADGLQFHWPRFREVNTADCPLKLQLLTFGLASYKFKISFWNSTGAEECPKANTLLQGA
        PSSWISVAWYPIYRIPTGPTLQSLDACFLTFH+LST +QG   DGLQFHWPR REV TADCPLKLQL  FGLASYKFKI FWNSTGAEEC KA++L Q A
Subjt:  PSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTTIQGNGADGLQFHWPRFREVNTADCPLKLQLLTFGLASYKFKISFWNSTGAEECPKANTLLQGA

Query:  DNWLRSLNVNHPDYRFFSSH
        D+WLR LNVNHPDYRFF+SH
Subjt:  DNWLRSLNVNHPDYRFFSSH

XP_008438917.1 PREDICTED: uncharacterized protein LOC103483873 isoform X2 [Cucumis melo]7.3e-19683.77Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQ--KQSALDSKEAVVVVASAARIDELEKRSEFDDCRSWSTRSDCSVSDRGVADSTNLD
        MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQ  KQSALDSK+  VV A+ + ID+LEKRSEFD+CRSWSTRSDCSVSDRG+ DSTNLD
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQ--KQSALDSKEAVVVVASAARIDELEKRSEFDDCRSWSTRSDCSVSDRGVADSTNLD

Query:  RFLEHTTPVVPALRNPKASQRGWRNREVLEAPPYFLLGDLWESFKEWSAYGAGTPLLLNGSDSVVQYYVPYLSGIQLYVNPSKSSALSRRPGASDADSSK
        RFLEHTTP+VPA   PK S RGWRNREV EA PYF+LGDLWESFKEWSAYGAG PLLLNGSDSVVQYYVPYLSGIQLYV+PSKS AL RR   SDA+SSK
Subjt:  RFLEHTTPVVPALRNPKASQRGWRNREVLEAPPYFLLGDLWESFKEWSAYGAGTPLLLNGSDSVVQYYVPYLSGIQLYVNPSKSSALSRRPGASDADSSK

Query:  ETSSDGSSSCGAEKQSRTAIRDEWIQDSSVLGSQRALQMNVPSSESSSDESDSCYRQGQLVFEYLEREPPFGREPLSDKITILASRFPELMTFRSCDLSP
        ETSSDGSS+ GAEK+++TA+++EWIQD + LGSQRALQMNVPSSESSSDESDSCYR GQLVFEYLER+PPF REPL+DKIT+LASRFPEL T+RSCDLSP
Subjt:  ETSSDGSSSCGAEKQSRTAIRDEWIQDSSVLGSQRALQMNVPSSESSSDESDSCYRQGQLVFEYLEREPPFGREPLSDKITILASRFPELMTFRSCDLSP

Query:  SSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTTIQGNGADGLQFHWPRFREVNTADCPLKLQLLTFGLASYKFKISFWNSTGAEECPKANTLLQGAD
        SSWISVAWYPIYRIPTGPTLQSLDACFLTFH+LST +QG   DGLQFHWPR REV TADCPLKLQL  FGLASYKFKI FWNSTGAEEC KA++L Q AD
Subjt:  SSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTTIQGNGADGLQFHWPRFREVNTADCPLKLQLLTFGLASYKFKISFWNSTGAEECPKANTLLQGAD

Query:  NWLRSLNVNHPDYRFFSSH
        +WLR LNVNHPDYRFF+SH
Subjt:  NWLRSLNVNHPDYRFFSSH

XP_022946432.1 uncharacterized protein LOC111450487 isoform X1 [Cucurbita moschata]4.3e-19684.04Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRL--------QQQQQQQQQQQQQQQKQSALDSKEAVVVVASAARIDELEKRSEFDDCRSWSTRSDCSVSDRGVA
        MSVSGGVSIARIRGENRFYHPPAMRRRL        QQQQQQQQQQQQQQQKQ+ALD KE  V  A+ ARIDELEK SE D+CRSWSTRSDCSVSDRGVA
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRL--------QQQQQQQQQQQQQQQKQSALDSKEAVVVVASAARIDELEKRSEFDDCRSWSTRSDCSVSDRGVA

Query:  DSTNLDRFLEHTTPVVPALRNPKASQRGWRNREVLEAPPYFLLGDLWESFKEWSAYGAGTPLLLNGSDSVVQYYVPYLSGIQLYVNPSKSSALSRRPGA-
        DSTNLDRFLE+TTPVVPA    K S +GWRNREV EAPPYF+LGDLWESFKEWSAYGAG PLLLNGSDSVVQYYVPYLSGIQLY++PSKSSALSRR GA 
Subjt:  DSTNLDRFLEHTTPVVPALRNPKASQRGWRNREVLEAPPYFLLGDLWESFKEWSAYGAGTPLLLNGSDSVVQYYVPYLSGIQLYVNPSKSSALSRRPGA-

Query:  SDADSSKETSSDGSSSCGAEKQSRTAIRDEWIQDSSVLGSQRALQMNVPSSESSSDESDSCYRQGQLVFEYLEREPPFGREPLSDKITILASRFPELMTF
        SDA+SSKET+SDGSS+CG  K++ TA++DEWIQDSSV GS+RALQMNVPS+ESSSDESDSCYRQGQLVFEY+E +PPF REPL+DKITILASRFPEL T+
Subjt:  SDADSSKETSSDGSSSCGAEKQSRTAIRDEWIQDSSVLGSQRALQMNVPSSESSSDESDSCYRQGQLVFEYLEREPPFGREPLSDKITILASRFPELMTF

Query:  RSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTTIQGNGADGLQFHWPRFREVNTADCPLKLQLLTFGLASYKFKISFWNSTGAEECPKAN
        RSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLST  QG G DGLQFHWPR REV+TA+ PLKLQL TFGLASYKFK SFWNSTG EECPKAN
Subjt:  RSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTTIQGNGADGLQFHWPRFREVNTADCPLKLQLLTFGLASYKFKISFWNSTGAEECPKAN

Query:  TLLQGADNWLRSLNVNHPDYRFFSSH
        TL Q ADNWLRSLNVNHPDYRFF+SH
Subjt:  TLLQGADNWLRSLNVNHPDYRFFSSH

XP_038877692.1 uncharacterized protein LOC120069924 [Benincasa hispida]1.0e-19785.11Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQ-----KQSALDSKEAVVVVASAARIDELEKRSEFDDCRSWSTRSDCSVSDRGVADST
        MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQ     KQS LDSK+  V+VAS A ID+LEKRSEFD+CRSWSTRSDCSVSDRG+ADST
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQ-----KQSALDSKEAVVVVASAARIDELEKRSEFDDCRSWSTRSDCSVSDRGVADST

Query:  NLDRFLEHTTPVVPALRNPKASQRGWRNREVLEAPPYFLLGDLWESFKEWSAYGAGTPLLLNGSDSVVQYYVPYLSGIQLYVNPSKSSALSRRPGA-SDA
        NLDRFLEHTTP+VPA   PK S RGWRNREVLEA PYF+LGDLWESFKEWSAYGAG PLLLNGSDSVVQYYVPYLSGIQLYV+PSKSS+LSRR G  SDA
Subjt:  NLDRFLEHTTPVVPALRNPKASQRGWRNREVLEAPPYFLLGDLWESFKEWSAYGAGTPLLLNGSDSVVQYYVPYLSGIQLYVNPSKSSALSRRPGA-SDA

Query:  DSSKETSSDGSSSCGAEKQSRTAIRDEWIQDSSVLGSQRALQMNVPSSESSSDESDSCYRQGQLVFEYLEREPPFGREPLSDKITILASRFPELMTFRSC
         SSKETSSDGSS+ GAEK+++TA++DEWIQD SV GSQRALQMNVPSSESSSDESDSCYR GQLVFEYLER+PPF REPL+DKITILASRFPEL T+RSC
Subjt:  DSSKETSSDGSSSCGAEKQSRTAIRDEWIQDSSVLGSQRALQMNVPSSESSSDESDSCYRQGQLVFEYLEREPPFGREPLSDKITILASRFPELMTFRSC

Query:  DLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTTIQGNGADGLQFHWPRFREVNTADCPLKLQLLTFGLASYKFKISFWNSTGAEECPKANTLL
        DLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFH+LST  QG   DGLQFHWPR REV TADCPLKLQL  FGLASYKFKI FWNSTGAEEC KA++L 
Subjt:  DLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTTIQGNGADGLQFHWPRFREVNTADCPLKLQLLTFGLASYKFKISFWNSTGAEECPKANTLL

Query:  QGADNWLRSLNVNHPDYRFFSSH
        Q ADNWLR LNVNHPDYRFF+SH
Subjt:  QGADNWLRSLNVNHPDYRFFSSH

TrEMBL top hitse value%identityAlignment
A0A0A0L5V4 Uncharacterized protein3.3e-19483.97Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQKQSALDSKEAVVVVASAARIDELEKRSEFDDCRSWSTRSDCSVSDRGVADSTNLDRF
        MSVSGGVSIARIRGENRFYHPPAMRRRL   QQQQQQQQQQQ KQSALDSK+  VV A+ + ID+LEKRSEFD+CRSWSTRSDCSVSDRG+ADSTNLDRF
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQKQSALDSKEAVVVVASAARIDELEKRSEFDDCRSWSTRSDCSVSDRGVADSTNLDRF

Query:  LEHTTPVVPALRNPKASQRGWRNREVLEAPPYFLLGDLWESFKEWSAYGAGTPLLLNGSDSVVQYYVPYLSGIQLYVNPSKSSALSRRPGA-SDADSSKE
        LEHTTP+VPA   PK S RGWRNREV EA PYF+LGDLWESFKEWSAYGAG PLLLNGSDSVVQYYVPYLSGIQLYV+PSKSSALSRR GA SDA+SSKE
Subjt:  LEHTTPVVPALRNPKASQRGWRNREVLEAPPYFLLGDLWESFKEWSAYGAGTPLLLNGSDSVVQYYVPYLSGIQLYVNPSKSSALSRRPGA-SDADSSKE

Query:  TSSDGSSSCGAEKQSRTAIRDEWIQDSSVLGSQRALQMNVPSSESSSDESDSCYRQGQLVFEYLEREPPFGREPLSDKITILASRFPELMTFRSCDLSPS
        TSSDGSS+ GAEK+++TA+++EWIQD +V GSQRALQMNVPSSESSSDESDSCYR GQLVFEYLER+PPF REPL+DKIT+LASRF EL T+RSCDLSPS
Subjt:  TSSDGSSSCGAEKQSRTAIRDEWIQDSSVLGSQRALQMNVPSSESSSDESDSCYRQGQLVFEYLEREPPFGREPLSDKITILASRFPELMTFRSCDLSPS

Query:  SWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTTIQGNGADGLQFHWPRFREVNTADCPLKLQLLTFGLASYKFKISFWNSTGAEECPKANTLLQGADN
        SWISVAWYPIYRIPTGPTLQSLDACFLTFH+LST  QG   DGLQFHWPR REV TADCPLKLQL  FGLASYKFKI FWNSTGAEEC KA++L Q AD+
Subjt:  SWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTTIQGNGADGLQFHWPRFREVNTADCPLKLQLLTFGLASYKFKISFWNSTGAEECPKANTLLQGADN

Query:  WLRSLNVNHPDYRFFSSH
        WLR LNVNHPDYRFF+SH
Subjt:  WLRSLNVNHPDYRFFSSH

A0A1S3AY60 uncharacterized protein LOC103483873 isoform X19.3e-19784.29Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQ--KQSALDSKEAVVVVASAARIDELEKRSEFDDCRSWSTRSDCSVSDRGVADSTNLD
        MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQ  KQSALDSK+  VV A+ + ID+LEKRSEFD+CRSWSTRSDCSVSDRG+ DSTNLD
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQ--KQSALDSKEAVVVVASAARIDELEKRSEFDDCRSWSTRSDCSVSDRGVADSTNLD

Query:  RFLEHTTPVVPALRNPKASQRGWRNREVLEAPPYFLLGDLWESFKEWSAYGAGTPLLLNGSDSVVQYYVPYLSGIQLYVNPSKSSALSRRPGA-SDADSS
        RFLEHTTP+VPA   PK S RGWRNREV EA PYF+LGDLWESFKEWSAYGAG PLLLNGSDSVVQYYVPYLSGIQLYV+PSKS ALSRR GA SDA+SS
Subjt:  RFLEHTTPVVPALRNPKASQRGWRNREVLEAPPYFLLGDLWESFKEWSAYGAGTPLLLNGSDSVVQYYVPYLSGIQLYVNPSKSSALSRRPGA-SDADSS

Query:  KETSSDGSSSCGAEKQSRTAIRDEWIQDSSVLGSQRALQMNVPSSESSSDESDSCYRQGQLVFEYLEREPPFGREPLSDKITILASRFPELMTFRSCDLS
        KETSSDGSS+ GAEK+++TA+++EWIQD + LGSQRALQMNVPSSESSSDESDSCYR GQLVFEYLER+PPF REPL+DKIT+LASRFPEL T+RSCDLS
Subjt:  KETSSDGSSSCGAEKQSRTAIRDEWIQDSSVLGSQRALQMNVPSSESSSDESDSCYRQGQLVFEYLEREPPFGREPLSDKITILASRFPELMTFRSCDLS

Query:  PSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTTIQGNGADGLQFHWPRFREVNTADCPLKLQLLTFGLASYKFKISFWNSTGAEECPKANTLLQGA
        PSSWISVAWYPIYRIPTGPTLQSLDACFLTFH+LST +QG   DGLQFHWPR REV TADCPLKLQL  FGLASYKFKI FWNSTGAEEC KA++L Q A
Subjt:  PSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTTIQGNGADGLQFHWPRFREVNTADCPLKLQLLTFGLASYKFKISFWNSTGAEECPKANTLLQGA

Query:  DNWLRSLNVNHPDYRFFSSH
        D+WLR LNVNHPDYRFF+SH
Subjt:  DNWLRSLNVNHPDYRFFSSH

A0A1S3AY77 uncharacterized protein LOC103483873 isoform X23.6e-19683.77Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQ--KQSALDSKEAVVVVASAARIDELEKRSEFDDCRSWSTRSDCSVSDRGVADSTNLD
        MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQ  KQSALDSK+  VV A+ + ID+LEKRSEFD+CRSWSTRSDCSVSDRG+ DSTNLD
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQ--KQSALDSKEAVVVVASAARIDELEKRSEFDDCRSWSTRSDCSVSDRGVADSTNLD

Query:  RFLEHTTPVVPALRNPKASQRGWRNREVLEAPPYFLLGDLWESFKEWSAYGAGTPLLLNGSDSVVQYYVPYLSGIQLYVNPSKSSALSRRPGASDADSSK
        RFLEHTTP+VPA   PK S RGWRNREV EA PYF+LGDLWESFKEWSAYGAG PLLLNGSDSVVQYYVPYLSGIQLYV+PSKS AL RR   SDA+SSK
Subjt:  RFLEHTTPVVPALRNPKASQRGWRNREVLEAPPYFLLGDLWESFKEWSAYGAGTPLLLNGSDSVVQYYVPYLSGIQLYVNPSKSSALSRRPGASDADSSK

Query:  ETSSDGSSSCGAEKQSRTAIRDEWIQDSSVLGSQRALQMNVPSSESSSDESDSCYRQGQLVFEYLEREPPFGREPLSDKITILASRFPELMTFRSCDLSP
        ETSSDGSS+ GAEK+++TA+++EWIQD + LGSQRALQMNVPSSESSSDESDSCYR GQLVFEYLER+PPF REPL+DKIT+LASRFPEL T+RSCDLSP
Subjt:  ETSSDGSSSCGAEKQSRTAIRDEWIQDSSVLGSQRALQMNVPSSESSSDESDSCYRQGQLVFEYLEREPPFGREPLSDKITILASRFPELMTFRSCDLSP

Query:  SSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTTIQGNGADGLQFHWPRFREVNTADCPLKLQLLTFGLASYKFKISFWNSTGAEECPKANTLLQGAD
        SSWISVAWYPIYRIPTGPTLQSLDACFLTFH+LST +QG   DGLQFHWPR REV TADCPLKLQL  FGLASYKFKI FWNSTGAEEC KA++L Q AD
Subjt:  SSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTTIQGNGADGLQFHWPRFREVNTADCPLKLQLLTFGLASYKFKISFWNSTGAEECPKANTLLQGAD

Query:  NWLRSLNVNHPDYRFFSSH
        +WLR LNVNHPDYRFF+SH
Subjt:  NWLRSLNVNHPDYRFFSSH

A0A6J1G3Q7 uncharacterized protein LOC111450487 isoform X12.1e-19684.04Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRL--------QQQQQQQQQQQQQQQKQSALDSKEAVVVVASAARIDELEKRSEFDDCRSWSTRSDCSVSDRGVA
        MSVSGGVSIARIRGENRFYHPPAMRRRL        QQQQQQQQQQQQQQQKQ+ALD KE  V  A+ ARIDELEK SE D+CRSWSTRSDCSVSDRGVA
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRL--------QQQQQQQQQQQQQQQKQSALDSKEAVVVVASAARIDELEKRSEFDDCRSWSTRSDCSVSDRGVA

Query:  DSTNLDRFLEHTTPVVPALRNPKASQRGWRNREVLEAPPYFLLGDLWESFKEWSAYGAGTPLLLNGSDSVVQYYVPYLSGIQLYVNPSKSSALSRRPGA-
        DSTNLDRFLE+TTPVVPA    K S +GWRNREV EAPPYF+LGDLWESFKEWSAYGAG PLLLNGSDSVVQYYVPYLSGIQLY++PSKSSALSRR GA 
Subjt:  DSTNLDRFLEHTTPVVPALRNPKASQRGWRNREVLEAPPYFLLGDLWESFKEWSAYGAGTPLLLNGSDSVVQYYVPYLSGIQLYVNPSKSSALSRRPGA-

Query:  SDADSSKETSSDGSSSCGAEKQSRTAIRDEWIQDSSVLGSQRALQMNVPSSESSSDESDSCYRQGQLVFEYLEREPPFGREPLSDKITILASRFPELMTF
        SDA+SSKET+SDGSS+CG  K++ TA++DEWIQDSSV GS+RALQMNVPS+ESSSDESDSCYRQGQLVFEY+E +PPF REPL+DKITILASRFPEL T+
Subjt:  SDADSSKETSSDGSSSCGAEKQSRTAIRDEWIQDSSVLGSQRALQMNVPSSESSSDESDSCYRQGQLVFEYLEREPPFGREPLSDKITILASRFPELMTF

Query:  RSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTTIQGNGADGLQFHWPRFREVNTADCPLKLQLLTFGLASYKFKISFWNSTGAEECPKAN
        RSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLST  QG G DGLQFHWPR REV+TA+ PLKLQL TFGLASYKFK SFWNSTG EECPKAN
Subjt:  RSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTTIQGNGADGLQFHWPRFREVNTADCPLKLQLLTFGLASYKFKISFWNSTGAEECPKAN

Query:  TLLQGADNWLRSLNVNHPDYRFFSSH
        TL Q ADNWLRSLNVNHPDYRFF+SH
Subjt:  TLLQGADNWLRSLNVNHPDYRFFSSH

A0A6J1G3S2 uncharacterized protein LOC111450487 isoform X27.9e-19683.53Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRL--------QQQQQQQQQQQQQQQKQSALDSKEAVVVVASAARIDELEKRSEFDDCRSWSTRSDCSVSDRGVA
        MSVSGGVSIARIRGENRFYHPPAMRRRL        QQQQQQQQQQQQQQQKQ+ALD KE  V  A+ ARIDELEK SE D+CRSWSTRSDCSVSDRGVA
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRL--------QQQQQQQQQQQQQQQKQSALDSKEAVVVVASAARIDELEKRSEFDDCRSWSTRSDCSVSDRGVA

Query:  DSTNLDRFLEHTTPVVPALRNPKASQRGWRNREVLEAPPYFLLGDLWESFKEWSAYGAGTPLLLNGSDSVVQYYVPYLSGIQLYVNPSKSSALSRRPGAS
        DSTNLDRFLE+TTPVVPA    K S +GWRNREV EAPPYF+LGDLWESFKEWSAYGAG PLLLNGSDSVVQYYVPYLSGIQLY++PSKSSAL RR   S
Subjt:  DSTNLDRFLEHTTPVVPALRNPKASQRGWRNREVLEAPPYFLLGDLWESFKEWSAYGAGTPLLLNGSDSVVQYYVPYLSGIQLYVNPSKSSALSRRPGAS

Query:  DADSSKETSSDGSSSCGAEKQSRTAIRDEWIQDSSVLGSQRALQMNVPSSESSSDESDSCYRQGQLVFEYLEREPPFGREPLSDKITILASRFPELMTFR
        DA+SSKET+SDGSS+CG  K++ TA++DEWIQDSSV GS+RALQMNVPS+ESSSDESDSCYRQGQLVFEY+E +PPF REPL+DKITILASRFPEL T+R
Subjt:  DADSSKETSSDGSSSCGAEKQSRTAIRDEWIQDSSVLGSQRALQMNVPSSESSSDESDSCYRQGQLVFEYLEREPPFGREPLSDKITILASRFPELMTFR

Query:  SCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTTIQGNGADGLQFHWPRFREVNTADCPLKLQLLTFGLASYKFKISFWNSTGAEECPKANT
        SCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLST  QG G DGLQFHWPR REV+TA+ PLKLQL TFGLASYKFK SFWNSTG EECPKANT
Subjt:  SCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTTIQGNGADGLQFHWPRFREVNTADCPLKLQLLTFGLASYKFKISFWNSTGAEECPKANT

Query:  LLQGADNWLRSLNVNHPDYRFFSSH
        L Q ADNWLRSLNVNHPDYRFF+SH
Subjt:  LLQGADNWLRSLNVNHPDYRFFSSH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G15030.1 Protein of unknown function (DUF789)1.5e-7750.31Show/hide
Query:  ADSTNLDRFLEHTTPVVPALRNPKASQRGWRNREVLEAPPYFLLGDLWESFKEWSAYGAGTPLLLNGS-DSVVQYYVPYLSGIQLY--VNPSKSSALSRR
        A S+N++RFL+  TP VPA    K   R     +V    PYFLLGD+WESF EWSAYG G PL LN + D V QYYVP LSGIQ+Y  V+   SS  +RR
Subjt:  ADSTNLDRFLEHTTPVVPALRNPKASQRGWRNREVLEAPPYFLLGDLWESFKEWSAYGAGTPLLLNGS-DSVVQYYVPYLSGIQLY--VNPSKSSALSRR

Query:  PGASDADSSKETSSDGSSSCGAEKQSRTAIRDEWIQDSSVLGSQRALQMNVPSSESSSDESDSCYRQGQLVFEYLEREPPFGREPLSDKITILASRFPEL
         G       +++SS+GSSS   E +       E I       S R         +SSSD+ +    QG+L+FEYLER+ P+ REP +DK++ LASRFPEL
Subjt:  PGASDADSSKETSSDGSSSCGAEKQSRTAIRDEWIQDSSVLGSQRALQMNVPSSESSSDESDSCYRQGQLVFEYLEREPPFGREPLSDKITILASRFPEL

Query:  MTFRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTTIQGNGADGLQFHWPRFREVNTADCPLKLQLLTFGLASYKFKISFWNSTGAEECP
         T RSCDL PSSW SVAWYPIY+IPTGPTL+ LDACFLT+HSL T  QG G      H      V   +   K++L  FGLASYK + S W S G     
Subjt:  MTFRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTTIQGNGADGLQFHWPRFREVNTADCPLKLQLLTFGLASYKFKISFWNSTGAEECP

Query:  KANTLLQGADNWLRSLNVNHPDYRFF
         AN+L Q ADNWLR   VNHPD+ FF
Subjt:  KANTLLQGADNWLRSLNVNHPDYRFF

AT2G01260.1 Protein of unknown function (DUF789)3.6e-7646.01Show/hide
Query:  RIDELEK-RSEFDDCRSWSTRSDCSVSDRGVADSTNLDRFLEHTTPVVPALRNPKASQRGWR-NREVLEAPPYFLLGDLWESFKEWSAYGAGTPLLLNGS
        RID+L + +S+  +  S +        +     S+NLDRFLE  TP VPA    K   R  R + +  +  PYF+LGD+W+SF EWSAYG G PL+LN +
Subjt:  RIDELEK-RSEFDDCRSWSTRSDCSVSDRGVADSTNLDRFLEHTTPVVPALRNPKASQRGWR-NREVLEAPPYFLLGDLWESFKEWSAYGAGTPLLLNGS

Query:  -DSVVQYYVPYLSGIQLYVNPS--KSSALSRRPGASDADSSKETSSDGSSSCGAEKQSR----TAIRDEWIQDSSVLGSQRALQMNVPSSESSSDESDSC
         D V+QYYVP LS IQ+Y +     SS  SRRPG S     +++SSD SS   +E+ S      ++RD+  +D                  SSSD+ +  
Subjt:  -DSVVQYYVPYLSGIQLYVNPS--KSSALSRRPGASDADSSKETSSDGSSSCGAEKQSR----TAIRDEWIQDSSVLGSQRALQMNVPSSESSSDESDSC

Query:  YRQGQLVFEYLEREPPFGREPLSDKITILASRFPELMTFRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTTIQGNGAD-GLQFHWPRFR
          QG+L+FEYLER+ P+ REP +DK+  LA++FPELMT RSCDL  SSW SVAWYPIYRIPTGPTL+ LDACFLT+HSL T+  G G++  +    PR  
Subjt:  YRQGQLVFEYLEREPPFGREPLSDKITILASRFPELMTFRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTTIQGNGAD-GLQFHWPRFR

Query:  EVNTADCPLKLQLLTFGLASYKFKISFWNSTGAEECPKANTLLQGADNWLRSLNVNHPDYRFF
        E        K+ L  FGLASYKF+ S W   G  E    N+L Q AD WL S +V+HPD+ FF
Subjt:  EVNTADCPLKLQLLTFGLASYKFKISFWNSTGAEECPKANTLLQGADNWLRSLNVNHPDYRFF

AT2G01260.2 Protein of unknown function (DUF789)4.0e-5947.37Show/hide
Query:  RIDELEK-RSEFDDCRSWSTRSDCSVSDRGVADSTNLDRFLEHTTPVVPALRNPKASQRGWR-NREVLEAPPYFLLGDLWESFKEWSAYGAGTPLLLNGS
        RID+L + +S+  +  S +        +     S+NLDRFLE  TP VPA    K   R  R + +  +  PYF+LGD+W+SF EWSAYG G PL+LN +
Subjt:  RIDELEK-RSEFDDCRSWSTRSDCSVSDRGVADSTNLDRFLEHTTPVVPALRNPKASQRGWR-NREVLEAPPYFLLGDLWESFKEWSAYGAGTPLLLNGS

Query:  -DSVVQYYVPYLSGIQLYVNPS--KSSALSRRPGASDADSSKETSSDGSSSCGAEKQSR----TAIRDEWIQDSSVLGSQRALQMNVPSSESSSDESDSC
         D V+QYYVP LS IQ+Y +     SS  SRRPG S     +++SSD SS   +E+ S      ++RD+  +D                  SSSD+ +  
Subjt:  -DSVVQYYVPYLSGIQLYVNPS--KSSALSRRPGASDADSSKETSSDGSSSCGAEKQSR----TAIRDEWIQDSSVLGSQRALQMNVPSSESSSDESDSC

Query:  YRQGQLVFEYLEREPPFGREPLSDKITILASRFPELMTFRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTTIQG
          QG+L+FEYLER+ P+ REP +DK+  LA++FPELMT RSCDL  SSW SVAWYPIYRIPTGPTL+ LDACFLT+HSL T+  G
Subjt:  YRQGQLVFEYLEREPPFGREPLSDKITILASRFPELMTFRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTTIQG

AT4G16100.1 Protein of unknown function (DUF789)6.2e-9249.4Show/hide
Query:  RIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQKQSALDSKEAVVVVASAARIDELEKRSEFDDCRSWSTRSDCSVSDRGVADST-------NLDRFLEH
        RIRGENRFY+PP M R+LQQ++++++ + ++ +K    + K+A  ++    +++E E +   ++C    + SDCSV  R  + +T       NL RFL+ 
Subjt:  RIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQKQSALDSKEAVVVVASAARIDELEKRSEFDDCRSWSTRSDCSVSDRGVADST-------NLDRFLEH

Query:  TTPVVPALRNPKASQRGWRNREVLEAPPYFLLGDLWESFKEWSAYGAGTPLLLNGSDSVVQYYVPYLSGIQLYVNPSKSSALSRRPG-ASDADSSKETSS
        TTP+V     P  S +GWR RE  E  PYFLL DLW+SF+EWSAYG G PLLLNG DSVVQYYVPYLSGIQLY +PS++    RR G  SD DS ++ SS
Subjt:  TTPVVPALRNPKASQRGWRNREVLEAPPYFLLGDLWESFKEWSAYGAGTPLLLNGSDSVVQYYVPYLSGIQLYVNPSKSSALSRRPG-ASDADSSKETSS

Query:  DGSSSCGAEKQSRTAIRDEWIQDSSVLGSQRALQMNVPSSESSSDESD-SCYRQGQLVFEYLEREPPFGREPLSDKITILASRFPELMTFRSCDLSPSSW
        DGS+ C    Q+                  RA     P   SSSDES+ S    G+LVFEYLE   PFGREPL+DKI+ L+S+FP L T+RSCDLSPSSW
Subjt:  DGSSSCGAEKQSRTAIRDEWIQDSSVLGSQRALQMNVPSSESSSDESD-SCYRQGQLVFEYLEREPPFGREPLSDKITILASRFPELMTFRSCDLSPSSW

Query:  ISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTTIQGNGADGLQFHWPRFREVNTADCPLKLQLLTFGLASYKFKISFWN-STGAEECPKANTLLQGADNW
        +SVAWYPIYRIP G +LQ+LDACFLTFHSLST  +G   +  Q         + +    KL L TFGLASYKFK+S W+  +  +E  +  TLL+ A+ W
Subjt:  ISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTTIQGNGADGLQFHWPRFREVNTADCPLKLQLLTFGLASYKFKISFWN-STGAEECPKANTLLQGADNW

Query:  LRSLNVNHPDYRFFSSH
        LR L V  PD+R F SH
Subjt:  LRSLNVNHPDYRFFSSH

AT5G49220.1 Protein of unknown function (DUF789)1.8e-9149.66Show/hide
Query:  MSVSGGVSIAR--IRGENRFYHPPAMRRRLQQQQQQQQ--QQQQQQQKQSALDSKE---AVVVVASAAR----IDELEKRSEFDDCRSWSTRSDCSV-SD
        MS SGGVSIAR  IRGENRFY+PP MRR  Q+ Q QQQ  ++Q++  +   L  KE   A  V     R    + E + R         +  SD S  S 
Subjt:  MSVSGGVSIAR--IRGENRFYHPPAMRRRLQQQQQQQQ--QQQQQQQKQSALDSKE---AVVVVASAAR----IDELEKRSEFDDCRSWSTRSDCSV-SD

Query:  RGVADSTNLDRFLEHTTPVVPALRNPKASQRGWRNREVLEAPPYFLLGDLWESFKEWSAYGAGT-----PLLLNGSDSVVQYYVPYLSGIQLYVNPSKSS
        R ++D +NLDRFLEHTTPVVPA   P  S+   + RE  +   YF+L DLWESF EWSAYGAG      PL ++G+DS VQYYVPYLSGIQLYV+P K  
Subjt:  RGVADSTNLDRFLEHTTPVVPALRNPKASQRGWRNREVLEAPPYFLLGDLWESFKEWSAYGAGT-----PLLLNGSDSVVQYYVPYLSGIQLYVNPSKSS

Query:  ALSRRPGASDADSSKETSSDGSS---SCGAEKQSRTAIRDEWIQDSSVLGSQRALQMNVPSSESSSDESDSCYRQGQLVFEYLEREPPFGREPLSDKITI
            R    D + S E SS+  +        + +R +++D+     S+ GS             SS E++    QG+L+FEYLE EPPFGREPL++KI+ 
Subjt:  ALSRRPGASDADSSKETSSDGSS---SCGAEKQSRTAIRDEWIQDSSVLGSQRALQMNVPSSESSSDESDSCYRQGQLVFEYLEREPPFGREPLSDKITI

Query:  LASRFPELMTFRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTTIQGNGADGLQFHWPRFREVNTADCPLKLQLLTFGLASYKFKISFWN
        LASR PELMT+RSCDL PSSW+SV+WYPIYRIP GPTLQ+LDACFLTFHSLST      A G     P            KL L TFGLASYK K+S WN
Subjt:  LASRFPELMTFRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTTIQGNGADGLQFHWPRFREVNTADCPLKLQLLTFGLASYKFKISFWN

Query:  STGAEECPKANTLLQGADNWLRSLNVNHPDYRFFSSH
            +E  K  +LLQ AD WL+ L V+HPDYRFF+S+
Subjt:  STGAEECPKANTLLQGADNWLRSLNVNHPDYRFFSSH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAGTCTCCGGTGGGGTTTCGATTGCGAGAATCCGAGGCGAGAATCGGTTCTATCATCCGCCTGCGATGCGGCGACGTTTGCAGCAGCAGCAACAGCAACAGCAACA
ACAGCAACAGCAGCAGCAGAAGCAGAGCGCGTTGGATTCGAAAGAGGCTGTTGTTGTTGTTGCTTCTGCTGCCAGGATCGATGAGTTGGAGAAGCGGAGTGAGTTTGATG
ACTGCCGCTCTTGGTCCACTCGCTCTGATTGCTCTGTTTCGGATCGGGGAGTGGCTGATTCTACTAATTTGGATCGCTTCTTGGAGCATACTACTCCCGTTGTTCCGGCT
CTACGAAATCCTAAGGCGAGCCAGAGGGGATGGAGAAATCGTGAAGTCTTAGAGGCACCTCCTTATTTTTTGCTTGGTGATCTCTGGGAATCTTTCAAGGAATGGAGTGC
ATATGGAGCTGGTACCCCTCTATTGTTAAATGGTAGCGACTCTGTTGTCCAGTACTACGTTCCGTATCTGTCTGGCATTCAACTCTATGTAAATCCATCAAAGTCTTCTG
CCCTAAGTAGAAGGCCTGGTGCAAGTGATGCTGATTCCTCTAAGGAAACAAGCAGTGATGGAAGCAGTAGTTGTGGGGCAGAAAAACAATCAAGGACTGCTATTCGGGAT
GAATGGATCCAGGACTCCAGTGTTCTGGGGTCACAACGAGCTCTTCAAATGAATGTACCTTCTTCCGAGTCATCAAGTGATGAAAGTGACTCTTGCTACCGTCAAGGTCA
GCTTGTGTTTGAATACTTGGAGCGTGAGCCACCATTTGGTCGTGAACCATTATCTGATAAGATCACTATTCTGGCATCCCGGTTTCCTGAATTAATGACATTTAGAAGCT
GTGATTTGTCTCCCTCTAGTTGGATATCTGTGGCATGGTATCCAATTTATAGGATTCCCACTGGTCCAACTTTACAAAGTCTAGATGCTTGTTTTCTGACCTTCCATTCT
CTGTCAACAACAATTCAAGGCAACGGCGCAGATGGGTTGCAATTCCACTGGCCAAGATTTAGAGAGGTGAACACTGCAGATTGCCCTCTCAAACTACAGTTGCTGACATT
CGGACTTGCTTCGTACAAGTTCAAAATTTCTTTTTGGAATTCAACTGGTGCTGAGGAATGTCCAAAGGCTAACACCTTGTTGCAAGGTGCTGACAACTGGCTCAGGTCAT
TAAACGTAAACCATCCTGATTACAGATTTTTCTCATCTCATATTCATACTTGA
mRNA sequenceShow/hide mRNA sequence
TCTGGTCCAAAAATCTCCCCCTCAAAATCCAATCCCCCAATTTCTGTAATTGCAAAAAAATAGGGCTAAATTTTCAAAAAAAAAAAAAAAACATTTCTGTTGTTCCATCT
TCCCCCACTAGATTTTACAAAACCCTAGTTTTCTCCATCTTCATCTTCCTGTTCAACAATCAAACCCTAGCTCCTCTCTTGTTTGTGCCGTTGCCACAGCCTTGGTGTTT
TCTTTTTGCAATGTCAGTCTCCGGTGGGGTTTCGATTGCGAGAATCCGAGGCGAGAATCGGTTCTATCATCCGCCTGCGATGCGGCGACGTTTGCAGCAGCAGCAACAGC
AACAGCAACAACAGCAACAGCAGCAGCAGAAGCAGAGCGCGTTGGATTCGAAAGAGGCTGTTGTTGTTGTTGCTTCTGCTGCCAGGATCGATGAGTTGGAGAAGCGGAGT
GAGTTTGATGACTGCCGCTCTTGGTCCACTCGCTCTGATTGCTCTGTTTCGGATCGGGGAGTGGCTGATTCTACTAATTTGGATCGCTTCTTGGAGCATACTACTCCCGT
TGTTCCGGCTCTACGAAATCCTAAGGCGAGCCAGAGGGGATGGAGAAATCGTGAAGTCTTAGAGGCACCTCCTTATTTTTTGCTTGGTGATCTCTGGGAATCTTTCAAGG
AATGGAGTGCATATGGAGCTGGTACCCCTCTATTGTTAAATGGTAGCGACTCTGTTGTCCAGTACTACGTTCCGTATCTGTCTGGCATTCAACTCTATGTAAATCCATCA
AAGTCTTCTGCCCTAAGTAGAAGGCCTGGTGCAAGTGATGCTGATTCCTCTAAGGAAACAAGCAGTGATGGAAGCAGTAGTTGTGGGGCAGAAAAACAATCAAGGACTGC
TATTCGGGATGAATGGATCCAGGACTCCAGTGTTCTGGGGTCACAACGAGCTCTTCAAATGAATGTACCTTCTTCCGAGTCATCAAGTGATGAAAGTGACTCTTGCTACC
GTCAAGGTCAGCTTGTGTTTGAATACTTGGAGCGTGAGCCACCATTTGGTCGTGAACCATTATCTGATAAGATCACTATTCTGGCATCCCGGTTTCCTGAATTAATGACA
TTTAGAAGCTGTGATTTGTCTCCCTCTAGTTGGATATCTGTGGCATGGTATCCAATTTATAGGATTCCCACTGGTCCAACTTTACAAAGTCTAGATGCTTGTTTTCTGAC
CTTCCATTCTCTGTCAACAACAATTCAAGGCAACGGCGCAGATGGGTTGCAATTCCACTGGCCAAGATTTAGAGAGGTGAACACTGCAGATTGCCCTCTCAAACTACAGT
TGCTGACATTCGGACTTGCTTCGTACAAGTTCAAAATTTCTTTTTGGAATTCAACTGGTGCTGAGGAATGTCCAAAGGCTAACACCTTGTTGCAAGGTGCTGACAACTGG
CTCAGGTCATTAAACGTAAACCATCCTGATTACAGATTTTTCTCATCTCATATTCATACTTGAAAATGATAAGGAAAAGGATATTATGCATGATGCCTTAAATATGGGAT
TGCAGTGTCGTAAGTCCAAAGAAACTCGCTTCTTTTGAATGTCGTGAAAATTTATATGGCATCTTTTGCTTGTTTTTTGGAGTTTCTATTTTGATTGACCATATAAAGGG
TTCTGTATAGGAAGAATGATGTTGATACTGTAACATTACGGTTGCGAAATGGGGATTGGCCTTAGGATAGTTATTATTCTGGGAAATGTCAATCCTCATATGGTATTATT
AGTCTATTCTTCCATGTGTTGAGATTATGAACGATGATGGGAATACGGCGGATGTCGACTGCAGAGAAACCTGCTGTCAATAACACCACCACCCCTTTTTTCTATTTTAT
TTTTTTGTACTTTAAATTGTTTGATAGATTTGTAACTTTCACTGGCACCACGATTAATGATGAAAAGTATATCTAGTATGTTAACTACAATCTTTCTGTATTAACATTTG
ATTTGATAATGTAATTGTTTCTCGGTTAATTTTGTCAATTATATCATTTGGGTGATGATTCGAATATTTGACCTTTTGGTTGAGGTTATATGGAACAAATTTGAACTTTT
TTTTCTTCTGTTTGCAGCATGTTTATCTCATATTGTCATATTATTTCACTTTTTCTGTCTCTAGTTATAATTCTGATTCCC
Protein sequenceShow/hide protein sequence
MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQKQSALDSKEAVVVVASAARIDELEKRSEFDDCRSWSTRSDCSVSDRGVADSTNLDRFLEHTTPVVPA
LRNPKASQRGWRNREVLEAPPYFLLGDLWESFKEWSAYGAGTPLLLNGSDSVVQYYVPYLSGIQLYVNPSKSSALSRRPGASDADSSKETSSDGSSSCGAEKQSRTAIRD
EWIQDSSVLGSQRALQMNVPSSESSSDESDSCYRQGQLVFEYLEREPPFGREPLSDKITILASRFPELMTFRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHS
LSTTIQGNGADGLQFHWPRFREVNTADCPLKLQLLTFGLASYKFKISFWNSTGAEECPKANTLLQGADNWLRSLNVNHPDYRFFSSHIHT