; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0008550 (gene) of Snake gourd v1 genome

Gene IDTan0008550
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionProtein of unknown function (DUF789)
Genome locationLG10:9375430..9379133
RNA-Seq ExpressionTan0008550
SyntenyTan0008550
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR008507 - Protein of unknown function DUF789


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004134231.3 uncharacterized protein LOC101208769 isoform X1 [Cucumis sativus]3.3e-22090.65Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQ---------KQSALDSKEVV-ATSARIDELEKRSEFDECRSWSTRSDCSVSDRGVADST
        MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQ         KQSALDSK+VV A ++ ID+LEKRSEFDECRSWSTRSDCSVSDRG+ADST
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQ---------KQSALDSKEVV-ATSARIDELEKRSEFDECRSWSTRSDCSVSDRGVADST

Query:  NLDRFLEHTTPLVPAQCIPKTSLRGWRNREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDA
        NLDRFLEHTTPLVPA CIPKTSLRGWRNREVSEA PYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDA
Subjt:  NLDRFLEHTTPLVPAQCIPKTSLRGWRNREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDA

Query:  ESSKETSSDGSSNCGAEKKTKTALQDEWIQDSSVLGSQRALQMSVPSAESSSDESDSCYRQGQLVFEYLERDPPFCREPLTDKITILSSRFPELKTYRSC
        ESSKETSSDGSSN GAEKKTKTALQ+EWIQD +V GSQRALQM+VPS+ESSSDESDSCYR GQLVFEYLERDPPFCREPLTDKIT+L+SRF ELKTYRSC
Subjt:  ESSKETSSDGSSNCGAEKKTKTALQDEWIQDSSVLGSQRALQMSVPSAESSSDESDSCYRQGQLVFEYLERDPPFCREPLTDKITILSSRFPELKTYRSC

Query:  DLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFLGISTDGLQFHWPRVREVYTADCPLKLQLPTFGLASYKFKISFWNSTGAEECPKANTLW
        DLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFH+LSTAF GISTDGLQFHWPRVREVYTADCPLKLQLP FGLASYKFKI FWNSTGAEEC KA++LW
Subjt:  DLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFLGISTDGLQFHWPRVREVYTADCPLKLQLPTFGLASYKFKISFWNSTGAEECPKANTLW

Query:  QDADNWLRSLNVNHPDYKFFASHNSFWR
        QDAD+WLR LNVNHPDY+FFASHNSFWR
Subjt:  QDADNWLRSLNVNHPDYKFFASHNSFWR

XP_008438916.1 PREDICTED: uncharacterized protein LOC103483873 isoform X1 [Cucumis melo]9.5e-22091.02Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQ----KQSALDSKEVV-ATSARIDELEKRSEFDECRSWSTRSDCSVSDRGVADSTNLDRF
        MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQ    KQSALDSK+VV A ++ ID+LEKRSEFDECRSWSTRSDCSVSDRG+ DSTNLDRF
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQ----KQSALDSKEVV-ATSARIDELEKRSEFDECRSWSTRSDCSVSDRGVADSTNLDRF

Query:  LEHTTPLVPAQCIPKTSLRGWRNREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDAESSKE
        LEHTTPLVPA CIPKTSLRGWRNREVSEA PYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKS ALSRRRGADSDAESSKE
Subjt:  LEHTTPLVPAQCIPKTSLRGWRNREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDAESSKE

Query:  TSSDGSSNCGAEKKTKTALQDEWIQDSSVLGSQRALQMSVPSAESSSDESDSCYRQGQLVFEYLERDPPFCREPLTDKITILSSRFPELKTYRSCDLSPS
        TSSDGSSN GAEKKTKTALQ+EWIQD + LGSQRALQM+VPS+ESSSDESDSCYR GQLVFEYLERDPPFCREPLTDKIT+L+SRFPELKTYRSCDLSPS
Subjt:  TSSDGSSNCGAEKKTKTALQDEWIQDSSVLGSQRALQMSVPSAESSSDESDSCYRQGQLVFEYLERDPPFCREPLTDKITILSSRFPELKTYRSCDLSPS

Query:  SWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFLGISTDGLQFHWPRVREVYTADCPLKLQLPTFGLASYKFKISFWNSTGAEECPKANTLWQDADN
        SWISVAWYPIYRIPTGPTLQSLDACFLTFH+LSTA  G STDGLQFHWPRVREVYTADCPLKLQLP FGLASYKFKI FWNSTGAEEC KA++LWQDAD+
Subjt:  SWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFLGISTDGLQFHWPRVREVYTADCPLKLQLPTFGLASYKFKISFWNSTGAEECPKANTLWQDADN

Query:  WLRSLNVNHPDYKFFASHNSFWR
        WLR LNVNHPDY+FFASHNSFWR
Subjt:  WLRSLNVNHPDYKFFASHNSFWR

XP_008438917.1 PREDICTED: uncharacterized protein LOC103483873 isoform X2 [Cucumis melo]5.2e-21890.78Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQ----KQSALDSKEVV-ATSARIDELEKRSEFDECRSWSTRSDCSVSDRGVADSTNLDRF
        MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQ    KQSALDSK+VV A ++ ID+LEKRSEFDECRSWSTRSDCSVSDRG+ DSTNLDRF
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQ----KQSALDSKEVV-ATSARIDELEKRSEFDECRSWSTRSDCSVSDRGVADSTNLDRF

Query:  LEHTTPLVPAQCIPKTSLRGWRNREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDAESSKE
        LEHTTPLVPA CIPKTSLRGWRNREVSEA PYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKS AL RRRGADSDAESSKE
Subjt:  LEHTTPLVPAQCIPKTSLRGWRNREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDAESSKE

Query:  TSSDGSSNCGAEKKTKTALQDEWIQDSSVLGSQRALQMSVPSAESSSDESDSCYRQGQLVFEYLERDPPFCREPLTDKITILSSRFPELKTYRSCDLSPS
        TSSDGSSN GAEKKTKTALQ+EWIQD + LGSQRALQM+VPS+ESSSDESDSCYR GQLVFEYLERDPPFCREPLTDKIT+L+SRFPELKTYRSCDLSPS
Subjt:  TSSDGSSNCGAEKKTKTALQDEWIQDSSVLGSQRALQMSVPSAESSSDESDSCYRQGQLVFEYLERDPPFCREPLTDKITILSSRFPELKTYRSCDLSPS

Query:  SWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFLGISTDGLQFHWPRVREVYTADCPLKLQLPTFGLASYKFKISFWNSTGAEECPKANTLWQDADN
        SWISVAWYPIYRIPTGPTLQSLDACFLTFH+LSTA  G STDGLQFHWPRVREVYTADCPLKLQLP FGLASYKFKI FWNSTGAEEC KA++LWQDAD+
Subjt:  SWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFLGISTDGLQFHWPRVREVYTADCPLKLQLPTFGLASYKFKISFWNSTGAEECPKANTLWQDADN

Query:  WLRSLNVNHPDYKFFASHNSFWR
        WLR LNVNHPDY+FFASHNSFWR
Subjt:  WLRSLNVNHPDYKFFASHNSFWR

XP_011651067.2 uncharacterized protein LOC101208769 isoform X2 [Cucumis sativus]1.8e-21890.42Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQ---------KQSALDSKEVV-ATSARIDELEKRSEFDECRSWSTRSDCSVSDRGVADST
        MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQ         KQSALDSK+VV A ++ ID+LEKRSEFDECRSWSTRSDCSVSDRG+ADST
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQ---------KQSALDSKEVV-ATSARIDELEKRSEFDECRSWSTRSDCSVSDRGVADST

Query:  NLDRFLEHTTPLVPAQCIPKTSLRGWRNREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDA
        NLDRFLEHTTPLVPA CIPKTSLRGWRNREVSEA PYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSAL RRRGADSDA
Subjt:  NLDRFLEHTTPLVPAQCIPKTSLRGWRNREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDA

Query:  ESSKETSSDGSSNCGAEKKTKTALQDEWIQDSSVLGSQRALQMSVPSAESSSDESDSCYRQGQLVFEYLERDPPFCREPLTDKITILSSRFPELKTYRSC
        ESSKETSSDGSSN GAEKKTKTALQ+EWIQD +V GSQRALQM+VPS+ESSSDESDSCYR GQLVFEYLERDPPFCREPLTDKIT+L+SRF ELKTYRSC
Subjt:  ESSKETSSDGSSNCGAEKKTKTALQDEWIQDSSVLGSQRALQMSVPSAESSSDESDSCYRQGQLVFEYLERDPPFCREPLTDKITILSSRFPELKTYRSC

Query:  DLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFLGISTDGLQFHWPRVREVYTADCPLKLQLPTFGLASYKFKISFWNSTGAEECPKANTLW
        DLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFH+LSTAF GISTDGLQFHWPRVREVYTADCPLKLQLP FGLASYKFKI FWNSTGAEEC KA++LW
Subjt:  DLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFLGISTDGLQFHWPRVREVYTADCPLKLQLPTFGLASYKFKISFWNSTGAEECPKANTLW

Query:  QDADNWLRSLNVNHPDYKFFASHNSFWR
        QDAD+WLR LNVNHPDY+FFASHNSFWR
Subjt:  QDADNWLRSLNVNHPDYKFFASHNSFWR

XP_038877692.1 uncharacterized protein LOC120069924 [Benincasa hispida]1.1e-22091.31Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQ-------KQSALDSKEV-VATSARIDELEKRSEFDECRSWSTRSDCSVSDRGVADSTNL
        MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQ       KQS LDSK+V VA++A ID+LEKRSEFDECRSWSTRSDCSVSDRG+ADSTNL
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQ-------KQSALDSKEV-VATSARIDELEKRSEFDECRSWSTRSDCSVSDRGVADSTNL

Query:  DRFLEHTTPLVPAQCIPKTSLRGWRNREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDAES
        DRFLEHTTPLVPA CIPKTSLRGWRNREV EA PYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSS+LSRRRG DSDA S
Subjt:  DRFLEHTTPLVPAQCIPKTSLRGWRNREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDAES

Query:  SKETSSDGSSNCGAEKKTKTALQDEWIQDSSVLGSQRALQMSVPSAESSSDESDSCYRQGQLVFEYLERDPPFCREPLTDKITILSSRFPELKTYRSCDL
        SKETSSDGSSN GAEKKTKTALQDEWIQD SV GSQRALQM+VPS+ESSSDESDSCYR GQLVFEYLERDPPFCREPLTDKITIL+SRFPELKTYRSCDL
Subjt:  SKETSSDGSSNCGAEKKTKTALQDEWIQDSSVLGSQRALQMSVPSAESSSDESDSCYRQGQLVFEYLERDPPFCREPLTDKITILSSRFPELKTYRSCDL

Query:  SPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFLGISTDGLQFHWPRVREVYTADCPLKLQLPTFGLASYKFKISFWNSTGAEECPKANTLWQD
        SPSSWISVAWYPIYRIPTGPTLQSLDACFLTFH+LSTAF GISTDGLQFHWPRVREVYTADCPLKLQLP FGLASYKFKI FWNSTGAEEC KA++LWQD
Subjt:  SPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFLGISTDGLQFHWPRVREVYTADCPLKLQLPTFGLASYKFKISFWNSTGAEECPKANTLWQD

Query:  ADNWLRSLNVNHPDYKFFASHNSFWR
        ADNWLR LNVNHPDY+FFASHNSFWR
Subjt:  ADNWLRSLNVNHPDYKFFASHNSFWR

TrEMBL top hitse value%identityAlignment
A0A0A0L5V4 Uncharacterized protein2.1e-22092.12Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQKQSALDSKEVV-ATSARIDELEKRSEFDECRSWSTRSDCSVSDRGVADSTNLDRFLEHT
        MSVSGGVSIARIRGENRFYHPPAMRRRL QQQQQQQQQQQ KQSALDSK+VV A ++ ID+LEKRSEFDECRSWSTRSDCSVSDRG+ADSTNLDRFLEHT
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQKQSALDSKEVV-ATSARIDELEKRSEFDECRSWSTRSDCSVSDRGVADSTNLDRFLEHT

Query:  TPLVPAQCIPKTSLRGWRNREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDAESSKETSSD
        TPLVPA CIPKTSLRGWRNREVSEA PYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDAESSKETSSD
Subjt:  TPLVPAQCIPKTSLRGWRNREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDAESSKETSSD

Query:  GSSNCGAEKKTKTALQDEWIQDSSVLGSQRALQMSVPSAESSSDESDSCYRQGQLVFEYLERDPPFCREPLTDKITILSSRFPELKTYRSCDLSPSSWIS
        GSSN GAEKKTKTALQ+EWIQD +V GSQRALQM+VPS+ESSSDESDSCYR GQLVFEYLERDPPFCREPLTDKIT+L+SRF ELKTYRSCDLSPSSWIS
Subjt:  GSSNCGAEKKTKTALQDEWIQDSSVLGSQRALQMSVPSAESSSDESDSCYRQGQLVFEYLERDPPFCREPLTDKITILSSRFPELKTYRSCDLSPSSWIS

Query:  VAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFLGISTDGLQFHWPRVREVYTADCPLKLQLPTFGLASYKFKISFWNSTGAEECPKANTLWQDADNWLRS
        VAWYPIYRIPTGPTLQSLDACFLTFH+LSTAF GISTDGLQFHWPRVREVYTADCPLKLQLP FGLASYKFKI FWNSTGAEEC KA++LWQDAD+WLR 
Subjt:  VAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFLGISTDGLQFHWPRVREVYTADCPLKLQLPTFGLASYKFKISFWNSTGAEECPKANTLWQDADNWLRS

Query:  LNVNHPDYKFFASHNSFWR
        LNVNHPDY+FFASHNSFWR
Subjt:  LNVNHPDYKFFASHNSFWR

A0A1S3AY60 uncharacterized protein LOC103483873 isoform X14.6e-22091.02Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQ----KQSALDSKEVV-ATSARIDELEKRSEFDECRSWSTRSDCSVSDRGVADSTNLDRF
        MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQ    KQSALDSK+VV A ++ ID+LEKRSEFDECRSWSTRSDCSVSDRG+ DSTNLDRF
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQ----KQSALDSKEVV-ATSARIDELEKRSEFDECRSWSTRSDCSVSDRGVADSTNLDRF

Query:  LEHTTPLVPAQCIPKTSLRGWRNREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDAESSKE
        LEHTTPLVPA CIPKTSLRGWRNREVSEA PYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKS ALSRRRGADSDAESSKE
Subjt:  LEHTTPLVPAQCIPKTSLRGWRNREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDAESSKE

Query:  TSSDGSSNCGAEKKTKTALQDEWIQDSSVLGSQRALQMSVPSAESSSDESDSCYRQGQLVFEYLERDPPFCREPLTDKITILSSRFPELKTYRSCDLSPS
        TSSDGSSN GAEKKTKTALQ+EWIQD + LGSQRALQM+VPS+ESSSDESDSCYR GQLVFEYLERDPPFCREPLTDKIT+L+SRFPELKTYRSCDLSPS
Subjt:  TSSDGSSNCGAEKKTKTALQDEWIQDSSVLGSQRALQMSVPSAESSSDESDSCYRQGQLVFEYLERDPPFCREPLTDKITILSSRFPELKTYRSCDLSPS

Query:  SWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFLGISTDGLQFHWPRVREVYTADCPLKLQLPTFGLASYKFKISFWNSTGAEECPKANTLWQDADN
        SWISVAWYPIYRIPTGPTLQSLDACFLTFH+LSTA  G STDGLQFHWPRVREVYTADCPLKLQLP FGLASYKFKI FWNSTGAEEC KA++LWQDAD+
Subjt:  SWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFLGISTDGLQFHWPRVREVYTADCPLKLQLPTFGLASYKFKISFWNSTGAEECPKANTLWQDADN

Query:  WLRSLNVNHPDYKFFASHNSFWR
        WLR LNVNHPDY+FFASHNSFWR
Subjt:  WLRSLNVNHPDYKFFASHNSFWR

A0A1S3AY77 uncharacterized protein LOC103483873 isoform X22.5e-21890.78Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQ----KQSALDSKEVV-ATSARIDELEKRSEFDECRSWSTRSDCSVSDRGVADSTNLDRF
        MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQ    KQSALDSK+VV A ++ ID+LEKRSEFDECRSWSTRSDCSVSDRG+ DSTNLDRF
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQ----KQSALDSKEVV-ATSARIDELEKRSEFDECRSWSTRSDCSVSDRGVADSTNLDRF

Query:  LEHTTPLVPAQCIPKTSLRGWRNREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDAESSKE
        LEHTTPLVPA CIPKTSLRGWRNREVSEA PYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKS AL RRRGADSDAESSKE
Subjt:  LEHTTPLVPAQCIPKTSLRGWRNREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDAESSKE

Query:  TSSDGSSNCGAEKKTKTALQDEWIQDSSVLGSQRALQMSVPSAESSSDESDSCYRQGQLVFEYLERDPPFCREPLTDKITILSSRFPELKTYRSCDLSPS
        TSSDGSSN GAEKKTKTALQ+EWIQD + LGSQRALQM+VPS+ESSSDESDSCYR GQLVFEYLERDPPFCREPLTDKIT+L+SRFPELKTYRSCDLSPS
Subjt:  TSSDGSSNCGAEKKTKTALQDEWIQDSSVLGSQRALQMSVPSAESSSDESDSCYRQGQLVFEYLERDPPFCREPLTDKITILSSRFPELKTYRSCDLSPS

Query:  SWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFLGISTDGLQFHWPRVREVYTADCPLKLQLPTFGLASYKFKISFWNSTGAEECPKANTLWQDADN
        SWISVAWYPIYRIPTGPTLQSLDACFLTFH+LSTA  G STDGLQFHWPRVREVYTADCPLKLQLP FGLASYKFKI FWNSTGAEEC KA++LWQDAD+
Subjt:  SWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFLGISTDGLQFHWPRVREVYTADCPLKLQLPTFGLASYKFKISFWNSTGAEECPKANTLWQDADN

Query:  WLRSLNVNHPDYKFFASHNSFWR
        WLR LNVNHPDY+FFASHNSFWR
Subjt:  WLRSLNVNHPDYKFFASHNSFWR

A0A6J1G3Q7 uncharacterized protein LOC111450487 isoform X11.1e-21689.74Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRL----------QQQQQQQQQQQQQKQSALDSKEV-VATSARIDELEKRSEFDECRSWSTRSDCSVSDRGVADS
        MSVSGGVSIARIRGENRFYHPPAMRRRL          QQQQQQQQQQQQQKQ+ALD KEV  AT+ARIDELEK SE DECRSWSTRSDCSVSDRGVADS
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRL----------QQQQQQQQQQQQQKQSALDSKEV-VATSARIDELEKRSEFDECRSWSTRSDCSVSDRGVADS

Query:  TNLDRFLEHTTPLVPAQCIPKTSLRGWRNREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSD
        TNLDRFLE+TTP+VPAQC  KTSL+GWRNREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLY+DPSKSSALSRRRGADSD
Subjt:  TNLDRFLEHTTPLVPAQCIPKTSLRGWRNREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSD

Query:  AESSKETSSDGSSNCGAEKKTKTALQDEWIQDSSVLGSQRALQMSVPSAESSSDESDSCYRQGQLVFEYLERDPPFCREPLTDKITILSSRFPELKTYRS
        AESSKET+SDGSSNCG  KKT TALQDEWIQDSSV GS+RALQM+VPSAESSSDESDSCYRQGQLVFEY+E DPPFCREPLTDKITIL+SRFPELKTYRS
Subjt:  AESSKETSSDGSSNCGAEKKTKTALQDEWIQDSSVLGSQRALQMSVPSAESSSDESDSCYRQGQLVFEYLERDPPFCREPLTDKITILSSRFPELKTYRS

Query:  CDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFLGISTDGLQFHWPRVREVYTADCPLKLQLPTFGLASYKFKISFWNSTGAEECPKANTL
        CDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAF GI TDGLQFHWPRVREV+TA+ PLKLQLPTFGLASYKFK SFWNSTG EECPKANTL
Subjt:  CDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFLGISTDGLQFHWPRVREVYTADCPLKLQLPTFGLASYKFKISFWNSTGAEECPKANTL

Query:  WQDADNWLRSLNVNHPDYKFFASHNSFWR
        WQDADNWLRSLNVNHPDY+FFASH S  R
Subjt:  WQDADNWLRSLNVNHPDYKFFASHNSFWR

A0A6J1G3S2 uncharacterized protein LOC111450487 isoform X27.6e-21589.51Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRL----------QQQQQQQQQQQQQKQSALDSKEV-VATSARIDELEKRSEFDECRSWSTRSDCSVSDRGVADS
        MSVSGGVSIARIRGENRFYHPPAMRRRL          QQQQQQQQQQQQQKQ+ALD KEV  AT+ARIDELEK SE DECRSWSTRSDCSVSDRGVADS
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRL----------QQQQQQQQQQQQQKQSALDSKEV-VATSARIDELEKRSEFDECRSWSTRSDCSVSDRGVADS

Query:  TNLDRFLEHTTPLVPAQCIPKTSLRGWRNREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSD
        TNLDRFLE+TTP+VPAQC  KTSL+GWRNREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLY+DPSKSSAL RRRGADSD
Subjt:  TNLDRFLEHTTPLVPAQCIPKTSLRGWRNREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSD

Query:  AESSKETSSDGSSNCGAEKKTKTALQDEWIQDSSVLGSQRALQMSVPSAESSSDESDSCYRQGQLVFEYLERDPPFCREPLTDKITILSSRFPELKTYRS
        AESSKET+SDGSSNCG  KKT TALQDEWIQDSSV GS+RALQM+VPSAESSSDESDSCYRQGQLVFEY+E DPPFCREPLTDKITIL+SRFPELKTYRS
Subjt:  AESSKETSSDGSSNCGAEKKTKTALQDEWIQDSSVLGSQRALQMSVPSAESSSDESDSCYRQGQLVFEYLERDPPFCREPLTDKITILSSRFPELKTYRS

Query:  CDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFLGISTDGLQFHWPRVREVYTADCPLKLQLPTFGLASYKFKISFWNSTGAEECPKANTL
        CDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAF GI TDGLQFHWPRVREV+TA+ PLKLQLPTFGLASYKFK SFWNSTG EECPKANTL
Subjt:  CDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFLGISTDGLQFHWPRVREVYTADCPLKLQLPTFGLASYKFKISFWNSTGAEECPKANTL

Query:  WQDADNWLRSLNVNHPDYKFFASHNSFWR
        WQDADNWLRSLNVNHPDY+FFASH S  R
Subjt:  WQDADNWLRSLNVNHPDYKFFASHNSFWR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G15030.1 Protein of unknown function (DUF789)9.8e-8250.91Show/hide
Query:  ADSTNLDRFLEHTTPLVPAQCIPKTSLRGWRNREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGS-DSVVQYYVPYLSGIQLY--VDPSKSSALSRR
        A S+N++RFL+  TP VPA  + KT +R     +V    PYF+LGD+WESF EWSAYG G+PL LN + D V QYYVP LSGIQ+Y  VD   SS  +RR
Subjt:  ADSTNLDRFLEHTTPLVPAQCIPKTSLRGWRNREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGS-DSVVQYYVPYLSGIQLY--VDPSKSSALSRR

Query:  RGADSDAESSKETSSDGSSNCGAEKKTKTALQDEWIQDSSVLGSQRALQMSV---PSAESSSDESDSCYRQGQLVFEYLERDPPFCREPLTDKITILSSR
        +G +S+++  +++SS+GSS   +E +       E I       S R  ++S+      +SSSD+ +    QG+L+FEYLERD P+ REP  DK++ L+SR
Subjt:  RGADSDAESSKETSSDGSSNCGAEKKTKTALQDEWIQDSSVLGSQRALQMSV---PSAESSSDESDSCYRQGQLVFEYLERDPPFCREPLTDKITILSSR

Query:  FPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFLGISTDGLQFHWPRVREVYTADCPLKLQLPTFGLASYKFKISFWNSTGA
        FPELKT RSCDL PSSW SVAWYPIY+IPTGPTL+ LDACFLT+HSL T F G        H  + RE        K++LP FGLASYK + S W S G 
Subjt:  FPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFLGISTDGLQFHWPRVREVYTADCPLKLQLPTFGLASYKFKISFWNSTGA

Query:  EECPKANTLWQDADNWLRSLNVNHPDYKFF
             AN+L+Q ADNWLR   VNHPD+ FF
Subjt:  EECPKANTLWQDADNWLRSLNVNHPDYKFF

AT2G01260.1 Protein of unknown function (DUF789)3.9e-7847.08Show/hide
Query:  RIDELEK-RSEFDECRSWSTRSDCSVSDRGVADSTNLDRFLEHTTPLVPAQCIPKTSLRGWR-NREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGS
        RID+L + +S+     S +        +     S+NLDRFLE  TP VPAQ + KT LR  R + + ++  PYFVLGD+W+SF EWSAYG G+PL+LN +
Subjt:  RIDELEK-RSEFDECRSWSTRSDCSVSDRGVADSTNLDRFLEHTTPLVPAQCIPKTSLRGWR-NREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGS

Query:  -DSVVQYYVPYLSGIQLYV-DPSKSSALSRRRGADSDAESSKETSSDGSSNCGAEKKTKTALQDEWIQDSSVLGSQRALQMSVPSAESSSDESDSCYRQG
         D V+QYYVP LS IQ+Y    +  S+L  RR  DS     +++SSD SS+  +E+ +          D   L  Q          +SSSD+ +    QG
Subjt:  -DSVVQYYVPYLSGIQLYV-DPSKSSALSRRRGADSDAESSKETSSDGSSNCGAEKKTKTALQDEWIQDSSVLGSQRALQMSVPSAESSSDESDSCYRQG

Query:  QLVFEYLERDPPFCREPLTDKITILSSRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFLGI-STDGLQFHWPRVREVYT
        +L+FEYLERD P+ REP  DK+  L+++FPEL T RSCDL  SSW SVAWYPIYRIPTGPTL+ LDACFLT+HSL T+F G  S   +    PR  E   
Subjt:  QLVFEYLERDPPFCREPLTDKITILSSRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFLGI-STDGLQFHWPRVREVYT

Query:  ADCPLKLQLPTFGLASYKFKISFWNSTGAEECPKANTLWQDADNWLRSLNVNHPDYKFF
             K+ LP FGLASYKF+ S W   G  E    N+L+Q AD WL S +V+HPD+ FF
Subjt:  ADCPLKLQLPTFGLASYKFKISFWNSTGAEECPKANTLWQDADNWLRSLNVNHPDYKFF

AT2G01260.2 Protein of unknown function (DUF789)4.3e-6148.4Show/hide
Query:  RIDELEK-RSEFDECRSWSTRSDCSVSDRGVADSTNLDRFLEHTTPLVPAQCIPKTSLRGWR-NREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGS
        RID+L + +S+     S +        +     S+NLDRFLE  TP VPAQ + KT LR  R + + ++  PYFVLGD+W+SF EWSAYG G+PL+LN +
Subjt:  RIDELEK-RSEFDECRSWSTRSDCSVSDRGVADSTNLDRFLEHTTPLVPAQCIPKTSLRGWR-NREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGS

Query:  -DSVVQYYVPYLSGIQLYV-DPSKSSALSRRRGADSDAESSKETSSDGSSNCGAEKKTKTALQDEWIQDSSVLGSQRALQMSVPSAESSSDESDSCYRQG
         D V+QYYVP LS IQ+Y    +  S+L  RR  DS     +++SSD SS+  +E+ +          D   L  Q          +SSSD+ +    QG
Subjt:  -DSVVQYYVPYLSGIQLYV-DPSKSSALSRRRGADSDAESSKETSSDGSSNCGAEKKTKTALQDEWIQDSSVLGSQRALQMSVPSAESSSDESDSCYRQG

Query:  QLVFEYLERDPPFCREPLTDKITILSSRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFLG
        +L+FEYLERD P+ REP  DK+  L+++FPEL T RSCDL  SSW SVAWYPIYRIPTGPTL+ LDACFLT+HSL T+F G
Subjt:  QLVFEYLERDPPFCREPLTDKITILSSRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFLG

AT4G16100.1 Protein of unknown function (DUF789)2.6e-9849.76Show/hide
Query:  RIRGENRFYHPPAMRRRLQQQQQQQQQQQQQKQSALDSKEVVATSARIDELEKRSEFDECRSWSTRSDCSVSDRGVADST-------NLDRFLEHTTPLV
        RIRGENRFY+PP MR+  Q++++++ + ++ ++    +KE++    +++E E +   +EC    + SDCSV  R  + +T       NL RFL+ TTP+V
Subjt:  RIRGENRFYHPPAMRRRLQQQQQQQQQQQQQKQSALDSKEVVATSARIDELEKRSEFDECRSWSTRSDCSVSDRGVADST-------NLDRFLEHTTPLV

Query:  PAQCIPKTSLRGWRNREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDAESSKETSSDGSSN
          Q +P TS +GWR RE  E  PYF+L DLW+SF+EWSAYG G+PLLLNG DSVVQYYVPYLSGIQLY DPS++    RR G +SD +S ++ SSDGS++
Subjt:  PAQCIPKTSLRGWRNREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDAESSKETSSDGSSN

Query:  CG--AEKKTKTALQDEWIQDSSVLGSQRALQMSVPSAESSSDESD-SCYRQGQLVFEYLERDPPFCREPLTDKITILSSRFPELKTYRSCDLSPSSWISV
        C   ++   + +L+++                  P   SSSDES+ S    G+LVFEYLE   PF REPLTDKI+ LSS+FP L+TYRSCDLSPSSW+SV
Subjt:  CG--AEKKTKTALQDEWIQDSSVLGSQRALQMSVPSAESSSDESD-SCYRQGQLVFEYLERDPPFCREPLTDKITILSSRFPELKTYRSCDLSPSSWISV

Query:  AWYPIYRIPTGPTLQSLDACFLTFHSLSTAFLGISTDGLQFHWPRVREVYTADCPLKLQLPTFGLASYKFKISFWN-STGAEECPKANTLWQDADNWLRS
        AWYPIYRIP G +LQ+LDACFLTFHSLST   G S +  Q     V          KL LPTFGLASYKFK+S W+  +  +E  +  TL + A+ WLR 
Subjt:  AWYPIYRIPTGPTLQSLDACFLTFHSLSTAFLGISTDGLQFHWPRVREVYTADCPLKLQLPTFGLASYKFKISFWN-STGAEECPKANTLWQDADNWLRS

Query:  LNVNHPDYKFFASHN-SFWR
        L V  PD++ F SH+ S WR
Subjt:  LNVNHPDYKFFASHN-SFWR

AT5G49220.1 Protein of unknown function (DUF789)1.6e-9248.99Show/hide
Query:  MSVSGGVSIAR--IRGENRFYHPPAMRRRLQQQQQQQQQQQQQKQSALDSKEVVATSAR----------------IDELEKR---SEFDECRSWSTRSDC
        MS SGGVSIAR  IRGENRFY+PP MRR   QQ+ Q QQQ ++KQ   D  EV+    R                + E + R   S  + C   S  S  
Subjt:  MSVSGGVSIAR--IRGENRFYHPPAMRRRLQQQQQQQQQQQQQKQSALDSKEVVATSAR----------------IDELEKR---SEFDECRSWSTRSDC

Query:  SVSDRGVADSTNLDRFLEHTTPLVPAQCIPKTSLRGWRNREVSEAPPYFVLGDLWESFKEWSAYGAGI-----PLLLNGSDSVVQYYVPYLSGIQLYVDP
        S S R ++D +NLDRFLEHTTP+VPA+  P  S    + RE S+   YFVL DLWESF EWSAYGAG+     PL ++G+DS VQYYVPYLSGIQLYVDP
Subjt:  SVSDRGVADSTNLDRFLEHTTPLVPAQCIPKTSLRGWRNREVSEAPPYFVLGDLWESFKEWSAYGAGI-----PLLLNGSDSVVQYYVPYLSGIQLYVDP

Query:  SKSSALSRRRGADSDAESSKETSSDGSS---NCGAEKKTKTALQDEWIQDSSVLGSQRALQMSVPSAESSSDESDSCYRQGQLVFEYLERDPPFCREPLT
             L + R    D E S E SS+  +   +    +  + +L+D+     S+ GS             SS E++    QG+L+FEYLE +PPF REPL 
Subjt:  SKSSALSRRRGADSDAESSKETSSDGSS---NCGAEKKTKTALQDEWIQDSSVLGSQRALQMSVPSAESSSDESDSCYRQGQLVFEYLERDPPFCREPLT

Query:  DKITILSSRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFLGISTDGLQFHWPRVREVYTADC--PLKLQLPTFGLASYK
        +KI+ L+SR PEL TYRSCDL PSSW+SV+WYPIYRIP GPTLQ+LDACFLTFHSLSTA             P    +  +D     KL LPTFGLASYK
Subjt:  DKITILSSRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFLGISTDGLQFHWPRVREVYTADC--PLKLQLPTFGLASYK

Query:  FKISFWNSTGAEECPKANTLWQDADNWLRSLNVNHPDYKFFASHN
         K+S WN    +E  K  +L Q AD WL+ L V+HPDY+FF S++
Subjt:  FKISFWNSTGAEECPKANTLWQDADNWLRSLNVNHPDYKFFASHN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAGTCTCCGGTGGGGTTTCGATTGCCCGAATCCGTGGCGAGAATCGGTTCTATCATCCACCTGCGATGCGGCGTCGTTTGCAGCAGCAGCAGCAGCAACAACAACA
GCAACAACAGCAGAAGCAGAGCGCCTTGGATTCTAAGGAGGTTGTTGCTACTTCTGCGAGGATCGATGAGTTGGAGAAGAGGAGTGAGTTCGATGAGTGTCGTTCTTGGT
CCACTCGCTCTGATTGCTCTGTTTCGGATCGTGGAGTTGCTGATTCTACTAATTTGGATCGGTTCTTGGAGCACACTACTCCCCTTGTTCCGGCTCAATGTATTCCTAAG
ACGAGCCTGAGGGGATGGAGAAATCGTGAAGTCTCAGAGGCACCTCCTTATTTTGTGCTTGGCGATCTCTGGGAATCTTTCAAGGAATGGAGTGCATATGGAGCCGGTAT
CCCTCTATTGTTAAATGGTAGCGACTCTGTAGTACAGTACTACGTTCCATATCTGTCTGGCATTCAACTCTATGTAGATCCATCAAAGTCGTCTGCCCTAAGTAGAAGGC
GTGGTGCAGATAGTGATGCTGAGTCCTCAAAGGAAACAAGCAGTGATGGAAGCAGTAATTGTGGGGCAGAAAAAAAAACGAAGACTGCTCTTCAGGATGAGTGGATCCAG
GACTCCAGTGTTCTGGGGTCACAAAGAGCTCTTCAAATGAGTGTACCTTCTGCCGAGTCATCAAGTGATGAAAGTGACTCTTGCTACCGTCAAGGTCAGCTTGTGTTTGA
ATACTTGGAGCGCGATCCACCATTTTGCCGTGAACCATTAACTGATAAGATCACTATTCTTTCATCTCGTTTTCCTGAATTAAAGACATATAGAAGCTGTGATTTATCTC
CTTCCAGTTGGATATCTGTGGCCTGGTATCCAATTTATAGGATTCCCACGGGTCCAACTCTACAAAGTCTAGATGCTTGTTTTTTGACCTTCCATTCTCTGTCAACAGCA
TTTCTAGGCATCAGCACCGATGGGTTGCAATTCCATTGGCCGAGAGTTAGAGAGGTGTACACTGCGGATTGCCCTCTCAAACTACAGTTGCCAACATTTGGACTTGCTTC
CTACAAGTTCAAAATATCTTTTTGGAATTCAACTGGTGCGGAGGAATGTCCCAAGGCTAATACTTTGTGGCAAGATGCTGACAACTGGCTCAGGTCATTAAACGTGAACC
ATCCTGATTACAAATTTTTCGCATCTCATAATTCATTCTGGAGATGA
mRNA sequenceShow/hide mRNA sequence
GCAATGTCAGTCTCCGGTGGGGTTTCGATTGCCCGAATCCGTGGCGAGAATCGGTTCTATCATCCACCTGCGATGCGGCGTCGTTTGCAGCAGCAGCAGCAGCAACAACA
ACAGCAACAACAGCAGAAGCAGAGCGCCTTGGATTCTAAGGAGGTTGTTGCTACTTCTGCGAGGATCGATGAGTTGGAGAAGAGGAGTGAGTTCGATGAGTGTCGTTCTT
GGTCCACTCGCTCTGATTGCTCTGTTTCGGATCGTGGAGTTGCTGATTCTACTAATTTGGATCGGTTCTTGGAGCACACTACTCCCCTTGTTCCGGCTCAATGTATTCCT
AAGACGAGCCTGAGGGGATGGAGAAATCGTGAAGTCTCAGAGGCACCTCCTTATTTTGTGCTTGGCGATCTCTGGGAATCTTTCAAGGAATGGAGTGCATATGGAGCCGG
TATCCCTCTATTGTTAAATGGTAGCGACTCTGTAGTACAGTACTACGTTCCATATCTGTCTGGCATTCAACTCTATGTAGATCCATCAAAGTCGTCTGCCCTAAGTAGAA
GGCGTGGTGCAGATAGTGATGCTGAGTCCTCAAAGGAAACAAGCAGTGATGGAAGCAGTAATTGTGGGGCAGAAAAAAAAACGAAGACTGCTCTTCAGGATGAGTGGATC
CAGGACTCCAGTGTTCTGGGGTCACAAAGAGCTCTTCAAATGAGTGTACCTTCTGCCGAGTCATCAAGTGATGAAAGTGACTCTTGCTACCGTCAAGGTCAGCTTGTGTT
TGAATACTTGGAGCGCGATCCACCATTTTGCCGTGAACCATTAACTGATAAGATCACTATTCTTTCATCTCGTTTTCCTGAATTAAAGACATATAGAAGCTGTGATTTAT
CTCCTTCCAGTTGGATATCTGTGGCCTGGTATCCAATTTATAGGATTCCCACGGGTCCAACTCTACAAAGTCTAGATGCTTGTTTTTTGACCTTCCATTCTCTGTCAACA
GCATTTCTAGGCATCAGCACCGATGGGTTGCAATTCCATTGGCCGAGAGTTAGAGAGGTGTACACTGCGGATTGCCCTCTCAAACTACAGTTGCCAACATTTGGACTTGC
TTCCTACAAGTTCAAAATATCTTTTTGGAATTCAACTGGTGCGGAGGAATGTCCCAAGGCTAATACTTTGTGGCAAGATGCTGACAACTGGCTCAGGTCATTAAACGTGA
ACCATCCTGATTACAAATTTTTCGCATCTCATAATTCATTCTGGAGATGATAAGGATATTATGAATGTGGGATTACAGTGTCTTAAGTAAGGGGACTTAAGTCCAAAGAA
ACTCGCTTCTTCTGAATGTCGTGGAAAATTTATGTGGCATCTTTGGGTTTTTTTTGAATTTCTATTTTGACCATATAAAGGGTTTTGTACAGGGAGAATGATGTTGATAG
TAACATTACGGTTGAAATGGGGATTGGCTTTAGGATAGCTATTATTCTGGGATCGTCAATCCCCTATGGTATCTAGTCTATTCTTCCATGTGTTGAGATATGAGTGAGAT
TATGGGAATATAGTGGTTGTGGTCAGCAGAGAAACTGCTGTCAATAACAACACCGCCCCCTTCCCTTTTTTTTTTTTTTTCAAATTGTGGGTTAGATTTGTAACTTTCAC
CGGCACCAAGATAATGATGAAAAGTATATCTAGTATGTTAACTACAATCTCGATGTATTAACAGAGTATTTAACAGTTCATGTGATAAGGTGTAATTGCTGCTCGGTTTT
AAGTGTATGTTGAATATGTC
Protein sequenceShow/hide protein sequence
MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQKQSALDSKEVVATSARIDELEKRSEFDECRSWSTRSDCSVSDRGVADSTNLDRFLEHTTPLVPAQCIPK
TSLRGWRNREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDAESSKETSSDGSSNCGAEKKTKTALQDEWIQ
DSSVLGSQRALQMSVPSAESSSDESDSCYRQGQLVFEYLERDPPFCREPLTDKITILSSRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTA
FLGISTDGLQFHWPRVREVYTADCPLKLQLPTFGLASYKFKISFWNSTGAEECPKANTLWQDADNWLRSLNVNHPDYKFFASHNSFWR