; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0020722 (gene) of Snake gourd v1 genome

Gene IDTan0020722
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionSmall multi-drug export protein
Genome locationLG04:10366906..10369525
RNA-Seq ExpressionTan0020722
SyntenyTan0020722
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR009577 - Putative small multi-drug export


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7013172.1 hypothetical protein SDJN02_25928 [Cucurbita argyrosperma subsp. argyrosperma]5.9e-15091.05Show/hide
Query:  MAISSALTSPLMSAFSARKTLISLNLNRPSISQSKQSLRWSSPCVNVRHFNCLNPVFSTSRMSCTVARSSSNEFLEDDDILPSYEEKPVKVLLLVLFWAS
        MA S  LTSPLMSAFSARKTLISLNLNRPSISQ KQSL  SS C+N+RHFNC NPVFSTSRMSCTV R SSNEFLE DDILPS+EEKPVKVLLLVLFWAS
Subjt:  MAISSALTSPLMSAFSARKTLISLNLNRPSISQSKQSLRWSSPCVNVRHFNCLNPVFSTSRMSCTVARSSSNEFLEDDDILPSYEEKPVKVLLLVLFWAS

Query:  LSFAWFAASGDAKAAVDSIRASNFGLKIASTLQRSGWPAEAVVFALATLPVIELRGAIPVGYWMQLNPVALTVLSVLGNMVPVPFIILYLKKIATFLAGR
        LS AWFAASGDAKAAVDSIRASNFGLKIAS LQ SGWPAEA+VFALATLPV+ELRGAIPVGYWMQL PVALTVLSVLGNMVPVPFIILYLKK ATFLAGR
Subjt:  LSFAWFAASGDAKAAVDSIRASNFGLKIASTLQRSGWPAEAVVFALATLPVIELRGAIPVGYWMQLNPVALTVLSVLGNMVPVPFIILYLKKIATFLAGR

Query:  NATASRFLDMLFKRAKEKAAPVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSGVSANFIGVVLAGLLVNLLVNLGLKEAIVTGVILFIISTFM
        NATAS+FLDMLFKRAK KAAPVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSG+SANF GVVLAGLLVNLLVNLGLKEA+ TGVILFIISTFM
Subjt:  NATASRFLDMLFKRAKEKAAPVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSGVSANFIGVVLAGLLVNLLVNLGLKEAIVTGVILFIISTFM

Query:  WSILRLIRKAFRK
        WSILRLIRKAF K
Subjt:  WSILRLIRKAFRK

XP_004135199.1 uncharacterized protein LOC101204187 [Cucumis sativus]1.7e-14187.22Show/hide
Query:  MAISSALTSPLMSAFSARKTLISLNLNRPSISQSKQSLRWSSPCVNVRHFNCLNPVFSTSRMSCTVARSSSNEFLEDDDILPSYEEKPVKVLLLVLFWAS
        M  S  LTSPL SAFS RKTL SL LNRPSI++S QSL +SSP VNV HFNC +PV  TSR+  TV RSSSN FLEDD+I+PS+EEKPVKVLLLVLFWAS
Subjt:  MAISSALTSPLMSAFSARKTLISLNLNRPSISQSKQSLRWSSPCVNVRHFNCLNPVFSTSRMSCTVARSSSNEFLEDDDILPSYEEKPVKVLLLVLFWAS

Query:  LSFAWFAASGDAKAAVDSIRASNFGLKIASTLQRSGWPAEAVVFALATLPVIELRGAIPVGYWMQLNPVALTVLSVLGNMVPVPFIILYLKKIATFLAGR
        LS AWFAASGDAKAAVDSIRASNFGLKIAS LQ SGWPAEAVVFALATLPVIELRGAIPVGYWMQL PVALTVLSVLGNMVPVPFIILYLKK ATFLAGR
Subjt:  LSFAWFAASGDAKAAVDSIRASNFGLKIASTLQRSGWPAEAVVFALATLPVIELRGAIPVGYWMQLNPVALTVLSVLGNMVPVPFIILYLKKIATFLAGR

Query:  NATASRFLDMLFKRAKEKAAPVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSGVSANFIGVVLAGLLVNLLVNLGLKEAIVTGVILFIISTFM
        NA+AS+FLDMLFKRAKEKAAPVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSGVSANF GVV+AGLLVNLLVNLGLKEAIVTGVILFIISTFM
Subjt:  NATASRFLDMLFKRAKEKAAPVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSGVSANFIGVVLAGLLVNLLVNLGLKEAIVTGVILFIISTFM

Query:  WSILRLIRKAFRK
        WSILR+I+K+F K
Subjt:  WSILRLIRKAFRK

XP_022944972.1 uncharacterized protein LOC111449349 [Cucurbita moschata]8.5e-14990.1Show/hide
Query:  MAISSALTSPLMSAFSARKTLISLNLNRPSISQSKQSLRWSSPCVNVRHFNCLNPVFSTSRMSCTVARSSSNEFLEDDDILPSYEEKPVKVLLLVLFWAS
        MA S  LTSPLMSAFSARKTLISLNLNRPSISQ  QSL  SS C+N+RHFNC NPVFSTSR+SCTV R SSNEFLE DDILPS+EEKPVKVLLLVLFWAS
Subjt:  MAISSALTSPLMSAFSARKTLISLNLNRPSISQSKQSLRWSSPCVNVRHFNCLNPVFSTSRMSCTVARSSSNEFLEDDDILPSYEEKPVKVLLLVLFWAS

Query:  LSFAWFAASGDAKAAVDSIRASNFGLKIASTLQRSGWPAEAVVFALATLPVIELRGAIPVGYWMQLNPVALTVLSVLGNMVPVPFIILYLKKIATFLAGR
        LS AWFAASGDAKAAVDSIRASNFGLKIAS LQ SGWPAEA+VFALATLPV+ELRGAIPVGYWMQL PVALTVLSVLGNMVPVPFIILYLKK+ATFLAGR
Subjt:  LSFAWFAASGDAKAAVDSIRASNFGLKIASTLQRSGWPAEAVVFALATLPVIELRGAIPVGYWMQLNPVALTVLSVLGNMVPVPFIILYLKKIATFLAGR

Query:  NATASRFLDMLFKRAKEKAAPVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSGVSANFIGVVLAGLLVNLLVNLGLKEAIVTGVILFIISTFM
        NATAS+FLDMLFKRAK KAAPVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSG+SANF GVVLAGLLVNLLVNLGLKEA+ TGVILFIISTFM
Subjt:  NATASRFLDMLFKRAKEKAAPVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSGVSANFIGVVLAGLLVNLLVNLGLKEAIVTGVILFIISTFM

Query:  WSILRLIRKAFRK
        WSILRLI+KAF K
Subjt:  WSILRLIRKAFRK

XP_022968257.1 uncharacterized protein LOC111467549 [Cucurbita maxima]1.0e-14689.49Show/hide
Query:  MAISSALTSPLMSAFSARKTLISLNLNRPSISQSKQSLRWSSPCVNVRHFNCLNPVFSTSRMSCTVARSSSNEFLEDDDILPSYEEKPVKV-LLLVLFWA
        MA S  LTSPLMSAFSARKTLISLNLNRPSI+  KQ L  SS C+N+RHFNC NPVFSTSRMSCTV R SSNEFLE DDILPS+EEKPVKV +LLVLFWA
Subjt:  MAISSALTSPLMSAFSARKTLISLNLNRPSISQSKQSLRWSSPCVNVRHFNCLNPVFSTSRMSCTVARSSSNEFLEDDDILPSYEEKPVKV-LLLVLFWA

Query:  SLSFAWFAASGDAKAAVDSIRASNFGLKIASTLQRSGWPAEAVVFALATLPVIELRGAIPVGYWMQLNPVALTVLSVLGNMVPVPFIILYLKKIATFLAG
        SLS AWFAASGDAKAAVDSIRASNFGLKIAS LQ SGWPAEA+VFALATLPV+ELRGAIPVGYWMQL PVALTVLSVLGNMVPVPFIILYLKK ATFLAG
Subjt:  SLSFAWFAASGDAKAAVDSIRASNFGLKIASTLQRSGWPAEAVVFALATLPVIELRGAIPVGYWMQLNPVALTVLSVLGNMVPVPFIILYLKKIATFLAG

Query:  RNATASRFLDMLFKRAKEKAAPVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSGVSANFIGVVLAGLLVNLLVNLGLKEAIVTGVILFIISTF
        RNATAS+FLDMLFKRAK KAAPVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSG+SANF GVVLAGLLVNLLVNLGLKEA+ TGVILFIISTF
Subjt:  RNATASRFLDMLFKRAKEKAAPVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSGVSANFIGVVLAGLLVNLLVNLGLKEAIVTGVILFIISTF

Query:  MWSILRLIRKAFRK
        MWSILRLIRKAF K
Subjt:  MWSILRLIRKAFRK

XP_023542277.1 uncharacterized protein LOC111802220 [Cucurbita pepo subsp. pepo]8.5e-14990.42Show/hide
Query:  MAISSALTSPLMSAFSARKTLISLNLNRPSISQSKQSLRWSSPCVNVRHFNCLNPVFSTSRMSCTVARSSSNEFLEDDDILPSYEEKPVKVLLLVLFWAS
        MA S  LTSPLMSAFSARKTLISLNLNRPSISQ KQ L  SS C+N+RHFNC NPVFSTSRMSCTV R SSNEFLE DDILPS+EEKPVKVLLLVLFWAS
Subjt:  MAISSALTSPLMSAFSARKTLISLNLNRPSISQSKQSLRWSSPCVNVRHFNCLNPVFSTSRMSCTVARSSSNEFLEDDDILPSYEEKPVKVLLLVLFWAS

Query:  LSFAWFAASGDAKAAVDSIRASNFGLKIASTLQRSGWPAEAVVFALATLPVIELRGAIPVGYWMQLNPVALTVLSVLGNMVPVPFIILYLKKIATFLAGR
        LS AWFAASGDAKAAVDSIRASNFGLKIAS LQ SGWPAEA+VFALATLPV+ELRGAIPVGYWMQL PVALTVLSVLGNMVPVPFIILYLKK ATFLAGR
Subjt:  LSFAWFAASGDAKAAVDSIRASNFGLKIASTLQRSGWPAEAVVFALATLPVIELRGAIPVGYWMQLNPVALTVLSVLGNMVPVPFIILYLKKIATFLAGR

Query:  NATASRFLDMLFKRAKEKAAPVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSGVSANFIGVVLAGLLVNLLVNLGLKEAIVTGVILFIISTFM
        NATAS+FLDMLFKRAK KAAPVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSG+SANF GVVLAGLLVNLLVNLGLKEA+ TGVILFIISTFM
Subjt:  NATASRFLDMLFKRAKEKAAPVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSGVSANFIGVVLAGLLVNLLVNLGLKEAIVTGVILFIISTFM

Query:  WSILRLIRKAFRK
        WSILRLI+KAF K
Subjt:  WSILRLIRKAFRK

TrEMBL top hitse value%identityAlignment
A0A0A0KQH3 Uncharacterized protein8.4e-14287.22Show/hide
Query:  MAISSALTSPLMSAFSARKTLISLNLNRPSISQSKQSLRWSSPCVNVRHFNCLNPVFSTSRMSCTVARSSSNEFLEDDDILPSYEEKPVKVLLLVLFWAS
        M  S  LTSPL SAFS RKTL SL LNRPSI++S QSL +SSP VNV HFNC +PV  TSR+  TV RSSSN FLEDD+I+PS+EEKPVKVLLLVLFWAS
Subjt:  MAISSALTSPLMSAFSARKTLISLNLNRPSISQSKQSLRWSSPCVNVRHFNCLNPVFSTSRMSCTVARSSSNEFLEDDDILPSYEEKPVKVLLLVLFWAS

Query:  LSFAWFAASGDAKAAVDSIRASNFGLKIASTLQRSGWPAEAVVFALATLPVIELRGAIPVGYWMQLNPVALTVLSVLGNMVPVPFIILYLKKIATFLAGR
        LS AWFAASGDAKAAVDSIRASNFGLKIAS LQ SGWPAEAVVFALATLPVIELRGAIPVGYWMQL PVALTVLSVLGNMVPVPFIILYLKK ATFLAGR
Subjt:  LSFAWFAASGDAKAAVDSIRASNFGLKIASTLQRSGWPAEAVVFALATLPVIELRGAIPVGYWMQLNPVALTVLSVLGNMVPVPFIILYLKKIATFLAGR

Query:  NATASRFLDMLFKRAKEKAAPVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSGVSANFIGVVLAGLLVNLLVNLGLKEAIVTGVILFIISTFM
        NA+AS+FLDMLFKRAKEKAAPVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSGVSANF GVV+AGLLVNLLVNLGLKEAIVTGVILFIISTFM
Subjt:  NATASRFLDMLFKRAKEKAAPVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSGVSANFIGVVLAGLLVNLLVNLGLKEAIVTGVILFIISTFM

Query:  WSILRLIRKAFRK
        WSILR+I+K+F K
Subjt:  WSILRLIRKAFRK

A0A5A7SV67 Sm_multidrug_ex domain-containing protein1.0e-13985.94Show/hide
Query:  MAISSALTSPLMSAFSARKTLISLNLNRPSISQSKQSLRWSSPCVNVRHFNCLNPVFSTSRMSCTVARSSSNEFLEDDDILPSYEEKPVKVLLLVLFWAS
        M  S  LTSPL SAFS RKTL SL LNRPSI+QS  SL +SSP VNV H NC +PV STSR+  TV RSSSN FLEDD+I+PS+EEKP+KVL+LVLFWAS
Subjt:  MAISSALTSPLMSAFSARKTLISLNLNRPSISQSKQSLRWSSPCVNVRHFNCLNPVFSTSRMSCTVARSSSNEFLEDDDILPSYEEKPVKVLLLVLFWAS

Query:  LSFAWFAASGDAKAAVDSIRASNFGLKIASTLQRSGWPAEAVVFALATLPVIELRGAIPVGYWMQLNPVALTVLSVLGNMVPVPFIILYLKKIATFLAGR
        LS AWFAASGDAKAAVDSIRASNFGLKIAS LQ SGWPAEAVVFALATLPVIELRGAIPVGYWMQL PV LTVLSVLGNMVPVPFIILYLKK ATFLAGR
Subjt:  LSFAWFAASGDAKAAVDSIRASNFGLKIASTLQRSGWPAEAVVFALATLPVIELRGAIPVGYWMQLNPVALTVLSVLGNMVPVPFIILYLKKIATFLAGR

Query:  NATASRFLDMLFKRAKEKAAPVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSGVSANFIGVVLAGLLVNLLVNLGLKEAIVTGVILFIISTFM
        NA+AS+FLDMLFKRAKEKAAPVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSGVSANF GVV+AGLLVNLLVNLGLKEAIVTG ILFIISTFM
Subjt:  NATASRFLDMLFKRAKEKAAPVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSGVSANFIGVVLAGLLVNLLVNLGLKEAIVTGVILFIISTFM

Query:  WSILRLIRKAFRK
        WSILR+I+K+F K
Subjt:  WSILRLIRKAFRK

A0A6J1DB35 uncharacterized protein LOC1110190666.0e-14085.3Show/hide
Query:  MAISSALTSPLMSAFSARKTLISLNLNRPSISQSKQSLRWSSPCVNVRHFNCLNPVFSTSRMSCTVARSSSNEFLEDDDILPSYEEKPVKVLLLVLFWAS
        M  S A T P+MSAFS RKT I L LNRPS+SQSKQSL  SSPC+NVRHFN  +P+F+TSR+  TV R+ SN F+E+DDI+PS+EEKPVK+LLLVLFWAS
Subjt:  MAISSALTSPLMSAFSARKTLISLNLNRPSISQSKQSLRWSSPCVNVRHFNCLNPVFSTSRMSCTVARSSSNEFLEDDDILPSYEEKPVKVLLLVLFWAS

Query:  LSFAWFAASGDAKAAVDSIRASNFGLKIASTLQRSGWPAEAVVFALATLPVIELRGAIPVGYWMQLNPVALTVLSVLGNMVPVPFIILYLKKIATFLAGR
        LS +WFAASGDAKAA DSIRASNFGLKIA+TL+ SGWPAEAVVFALATLPVIELRGAIPVGYWMQL PVALTVLSVLGNMVPVP IILYLKK ATFLAGR
Subjt:  LSFAWFAASGDAKAAVDSIRASNFGLKIASTLQRSGWPAEAVVFALATLPVIELRGAIPVGYWMQLNPVALTVLSVLGNMVPVPFIILYLKKIATFLAGR

Query:  NATASRFLDMLFKRAKEKAAPVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSGVSANFIGVVLAGLLVNLLVNLGLKEAIVTGVILFIISTFM
        NA+ASRFLDMLFKR KEKAAPVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSGVSANF GVVLAGLLVNLLVNLGLKEAIVTGV LFI+STFM
Subjt:  NATASRFLDMLFKRAKEKAAPVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSGVSANFIGVVLAGLLVNLLVNLGLKEAIVTGVILFIISTFM

Query:  WSILRLIRKAFRK
        WSILRLI KAFRK
Subjt:  WSILRLIRKAFRK

A0A6J1FZI4 uncharacterized protein LOC1114493494.1e-14990.1Show/hide
Query:  MAISSALTSPLMSAFSARKTLISLNLNRPSISQSKQSLRWSSPCVNVRHFNCLNPVFSTSRMSCTVARSSSNEFLEDDDILPSYEEKPVKVLLLVLFWAS
        MA S  LTSPLMSAFSARKTLISLNLNRPSISQ  QSL  SS C+N+RHFNC NPVFSTSR+SCTV R SSNEFLE DDILPS+EEKPVKVLLLVLFWAS
Subjt:  MAISSALTSPLMSAFSARKTLISLNLNRPSISQSKQSLRWSSPCVNVRHFNCLNPVFSTSRMSCTVARSSSNEFLEDDDILPSYEEKPVKVLLLVLFWAS

Query:  LSFAWFAASGDAKAAVDSIRASNFGLKIASTLQRSGWPAEAVVFALATLPVIELRGAIPVGYWMQLNPVALTVLSVLGNMVPVPFIILYLKKIATFLAGR
        LS AWFAASGDAKAAVDSIRASNFGLKIAS LQ SGWPAEA+VFALATLPV+ELRGAIPVGYWMQL PVALTVLSVLGNMVPVPFIILYLKK+ATFLAGR
Subjt:  LSFAWFAASGDAKAAVDSIRASNFGLKIASTLQRSGWPAEAVVFALATLPVIELRGAIPVGYWMQLNPVALTVLSVLGNMVPVPFIILYLKKIATFLAGR

Query:  NATASRFLDMLFKRAKEKAAPVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSGVSANFIGVVLAGLLVNLLVNLGLKEAIVTGVILFIISTFM
        NATAS+FLDMLFKRAK KAAPVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSG+SANF GVVLAGLLVNLLVNLGLKEA+ TGVILFIISTFM
Subjt:  NATASRFLDMLFKRAKEKAAPVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSGVSANFIGVVLAGLLVNLLVNLGLKEAIVTGVILFIISTFM

Query:  WSILRLIRKAFRK
        WSILRLI+KAF K
Subjt:  WSILRLIRKAFRK

A0A6J1HZ51 uncharacterized protein LOC1114675495.1e-14789.49Show/hide
Query:  MAISSALTSPLMSAFSARKTLISLNLNRPSISQSKQSLRWSSPCVNVRHFNCLNPVFSTSRMSCTVARSSSNEFLEDDDILPSYEEKPVKV-LLLVLFWA
        MA S  LTSPLMSAFSARKTLISLNLNRPSI+  KQ L  SS C+N+RHFNC NPVFSTSRMSCTV R SSNEFLE DDILPS+EEKPVKV +LLVLFWA
Subjt:  MAISSALTSPLMSAFSARKTLISLNLNRPSISQSKQSLRWSSPCVNVRHFNCLNPVFSTSRMSCTVARSSSNEFLEDDDILPSYEEKPVKV-LLLVLFWA

Query:  SLSFAWFAASGDAKAAVDSIRASNFGLKIASTLQRSGWPAEAVVFALATLPVIELRGAIPVGYWMQLNPVALTVLSVLGNMVPVPFIILYLKKIATFLAG
        SLS AWFAASGDAKAAVDSIRASNFGLKIAS LQ SGWPAEA+VFALATLPV+ELRGAIPVGYWMQL PVALTVLSVLGNMVPVPFIILYLKK ATFLAG
Subjt:  SLSFAWFAASGDAKAAVDSIRASNFGLKIASTLQRSGWPAEAVVFALATLPVIELRGAIPVGYWMQLNPVALTVLSVLGNMVPVPFIILYLKKIATFLAG

Query:  RNATASRFLDMLFKRAKEKAAPVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSGVSANFIGVVLAGLLVNLLVNLGLKEAIVTGVILFIISTF
        RNATAS+FLDMLFKRAK KAAPVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSG+SANF GVVLAGLLVNLLVNLGLKEA+ TGVILFIISTF
Subjt:  RNATASRFLDMLFKRAKEKAAPVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSGVSANFIGVVLAGLLVNLLVNLGLKEAIVTGVILFIISTF

Query:  MWSILRLIRKAFRK
        MWSILRLIRKAF K
Subjt:  MWSILRLIRKAFRK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G02590.1 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 21 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Putative small multi-drug export (InterPro:IPR009577); Has 405 Blast hits to 405 proteins in 185 species: Archae - 65; Bacteria - 295; Metazoa - 0; Fungi - 0; Plants - 23; Viruses - 0; Other Eukaryotes - 22 (source: NCBI BLink).6.9e-9661.73Show/hide
Query:  MAISS--ALTSPLMSAFSARKTLISLNLNRPSISQSKQSLRWSSPCVNVRHFNC-LNPVFSTSRMSCTVARSSSNEFL---EDDD------ILPSYEEKP
        MAIS+  +++S  +S+ S+ KT +  ++    +     S +W      +R      + +F   R + T   SS + FL   +DD+       LPS    P
Subjt:  MAISS--ALTSPLMSAFSARKTLISLNLNRPSISQSKQSLRWSSPCVNVRHFNC-LNPVFSTSRMSCTVARSSSNEFL---EDDD------ILPSYEEKP

Query:  VKVLLLVLFWASLSFAWFAASGDAKAAVDSIRASNFGLKIASTLQRSGWPAEAVVFALATLPVIELRGAIPVGYWMQLNPVALTVLSVLGNMVPVPFIIL
        VK  + V+ WAS S  WFA SGDAKAA DSI++S+FGL+IASTL+R GWP EAVVFALATLPVIELRGAIPVGYWMQL PV LT  SVLGNMVPVPFI+L
Subjt:  VKVLLLVLFWASLSFAWFAASGDAKAAVDSIRASNFGLKIASTLQRSGWPAEAVVFALATLPVIELRGAIPVGYWMQLNPVALTVLSVLGNMVPVPFIIL

Query:  YLKKIATFLAGRNATASRFLDMLFKRAKEKAAPVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSGVSANFIGVVLAGLLVNLLVNLGLKEAIV
        YLK  A+F+AG++ TAS+ LD+LFKRAKEKA PVEEF+WLGLMLFVAVPFPGTGAWTGAIIASILDMPFWS VS+NF GVVLAGLLVNLLVNLGLK+AIV
Subjt:  YLKKIATFLAGRNATASRFLDMLFKRAKEKAAPVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSGVSANFIGVVLAGLLVNLLVNLGLKEAIV

Query:  TGVILFIISTFMWSILRLIRKAFR
         G+ LF +STFMWS+LR IRK+ +
Subjt:  TGVILFIISTFMWSILRLIRKAFR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTATTTCTTCAGCATTGACTTCACCATTAATGTCGGCATTTTCGGCGAGAAAGACCCTTATCTCGCTCAATCTCAATCGACCCTCCATTAGTCAGAGTAAACAATC
TCTCCGTTGGTCGAGTCCATGTGTAAACGTTCGCCATTTCAACTGTTTGAATCCTGTTTTCTCGACTTCTCGGATGTCTTGTACTGTCGCTCGAAGTTCATCAAATGAGT
TTCTGGAAGATGATGACATTTTGCCCTCTTATGAGGAGAAGCCGGTTAAAGTTCTACTGTTGGTTCTGTTTTGGGCTTCTTTATCCTTTGCTTGGTTTGCTGCTTCTGGG
GATGCCAAAGCTGCTGTTGATTCTATCAGAGCTTCGAATTTTGGCCTAAAGATCGCCAGCACATTGCAGCGCTCAGGCTGGCCTGCCGAGGCTGTAGTATTTGCTCTCGC
TACGCTTCCGGTAATTGAGCTCCGTGGGGCGATCCCTGTTGGTTACTGGATGCAGCTTAATCCTGTAGCGCTAACCGTTCTATCCGTTCTTGGGAACATGGTTCCTGTGC
CTTTCATTATACTCTATTTAAAAAAAATAGCAACTTTCCTTGCGGGAAGGAATGCTACTGCCTCTCGATTCCTCGATATGTTATTCAAGAGGGCTAAAGAGAAAGCTGCA
CCTGTTGAAGAGTTTCAATGGCTCGGACTAATGCTATTCGTGGCTGTGCCTTTCCCTGGAACAGGAGCTTGGACTGGTGCCATAATAGCTTCCATCCTAGATATGCCATT
CTGGTCAGGTGTTTCTGCAAATTTCATTGGTGTTGTATTAGCAGGGCTTCTGGTGAACTTGTTGGTGAATCTTGGTCTTAAGGAGGCCATTGTCACTGGAGTAATTCTTT
TCATTATATCAACTTTCATGTGGAGCATTCTTCGACTGATTAGAAAAGCTTTCAGAAAATGA
mRNA sequenceShow/hide mRNA sequence
GTTGCTTTGCTTCCCCAAATATTTCTTGTGTTCTTTTCCGATGTTCATTACGCATTCTCTCTTTATCCAAACAAAACCTCGGTAAATCCTCCCTCTTGTTGCTTATGCCG
TTTTCCTAACTCTCCTTCTGTACCCAATTTCCAATACATTCCTACTTTTTATAGGGTAAAATCAGCAATGCCCTTTCAATTCGTTCCAATTTGCTCGGAGGCAAGAATTC
AGCTCCAACCCTTATGATTGGCATCGGAGAAATGTCCTTTAAACCCACTTCCATTATCATCCTCTGAATCAATTTCCAATGCCATAATTCCACTCAGTCAAAGTGACTCG
GCAGAATTCCGCTTGAGCATAAGAGGGATTTTCTCAAAGAACTCAACCCCAGACGCGTTCTCCAGCTTCCTGTAAGGAATGGCTATTTCTTCAGCATTGACTTCACCATT
AATGTCGGCATTTTCGGCGAGAAAGACCCTTATCTCGCTCAATCTCAATCGACCCTCCATTAGTCAGAGTAAACAATCTCTCCGTTGGTCGAGTCCATGTGTAAACGTTC
GCCATTTCAACTGTTTGAATCCTGTTTTCTCGACTTCTCGGATGTCTTGTACTGTCGCTCGAAGTTCATCAAATGAGTTTCTGGAAGATGATGACATTTTGCCCTCTTAT
GAGGAGAAGCCGGTTAAAGTTCTACTGTTGGTTCTGTTTTGGGCTTCTTTATCCTTTGCTTGGTTTGCTGCTTCTGGGGATGCCAAAGCTGCTGTTGATTCTATCAGAGC
TTCGAATTTTGGCCTAAAGATCGCCAGCACATTGCAGCGCTCAGGCTGGCCTGCCGAGGCTGTAGTATTTGCTCTCGCTACGCTTCCGGTAATTGAGCTCCGTGGGGCGA
TCCCTGTTGGTTACTGGATGCAGCTTAATCCTGTAGCGCTAACCGTTCTATCCGTTCTTGGGAACATGGTTCCTGTGCCTTTCATTATACTCTATTTAAAAAAAATAGCA
ACTTTCCTTGCGGGAAGGAATGCTACTGCCTCTCGATTCCTCGATATGTTATTCAAGAGGGCTAAAGAGAAAGCTGCACCTGTTGAAGAGTTTCAATGGCTCGGACTAAT
GCTATTCGTGGCTGTGCCTTTCCCTGGAACAGGAGCTTGGACTGGTGCCATAATAGCTTCCATCCTAGATATGCCATTCTGGTCAGGTGTTTCTGCAAATTTCATTGGTG
TTGTATTAGCAGGGCTTCTGGTGAACTTGTTGGTGAATCTTGGTCTTAAGGAGGCCATTGTCACTGGAGTAATTCTTTTCATTATATCAACTTTCATGTGGAGCATTCTT
CGACTGATTAGAAAAGCTTTCAGAAAATGAATTAAATTGAAAAGGTTAGGTATTTCTATGTCTATTGACAATGAACAAGGTCGTGTTTGATCTTAGTCTTATTCATCTTG
ACTTAGACTTGTTAGTTTAAGTTGGCAACTTTAGAGCTAAAGTTTGTCTTGTGAACACTGTTCAGAAGAGAAAATGTAGGCAAACAACTTTCATTTGTCATACTGAGCTG
AGAAATGCTGTGTAAATG
Protein sequenceShow/hide protein sequence
MAISSALTSPLMSAFSARKTLISLNLNRPSISQSKQSLRWSSPCVNVRHFNCLNPVFSTSRMSCTVARSSSNEFLEDDDILPSYEEKPVKVLLLVLFWASLSFAWFAASG
DAKAAVDSIRASNFGLKIASTLQRSGWPAEAVVFALATLPVIELRGAIPVGYWMQLNPVALTVLSVLGNMVPVPFIILYLKKIATFLAGRNATASRFLDMLFKRAKEKAA
PVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSGVSANFIGVVLAGLLVNLLVNLGLKEAIVTGVILFIISTFMWSILRLIRKAFRK