; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10009252 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10009252
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionProtein of unknown function (DUF1442)
Genome locationChr06:4022352..4023093
RNA-Seq ExpressionHG10009252
SyntenyHG10009252
Gene Ontology termsNA
InterPro domainsIPR009902 - Protein of unknown function DUF1442


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004147575.1 uncharacterized protein LOC101211926 isoform X2 [Cucumis sativus]3.2e-11095.45Show/hide
Query:  MKLVWSPETASKAYIDTVQSFCVKCDLHQESGVAELISAMAAGWSAQFIVETWSSGGAIATSIGLAVARRHVGGRHVCVVPDERSRGEYSRALERAGLSP
        MKLVWSPETASKAYIDTVQS    CDLHQESGVAELISAMAAGW+AQFIVETWS+GGAIATSIGLAVARRHVGGRHVCVVPDERSRGEYSRA+ERAGLSP
Subjt:  MKLVWSPETASKAYIDTVQSFCVKCDLHQESGVAELISAMAAGWSAQFIVETWSSGGAIATSIGLAVARRHVGGRHVCVVPDERSRGEYSRALERAGLSP

Query:  EVIVGEPEEVMEGLVGIDFLVVDSQRRNFNRVLKLANLSSRGAVLICKNANSRSDSSFRWKSVTDNGTRRLVRSAFLPVGKGLDIAHVAAAGGNSGSGGG
        EVIVGEPEEVMEGLVGIDFLVVDSQRRNF+RVLKLANLSSRGAVLICKNANSRSDSSFRW SVT+NGTRRLVRSAFLPVGKGLDIAHVAAAGGNSGSGGG
Subjt:  EVIVGEPEEVMEGLVGIDFLVVDSQRRNFNRVLKLANLSSRGAVLICKNANSRSDSSFRWKSVTDNGTRRLVRSAFLPVGKGLDIAHVAAAGGNSGSGGG

Query:  KGKWIKHVDRRSGEEFVIRK
        KGKWIKHVDRRSGEEFVIRK
Subjt:  KGKWIKHVDRRSGEEFVIRK

XP_008437150.1 PREDICTED: uncharacterized protein LOC103482660 isoform X1 [Cucumis melo]5.1e-11698.18Show/hide
Query:  MKLVWSPETASKAYIDTVQSFCVKCDLHQESGVAELISAMAAGWSAQFIVETWSSGGAIATSIGLAVARRHVGGRHVCVVPDERSRGEYSRALERAGLSP
        MKLVWSPETASKAYIDTVQSFCVKCDLHQESGVAELISAMAAGW+AQFIVETWSSGGAIATSIGLAVARRHVGGRHVCVVPDERSRGEYSRA+ERAGLSP
Subjt:  MKLVWSPETASKAYIDTVQSFCVKCDLHQESGVAELISAMAAGWSAQFIVETWSSGGAIATSIGLAVARRHVGGRHVCVVPDERSRGEYSRALERAGLSP

Query:  EVIVGEPEEVMEGLVGIDFLVVDSQRRNFNRVLKLANLSSRGAVLICKNANSRSDSSFRWKSVTDNGTRRLVRSAFLPVGKGLDIAHVAAAGGNSGSGGG
        EVIVGEPEEVMEGLVGIDFLVVDSQRRNF+RVLKLANLSSRGAVLICKNANSRSDSSFRWKSVT+NGTRRLVRSAFLPVGKGLDIAHVAAAGGNSGSGGG
Subjt:  EVIVGEPEEVMEGLVGIDFLVVDSQRRNFNRVLKLANLSSRGAVLICKNANSRSDSSFRWKSVTDNGTRRLVRSAFLPVGKGLDIAHVAAAGGNSGSGGG

Query:  KGKWIKHVDRRSGEEFVIRK
        KGKWIKHVDRRSGEEFVIRK
Subjt:  KGKWIKHVDRRSGEEFVIRK

XP_008437151.1 PREDICTED: uncharacterized protein LOC103482660 isoform X2 [Cucumis melo]1.7e-11196.36Show/hide
Query:  MKLVWSPETASKAYIDTVQSFCVKCDLHQESGVAELISAMAAGWSAQFIVETWSSGGAIATSIGLAVARRHVGGRHVCVVPDERSRGEYSRALERAGLSP
        MKLVWSPETASKAYIDTVQS    CDLHQESGVAELISAMAAGW+AQFIVETWSSGGAIATSIGLAVARRHVGGRHVCVVPDERSRGEYSRA+ERAGLSP
Subjt:  MKLVWSPETASKAYIDTVQSFCVKCDLHQESGVAELISAMAAGWSAQFIVETWSSGGAIATSIGLAVARRHVGGRHVCVVPDERSRGEYSRALERAGLSP

Query:  EVIVGEPEEVMEGLVGIDFLVVDSQRRNFNRVLKLANLSSRGAVLICKNANSRSDSSFRWKSVTDNGTRRLVRSAFLPVGKGLDIAHVAAAGGNSGSGGG
        EVIVGEPEEVMEGLVGIDFLVVDSQRRNF+RVLKLANLSSRGAVLICKNANSRSDSSFRWKSVT+NGTRRLVRSAFLPVGKGLDIAHVAAAGGNSGSGGG
Subjt:  EVIVGEPEEVMEGLVGIDFLVVDSQRRNFNRVLKLANLSSRGAVLICKNANSRSDSSFRWKSVTDNGTRRLVRSAFLPVGKGLDIAHVAAAGGNSGSGGG

Query:  KGKWIKHVDRRSGEEFVIRK
        KGKWIKHVDRRSGEEFVIRK
Subjt:  KGKWIKHVDRRSGEEFVIRK

XP_011654785.1 uncharacterized protein LOC101211926 isoform X1 [Cucumis sativus]9.7e-11597.27Show/hide
Query:  MKLVWSPETASKAYIDTVQSFCVKCDLHQESGVAELISAMAAGWSAQFIVETWSSGGAIATSIGLAVARRHVGGRHVCVVPDERSRGEYSRALERAGLSP
        MKLVWSPETASKAYIDTVQSFCVKCDLHQESGVAELISAMAAGW+AQFIVETWS+GGAIATSIGLAVARRHVGGRHVCVVPDERSRGEYSRA+ERAGLSP
Subjt:  MKLVWSPETASKAYIDTVQSFCVKCDLHQESGVAELISAMAAGWSAQFIVETWSSGGAIATSIGLAVARRHVGGRHVCVVPDERSRGEYSRALERAGLSP

Query:  EVIVGEPEEVMEGLVGIDFLVVDSQRRNFNRVLKLANLSSRGAVLICKNANSRSDSSFRWKSVTDNGTRRLVRSAFLPVGKGLDIAHVAAAGGNSGSGGG
        EVIVGEPEEVMEGLVGIDFLVVDSQRRNF+RVLKLANLSSRGAVLICKNANSRSDSSFRW SVT+NGTRRLVRSAFLPVGKGLDIAHVAAAGGNSGSGGG
Subjt:  EVIVGEPEEVMEGLVGIDFLVVDSQRRNFNRVLKLANLSSRGAVLICKNANSRSDSSFRWKSVTDNGTRRLVRSAFLPVGKGLDIAHVAAAGGNSGSGGG

Query:  KGKWIKHVDRRSGEEFVIRK
        KGKWIKHVDRRSGEEFVIRK
Subjt:  KGKWIKHVDRRSGEEFVIRK

XP_038875174.1 uncharacterized protein LOC120067703 isoform X1 [Benincasa hispida]4.1e-11395.91Show/hide
Query:  MKLVWSPETASKAYIDTVQSFCVKCDLHQESGVAELISAMAAGWSAQFIVETWSSGGAIATSIGLAVARRHVGGRHVCVVPDERSRGEYSRALERAGLSP
        MKLVWSPETASKAYI+TVQSF +KCDLHQESGVAELISAMAAGWSAQFI+ETWSSGGAIATSIGLAVARRHVGGRHVCVVPDERS GEYSRA+ERAGL P
Subjt:  MKLVWSPETASKAYIDTVQSFCVKCDLHQESGVAELISAMAAGWSAQFIVETWSSGGAIATSIGLAVARRHVGGRHVCVVPDERSRGEYSRALERAGLSP

Query:  EVIVGEPEEVMEGLVGIDFLVVDSQRRNFNRVLKLANLSSRGAVLICKNANSRSDSSFRWKSVTDNGTRRLVRSAFLPVGKGLDIAHVAAAGGNSGSGGG
        EVIVGEPEEVMEGL+GIDFLVVDSQRRNF+RVLKLANLSSRGAVLICKNANSRSDSSFRWKSVTDNGTRRLVRSAFLPVGKGLDIAHVAAAGGNSGSGGG
Subjt:  EVIVGEPEEVMEGLVGIDFLVVDSQRRNFNRVLKLANLSSRGAVLICKNANSRSDSSFRWKSVTDNGTRRLVRSAFLPVGKGLDIAHVAAAGGNSGSGGG

Query:  KGKWIKHVDRRSGEEFVIRK
        KGKWIKHVDRRSGEEFVIRK
Subjt:  KGKWIKHVDRRSGEEFVIRK

TrEMBL top hitse value%identityAlignment
A0A0A0KKR4 Uncharacterized protein1.6e-11095.45Show/hide
Query:  MKLVWSPETASKAYIDTVQSFCVKCDLHQESGVAELISAMAAGWSAQFIVETWSSGGAIATSIGLAVARRHVGGRHVCVVPDERSRGEYSRALERAGLSP
        MKLVWSPETASKAYIDTVQS    CDLHQESGVAELISAMAAGW+AQFIVETWS+GGAIATSIGLAVARRHVGGRHVCVVPDERSRGEYSRA+ERAGLSP
Subjt:  MKLVWSPETASKAYIDTVQSFCVKCDLHQESGVAELISAMAAGWSAQFIVETWSSGGAIATSIGLAVARRHVGGRHVCVVPDERSRGEYSRALERAGLSP

Query:  EVIVGEPEEVMEGLVGIDFLVVDSQRRNFNRVLKLANLSSRGAVLICKNANSRSDSSFRWKSVTDNGTRRLVRSAFLPVGKGLDIAHVAAAGGNSGSGGG
        EVIVGEPEEVMEGLVGIDFLVVDSQRRNF+RVLKLANLSSRGAVLICKNANSRSDSSFRW SVT+NGTRRLVRSAFLPVGKGLDIAHVAAAGGNSGSGGG
Subjt:  EVIVGEPEEVMEGLVGIDFLVVDSQRRNFNRVLKLANLSSRGAVLICKNANSRSDSSFRWKSVTDNGTRRLVRSAFLPVGKGLDIAHVAAAGGNSGSGGG

Query:  KGKWIKHVDRRSGEEFVIRK
        KGKWIKHVDRRSGEEFVIRK
Subjt:  KGKWIKHVDRRSGEEFVIRK

A0A1S3ASZ1 uncharacterized protein LOC103482660 isoform X12.5e-11698.18Show/hide
Query:  MKLVWSPETASKAYIDTVQSFCVKCDLHQESGVAELISAMAAGWSAQFIVETWSSGGAIATSIGLAVARRHVGGRHVCVVPDERSRGEYSRALERAGLSP
        MKLVWSPETASKAYIDTVQSFCVKCDLHQESGVAELISAMAAGW+AQFIVETWSSGGAIATSIGLAVARRHVGGRHVCVVPDERSRGEYSRA+ERAGLSP
Subjt:  MKLVWSPETASKAYIDTVQSFCVKCDLHQESGVAELISAMAAGWSAQFIVETWSSGGAIATSIGLAVARRHVGGRHVCVVPDERSRGEYSRALERAGLSP

Query:  EVIVGEPEEVMEGLVGIDFLVVDSQRRNFNRVLKLANLSSRGAVLICKNANSRSDSSFRWKSVTDNGTRRLVRSAFLPVGKGLDIAHVAAAGGNSGSGGG
        EVIVGEPEEVMEGLVGIDFLVVDSQRRNF+RVLKLANLSSRGAVLICKNANSRSDSSFRWKSVT+NGTRRLVRSAFLPVGKGLDIAHVAAAGGNSGSGGG
Subjt:  EVIVGEPEEVMEGLVGIDFLVVDSQRRNFNRVLKLANLSSRGAVLICKNANSRSDSSFRWKSVTDNGTRRLVRSAFLPVGKGLDIAHVAAAGGNSGSGGG

Query:  KGKWIKHVDRRSGEEFVIRK
        KGKWIKHVDRRSGEEFVIRK
Subjt:  KGKWIKHVDRRSGEEFVIRK

A0A1S3ATG0 uncharacterized protein LOC103482660 isoform X28.3e-11296.36Show/hide
Query:  MKLVWSPETASKAYIDTVQSFCVKCDLHQESGVAELISAMAAGWSAQFIVETWSSGGAIATSIGLAVARRHVGGRHVCVVPDERSRGEYSRALERAGLSP
        MKLVWSPETASKAYIDTVQS    CDLHQESGVAELISAMAAGW+AQFIVETWSSGGAIATSIGLAVARRHVGGRHVCVVPDERSRGEYSRA+ERAGLSP
Subjt:  MKLVWSPETASKAYIDTVQSFCVKCDLHQESGVAELISAMAAGWSAQFIVETWSSGGAIATSIGLAVARRHVGGRHVCVVPDERSRGEYSRALERAGLSP

Query:  EVIVGEPEEVMEGLVGIDFLVVDSQRRNFNRVLKLANLSSRGAVLICKNANSRSDSSFRWKSVTDNGTRRLVRSAFLPVGKGLDIAHVAAAGGNSGSGGG
        EVIVGEPEEVMEGLVGIDFLVVDSQRRNF+RVLKLANLSSRGAVLICKNANSRSDSSFRWKSVT+NGTRRLVRSAFLPVGKGLDIAHVAAAGGNSGSGGG
Subjt:  EVIVGEPEEVMEGLVGIDFLVVDSQRRNFNRVLKLANLSSRGAVLICKNANSRSDSSFRWKSVTDNGTRRLVRSAFLPVGKGLDIAHVAAAGGNSGSGGG

Query:  KGKWIKHVDRRSGEEFVIRK
        KGKWIKHVDRRSGEEFVIRK
Subjt:  KGKWIKHVDRRSGEEFVIRK

A0A6J1E2X0 uncharacterized protein LOC111430333 isoform X11.0e-10994.09Show/hide
Query:  MKLVWSPETASKAYIDTVQSFCVKCDLHQESGVAELISAMAAGWSAQFIVETWSSGGAIATSIGLAVARRHVGGRHVCVVPDERSRGEYSRALERAGLSP
        MKLVWSPETASKAYIDTVQSF VKCDL QESGVAELISAMAAGWSAQFIVETWSSGGAIATSIGLAVARRHVGGRHVCVVPDERSRGEY RALERAG  P
Subjt:  MKLVWSPETASKAYIDTVQSFCVKCDLHQESGVAELISAMAAGWSAQFIVETWSSGGAIATSIGLAVARRHVGGRHVCVVPDERSRGEYSRALERAGLSP

Query:  EVIVGEPEEVMEGLVGIDFLVVDSQRRNFNRVLKLANLSSRGAVLICKNANSRSDSSFRWKSVTDNGTRRLVRSAFLPVGKGLDIAHVAAAGGNSGSGGG
        EVIVGEPEEVM+GLVGIDF+VVDSQRRNF+RVLKLANLSSRGAVLICKNANSRSDSSFRWK+VTDNGTRRLVRS FLPVGKGLDIAHVAA G NSGSG G
Subjt:  EVIVGEPEEVMEGLVGIDFLVVDSQRRNFNRVLKLANLSSRGAVLICKNANSRSDSSFRWKSVTDNGTRRLVRSAFLPVGKGLDIAHVAAAGGNSGSGGG

Query:  KGKWIKHVDRRSGEEFVIRK
        KGKWIKHVDRRSGEEFVIRK
Subjt:  KGKWIKHVDRRSGEEFVIRK

A0A6J1I360 uncharacterized protein LOC111469161 isoform X11.7e-10993.64Show/hide
Query:  MKLVWSPETASKAYIDTVQSFCVKCDLHQESGVAELISAMAAGWSAQFIVETWSSGGAIATSIGLAVARRHVGGRHVCVVPDERSRGEYSRALERAGLSP
        MKLVWSPETASKAYIDTVQSF VKCDL QESGVAELISAMAAGWSAQFIVETWSSGGAIATSIGLAVARRHVGGRHVCVVPDERSRGEY RA+ERAG  P
Subjt:  MKLVWSPETASKAYIDTVQSFCVKCDLHQESGVAELISAMAAGWSAQFIVETWSSGGAIATSIGLAVARRHVGGRHVCVVPDERSRGEYSRALERAGLSP

Query:  EVIVGEPEEVMEGLVGIDFLVVDSQRRNFNRVLKLANLSSRGAVLICKNANSRSDSSFRWKSVTDNGTRRLVRSAFLPVGKGLDIAHVAAAGGNSGSGGG
        EVIVGEPEEVM+GLVGIDF+VVDSQRRNF+RVLKLANLSSRGAVLICKNANSRSDSSFRWK+VTDNGTRRLVRS FLPVGKGLDIAHVAA G NSGSG G
Subjt:  EVIVGEPEEVMEGLVGIDFLVVDSQRRNFNRVLKLANLSSRGAVLICKNANSRSDSSFRWKSVTDNGTRRLVRSAFLPVGKGLDIAHVAAAGGNSGSGGG

Query:  KGKWIKHVDRRSGEEFVIRK
        KGKWIKHVDRRSGEEFVIRK
Subjt:  KGKWIKHVDRRSGEEFVIRK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G12320.1 Protein of unknown function (DUF1442)2.7e-4649.78Show/hide
Query:  MKLVWSPETASKAYIDTVQSFCVKCDLHQESGVAELISAMAAGWSAQFIVETWSSGGAIATSIGLAVARRHVGGRHVCVVPDERSRGEYSRALERAGLS-
        MKLVWSPETASKAYIDTV+S    C+  +    AELI+AMAAGW+ + IVETWS G AIA+SIGL VA +H   +H+C+V + RS   Y +A++ +    
Subjt:  MKLVWSPETASKAYIDTVQSFCVKCDLHQESGVAELISAMAAGWSAQFIVETWSSGGAIATSIGLAVARRHVGGRHVCVVPDERSRGEYSRALERAGLS-

Query:  --PEVIVG-EPEEVMEGLVGIDFLVVDSQRRNF-NRVLKLANLSSRGAVLICKNANSRSDSSFRWKSVTDNGTRRLVRSAFLPVGKGLDIAHVAAAGGNS
          PE IV  EP + M+ L G+DFLVVD + + F    LK A   +RGAV++C+N  S      R         R++VR+  LPV  G++IAHVAA   NS
Subjt:  --PEVIVG-EPEEVMEGLVGIDFLVVDSQRRNF-NRVLKLANLSSRGAVLICKNANSRSDSSFRWKSVTDNGTRRLVRSAFLPVGKGLDIAHVAAAGGNS

Query:  G-SGGGKGKWIKHVDRRSGEEFV
        G SG  K +WI HVD+RSGEE V
Subjt:  G-SGGGKGKWIKHVDRRSGEEFV

AT1G62840.1 Protein of unknown function (DUF1442)1.1e-5252.38Show/hide
Query:  MKLVWSPETASKAYIDTVQSFCVKCDLHQESGVAELISAMAAGWSAQFIVETWSSGGAIATSIGLAVARRHVGGRHVCVVPDERSRGEYSRALERAGLS-
        MKL+WSPETASKAYIDTV+S    C+     G AEL++AMAAGW+A  IVETWS G  IA S+GL +A RH  GRH+C+VP+ RS+  Y +A+     S 
Subjt:  MKLVWSPETASKAYIDTVQSFCVKCDLHQESGVAELISAMAAGWSAQFIVETWSSGGAIATSIGLAVARRHVGGRHVCVVPDERSRGEYSRALERAGLS-

Query:  -PEVIV-----GEPEEVMEGLVGIDFLVVDSQRRNF-NRVLKLANLSSRGAVLICKNANSRSDSSFRW-KSVTDNGTRRLVRSAFLPVGKGLDIAHVAAA
         PE I+      E E  M+ L GIDFLVVD  +++F   VL+ A   SRGAV++C++   RS S F W K+ +D   R +VR+  LPV  GL+IAHVAAA
Subjt:  -PEVIV-----GEPEEVMEGLVGIDFLVVDSQRRNF-NRVLKLANLSSRGAVLICKNANSRSDSSFRW-KSVTDNGTRRLVRSAFLPVGKGLDIAHVAAA

Query:  --GGNSGSGGGKGKWIKHVDRRSGEEFVIRK
           G S +   K KWIKH D+RSGEE VIRK
Subjt:  --GGNSGSGGGKGKWIKHVDRRSGEEFVIRK

AT2G45360.1 Protein of unknown function (DUF1442)1.0e-6157.92Show/hide
Query:  MKLVWSPETASKAYIDTVQSFCVKCDLHQESGVAELISAMAAGWSAQFIVETWSSGGAIATSIGLAVARRHVGGRHVCVVPDERSRGEYSRALERAGLSP
        MKLVWSPETAS AYIDTV+S    C   +ESGVAE +SA AAGW+A+ IVETWS G  I TS+GLAVA  H GGRHVC+VPDE+S+ EY  A+     + 
Subjt:  MKLVWSPETASKAYIDTVQSFCVKCDLHQESGVAELISAMAAGWSAQFIVETWSSGGAIATSIGLAVARRHVGGRHVCVVPDERSRGEYSRALERAGLSP

Query:  EVIVGEP-EEVMEGLVGIDFLVVDSQRRNFNRVLKLANLSSRGAVLICKNANSRSDSSFRWKSVTDNGTRRLVRSAFLPVGKGLDIAHVAAAGGNSGSGG
         V+VGE  E  ME   G+DFLVVDS+RR F R L+ A LS++GAVL+CKNA  R+ S F+W  V   GT R+VRS FLPVG GLDI HV A  G   S  
Subjt:  EVIVGEP-EEVMEGLVGIDFLVVDSQRRNFNRVLKLANLSSRGAVLICKNANSRSDSSFRWKSVTDNGTRRLVRSAFLPVGKGLDIAHVAAAGGNSGSGG

Query:  GKGKWIKHVDRRSGEEFVIRK
         + +WI+HVD  SGEE + R+
Subjt:  GKGKWIKHVDRRSGEEFVIRK

AT3G60780.1 Protein of unknown function (DUF1442)1.5e-5752.02Show/hide
Query:  MKLVWSPETASKAYIDTVQSFCVKCDLHQESGVAELISAMAAGWSAQFIVETWSSGGAIATSIGLAVARRHVGGRHVCVVPDERSRGEYSRALERAGLSP
        M+LVWSPETAS AYI TV+S    C  +++S VAE +SA AAGW+ + IVETWS G  IATS+GLAVA  H  GRHVC+VPDE SR EY   +  A  S 
Subjt:  MKLVWSPETASKAYIDTVQSFCVKCDLHQESGVAELISAMAAGWSAQFIVETWSSGGAIATSIGLAVARRHVGGRHVCVVPDERSRGEYSRALERAGLSP

Query:  E---VIVGEPEEVMEGLVGIDFLVVDSQRRNFNRVLKLANLSSRGAVLICKNANSRSDSSFRWKSVTDNGTRRLVRSAFLPVGKGLDIAHVAAAGGNSGS
            +++   E+V+E + G+DF+VVDS+R  F   L LA  S  GAVL+CKNA  +S   F+W+ +   GT R+VRS FLPVG+GL+I HV A+GG +G 
Subjt:  E---VIVGEPEEVMEGLVGIDFLVVDSQRRNFNRVLKLANLSSRGAVLICKNANSRSDSSFRWKSVTDNGTRRLVRSAFLPVGKGLDIAHVAAAGGNSGS

Query:  GGGKGKWIKHVDRRSGEEFVIRK
             +WIKH+D RSGEE + ++
Subjt:  GGGKGKWIKHVDRRSGEEFVIRK

AT5G62280.1 Protein of unknown function (DUF1442)4.9e-0826.47Show/hide
Query:  WSPETASKAYIDTVQSFCVKCDLHQESGVAELISAMAAGWSAQFIVETWSSGGAIATSIGLAVARRHVGGRHVCVVPDERSRGEYSRALERAGLSP----
        WS E A+KAY+ T+++        +E  VAE ISA+AAG SA+ I    +        + L  A     G+ VCV+     RG     + +  L P    
Subjt:  WSPETASKAYIDTVQSFCVKCDLHQESGVAELISAMAAGWSAQFIVETWSSGGAIATSIGLAVARRHVGGRHVCVVPDERSRGEYSRALERAGLSP----

Query:  --EVIVGEPEE---VMEGLVGIDFLVVDSQRRNFNRVL-KLANLSSRGA-------VLICKNANSRSDSSFRWKSVTDNGTRRLVRSAFLPVGKGLDIAH
          + +VGE  +   +       DF++VD    N   ++ K+ N     A       V +    N+ S  S+R+            ++ FLP+G+GL +  
Subjt:  --EVIVGEPEE---VMEGLVGIDFLVVDSQRRNFNRVL-KLANLSSRGA-------VLICKNANSRSDSSFRWKSVTDNGTRRLVRSAFLPVGKGLDIAH

Query:  V------AAAGGNSGSGGGKGKWIKHVDRRSGEEFVIR
        V           +      K +W+  VD+ +GEE V R
Subjt:  V------AAAGGNSGSGGGKGKWIKHVDRRSGEEFVIR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGCTCGTTTGGTCTCCAGAGACGGCTTCCAAAGCCTACATCGACACCGTTCAATCTTTTTGTGTGAAGTGTGATCTTCATCAAGAATCCGGCGTTGCCGAACTGAT
TTCAGCGATGGCGGCGGGATGGAGCGCGCAATTTATTGTGGAGACGTGGTCGAGCGGCGGAGCGATTGCGACGAGTATAGGTTTGGCGGTGGCTCGCCGTCACGTCGGAG
GGAGACACGTGTGTGTGGTTCCTGATGAGAGATCGAGGGGAGAATATTCGAGAGCGTTGGAGAGAGCGGGATTATCGCCGGAAGTGATCGTCGGAGAGCCAGAGGAGGTG
ATGGAAGGATTAGTAGGTATAGATTTTCTGGTGGTGGATAGCCAGAGGAGGAATTTCAATCGAGTTTTGAAACTCGCGAATCTGAGTTCTAGAGGCGCGGTTTTGATTTG
CAAGAACGCGAACTCGAGAAGCGATTCGAGTTTCAGATGGAAAAGTGTTACCGATAACGGAACACGGCGGCTGGTTCGATCGGCGTTTCTGCCGGTGGGAAAGGGTTTGG
ATATAGCGCACGTGGCGGCGGCCGGAGGGAATTCGGGTTCCGGCGGAGGGAAGGGAAAATGGATCAAGCATGTTGATCGACGGTCAGGGGAGGAGTTTGTAATTCGGAAG
TGA
mRNA sequenceShow/hide mRNA sequence
ATGAAGCTCGTTTGGTCTCCAGAGACGGCTTCCAAAGCCTACATCGACACCGTTCAATCTTTTTGTGTGAAGTGTGATCTTCATCAAGAATCCGGCGTTGCCGAACTGAT
TTCAGCGATGGCGGCGGGATGGAGCGCGCAATTTATTGTGGAGACGTGGTCGAGCGGCGGAGCGATTGCGACGAGTATAGGTTTGGCGGTGGCTCGCCGTCACGTCGGAG
GGAGACACGTGTGTGTGGTTCCTGATGAGAGATCGAGGGGAGAATATTCGAGAGCGTTGGAGAGAGCGGGATTATCGCCGGAAGTGATCGTCGGAGAGCCAGAGGAGGTG
ATGGAAGGATTAGTAGGTATAGATTTTCTGGTGGTGGATAGCCAGAGGAGGAATTTCAATCGAGTTTTGAAACTCGCGAATCTGAGTTCTAGAGGCGCGGTTTTGATTTG
CAAGAACGCGAACTCGAGAAGCGATTCGAGTTTCAGATGGAAAAGTGTTACCGATAACGGAACACGGCGGCTGGTTCGATCGGCGTTTCTGCCGGTGGGAAAGGGTTTGG
ATATAGCGCACGTGGCGGCGGCCGGAGGGAATTCGGGTTCCGGCGGAGGGAAGGGAAAATGGATCAAGCATGTTGATCGACGGTCAGGGGAGGAGTTTGTAATTCGGAAG
TGA
Protein sequenceShow/hide protein sequence
MKLVWSPETASKAYIDTVQSFCVKCDLHQESGVAELISAMAAGWSAQFIVETWSSGGAIATSIGLAVARRHVGGRHVCVVPDERSRGEYSRALERAGLSPEVIVGEPEEV
MEGLVGIDFLVVDSQRRNFNRVLKLANLSSRGAVLICKNANSRSDSSFRWKSVTDNGTRRLVRSAFLPVGKGLDIAHVAAAGGNSGSGGGKGKWIKHVDRRSGEEFVIRK