; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0032857 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0032857
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionProtein of unknown function (DUF1442)
Genome locationchr11:38250590..38251333
RNA-Seq ExpressionLag0032857
SyntenyLag0032857
Gene Ontology termsNA
InterPro domainsIPR009902 - Protein of unknown function DUF1442


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004147575.1 uncharacterized protein LOC101211926 isoform X2 [Cucumis sativus]3.0e-10895.37Show/hide
Query:  MKLVWSPETASKAYIDTVQSCDLHQESGVAELISAMAAGWNAQFIVETWSSGGAIATSIGLAVARRHAGGRHVCVVPDERSRGEYSRAMERAGSAPEVIV
        MKLVWSPETASKAYIDTVQSCDLHQESGVAELISAMAAGWNAQFIVETWS+GGAIATSIGLAVARRH GGRHVCVVPDERSRGEYSRAMERAG +PEVIV
Subjt:  MKLVWSPETASKAYIDTVQSCDLHQESGVAELISAMAAGWNAQFIVETWSSGGAIATSIGLAVARRHAGGRHVCVVPDERSRGEYSRAMERAGSAPEVIV

Query:  GEAEEVMGGLVGIDFLVVDSQRRNFSRVLKLANLSSRGAVLICKNANSRSDSSFRWKNVTDNGTRRLVRSAFLPVGKGLDIAHVAAAGGNSGSGEGKGKW
        GE EEVM GLVGIDFLVVDSQRRNFSRVLKLANLSSRGAVLICKNANSRSDSSFRW +VT+NGTRRLVRSAFLPVGKGLDIAHVAAAGGNSGSG GKGKW
Subjt:  GEAEEVMGGLVGIDFLVVDSQRRNFSRVLKLANLSSRGAVLICKNANSRSDSSFRWKNVTDNGTRRLVRSAFLPVGKGLDIAHVAAAGGNSGSGEGKGKW

Query:  IKHVDRRSGEEFVIRK
        IKHVDRRSGEEFVIRK
Subjt:  IKHVDRRSGEEFVIRK

XP_008437150.1 PREDICTED: uncharacterized protein LOC103482660 isoform X1 [Cucumis melo]8.6e-10894.55Show/hide
Query:  MKLVWSPETASKAYIDTVQS----CDLHQESGVAELISAMAAGWNAQFIVETWSSGGAIATSIGLAVARRHAGGRHVCVVPDERSRGEYSRAMERAGSAP
        MKLVWSPETASKAYIDTVQS    CDLHQESGVAELISAMAAGWNAQFIVETWSSGGAIATSIGLAVARRH GGRHVCVVPDERSRGEYSRAMERAG +P
Subjt:  MKLVWSPETASKAYIDTVQS----CDLHQESGVAELISAMAAGWNAQFIVETWSSGGAIATSIGLAVARRHAGGRHVCVVPDERSRGEYSRAMERAGSAP

Query:  EVIVGEAEEVMGGLVGIDFLVVDSQRRNFSRVLKLANLSSRGAVLICKNANSRSDSSFRWKNVTDNGTRRLVRSAFLPVGKGLDIAHVAAAGGNSGSGEG
        EVIVGE EEVM GLVGIDFLVVDSQRRNFSRVLKLANLSSRGAVLICKNANSRSDSSFRWK+VT+NGTRRLVRSAFLPVGKGLDIAHVAAAGGNSGSG G
Subjt:  EVIVGEAEEVMGGLVGIDFLVVDSQRRNFSRVLKLANLSSRGAVLICKNANSRSDSSFRWKNVTDNGTRRLVRSAFLPVGKGLDIAHVAAAGGNSGSGEG

Query:  KGKWIKHVDRRSGEEFVIRK
        KGKWIKHVDRRSGEEFVIRK
Subjt:  KGKWIKHVDRRSGEEFVIRK

XP_008437151.1 PREDICTED: uncharacterized protein LOC103482660 isoform X2 [Cucumis melo]1.6e-10996.3Show/hide
Query:  MKLVWSPETASKAYIDTVQSCDLHQESGVAELISAMAAGWNAQFIVETWSSGGAIATSIGLAVARRHAGGRHVCVVPDERSRGEYSRAMERAGSAPEVIV
        MKLVWSPETASKAYIDTVQSCDLHQESGVAELISAMAAGWNAQFIVETWSSGGAIATSIGLAVARRH GGRHVCVVPDERSRGEYSRAMERAG +PEVIV
Subjt:  MKLVWSPETASKAYIDTVQSCDLHQESGVAELISAMAAGWNAQFIVETWSSGGAIATSIGLAVARRHAGGRHVCVVPDERSRGEYSRAMERAGSAPEVIV

Query:  GEAEEVMGGLVGIDFLVVDSQRRNFSRVLKLANLSSRGAVLICKNANSRSDSSFRWKNVTDNGTRRLVRSAFLPVGKGLDIAHVAAAGGNSGSGEGKGKW
        GE EEVM GLVGIDFLVVDSQRRNFSRVLKLANLSSRGAVLICKNANSRSDSSFRWK+VT+NGTRRLVRSAFLPVGKGLDIAHVAAAGGNSGSG GKGKW
Subjt:  GEAEEVMGGLVGIDFLVVDSQRRNFSRVLKLANLSSRGAVLICKNANSRSDSSFRWKNVTDNGTRRLVRSAFLPVGKGLDIAHVAAAGGNSGSGEGKGKW

Query:  IKHVDRRSGEEFVIRK
        IKHVDRRSGEEFVIRK
Subjt:  IKHVDRRSGEEFVIRK

XP_022970194.1 uncharacterized protein LOC111469161 isoform X2 [Cucurbita maxima]3.9e-10894.91Show/hide
Query:  MKLVWSPETASKAYIDTVQSCDLHQESGVAELISAMAAGWNAQFIVETWSSGGAIATSIGLAVARRHAGGRHVCVVPDERSRGEYSRAMERAGSAPEVIV
        MKLVWSPETASKAYIDTVQSCDL QESGVAELISAMAAGW+AQFIVETWSSGGAIATSIGLAVARRH GGRHVCVVPDERSRGEY RAMERAGS PEVIV
Subjt:  MKLVWSPETASKAYIDTVQSCDLHQESGVAELISAMAAGWNAQFIVETWSSGGAIATSIGLAVARRHAGGRHVCVVPDERSRGEYSRAMERAGSAPEVIV

Query:  GEAEEVMGGLVGIDFLVVDSQRRNFSRVLKLANLSSRGAVLICKNANSRSDSSFRWKNVTDNGTRRLVRSAFLPVGKGLDIAHVAAAGGNSGSGEGKGKW
        GE EEVM GLVGIDF+VVDSQRRNFSRVLKLANLSSRGAVLICKNANSRSDSSFRWKNVTDNGTRRLVRS FLPVGKGLDIAHVAA G NSGSGEGKGKW
Subjt:  GEAEEVMGGLVGIDFLVVDSQRRNFSRVLKLANLSSRGAVLICKNANSRSDSSFRWKNVTDNGTRRLVRSAFLPVGKGLDIAHVAAAGGNSGSGEGKGKW

Query:  IKHVDRRSGEEFVIRK
        IKHVDRRSGEEFVIRK
Subjt:  IKHVDRRSGEEFVIRK

XP_038875175.1 uncharacterized protein LOC120067703 isoform X2 [Benincasa hispida]8.6e-10894.44Show/hide
Query:  MKLVWSPETASKAYIDTVQSCDLHQESGVAELISAMAAGWNAQFIVETWSSGGAIATSIGLAVARRHAGGRHVCVVPDERSRGEYSRAMERAGSAPEVIV
        MKLVWSPETASKAYI+TVQSCDLHQESGVAELISAMAAGW+AQFI+ETWSSGGAIATSIGLAVARRH GGRHVCVVPDERS GEYSRAMERAG  PEVIV
Subjt:  MKLVWSPETASKAYIDTVQSCDLHQESGVAELISAMAAGWNAQFIVETWSSGGAIATSIGLAVARRHAGGRHVCVVPDERSRGEYSRAMERAGSAPEVIV

Query:  GEAEEVMGGLVGIDFLVVDSQRRNFSRVLKLANLSSRGAVLICKNANSRSDSSFRWKNVTDNGTRRLVRSAFLPVGKGLDIAHVAAAGGNSGSGEGKGKW
        GE EEVM GL+GIDFLVVDSQRRNFSRVLKLANLSSRGAVLICKNANSRSDSSFRWK+VTDNGTRRLVRSAFLPVGKGLDIAHVAAAGGNSGSG GKGKW
Subjt:  GEAEEVMGGLVGIDFLVVDSQRRNFSRVLKLANLSSRGAVLICKNANSRSDSSFRWKNVTDNGTRRLVRSAFLPVGKGLDIAHVAAAGGNSGSGEGKGKW

Query:  IKHVDRRSGEEFVIRK
        IKHVDRRSGEEFVIRK
Subjt:  IKHVDRRSGEEFVIRK

TrEMBL top hitse value%identityAlignment
A0A0A0KKR4 Uncharacterized protein1.4e-10895.37Show/hide
Query:  MKLVWSPETASKAYIDTVQSCDLHQESGVAELISAMAAGWNAQFIVETWSSGGAIATSIGLAVARRHAGGRHVCVVPDERSRGEYSRAMERAGSAPEVIV
        MKLVWSPETASKAYIDTVQSCDLHQESGVAELISAMAAGWNAQFIVETWS+GGAIATSIGLAVARRH GGRHVCVVPDERSRGEYSRAMERAG +PEVIV
Subjt:  MKLVWSPETASKAYIDTVQSCDLHQESGVAELISAMAAGWNAQFIVETWSSGGAIATSIGLAVARRHAGGRHVCVVPDERSRGEYSRAMERAGSAPEVIV

Query:  GEAEEVMGGLVGIDFLVVDSQRRNFSRVLKLANLSSRGAVLICKNANSRSDSSFRWKNVTDNGTRRLVRSAFLPVGKGLDIAHVAAAGGNSGSGEGKGKW
        GE EEVM GLVGIDFLVVDSQRRNFSRVLKLANLSSRGAVLICKNANSRSDSSFRW +VT+NGTRRLVRSAFLPVGKGLDIAHVAAAGGNSGSG GKGKW
Subjt:  GEAEEVMGGLVGIDFLVVDSQRRNFSRVLKLANLSSRGAVLICKNANSRSDSSFRWKNVTDNGTRRLVRSAFLPVGKGLDIAHVAAAGGNSGSGEGKGKW

Query:  IKHVDRRSGEEFVIRK
        IKHVDRRSGEEFVIRK
Subjt:  IKHVDRRSGEEFVIRK

A0A1S3ATG0 uncharacterized protein LOC103482660 isoform X27.6e-11096.3Show/hide
Query:  MKLVWSPETASKAYIDTVQSCDLHQESGVAELISAMAAGWNAQFIVETWSSGGAIATSIGLAVARRHAGGRHVCVVPDERSRGEYSRAMERAGSAPEVIV
        MKLVWSPETASKAYIDTVQSCDLHQESGVAELISAMAAGWNAQFIVETWSSGGAIATSIGLAVARRH GGRHVCVVPDERSRGEYSRAMERAG +PEVIV
Subjt:  MKLVWSPETASKAYIDTVQSCDLHQESGVAELISAMAAGWNAQFIVETWSSGGAIATSIGLAVARRHAGGRHVCVVPDERSRGEYSRAMERAGSAPEVIV

Query:  GEAEEVMGGLVGIDFLVVDSQRRNFSRVLKLANLSSRGAVLICKNANSRSDSSFRWKNVTDNGTRRLVRSAFLPVGKGLDIAHVAAAGGNSGSGEGKGKW
        GE EEVM GLVGIDFLVVDSQRRNFSRVLKLANLSSRGAVLICKNANSRSDSSFRWK+VT+NGTRRLVRSAFLPVGKGLDIAHVAAAGGNSGSG GKGKW
Subjt:  GEAEEVMGGLVGIDFLVVDSQRRNFSRVLKLANLSSRGAVLICKNANSRSDSSFRWKNVTDNGTRRLVRSAFLPVGKGLDIAHVAAAGGNSGSGEGKGKW

Query:  IKHVDRRSGEEFVIRK
        IKHVDRRSGEEFVIRK
Subjt:  IKHVDRRSGEEFVIRK

A0A6J1DZ50 uncharacterized protein LOC111025939 isoform X24.2e-10894.44Show/hide
Query:  MKLVWSPETASKAYIDTVQSCDLHQESGVAELISAMAAGWNAQFIVETWSSGGAIATSIGLAVARRHAGGRHVCVVPDERSRGEYSRAMERAGSAPEVIV
        MKLVWSPETASKAYIDTVQSCDLH+ESGVAELISAMAAGWNAQFIVETWSSGGAIATS+GLAVA RH GGRHVCVVPDERSRGEYS A+ERAGSAPEVIV
Subjt:  MKLVWSPETASKAYIDTVQSCDLHQESGVAELISAMAAGWNAQFIVETWSSGGAIATSIGLAVARRHAGGRHVCVVPDERSRGEYSRAMERAGSAPEVIV

Query:  GEAEEVMGGLVGIDFLVVDSQRRNFSRVLKLANLSSRGAVLICKNANSRSDSSFRWKNVTDNGTRRLVRSAFLPVGKGLDIAHVAAAGGNSGSGEGKGKW
        GEAEEVM GLVGIDFLVVDSQRRNFS VLKLANLSSRGAVLICKNANSRSDSSFRWKNVT NGTRRLVRSAFLPVGKGLDIAHVAA+GGNSGSGEGKG+W
Subjt:  GEAEEVMGGLVGIDFLVVDSQRRNFSRVLKLANLSSRGAVLICKNANSRSDSSFRWKNVTDNGTRRLVRSAFLPVGKGLDIAHVAAAGGNSGSGEGKGKW

Query:  IKHVDRRSGEEFVIRK
        IKH DRRSGEEFVIRK
Subjt:  IKHVDRRSGEEFVIRK

A0A6J1E684 uncharacterized protein LOC111430333 isoform X24.2e-10894.44Show/hide
Query:  MKLVWSPETASKAYIDTVQSCDLHQESGVAELISAMAAGWNAQFIVETWSSGGAIATSIGLAVARRHAGGRHVCVVPDERSRGEYSRAMERAGSAPEVIV
        MKLVWSPETASKAYIDTVQSCDL QESGVAELISAMAAGW+AQFIVETWSSGGAIATSIGLAVARRH GGRHVCVVPDERSRGEY RA+ERAGS PEVIV
Subjt:  MKLVWSPETASKAYIDTVQSCDLHQESGVAELISAMAAGWNAQFIVETWSSGGAIATSIGLAVARRHAGGRHVCVVPDERSRGEYSRAMERAGSAPEVIV

Query:  GEAEEVMGGLVGIDFLVVDSQRRNFSRVLKLANLSSRGAVLICKNANSRSDSSFRWKNVTDNGTRRLVRSAFLPVGKGLDIAHVAAAGGNSGSGEGKGKW
        GE EEVM GLVGIDF+VVDSQRRNFSRVLKLANLSSRGAVLICKNANSRSDSSFRWKNVTDNGTRRLVRS FLPVGKGLDIAHVAA G NSGSGEGKGKW
Subjt:  GEAEEVMGGLVGIDFLVVDSQRRNFSRVLKLANLSSRGAVLICKNANSRSDSSFRWKNVTDNGTRRLVRSAFLPVGKGLDIAHVAAAGGNSGSGEGKGKW

Query:  IKHVDRRSGEEFVIRK
        IKHVDRRSGEEFVIRK
Subjt:  IKHVDRRSGEEFVIRK

A0A6J1I4U3 uncharacterized protein LOC111469161 isoform X21.9e-10894.91Show/hide
Query:  MKLVWSPETASKAYIDTVQSCDLHQESGVAELISAMAAGWNAQFIVETWSSGGAIATSIGLAVARRHAGGRHVCVVPDERSRGEYSRAMERAGSAPEVIV
        MKLVWSPETASKAYIDTVQSCDL QESGVAELISAMAAGW+AQFIVETWSSGGAIATSIGLAVARRH GGRHVCVVPDERSRGEY RAMERAGS PEVIV
Subjt:  MKLVWSPETASKAYIDTVQSCDLHQESGVAELISAMAAGWNAQFIVETWSSGGAIATSIGLAVARRHAGGRHVCVVPDERSRGEYSRAMERAGSAPEVIV

Query:  GEAEEVMGGLVGIDFLVVDSQRRNFSRVLKLANLSSRGAVLICKNANSRSDSSFRWKNVTDNGTRRLVRSAFLPVGKGLDIAHVAAAGGNSGSGEGKGKW
        GE EEVM GLVGIDF+VVDSQRRNFSRVLKLANLSSRGAVLICKNANSRSDSSFRWKNVTDNGTRRLVRS FLPVGKGLDIAHVAA G NSGSGEGKGKW
Subjt:  GEAEEVMGGLVGIDFLVVDSQRRNFSRVLKLANLSSRGAVLICKNANSRSDSSFRWKNVTDNGTRRLVRSAFLPVGKGLDIAHVAAAGGNSGSGEGKGKW

Query:  IKHVDRRSGEEFVIRK
        IKHVDRRSGEEFVIRK
Subjt:  IKHVDRRSGEEFVIRK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G12320.1 Protein of unknown function (DUF1442)2.2e-4851.6Show/hide
Query:  MKLVWSPETASKAYIDTVQSCDLHQESGVAELISAMAAGWNAQFIVETWSSGGAIATSIGLAVARRHAGGRHVCVVPDERSRGEYSRAMERAGSA---PE
        MKLVWSPETASKAYIDTV+SC+  +    AELI+AMAAGWN + IVETWS G AIA+SIGL VA +HA  +H+C+V + RS   Y +A++ + S    PE
Subjt:  MKLVWSPETASKAYIDTVQSCDLHQESGVAELISAMAAGWNAQFIVETWSSGGAIATSIGLAVARRHAGGRHVCVVPDERSRGEYSRAMERAGSA---PE

Query:  VIVGEAE-EVMGGLVGIDFLVVDSQRRNF-SRVLKLANLSSRGAVLICKNANSRSDSSFRWKNVTDNGTRRLVRSAFLPVGKGLDIAHVAAAGGNSG-SG
         IV E   + M  L G+DFLVVD + + F +  LK A   +RGAV++C+N  S      R         R++VR+  LPV  G++IAHVAA   NSG SG
Subjt:  VIVGEAE-EVMGGLVGIDFLVVDSQRRNF-SRVLKLANLSSRGAVLICKNANSRSDSSFRWKNVTDNGTRRLVRSAFLPVGKGLDIAHVAAAGGNSG-SG

Query:  EGKGKWIKHVDRRSGEEFV
          K +WI HVD+RSGEE V
Subjt:  EGKGKWIKHVDRRSGEEFV

AT1G62840.1 Protein of unknown function (DUF1442)6.9e-5553.1Show/hide
Query:  MKLVWSPETASKAYIDTVQSCDLHQESGVAELISAMAAGWNAQFIVETWSSGGAIATSIGLAVARRHAGGRHVCVVPDERSRGEYSRAM--ERAGSAPEV
        MKL+WSPETASKAYIDTV+SC+     G AEL++AMAAGWNA  IVETWS G  IA S+GL +A RH  GRH+C+VP+ RS+  Y +AM  +   + PE 
Subjt:  MKLVWSPETASKAYIDTVQSCDLHQESGVAELISAMAAGWNAQFIVETWSSGGAIATSIGLAVARRHAGGRHVCVVPDERSRGEYSRAM--ERAGSAPEV

Query:  IV-----GEAEEVMGGLVGIDFLVVDSQRRNF-SRVLKLANLSSRGAVLICKNANSRSDSSFRWKNVTDNGTRRLVRSAFLPVGKGLDIAHVAAA--GGN
        I+      E E  M  L GIDFLVVD  +++F + VL+ A   SRGAV++C++   RS S F W     +  R +VR+  LPV  GL+IAHVAAA   G 
Subjt:  IV-----GEAEEVMGGLVGIDFLVVDSQRRNF-SRVLKLANLSSRGAVLICKNANSRSDSSFRWKNVTDNGTRRLVRSAFLPVGKGLDIAHVAAA--GGN

Query:  SGSGEGKGKWIKHVDRRSGEEFVIRK
        S +   K KWIKH D+RSGEE VIRK
Subjt:  SGSGEGKGKWIKHVDRRSGEEFVIRK

AT2G45360.1 Protein of unknown function (DUF1442)8.1e-6459.45Show/hide
Query:  MKLVWSPETASKAYIDTVQSCDLHQESGVAELISAMAAGWNAQFIVETWSSGGAIATSIGLAVARRHAGGRHVCVVPDERSRGEYSRAMERAGSAPEVIV
        MKLVWSPETAS AYIDTV+SC   +ESGVAE +SA AAGWNA+ IVETWS G  I TS+GLAVA  H GGRHVC+VPDE+S+ EY  AM    +   V+V
Subjt:  MKLVWSPETASKAYIDTVQSCDLHQESGVAELISAMAAGWNAQFIVETWSSGGAIATSIGLAVARRHAGGRHVCVVPDERSRGEYSRAMERAGSAPEVIV

Query:  GEA-EEVMGGLVGIDFLVVDSQRRNFSRVLKLANLSSRGAVLICKNANSRSDSSFRWKNVTDNGTRRLVRSAFLPVGKGLDIAHVAAAGGNSGSGEGKGK
        GE+ E  M    G+DFLVVDS+RR F R L+ A LS++GAVL+CKNA  R+ S F+W +V   GT R+VRS FLPVG GLDI HV A  G   S   + +
Subjt:  GEA-EEVMGGLVGIDFLVVDSQRRNFSRVLKLANLSSRGAVLICKNANSRSDSSFRWKNVTDNGTRRLVRSAFLPVGKGLDIAHVAAAGGNSGSGEGKGK

Query:  WIKHVDRRSGEEFVIRK
        WI+HVD  SGEE + R+
Subjt:  WIKHVDRRSGEEFVIRK

AT3G60780.1 Protein of unknown function (DUF1442)4.2e-6053.88Show/hide
Query:  MKLVWSPETASKAYIDTVQSCDLHQESGVAELISAMAAGWNAQFIVETWSSGGAIATSIGLAVARRHAGGRHVCVVPDERSRGEYSRAMERA---GSAPE
        M+LVWSPETAS AYI TV+SC  +++S VAE +SA AAGWN + IVETWS G  IATS+GLAVA  H  GRHVC+VPDE SR EY   M  A    S   
Subjt:  MKLVWSPETASKAYIDTVQSCDLHQESGVAELISAMAAGWNAQFIVETWSSGGAIATSIGLAVARRHAGGRHVCVVPDERSRGEYSRAMERA---GSAPE

Query:  VIVGEAEEVMGGLVGIDFLVVDSQRRNFSRVLKLANLSSRGAVLICKNANSRSDSSFRWKNVTDNGTRRLVRSAFLPVGKGLDIAHVAAAGGNSGSGEGK
        +++  AE+V+  + G+DF+VVDS+R  F   L LA  S  GAVL+CKNA  +S   F+W+ +   GT R+VRS FLPVG+GL+I HV A+GG +G  +  
Subjt:  VIVGEAEEVMGGLVGIDFLVVDSQRRNFSRVLKLANLSSRGAVLICKNANSRSDSSFRWKNVTDNGTRRLVRSAFLPVGKGLDIAHVAAAGGNSGSGEGK

Query:  GKWIKHVDRRSGEEFVIRK
         +WIKH+D RSGEE + ++
Subjt:  GKWIKHVDRRSGEEFVIRK

AT5G62280.1 Protein of unknown function (DUF1442)1.3e-0826.07Show/hide
Query:  WSPETASKAYIDTVQSCDLHQESGVAELISAMAAGWNAQFIVETWSSGGAIATSIGLAVARRHAGGRHVCVVPDERSRGEYSRAMERAGSAP------EV
        WS E A+KAY+ T+++    +E  VAE ISA+AAG +A+ I    +        + L  A     G+ VCV+     RG     + +    P      + 
Subjt:  WSPETASKAYIDTVQSCDLHQESGVAELISAMAAGWNAQFIVETWSSGGAIATSIGLAVARRHAGGRHVCVVPDERSRGEYSRAMERAGSAP------EV

Query:  IVGEAEE---VMGGLVGIDFLVVDSQRRNFSRVL-KLANLSSRGA-------VLICKNANSRSDSSFRWKNVTDNGTRRLVRSAFLPVGKGLDIAHV---
        +VGE+ +   +       DF++VD    N   ++ K+ N     A       V +    N+ S  S+R+ +          ++ FLP+G+GL +  V   
Subjt:  IVGEAEE---VMGGLVGIDFLVVDSQRRNFSRVL-KLANLSSRGA-------VLICKNANSRSDSSFRWKNVTDNGTRRLVRSAFLPVGKGLDIAHV---

Query:  ---AAAGGNSGSGEGKGKWIKHVDRRSGEEFVIR
                +      K +W+  VD+ +GEE V R
Subjt:  ---AAAGGNSGSGEGKGKWIKHVDRRSGEEFVIR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGCTCGTTTGGTCTCCCGAGACGGCTTCCAAGGCCTACATCGACACCGTTCAATCTTGTGATCTCCATCAAGAATCCGGCGTCGCGGAACTGATTTCGGCGATGGC
GGCGGGATGGAACGCGCAGTTCATCGTGGAGACGTGGTCGAGCGGCGGAGCGATTGCGACGAGTATCGGACTGGCGGTGGCTCGCCGTCACGCCGGAGGGAGACATGTGT
GTGTGGTTCCTGACGAGAGATCGAGAGGAGAATATTCGAGAGCGATGGAGAGGGCGGGATCGGCGCCGGAGGTGATCGTCGGAGAGGCGGAGGAGGTGATGGGAGGACTG
GTAGGGATAGATTTTCTGGTGGTGGATAGTCAGAGGAGGAATTTCAGTCGAGTTCTGAAACTGGCGAATCTGAGTTCCAGAGGAGCGGTTTTGATCTGCAAGAACGCGAA
CTCGAGAAGCGATTCGAGTTTCAGATGGAAAAATGTTACCGATAACGGAACGCGGCGGCTGGTCCGATCGGCGTTTTTGCCGGTGGGGAAGGGATTGGATATTGCCCACG
TGGCGGCGGCCGGAGGGAATTCAGGTTCCGGCGAAGGGAAGGGGAAATGGATCAAGCATGTTGATCGGCGATCAGGGGAGGAGTTTGTCATTCGGAAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAGCTCGTTTGGTCTCCCGAGACGGCTTCCAAGGCCTACATCGACACCGTTCAATCTTGTGATCTCCATCAAGAATCCGGCGTCGCGGAACTGATTTCGGCGATGGC
GGCGGGATGGAACGCGCAGTTCATCGTGGAGACGTGGTCGAGCGGCGGAGCGATTGCGACGAGTATCGGACTGGCGGTGGCTCGCCGTCACGCCGGAGGGAGACATGTGT
GTGTGGTTCCTGACGAGAGATCGAGAGGAGAATATTCGAGAGCGATGGAGAGGGCGGGATCGGCGCCGGAGGTGATCGTCGGAGAGGCGGAGGAGGTGATGGGAGGACTG
GTAGGGATAGATTTTCTGGTGGTGGATAGTCAGAGGAGGAATTTCAGTCGAGTTCTGAAACTGGCGAATCTGAGTTCCAGAGGAGCGGTTTTGATCTGCAAGAACGCGAA
CTCGAGAAGCGATTCGAGTTTCAGATGGAAAAATGTTACCGATAACGGAACGCGGCGGCTGGTCCGATCGGCGTTTTTGCCGGTGGGGAAGGGATTGGATATTGCCCACG
TGGCGGCGGCCGGAGGGAATTCAGGTTCCGGCGAAGGGAAGGGGAAATGGATCAAGCATGTTGATCGGCGATCAGGGGAGGAGTTTGTCATTCGGAAGTGA
Protein sequenceShow/hide protein sequence
MKLVWSPETASKAYIDTVQSCDLHQESGVAELISAMAAGWNAQFIVETWSSGGAIATSIGLAVARRHAGGRHVCVVPDERSRGEYSRAMERAGSAPEVIVGEAEEVMGGL
VGIDFLVVDSQRRNFSRVLKLANLSSRGAVLICKNANSRSDSSFRWKNVTDNGTRRLVRSAFLPVGKGLDIAHVAAAGGNSGSGEGKGKWIKHVDRRSGEEFVIRK