; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0015169 (gene) of Snake gourd v1 genome

Gene IDTan0015169
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionHistone-lysine N-methyltransferase EZA1 isoform X3
Genome locationLG04:81170503..81172180
RNA-Seq ExpressionTan0015169
SyntenyTan0015169
Gene Ontology termsGO:0032259 - methylation (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0031519 - PcG protein complex (cellular component)
GO:0008168 - methyltransferase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6581302.1 hypothetical protein SDJN03_21304, partial [Cucurbita argyrosperma subsp. sororia]1.6e-10377.39Show/hide
Query:  MEVLKPSFLFHLFAINLLGLLLPLSFLLLARLSSALYLAAPSPVPSLLLSLILYVNSPLLYLLVAFVIVSTLLHSLTGKSALLTKLSGPVSQPRLYIAWI
        ME+L+  FLFHL A+NLL LLLPLS LLLARLSSALYLA P  +P L LSLILY+ SP+L LLV+FV++S LLHSLTGKSAL TKL  PVSQPRLY  WI
Subjt:  MEVLKPSFLFHLFAINLLGLLLPLSFLLLARLSSALYLAAPSPVPSLLLSLILYVNSPLLYLLVAFVIVSTLLHSLTGKSALLTKLSGPVSQPRLYIAWI

Query:  FLCTLQVCVGVGIEGSLSTGLNDVAAGRVEGRLWGRLLFFLGLHEAVVHWTTTVVKPVVDDTVIGESRKERWFETAATAVSFGGLWWWRLRDEAEALAVV
        FLCTLQVCVGVGIEGSLS+GLN+ AAG +EG LW RLLFF+GLHEAVVHWT  VVKPVVDDTV GESRKERWFETAATAVS GG+WWWRLRDEA+AL VV
Subjt:  FLCTLQVCVGVGIEGSLSTGLNDVAAGRVEGRLWGRLLFFLGLHEAVVHWTTTVVKPVVDDTVIGESRKERWFETAATAVSFGGLWWWRLRDEAEALAVV

Query:  AERKWLTAAELGPADFSGWCLYYITVAIGIAKIVKSVAWFGRIFVSKKQSKSSDEVVVVQD
        AE KWLT+AELGPA+ + WCLYYI V IGI KIV SVAW  RI VSKK SK SDEVVVV +
Subjt:  AERKWLTAAELGPADFSGWCLYYITVAIGIAKIVKSVAWFGRIFVSKKQSKSSDEVVVVQD

XP_008454283.1 PREDICTED: uncharacterized protein LOC103494729 [Cucumis melo]1.4e-10479.7Show/hide
Query:  MEVLKPSFLFHLFAINLLGLLLPLSFLLLARLSSALYLAAPSP-VPSLLLSLILYVNSPLLYLLVAFVIVSTLLHSLTGKSALLTKLSGPVSQPRLYIAW
        ME+L  SFLFHLFAINLLGLLLPLS LLLARLSSALYL A  P  PS LLSLILYVNSPLL+LLV+FVI+STLLHSLTGKS L TKL GPVSQPRLY AW
Subjt:  MEVLKPSFLFHLFAINLLGLLLPLSFLLLARLSSALYLAAPSP-VPSLLLSLILYVNSPLLYLLVAFVIVSTLLHSLTGKSALLTKLSGPVSQPRLYIAW

Query:  IFLCTLQVCVGVGIEGSLSTGLNDV-AAGRVEGRLWGRLLFFLGLHEAVVHWTTTVVKPVVDDTVIGESRKERWFETAATAVSFGGLWWWRLRDEAEALA
        IFLCTLQVCVGVGIEGSLS+GLND+ + G VEG +W RLLFF GLHEAVVHWT  VVKPVVDDT+ GESRKE+WFETAATAVS GGLWWWRLRDEAE L 
Subjt:  IFLCTLQVCVGVGIEGSLSTGLNDV-AAGRVEGRLWGRLLFFLGLHEAVVHWTTTVVKPVVDDTVIGESRKERWFETAATAVSFGGLWWWRLRDEAEALA

Query:  VVAERKWLTAAELGPADFSGWCLYYITVAIGIAKIVK-SVAWFGRIFVSKKQSKSSDEVVVVQDNV
        VVAE KWLT+ ELG AD SGWCLYYITV IGIAKIVK  + WFG IFVS+K SK+S+ +V V+DNV
Subjt:  VVAERKWLTAAELGPADFSGWCLYYITVAIGIAKIVK-SVAWFGRIFVSKKQSKSSDEVVVVQDNV

XP_022934389.1 uncharacterized protein LOC111441578 [Cucurbita moschata]9.2e-10477.78Show/hide
Query:  MEVLKPSFLFHLFAINLLGLLLPLSFLLLARLSSALYLAAPSPVPSLLLSLILYVNSPLLYLLVAFVIVSTLLHSLTGKSALLTKLSGPVSQPRLYIAWI
        ME+L+  FLFHL A+NLL LLLPLS LLLARLSSALYLA P  +P L LSLILY+ SPLL LLV+FV++S LLHSLTGKSAL TKL  PVSQPRLY  WI
Subjt:  MEVLKPSFLFHLFAINLLGLLLPLSFLLLARLSSALYLAAPSPVPSLLLSLILYVNSPLLYLLVAFVIVSTLLHSLTGKSALLTKLSGPVSQPRLYIAWI

Query:  FLCTLQVCVGVGIEGSLSTGLNDVAAGRVEGRLWGRLLFFLGLHEAVVHWTTTVVKPVVDDTVIGESRKERWFETAATAVSFGGLWWWRLRDEAEALAVV
        FLCTLQVCVGVGIEGSLS+GLN+ AAG +EG LW RLLFF+GLHEAVVHWT  VVKPVVDDTV GESRKERWFETAATAVS GG+WWWRLRDEA+AL VV
Subjt:  FLCTLQVCVGVGIEGSLSTGLNDVAAGRVEGRLWGRLLFFLGLHEAVVHWTTTVVKPVVDDTVIGESRKERWFETAATAVSFGGLWWWRLRDEAEALAVV

Query:  AERKWLTAAELGPADFSGWCLYYITVAIGIAKIVKSVAWFGRIFVSKKQSKSSDEVVVVQD
        AE KWLT+AELGPA+ + WCLYYI V IGI KIV SVAW  RI VSKK SK SDEVVVV +
Subjt:  AERKWLTAAELGPADFSGWCLYYITVAIGIAKIVKSVAWFGRIFVSKKQSKSSDEVVVVQD

XP_022983396.1 uncharacterized protein LOC111482002 [Cucurbita maxima]2.4e-10478.93Show/hide
Query:  MEVLKPSFLFHLFAINLLGLLLPLSFLLLARLSSALYLAAPSPVPSLLLSLILYVNSPLLYLLVAFVIVSTLLHSLTGKSALLTKLSGPVSQPRLYIAWI
        ME+L+P FLFHL A+NLL LLLPLS LLLARLSSALYLA    +P LLLSLILYV SPLL LLV+FV+VS LLHSLTGKSAL TKL  P+SQPRLY  WI
Subjt:  MEVLKPSFLFHLFAINLLGLLLPLSFLLLARLSSALYLAAPSPVPSLLLSLILYVNSPLLYLLVAFVIVSTLLHSLTGKSALLTKLSGPVSQPRLYIAWI

Query:  FLCTLQVCVGVGIEGSLSTGLNDVAAGRVEGRLWGRLLFFLGLHEAVVHWTTTVVKPVVDDTVIGESRKERWFETAATAVSFGGLWWWRLRDEAEALAVV
        FLCTLQVCVGVGIEGSLS+GLN  AAG VEG LW RLLFF GLHEAVVHWT  VVKPVVDDTV GESRKERWFETAATAVS GG+WWWRLRDEA+AL VV
Subjt:  FLCTLQVCVGVGIEGSLSTGLNDVAAGRVEGRLWGRLLFFLGLHEAVVHWTTTVVKPVVDDTVIGESRKERWFETAATAVSFGGLWWWRLRDEAEALAVV

Query:  AERKWLTAAELGPADFSGWCLYYITVAIGIAKIVKSVAWFGRIFVSKKQSKSSDEVVVVQD
        AE KWLT+ ELGPA+ + WCLYYI VAIGIAKIV SVAW  RI V KK SK SDEVVVV +
Subjt:  AERKWLTAAELGPADFSGWCLYYITVAIGIAKIVKSVAWFGRIFVSKKQSKSSDEVVVVQD

XP_038904328.1 uncharacterized protein LOC120090682 [Benincasa hispida]6.6e-11083.78Show/hide
Query:  MEVLKPSFLFHLFAINLLGLLLPLSFLLLARLSSALYLAAPSPVPS-LLLSLILYVNSPLLYLLVAFVIVSTLLHSLTGKSALLTKLSGPVSQPRLYIAW
        ME+L PSFLFHLFAINLLGLLLPLSFLLLARLSS LYL    P+ S LLLSLILYVNSPLL+LLV+FVIVSTL HSLTGKSAL TKL GPVSQPRLY AW
Subjt:  MEVLKPSFLFHLFAINLLGLLLPLSFLLLARLSSALYLAAPSPVPS-LLLSLILYVNSPLLYLLVAFVIVSTLLHSLTGKSALLTKLSGPVSQPRLYIAW

Query:  IFLCTLQVCVGVGIEGSLSTGLNDVAAGRVEGRLWGRLLFFLGLHEAVVHWTTTVVKPVVDDTVIGESRKERWFETAATAVSFGGLWWWRLRDEAEALAV
        IFLCTLQVCVGVGIEGSLS+GLN  AAG +EG LW RLLFF GLHEAVVHWT  VVKPVVDDTV GESRKE+WFETAATAVS GGLWWWRLRDEAEAL V
Subjt:  IFLCTLQVCVGVGIEGSLSTGLNDVAAGRVEGRLWGRLLFFLGLHEAVVHWTTTVVKPVVDDTVIGESRKERWFETAATAVSFGGLWWWRLRDEAEALAV

Query:  VAERKWLTAAELGPADFSGWCLYYITVAIGIAKIVKSVAWFGRIFVSKKQSKSSDEVVV
        VAE KWLT+AELGPAD SGWCLYYITVAIGIAKIV+ VAWFG IFVS K SK   EV V
Subjt:  VAERKWLTAAELGPADFSGWCLYYITVAIGIAKIVKSVAWFGRIFVSKKQSKSSDEVVV

TrEMBL top hitse value%identityAlignment
A0A0A0KWP5 Uncharacterized protein1.3e-10379.7Show/hide
Query:  MEVLKPSFLFHLFAINLLGLLLPLSFLLLARLSSALYLAAPSP-VPSLLLSLILYVNSPLLYLLVAFVIVSTLLHSLTGKSALLTKLSGPVSQPRLYIAW
        ME+L  SFLFHLFAINLLGLLLPLS LLLARLSSALYL A  P  PS LLSLILYVNSPLL+LLV+FVI+STLLHSLTGKS L TKL GPVSQPRLY AW
Subjt:  MEVLKPSFLFHLFAINLLGLLLPLSFLLLARLSSALYLAAPSP-VPSLLLSLILYVNSPLLYLLVAFVIVSTLLHSLTGKSALLTKLSGPVSQPRLYIAW

Query:  IFLCTLQVCVGVGIEGSLSTGLNDV-AAGRVEGRLWGRLLFFLGLHEAVVHWTTTVVKPVVDDTVIGESRKERWFETAATAVSFGGLWWWRLRDEAEALA
        IFLCTLQVCVGVGIEGSLS+GLND+ + G VEG LW RLLFFLGLHEAVVHWT  VVKPVVDDT+ GE R E+WFETAATAVS GGLWWWRLRDEAE L 
Subjt:  IFLCTLQVCVGVGIEGSLSTGLNDV-AAGRVEGRLWGRLLFFLGLHEAVVHWTTTVVKPVVDDTVIGESRKERWFETAATAVSFGGLWWWRLRDEAEALA

Query:  VVAERKWLTAAELGPADFSGWCLYYITVAIGIAKIVK-SVAWFGRIFVSKKQSKSSDEVVVVQDNV
        VVAE KWLT+AELG AD SGWCLYYITV IGIAKIVK  + WFG  FVSK  SK+S  +V V+DNV
Subjt:  VVAERKWLTAAELGPADFSGWCLYYITVAIGIAKIVK-SVAWFGRIFVSKKQSKSSDEVVVVQDNV

A0A1S3BZH0 uncharacterized protein LOC1034947296.9e-10579.7Show/hide
Query:  MEVLKPSFLFHLFAINLLGLLLPLSFLLLARLSSALYLAAPSP-VPSLLLSLILYVNSPLLYLLVAFVIVSTLLHSLTGKSALLTKLSGPVSQPRLYIAW
        ME+L  SFLFHLFAINLLGLLLPLS LLLARLSSALYL A  P  PS LLSLILYVNSPLL+LLV+FVI+STLLHSLTGKS L TKL GPVSQPRLY AW
Subjt:  MEVLKPSFLFHLFAINLLGLLLPLSFLLLARLSSALYLAAPSP-VPSLLLSLILYVNSPLLYLLVAFVIVSTLLHSLTGKSALLTKLSGPVSQPRLYIAW

Query:  IFLCTLQVCVGVGIEGSLSTGLNDV-AAGRVEGRLWGRLLFFLGLHEAVVHWTTTVVKPVVDDTVIGESRKERWFETAATAVSFGGLWWWRLRDEAEALA
        IFLCTLQVCVGVGIEGSLS+GLND+ + G VEG +W RLLFF GLHEAVVHWT  VVKPVVDDT+ GESRKE+WFETAATAVS GGLWWWRLRDEAE L 
Subjt:  IFLCTLQVCVGVGIEGSLSTGLNDV-AAGRVEGRLWGRLLFFLGLHEAVVHWTTTVVKPVVDDTVIGESRKERWFETAATAVSFGGLWWWRLRDEAEALA

Query:  VVAERKWLTAAELGPADFSGWCLYYITVAIGIAKIVK-SVAWFGRIFVSKKQSKSSDEVVVVQDNV
        VVAE KWLT+ ELG AD SGWCLYYITV IGIAKIVK  + WFG IFVS+K SK+S+ +V V+DNV
Subjt:  VVAERKWLTAAELGPADFSGWCLYYITVAIGIAKIVK-SVAWFGRIFVSKKQSKSSDEVVVVQDNV

A0A6J1F2F5 uncharacterized protein LOC1114415784.5e-10477.78Show/hide
Query:  MEVLKPSFLFHLFAINLLGLLLPLSFLLLARLSSALYLAAPSPVPSLLLSLILYVNSPLLYLLVAFVIVSTLLHSLTGKSALLTKLSGPVSQPRLYIAWI
        ME+L+  FLFHL A+NLL LLLPLS LLLARLSSALYLA P  +P L LSLILY+ SPLL LLV+FV++S LLHSLTGKSAL TKL  PVSQPRLY  WI
Subjt:  MEVLKPSFLFHLFAINLLGLLLPLSFLLLARLSSALYLAAPSPVPSLLLSLILYVNSPLLYLLVAFVIVSTLLHSLTGKSALLTKLSGPVSQPRLYIAWI

Query:  FLCTLQVCVGVGIEGSLSTGLNDVAAGRVEGRLWGRLLFFLGLHEAVVHWTTTVVKPVVDDTVIGESRKERWFETAATAVSFGGLWWWRLRDEAEALAVV
        FLCTLQVCVGVGIEGSLS+GLN+ AAG +EG LW RLLFF+GLHEAVVHWT  VVKPVVDDTV GESRKERWFETAATAVS GG+WWWRLRDEA+AL VV
Subjt:  FLCTLQVCVGVGIEGSLSTGLNDVAAGRVEGRLWGRLLFFLGLHEAVVHWTTTVVKPVVDDTVIGESRKERWFETAATAVSFGGLWWWRLRDEAEALAVV

Query:  AERKWLTAAELGPADFSGWCLYYITVAIGIAKIVKSVAWFGRIFVSKKQSKSSDEVVVVQD
        AE KWLT+AELGPA+ + WCLYYI V IGI KIV SVAW  RI VSKK SK SDEVVVV +
Subjt:  AERKWLTAAELGPADFSGWCLYYITVAIGIAKIVKSVAWFGRIFVSKKQSKSSDEVVVVQD

A0A6J1GAY8 uncharacterized protein LOC1114525023.0e-10076.6Show/hide
Query:  MEVLKPSFLFHLFAINLLGLLLPLSFLLLARLSSALYLAAPSPVPSLLLSLILYVNSPLLYLLVAFVIVSTLLHSLTGKSALLTKLSGPVSQPRLYIAWI
        ME  KPSFLFHLFA+NLLGLLLPLSFLLL RLSSALYL AP   P L LSLILY+NSPLL+LLV FVIVS LLHSLTGKS L T   G VSQPRLY AWI
Subjt:  MEVLKPSFLFHLFAINLLGLLLPLSFLLLARLSSALYLAAPSPVPSLLLSLILYVNSPLLYLLVAFVIVSTLLHSLTGKSALLTKLSGPVSQPRLYIAWI

Query:  FLCTLQVCVGVGIEGSLSTGLNDVAAGRVEGRLWGRLLFFLGLHEAVVHWTTTVVKPVVDDTVIGESRKERWFETAATAVSFGGLWWWRLRDEAEALAVV
         LCT QVCVGVGIEGSLS GL D   GRVEG LW R+LFFLGLHE+VVHWT TVVKPVVDDTV GESRKERWFET AT +SF GLWWWRLRDEAEAL VV
Subjt:  FLCTLQVCVGVGIEGSLSTGLNDVAAGRVEGRLWGRLLFFLGLHEAVVHWTTTVVKPVVDDTVIGESRKERWFETAATAVSFGGLWWWRLRDEAEALAVV

Query:  AERKWLTAA-ELGPADFSGWCLYYITVAIGIAKIVKSVAWF-GRIFVSKKQSKSSDEVVVVQDNV
        AERKWL AA ELGP D  GWCLYY+ VA+GIAK+VKSVA F  ++  S+KQSKS +  VV++DNV
Subjt:  AERKWLTAA-ELGPADFSGWCLYYITVAIGIAKIVKSVAWF-GRIFVSKKQSKSSDEVVVVQDNV

A0A6J1J235 uncharacterized protein LOC1114820021.2e-10478.93Show/hide
Query:  MEVLKPSFLFHLFAINLLGLLLPLSFLLLARLSSALYLAAPSPVPSLLLSLILYVNSPLLYLLVAFVIVSTLLHSLTGKSALLTKLSGPVSQPRLYIAWI
        ME+L+P FLFHL A+NLL LLLPLS LLLARLSSALYLA    +P LLLSLILYV SPLL LLV+FV+VS LLHSLTGKSAL TKL  P+SQPRLY  WI
Subjt:  MEVLKPSFLFHLFAINLLGLLLPLSFLLLARLSSALYLAAPSPVPSLLLSLILYVNSPLLYLLVAFVIVSTLLHSLTGKSALLTKLSGPVSQPRLYIAWI

Query:  FLCTLQVCVGVGIEGSLSTGLNDVAAGRVEGRLWGRLLFFLGLHEAVVHWTTTVVKPVVDDTVIGESRKERWFETAATAVSFGGLWWWRLRDEAEALAVV
        FLCTLQVCVGVGIEGSLS+GLN  AAG VEG LW RLLFF GLHEAVVHWT  VVKPVVDDTV GESRKERWFETAATAVS GG+WWWRLRDEA+AL VV
Subjt:  FLCTLQVCVGVGIEGSLSTGLNDVAAGRVEGRLWGRLLFFLGLHEAVVHWTTTVVKPVVDDTVIGESRKERWFETAATAVSFGGLWWWRLRDEAEALAVV

Query:  AERKWLTAAELGPADFSGWCLYYITVAIGIAKIVKSVAWFGRIFVSKKQSKSSDEVVVVQD
        AE KWLT+ ELGPA+ + WCLYYI VAIGIAKIV SVAW  RI V KK SK SDEVVVV +
Subjt:  AERKWLTAAELGPADFSGWCLYYITVAIGIAKIVKSVAWFGRIFVSKKQSKSSDEVVVVQD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G02570.1 unknown protein4.1e-4142.31Show/hide
Query:  FHLFAINLLGLLLPLSFLLLARLSSALYLAAPSPVP-SLLLSLILYVNSPLLYLLVAFVIVSTLLHSLTGKSALLTKLSGPVSQPRLYIAWIFLCTLQVC
        F + +I+LL LL+PLSFL L+RLS +   ++ +PV  S + SL+   +  +LY +++ +IVSTL+H L+GK          V    LYI WI L  +Q C
Subjt:  FHLFAINLLGLLLPLSFLLLARLSSALYLAAPSPVP-SLLLSLILYVNSPLLYLLVAFVIVSTLLHSLTGKSALLTKLSGPVSQPRLYIAWIFLCTLQVC

Query:  VGVGIEGSLSTGLN---DVAAGRVEGRLW--GRLLFFLGLHEAVVHWTTTVVKPVVDDTVIG-ESRKERWFETAATAVSFGGLWWWRLRDEAEALAVVAE
        V  GIEG++ST ++   D +        W   R++FFLGLHE ++ W   VVKPV+DDTV G    +ERW E A  AV+FG +WWWRLRDE E+L VVAE
Subjt:  VGVGIEGSLSTGLN---DVAAGRVEGRLW--GRLLFFLGLHEAVVHWTTTVVKPVVDDTVIG-ESRKERWFETAATAVSFGGLWWWRLRDEAEALAVVAE

Query:  RKWLTAAELGPADFSGWCLYYITVAIGIAKIVKSVAWF-GRIFVSKKQSKSSDEVVVVQD
         K      L   DF  W +YYI V IG+ KI K   +F   + ++  +S+   E  +V D
Subjt:  RKWLTAAELGPADFSGWCLYYITVAIGIAKIVKSVAWF-GRIFVSKKQSKSSDEVVVVQD

AT1G02575.1 unknown protein3.9e-3640.93Show/hide
Query:  FHLFAINLLGLLLPLSFLLLARLSSALYLAAPSPVPSLLLSLILYVNSPLLYLLVAFVIVSTLLHSLTGKSALLTKLSGPVSQPRLYIAWIFLCTLQVCV
        F + +I+ L LLLPLSFL L+RLS  LY ++     S + S+I   +  +LY ++  +IV TL+HSL+GK          V    LYI WI L   Q C 
Subjt:  FHLFAINLLGLLLPLSFLLLARLSSALYLAAPSPVPSLLLSLILYVNSPLLYLLVAFVIVSTLLHSLTGKSALLTKLSGPVSQPRLYIAWIFLCTLQVCV

Query:  GVGIEGSLSTGLN-----DVAAGRVEGRLWGRLLFFLGLHEAVVHWTTTVVKPVVDDTVIGESRKERWFETAATAVSFGGLWWWRLRDEAEALAVVAERK
          GI+ ++ST ++     ++     E  +  R+LFFLGLHE ++ W   VVKPVVD+T+ G   +ERW E A  AV+FG +WWWRLRDE E+L VV    
Subjt:  GVGIEGSLSTGLN-----DVAAGRVEGRLWGRLLFFLGLHEAVVHWTTTVVKPVVDDTVIGESRKERWFETAATAVSFGGLWWWRLRDEAEALAVVAERK

Query:  WLT-AAELGPADFSGWCLYYITVAIGIAKIVKSVAWF-GRIFVSKKQSKSSDEVVVVQD
         L     L   +F  WC+YYI V IG+ KI K    F   + +S K+S+   E  V  D
Subjt:  WLT-AAELGPADFSGWCLYYITVAIGIAKIVKSVAWF-GRIFVSKKQSKSSDEVVVVQD

AT2G47360.1 unknown protein4.6e-3739.85Show/hide
Query:  VLKPSFLFHLFAINLLGLLLPLSFLLLARLSSALYL-----AAPSPVPS-LLLSLILYVNSPLLYLLVAFVIVSTLLHSLTGKSALLTKLSGPVSQPRLY
        V+KP   F L    LL LLLPLSFLLL+RLSSA +L     + P    S  + SL L  N  ++Y +V+ + V TL+  LT K             P + 
Subjt:  VLKPSFLFHLFAINLLGLLLPLSFLLLARLSSALYL-----AAPSPVPS-LLLSLILYVNSPLLYLLVAFVIVSTLLHSLTGKSALLTKLSGPVSQPRLY

Query:  IAWIFLCTLQVCVGVGIEGSLSTGLNDVAAGRVEGRLWGRLLFFLGLHEAVVHWTTTVVKPVVDDTVI----GESRKERWFETAATAVSFGGLWWWRLRD
        IAW+ L  +Q+ VG+G+E ++S GL     G  E     RL+FF GLHE ++ W   +V+PVVD+T++    G+ R+E   E  A AVS G LWWW+LRD
Subjt:  IAWIFLCTLQVCVGVGIEGSLSTGLNDVAAGRVEGRLWGRLLFFLGLHEAVVHWTTTVVKPVVDDTVI----GESRKERWFETAATAVSFGGLWWWRLRD

Query:  EAEALAVVAERKWL------------TAAELGPADFSGWCLYYITVAIGIAKIVKSVAWFGRIFVSKKQSK
        E EAL  VAE K               + ++G  DF  W LYY+ V IG+ +I+K   WFG I + ++ S+
Subjt:  EAEALAVVAERKWL------------TAAELGPADFSGWCLYYITVAIGIAKIVKSVAWFGRIFVSKKQSK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGTTCTAAAACCTTCATTTTTGTTTCATCTCTTCGCCATCAACCTTCTTGGCCTCTTGCTTCCCCTCTCTTTCCTCCTCCTCGCTCGCCTCTCCTCCGCCCTATA
TCTCGCCGCCCCGTCACCGGTACCGTCCTTGTTGCTTTCTCTCATTCTCTATGTAAACTCTCCTCTTCTTTACCTTCTTGTTGCTTTCGTTATCGTCTCCACCCTCCTTC
ATTCCCTCACCGGAAAATCCGCTCTCCTGACCAAGCTCTCCGGTCCGGTCTCCCAACCGCGTCTCTACATTGCTTGGATCTTCCTTTGTACACTTCAGGTATGTGTCGGT
GTCGGGATCGAGGGAAGCCTATCAACCGGTCTCAACGACGTGGCGGCCGGCCGCGTCGAGGGCAGGCTGTGGGGCAGGCTGTTGTTCTTCTTGGGACTTCATGAGGCAGT
GGTGCACTGGACGACGACAGTGGTGAAGCCGGTGGTGGACGACACCGTAATTGGGGAATCTCGAAAAGAGAGGTGGTTTGAAACGGCGGCGACGGCGGTGAGCTTCGGCG
GCCTGTGGTGGTGGCGGCTGAGGGATGAGGCGGAGGCGCTGGCAGTTGTGGCGGAAAGAAAGTGGTTGACGGCGGCGGAATTGGGTCCGGCGGACTTTTCCGGTTGGTGC
TTGTATTATATCACCGTCGCCATTGGAATCGCTAAGATTGTTAAATCCGTTGCTTGGTTTGGTCGGATTTTTGTCTCCAAAAAACAATCTAAAAGCTCCGACGAGGTCGT
GGTTGTTCAGGACAATGTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAGTTCTAAAACCTTCATTTTTGTTTCATCTCTTCGCCATCAACCTTCTTGGCCTCTTGCTTCCCCTCTCTTTCCTCCTCCTCGCTCGCCTCTCCTCCGCCCTATA
TCTCGCCGCCCCGTCACCGGTACCGTCCTTGTTGCTTTCTCTCATTCTCTATGTAAACTCTCCTCTTCTTTACCTTCTTGTTGCTTTCGTTATCGTCTCCACCCTCCTTC
ATTCCCTCACCGGAAAATCCGCTCTCCTGACCAAGCTCTCCGGTCCGGTCTCCCAACCGCGTCTCTACATTGCTTGGATCTTCCTTTGTACACTTCAGGTATGTGTCGGT
GTCGGGATCGAGGGAAGCCTATCAACCGGTCTCAACGACGTGGCGGCCGGCCGCGTCGAGGGCAGGCTGTGGGGCAGGCTGTTGTTCTTCTTGGGACTTCATGAGGCAGT
GGTGCACTGGACGACGACAGTGGTGAAGCCGGTGGTGGACGACACCGTAATTGGGGAATCTCGAAAAGAGAGGTGGTTTGAAACGGCGGCGACGGCGGTGAGCTTCGGCG
GCCTGTGGTGGTGGCGGCTGAGGGATGAGGCGGAGGCGCTGGCAGTTGTGGCGGAAAGAAAGTGGTTGACGGCGGCGGAATTGGGTCCGGCGGACTTTTCCGGTTGGTGC
TTGTATTATATCACCGTCGCCATTGGAATCGCTAAGATTGTTAAATCCGTTGCTTGGTTTGGTCGGATTTTTGTCTCCAAAAAACAATCTAAAAGCTCCGACGAGGTCGT
GGTTGTTCAGGACAATGTTTGA
Protein sequenceShow/hide protein sequence
MEVLKPSFLFHLFAINLLGLLLPLSFLLLARLSSALYLAAPSPVPSLLLSLILYVNSPLLYLLVAFVIVSTLLHSLTGKSALLTKLSGPVSQPRLYIAWIFLCTLQVCVG
VGIEGSLSTGLNDVAAGRVEGRLWGRLLFFLGLHEAVVHWTTTVVKPVVDDTVIGESRKERWFETAATAVSFGGLWWWRLRDEAEALAVVAERKWLTAAELGPADFSGWC
LYYITVAIGIAKIVKSVAWFGRIFVSKKQSKSSDEVVVVQDNV