; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10018669 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10018669
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionTransposase
Genome locationChr04:6650042..6656927
RNA-Seq ExpressionHG10018669
SyntenyHG10018669
Gene Ontology termsGO:0008152 - metabolic process (biological process)
GO:0016787 - hydrolase activity (molecular function)
InterPro domainsIPR029480 - Transposase-associated domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK07544.1 putative serine/threonine-protein kinase nek2 [Cucumis melo var. makuwa]3.7e-5043.25Show/hide
Query:  EKRKTRGPTIMREITRCSSQGDRKVVQYNDYGQPIGVNAAKLRSYIGSCVHHHVPITYATWKHVPKAIKEKIFKLIQAGFVIDEKAKKSILQNAGNSFRQ
        E ++ RGPTIM ++TR  S GD+KVV+YN+ G PIG N AKL+S+IGS  H+HVPITY +WK VP  +K+KIF  ++A FVID +++K++LQ AG SFRQ
Subjt:  EKRKTRGPTIMREITRCSSQGDRKVVQYNDYGQPIGVNAAKLRSYIGSCVHHHVPITYATWKHVPKAIKEKIFKLIQAGFVIDEKAKKSILQNAGNSFRQ

Query:  FKSWLTTRYIIPFKNQPDRLENPPDTYSYIERHHWHQFVKSRLSEGFEKK-------------NSSIEKAAQMNIKTEMENIEAEEEDESSKDVDTKPKV
        FK+WLTT+YIIP K++P  L+ PP+ YS+IE++HW +FV+SRLSE F++K             N  I +    N+K EM        DE S + D+    
Subjt:  FKSWLTTRYIIPFKNQPDRLENPPDTYSYIERHHWHQFVKSRLSEGFEKK-------------NSSIEKAAQMNIKTEMENIEAEEEDESSKDVDTKPKV

Query:  AMNPSC-DYVSTRTSGKPECSDQPKGEGGSSNPSASIETTKSGPELERMQKNEIDKMV--KALHRRVEELEEKMRQQF-PPNSECGSCN
        ++N  C + V T+  G  E + + +G GG   P+    + K      +  K+E + +V  + L RRV ELE ++R     P S  GSC+
Subjt:  AMNPSC-DYVSTRTSGKPECSDQPKGEGGSSNPSASIETTKSGPELERMQKNEIDKMV--KALHRRVEELEEKMRQQF-PPNSECGSCN

TYK07775.1 putative serine/threonine-protein kinase nek2 [Cucumis melo var. makuwa]3.7e-5043.25Show/hide
Query:  EKRKTRGPTIMREITRCSSQGDRKVVQYNDYGQPIGVNAAKLRSYIGSCVHHHVPITYATWKHVPKAIKEKIFKLIQAGFVIDEKAKKSILQNAGNSFRQ
        E ++ RGPTIM ++TR  S GD+KVV+YN+ G PIG N AKL+S+IGS  H+HVPITY +WK VP  +K+KIF  ++A FVID +++K++LQ AG SFRQ
Subjt:  EKRKTRGPTIMREITRCSSQGDRKVVQYNDYGQPIGVNAAKLRSYIGSCVHHHVPITYATWKHVPKAIKEKIFKLIQAGFVIDEKAKKSILQNAGNSFRQ

Query:  FKSWLTTRYIIPFKNQPDRLENPPDTYSYIERHHWHQFVKSRLSEGFEKK-------------NSSIEKAAQMNIKTEMENIEAEEEDESSKDVDTKPKV
        FK+WLTT+YIIP K++P  L+ PP+ YS+IE++HW +FV+SRLSE F++K             N  I +    N+K EM        DE S + D+    
Subjt:  FKSWLTTRYIIPFKNQPDRLENPPDTYSYIERHHWHQFVKSRLSEGFEKK-------------NSSIEKAAQMNIKTEMENIEAEEEDESSKDVDTKPKV

Query:  AMNPSC-DYVSTRTSGKPECSDQPKGEGGSSNPSASIETTKSGPELERMQKNEIDKMV--KALHRRVEELEEKMRQQF-PPNSECGSCN
        ++N  C + V T+  G  E + + +G GG   P+    + K      +  K+E + +V  + L RRV ELE ++R     P S  GSC+
Subjt:  AMNPSC-DYVSTRTSGKPECSDQPKGEGGSSNPSASIETTKSGPELERMQKNEIDKMV--KALHRRVEELEEKMRQQF-PPNSECGSCN

TYK24336.1 putative serine/threonine-protein kinase nek2 [Cucumis melo var. makuwa]3.7e-5043.25Show/hide
Query:  EKRKTRGPTIMREITRCSSQGDRKVVQYNDYGQPIGVNAAKLRSYIGSCVHHHVPITYATWKHVPKAIKEKIFKLIQAGFVIDEKAKKSILQNAGNSFRQ
        E ++ RGPTIM ++TR  S GD+KVV+YN+ G PIG N AKL+S+IGS  H+HVPITY +WK VP  +K+KIF  ++A FVID +++K++LQ AG SFRQ
Subjt:  EKRKTRGPTIMREITRCSSQGDRKVVQYNDYGQPIGVNAAKLRSYIGSCVHHHVPITYATWKHVPKAIKEKIFKLIQAGFVIDEKAKKSILQNAGNSFRQ

Query:  FKSWLTTRYIIPFKNQPDRLENPPDTYSYIERHHWHQFVKSRLSEGFEKK-------------NSSIEKAAQMNIKTEMENIEAEEEDESSKDVDTKPKV
        FK+WLTT+YIIP K++P  L+ PP+ YS+IE++HW +FV+SRLSE F++K             N  I +    N+K EM        DE S + D+    
Subjt:  FKSWLTTRYIIPFKNQPDRLENPPDTYSYIERHHWHQFVKSRLSEGFEKK-------------NSSIEKAAQMNIKTEMENIEAEEEDESSKDVDTKPKV

Query:  AMNPSC-DYVSTRTSGKPECSDQPKGEGGSSNPSASIETTKSGPELERMQKNEIDKMV--KALHRRVEELEEKMRQQF-PPNSECGSCN
        ++N  C + V T+  G  E + + +G GG   P+    + K      +  K+E + +V  + L RRV ELE ++R     P S  GSC+
Subjt:  AMNPSC-DYVSTRTSGKPECSDQPKGEGGSSNPSASIETTKSGPELERMQKNEIDKMV--KALHRRVEELEEKMRQQF-PPNSECGSCN

TYK29003.1 putative serine/threonine-protein kinase nek2 [Cucumis melo var. makuwa]3.7e-5043.25Show/hide
Query:  EKRKTRGPTIMREITRCSSQGDRKVVQYNDYGQPIGVNAAKLRSYIGSCVHHHVPITYATWKHVPKAIKEKIFKLIQAGFVIDEKAKKSILQNAGNSFRQ
        E ++ RGPTIM ++TR  S GD+KVV+YN+ G PIG N AKL+S+IGS  H+HVPITY +WK VP  +K+KIF  ++A FVID +++K++LQ AG SFRQ
Subjt:  EKRKTRGPTIMREITRCSSQGDRKVVQYNDYGQPIGVNAAKLRSYIGSCVHHHVPITYATWKHVPKAIKEKIFKLIQAGFVIDEKAKKSILQNAGNSFRQ

Query:  FKSWLTTRYIIPFKNQPDRLENPPDTYSYIERHHWHQFVKSRLSEGFEKK-------------NSSIEKAAQMNIKTEMENIEAEEEDESSKDVDTKPKV
        FK+WLTT+YIIP K++P  L+ PP+ YS+IE++HW +FV+SRLSE F++K             N  I +    N+K EM        DE S + D+    
Subjt:  FKSWLTTRYIIPFKNQPDRLENPPDTYSYIERHHWHQFVKSRLSEGFEKK-------------NSSIEKAAQMNIKTEMENIEAEEEDESSKDVDTKPKV

Query:  AMNPSC-DYVSTRTSGKPECSDQPKGEGGSSNPSASIETTKSGPELERMQKNEIDKMV--KALHRRVEELEEKMRQQF-PPNSECGSCN
        ++N  C + V T+  G  E + + +G GG   P+    + K      +  K+E + +V  + L RRV ELE ++R     P S  GSC+
Subjt:  AMNPSC-DYVSTRTSGKPECSDQPKGEGGSSNPSASIETTKSGPELERMQKNEIDKMV--KALHRRVEELEEKMRQQF-PPNSECGSCN

XP_022148697.1 uncharacterized protein LOC111017298 [Momordica charantia]1.2e-14456.87Show/hide
Query:  MDKSWMGRSRLSKEYDLGVEMFIKFGERHANGSTGIRCPCLRCGNLKRHCSQEIRDHLYIYGIDQSYKTWFWHGEELSNGLMDEKVGDNKNRDYAKIVNV
        MDKSWMG+SRLSKEYDLGVEMFIKFGERHA GST IRCPCL+CGN     S++IRDHLYI+GIDQSYKTWFWHGEELS+ L  ++VG+N +         
Subjt:  MDKSWMGRSRLSKEYDLGVEMFIKFGERHANGSTGIRCPCLRCGNLKRHCSQEIRDHLYIYGIDQSYKTWFWHGEELSNGLMDEKVGDNKNRDYAKIVNV

Query:  IVVETLEPTCIFMVLIKVIQLGFGMVKNFQLNSKSKKKVASWKKSLDEQNKNHDLKDVMENDNAAHATNPLMNEYNLNISNTSHIKFYCEKRKTRGPTIM
                                                    SL+E+++  +L ++ +   AAHATNPLM+  N N   TS++K  C K++ RGPT M
Subjt:  IVVETLEPTCIFMVLIKVIQLGFGMVKNFQLNSKSKKKVASWKKSLDEQNKNHDLKDVMENDNAAHATNPLMNEYNLNISNTSHIKFYCEKRKTRGPTIM

Query:  REITRCSSQGDRKVVQYNDYGQPIGVNAAKLRSYIGSCVHHHVPITYATWKHVPKAIKEKIFKLIQAGFVIDEKAKKSILQNAGNSFRQFKSWLTTRYII
        REITRCSSQGDRKVV+YNDYGQPIGVN AKL+SYIGSCVH+HVPITY+TWK VP   KEKIFKLIQAG VID  +KKSILQ AGNSFRQFKS LTT +II
Subjt:  REITRCSSQGDRKVVQYNDYGQPIGVNAAKLRSYIGSCVHHHVPITYATWKHVPKAIKEKIFKLIQAGFVIDEKAKKSILQNAGNSFRQFKSWLTTRYII

Query:  PFKNQPDRLENPPDTYSYIERHHWHQFVKSRLSEGFE-------------------------------KKNSSIEKAAQMNIK-TEMENIEAEEEDESSK
        PFK+QP RLENPPDTYS+IE HHW QFVKSRLSE FE                               KK+S  +KAA+MN K  EMEN +  EEDES K
Subjt:  PFKNQPDRLENPPDTYSYIERHHWHQFVKSRLSEGFE-------------------------------KKNSSIEKAAQMNIK-TEMENIEAEEEDESSK

Query:  DVDTKPKVAM--NPSCDYVST-RTSGKPECSDQPKGEGGSSNPSASIETTKSGPELERMQKNEIDKMVKALHRRVEELEEKMR-QQFPPNSECGSCNEQQ
        DVD K KV M  NPSCDYVST R SG+ E SD+ KGEGG S PSA   TTK  P+ ER Q+ E ++ +K L+RR++ELEE++R +Q PP SE GSC+E Q
Subjt:  DVDTKPKVAM--NPSCDYVST-RTSGKPECSDQPKGEGGSSNPSASIETTKSGPELERMQKNEIDKMVKALHRRVEELEEKMR-QQFPPNSECGSCNEQQ

Query:  HSENEVSGPVKNWSLEK
        HS+ E+ GP  NW+ +K
Subjt:  HSENEVSGPVKNWSLEK

TrEMBL top hitse value%identityAlignment
A0A5D3C8G6 Putative serine/threonine-protein kinase nek21.8e-5043.25Show/hide
Query:  EKRKTRGPTIMREITRCSSQGDRKVVQYNDYGQPIGVNAAKLRSYIGSCVHHHVPITYATWKHVPKAIKEKIFKLIQAGFVIDEKAKKSILQNAGNSFRQ
        E ++ RGPTIM ++TR  S GD+KVV+YN+ G PIG N AKL+S+IGS  H+HVPITY +WK VP  +K+KIF  ++A FVID +++K++LQ AG SFRQ
Subjt:  EKRKTRGPTIMREITRCSSQGDRKVVQYNDYGQPIGVNAAKLRSYIGSCVHHHVPITYATWKHVPKAIKEKIFKLIQAGFVIDEKAKKSILQNAGNSFRQ

Query:  FKSWLTTRYIIPFKNQPDRLENPPDTYSYIERHHWHQFVKSRLSEGFEKK-------------NSSIEKAAQMNIKTEMENIEAEEEDESSKDVDTKPKV
        FK+WLTT+YIIP K++P  L+ PP+ YS+IE++HW +FV+SRLSE F++K             N  I +    N+K EM        DE S + D+    
Subjt:  FKSWLTTRYIIPFKNQPDRLENPPDTYSYIERHHWHQFVKSRLSEGFEKK-------------NSSIEKAAQMNIKTEMENIEAEEEDESSKDVDTKPKV

Query:  AMNPSC-DYVSTRTSGKPECSDQPKGEGGSSNPSASIETTKSGPELERMQKNEIDKMV--KALHRRVEELEEKMRQQF-PPNSECGSCN
        ++N  C + V T+  G  E + + +G GG   P+    + K      +  K+E + +V  + L RRV ELE ++R     P S  GSC+
Subjt:  AMNPSC-DYVSTRTSGKPECSDQPKGEGGSSNPSASIETTKSGPELERMQKNEIDKMV--KALHRRVEELEEKMRQQF-PPNSECGSCN

A0A5D3C8G8 Putative serine/threonine-protein kinase nek21.8e-5043.25Show/hide
Query:  EKRKTRGPTIMREITRCSSQGDRKVVQYNDYGQPIGVNAAKLRSYIGSCVHHHVPITYATWKHVPKAIKEKIFKLIQAGFVIDEKAKKSILQNAGNSFRQ
        E ++ RGPTIM ++TR  S GD+KVV+YN+ G PIG N AKL+S+IGS  H+HVPITY +WK VP  +K+KIF  ++A FVID +++K++LQ AG SFRQ
Subjt:  EKRKTRGPTIMREITRCSSQGDRKVVQYNDYGQPIGVNAAKLRSYIGSCVHHHVPITYATWKHVPKAIKEKIFKLIQAGFVIDEKAKKSILQNAGNSFRQ

Query:  FKSWLTTRYIIPFKNQPDRLENPPDTYSYIERHHWHQFVKSRLSEGFEKK-------------NSSIEKAAQMNIKTEMENIEAEEEDESSKDVDTKPKV
        FK+WLTT+YIIP K++P  L+ PP+ YS+IE++HW +FV+SRLSE F++K             N  I +    N+K EM        DE S + D+    
Subjt:  FKSWLTTRYIIPFKNQPDRLENPPDTYSYIERHHWHQFVKSRLSEGFEKK-------------NSSIEKAAQMNIKTEMENIEAEEEDESSKDVDTKPKV

Query:  AMNPSC-DYVSTRTSGKPECSDQPKGEGGSSNPSASIETTKSGPELERMQKNEIDKMV--KALHRRVEELEEKMRQQF-PPNSECGSCN
        ++N  C + V T+  G  E + + +G GG   P+    + K      +  K+E + +V  + L RRV ELE ++R     P S  GSC+
Subjt:  AMNPSC-DYVSTRTSGKPECSDQPKGEGGSSNPSASIETTKSGPELERMQKNEIDKMV--KALHRRVEELEEKMRQQF-PPNSECGSCN

A0A5D3DME5 Putative serine/threonine-protein kinase nek21.8e-5043.25Show/hide
Query:  EKRKTRGPTIMREITRCSSQGDRKVVQYNDYGQPIGVNAAKLRSYIGSCVHHHVPITYATWKHVPKAIKEKIFKLIQAGFVIDEKAKKSILQNAGNSFRQ
        E ++ RGPTIM ++TR  S GD+KVV+YN+ G PIG N AKL+S+IGS  H+HVPITY +WK VP  +K+KIF  ++A FVID +++K++LQ AG SFRQ
Subjt:  EKRKTRGPTIMREITRCSSQGDRKVVQYNDYGQPIGVNAAKLRSYIGSCVHHHVPITYATWKHVPKAIKEKIFKLIQAGFVIDEKAKKSILQNAGNSFRQ

Query:  FKSWLTTRYIIPFKNQPDRLENPPDTYSYIERHHWHQFVKSRLSEGFEKK-------------NSSIEKAAQMNIKTEMENIEAEEEDESSKDVDTKPKV
        FK+WLTT+YIIP K++P  L+ PP+ YS+IE++HW +FV+SRLSE F++K             N  I +    N+K EM        DE S + D+    
Subjt:  FKSWLTTRYIIPFKNQPDRLENPPDTYSYIERHHWHQFVKSRLSEGFEKK-------------NSSIEKAAQMNIKTEMENIEAEEEDESSKDVDTKPKV

Query:  AMNPSC-DYVSTRTSGKPECSDQPKGEGGSSNPSASIETTKSGPELERMQKNEIDKMV--KALHRRVEELEEKMRQQF-PPNSECGSCN
        ++N  C + V T+  G  E + + +G GG   P+    + K      +  K+E + +V  + L RRV ELE ++R     P S  GSC+
Subjt:  AMNPSC-DYVSTRTSGKPECSDQPKGEGGSSNPSASIETTKSGPELERMQKNEIDKMV--KALHRRVEELEEKMRQQF-PPNSECGSCN

A0A5D3DZ21 Putative serine/threonine-protein kinase nek21.8e-5043.25Show/hide
Query:  EKRKTRGPTIMREITRCSSQGDRKVVQYNDYGQPIGVNAAKLRSYIGSCVHHHVPITYATWKHVPKAIKEKIFKLIQAGFVIDEKAKKSILQNAGNSFRQ
        E ++ RGPTIM ++TR  S GD+KVV+YN+ G PIG N AKL+S+IGS  H+HVPITY +WK VP  +K+KIF  ++A FVID +++K++LQ AG SFRQ
Subjt:  EKRKTRGPTIMREITRCSSQGDRKVVQYNDYGQPIGVNAAKLRSYIGSCVHHHVPITYATWKHVPKAIKEKIFKLIQAGFVIDEKAKKSILQNAGNSFRQ

Query:  FKSWLTTRYIIPFKNQPDRLENPPDTYSYIERHHWHQFVKSRLSEGFEKK-------------NSSIEKAAQMNIKTEMENIEAEEEDESSKDVDTKPKV
        FK+WLTT+YIIP K++P  L+ PP+ YS+IE++HW +FV+SRLSE F++K             N  I +    N+K EM        DE S + D+    
Subjt:  FKSWLTTRYIIPFKNQPDRLENPPDTYSYIERHHWHQFVKSRLSEGFEKK-------------NSSIEKAAQMNIKTEMENIEAEEEDESSKDVDTKPKV

Query:  AMNPSC-DYVSTRTSGKPECSDQPKGEGGSSNPSASIETTKSGPELERMQKNEIDKMV--KALHRRVEELEEKMRQQF-PPNSECGSCN
        ++N  C + V T+  G  E + + +G GG   P+    + K      +  K+E + +V  + L RRV ELE ++R     P S  GSC+
Subjt:  AMNPSC-DYVSTRTSGKPECSDQPKGEGGSSNPSASIETTKSGPELERMQKNEIDKMV--KALHRRVEELEEKMRQQF-PPNSECGSCN

A0A6J1D4R5 uncharacterized protein LOC1110172985.6e-14556.87Show/hide
Query:  MDKSWMGRSRLSKEYDLGVEMFIKFGERHANGSTGIRCPCLRCGNLKRHCSQEIRDHLYIYGIDQSYKTWFWHGEELSNGLMDEKVGDNKNRDYAKIVNV
        MDKSWMG+SRLSKEYDLGVEMFIKFGERHA GST IRCPCL+CGN     S++IRDHLYI+GIDQSYKTWFWHGEELS+ L  ++VG+N +         
Subjt:  MDKSWMGRSRLSKEYDLGVEMFIKFGERHANGSTGIRCPCLRCGNLKRHCSQEIRDHLYIYGIDQSYKTWFWHGEELSNGLMDEKVGDNKNRDYAKIVNV

Query:  IVVETLEPTCIFMVLIKVIQLGFGMVKNFQLNSKSKKKVASWKKSLDEQNKNHDLKDVMENDNAAHATNPLMNEYNLNISNTSHIKFYCEKRKTRGPTIM
                                                    SL+E+++  +L ++ +   AAHATNPLM+  N N   TS++K  C K++ RGPT M
Subjt:  IVVETLEPTCIFMVLIKVIQLGFGMVKNFQLNSKSKKKVASWKKSLDEQNKNHDLKDVMENDNAAHATNPLMNEYNLNISNTSHIKFYCEKRKTRGPTIM

Query:  REITRCSSQGDRKVVQYNDYGQPIGVNAAKLRSYIGSCVHHHVPITYATWKHVPKAIKEKIFKLIQAGFVIDEKAKKSILQNAGNSFRQFKSWLTTRYII
        REITRCSSQGDRKVV+YNDYGQPIGVN AKL+SYIGSCVH+HVPITY+TWK VP   KEKIFKLIQAG VID  +KKSILQ AGNSFRQFKS LTT +II
Subjt:  REITRCSSQGDRKVVQYNDYGQPIGVNAAKLRSYIGSCVHHHVPITYATWKHVPKAIKEKIFKLIQAGFVIDEKAKKSILQNAGNSFRQFKSWLTTRYII

Query:  PFKNQPDRLENPPDTYSYIERHHWHQFVKSRLSEGFE-------------------------------KKNSSIEKAAQMNIK-TEMENIEAEEEDESSK
        PFK+QP RLENPPDTYS+IE HHW QFVKSRLSE FE                               KK+S  +KAA+MN K  EMEN +  EEDES K
Subjt:  PFKNQPDRLENPPDTYSYIERHHWHQFVKSRLSEGFE-------------------------------KKNSSIEKAAQMNIK-TEMENIEAEEEDESSK

Query:  DVDTKPKVAM--NPSCDYVST-RTSGKPECSDQPKGEGGSSNPSASIETTKSGPELERMQKNEIDKMVKALHRRVEELEEKMR-QQFPPNSECGSCNEQQ
        DVD K KV M  NPSCDYVST R SG+ E SD+ KGEGG S PSA   TTK  P+ ER Q+ E ++ +K L+RR++ELEE++R +Q PP SE GSC+E Q
Subjt:  DVDTKPKVAM--NPSCDYVST-RTSGKPECSDQPKGEGGSSNPSASIETTKSGPELERMQKNEIDKMVKALHRRVEELEEKMR-QQFPPNSECGSCNEQQ

Query:  HSENEVSGPVKNWSLEK
        HS+ E+ GP  NW+ +K
Subjt:  HSENEVSGPVKNWSLEK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATAAATCATGGATGGGAAGAAGTAGATTATCAAAGGAGTATGACTTGGGAGTTGAAATGTTTATTAAATTTGGAGAACGTCATGCGAATGGGTCAACTGGCATTAG
ATGTCCTTGTTTGAGATGTGGAAATCTTAAACGGCATTGTAGTCAAGAGATTAGAGATCACTTGTACATTTATGGCATTGATCAGAGTTACAAGACATGGTTTTGGCATG
GTGAAGAACTTTCAAATGGTTTAATGGACGAAAAAGTAGGAGACAACAAGAACAGAGACTATGCAAAAATCGTAAATGTCATAGTAGTGGAGACATTAGAGCCCACTTGT
ATTTTCATGGTATTGATCAAAGTTATACAACTTGGTTTTGGCATGGTGAAAAACTTTCAATTGAACTCAAAGTCAAAGAAGAAGGTGGCATCATGGAAAAAATCATTGGA
TGAACAGAACAAAAATCATGATTTGAAGGATGTAATGGAAAATGATAACGCAGCTCATGCCACAAATCCTCTCATGAATGAATACAATCTAAATATTTCTAACACATCTC
ATATCAAGTTCTATTGCGAGAAGAGGAAAACACGTGGTCCAACAATTATGCGTGAAATTACTCGATGTAGTAGCCAGGGAGATAGAAAGGTAGTACAGTATAATGACTAT
GGACAACCAATTGGAGTGAATGCAGCAAAACTGAGGAGTTACATTGGCTCTTGTGTCCATCACCATGTCCCAATTACTTATGCTACTTGGAAACATGTACCTAAAGCGAT
TAAGGAAAAGATTTTTAAGTTGATTCAGGCTGGTTTTGTCATTGATGAAAAGGCAAAAAAGTCCATCTTGCAAAATGCTGGTAATTCATTCCGTCAATTTAAAAGCTGGT
TGACGACTCGTTATATAATTCCTTTCAAAAATCAGCCAGATCGATTGGAAAATCCTCCAGATACTTATTCATACATTGAGAGACATCATTGGCACCAATTTGTCAAATCA
CGATTGAGCGAAGGATTTGAGAAAAAGAATTCTTCTATTGAAAAAGCTGCTCAAATGAACATAAAAACAGAGATGGAAAACATAGAAGCAGAGGAGGAGGATGAGTCGTC
CAAGGATGTTGATACTAAGCCTAAGGTGGCCATGAATCCTAGTTGTGATTATGTCTCGACTCGAACATCGGGGAAACCGGAATGCTCTGATCAACCGAAGGGAGAAGGTG
GTTCGAGTAATCCTAGCGCAAGCATTGAGACAACAAAATCTGGTCCTGAACTAGAGAGAATGCAAAAGAATGAAATTGATAAGATGGTCAAAGCGTTACATCGACGTGTT
GAGGAGTTGGAAGAGAAAATGCGACAACAGTTTCCTCCCAATTCCGAGTGTGGCAGTTGTAATGAGCAACAACATTCAGAAAACGAGGTATCGGGACCTGTTAAGAATTG
GAGCTTGGAAAAGGGAGTTTTGGCGAGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGATAAATCATGGATGGGAAGAAGTAGATTATCAAAGGAGTATGACTTGGGAGTTGAAATGTTTATTAAATTTGGAGAACGTCATGCGAATGGGTCAACTGGCATTAG
ATGTCCTTGTTTGAGATGTGGAAATCTTAAACGGCATTGTAGTCAAGAGATTAGAGATCACTTGTACATTTATGGCATTGATCAGAGTTACAAGACATGGTTTTGGCATG
GTGAAGAACTTTCAAATGGTTTAATGGACGAAAAAGTAGGAGACAACAAGAACAGAGACTATGCAAAAATCGTAAATGTCATAGTAGTGGAGACATTAGAGCCCACTTGT
ATTTTCATGGTATTGATCAAAGTTATACAACTTGGTTTTGGCATGGTGAAAAACTTTCAATTGAACTCAAAGTCAAAGAAGAAGGTGGCATCATGGAAAAAATCATTGGA
TGAACAGAACAAAAATCATGATTTGAAGGATGTAATGGAAAATGATAACGCAGCTCATGCCACAAATCCTCTCATGAATGAATACAATCTAAATATTTCTAACACATCTC
ATATCAAGTTCTATTGCGAGAAGAGGAAAACACGTGGTCCAACAATTATGCGTGAAATTACTCGATGTAGTAGCCAGGGAGATAGAAAGGTAGTACAGTATAATGACTAT
GGACAACCAATTGGAGTGAATGCAGCAAAACTGAGGAGTTACATTGGCTCTTGTGTCCATCACCATGTCCCAATTACTTATGCTACTTGGAAACATGTACCTAAAGCGAT
TAAGGAAAAGATTTTTAAGTTGATTCAGGCTGGTTTTGTCATTGATGAAAAGGCAAAAAAGTCCATCTTGCAAAATGCTGGTAATTCATTCCGTCAATTTAAAAGCTGGT
TGACGACTCGTTATATAATTCCTTTCAAAAATCAGCCAGATCGATTGGAAAATCCTCCAGATACTTATTCATACATTGAGAGACATCATTGGCACCAATTTGTCAAATCA
CGATTGAGCGAAGGATTTGAGAAAAAGAATTCTTCTATTGAAAAAGCTGCTCAAATGAACATAAAAACAGAGATGGAAAACATAGAAGCAGAGGAGGAGGATGAGTCGTC
CAAGGATGTTGATACTAAGCCTAAGGTGGCCATGAATCCTAGTTGTGATTATGTCTCGACTCGAACATCGGGGAAACCGGAATGCTCTGATCAACCGAAGGGAGAAGGTG
GTTCGAGTAATCCTAGCGCAAGCATTGAGACAACAAAATCTGGTCCTGAACTAGAGAGAATGCAAAAGAATGAAATTGATAAGATGGTCAAAGCGTTACATCGACGTGTT
GAGGAGTTGGAAGAGAAAATGCGACAACAGTTTCCTCCCAATTCCGAGTGTGGCAGTTGTAATGAGCAACAACATTCAGAAAACGAGGTATCGGGACCTGTTAAGAATTG
GAGCTTGGAAAAGGGAGTTTTGGCGAGTTGA
Protein sequenceShow/hide protein sequence
MDKSWMGRSRLSKEYDLGVEMFIKFGERHANGSTGIRCPCLRCGNLKRHCSQEIRDHLYIYGIDQSYKTWFWHGEELSNGLMDEKVGDNKNRDYAKIVNVIVVETLEPTC
IFMVLIKVIQLGFGMVKNFQLNSKSKKKVASWKKSLDEQNKNHDLKDVMENDNAAHATNPLMNEYNLNISNTSHIKFYCEKRKTRGPTIMREITRCSSQGDRKVVQYNDY
GQPIGVNAAKLRSYIGSCVHHHVPITYATWKHVPKAIKEKIFKLIQAGFVIDEKAKKSILQNAGNSFRQFKSWLTTRYIIPFKNQPDRLENPPDTYSYIERHHWHQFVKS
RLSEGFEKKNSSIEKAAQMNIKTEMENIEAEEEDESSKDVDTKPKVAMNPSCDYVSTRTSGKPECSDQPKGEGGSSNPSASIETTKSGPELERMQKNEIDKMVKALHRRV
EELEEKMRQQFPPNSECGSCNEQQHSENEVSGPVKNWSLEKGVLAS