; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc02g15690 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc02g15690
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionIntegrase catalytic domain-containing protein
Genome locationchr2:11822409..11826137
RNA-Seq ExpressionMoc02g15690
SyntenyMoc02g15690
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0057475.1 uncharacterized protein E6C27_scaffold280G003560 [Cucumis melo var. makuwa]9.4e-7149.54Show/hide
Query:  SSSSIATEAAINPLYESWVTTDQLLLGWLYNSMTPEVATQVMGYENACDLWAAIQELFGVQSQAEEDYLRQVFQQTRKGSLKMTDFLRVMKSHADNLGQA
        SSS+      +NP YE WVT+D LLLG +YNSM P+VA Q+MG+  A DLW AIQ LFG++S+AEE +LR  FQ TR+G+ KM D+LR+MK +ADNLGQA
Subjt:  SSSSIATEAAINPLYESWVTTDQLLLGWLYNSMTPEVATQVMGYENACDLWAAIQELFGVQSQAEEDYLRQVFQQTRKGSLKMTDFLRVMKSHADNLGQA

Query:  GSPVPTRSLISQVLLGLDEEYNPVVATIQGKRGISWPEMQAELLVFEK---------RNNTQSQGKNFNGDSNQGV--NNNSGQGTSYAFTATQNNNPFL
        GSPVP R LISQVLLGLDE YNPV A IQGK  ISW +MQ+ELL+FE           + T     +   + N+G   N N  Q    AF  TQ ++  L
Subjt:  GSPVPTRSLISQVLLGLDEEYNPVVATIQGKRGISWPEMQAELLVFEK---------RNNTQSQGKNFNGDSNQGV--NNNSGQGTSYAFTATQNNNPFL

Query:  ANPETVIDPNWYVDSGASNHVTADYNSMVQPTEYGGMERVTVGNDDKLKISHVGKSCLVSDGGLVMLENVLCVPNIAKNLVSVSKLAKDNNVYLEFHADS
        A PETV+D N YVDSGA+NHVT+D++++    +Y G E V VGN++KL+IS VG + L      + L+ +LCVP I KN      LAKD           
Subjt:  ANPETVIDPNWYVDSGASNHVTADYNSMVQPTEYGGMERVTVGNDDKLKISHVGKSCLVSDGGLVMLENVLCVPNIAKNLVSVSKLAKDNNVYLEFHADS

Query:  CLVKDIRSGKVVLKGALKDGLYRLNTVGV
               +G+V+LKG L DGLY L  V +
Subjt:  CLVKDIRSGKVVLKGALKDGLYRLNTVGV

KAA0060208.1 Integrase, catalytic core [Cucumis melo var. makuwa]5.9e-6545.13Show/hide
Query:  FVQQSIGNMETSQTNISAPSSSSIATEA-AINPLYESWVTTDQLLLGWLYNSMTPEVATQVMGYENACDLWAAIQELFGVQSQAEEDYLRQVFQQTRKGS
        F+   I   ++     +   +SS +TE   +NP ++ WVT+D LLLGW+YNSMT EVA Q+MG+  A DL  AIQ+LFGVQS+ EED+LR  FQ TRKG+
Subjt:  FVQQSIGNMETSQTNISAPSSSSIATEA-AINPLYESWVTTDQLLLGWLYNSMTPEVATQVMGYENACDLWAAIQELFGVQSQAEEDYLRQVFQQTRKGS

Query:  LKMTDFLRVMKSHADNLGQAGSPVPTRSLISQVLLGLDEEYNPVVATIQGKRGISWPEMQAELLVFEKRNNTQSQGKN-FNGDSNQGVNNNSGQGTSYAF
         KM D+LR+MK++A+NLGQAGSP+P RSLISQVLLGLDE YNP      G                  +N T +  +N  N D+ Q     SG       
Subjt:  LKMTDFLRVMKSHADNLGQAGSPVPTRSLISQVLLGLDEEYNPVVATIQGKRGISWPEMQAELLVFEKRNNTQSQGKN-FNGDSNQGVNNNSGQGTSYAF

Query:  TATQNNNPF-------LANPETVIDPNWY------------VDSGASNHVTADYNSMVQPTEYGGMERVTVGNDDKLKISHVGKSCLVSDGGLVMLENVL
          +    PF       +A  E +     Y            VDSGA+NHVT DY+++  P+EY G+E V VGN ++L+IS  G S L      + LENVL
Subjt:  TATQNNNPF-------LANPETVIDPNWY------------VDSGASNHVTADYNSMVQPTEYGGMERVTVGNDDKLKISHVGKSCLVSDGGLVMLENVL

Query:  CVPNIAKNLVSVSKLAKDNNVYLEFHADSCLVKDIRSGKVVLKGALKDGLYRLNTVGVV
         VP+I KNL+SVSKL +DNNV LEF+ D C VKD  +G+ +++G L+DGLY L   GV+
Subjt:  CVPNIAKNLVSVSKLAKDNNVYLEFHADSCLVKDIRSGKVVLKGALKDGLYRLNTVGVV

TYK05754.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]2.2e-6742.27Show/hide
Query:  VATQVMGYENACDLWAAIQELFGVQSQAEEDYLRQVFQQTRKGSLKMTDFLRVMKSHADNLGQAGSPVPTRSLISQVLLGLDEEYNPVVATIQGKRGISW
        +A Q+MG+ NA DLW A Q+LFGVQS+AEED+LRQ+FQ TRK      D+LR+MK+++D LGQAGSPVP R+ ISQ LLGLDE YNPV+A IQGK  ISW
Subjt:  VATQVMGYENACDLWAAIQELFGVQSQAEEDYLRQVFQQTRKGSLKMTDFLRVMKSHADNLGQAGSPVPTRSLISQVLLGLDEEYNPVVATIQGKRGISW

Query:  PEMQAELLVFEKR----------------------NNTQSQGKNFNGDSNQGVNNNSGQGTSYAFTATQN----------------------------NN
         +MQ+ELL FEKR                      N   S  + ++     G N N+ QG    F   +                             N 
Subjt:  PEMQAELLVFEKR----------------------NNTQSQGKNFNGDSNQGVNNNSGQGTSYAFTATQN----------------------------NN

Query:  PFL--------------------------------ANPETVIDPNWYVDSGASNHVTADYNSMVQPTEYGGMERVTVGNDDKLKISHVGKSCLVSDGGLV
         FL                                A  +TVI+ NWY+DSGA+NH+T +Y+++  P+EY G+E++ VGN D L IS++G + L      +
Subjt:  PFL--------------------------------ANPETVIDPNWYVDSGASNHVTADYNSMVQPTEYGGMERVTVGNDDKLKISHVGKSCLVSDGGLV

Query:  MLENVLCVPNIAKNLVSVSKLAKDNNVYLEFHADSCLVKDIRSGKVVLKGALKDGLYRLNTV
         L+NVLCVP+I KNLVSVSKLA+DNNVY+EFH   C +KD  +G+ +L   +KDGLY L+T+
Subjt:  MLENVLCVPNIAKNLVSVSKLAKDNNVYLEFHADSCLVKDIRSGKVVLKGALKDGLYRLNTV

XP_022148963.1 uncharacterized protein LOC111017501 [Momordica charantia]3.3e-8488.14Show/hide
Query:  MFVQQSIGNMETSQTNISAPSSSSIATEAAINPLYESWVTTDQLLLGWLYNSMTPEVATQVMGYENACDLWAAIQELFGVQSQAEEDYLRQVFQQTRKGS
        MFVQQSIGNMETSQTNISAPSSSSIATEAAINPLYESWVTTDQLLLGWLYNSMTPEVATQVMGYENACDLWAAIQELFGVQSQAEEDYLRQVFQQTRKGS
Subjt:  MFVQQSIGNMETSQTNISAPSSSSIATEAAINPLYESWVTTDQLLLGWLYNSMTPEVATQVMGYENACDLWAAIQELFGVQSQAEEDYLRQVFQQTRKGS

Query:  LKMTDFLRVMKSHADNLGQAGSPVPTRSLISQVLLGLDEEYNPVVATIQGKRGISWPEMQAELLVFEKRNNTQSQGKNFNGDSNQGVNNNSGQG
        LKMTDFLRVMKSHADNLGQAGSPVPTRSLISQVLLGLDEEYNPVVATIQGKRGISWPEMQAE       N  Q+Q      +S    NNN G G
Subjt:  LKMTDFLRVMKSHADNLGQAGSPVPTRSLISQVLLGLDEEYNPVVATIQGKRGISWPEMQAELLVFEKRNNTQSQGKNFNGDSNQGVNNNSGQG

XP_022151683.1 uncharacterized protein LOC111019598 [Momordica charantia]1.1e-7144.24Show/hide
Query:  TNISAPSSSSIATEAAINPLYESWVTTDQLLLGWLYNSMTPEVATQVMGYENACDLWAAIQELFGVQSQAEEDYLRQVFQQTRKGSLKMTDFLRVMKSHA
        TNI   +SS   +   +NP YE+W+  D+LLLGWLYNSM  +VA QVMG+  + +LW A+QELFGVQS+AE DYL+QVFQQT KGSL+M ++L++MKSHA
Subjt:  TNISAPSSSSIATEAAINPLYESWVTTDQLLLGWLYNSMTPEVATQVMGYENACDLWAAIQELFGVQSQAEEDYLRQVFQQTRKGSLKMTDFLRVMKSHA

Query:  DNLGQAGSPVPTRSLISQVLLGLDEEYNPVVATIQGKRGISWPEMQAELLVFEKR-------------NNTQSQGKNF-------------NGDSNQGVN
        DNL  AGS V  R L+SQVL GLDEEYNP+V  +QGK  +SW EM AELL +EKR             N TQ+   N+             NG+++ G N
Subjt:  DNLGQAGSPVPTRSLISQVLLGLDEEYNPVVATIQGKRGISWPEMQAELLVFEKR-------------NNTQSQGKNF-------------NGDSNQGVN

Query:  NNSGQG-----------------TSYAFTATQNNNP----------FLANPETVIDPNWYVDSGASNHVTADYNSMVQPTEYGGMERVTVGNDDKLKISH
         + G G                 T +      N+ P           +  PETVIDP+WY DSGA++HVTA+ N++ Q  +Y G E V V N +KL ISH
Subjt:  NNSGQG-----------------TSYAFTATQNNNP----------FLANPETVIDPNWYVDSGASNHVTADYNSMVQPTEYGGMERVTVGNDDKLKISH

Query:  VGKSCLVSDGGLVMLENVLCVPNIAKNLVSVSKLAKDNNVYLEFHADSCLVKDIRSGKVVLKGALKDGLYRLN
        +G + + + GG + L++VL VP+IAKNL                        D  SG+ +LKG LKD LYRL+
Subjt:  VGKSCLVSDGGLVMLENVLCVPNIAKNLVSVSKLAKDNNVYLEFHADSCLVKDIRSGKVVLKGALKDGLYRLN

TrEMBL top hitse value%identityAlignment
A0A5A7UY76 Integrase, catalytic core2.9e-6545.13Show/hide
Query:  FVQQSIGNMETSQTNISAPSSSSIATEA-AINPLYESWVTTDQLLLGWLYNSMTPEVATQVMGYENACDLWAAIQELFGVQSQAEEDYLRQVFQQTRKGS
        F+   I   ++     +   +SS +TE   +NP ++ WVT+D LLLGW+YNSMT EVA Q+MG+  A DL  AIQ+LFGVQS+ EED+LR  FQ TRKG+
Subjt:  FVQQSIGNMETSQTNISAPSSSSIATEA-AINPLYESWVTTDQLLLGWLYNSMTPEVATQVMGYENACDLWAAIQELFGVQSQAEEDYLRQVFQQTRKGS

Query:  LKMTDFLRVMKSHADNLGQAGSPVPTRSLISQVLLGLDEEYNPVVATIQGKRGISWPEMQAELLVFEKRNNTQSQGKN-FNGDSNQGVNNNSGQGTSYAF
         KM D+LR+MK++A+NLGQAGSP+P RSLISQVLLGLDE YNP      G                  +N T +  +N  N D+ Q     SG       
Subjt:  LKMTDFLRVMKSHADNLGQAGSPVPTRSLISQVLLGLDEEYNPVVATIQGKRGISWPEMQAELLVFEKRNNTQSQGKN-FNGDSNQGVNNNSGQGTSYAF

Query:  TATQNNNPF-------LANPETVIDPNWY------------VDSGASNHVTADYNSMVQPTEYGGMERVTVGNDDKLKISHVGKSCLVSDGGLVMLENVL
          +    PF       +A  E +     Y            VDSGA+NHVT DY+++  P+EY G+E V VGN ++L+IS  G S L      + LENVL
Subjt:  TATQNNNPF-------LANPETVIDPNWY------------VDSGASNHVTADYNSMVQPTEYGGMERVTVGNDDKLKISHVGKSCLVSDGGLVMLENVL

Query:  CVPNIAKNLVSVSKLAKDNNVYLEFHADSCLVKDIRSGKVVLKGALKDGLYRLNTVGVV
         VP+I KNL+SVSKL +DNNV LEF+ D C VKD  +G+ +++G L+DGLY L   GV+
Subjt:  CVPNIAKNLVSVSKLAKDNNVYLEFHADSCLVKDIRSGKVVLKGALKDGLYRLNTVGVV

A0A5D3C373 Retrovirus-related Pol polyprotein from transposon TNT 1-941.0e-6742.27Show/hide
Query:  VATQVMGYENACDLWAAIQELFGVQSQAEEDYLRQVFQQTRKGSLKMTDFLRVMKSHADNLGQAGSPVPTRSLISQVLLGLDEEYNPVVATIQGKRGISW
        +A Q+MG+ NA DLW A Q+LFGVQS+AEED+LRQ+FQ TRK      D+LR+MK+++D LGQAGSPVP R+ ISQ LLGLDE YNPV+A IQGK  ISW
Subjt:  VATQVMGYENACDLWAAIQELFGVQSQAEEDYLRQVFQQTRKGSLKMTDFLRVMKSHADNLGQAGSPVPTRSLISQVLLGLDEEYNPVVATIQGKRGISW

Query:  PEMQAELLVFEKR----------------------NNTQSQGKNFNGDSNQGVNNNSGQGTSYAFTATQN----------------------------NN
         +MQ+ELL FEKR                      N   S  + ++     G N N+ QG    F   +                             N 
Subjt:  PEMQAELLVFEKR----------------------NNTQSQGKNFNGDSNQGVNNNSGQGTSYAFTATQN----------------------------NN

Query:  PFL--------------------------------ANPETVIDPNWYVDSGASNHVTADYNSMVQPTEYGGMERVTVGNDDKLKISHVGKSCLVSDGGLV
         FL                                A  +TVI+ NWY+DSGA+NH+T +Y+++  P+EY G+E++ VGN D L IS++G + L      +
Subjt:  PFL--------------------------------ANPETVIDPNWYVDSGASNHVTADYNSMVQPTEYGGMERVTVGNDDKLKISHVGKSCLVSDGGLV

Query:  MLENVLCVPNIAKNLVSVSKLAKDNNVYLEFHADSCLVKDIRSGKVVLKGALKDGLYRLNTV
         L+NVLCVP+I KNLVSVSKLA+DNNVY+EFH   C +KD  +G+ +L   +KDGLY L+T+
Subjt:  MLENVLCVPNIAKNLVSVSKLAKDNNVYLEFHADSCLVKDIRSGKVVLKGALKDGLYRLNTV

A0A5D3E3L7 Uncharacterized protein4.5e-7149.54Show/hide
Query:  SSSSIATEAAINPLYESWVTTDQLLLGWLYNSMTPEVATQVMGYENACDLWAAIQELFGVQSQAEEDYLRQVFQQTRKGSLKMTDFLRVMKSHADNLGQA
        SSS+      +NP YE WVT+D LLLG +YNSM P+VA Q+MG+  A DLW AIQ LFG++S+AEE +LR  FQ TR+G+ KM D+LR+MK +ADNLGQA
Subjt:  SSSSIATEAAINPLYESWVTTDQLLLGWLYNSMTPEVATQVMGYENACDLWAAIQELFGVQSQAEEDYLRQVFQQTRKGSLKMTDFLRVMKSHADNLGQA

Query:  GSPVPTRSLISQVLLGLDEEYNPVVATIQGKRGISWPEMQAELLVFEK---------RNNTQSQGKNFNGDSNQGV--NNNSGQGTSYAFTATQNNNPFL
        GSPVP R LISQVLLGLDE YNPV A IQGK  ISW +MQ+ELL+FE           + T     +   + N+G   N N  Q    AF  TQ ++  L
Subjt:  GSPVPTRSLISQVLLGLDEEYNPVVATIQGKRGISWPEMQAELLVFEK---------RNNTQSQGKNFNGDSNQGV--NNNSGQGTSYAFTATQNNNPFL

Query:  ANPETVIDPNWYVDSGASNHVTADYNSMVQPTEYGGMERVTVGNDDKLKISHVGKSCLVSDGGLVMLENVLCVPNIAKNLVSVSKLAKDNNVYLEFHADS
        A PETV+D N YVDSGA+NHVT+D++++    +Y G E V VGN++KL+IS VG + L      + L+ +LCVP I KN      LAKD           
Subjt:  ANPETVIDPNWYVDSGASNHVTADYNSMVQPTEYGGMERVTVGNDDKLKISHVGKSCLVSDGGLVMLENVLCVPNIAKNLVSVSKLAKDNNVYLEFHADS

Query:  CLVKDIRSGKVVLKGALKDGLYRLNTVGV
               +G+V+LKG L DGLY L  V +
Subjt:  CLVKDIRSGKVVLKGALKDGLYRLNTVGV

A0A6J1D5J0 uncharacterized protein LOC1110175011.6e-8488.14Show/hide
Query:  MFVQQSIGNMETSQTNISAPSSSSIATEAAINPLYESWVTTDQLLLGWLYNSMTPEVATQVMGYENACDLWAAIQELFGVQSQAEEDYLRQVFQQTRKGS
        MFVQQSIGNMETSQTNISAPSSSSIATEAAINPLYESWVTTDQLLLGWLYNSMTPEVATQVMGYENACDLWAAIQELFGVQSQAEEDYLRQVFQQTRKGS
Subjt:  MFVQQSIGNMETSQTNISAPSSSSIATEAAINPLYESWVTTDQLLLGWLYNSMTPEVATQVMGYENACDLWAAIQELFGVQSQAEEDYLRQVFQQTRKGS

Query:  LKMTDFLRVMKSHADNLGQAGSPVPTRSLISQVLLGLDEEYNPVVATIQGKRGISWPEMQAELLVFEKRNNTQSQGKNFNGDSNQGVNNNSGQG
        LKMTDFLRVMKSHADNLGQAGSPVPTRSLISQVLLGLDEEYNPVVATIQGKRGISWPEMQAE       N  Q+Q      +S    NNN G G
Subjt:  LKMTDFLRVMKSHADNLGQAGSPVPTRSLISQVLLGLDEEYNPVVATIQGKRGISWPEMQAELLVFEKRNNTQSQGKNFNGDSNQGVNNNSGQG

A0A6J1DCW4 uncharacterized protein LOC1110195985.4e-7244.24Show/hide
Query:  TNISAPSSSSIATEAAINPLYESWVTTDQLLLGWLYNSMTPEVATQVMGYENACDLWAAIQELFGVQSQAEEDYLRQVFQQTRKGSLKMTDFLRVMKSHA
        TNI   +SS   +   +NP YE+W+  D+LLLGWLYNSM  +VA QVMG+  + +LW A+QELFGVQS+AE DYL+QVFQQT KGSL+M ++L++MKSHA
Subjt:  TNISAPSSSSIATEAAINPLYESWVTTDQLLLGWLYNSMTPEVATQVMGYENACDLWAAIQELFGVQSQAEEDYLRQVFQQTRKGSLKMTDFLRVMKSHA

Query:  DNLGQAGSPVPTRSLISQVLLGLDEEYNPVVATIQGKRGISWPEMQAELLVFEKR-------------NNTQSQGKNF-------------NGDSNQGVN
        DNL  AGS V  R L+SQVL GLDEEYNP+V  +QGK  +SW EM AELL +EKR             N TQ+   N+             NG+++ G N
Subjt:  DNLGQAGSPVPTRSLISQVLLGLDEEYNPVVATIQGKRGISWPEMQAELLVFEKR-------------NNTQSQGKNF-------------NGDSNQGVN

Query:  NNSGQG-----------------TSYAFTATQNNNP----------FLANPETVIDPNWYVDSGASNHVTADYNSMVQPTEYGGMERVTVGNDDKLKISH
         + G G                 T +      N+ P           +  PETVIDP+WY DSGA++HVTA+ N++ Q  +Y G E V V N +KL ISH
Subjt:  NNSGQG-----------------TSYAFTATQNNNP----------FLANPETVIDPNWYVDSGASNHVTADYNSMVQPTEYGGMERVTVGNDDKLKISH

Query:  VGKSCLVSDGGLVMLENVLCVPNIAKNLVSVSKLAKDNNVYLEFHADSCLVKDIRSGKVVLKGALKDGLYRLN
        +G + + + GG + L++VL VP+IAKNL                        D  SG+ +LKG LKD LYRL+
Subjt:  VGKSCLVSDGGLVMLENVLCVPNIAKNLVSVSKLAKDNNVYLEFHADSCLVKDIRSGKVVLKGALKDGLYRLN

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.6e-1741.59Show/hide
Query:  NWYVDSGASNHVTADYNSMVQPTEYGGMERVTVGNDDKLKISHVGKSCLVSDGGLVMLENVLCVPNIAKNLVSVSKLAKDNNVYLEFHADSCLVKDIRSG
        NW +DSGA++H+T+D+N++     Y G + V V +   + ISH G + L +    + L N+L VPNI KNL+SV +L   N V +EF   S  VKD+ +G
Subjt:  NWYVDSGASNHVTADYNSMVQPTEYGGMERVTVGNDDKLKISHVGKSCLVSDGGLVMLENVLCVPNIAKNLVSVSKLAKDNNVYLEFHADSCLVKDIRSG

Query:  KVVLKGALKDGLY
          +L+G  KD LY
Subjt:  KVVLKGALKDGLY

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE23.6e-1738.58Show/hide
Query:  NNPFLANPETVIDPNWYVDSGASNHVTADYNSMVQPTEYGGMERVTVGNDDKLKISHVGKSCLVSDGGLVMLENVLCVPNIAKNLVSVSKLAKDNNVYLE
        N+P+ AN       NW +DSGA++H+T+D+N++     Y G + V + +   + I+H G + L +    + L  VL VPNI KNL+SV +L   N V +E
Subjt:  NNPFLANPETVIDPNWYVDSGASNHVTADYNSMVQPTEYGGMERVTVGNDDKLKISHVGKSCLVSDGGLVMLENVLCVPNIAKNLVSVSKLAKDNNVYLE

Query:  FHADSCLVKDIRSGKVVLKGALKDGLY
        F   S  VKD+ +G  +L+G  KD LY
Subjt:  FHADSCLVKDIRSGKVVLKGALKDGLY

Arabidopsis top hitse value%identityAlignment
AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)3.3e-0524.79Show/hide
Query:  SWVTTDQLLLGWLYNSMTP-EVATQVMGYENACDLWAAIQELFGVQSQAEEDYLRQVFQQTRKGSLKMTDFLRVMKSHADNLGQAGSPVPTRSLISQVLL
        +W   D ++   LY ++TP +     +    + D+W  I+  F     A    L    +    G +++ D+ R MK  AD+L     PV  R+L+  VL 
Subjt:  SWVTTDQLLLGWLYNSMTP-EVATQVMGYENACDLWAAIQELFGVQSQAEEDYLRQVFQQTRKGSLKMTDFLRVMKSHADNLGQAGSPVPTRSLISQVLL

Query:  GLDEEYNPVVATIQGKR
        GL+ +++ ++  I+ ++
Subjt:  GLDEEYNPVVATIQGKR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTGTGCAACAGTCGATTGGTAATATGGAAACAAGCCAAACGAACATATCTGCACCATCGAGTTCCTCTATAGCAACAGAAGCAGCCATCAATCCACTATAT
GAGTCATGGGTAACTACCGACCAGCTACTTCTTGGTTGGTTGTACAACTCTATGACTCCAGAAGTTGCAACACAGGTGATGGGGTACGAAAATGCTTGTGATTTA
TGGGCTGCCATACAAGAACTCTTTGGAGTACAGTCTCAGGCGGAAGAAGATTATCTCCGTCAAGTATTTCAACAAACTCGAAAAGGTTCTCTTAAAATGACTGAT
TTTTTGCGTGTTATGAAGTCTCATGCAGACAATTTGGGTCAAGCTGGAAGCCCCGTACCCACTCGATCTTTGATTTCTCAAGTTTTGCTGGGATTAGATGAAGAG
TATAATCCTGTGGTAGCAACGATCCAAGGAAAACGAGGCATTTCGTGGCCTGAAATGCAAGCCGAATTGTTGGTATTTGAGAAGAGGAACAATACACAAAGCCAG
GGTAAAAACTTCAATGGCGACTCTAACCAGGGGGTTAACAACAACTCTGGACAAGGTACATCTTATGCCTTCACAGCAACCCAAAATAACAATCCTTTTTTGGCC
AATCCAGAAACAGTGATAGACCCGAATTGGTATGTGGATAGTGGTGCTTCAAATCATGTCACCGCCGACTACAATAGTATGGTTCAACCTACTGAATATGGAGGT
ATGGAAAGAGTTACAGTAGGTAATGATGATAAATTAAAAATATCTCATGTTGGCAAATCCTGTTTAGTTTCTGACGGTGGGTTGGTAATGCTTGAAAATGTGTTG
TGCGTACCTAACATAGCTAAAAATCTAGTTAGCGTGTCTAAACTCGCTAAAGACAATAACGTATACCTTGAATTTCATGCTGATTCTTGTCTTGTAAAGGATATA
CGTTCGGGCAAGGTGGTGCTGAAAGGGGCTCTTAAGGATGGACTTTACCGCCTCAATACTGTTGGAGTAGTCATTGGGAGTACTTCGACTCCAGTTGACTGTGGC
TTGGAGTTGGCTGCTAATAAAACTATTTGTTCTGTGTCTCTTCCCAAATCATCCAGTAGTATAAATGTTGTGGCAAAATACATAGATGACGTGTTCCGTCGCCTG
GATATGGAGGGCTTAAAGCCAGCCCCTCCCCCACTGTATTGGGCAAACACTTGTCAATTTCTGATGGAGAGCCCATGA
mRNA sequenceShow/hide mRNA sequence
ATGTTTGTGCAACAGTCGATTGGTAATATGGAAACAAGCCAAACGAACATATCTGCACCATCGAGTTCCTCTATAGCAACAGAAGCAGCCATCAATCCACTATAT
GAGTCATGGGTAACTACCGACCAGCTACTTCTTGGTTGGTTGTACAACTCTATGACTCCAGAAGTTGCAACACAGGTGATGGGGTACGAAAATGCTTGTGATTTA
TGGGCTGCCATACAAGAACTCTTTGGAGTACAGTCTCAGGCGGAAGAAGATTATCTCCGTCAAGTATTTCAACAAACTCGAAAAGGTTCTCTTAAAATGACTGAT
TTTTTGCGTGTTATGAAGTCTCATGCAGACAATTTGGGTCAAGCTGGAAGCCCCGTACCCACTCGATCTTTGATTTCTCAAGTTTTGCTGGGATTAGATGAAGAG
TATAATCCTGTGGTAGCAACGATCCAAGGAAAACGAGGCATTTCGTGGCCTGAAATGCAAGCCGAATTGTTGGTATTTGAGAAGAGGAACAATACACAAAGCCAG
GGTAAAAACTTCAATGGCGACTCTAACCAGGGGGTTAACAACAACTCTGGACAAGGTACATCTTATGCCTTCACAGCAACCCAAAATAACAATCCTTTTTTGGCC
AATCCAGAAACAGTGATAGACCCGAATTGGTATGTGGATAGTGGTGCTTCAAATCATGTCACCGCCGACTACAATAGTATGGTTCAACCTACTGAATATGGAGGT
ATGGAAAGAGTTACAGTAGGTAATGATGATAAATTAAAAATATCTCATGTTGGCAAATCCTGTTTAGTTTCTGACGGTGGGTTGGTAATGCTTGAAAATGTGTTG
TGCGTACCTAACATAGCTAAAAATCTAGTTAGCGTGTCTAAACTCGCTAAAGACAATAACGTATACCTTGAATTTCATGCTGATTCTTGTCTTGTAAAGGATATA
CGTTCGGGCAAGGTGGTGCTGAAAGGGGCTCTTAAGGATGGACTTTACCGCCTCAATACTGTTGGAGTAGTCATTGGGAGTACTTCGACTCCAGTTGACTGTGGC
TTGGAGTTGGCTGCTAATAAAACTATTTGTTCTGTGTCTCTTCCCAAATCATCCAGTAGTATAAATGTTGTGGCAAAATACATAGATGACGTGTTCCGTCGCCTG
GATATGGAGGGCTTAAAGCCAGCCCCTCCCCCACTGTATTGGGCAAACACTTGTCAATTTCTGATGGAGAGCCCATGA
Protein sequenceShow/hide protein sequence
MFVQQSIGNMETSQTNISAPSSSSIATEAAINPLYESWVTTDQLLLGWLYNSMTPEVATQVMGYENACDLWAAIQELFGVQSQAEEDYLRQVFQQTRKGSLKMTD
FLRVMKSHADNLGQAGSPVPTRSLISQVLLGLDEEYNPVVATIQGKRGISWPEMQAELLVFEKRNNTQSQGKNFNGDSNQGVNNNSGQGTSYAFTATQNNNPFLA
NPETVIDPNWYVDSGASNHVTADYNSMVQPTEYGGMERVTVGNDDKLKISHVGKSCLVSDGGLVMLENVLCVPNIAKNLVSVSKLAKDNNVYLEFHADSCLVKDI
RSGKVVLKGALKDGLYRLNTVGVVIGSTSTPVDCGLELAANKTICSVSLPKSSSSINVVAKYIDDVFRRLDMEGLKPAPPPLYWANTCQFLMESP