; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g23970 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g23970
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr4:17327853..17335124
RNA-Seq ExpressionMoc04g23970
SyntenyMoc04g23970
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN72676.1 hypothetical protein VITISV_020406 [Vitis vinifera]3.4e-6044.38Show/hide
Query:  RLWENKKKYSKSLVIIQQAVNDSVFSRIVAAKTSKQAWSILQKEFQGNLKVFVVRLQSLRRDFKTLTMKNGESVAVFLSRAMAIVSQMWSYDETITDQTI
        RL EN KK SK+L  IQQAV++S+FS+I  A T+K+AW+ L+  FQG+ KV  V+LQSLRRDF+TL MKNGESV  FLSR  AIV+QM SY E I DQTI
Subjt:  RLWENKKKYSKSLVIIQQAVNDSVFSRIVAAKTSKQAWSILQKEFQGNLKVFVVRLQSLRRDFKTLTMKNGESVAVFLSRAMAIVSQMWSYDETITDQTI

Query:  VEKVLISLTPKFDHVVATIEESKDLFVFTFDELIGSLQAHESRINRSLEKTEEKAFQ-----------GKEVAPKFK------------------EGTKF
        V KVL SLTPKFDHVVA IEESK L  ++FDEL+GSLQ+HE R++R  EK EEKAF            G+E   + +                  +G K 
Subjt:  VEKVLISLTPKFDHVVATIEESKDLFVFTFDELIGSLQAHESRINRSLEKTEEKAFQ-----------GKEVAPKFK------------------EGTKF

Query:  IFRELDKAEKMKVQLGNDKDLQVEVKG-------------------------------------------------------QEMVWVKMTQSKMFPLEV
        +F+ELD++ K+KV+LG+DK +QVE KG                                                       Q +  V+M  +K+FPLEV
Subjt:  IFRELDKAEKMKVQLGNDKDLQVEVKG-------------------------------------------------------QEMVWVKMTQSKMFPLEV

Query:  SNVGSFVLVAEGKDDSKLWHLQYGHINIKGLSLLNHRD
        S++    LV +   +S LWHL+YGH+N+KGL LL+ ++
Subjt:  SNVGSFVLVAEGKDDSKLWHLQYGHINIKGLSLLNHRD

KAA0055915.1 copia protein [Cucumis melo var. makuwa]2.3e-5643.13Show/hide
Query:  RLWENKKKYSKSLVIIQQAVNDSVFSRIVAAKTSKQAWSILQKEFQGNLKVFVVRLQSLRRDFKTLTMKNGESVAVFLSRAMAIVSQMWSYDETITDQTI
        +L EN++K  K+LVI+QQAV+D+VFSRI AA TSKQAW ILQK FQG+ +V VV+LQSL+RDF+TL MKNGES+A FLSRA  I+SQM +Y ETITDQTI
Subjt:  RLWENKKKYSKSLVIIQQAVNDSVFSRIVAAKTSKQAWSILQKEFQGNLKVFVVRLQSLRRDFKTLTMKNGESVAVFLSRAMAIVSQMWSYDETITDQTI

Query:  VEKVLISLTPKFDHVVATIEESKDLFVFTFDELIGSLQAHESRINRSLEKTEEKAFQGKEVAPKFKE---------------------------------
        VEKVL SLTPKFDHVVA IEESKDL  FTF EL+GSLQAHESRIN S+EK +EKAF+ K+V PK+ +                                 
Subjt:  VEKVLISLTPKFDHVVATIEESKDLFVFTFDELIGSLQAHESRINRSLEKTEEKAFQGKEVAPKFKE---------------------------------

Query:  ------------------------------------------------------------------------------GTKFIFRELDKAEKMKVQLGND
                                                                                      G K +F+EL++ EK+KV+LGN 
Subjt:  ------------------------------------------------------------------------------GTKFIFRELDKAEKMKVQLGND

Query:  KDLQVEVK---------GQEM---VWVKMTQSKMFPLEVSNVGSFVLVAEG----KDDSKLWHL
        K+LQVE K         G  +   V     +SK  P EVSNV +F L A      K++S+LWHL
Subjt:  KDLQVEVK---------GQEM---VWVKMTQSKMFPLEVSNVGSFVLVAEG----KDDSKLWHL

RVW94747.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]8.0e-5739.59Show/hide
Query:  RLWENKKKYSKSLVIIQQAVNDSVFSRIVAAKTSKQAWSILQKEFQGNLKVFVVRLQSLRRDFKTLTMKNGESVAVFLSRAMAIVSQMWSYDETITDQTI
        RL EN KK SK+L  IQQAV++S+FS+I AA T+K+AW+ L+  FQG+ KV  V+LQSLRRDF+TL MKNGESV  F SR  AIV+QM SY E I DQT+
Subjt:  RLWENKKKYSKSLVIIQQAVNDSVFSRIVAAKTSKQAWSILQKEFQGNLKVFVVRLQSLRRDFKTLTMKNGESVAVFLSRAMAIVSQMWSYDETITDQTI

Query:  VEKVLISLTPKFDHVVATIEESKDLFVFTFDELIGSLQAHESRINRSLEKTEEKAF----------------QGKEVAP---------------------
        V KVL SLTPKFDHVVA IEESKDL  ++FDEL+GSLQ+HE R++R+ EK EEKAF                 GK+VA                      
Subjt:  VEKVLISLTPKFDHVVATIEESKDLFVFTFDELIGSLQAHESRINRSLEKTEEKAF----------------QGKEVAP---------------------

Query:  -------------------------------------------KFKEGTKFIFRELDKAEKMKVQLGNDKDLQVEVKG----------------------
                                                   K K   K +F+ELD++ K+KV+LG+DK +QVE KG                      
Subjt:  -------------------------------------------KFKEGTKFIFRELDKAEKMKVQLGNDKDLQVEVKG----------------------

Query:  ---------------------------------QEMVWVKMTQSKMFPLEVSNVGSFVLVAEGKDDSKLWHLQYGHINIKGLSLLNHRD
                                         Q +V V+M  +K+FPLEVS++    LV +   +S LWHL+YGH+N+KGL LL+ ++
Subjt:  ---------------------------------QEMVWVKMTQSKMFPLEVSNVGSFVLVAEGKDDSKLWHLQYGHINIKGLSLLNHRD

TYK27735.1 putative gag-pol polyprotein, identical [Cucumis melo var. makuwa]2.2e-6742.55Show/hide
Query:  RLWENKKKYSKSLVIIQQAVNDSVFSRIVAAKTSKQAWSILQKEFQGNLKVFVVRLQSLRRDFKTLTMKNGESVAVFLSRAMAIVSQMWSYDETITDQTI
        +L EN+KK SK+LVIIQQAV+DSVFSRI  A TSKQAW ILQK FQG+ +V +V+LQSLRRDF+TL MKNGES+A FLSRA  I+SQM +Y ETI DQTI
Subjt:  RLWENKKKYSKSLVIIQQAVNDSVFSRIVAAKTSKQAWSILQKEFQGNLKVFVVRLQSLRRDFKTLTMKNGESVAVFLSRAMAIVSQMWSYDETITDQTI

Query:  VEKVLISLTPKFDHVVATIEESKDLFVFTFDELIGSLQAHESRINRSLEKTEEKAFQGKEVAPKFKE---------------------------------
        VEKVL SLTPKFDHVVA IEESK+LF FTF EL+GSL+AHESRINRS+E+ EEKAFQ K+  PK+ +                                 
Subjt:  VEKVLISLTPKFDHVVATIEESKDLFVFTFDELIGSLQAHESRINRSLEKTEEKAFQGKEVAPKFKE---------------------------------

Query:  ------------------------------------------------------------------------------GTKFIFRELDKAEKMKVQLGND
                                                                                      G K +F+EL++ EK+KV+L N 
Subjt:  ------------------------------------------------------------------------------GTKFIFRELDKAEKMKVQLGND

Query:  KDLQVEVK------------------------------------------------------GQEMVWVKMTQSKMFPLEVSNVGSFVLVAEG----KDD
        K+LQVE K                                                      G+ +  VKMTQSKMFPLEVSNV SF L A      K++
Subjt:  KDLQVEVK------------------------------------------------------GQEMVWVKMTQSKMFPLEVSNVGSFVLVAEG----KDD

Query:  SKLWHLQYGHINIKGLSLLNHRD
        S+LWHL+YGH+NIKGLSLLN RD
Subjt:  SKLWHLQYGHINIKGLSLLNHRD

XP_008463459.1 PREDICTED: uncharacterized protein LOC103501626 [Cucumis melo]2.7e-5774.25Show/hide
Query:  RLWENKKKYSKSLVIIQQAVNDSVFSRIVAAKTSKQAWSILQKEFQGNLKVFVVRLQSLRRDFKTLTMKNGESVAVFLSRAMAIVSQMWSYDETITDQTI
        +LWENKKK SK+LVIIQQ V+DSVFSRIVAA +SKQAW ILQK FQG+ +V +V+LQSLRRDF+TLTMKNGES+A FLSRA  I+SQM +YDE IT+QTI
Subjt:  RLWENKKKYSKSLVIIQQAVNDSVFSRIVAAKTSKQAWSILQKEFQGNLKVFVVRLQSLRRDFKTLTMKNGESVAVFLSRAMAIVSQMWSYDETITDQTI

Query:  VEKVLISLTPKFDHVVATIEESKDLFVFTFDELIGSLQAHESRINRSLEKTEEKAFQGKEVAPKFKE
        VEKVL SLT KFDHVVA IEESK+L  FTF ELIGSLQAHESRINRS+E+ +EK FQ ++V PK+ E
Subjt:  VEKVLISLTPKFDHVVATIEESKDLFVFTFDELIGSLQAHESRINRSLEKTEEKAFQGKEVAPKFKE

TrEMBL top hitse value%identityAlignment
A0A0V0IV83 Putative ovule protein (Fragment)1.3e-6541.65Show/hide
Query:  RLWENKKKYSKSLVIIQQAVNDSVFSRIVAAKTSKQAWSILQKEFQGNLKVFVVRLQSLRRDFKTLTMKNGESVAVFLSRAMAIVSQMWSYDETITDQTI
        RL +NKKK +K+LV IQQAV+DS+FSRI  A TSKQAWSILQK FQG+ KV VVRLQSLRRDF+TL MK+GES+A FLSRAM IVSQ+ SY E +TDQ I
Subjt:  RLWENKKKYSKSLVIIQQAVNDSVFSRIVAAKTSKQAWSILQKEFQGNLKVFVVRLQSLRRDFKTLTMKNGESVAVFLSRAMAIVSQMWSYDETITDQTI

Query:  VEKVLISLTPKFDHVVATIEESKDLFVFTFDELIGSLQAHESRINRSLEKTEEKAFQGKEVAPKFKE---------------------------------
        VEKVL SL PKFDHVVA IEESKDL VF+FDEL+GSLQAHE+R NRS+EK EEKAFQ K+   K+ +                                 
Subjt:  VEKVLISLTPKFDHVVATIEESKDLFVFTFDELIGSLQAHESRINRSLEKTEEKAFQGKEVAPKFKE---------------------------------

Query:  -------------------------------------------------------------------------GTKFIFRELDKAEKMKVQLGNDKDLQV
                                                                                 G K +FR+LD+ +K KVQLGN K++QV
Subjt:  -------------------------------------------------------------------------GTKFIFRELDKAEKMKVQLGNDKDLQV

Query:  EVKGQ------------------------------------------------------EMVWVKMTQSKMFPLEVSNVGSFVLVAEGKDDSKLWHLQYG
        E KG+                                                      + V +  T + MFPL+VSN+ +F L A  KDDSKLWHL+YG
Subjt:  EVKGQ------------------------------------------------------EMVWVKMTQSKMFPLEVSNVGSFVLVAEGKDDSKLWHLQYG

Query:  HINIKGLSLLNHR
        H+NIKGL LL  +
Subjt:  HINIKGLSLLNHR

A0A1S3CJ95 uncharacterized protein LOC1035016261.3e-5774.25Show/hide
Query:  RLWENKKKYSKSLVIIQQAVNDSVFSRIVAAKTSKQAWSILQKEFQGNLKVFVVRLQSLRRDFKTLTMKNGESVAVFLSRAMAIVSQMWSYDETITDQTI
        +LWENKKK SK+LVIIQQ V+DSVFSRIVAA +SKQAW ILQK FQG+ +V +V+LQSLRRDF+TLTMKNGES+A FLSRA  I+SQM +YDE IT+QTI
Subjt:  RLWENKKKYSKSLVIIQQAVNDSVFSRIVAAKTSKQAWSILQKEFQGNLKVFVVRLQSLRRDFKTLTMKNGESVAVFLSRAMAIVSQMWSYDETITDQTI

Query:  VEKVLISLTPKFDHVVATIEESKDLFVFTFDELIGSLQAHESRINRSLEKTEEKAFQGKEVAPKFKE
        VEKVL SLT KFDHVVA IEESK+L  FTF ELIGSLQAHESRINRS+E+ +EK FQ ++V PK+ E
Subjt:  VEKVLISLTPKFDHVVATIEESKDLFVFTFDELIGSLQAHESRINRSLEKTEEKAFQGKEVAPKFKE

A0A438IDK1 Retrovirus-related Pol polyprotein from transposon TNT 1-943.8e-5739.59Show/hide
Query:  RLWENKKKYSKSLVIIQQAVNDSVFSRIVAAKTSKQAWSILQKEFQGNLKVFVVRLQSLRRDFKTLTMKNGESVAVFLSRAMAIVSQMWSYDETITDQTI
        RL EN KK SK+L  IQQAV++S+FS+I AA T+K+AW+ L+  FQG+ KV  V+LQSLRRDF+TL MKNGESV  F SR  AIV+QM SY E I DQT+
Subjt:  RLWENKKKYSKSLVIIQQAVNDSVFSRIVAAKTSKQAWSILQKEFQGNLKVFVVRLQSLRRDFKTLTMKNGESVAVFLSRAMAIVSQMWSYDETITDQTI

Query:  VEKVLISLTPKFDHVVATIEESKDLFVFTFDELIGSLQAHESRINRSLEKTEEKAF----------------QGKEVAP---------------------
        V KVL SLTPKFDHVVA IEESKDL  ++FDEL+GSLQ+HE R++R+ EK EEKAF                 GK+VA                      
Subjt:  VEKVLISLTPKFDHVVATIEESKDLFVFTFDELIGSLQAHESRINRSLEKTEEKAF----------------QGKEVAP---------------------

Query:  -------------------------------------------KFKEGTKFIFRELDKAEKMKVQLGNDKDLQVEVKG----------------------
                                                   K K   K +F+ELD++ K+KV+LG+DK +QVE KG                      
Subjt:  -------------------------------------------KFKEGTKFIFRELDKAEKMKVQLGNDKDLQVEVKG----------------------

Query:  ---------------------------------QEMVWVKMTQSKMFPLEVSNVGSFVLVAEGKDDSKLWHLQYGHINIKGLSLLNHRD
                                         Q +V V+M  +K+FPLEVS++    LV +   +S LWHL+YGH+N+KGL LL+ ++
Subjt:  ---------------------------------QEMVWVKMTQSKMFPLEVSNVGSFVLVAEGKDDSKLWHLQYGHINIKGLSLLNHRD

A0A5D3DWP2 Putative gag-pol polyprotein, identical1.1e-6742.55Show/hide
Query:  RLWENKKKYSKSLVIIQQAVNDSVFSRIVAAKTSKQAWSILQKEFQGNLKVFVVRLQSLRRDFKTLTMKNGESVAVFLSRAMAIVSQMWSYDETITDQTI
        +L EN+KK SK+LVIIQQAV+DSVFSRI  A TSKQAW ILQK FQG+ +V +V+LQSLRRDF+TL MKNGES+A FLSRA  I+SQM +Y ETI DQTI
Subjt:  RLWENKKKYSKSLVIIQQAVNDSVFSRIVAAKTSKQAWSILQKEFQGNLKVFVVRLQSLRRDFKTLTMKNGESVAVFLSRAMAIVSQMWSYDETITDQTI

Query:  VEKVLISLTPKFDHVVATIEESKDLFVFTFDELIGSLQAHESRINRSLEKTEEKAFQGKEVAPKFKE---------------------------------
        VEKVL SLTPKFDHVVA IEESK+LF FTF EL+GSL+AHESRINRS+E+ EEKAFQ K+  PK+ +                                 
Subjt:  VEKVLISLTPKFDHVVATIEESKDLFVFTFDELIGSLQAHESRINRSLEKTEEKAFQGKEVAPKFKE---------------------------------

Query:  ------------------------------------------------------------------------------GTKFIFRELDKAEKMKVQLGND
                                                                                      G K +F+EL++ EK+KV+L N 
Subjt:  ------------------------------------------------------------------------------GTKFIFRELDKAEKMKVQLGND

Query:  KDLQVEVK------------------------------------------------------GQEMVWVKMTQSKMFPLEVSNVGSFVLVAEG----KDD
        K+LQVE K                                                      G+ +  VKMTQSKMFPLEVSNV SF L A      K++
Subjt:  KDLQVEVK------------------------------------------------------GQEMVWVKMTQSKMFPLEVSNVGSFVLVAEG----KDD

Query:  SKLWHLQYGHINIKGLSLLNHRD
        S+LWHL+YGH+NIKGLSLLN RD
Subjt:  SKLWHLQYGHINIKGLSLLNHRD

A5BT67 Integrase catalytic domain-containing protein1.7e-6044.38Show/hide
Query:  RLWENKKKYSKSLVIIQQAVNDSVFSRIVAAKTSKQAWSILQKEFQGNLKVFVVRLQSLRRDFKTLTMKNGESVAVFLSRAMAIVSQMWSYDETITDQTI
        RL EN KK SK+L  IQQAV++S+FS+I  A T+K+AW+ L+  FQG+ KV  V+LQSLRRDF+TL MKNGESV  FLSR  AIV+QM SY E I DQTI
Subjt:  RLWENKKKYSKSLVIIQQAVNDSVFSRIVAAKTSKQAWSILQKEFQGNLKVFVVRLQSLRRDFKTLTMKNGESVAVFLSRAMAIVSQMWSYDETITDQTI

Query:  VEKVLISLTPKFDHVVATIEESKDLFVFTFDELIGSLQAHESRINRSLEKTEEKAFQ-----------GKEVAPKFK------------------EGTKF
        V KVL SLTPKFDHVVA IEESK L  ++FDEL+GSLQ+HE R++R  EK EEKAF            G+E   + +                  +G K 
Subjt:  VEKVLISLTPKFDHVVATIEESKDLFVFTFDELIGSLQAHESRINRSLEKTEEKAFQ-----------GKEVAPKFK------------------EGTKF

Query:  IFRELDKAEKMKVQLGNDKDLQVEVKG-------------------------------------------------------QEMVWVKMTQSKMFPLEV
        +F+ELD++ K+KV+LG+DK +QVE KG                                                       Q +  V+M  +K+FPLEV
Subjt:  IFRELDKAEKMKVQLGNDKDLQVEVKG-------------------------------------------------------QEMVWVKMTQSKMFPLEV

Query:  SNVGSFVLVAEGKDDSKLWHLQYGHINIKGLSLLNHRD
        S++    LV +   +S LWHL+YGH+N+KGL LL+ ++
Subjt:  SNVGSFVLVAEGKDDSKLWHLQYGHINIKGLSLLNHRD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G21000.1 Gag-Pol-related retrotransposon family protein4.7e-0725.26Show/hide
Query:  KYSKSLVIIQQAVNDSVFSRIVAAKTSKQAWSILQKEFQGNLKVFVVRLQS-----LRRDFKTLTMKNGESVAVFLSRAMAIVSQMWSYDETITDQTIVE
        K +K+L I+Q ++ DSVF + ++A ++K  W +L+K   GN +  + RL+      L +  + L M + ES + +L +A+ I+ ++       +D  I +
Subjt:  KYSKSLVIIQQAVNDSVFSRIVAAKTSKQAWSILQKEFQGNLKVFVVRLQS-----LRRDFKTLTMKNGESVAVFLSRAMAIVSQMWSYDETITDQTIVE

Query:  KVLISLTPKFDHVVATIEESKDLFVFTFDELIGSL--QAHESRINRSLEKTEEKAF---QGKEVAPKFKEGTKFIFRELDKAEKMKVQLGNDKD
         V  +L+  FD + + +EE  D+   T   L+     + HES        TEE  F   +   +  K ++     ++     E  K ++  DK+
Subjt:  KVLISLTPKFDHVVATIEESKDLFVFTFDELIGSL--QAHESRINRSLEKTEEKAF---QGKEVAPKFKEGTKFIFRELDKAEKMKVQLGNDKD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAATTGCCTACCACCCGCACGCCTCACACGCTTGTTGCCCCAGCTGGGCACACGTCCTCACCAACCCTGAGCGCACCTCCAACCGCACGCCTTTACACATGC
CATGGTGCACGCCCCTACCGCACACATGGAATGCGCCTTATCTACCACATGTCCATGCTTGCCCCTATGCCCCTTACCATGCACCCATTTCCATTGTCTGTTCCC
AACAAATTTGGTATCAGAGTAAGATTGTGGGAGAACAAGAAGAAGTATTCCAAGTCATTGGTGATCATTCAACAGGCAGTCAATGACAGTGTCTTCTCGCGAATC
GTAGCGGCAAAAACGTCAAAGCAGGCATGGTCTATTTTGCAGAAGGAGTTTCAAGGCAATTTGAAAGTCTTCGTGGTGAGATTACAATCACTCAGGCGTGACTTT
AAGACCTTGACGATGAAGAATGGAGAATCGGTCGCTGTTTTTTTGTCAAGAGCAATGGCAATAGTCAGTCAAATGTGGTCCTACGACGAGACGATTACCGACCAG
ACCATTGTTGAAAAGGTACTTATAAGTTTGACTCCAAAATTTGATCATGTAGTAGCTACCATAGAAGAATCAAAGGATTTGTTTGTTTTCACATTTGATGAACTA
ATCGGGTCTCTTCAAGCTCATGAGTCGAGAATTAATAGATCGCTCGAAAAGACCGAAGAAAAAGCATTTCAGGGGAAAGAGGTAGCCCCCAAATTCAAAGAAGGC
ACAAAATTCATCTTCAGAGAACTTGATAAGGCAGAAAAGATGAAGGTACAACTTGGAAACGACAAGGATTTACAAGTTGAAGTCAAAGGTCAAGAGATGGTCTGG
GTTAAGATGACCCAAAGCAAAATGTTCCCATTGGAAGTCTCAAATGTTGGAAGTTTTGTTCTTGTTGCTGAAGGAAAAGATGACTCAAAATTGTGGCACTTGCAA
TATGGGCATATCAACATCAAGGGTTTGTCATTACTAAATCATAGAGATCCTGCTCGCATATTGTTTGGGGAAGGGAATACCCTGGACACGTGGAGGACCCCTATT
GGTGCTCCAAGATGGCTCATGAGTCATGCCCAGCTCGGATGGCCGAGATGGCCGCCACCAACGCCGACAAATGCTAAGTATGAGGGCCGAGGTGAACTTGGCCCA
GGTCTGCCCAAGCGTTCAGGTCAGTCCGGAGGCCGGGTTCGAGCTGCAATCTGGAACACACTGTTGTGCATATCCTTGCATAAACATTTGGCGCCGTCTGTGGGA
ACGACAATCTAA
mRNA sequenceShow/hide mRNA sequence
ATGCAATTGCCTACCACCCGCACGCCTCACACGCTTGTTGCCCCAGCTGGGCACACGTCCTCACCAACCCTGAGCGCACCTCCAACCGCACGCCTTTACACATGC
CATGGTGCACGCCCCTACCGCACACATGGAATGCGCCTTATCTACCACATGTCCATGCTTGCCCCTATGCCCCTTACCATGCACCCATTTCCATTGTCTGTTCCC
AACAAATTTGGTATCAGAGTAAGATTGTGGGAGAACAAGAAGAAGTATTCCAAGTCATTGGTGATCATTCAACAGGCAGTCAATGACAGTGTCTTCTCGCGAATC
GTAGCGGCAAAAACGTCAAAGCAGGCATGGTCTATTTTGCAGAAGGAGTTTCAAGGCAATTTGAAAGTCTTCGTGGTGAGATTACAATCACTCAGGCGTGACTTT
AAGACCTTGACGATGAAGAATGGAGAATCGGTCGCTGTTTTTTTGTCAAGAGCAATGGCAATAGTCAGTCAAATGTGGTCCTACGACGAGACGATTACCGACCAG
ACCATTGTTGAAAAGGTACTTATAAGTTTGACTCCAAAATTTGATCATGTAGTAGCTACCATAGAAGAATCAAAGGATTTGTTTGTTTTCACATTTGATGAACTA
ATCGGGTCTCTTCAAGCTCATGAGTCGAGAATTAATAGATCGCTCGAAAAGACCGAAGAAAAAGCATTTCAGGGGAAAGAGGTAGCCCCCAAATTCAAAGAAGGC
ACAAAATTCATCTTCAGAGAACTTGATAAGGCAGAAAAGATGAAGGTACAACTTGGAAACGACAAGGATTTACAAGTTGAAGTCAAAGGTCAAGAGATGGTCTGG
GTTAAGATGACCCAAAGCAAAATGTTCCCATTGGAAGTCTCAAATGTTGGAAGTTTTGTTCTTGTTGCTGAAGGAAAAGATGACTCAAAATTGTGGCACTTGCAA
TATGGGCATATCAACATCAAGGGTTTGTCATTACTAAATCATAGAGATCCTGCTCGCATATTGTTTGGGGAAGGGAATACCCTGGACACGTGGAGGACCCCTATT
GGTGCTCCAAGATGGCTCATGAGTCATGCCCAGCTCGGATGGCCGAGATGGCCGCCACCAACGCCGACAAATGCTAAGTATGAGGGCCGAGGTGAACTTGGCCCA
GGTCTGCCCAAGCGTTCAGGTCAGTCCGGAGGCCGGGTTCGAGCTGCAATCTGGAACACACTGTTGTGCATATCCTTGCATAAACATTTGGCGCCGTCTGTGGGA
ACGACAATCTAA
Protein sequenceShow/hide protein sequence
MQLPTTRTPHTLVAPAGHTSSPTLSAPPTARLYTCHGARPYRTHGMRLIYHMSMLAPMPLTMHPFPLSVPNKFGIRVRLWENKKKYSKSLVIIQQAVNDSVFSRI
VAAKTSKQAWSILQKEFQGNLKVFVVRLQSLRRDFKTLTMKNGESVAVFLSRAMAIVSQMWSYDETITDQTIVEKVLISLTPKFDHVVATIEESKDLFVFTFDEL
IGSLQAHESRINRSLEKTEEKAFQGKEVAPKFKEGTKFIFRELDKAEKMKVQLGNDKDLQVEVKGQEMVWVKMTQSKMFPLEVSNVGSFVLVAEGKDDSKLWHLQ
YGHINIKGLSLLNHRDPARILFGEGNTLDTWRTPIGAPRWLMSHAQLGWPRWPPPTPTNAKYEGRGELGPGLPKRSGQSGGRVRAAIWNTLLCISLHKHLAPSVG
TTI