; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g00310 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g00310
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationchr6:217947..218849
RNA-Seq ExpressionMoc06g00310
SyntenyMoc06g00310
Gene Ontology termsNA
InterPro domainsIPR026960 - Reverse transcriptase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039950.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]1.3e-4233.57Show/hide
Query:  RNFFWKGSSNKHGQNLVRWDKITHPIEDGGLDISKIKEKNKALLAKWIWRYHTEELALWRQVIEAKYGPLDRHKSSLHHSLASAHGPWKSILKQKQHVLD
        RNF W G+SN H  +L+RW++I  P E GGL I  +   N ALL KW+W++ TE+  LW+++I +KY          H   +S + PWK++ +       
Subjt:  RNFFWKGSSNKHGQNLVRWDKITHPIEDGGLDISKIKEKNKALLAKWIWRYHTEELALWRQVIEAKYGPLDRHKSSLHHSLASAHGPWKSILKQKQHVLD

Query:  RSCFKIGDGSKVLFWKDGWLAPSPLCTVFPLLFALHSCKDALVKEMWSITTKSWDLCFRRNLKDAEIHECLELHLLLSPIHINRGCDRFVWKLKPSGLF-
           +K+ DG  + FW D W   +PL    P LFAL + K   VKE W+ ++  W L   R L+D E +    +   L     NRG  + +W L  + +F 
Subjt:  RSCFKIGDGSKVLFWKDGWLAPSPLCTVFPLLFALHSCKDALVKEMWSITTKSWDLCFRRNLKDAEIHECLELHLLLSPIHINRGCDRFVWKLKPSGLF-

Query:  --SVNSLLQDLQISQTTHEETMLYKSIWKDHYPKRIKVFLWEINKNALNTHDKLQRSCSICVFHPNGVCFVKNKKNPWATFLF
          SV   + +  IS        LYK++WK  +PK+ K F+W +    +NT D+LQ+        PN  C++ NK       LF
Subjt:  --SVNSLLQDLQISQTTHEETMLYKSIWKDHYPKRIKVFLWEINKNALNTHDKLQRSCSICVFHPNGVCFVKNKKNPWATFLF

KAA0041397.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]9.3e-4433.33Show/hide
Query:  RNFFWKGSSNKHGQNLVRWDKITHPIEDGGLDISKIKEKNKALLAKWIWRYHTEELALWRQVIEAKYGPLDRHKSSLHHSLASAHGPWKSILKQKQHVLD
        RNF W G+SN H  +L+RW+++  P E GGL I  +   N ALL KW+W++ TE+  LW+++I +KY      +       +S + PWK++         
Subjt:  RNFFWKGSSNKHGQNLVRWDKITHPIEDGGLDISKIKEKNKALLAKWIWRYHTEELALWRQVIEAKYGPLDRHKSSLHHSLASAHGPWKSILKQKQHVLD

Query:  RSCFKIGDGSKVLFWKDGWLAPSPLCTVFPLLFALHSCKDALVKEMWSITTKSWDLCFRRNLKDAEIHECLELHLLLSPIHINRGCDRFVWKLKPSGLFS
           +K+ DG  + FW D W   SPL  V P LFAL + K   VK++W+ + K W++   R L+D E +    +   L     +RG  + +WKL  + +F 
Subjt:  RSCFKIGDGSKVLFWKDGWLAPSPLCTVFPLLFALHSCKDALVKEMWSITTKSWDLCFRRNLKDAEIHECLELHLLLSPIHINRGCDRFVWKLKPSGLFS

Query:  VNSLLQDLQ--ISQTTHEETMLYKSIWKDHYPKRIKVFLWEINKNALNTHDKLQRSCSICVFHPNGVCFVKNKKNPWATFLF
          S+ +DL    +  T+    LYK++WK  +PK+ K F+W +    +NT D+LQ+        PN  C++ NK       LF
Subjt:  VNSLLQDLQ--ISQTTHEETMLYKSIWKDHYPKRIKVFLWEINKNALNTHDKLQRSCSICVFHPNGVCFVKNKKNPWATFLF

KAA0044556.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]6.1e-4332.98Show/hide
Query:  RNFFWKGSSNKHGQNLVRWDKITHPIEDGGLDISKIKEKNKALLAKWIWRYHTEELALWRQVIEAKYGPLDRHKSSLHHSLASAHGPWKSILKQKQHVLD
        RNF W G+SN H  +L+RW+++  P E GGL I  +   N ALL KW+W++ TE+  LW+++I +KY      +       +S + PWK++         
Subjt:  RNFFWKGSSNKHGQNLVRWDKITHPIEDGGLDISKIKEKNKALLAKWIWRYHTEELALWRQVIEAKYGPLDRHKSSLHHSLASAHGPWKSILKQKQHVLD

Query:  RSCFKIGDGSKVLFWKDGWLAPSPLCTVFPLLFALHSCKDALVKEMWSITTKSWDLCFRRNLKDAEIHECLELHLLLSPIHINRGCDRFVWKLKPSGLFS
           +K+ DG  + FW D W   SPL    P LFAL + K   VK++W+ + K W++   R L+D E +    +   L     +RG  + +WKL  + +F 
Subjt:  RSCFKIGDGSKVLFWKDGWLAPSPLCTVFPLLFALHSCKDALVKEMWSITTKSWDLCFRRNLKDAEIHECLELHLLLSPIHINRGCDRFVWKLKPSGLFS

Query:  VNSLLQDLQ--ISQTTHEETMLYKSIWKDHYPKRIKVFLWEINKNALNTHDKLQRSCSICVFHPNGVCFVKNKKNPWATFLF
          S+ +DL    +  T+    LYK++WK  +PK+ K F+W +    +NT D+LQ+        PN  C++ NK       LF
Subjt:  VNSLLQDLQ--ISQTTHEETMLYKSIWKDHYPKRIKVFLWEINKNALNTHDKLQRSCSICVFHPNGVCFVKNKKNPWATFLF

KAA0056839.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]2.8e-4035.8Show/hide
Query:  RNFFWKGSSNKHGQNLVRWDKITHPIEDGGLDISKIKEKNKALLAKWIWRYHTEELALWRQVIEAKYGPLDRHKSSLHHSLASAHGPWKSILKQKQHVLD
        R+F W GS +K   +L+ W+  T P E GGL ISK+K+ N+ALL KW+WRYH E  +LW++ I+AKY    +    +    +SA+ PW +I K K     
Subjt:  RNFFWKGSSNKHGQNLVRWDKITHPIEDGGLDISKIKEKNKALLAKWIWRYHTEELALWRQVIEAKYGPLDRHKSSLHHSLASAHGPWKSILKQKQHVLD

Query:  RSCFKIGDGSKVLFWKDGWLAPSPLCTVFPLLFALHSCKDALVKEMWSITTKSWDLCFRRNLKDAEIHECLELHLLLSPIHINRGCDRFVWKLKPSGLFS
        +  +   DGS + FW   W    PL   FP L+AL + + A VKE+W   +  W++  RR L + E      + + L  IH NRG  +  W    S  ++
Subjt:  RSCFKIGDGSKVLFWKDGWLAPSPLCTVFPLLFALHSCKDALVKEMWSITTKSWDLCFRRNLKDAEIHECLELHLLLSPIHINRGCDRFVWKLKPSGLFS

Query:  VNS----LLQDLQISQTTHEETMLYKSIWKDHYPKRIKVFLWEINKNALNTHDKLQR
        V S      ++  I + T+ E  L K +W+ H P++ K F+W +    LNT DK+Q+
Subjt:  VNS----LLQDLQISQTTHEETMLYKSIWKDHYPKRIKVFLWEINKNALNTHDKLQR

TQD93576.1 hypothetical protein C1H46_020784 [Malus baccata]3.7e-4032.97Show/hide
Query:  MRNFFWKGSSNKHGQNLVRWDKITHPIEDGGLDISKIKEKNKALLAKWIWRYHTEELALWRQVIEAKYGPLDRHKSSLHHSLASAHGPWKSILKQKQHVL
        M+ F W+G       +LV+W+ +    E+GGL +  ++ +N+ALLAKW+WR+  E  +LW +VI +KYG  +   ++      S+  PWK I    Q  L
Subjt:  MRNFFWKGSSNKHGQNLVRWDKITHPIEDGGLDISKIKEKNKALLAKWIWRYHTEELALWRQVIEAKYGPLDRHKSSLHHSLASAHGPWKSILKQKQHVL

Query:  DRSCFKIGDGSKVLFWKDGWLAPSPLCTVFPLLFALHSCKDALVKEM--WSITTKSWDLCFRRNLKDAEIHECLELHLLLSPIHINRG-CDRFVWKLKPS
            F++G+G +V FW+DGWL   PL   FP LF L    +  +     +S  + SW+  FRRNL +AEI E   L   +  + +++   D   WKL+ S
Subjt:  DRSCFKIGDGSKVLFWKDGWLAPSPLCTVFPLLFALHSCKDALVKEM--WSITTKSWDLCFRRNLKDAEIHECLELHLLLSPIHINRG-CDRFVWKLKPS

Query:  GLFSVNSLLQDLQISQTTHEETMLYKSIWKDHYPKRIKVFLWEINKNALNTHDKLQRSCSICVFHPNGVCFVKNKK
         LF+  S    L  +    +    Y  IWK   P ++K+ +W + K  LNT D++QR        P+     K K+
Subjt:  GLFSVNSLLQDLQISQTTHEETMLYKSIWKDHYPKRIKVFLWEINKNALNTHDKLQRSCSICVFHPNGVCFVKNKK

TrEMBL top hitse value%identityAlignment
A0A5A7T9I7 LINE-1 retrotransposable element ORF2 protein6.5e-4333.57Show/hide
Query:  RNFFWKGSSNKHGQNLVRWDKITHPIEDGGLDISKIKEKNKALLAKWIWRYHTEELALWRQVIEAKYGPLDRHKSSLHHSLASAHGPWKSILKQKQHVLD
        RNF W G+SN H  +L+RW++I  P E GGL I  +   N ALL KW+W++ TE+  LW+++I +KY          H   +S + PWK++ +       
Subjt:  RNFFWKGSSNKHGQNLVRWDKITHPIEDGGLDISKIKEKNKALLAKWIWRYHTEELALWRQVIEAKYGPLDRHKSSLHHSLASAHGPWKSILKQKQHVLD

Query:  RSCFKIGDGSKVLFWKDGWLAPSPLCTVFPLLFALHSCKDALVKEMWSITTKSWDLCFRRNLKDAEIHECLELHLLLSPIHINRGCDRFVWKLKPSGLF-
           +K+ DG  + FW D W   +PL    P LFAL + K   VKE W+ ++  W L   R L+D E +    +   L     NRG  + +W L  + +F 
Subjt:  RSCFKIGDGSKVLFWKDGWLAPSPLCTVFPLLFALHSCKDALVKEMWSITTKSWDLCFRRNLKDAEIHECLELHLLLSPIHINRGCDRFVWKLKPSGLF-

Query:  --SVNSLLQDLQISQTTHEETMLYKSIWKDHYPKRIKVFLWEINKNALNTHDKLQRSCSICVFHPNGVCFVKNKKNPWATFLF
          SV   + +  IS        LYK++WK  +PK+ K F+W +    +NT D+LQ+        PN  C++ NK       LF
Subjt:  --SVNSLLQDLQISQTTHEETMLYKSIWKDHYPKRIKVFLWEINKNALNTHDKLQRSCSICVFHPNGVCFVKNKKNPWATFLF

A0A5A7TIB8 LINE-1 retrotransposable element ORF2 protein4.5e-4433.33Show/hide
Query:  RNFFWKGSSNKHGQNLVRWDKITHPIEDGGLDISKIKEKNKALLAKWIWRYHTEELALWRQVIEAKYGPLDRHKSSLHHSLASAHGPWKSILKQKQHVLD
        RNF W G+SN H  +L+RW+++  P E GGL I  +   N ALL KW+W++ TE+  LW+++I +KY      +       +S + PWK++         
Subjt:  RNFFWKGSSNKHGQNLVRWDKITHPIEDGGLDISKIKEKNKALLAKWIWRYHTEELALWRQVIEAKYGPLDRHKSSLHHSLASAHGPWKSILKQKQHVLD

Query:  RSCFKIGDGSKVLFWKDGWLAPSPLCTVFPLLFALHSCKDALVKEMWSITTKSWDLCFRRNLKDAEIHECLELHLLLSPIHINRGCDRFVWKLKPSGLFS
           +K+ DG  + FW D W   SPL  V P LFAL + K   VK++W+ + K W++   R L+D E +    +   L     +RG  + +WKL  + +F 
Subjt:  RSCFKIGDGSKVLFWKDGWLAPSPLCTVFPLLFALHSCKDALVKEMWSITTKSWDLCFRRNLKDAEIHECLELHLLLSPIHINRGCDRFVWKLKPSGLFS

Query:  VNSLLQDLQ--ISQTTHEETMLYKSIWKDHYPKRIKVFLWEINKNALNTHDKLQRSCSICVFHPNGVCFVKNKKNPWATFLF
          S+ +DL    +  T+    LYK++WK  +PK+ K F+W +    +NT D+LQ+        PN  C++ NK       LF
Subjt:  VNSLLQDLQ--ISQTTHEETMLYKSIWKDHYPKRIKVFLWEINKNALNTHDKLQRSCSICVFHPNGVCFVKNKKNPWATFLF

A0A5A7TR15 LINE-1 retrotransposable element ORF2 protein2.9e-4332.98Show/hide
Query:  RNFFWKGSSNKHGQNLVRWDKITHPIEDGGLDISKIKEKNKALLAKWIWRYHTEELALWRQVIEAKYGPLDRHKSSLHHSLASAHGPWKSILKQKQHVLD
        RNF W G+SN H  +L+RW+++  P E GGL I  +   N ALL KW+W++ TE+  LW+++I +KY      +       +S + PWK++         
Subjt:  RNFFWKGSSNKHGQNLVRWDKITHPIEDGGLDISKIKEKNKALLAKWIWRYHTEELALWRQVIEAKYGPLDRHKSSLHHSLASAHGPWKSILKQKQHVLD

Query:  RSCFKIGDGSKVLFWKDGWLAPSPLCTVFPLLFALHSCKDALVKEMWSITTKSWDLCFRRNLKDAEIHECLELHLLLSPIHINRGCDRFVWKLKPSGLFS
           +K+ DG  + FW D W   SPL    P LFAL + K   VK++W+ + K W++   R L+D E +    +   L     +RG  + +WKL  + +F 
Subjt:  RSCFKIGDGSKVLFWKDGWLAPSPLCTVFPLLFALHSCKDALVKEMWSITTKSWDLCFRRNLKDAEIHECLELHLLLSPIHINRGCDRFVWKLKPSGLFS

Query:  VNSLLQDLQ--ISQTTHEETMLYKSIWKDHYPKRIKVFLWEINKNALNTHDKLQRSCSICVFHPNGVCFVKNKKNPWATFLF
          S+ +DL    +  T+    LYK++WK  +PK+ K F+W +    +NT D+LQ+        PN  C++ NK       LF
Subjt:  VNSLLQDLQ--ISQTTHEETMLYKSIWKDHYPKRIKVFLWEINKNALNTHDKLQRSCSICVFHPNGVCFVKNKKNPWATFLF

A0A5A7UTI6 LINE-1 retrotransposable element ORF2 protein1.4e-4035.8Show/hide
Query:  RNFFWKGSSNKHGQNLVRWDKITHPIEDGGLDISKIKEKNKALLAKWIWRYHTEELALWRQVIEAKYGPLDRHKSSLHHSLASAHGPWKSILKQKQHVLD
        R+F W GS +K   +L+ W+  T P E GGL ISK+K+ N+ALL KW+WRYH E  +LW++ I+AKY    +    +    +SA+ PW +I K K     
Subjt:  RNFFWKGSSNKHGQNLVRWDKITHPIEDGGLDISKIKEKNKALLAKWIWRYHTEELALWRQVIEAKYGPLDRHKSSLHHSLASAHGPWKSILKQKQHVLD

Query:  RSCFKIGDGSKVLFWKDGWLAPSPLCTVFPLLFALHSCKDALVKEMWSITTKSWDLCFRRNLKDAEIHECLELHLLLSPIHINRGCDRFVWKLKPSGLFS
        +  +   DGS + FW   W    PL   FP L+AL + + A VKE+W   +  W++  RR L + E      + + L  IH NRG  +  W    S  ++
Subjt:  RSCFKIGDGSKVLFWKDGWLAPSPLCTVFPLLFALHSCKDALVKEMWSITTKSWDLCFRRNLKDAEIHECLELHLLLSPIHINRGCDRFVWKLKPSGLFS

Query:  VNS----LLQDLQISQTTHEETMLYKSIWKDHYPKRIKVFLWEINKNALNTHDKLQR
        V S      ++  I + T+ E  L K +W+ H P++ K F+W +    LNT DK+Q+
Subjt:  VNS----LLQDLQISQTTHEETMLYKSIWKDHYPKRIKVFLWEINKNALNTHDKLQR

A0A803P465 Uncharacterized protein1.0e-4033.71Show/hide
Query:  MRNFFWKGSSNKHGQNLVRWDKITHPIEDGGLDISKIKEKNKALLAKWIWRYHTEELALWRQVIEAKYGPLDRHKSSLHHSLASAHGPWKSILKQKQHVL
        MR+F W+GS +  G++LV WD++  P  +GGL I +++ +NK+LL KW+WR+  E+ +LW +V+ ++YG  D    S   S  S  GPW+ I       L
Subjt:  MRNFFWKGSSNKHGQNLVRWDKITHPIEDGGLDISKIKEKNKALLAKWIWRYHTEELALWRQVIEAKYGPLDRHKSSLHHSLASAHGPWKSILKQKQHVL

Query:  DRSCFKIGDGSKVLFWKDGWLAPSPLCTVFPLLFALHSCKDALVKEM---------WSITTKSWDLCFRRNLKDAEIHECLELHLLLSPIHI-NRGCDRF
            FK+G G ++ FW+D W+   PL + FP L  +   ++  +KE+         W    +SW+  FRRNL D E+   + L   +  + + +   D  
Subjt:  DRSCFKIGDGSKVLFWKDGWLAPSPLCTVFPLLFALHSCKDALVKEM---------WSITTKSWDLCFRRNLKDAEIHECLELHLLLSPIHI-NRGCDRF

Query:  VWKLKPSGLFSVNSLLQDLQISQTTHEETMLYKSIWKDHYPKRIKVFLWEINKNALNTHDKLQR
        +WK  PSG+FS  S      +S  +       KS+WK   P ++KVF W +    +N HDK+Q+
Subjt:  VWKLKPSGLFSVNSLLQDLQISQTTHEETMLYKSIWKDHYPKRIKVFLWEINKNALNTHDKLQR

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657501.8e-2628.04Show/hide
Query:  RNFFWKGSSNKHGQNLVRWDKITHPIEDGGLDISKIKEKNKALLAKWIWRYHTEELALWRQVIEAKYGPLDRHKSSLHHSLASAHGPWKSILKQKQHVLD
        R F W  ++ K  Q+LV+W K+  P ++GGL +   K  N+AL++K  WR   E+ +LW  V++ KY   +   S       S    W+SI    + V+ 
Subjt:  RNFFWKGSSNKHGQNLVRWDKITHPIEDGGLDISKIKEKNKALLAKWIWRYHTEELALWRQVIEAKYGPLDRHKSSLHHSLASAHGPWKSILKQKQHVLD

Query:  RSCFKI-GDGSKVLFWKDGWLAPSPLCTV----FPLLFALHSCKDALVKEMWSITTKSWDLCFRRNLKDAEIHECLELHLLLSPIHINRGCDRFVWKLKP
             I GDG ++ FW D W++  PL  +     P       C   + K++W I  + WD  F +       +  LEL  ++  + +    DR  WK   
Subjt:  RSCFKI-GDGSKVLFWKDGWLAPSPLCTV----FPLLFALHSCKDALVKEMWSITTKSWDLCFRRNLKDAEIHECLELHLLLSPIHINRGCDRFVWKLKP

Query:  SGLFSVNSLLQDLQISQTTHEE-TMLYKSIWKDHYPKRIKVFLWEINKNALNTHDKLQRS-------CSIC
         G FSV S  + L + +         +  +WK   P+R+K FLW +   A+ T ++  R        C +C
Subjt:  SGLFSVNSLLQDLQISQTTHEE-TMLYKSIWKDHYPKRIKVFLWEINKNALNTHDKLQRS-------CSIC

Arabidopsis top hitse value%identityAlignment
AT3G24255.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.3e-0620.6Show/hide
Query:  NFFWKGSSNKHGQNLVRWDKITHPIEDGGLDISKIKEKNKALLAKWIWRYHTEELALWRQVIEAKYGPLDRHKSSLHHSLASAHGPWKSILKQKQHVLDR
        +F W G      +  V W  +  P ++GGL I  +KE NK               + W                S+  +       WK ILK +      
Subjt:  NFFWKGSSNKHGQNLVRWDKITHPIEDGGLDISKIKEKNKALLAKWIWRYHTEELALWRQVIEAKYGPLDRHKSSLHHSLASAHGPWKSILKQKQHVLDR

Query:  SCFKIGDGSKVLFWKDGWLAPSPLCTVFPLLFALHSCKDALVKEMWSITTKSWDLCFRRNLKDAEIHECLELHLLLSPIH---INRGCDRFVWKLKPSGL
            I +GS   FW D W     L  V         C D  +    S+     +   RR+  D      L +  +++ +    +  G D   WK      
Subjt:  SCFKIGDGSKVLFWKDGWLAPSPLCTVFPLLFALHSCKDALVKEMWSITTKSWDLCFRRNLKDAEIHECLELHLLLSPIH---INRGCDRFVWKLKPSGL

Query:  FSVNSLLQDLQISQTTHEETMLYKSIWKDHYPKRIKVFLWEINKNALNTHDKL-------QRSCSIC
            +  +    ++    +   YK +W  H   +  V  W   KN L T D++         SC +C
Subjt:  FSVNSLLQDLQISQTTHEETMLYKSIWKDHYPKRIKVFLWEINKNALNTHDKL-------QRSCSIC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGAATTTTTTTTGGAAAGGGAGTAGCAACAAACATGGGCAAAACCTGGTGCGATGGGATAAGATCACTCATCCCATTGAGGATGGAGGCCTCGACATCTCC
AAAATCAAGGAAAAGAATAAAGCTCTATTAGCAAAATGGATCTGGAGGTATCATACAGAAGAATTGGCCCTTTGGAGGCAAGTCATTGAAGCCAAATATGGGCCC
CTCGATAGGCACAAAAGCTCCCTTCACCATTCTCTTGCTTCTGCCCATGGTCCTTGGAAATCTATATTAAAACAGAAGCAACATGTTCTGGACAGAAGCTGCTTC
AAGATAGGCGATGGATCTAAAGTTCTCTTTTGGAAAGATGGATGGTTGGCCCCCAGTCCTCTATGTACTGTCTTCCCCCTCCTATTTGCTTTACACTCCTGCAAA
GATGCCTTGGTGAAGGAGATGTGGTCGATAACTACAAAATCATGGGACCTCTGTTTTAGACGGAATCTCAAAGACGCAGAAATCCATGAATGCCTAGAGCTCCAT
CTCCTGCTGTCTCCCATCCACATCAATCGGGGATGTGATAGATTTGTCTGGAAATTGAAACCATCTGGCCTTTTCTCCGTCAACTCCCTTCTACAAGATCTACAA
ATCTCACAAACTACTCATGAAGAGACTATGCTCTACAAATCGATTTGGAAGGATCATTACCCAAAAAGGATCAAAGTCTTTTTATGGGAGATCAACAAGAATGCC
CTTAATACCCATGACAAGCTTCAAAGAAGTTGCAGCATATGTGTCTTTCACCCCAATGGTGTGTGCTTTGTAAAAAACAAGAAGAATCCATGGGCCACATTTTTG
TTTCCTGTGAATACACAGCAAAGCTATGGGCCTCTATTCTTTCATCCTTCGGTTGGCCTATAG
mRNA sequenceShow/hide mRNA sequence
ATGAGGAATTTTTTTTGGAAAGGGAGTAGCAACAAACATGGGCAAAACCTGGTGCGATGGGATAAGATCACTCATCCCATTGAGGATGGAGGCCTCGACATCTCC
AAAATCAAGGAAAAGAATAAAGCTCTATTAGCAAAATGGATCTGGAGGTATCATACAGAAGAATTGGCCCTTTGGAGGCAAGTCATTGAAGCCAAATATGGGCCC
CTCGATAGGCACAAAAGCTCCCTTCACCATTCTCTTGCTTCTGCCCATGGTCCTTGGAAATCTATATTAAAACAGAAGCAACATGTTCTGGACAGAAGCTGCTTC
AAGATAGGCGATGGATCTAAAGTTCTCTTTTGGAAAGATGGATGGTTGGCCCCCAGTCCTCTATGTACTGTCTTCCCCCTCCTATTTGCTTTACACTCCTGCAAA
GATGCCTTGGTGAAGGAGATGTGGTCGATAACTACAAAATCATGGGACCTCTGTTTTAGACGGAATCTCAAAGACGCAGAAATCCATGAATGCCTAGAGCTCCAT
CTCCTGCTGTCTCCCATCCACATCAATCGGGGATGTGATAGATTTGTCTGGAAATTGAAACCATCTGGCCTTTTCTCCGTCAACTCCCTTCTACAAGATCTACAA
ATCTCACAAACTACTCATGAAGAGACTATGCTCTACAAATCGATTTGGAAGGATCATTACCCAAAAAGGATCAAAGTCTTTTTATGGGAGATCAACAAGAATGCC
CTTAATACCCATGACAAGCTTCAAAGAAGTTGCAGCATATGTGTCTTTCACCCCAATGGTGTGTGCTTTGTAAAAAACAAGAAGAATCCATGGGCCACATTTTTG
TTTCCTGTGAATACACAGCAAAGCTATGGGCCTCTATTCTTTCATCCTTCGGTTGGCCTATAG
Protein sequenceShow/hide protein sequence
MRNFFWKGSSNKHGQNLVRWDKITHPIEDGGLDISKIKEKNKALLAKWIWRYHTEELALWRQVIEAKYGPLDRHKSSLHHSLASAHGPWKSILKQKQHVLDRSCF
KIGDGSKVLFWKDGWLAPSPLCTVFPLLFALHSCKDALVKEMWSITTKSWDLCFRRNLKDAEIHECLELHLLLSPIHINRGCDRFVWKLKPSGLFSVNSLLQDLQ
ISQTTHEETMLYKSIWKDHYPKRIKVFLWEINKNALNTHDKLQRSCSICVFHPNGVCFVKNKKNPWATFLFPVNTQQSYGPLFFHPSVGL