; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0014543 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0014543
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRibonuclease H-like superfamily protein
Genome locationchr12:1930806..1932043
RNA-Seq ExpressionLag0014543
SyntenyLag0014543
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
OMO70721.1 Zinc finger, CCHC-type [Corchorus olitorius]2.8e-2128.52Show/hide
Query:  CHVCQEETETTKHALFQCSKARAIWELLLPLMYDDQWGIVDIKDLWLGLTDCQMSDLERICVGAWAIWNDRNSWIHNHPIHDTVSRADWINMYLTEFWQA
        C  C +E ET  HA+ +C KA  +WEL+       +   VD     LG  D    D+    V  WAIW  RN  IH++ ++     A ++  YL E+ + 
Subjt:  CHVCQEETETTKHALFQCSKARAIWELLLPLMYDDQWGIVDIKDLWLGLTDCQMSDLERICVGAWAIWNDRNSWIHNHPIHDTVSRADWINMYLTEFWQA

Query:  NPKGGIPIQSEKDILEIISRGEETILHTDAAFMGDRDTGGIGLVMRDKLGQLLAIKSSGFSNFSSPLVGEVIVVLEGLRMARAFDVRRLMILSDLLNLIN
          K   P  +++   E   + E  + + D A+     TG  G + RD  G +L   +      S   V E    L+  + AR     R+++  D L +I 
Subjt:  NPKGGIPIQSEKDILEIISRGEETILHTDAAFMGDRDTGGIGLVMRDKLGQLLAIKSSGFSNFSSPLVGEVIVVLEGLRMARAFDVRRLMILSDLLNLIN

Query:  IINEEIQGHSSIATVIWDIKVMRNFFENVSFKFVSRKFNRFAHQTARVGLSSPSPILWQGNYPDWMVGLSLTECSLF
          N      S I   I D+K  +N+FE  SF  V RK NR     A++G S    I+W  + P ++  +   +C+ F
Subjt:  IINEEIQGHSSIATVIWDIKVMRNFFENVSFKFVSRKFNRFAHQTARVGLSSPSPILWQGNYPDWMVGLSLTECSLF

TXG50387.1 hypothetical protein EZV62_022911 [Acer yangbiense]1.2e-1926.97Show/hide
Query:  YCHVCQEETETTKHALFQCSKARAIWELLLPLMYDDQWGIVDIKDLWLGL---TDCQMSDLERICVGAWAIWNDRNSWIHNHPIHDTVSRADWINMYLTE
        YC VC +++E+  H L+ C+ A  +W  LL      +  + D   + L L    D  + +L  + +G W +W +RNS +H            WI+ +  E
Subjt:  YCHVCQEETETTKHALFQCSKARAIWELLLPLMYDDQWGIVDIKDLWLGL---TDCQMSDLERICVGAWAIWNDRNSWIHNHPIHDTVSRADWINMYLTE

Query:  FWQANPKGGIPIQSEKDILEIISRGEETILHTDAAFMGDRDTGGIGLVMRDKLGQLLAIKSSGFSNFSSPLVGEVIVVLEGLRMARAFDVRRLMILSDLL
        F  AN      + S +   +   RGE  I + DA+F       G+G+++RD  G  +A +SS     SS  + E    LEG+ +A    V  ++I SD  
Subjt:  FWQANPKGGIPIQSEKDILEIISRGEETILHTDAAFMGDRDTGGIGLVMRDKLGQLLAIKSSGFSNFSSPLVGEVIVVLEGLRMARAFDVRRLMILSDLL

Query:  NLINIINEEIQGHSSIATVIWDIKVMRNFFENVSFKFVSRKFNRFAHQTARVGLSSPSPILWQGNYP
        ++I +++++    + +  +I     +      +S+  V R+ N  AH  A+  LS  SP++W    P
Subjt:  NLINIINEEIQGHSSIATVIWDIKVMRNFFENVSFKFVSRKFNRFAHQTARVGLSSPSPILWQGNYP

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]1.5e-2731.9Show/hide
Query:  CHVCQEETETTKHALFQCSKARAIWELLLP----LMYDDQWGIVDIKDLWLGLTD-CQMSDLERICVGAWAIWNDRNSWIHNHPIHDTVSRADWINMYLT
        C +C +  E+  HA F C +AR IW  L P    L  +D    +   +LW  LT+  +  DL    +  W IWNDRNS IH   +     + +W+  +L 
Subjt:  CHVCQEETETTKHALFQCSKARAIWELLLP----LMYDDQWGIVDIKDLWLGLTD-CQMSDLERICVGAWAIWNDRNSWIHNHPIHDTVSRADWINMYLT

Query:  EFWQANPKGGIP-IQSEKDILEIISRGEETI---LHTDAAFMGDRDTGGIGLVMRDKLGQLLAIKSSGFSNFSSPLVGEVIVVLEGLRMARAFDVRRLMI
           QA      P  QS    +    R   ++   L+TDAA  G   +   G ++RD    L+A  S       SPL+ E+  +LEGL+ A A +   L +
Subjt:  EFWQANPKGGIP-IQSEKDILEIISRGEETI---LHTDAAFMGDRDTGGIGLVMRDKLGQLLAIKSSGFSNFSSPLVGEVIVVLEGLRMARAFDVRRLMI

Query:  LSDLLNLINIINEEIQGHSSIATVIWDIKVMRNFFENVSFKFVSRKFNRFAHQTARVGLSSPS-PILWQGNYPDWMVGL
         SD L  I +I  EI         + +I+ +   F  +SF   SR+ NR AH  A+ G++SPS    W  N+P W++ L
Subjt:  LSDLLNLINIINEEIQGHSSIATVIWDIKVMRNFFENVSFKFVSRKFNRFAHQTARVGLSSPS-PILWQGNYPDWMVGL

XP_030508852.1 uncharacterized protein LOC115723496 [Cannabis sativa]3.1e-2028.22Show/hide
Query:  YCHVCQEETETTKHALFQCSKARAIWEL---LLPLMYDDQWGIVDIKDLWLGLTDCQMSDLERICVGAWAIWNDRNSWIHNHPIHDTVSRADWINMYLTE
        YC +C    ET  HALF C +A+A+WEL    +     ++    D   L L  T    S+LE   V  W+IW++RN+  H + +    + A +   YLTE
Subjt:  YCHVCQEETETTKHALFQCSKARAIWEL---LLPLMYDDQWGIVDIKDLWLGLTDCQMSDLERICVGAWAIWNDRNSWIHNHPIHDTVSRADWINMYLTE

Query:  FWQANPKGGIPIQS------EKDILEIISRGEETI-------LHTDAAFMGDRDTGGIGLVMRDKLGQLLAIKSSGF-SNFSSPLVGEVIVVLEGLRMAR
        F QA  K   P+ +       +   E I   + T        L+TDAA   +R+T GIG V+R+  G ++A  S  F  NF +  + E + +   L    
Subjt:  FWQANPKGGIPIQS------EKDILEIISRGEETI-------LHTDAAFMGDRDTGGIGLVMRDKLGQLLAIKSSGF-SNFSSPLVGEVIVVLEGLRMAR

Query:  AFDVRRLMILSDLLNLINIINEEIQGHSSIATVIWDIKVMRNFFENVSFKFVSRKFNRFAHQTARVGLSSPSPILWQGNYPDWMVGL
        + ++    I +D L ++  +       S+   ++ +I  + +FF       VSR  N +AH  A+  L+  +  +W G +P  ++ L
Subjt:  AFDVRRLMILSDLLNLINIINEEIQGHSSIATVIWDIKVMRNFFENVSFKFVSRKFNRFAHQTARVGLSSPSPILWQGNYPDWMVGL

XP_038704726.1 uncharacterized protein LOC120000672 [Tripterygium wilfordii]9.1e-2026.89Show/hide
Query:  ETTKHALFQCSKARAIWELLLPLMYDDQWGIVDIKDLWLG--LTDCQMSDLERICVGAWAIWNDRNSWIHNHPIHDTVSRADWINMYLTEFWQANPKGGI
        E   HAL +C  A  +W+    + Y  +   +  +D WL   L   +    +R+   AWA+W ++NS+I         S  +++  Y+ E+ QA     +
Subjt:  ETTKHALFQCSKARAIWELLLPLMYDDQWGIVDIKDLWLG--LTDCQMSDLERICVGAWAIWNDRNSWIHNHPIHDTVSRADWINMYLTEFWQANPKGGI

Query:  --PIQSEKDILEIISRGEETI-LHTDAAFMGDRDTGGIGLVMRDKLGQLLAIKSSGFSNFSSPLVGEVIVVLEGLRMARAFDVRRLMILSDLLNLINIIN
          P QS ++ ++       TI L  D A   +    G+G V+RD  GQ+ A  S   +    P V +   +++G ++A    V  L+I SD   L+N I 
Subjt:  --PIQSEKDILEIISRGEETI-LHTDAAFMGDRDTGGIGLVMRDKLGQLLAIKSSGFSNFSSPLVGEVIVVLEGLRMARAFDVRRLMILSDLLNLINIIN

Query:  EEIQGHSSIATVIWDIKVMRNFFENVSFKFVSRKFNRFAHQTARVGLSSPSPILWQGNYPDWMV
         E    +    ++ DI+ + +   +VS ++  R  NR AH+ AR  ++   P+ W G  PD+++
Subjt:  EEIQGHSSIATVIWDIKVMRNFFENVSFKFVSRKFNRFAHQTARVGLSSPSPILWQGNYPDWMV

TrEMBL top hitse value%identityAlignment
A0A1R3HK95 Zinc finger, CCHC-type1.4e-2128.52Show/hide
Query:  CHVCQEETETTKHALFQCSKARAIWELLLPLMYDDQWGIVDIKDLWLGLTDCQMSDLERICVGAWAIWNDRNSWIHNHPIHDTVSRADWINMYLTEFWQA
        C  C +E ET  HA+ +C KA  +WEL+       +   VD     LG  D    D+    V  WAIW  RN  IH++ ++     A ++  YL E+ + 
Subjt:  CHVCQEETETTKHALFQCSKARAIWELLLPLMYDDQWGIVDIKDLWLGLTDCQMSDLERICVGAWAIWNDRNSWIHNHPIHDTVSRADWINMYLTEFWQA

Query:  NPKGGIPIQSEKDILEIISRGEETILHTDAAFMGDRDTGGIGLVMRDKLGQLLAIKSSGFSNFSSPLVGEVIVVLEGLRMARAFDVRRLMILSDLLNLIN
          K   P  +++   E   + E  + + D A+     TG  G + RD  G +L   +      S   V E    L+  + AR     R+++  D L +I 
Subjt:  NPKGGIPIQSEKDILEIISRGEETILHTDAAFMGDRDTGGIGLVMRDKLGQLLAIKSSGFSNFSSPLVGEVIVVLEGLRMARAFDVRRLMILSDLLNLIN

Query:  IINEEIQGHSSIATVIWDIKVMRNFFENVSFKFVSRKFNRFAHQTARVGLSSPSPILWQGNYPDWMVGLSLTECSLF
          N      S I   I D+K  +N+FE  SF  V RK NR     A++G S    I+W  + P ++  +   +C+ F
Subjt:  IINEEIQGHSSIATVIWDIKVMRNFFENVSFKFVSRKFNRFAHQTARVGLSSPSPILWQGNYPDWMVGLSLTECSLF

A0A2P6QT88 Putative ribonuclease H-like domain, reverse transcriptase zinc-binding domain-containing protein7.5e-2026.52Show/hide
Query:  MCWYGYCHVCQEETETTKHALFQCSKARAIWELLLPLMYDDQWGIVDIKDLWLG-LTDCQMSDLERICVGAWAIWNDRNSWIHNHPIHDTVSR----ADW
        +C    C  C E  ET  H L++C   R IW+L         W      DL+   L   Q   +E   +  W IW DRN++IH     D        +DW
Subjt:  MCWYGYCHVCQEETETTKHALFQCSKARAIWELLLPLMYDDQWGIVDIKDLWLG-LTDCQMSDLERICVGAWAIWNDRNSWIHNHPIHDTVSR----ADW

Query:  INMYLTEFWQANP--KGGIPIQSEKDILEIISRGEETILHTDAAFMGDRDTGGIGLVMRDKLGQLLAIKSSGFSNFSSPLVGEVIVVLEGLRMARAFDVR
            L + W+     K  +  +++ ++       +   L+ D A   +    G+G V+R++LG+L+   +  F+    P   E + + EGLR +R   +R
Subjt:  INMYLTEFWQANP--KGGIPIQSEKDILEIISRGEETILHTDAAFMGDRDTGGIGLVMRDKLGQLLAIKSSGFSNFSSPLVGEVIVVLEGLRMARAFDVR

Query:  RLMILSDLLNLINIINEEIQGHSSIATVIWDIKVMRNFFENVSFKFVSRKFNRFAHQTARVGLSSPSPILWQGNYPDWM
         L +  D L +IN +++     + I  V+  ++++   FE V++K V +K N+ AH+ AR  L S   +  +   P W+
Subjt:  RLMILSDLLNLINIINEEIQGHSSIATVIWDIKVMRNFFENVSFKFVSRKFNRFAHQTARVGLSSPSPILWQGNYPDWM

A0A5C7H0P0 Uncharacterized protein5.7e-2026.97Show/hide
Query:  YCHVCQEETETTKHALFQCSKARAIWELLLPLMYDDQWGIVDIKDLWLGL---TDCQMSDLERICVGAWAIWNDRNSWIHNHPIHDTVSRADWINMYLTE
        YC VC +++E+  H L+ C+ A  +W  LL      +  + D   + L L    D  + +L  + +G W +W +RNS +H            WI+ +  E
Subjt:  YCHVCQEETETTKHALFQCSKARAIWELLLPLMYDDQWGIVDIKDLWLGL---TDCQMSDLERICVGAWAIWNDRNSWIHNHPIHDTVSRADWINMYLTE

Query:  FWQANPKGGIPIQSEKDILEIISRGEETILHTDAAFMGDRDTGGIGLVMRDKLGQLLAIKSSGFSNFSSPLVGEVIVVLEGLRMARAFDVRRLMILSDLL
        F  AN      + S +   +   RGE  I + DA+F       G+G+++RD  G  +A +SS     SS  + E    LEG+ +A    V  ++I SD  
Subjt:  FWQANPKGGIPIQSEKDILEIISRGEETILHTDAAFMGDRDTGGIGLVMRDKLGQLLAIKSSGFSNFSSPLVGEVIVVLEGLRMARAFDVRRLMILSDLL

Query:  NLINIINEEIQGHSSIATVIWDIKVMRNFFENVSFKFVSRKFNRFAHQTARVGLSSPSPILWQGNYP
        ++I +++++    + +  +I     +      +S+  V R+ N  AH  A+  LS  SP++W    P
Subjt:  NLINIINEEIQGHSSIATVIWDIKVMRNFFENVSFKFVSRKFNRFAHQTARVGLSSPSPILWQGNYP

A0A6J1DX30 uncharacterized protein LOC1110248747.5e-2831.9Show/hide
Query:  CHVCQEETETTKHALFQCSKARAIWELLLP----LMYDDQWGIVDIKDLWLGLTD-CQMSDLERICVGAWAIWNDRNSWIHNHPIHDTVSRADWINMYLT
        C +C +  E+  HA F C +AR IW  L P    L  +D    +   +LW  LT+  +  DL    +  W IWNDRNS IH   +     + +W+  +L 
Subjt:  CHVCQEETETTKHALFQCSKARAIWELLLP----LMYDDQWGIVDIKDLWLGLTD-CQMSDLERICVGAWAIWNDRNSWIHNHPIHDTVSRADWINMYLT

Query:  EFWQANPKGGIP-IQSEKDILEIISRGEETI---LHTDAAFMGDRDTGGIGLVMRDKLGQLLAIKSSGFSNFSSPLVGEVIVVLEGLRMARAFDVRRLMI
           QA      P  QS    +    R   ++   L+TDAA  G   +   G ++RD    L+A  S       SPL+ E+  +LEGL+ A A +   L +
Subjt:  EFWQANPKGGIP-IQSEKDILEIISRGEETI---LHTDAAFMGDRDTGGIGLVMRDKLGQLLAIKSSGFSNFSSPLVGEVIVVLEGLRMARAFDVRRLMI

Query:  LSDLLNLINIINEEIQGHSSIATVIWDIKVMRNFFENVSFKFVSRKFNRFAHQTARVGLSSPS-PILWQGNYPDWMVGL
         SD L  I +I  EI         + +I+ +   F  +SF   SR+ NR AH  A+ G++SPS    W  N+P W++ L
Subjt:  LSDLLNLINIINEEIQGHSSIATVIWDIKVMRNFFENVSFKFVSRKFNRFAHQTARVGLSSPS-PILWQGNYPDWMVGL

A0A803QDG1 Uncharacterized protein2.6e-2026.79Show/hide
Query:  CHVCQEETETTKHALFQCSKARAIWELLLPLMYDDQWGIVDIKDLWLGL-TDCQMSDLERICVGAWAIWNDRNSWIHNHPIHDTVSRADWINMYLTEFWQ
        C VC+ E ETTKHAL  CS+ R  W       Y D +  +DI   +LGL       D+  +    W++WN RN+++HN          DW  ++  +F  
Subjt:  CHVCQEETETTKHALFQCSKARAIWELLLPLMYDDQWGIVDIKDLWLGL-TDCQMSDLERICVGAWAIWNDRNSWIHNHPIHDTVSRADWINMYLTEFWQ

Query:  ANPKGGIPIQSEKDILEIISRGEETI-LHTDAAFMGDRDTGGIGLVMRDKLGQLLAIKSSGFSNFSSPLVGEVIVVLEGLRMARAFDVRRLMILSDLLNL
        A        QS      + S     + L  DA          IGLV+ D    + A  S+ FS    P V E   + + +  A+   +   ++L+D  ++
Subjt:  ANPKGGIPIQSEKDILEIISRGEETI-LHTDAAFMGDRDTGGIGLVMRDKLGQLLAIKSSGFSNFSSPLVGEVIVVLEGLRMARAFDVRRLMILSDLLNL

Query:  INIINEEIQGHSSIATVIWDIKVMRNFFENVSFKFVSRKFNRFAHQTARVGLSSPSPILWQGNYP
        ++ +N     +S +  V+  I  + +F   ++  ++SR+ N  AH  A++GL   + ++W G+ P
Subjt:  INIINEEIQGHSSIATVIWDIKVMRNFFENVSFKFVSRKFNRFAHQTARVGLSSPSPILWQGNYP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G10000.1 Ribonuclease H-like superfamily protein5.3e-1023.74Show/hide
Query:  ETTKHALFQCSKARAIWELLLPLMYDDQWGIVDIKDLWLGLTDCQMSDLERICVG----------AWAIWNDRNSWIHNH---PIHDTVSRADWINMYLT
        ET+ H LF C  A  +W  L PL    Q G + I    L   +     +    VG           W IW  RN  I  +    + +TV++A    +   
Subjt:  ETTKHALFQCSKARAIWELLLPLMYDDQWGIVDIKDLWLGLTDCQMSDLERICVG----------AWAIWNDRNSWIHNH---PIHDTVSRADWINMYLT

Query:  EFWQANPKGGIPIQSEKDILEIISRGEETILHTDAAFMGDRDTGGIGLVMR--DKLGQLLAIKSSGFSNFSSPLVGEVIVVLEGLRMARAFDVRRLMILS
            A PK  +          +     + + + DAA+       G G V +      + +   S+G   F SPL  E   +   +  A   +   L++LS
Subjt:  EFWQANPKGGIPIQSEKDILEIISRGEETILHTDAAFMGDRDTGGIGLVMR--DKLGQLLAIKSSGFSNFSSPLVGEVIVVLEGLRMARAFDVRRLMILS

Query:  DLLNLINIINEEIQGHSSIATVIWDIKVMRNFFENVSFKFVSRKFNRFAHQTARVGL
        D  ++++ +N  +   + I  ++ +I+ +RN F ++SF+F+ R  N  A   A++ L
Subjt:  DLLNLINIINEEIQGHSSIATVIWDIKVMRNFFENVSFKFVSRKFNRFAHQTARVGL

AT2G13980.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.4e-0530.77Show/hide
Query:  NFSSPLVGEVIVVLEGLRMARAFDVRRLMILSDLLNLINIINEEIQGHSSIATVIWDIKVMRNFFENVSFKFVSRKFNRFAHQTARVGLSS
        N S+PL  E   +L  L+        R+++  D   L N+++     H+ +A ++ DI++    F NV F FV R  N+ AH+ A++G +S
Subjt:  NFSSPLVGEVIVVLEGLRMARAFDVRRLMILSDLLNLINIINEEIQGHSSIATVIWDIKVMRNFFENVSFKFVSRKFNRFAHQTARVGLSS

AT3G09510.1 Ribonuclease H-like superfamily protein6.5e-1626.43Show/hide
Query:  CHVCQEETETTKHALFQCSKARAIWELLLPLMYDDQWGIVD----IKDLWLGLTDCQMSDLERICVG--AWAIWNDRNSWIHN----HPIHDTVSRA---
        C  C  E E+  HALF C  A   W L    +  +Q    D    I ++   + D  MSD  ++      W IW  RN+ + N     P    +S     
Subjt:  CHVCQEETETTKHALFQCSKARAIWELLLPLMYDDQWGIVD----IKDLWLGLTDCQMSDLERICVG--AWAIWNDRNSWIHN----HPIHDTVSRA---

Query:  -DWINMYLTEFWQANPKGGIPIQSEKDILEIISRGEETILHTDAAFMGDRDTGGIGLVMRDKLGQLLAIKSSGFSNFSSPLVGEVIVVLEGLRMARAFDV
         DW+N   +     +P   I   +E  I            + DA F   +     G ++R+  G  ++  S   ++ S+PL  E   +L  L+       
Subjt:  -DWINMYLTEFWQANPKGGIPIQSEKDILEIISRGEETILHTDAAFMGDRDTGGIGLVMRDKLGQLLAIKSSGFSNFSSPLVGEVIVVLEGLRMARAFDV

Query:  RRLMILSDLLNLINIINEEIQGHSSIATVIWDIKVMRNFFENVSFKFVSRKFNRFAHQTARVGLSSPSPILWQGNYPDWM
         ++ +  D   LIN+IN  I  HSS+A  + DI    N F ++ F F+ RK N+ AH  A+ G +  +     G+ P W+
Subjt:  RRLMILSDLLNLINIINEEIQGHSSIATVIWDIKVMRNFFENVSFKFVSRKFNRFAHQTARVGLSSPSPILWQGNYPDWM

AT4G09490.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.2e-0631.06Show/hide
Query:  EETILHTDAAFMGDRDTGGIGLVMRDKLGQLLAIK-SSGFSNFSSPLVGEVIVVLEGLRMARAFDVRRLMILSDLLNLINIINEEIQGHSSIATVIWDIK
        +   + TDAA+  +    G G V+R+   +L A+   S   N   PL+ E I +   L+ A++  + +L + SD   LI  I  E    +    +I+DI 
Subjt:  EETILHTDAAFMGDRDTGGIGLVMRDKLGQLLAIK-SSGFSNFSSPLVGEVIVVLEGLRMARAFDVRRLMILSDLLNLINIINEEIQGHSSIATVIWDIK

Query:  VMRNFFENVSFKFVSRKFNRFAHQTARVGLSS
         +   F +VSF FV R  NR A + A+  L S
Subjt:  VMRNFFENVSFKFVSRKFNRFAHQTARVGLSS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGCTGGTATGGTTATTGTCATGTTTGCCAAGAGGAGACTGAGACTACCAAGCATGCCCTTTTCCAATGTTCGAAAGCCCGAGCGATTTGGGAGTTATTACTTCCGTT
AATGTACGACGATCAATGGGGTATAGTGGATATCAAAGATCTTTGGTTGGGATTGACTGATTGCCAAATGTCGGATCTTGAACGTATCTGTGTGGGGGCTTGGGCCATTT
GGAATGATAGAAATAGTTGGATTCACAATCATCCAATTCATGACACAGTGTCTAGGGCGGATTGGATAAATATGTATCTGACAGAATTTTGGCAGGCAAACCCTAAGGGT
GGTATTCCAATCCAGTCCGAGAAAGATATTTTGGAGATCATCTCCCGAGGCGAGGAGACTATTTTGCACACTGATGCAGCGTTTATGGGAGACCGAGACACAGGTGGTAT
TGGGTTAGTAATGCGGGACAAATTGGGGCAACTGCTTGCAATAAAATCATCAGGTTTTTCGAACTTTTCATCCCCTTTGGTGGGGGAAGTGATTGTGGTGCTAGAAGGAC
TTCGAATGGCACGAGCTTTTGATGTGAGGAGGTTGATGATTTTGTCTGACTTGTTGAACTTAATTAACATCATTAACGAGGAGATACAAGGTCATTCTAGTATTGCAACA
GTTATCTGGGATATTAAAGTCATGAGGAATTTTTTTGAGAATGTAAGTTTTAAATTTGTTAGTCGAAAGTTTAATAGGTTTGCTCATCAAACGGCCCGTGTTGGGTTATC
TTCTCCTTCACCGATTTTGTGGCAAGGAAACTATCCTGATTGGATGGTTGGGTTATCGCTCACAGAATGTAGTTTGTTTGTACCCCTGAGATATTGCTCATGA
mRNA sequenceShow/hide mRNA sequence
ATGTGCTGGTATGGTTATTGTCATGTTTGCCAAGAGGAGACTGAGACTACCAAGCATGCCCTTTTCCAATGTTCGAAAGCCCGAGCGATTTGGGAGTTATTACTTCCGTT
AATGTACGACGATCAATGGGGTATAGTGGATATCAAAGATCTTTGGTTGGGATTGACTGATTGCCAAATGTCGGATCTTGAACGTATCTGTGTGGGGGCTTGGGCCATTT
GGAATGATAGAAATAGTTGGATTCACAATCATCCAATTCATGACACAGTGTCTAGGGCGGATTGGATAAATATGTATCTGACAGAATTTTGGCAGGCAAACCCTAAGGGT
GGTATTCCAATCCAGTCCGAGAAAGATATTTTGGAGATCATCTCCCGAGGCGAGGAGACTATTTTGCACACTGATGCAGCGTTTATGGGAGACCGAGACACAGGTGGTAT
TGGGTTAGTAATGCGGGACAAATTGGGGCAACTGCTTGCAATAAAATCATCAGGTTTTTCGAACTTTTCATCCCCTTTGGTGGGGGAAGTGATTGTGGTGCTAGAAGGAC
TTCGAATGGCACGAGCTTTTGATGTGAGGAGGTTGATGATTTTGTCTGACTTGTTGAACTTAATTAACATCATTAACGAGGAGATACAAGGTCATTCTAGTATTGCAACA
GTTATCTGGGATATTAAAGTCATGAGGAATTTTTTTGAGAATGTAAGTTTTAAATTTGTTAGTCGAAAGTTTAATAGGTTTGCTCATCAAACGGCCCGTGTTGGGTTATC
TTCTCCTTCACCGATTTTGTGGCAAGGAAACTATCCTGATTGGATGGTTGGGTTATCGCTCACAGAATGTAGTTTGTTTGTACCCCTGAGATATTGCTCATGA
Protein sequenceShow/hide protein sequence
MCWYGYCHVCQEETETTKHALFQCSKARAIWELLLPLMYDDQWGIVDIKDLWLGLTDCQMSDLERICVGAWAIWNDRNSWIHNHPIHDTVSRADWINMYLTEFWQANPKG
GIPIQSEKDILEIISRGEETILHTDAAFMGDRDTGGIGLVMRDKLGQLLAIKSSGFSNFSSPLVGEVIVVLEGLRMARAFDVRRLMILSDLLNLINIINEEIQGHSSIAT
VIWDIKVMRNFFENVSFKFVSRKFNRFAHQTARVGLSSPSPILWQGNYPDWMVGLSLTECSLFVPLRYCS