; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g11290 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g11290
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptionzf-RVT domain-containing protein
Genome locationchr4:8495134..8499846
RNA-Seq ExpressionMoc04g11290
SyntenyMoc04g11290
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022131662.1 uncharacterized protein LOC111004787 [Momordica charantia]7.1e-2242.86Show/hide
Query:  HSSYFWQGMVWCRDLLVTGIRKVVGNGSSINFFKDLWIPRENTFRPLSPCLDPSRIHWVADFITSTRNWDVQKLRSIVGEEDVATISNIHISAVGGLDRW
        + SYFW+G +W RDLL+ G+R  VGNGS+IN F D WIPR  +FRP++    P  +  VAD I     WDV  +  I  EED   I ++ +S    +D W
Subjt:  HSSYFWQGMVWCRDLLVTGIRKVVGNGSSINFFKDLWIPRENTFRPLSPCLDPSRIHWVADFITSTRNWDVQKLRSIVGEEDVATISNIHISAVGGLDRW

Query:  MWHFTKNGSYTEETVDHAL
        +WHF K G Y  ++V +++
Subjt:  MWHFTKNGSYTEETVDHAL

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]2.8e-2625Show/hide
Query:  SSYFWQGMVWCRDLLVTGIRKVVGNGSSINFFKDLWIPRENTFRPL---SPCLDPSRIHWVADFITSTRNWDVQKLRSIVGEEDVATISNIHISAVGGLD
        SSYFW+G +W RDLLV G+R  VGNGS+I  F D W+PR  TF+PL   +  LD +    VA FIT+  NWDV  +      ED   I ++ IS+    D
Subjt:  SSYFWQGMVWCRDLLVTGIRKVVGNGSSINFFKDLWIPRENTFRPL---SPCLDPSRIHWVADFITSTRNWDVQKLRSIVGEEDVATISNIHISAVGGLD

Query:  RWMWHFTKNGSYT--------------------------------------------------------------------------EETVDHALVGCKR
         W+WH+ K G+Y+                                                                           E++ HA   CKR
Subjt:  RWMWHFTKNGSYT--------------------------------------------------------------------------EETVDHALVGCKR

Query:  ASKIWSLLLPALDS-SNNFNGCFVDRWTKVTSDLSQKEVTLAA---------------------------------------------------------
        A +IW  L P L   S   N  F++ W+ +T  L  K++ LAA                                                         
Subjt:  ASKIWSLLLPALDS-SNNFNGCFVDRWTKVTSDLSQKEVTLAA---------------------------------------------------------

Query:  ------------------------FG----------------KPPFE-----------IEGISLAIRMGFSKVMVESDSLEAVKLIEKNEGWLNEVASLI
                                FG                + PF            +EG+  A    F+ + VESDSL A++LI        +  + +
Subjt:  ------------------------FG----------------KPPFE-----------IEGISLAIRMGFSKVMVESDSLEAVKLIEKNEGWLNEVASLI

Query:  VDIRHLCLNFDDFRIQHVRREGNRAAHILARNAI-SINSPLLWLCDFPVWLLSSVKADDRTNVA
        ++I+ L   F      H  R+ NRAAH LA+  I S ++   WL +FP WLL  V+ D  +N A
Subjt:  VDIRHLCLNFDDFRIQHVRREGNRAAHILARNAI-SINSPLLWLCDFPVWLLSSVKADDRTNVA

XP_030503504.1 uncharacterized protein LOC115718825 [Cannabis sativa]7.8e-2125.13Show/hide
Query:  WQGMVWCRDLLVTGIRKVVGNGSSINFFKDLWIPRENTFRPLSPCLDPSRIHWVADFITSTRNWDVQKLRSIVGEEDVATISNIHISAVGGLDRWMWHFT
        WQG+ W +DLL+ G+R  VG+G SI    D WIP +N F P+           VAD+IT  + WDV +L +     DV  I  I +S     D ++WH+T
Subjt:  WQGMVWCRDLLVTGIRKVVGNGSSINFFKDLWIPRENTFRPLSPCLDPSRIHWVADFITSTRNWDVQKLRSIVGEEDVATISNIHISAVGGLDRWMWHFT

Query:  KNGSYTEETVDH---------ALVGCKRASK----IWSLLL----PALDSSNNFNGCFVDRWTKVTSDLSQ-----------------------------
          G YT ++  H            G     K     WSL L    PAL    +  G  +D    V + L++                             
Subjt:  KNGSYTEETVDH---------ALVGCKRASK----IWSLLL----PALDSSNNFNGCFVDRWTKVTSDLSQ-----------------------------

Query:  -------------------------------------------------KEVTLAAFGKPPF------EIEGISLAIRMGFSKVM------VESDSLEAV
                                                         + V LAA  +P        E+E  +L   + +S +M      +E+DSL  V
Subjt:  -------------------------------------------------KEVTLAAFGKPPF------EIEGISLAIRMGFSKVM------VESDSLEAV

Query:  KLIEKNEGWLNEVASLIVDIRHLCLNFDDFRIQHVRREGNRAAHILARNAISINSPLLWLCDFPVWLLSSVKAD
        + I           +L++D+++L  NF +  + HVRR+ N+AAH LA+ A++++S  +WL + P  + S +  D
Subjt:  KLIEKNEGWLNEVASLIVDIRHLCLNFDDFRIQHVRREGNRAAHILARNAISINSPLLWLCDFPVWLLSSVKAD

XP_030509336.1 uncharacterized protein LOC115724021 [Cannabis sativa]6.0e-2125.2Show/hide
Query:  SSYFWQGMVWCRDLLVTGIRKVVGNGSSINFFKDLWIPRENTFRPLSPCLDPSRIHWVADFITSTRNWDVQKLRSIVGEEDVATISNIHISAVGGLDRWM
        SS  WQG+ W RDLLV G+R  +G+GSS+    D WIPR + F P+  C      + V+ +IT  + W++  L       DV  I +I +S+    D+W+
Subjt:  SSYFWQGMVWCRDLLVTGIRKVVGNGSSINFFKDLWIPRENTFRPLSPCLDPSRIHWVADFITSTRNWDVQKLRSIVGEEDVATISNIHISAVGGLDRWM

Query:  WHFTKNGSYT--------EETVDHALVGCKRASKIW-------------------------SLLLPALDS--------------SNNFN-GCF-------
        WHFT +  YT         +  D  L         W                          + LPA+ +               N  N  CF       
Subjt:  WHFTKNGSYT--------EETVDHALVGCKRASKIW-------------------------SLLLPALDS--------------SNNFN-GCF-------

Query:  ------------VDRWTK-----------VTSDLSQKEVT--------------------LAAFGKP------PFEIE------GISLAIRMGFSKVMVE
                       W K            TS  S ++ T                    +AA+ KP      P E+E      GI+ A R   S  + E
Subjt:  ------------VDRWTK-----------VTSDLSQKEVT--------------------LAAFGKP------PFEIE------GISLAIRMGFSKVMVE

Query:  SDSLEAVKLIEKNEGWLNEVASLIVDIRHLCLNFDDFRIQHVRREGNRAAHILARNAISINSPLLWLCDFPVWLLSSVKAD
        SDSL  V  I      ++    L++DI++         + HV+R+ N+AAH LA++A+ ++   +W  + P  + S V  D
Subjt:  SDSLEAVKLIEKNEGWLNEVASLIVDIRHLCLNFDDFRIQHVRREGNRAAHILARNAISINSPLLWLCDFPVWLLSSVKAD

XP_042969199.1 uncharacterized protein LOC122301911 [Carya illinoinensis]3.9e-2026.59Show/hide
Query:  AKADYHSSYFWQGMVWCRDLLVTGIRKVVGNGSSINFFKDLWIPRENTFRPLSPCLDPSRIHWVADFITSTRNWDVQKLRSIVGEEDVATISNIHISAVG
        AKA   SSY W G+    D L    R  VGNG +I  FKD W+P   +   L    +  ++H + D   ST  W+V  +R++     +  I  + IS V 
Subjt:  AKADYHSSYFWQGMVWCRDLLVTGIRKVVGNGSSINFFKDLWIPRENTFRPLSPCLDPSRIHWVADFITSTRNWDVQKLRSIVGEEDVATISNIHISAVG

Query:  GLDRWMWHFTKNGSYTEETVDHALVGCKRAS---------------KIWSLLLP-------------------------ALDSS--------------NN
          D   W   K GS++ ++    L   +  S                +W L LP                          LD +               N
Subjt:  GLDRWMWHFTKNGSYTEETVDHALVGCKRAS---------------KIWSLLLP-------------------------ALDSS--------------NN

Query:  FNGCFVDRWTKVTSDLSQKEVTLAAFGKPPFEIEGISLAIRMGFSKVMVESDSLEAVKLIEKNEGWLNEVASLIVDIRHLCLNFDDFRIQHVRREGNRAA
         NG  V   +KV     +KEV+ A F +    + G+ L ++ G  K+M+++D L  V  + +N   L + A ++ DIR L   F + ++ HV R GN  A
Subjt:  FNGCFVDRWTKVTSDLSQKEVTLAAFGKPPFEIEGISLAIRMGFSKVMVESDSLEAVKLIEKNEGWLNEVASLIVDIRHLCLNFDDFRIQHVRREGNRAA

Query:  HILARNAISINSPLLWLCDFPVWLLSSVKAD
        H+LAR+   IN   +W    P ++  ++  D
Subjt:  HILARNAISINSPLLWLCDFPVWLLSSVKAD

TrEMBL top hitse value%identityAlignment
A0A6J1BRN0 uncharacterized protein LOC1110047873.4e-2242.86Show/hide
Query:  HSSYFWQGMVWCRDLLVTGIRKVVGNGSSINFFKDLWIPRENTFRPLSPCLDPSRIHWVADFITSTRNWDVQKLRSIVGEEDVATISNIHISAVGGLDRW
        + SYFW+G +W RDLL+ G+R  VGNGS+IN F D WIPR  +FRP++    P  +  VAD I     WDV  +  I  EED   I ++ +S    +D W
Subjt:  HSSYFWQGMVWCRDLLVTGIRKVVGNGSSINFFKDLWIPRENTFRPLSPCLDPSRIHWVADFITSTRNWDVQKLRSIVGEEDVATISNIHISAVGGLDRW

Query:  MWHFTKNGSYTEETVDHAL
        +WHF K G Y  ++V +++
Subjt:  MWHFTKNGSYTEETVDHAL

A0A6J1DX30 uncharacterized protein LOC1110248741.3e-2625Show/hide
Query:  SSYFWQGMVWCRDLLVTGIRKVVGNGSSINFFKDLWIPRENTFRPL---SPCLDPSRIHWVADFITSTRNWDVQKLRSIVGEEDVATISNIHISAVGGLD
        SSYFW+G +W RDLLV G+R  VGNGS+I  F D W+PR  TF+PL   +  LD +    VA FIT+  NWDV  +      ED   I ++ IS+    D
Subjt:  SSYFWQGMVWCRDLLVTGIRKVVGNGSSINFFKDLWIPRENTFRPL---SPCLDPSRIHWVADFITSTRNWDVQKLRSIVGEEDVATISNIHISAVGGLD

Query:  RWMWHFTKNGSYT--------------------------------------------------------------------------EETVDHALVGCKR
         W+WH+ K G+Y+                                                                           E++ HA   CKR
Subjt:  RWMWHFTKNGSYT--------------------------------------------------------------------------EETVDHALVGCKR

Query:  ASKIWSLLLPALDS-SNNFNGCFVDRWTKVTSDLSQKEVTLAA---------------------------------------------------------
        A +IW  L P L   S   N  F++ W+ +T  L  K++ LAA                                                         
Subjt:  ASKIWSLLLPALDS-SNNFNGCFVDRWTKVTSDLSQKEVTLAA---------------------------------------------------------

Query:  ------------------------FG----------------KPPFE-----------IEGISLAIRMGFSKVMVESDSLEAVKLIEKNEGWLNEVASLI
                                FG                + PF            +EG+  A    F+ + VESDSL A++LI        +  + +
Subjt:  ------------------------FG----------------KPPFE-----------IEGISLAIRMGFSKVMVESDSLEAVKLIEKNEGWLNEVASLI

Query:  VDIRHLCLNFDDFRIQHVRREGNRAAHILARNAI-SINSPLLWLCDFPVWLLSSVKADDRTNVA
        ++I+ L   F      H  R+ NRAAH LA+  I S ++   WL +FP WLL  V+ D  +N A
Subjt:  VDIRHLCLNFDDFRIQHVRREGNRAAHILARNAI-SINSPLLWLCDFPVWLLSSVKADDRTNVA

A0A803NG99 Uncharacterized protein1.3e-2125.4Show/hide
Query:  SSYFWQGMVWCRDLLVTGIRKVVGNGSSINFFKDLWIPRENTFRPLSPCLDPSRIHWVADFITSTRNWDVQKLRSIVGEEDVATISNIHISAVGGLDRWM
        SS  WQG+ W +DLL+ G+R  VG+G SI    D WIP +N F P+           VAD+IT  + WDV +L +     DV  I  I +S     D ++
Subjt:  SSYFWQGMVWCRDLLVTGIRKVVGNGSSINFFKDLWIPRENTFRPLSPCLDPSRIHWVADFITSTRNWDVQKLRSIVGEEDVATISNIHISAVGGLDRWM

Query:  WHFTKNGSYTEETVDH---------ALVGCKRASK----IWSLLL----PALDSSNNFNGCFVDRWTKVTSDLSQ-------------------------
        WH+T  G YT ++  H            G     K     WSL L    PAL    +  G  +D    V + L++                         
Subjt:  WHFTKNGSYTEETVDH---------ALVGCKRASK----IWSLLL----PALDSSNNFNGCFVDRWTKVTSDLSQ-------------------------

Query:  -----------------------------------------------------KEVTLAAFGKPPF------EIEGISLAIRMGFSKVM------VESDS
                                                             + V LAA  +P        E+E  +L   + +S +M      +E+DS
Subjt:  -----------------------------------------------------KEVTLAAFGKPPF------EIEGISLAIRMGFSKVM------VESDS

Query:  LEAVKLIEKNEGWLNEVASLIVDIRHLCLNFDDFRIQHVRREGNRAAHILARNAISINSPLLWLCDFPVWLLSSVKAD
        L  V+ I           +L++D+++L  NF +  + HVRR+ N+AAH LA+ A++++S  +WL + P  + S +  D
Subjt:  LEAVKLIEKNEGWLNEVASLIVDIRHLCLNFDDFRIQHVRREGNRAAHILARNAISINSPLLWLCDFPVWLLSSVKAD

A0A803NPU7 Uncharacterized protein8.7e-2629.5Show/hide
Query:  ASLWSLSSFSAPALAKADYHSSYFWQGMVWCRDLLVTGIRKVVGNGSSINFFKDLWIPRENTFRPLSPCLDPSRIHWVADFITSTRNWDVQKLRSIVGEE
        A  +S+SSF     +    + S  WQG+VW ++LL+ G+R  VG+G +I +  D WIP    F+PL   +  +  H V+ FITS R+W+++ L +     
Subjt:  ASLWSLSSFSAPALAKADYHSSYFWQGMVWCRDLLVTGIRKVVGNGSSINFFKDLWIPRENTFRPLSPCLDPSRIHWVADFITSTRNWDVQKLRSIVGEE

Query:  DVATISNIHISAVGGLDRWMWHFTKNGSYT-----------EETVDH--ALVGCKRASKIWSLLLPALDSSNNFNGCFVDRWTKVTSDLSQKEVT-----
        DV  I  I ++     D+ +W+F  NG+YT           EE + +   L      +K WSL LP+     ++      R  K+ +D +  +VT     
Subjt:  DVATISNIHISAVGGLDRWMWHFTKNGSYT-----------EETVDH--ALVGCKRASKIWSLLLPALDSSNNFNGCFVDRWTKVTSDLSQKEVT-----

Query:  -----------LAAFGKP------PFEIEGISLAIRM------GFSKVMVESDSLEAVKLIEKNEGWLNEVASLIVDIRHLCLNFDDFRIQHVRREGNRA
                   +AAF KP      P  +E +SL   +      G     +E+DSL   K ++     +++   L+ DI  L   F   +I HV R  N A
Subjt:  -----------LAAFGKP------PFEIEGISLAIRM------GFSKVMVESDSLEAVKLIEKNEGWLNEVASLIVDIRHLCLNFDDFRIQHVRREGNRA

Query:  AHILARNAISINSPLLWLCDFP
        AH+LA+ A+S++    WL + P
Subjt:  AHILARNAISINSPLLWLCDFP

A0A803P9C5 Uncharacterized protein1.7e-2123.56Show/hide
Query:  SSYFWQGMVWCRDLLVTGIRKVVGNGSSINFFKDLWIPRENTFRPLSPCLDPSRIHWVADFITSTRNWDVQKLRSIVGEEDVATISNIHISAVGGLDRWM
        SS  WQG+VW R+LL  G+R  VG GSSI+   D WIP   +F+P     D S +  VAD+I+S R W+++ L +     DV  I  I +S +   DRW+
Subjt:  SSYFWQGMVWCRDLLVTGIRKVVGNGSSINFFKDLWIPRENTFRPLSPCLDPSRIHWVADFITSTRNWDVQKLRSIVGEEDVATISNIHISAVGGLDRWM

Query:  WHFTKNGSY----------------------TEET-------VDHALVGCKRASKIWSLLLPALDSSN--------------------------------
        WH+  +G Y                      T+ET       + HAL  C  A  +W+L    +D SN                                
Subjt:  WHFTKNGSY----------------------TEET-------VDHALVGCKRASKIWSLLLPALDSSN--------------------------------

Query:  -NFNGCFVDRWTKVTSDLSQKEVTLAAFGKPP---FEIE------------GISLAIRMGFSKVM-----------------------------------
         +    ++   +     L    +  A +  PP   +++             GI + +R    +V+                                   
Subjt:  -NFNGCFVDRWTKVTSDLSQKEVTLAAFGKPP---FEIE------------GISLAIRMGFSKVM-----------------------------------

Query:  --VESDSLEAVKLIEKNEGWLNEVASLIVDIRHLCLNFDDFRIQHVRREGNRAAHILARNAISINSPLLWLCDFPVWLLSSV
          VE+D L  V  +       +    L+ D+     +F +  + HV+R+ N+AAH LA+ A+ +++  +WL + P  + S V
Subjt:  --VESDSLEAVKLIEKNEGWLNEVASLIVDIRHLCLNFDDFRIQHVRREGNRAAHILARNAISINSPLLWLCDFPVWLLSSV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G13980.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.4e-0432.14Show/hide
Query:  GFSKVMVESDSLEAVKLIEKNEGWLNEVASLIVDIRHLCLNFDDFRIQHVRREGNRAAHILARNAISINSPLLW--LCDFPVWL
        G+ +V++E D      L+  +    N +A+L+ DIR     F + +   VRR GN+ AH LA+  +  NS   +   C  P+WL
Subjt:  GFSKVMVESDSLEAVKLIEKNEGWLNEVASLIVDIRHLCLNFDDFRIQHVRREGNRAAHILARNAISINSPLLW--LCDFPVWL

AT2G34320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein5.4e-0427.18Show/hide
Query:  EIEGISLAI----RMGFSKVMVESDSLEAVKLIEKNEGWLNEVASLIVDIRHLCLNFDDFRIQHVRREGNRAAHILARNAISINS--PLLWLCDFPVWLL
        E+E +  A+    R  + +++ ESD+   V L+  ++ W   +   + DI+ L  +F++ + +   R GN+ A  +AR +IS ++  P L+    P WL 
Subjt:  EIEGISLAI----RMGFSKVMVESDSLEAVKLIEKNEGWLNEVASLIVDIRHLCLNFDDFRIQHVRREGNRAAHILARNAISINS--PLLWLCDFPVWLL

Query:  SSV
        S++
Subjt:  SSV

AT5G42965.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein3.1e-0737.14Show/hide
Query:  RMGFSKVMVESDSLEAVKLIEKNEGWLNEVASLIVDIRHLCLNFDDFRIQHVRREGNRAAHILARNAISI
        R  + KV+ ESDS   V L+  +  + N +A  I DIRH+   F++ ++ H++REGN  A  +AR ++S+
Subjt:  RMGFSKVMVESDSLEAVKLIEKNEGWLNEVASLIVDIRHLCLNFDDFRIQHVRREGNRAAHILARNAISI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTCGTAGAAGACCTGCAAAACAGAAATCTTCCGTAGCTAACCCTAATTCACCGTTGCAGGGCCTGGGCGACATCGTCGATCGAAGACGGGGGATCACTAGCTGGGC
CGTGGGCCGAGCACGGCTCGGGACCGAGCCCCGGGTCGGGGCCGAGCCCTTCGTGGCTCAGTGGGCTTCCCTCTGGTCGCTTTCCTCGTTCTCTGCCCCGGCGTTAGCAA
AGGCGGACTACCACTCATCTTACTTTTGGCAAGGGATGGTTTGGTGTCGCGATTTGTTGGTTACGGGAATTAGAAAGGTGGTTGGAAATGGATCTTCAATTAATTTTTTC
AAAGATCTGTGGATTCCGCGAGAGAACACTTTCAGGCCTTTAAGTCCCTGTCTTGATCCGTCCAGAATTCATTGGGTAGCTGATTTTATTACTTCCACTCGAAATTGGGA
TGTCCAAAAGCTCAGGTCCATAGTTGGAGAAGAAGATGTGGCTACCATATCTAATATTCATATTAGTGCTGTGGGGGGTTTGGATAGATGGATGTGGCATTTTACGAAGA
ATGGCTCCTATACGGAGGAGACAGTGGATCACGCTCTGGTGGGTTGTAAACGGGCGTCCAAAATCTGGTCTTTGCTTCTGCCAGCATTAGATTCTTCTAATAATTTTAAC
GGCTGTTTTGTCGACCGGTGGACTAAGGTGACGTCGGATCTTTCGCAAAAAGAAGTCACCCTCGCTGCGTTTGGGAAGCCCCCTTTTGAGATAGAAGGAATTAGTTTGGC
GATCCGGATGGGTTTCAGTAAGGTGATGGTCGAATCTGACTCCCTCGAAGCGGTTAAACTTATAGAGAAGAATGAGGGGTGGTTAAATGAAGTGGCTTCATTGATTGTTG
ACATCAGACACTTGTGTTTGAATTTTGACGATTTTCGAATTCAGCATGTTCGTCGTGAGGGAAATAGGGCCGCTCATATTCTGGCTAGAAATGCCATTTCTATAAATAGT
CCCCTTCTTTGGCTTTGTGATTTTCCAGTTTGGCTTCTCAGCAGTGTGAAAGCTGATGATCGCACTAATGTAGCCTTGGCAAGTTATTTTCTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCTCGTAGAAGACCTGCAAAACAGAAATCTTCCGTAGCTAACCCTAATTCACCGTTGCAGGGCCTGGGCGACATCGTCGATCGAAGACGGGGGATCACTAGCTGGGC
CGTGGGCCGAGCACGGCTCGGGACCGAGCCCCGGGTCGGGGCCGAGCCCTTCGTGGCTCAGTGGGCTTCCCTCTGGTCGCTTTCCTCGTTCTCTGCCCCGGCGTTAGCAA
AGGCGGACTACCACTCATCTTACTTTTGGCAAGGGATGGTTTGGTGTCGCGATTTGTTGGTTACGGGAATTAGAAAGGTGGTTGGAAATGGATCTTCAATTAATTTTTTC
AAAGATCTGTGGATTCCGCGAGAGAACACTTTCAGGCCTTTAAGTCCCTGTCTTGATCCGTCCAGAATTCATTGGGTAGCTGATTTTATTACTTCCACTCGAAATTGGGA
TGTCCAAAAGCTCAGGTCCATAGTTGGAGAAGAAGATGTGGCTACCATATCTAATATTCATATTAGTGCTGTGGGGGGTTTGGATAGATGGATGTGGCATTTTACGAAGA
ATGGCTCCTATACGGAGGAGACAGTGGATCACGCTCTGGTGGGTTGTAAACGGGCGTCCAAAATCTGGTCTTTGCTTCTGCCAGCATTAGATTCTTCTAATAATTTTAAC
GGCTGTTTTGTCGACCGGTGGACTAAGGTGACGTCGGATCTTTCGCAAAAAGAAGTCACCCTCGCTGCGTTTGGGAAGCCCCCTTTTGAGATAGAAGGAATTAGTTTGGC
GATCCGGATGGGTTTCAGTAAGGTGATGGTCGAATCTGACTCCCTCGAAGCGGTTAAACTTATAGAGAAGAATGAGGGGTGGTTAAATGAAGTGGCTTCATTGATTGTTG
ACATCAGACACTTGTGTTTGAATTTTGACGATTTTCGAATTCAGCATGTTCGTCGTGAGGGAAATAGGGCCGCTCATATTCTGGCTAGAAATGCCATTTCTATAAATAGT
CCCCTTCTTTGGCTTTGTGATTTTCCAGTTTGGCTTCTCAGCAGTGTGAAAGCTGATGATCGCACTAATGTAGCCTTGGCAAGTTATTTTCTTTAA
Protein sequenceShow/hide protein sequence
MSRRRPAKQKSSVANPNSPLQGLGDIVDRRRGITSWAVGRARLGTEPRVGAEPFVAQWASLWSLSSFSAPALAKADYHSSYFWQGMVWCRDLLVTGIRKVVGNGSSINFF
KDLWIPRENTFRPLSPCLDPSRIHWVADFITSTRNWDVQKLRSIVGEEDVATISNIHISAVGGLDRWMWHFTKNGSYTEETVDHALVGCKRASKIWSLLLPALDSSNNFN
GCFVDRWTKVTSDLSQKEVTLAAFGKPPFEIEGISLAIRMGFSKVMVESDSLEAVKLIEKNEGWLNEVASLIVDIRHLCLNFDDFRIQHVRREGNRAAHILARNAISINS
PLLWLCDFPVWLLSSVKADDRTNVALASYFL