; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0010074 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0010074
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRNase H domain-containing protein
Genome locationchr9:44406640..44409858
RNA-Seq ExpressionLag0010074
SyntenyLag0010074
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
EEE69144.1 hypothetical protein OsJ_28268 [Oryza sativa Japonica Group]2.1e-1524.37Show/hide
Query:  WKIKSIPKAKLCAWIIIYDSIPSSSNIRKKGIDSNLPCSFAG-------------------------NMRNQPFTPFGN--------------EKELEIA
        WK  +  K K+ AW  I +S+ +  N +K+ ++ +  C+  G                         +M+  P +   N               ++  + 
Subjt:  WKIKSIPKAKLCAWIIIYDSIPSSSNIRKKGIDSNLPCSFAG-------------------------NMRNQPFTPFGN--------------EKELEIA

Query:  ILILWELWNLRNKVIHNAEIP--------------NQNSIIRSVEANVKEGEAYLN--LDSAREIPRSKNQMSHVPWSSPPLGSWTLNVDASRNDSKFLG
        +++LW +W+ RN+++H    P              +   I +  +AN ++G+  ++  L  +  I RS+ + +   WS P  GS  LNVD S  +S+  G
Subjt:  ILILWELWNLRNKVIHNAEIP--------------NQNSIIRSVEANVKEGEAYLN--LDSAREIPRSKNQMSHVPWSSPPLGSWTLNVDASRNDSKFLG

Query:  GVRWIWRDSSGSPICEGCSSVKGRWAIKMLEMRAILEGLRSLPTIRASRSNLTIPPIVVMSDASGVINLLNKADSDHSEISFMVSEIESLVAEIGSISFV
        G+  + R+ +G  I   C  V    +   +E+ A  +GL       A     T+ PIV+ +D   +++L   A    SE++F+++EI+SL+     ISF 
Subjt:  GVRWIWRDSSGSPICEGCSSVKGRWAIKMLEMRAILEGLRSLPTIRASRSNLTIPPIVVMSDASGVINLLNKADSDHSEISFMVSEIESLVAEIGSISFV

Query:  HCPRSHNEDAHKLARQ
         C RS N  +H LA +
Subjt:  HCPRSHNEDAHKLARQ

KAF4360260.1 hypothetical protein F8388_020551 [Cannabis sativa]3.2e-1623.51Show/hide
Query:  WKRFWKIKSIPKAKLCAWIIIYDSIPSSSNIRKKGIDSNLPCSFAGNM------------------------RNQPFTPFG-------------NEKELE
        WK  W ++  P+ KL  W +  + +PS +N+  +G+  +  C   GN                         ++  +   G             N  E E
Subjt:  WKRFWKIKSIPKAKLCAWIIIYDSIPSSSNIRKKGIDSNLPCSFAGNM------------------------RNQPFTPFG-------------NEKELE

Query:  IAILILWELWNLRNKVIHNAEIPNQNSIIRSVEANVKEGEAYLNLDSAREIPRSKNQMSHVPWSSPPLGSWTLNVDASRNDSKFLGGVRWIWRDSSGSPI
         AI I+W +W  RN+  +N  + N   ++  V +        +  D    +  SK       W  PP G   +N DA+ N      G  +IWR+  G  +
Subjt:  IAILILWELWNLRNKVIHNAEIPNQNSIIRSVEANVKEGEAYLNLDSAREIPRSKNQMSHVPWSSPPLGSWTLNVDASRNDSKFLGGVRWIWRDSSGSPI

Query:  CEGCSSVKGRWAIKMLEMRAILEGLRSLPTIRASRSNLTIPPIVVMSDASGVINLLNKADSDHSEISFMVSEIESLVAEIGSISFVHCPRSHNEDAHKLA
          G +      ++ M E  AILE L ++P       N T  P+ + SD   +++ +   D+  + ++ ++ +I   + ++   S VH  R++NE AH LA
Subjt:  CEGCSSVKGRWAIKMLEMRAILEGLRSLPTIRASRSNLTIPPIVVMSDASGVINLLNKADSDHSEISFMVSEIESLVAEIGSISFVHCPRSHNEDAHKLA

Query:  RQ
        R+
Subjt:  RQ

XP_015382610.1 uncharacterized protein LOC107175578 [Citrus sinensis]2.7e-1527.88Show/hide
Query:  NEKELEIAILILWELWNLRNKVIHNAEIPNQNSIIRSVEANVKEGEAYLNLDSAREIPRSKNQM-SHVPWSSPPLGSWTLNVDASRNDSKFLGGVRWIWR
        N+ +LE+ I+I W +WN RN+++   +  N  +++   EA +   EAY  +    ++ +   Q+ +   W+ P  G   +NVDA+ +  K L G+  I R
Subjt:  NEKELEIAILILWELWNLRNKVIHNAEIPNQNSIIRSVEANVKEGEAYLNLDSAREIPRSKNQM-SHVPWSSPPLGSWTLNVDASRNDSKFLGGVRWIWR

Query:  DSSGSPICEGCSSVKGRWAIKMLEMRAILEGLRSLPTIRASRSNLTIPPIVVMSDASGVINLLNKADSDHSEISFMVSEIESLVAEIGSISFVHCPRSHN
        D  G+ I       K    +   E   +  GL+          N +   ++V SDA  V+ L+N      SEI +++SEI++L+     +S  +  RS N
Subjt:  DSSGSPICEGCSSVKGRWAIKMLEMRAILEGLRSLPTIRASRSNLTIPPIVVMSDASGVINLLNKADSDHSEISFMVSEIESLVAEIGSISFVHCPRSHN

Query:  EDAHKLAR
           H LA+
Subjt:  EDAHKLAR

XP_022143535.1 uncharacterized protein LOC111013412 [Momordica charantia]2.1e-1529.82Show/hide
Query:  EKELEIAILILWELWNLRNKVIHNAEIPNQNSIIRSVE----------ANVKEGEAYLNLDSAREIPRSKNQMSHVPWSSPPLGSWTLNVDASRNDSKFL
        E+E   +++I W++W +RNK I     P    I  +++           N+K      +L   R I       +   W  P   SW LN +A+       
Subjt:  EKELEIAILILWELWNLRNKVIHNAEIPNQNSIIRSVE----------ANVKEGEAYLNLDSAREIPRSKNQMSHVPWSSPPLGSWTLNVDASRNDSKFL

Query:  GGVRWIWRDSSGSPICEGCSSVKGRWAIKMLEMRAILEGLRSLPTIRASRSNLTIPPIVVMSDASGVINLLNKADSDHSEISFMVSEIESLVAEIGSISF
        GG+ WI RD  G  I   C  ++    I  LE+ AI EGLR+   IR         PI + SD+   I+LL++   D +EI +++ EI  ++ ++  +S 
Subjt:  GGVRWIWRDSSGSPICEGCSSVKGRWAIKMLEMRAILEGLRSLPTIRASRSNLTIPPIVVMSDASGVINLLNKADSDHSEISFMVSEIESLVAEIGSISF

Query:  VHCPRSHNEDAHKLARQA
         H  R  N+ AH LAR+A
Subjt:  VHCPRSHNEDAHKLARQA

XP_022155262.1 uncharacterized protein LOC111022403 [Momordica charantia]4.5e-1831.43Show/hide
Query:  NEKELEIAILILWELWNLRNKVIHNAEIPNQNSIIRSVEANVKEGEAYLNLDSAREIPRSKNQMSHVPWSSPPLGSWTLNVDASRNDSKFLGGVRWIWRD
        ++++L++ ++  W +WN RN VI   E  + +++I+ +   V E  +Y +  S   + ++ N  + + W  PP+  WTLN DAS +DS   GG+ WI R 
Subjt:  NEKELEIAILILWELWNLRNKVIHNAEIPNQNSIIRSVEANVKEGEAYLNLDSAREIPRSKNQMSHVPWSSPPLGSWTLNVDASRNDSKFLGGVRWIWRD

Query:  SSGSPICEGCSSVKGRWAIKMLEMRAILEGLRSLPTIRASRSNLTIPPIVVMSDASGVINLLNKADSDHSEISFMVSEIESLVAEIGSISFVHCPRSHNE
          G  +  G   V+    +K+LE  AILEGLR+L  +   R      P+ + +D++ V +LLN+   D ++  ++V EI +L      ++F    R  N 
Subjt:  SSGSPICEGCSSVKGRWAIKMLEMRAILEGLRSLPTIRASRSNLTIPPIVVMSDASGVINLLNKADSDHSEISFMVSEIESLVAEIGSISFVHCPRSHNE

Query:  DAHKLARQAA
         AH LA++A+
Subjt:  DAHKLARQAA

TrEMBL top hitse value%identityAlignment
A0A6J1CP26 uncharacterized protein LOC1110134121.0e-1529.82Show/hide
Query:  EKELEIAILILWELWNLRNKVIHNAEIPNQNSIIRSVE----------ANVKEGEAYLNLDSAREIPRSKNQMSHVPWSSPPLGSWTLNVDASRNDSKFL
        E+E   +++I W++W +RNK I     P    I  +++           N+K      +L   R I       +   W  P   SW LN +A+       
Subjt:  EKELEIAILILWELWNLRNKVIHNAEIPNQNSIIRSVE----------ANVKEGEAYLNLDSAREIPRSKNQMSHVPWSSPPLGSWTLNVDASRNDSKFL

Query:  GGVRWIWRDSSGSPICEGCSSVKGRWAIKMLEMRAILEGLRSLPTIRASRSNLTIPPIVVMSDASGVINLLNKADSDHSEISFMVSEIESLVAEIGSISF
        GG+ WI RD  G  I   C  ++    I  LE+ AI EGLR+   IR         PI + SD+   I+LL++   D +EI +++ EI  ++ ++  +S 
Subjt:  GGVRWIWRDSSGSPICEGCSSVKGRWAIKMLEMRAILEGLRSLPTIRASRSNLTIPPIVVMSDASGVINLLNKADSDHSEISFMVSEIESLVAEIGSISF

Query:  VHCPRSHNEDAHKLARQA
         H  R  N+ AH LAR+A
Subjt:  VHCPRSHNEDAHKLARQA

A0A6J1DNV9 uncharacterized protein LOC1110224032.2e-1831.43Show/hide
Query:  NEKELEIAILILWELWNLRNKVIHNAEIPNQNSIIRSVEANVKEGEAYLNLDSAREIPRSKNQMSHVPWSSPPLGSWTLNVDASRNDSKFLGGVRWIWRD
        ++++L++ ++  W +WN RN VI   E  + +++I+ +   V E  +Y +  S   + ++ N  + + W  PP+  WTLN DAS +DS   GG+ WI R 
Subjt:  NEKELEIAILILWELWNLRNKVIHNAEIPNQNSIIRSVEANVKEGEAYLNLDSAREIPRSKNQMSHVPWSSPPLGSWTLNVDASRNDSKFLGGVRWIWRD

Query:  SSGSPICEGCSSVKGRWAIKMLEMRAILEGLRSLPTIRASRSNLTIPPIVVMSDASGVINLLNKADSDHSEISFMVSEIESLVAEIGSISFVHCPRSHNE
          G  +  G   V+    +K+LE  AILEGLR+L  +   R      P+ + +D++ V +LLN+   D ++  ++V EI +L      ++F    R  N 
Subjt:  SSGSPICEGCSSVKGRWAIKMLEMRAILEGLRSLPTIRASRSNLTIPPIVVMSDASGVINLLNKADSDHSEISFMVSEIESLVAEIGSISFVHCPRSHNE

Query:  DAHKLARQAA
         AH LA++A+
Subjt:  DAHKLARQAA

A0A6J1DSV1 uncharacterized protein LOC1110236085.0e-1530.14Show/hide
Query:  EKELEIAILILWELWNLRNKVI--------HNAEIPNQNSIIRSV--EANVKEGEAYLNLDSAREIPRSKNQMSHVPWSSPPLGSWTLNVDASRNDSKFL
        E+E   +++I W++W +RNK I         + ++     II S   + N+K   A  +L   R I       +   W  P   SW LN DA+       
Subjt:  EKELEIAILILWELWNLRNKVI--------HNAEIPNQNSIIRSV--EANVKEGEAYLNLDSAREIPRSKNQMSHVPWSSPPLGSWTLNVDASRNDSKFL

Query:  GGVRWIWRDSSGSPICEGCSSVKGRWAIKMLEMRAILEGLRSLPTIRASR-SNLTIPPIVVMSDASGVINLLNKADSDHSEISFMVSEIESLVAEIGSIS
        GG+ WI RD  G  I   C  ++    I  LE+ AI EGLR++              PI + SD+   I+LL++   D +EI +++ EI  ++ ++  +S
Subjt:  GGVRWIWRDSSGSPICEGCSSVKGRWAIKMLEMRAILEGLRSLPTIRASR-SNLTIPPIVVMSDASGVINLLNKADSDHSEISFMVSEIESLVAEIGSIS

Query:  FVHCPRSHNEDAHKLARQA
          H  R  N+ AH LAR+A
Subjt:  FVHCPRSHNEDAHKLARQA

A0A7C8ZRF8 Uncharacterized protein (Fragment)2.6e-1927.11Show/hide
Query:  WKRFWKIKSIPKAKLCAWIIIYDSIPSSSNIRKKGIDSNLPCSFAGNMRNQPFTPFGNEKELEIAILILWELWNLRNKVIHNAEIPNQNSII-------R
        W + W+ K+ P+ K  AW    D++P+  N+ +KG+  ++ C            PF +++   + +  LW +WN RNK I   +  N  ++I       R
Subjt:  WKRFWKIKSIPKAKLCAWIIIYDSIPSSSNIRKKGIDSNLPCSFAGNMRNQPFTPFGNEKELEIAILILWELWNLRNKVIHNAEIPNQNSII-------R

Query:  SVEANVKEGEAYLNLDSAREIPRSKNQMSHVPWSSPPLGSWTLNVDASRNDSKFLGGVRWIWRDSSGSPICEGCSSVKGRWAIKMLEMRAILEGLRSLPT
         +   +K+ E  L L S           ++V W  PP G   +N DAS    +F  G+  + RD  G  +   C+ + G W  +  E+ A+  G++    
Subjt:  SVEANVKEGEAYLNLDSAREIPRSKNQMSHVPWSSPPLGSWTLNVDASRNDSKFLGGVRWIWRDSSGSPICEGCSSVKGRWAIKMLEMRAILEGLRSLPT

Query:  IRASRSNLTIPPIVVMSDASGVINLLNKADSDHSEISFMVSEIESLVAEIGSISFVHCPRSHNEDAHKLARQA
              +L +  I++ +DA+GV + LN         S  V +   L+ +  S+ F+H  RS N+ AH+LAR A
Subjt:  IRASRSNLTIPPIVVMSDASGVINLLNKADSDHSEISFMVSEIESLVAEIGSISFVHCPRSHNEDAHKLARQA

B9FYJ3 Uncharacterized protein1.0e-1524.37Show/hide
Query:  WKIKSIPKAKLCAWIIIYDSIPSSSNIRKKGIDSNLPCSFAG-------------------------NMRNQPFTPFGN--------------EKELEIA
        WK  +  K K+ AW  I +S+ +  N +K+ ++ +  C+  G                         +M+  P +   N               ++  + 
Subjt:  WKIKSIPKAKLCAWIIIYDSIPSSSNIRKKGIDSNLPCSFAG-------------------------NMRNQPFTPFGN--------------EKELEIA

Query:  ILILWELWNLRNKVIHNAEIP--------------NQNSIIRSVEANVKEGEAYLN--LDSAREIPRSKNQMSHVPWSSPPLGSWTLNVDASRNDSKFLG
        +++LW +W+ RN+++H    P              +   I +  +AN ++G+  ++  L  +  I RS+ + +   WS P  GS  LNVD S  +S+  G
Subjt:  ILILWELWNLRNKVIHNAEIP--------------NQNSIIRSVEANVKEGEAYLN--LDSAREIPRSKNQMSHVPWSSPPLGSWTLNVDASRNDSKFLG

Query:  GVRWIWRDSSGSPICEGCSSVKGRWAIKMLEMRAILEGLRSLPTIRASRSNLTIPPIVVMSDASGVINLLNKADSDHSEISFMVSEIESLVAEIGSISFV
        G+  + R+ +G  I   C  V    +   +E+ A  +GL       A     T+ PIV+ +D   +++L   A    SE++F+++EI+SL+     ISF 
Subjt:  GVRWIWRDSSGSPICEGCSSVKGRWAIKMLEMRAILEGLRSLPTIRASRSNLTIPPIVVMSDASGVINLLNKADSDHSEISFMVSEIESLVAEIGSISFV

Query:  HCPRSHNEDAHKLARQ
         C RS N  +H LA +
Subjt:  HCPRSHNEDAHKLARQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G52990.1 thioredoxin family protein3.5e-0523.45Show/hide
Query:  SHVPWSSPPLGSWTLNVDASRNDSKFLGGVRWIWRDSSGSPICEGCSSVKGRWAIKMLEMRAILEGLRSLPTIRASRSNLTIPPIVVMSDASGVINLLNK
        ++V  SS P      N DAS ++   + G+ W+ R+S G+ +  G    +GR   +  E  A++  +++      ++       ++   D S V  L+N 
Subjt:  SHVPWSSPPLGSWTLNVDASRNDSKFLGGVRWIWRDSSGSPICEGCSSVKGRWAIKMLEMRAILEGLRSLPTIRASRSNLTIPPIVVMSDASGVINLLNK

Query:  ADSDHSEISFMVSEIESLVAEIGSISFVHCPRSHNEDAHKLARQA
          SD+  +   +  I+S +    S  F+   R  N+ A  L ++A
Subjt:  ADSDHSEISFMVSEIESLVAEIGSISFVHCPRSHNEDAHKLARQA

AT5G65005.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.9e-0620.1Show/hide
Query:  ILWELWNLRNKVIHNAEIPNQNSIIRSVEANVKEGEAYLNLDSAREIPRSKNQMSHVPWSSPPLGSWTLNVDASRNDSKFLGGVRWIWRDSSGSPICEGC
        ++W +W   N ++ N       + +     + KE       +  +   R+ +   +  WS P       N DAS ++   + G+ WI R+S G+ I  G 
Subjt:  ILWELWNLRNKVIHNAEIPNQNSIIRSVEANVKEGEAYLNLDSAREIPRSKNQMSHVPWSSPPLGSWTLNVDASRNDSKFLGGVRWIWRDSSGSPICEGC

Query:  SSVKGRWAIKMLEMRAILEGLRSLPTIRASRSNLTIPPIVVMSDASGVINLLNKADSDHSEISFMVSEIESLVAEIGSISFVHCPRSHNEDAHKLARQA
           +GR   +  E   ++  +++       +       ++   D   +  ++N   S +  +   +  I+S +    SI F    R  N  A  LA+QA
Subjt:  SSVKGRWAIKMLEMRAILEGLRSLPTIRASRSNLTIPPIVVMSDASGVINLLNKADSDHSEISFMVSEIESLVAEIGSISFVHCPRSHNEDAHKLARQA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGGAAAAGATTTTGGAAGATCAAATCTATTCCTAAGGCGAAGCTCTGCGCTTGGATAATCATTTACGACTCTATCCCTTCTTCTTCTAATATCAGGAAAAAAGGAAT
TGATTCTAACTTACCGTGCTCTTTTGCAGGAAATATGAGGAATCAACCATTCACTCCCTTTGGAAATGAGAAAGAGCTAGAAATCGCCATTCTGATTTTGTGGGAGCTTT
GGAACCTCAGAAATAAAGTTATTCACAACGCAGAAATTCCTAATCAGAATTCCATCATCAGATCAGTGGAAGCAAATGTTAAGGAAGGGGAAGCTTACCTCAATCTCGAT
TCTGCCAGGGAGATTCCAAGATCGAAGAACCAGATGAGTCATGTCCCCTGGTCTTCTCCGCCCCTCGGTTCGTGGACACTCAACGTGGATGCCTCCCGAAATGACTCTAA
ATTCTTAGGAGGGGTGAGGTGGATTTGGCGTGACTCCTCAGGTTCTCCCATCTGTGAGGGTTGTTCTAGCGTTAAAGGAAGATGGGCTATAAAGATGTTGGAGATGAGAG
CGATTCTCGAAGGGTTGCGAAGCCTTCCGACCATCCGGGCTTCGCGCTCAAATCTGACAATCCCGCCTATTGTGGTGATGTCCGACGCTAGTGGCGTGATCAACCTGCTG
AACAAAGCAGATTCAGACCATTCGGAAATTTCCTTTATGGTCTCTGAGATTGAAAGCCTGGTTGCTGAGATCGGGTCAATCTCTTTTGTCCATTGCCCGCGTTCGCATAA
TGAAGATGCTCACAAGCTTGCGCGCCAAGCGGCTTTTGGTTTCTCGGGGGGCTCTAGTTGTTTTTTGGCTTCTTCCAATTCGGAAGAAGGAATTGTTTTGTTGGGCCAAC
TTGGGCCTGACTTTATTTCTCCACGCTTTTATGGAGTTGTTGTAAAAAGAGCTGAGGAATGGATCCAAGGGGTAAAACCGACAAGTGGGACGGGCCAAGACCGAAGGGGT
CGGGCCTTGCCCGACCCCCTGCTCGGCCTCGGCCAAAGGCAGAGGCCGAGGCCATGGTCGGCCGGGCTTGTTCGGTTCCGCTTGGTCCCCACCGCCTCTAGCAGCCTCGG
TTCAGCCTGGTTTGTCCCGAAACGCCTCCGAATCCCTAAAGACCCTAGGAGGATGAGCAGGCCACGTCTTCCCCTCTTCTACAAATTTTACCGTTGGTGGCACACGAAGA
GAGAATTCACCTGGCCTTCACGTGCCGCCAACGGTAAATTTGTAGATAAGGGGGGAAGACGTGGCCTGCAAGACGGTAAACCTACACACCGGTGTGGTGCTAGTCACACC
GACTCCGATGTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGTGGAAAAGATTTTGGAAGATCAAATCTATTCCTAAGGCGAAGCTCTGCGCTTGGATAATCATTTACGACTCTATCCCTTCTTCTTCTAATATCAGGAAAAAAGGAAT
TGATTCTAACTTACCGTGCTCTTTTGCAGGAAATATGAGGAATCAACCATTCACTCCCTTTGGAAATGAGAAAGAGCTAGAAATCGCCATTCTGATTTTGTGGGAGCTTT
GGAACCTCAGAAATAAAGTTATTCACAACGCAGAAATTCCTAATCAGAATTCCATCATCAGATCAGTGGAAGCAAATGTTAAGGAAGGGGAAGCTTACCTCAATCTCGAT
TCTGCCAGGGAGATTCCAAGATCGAAGAACCAGATGAGTCATGTCCCCTGGTCTTCTCCGCCCCTCGGTTCGTGGACACTCAACGTGGATGCCTCCCGAAATGACTCTAA
ATTCTTAGGAGGGGTGAGGTGGATTTGGCGTGACTCCTCAGGTTCTCCCATCTGTGAGGGTTGTTCTAGCGTTAAAGGAAGATGGGCTATAAAGATGTTGGAGATGAGAG
CGATTCTCGAAGGGTTGCGAAGCCTTCCGACCATCCGGGCTTCGCGCTCAAATCTGACAATCCCGCCTATTGTGGTGATGTCCGACGCTAGTGGCGTGATCAACCTGCTG
AACAAAGCAGATTCAGACCATTCGGAAATTTCCTTTATGGTCTCTGAGATTGAAAGCCTGGTTGCTGAGATCGGGTCAATCTCTTTTGTCCATTGCCCGCGTTCGCATAA
TGAAGATGCTCACAAGCTTGCGCGCCAAGCGGCTTTTGGTTTCTCGGGGGGCTCTAGTTGTTTTTTGGCTTCTTCCAATTCGGAAGAAGGAATTGTTTTGTTGGGCCAAC
TTGGGCCTGACTTTATTTCTCCACGCTTTTATGGAGTTGTTGTAAAAAGAGCTGAGGAATGGATCCAAGGGGTAAAACCGACAAGTGGGACGGGCCAAGACCGAAGGGGT
CGGGCCTTGCCCGACCCCCTGCTCGGCCTCGGCCAAAGGCAGAGGCCGAGGCCATGGTCGGCCGGGCTTGTTCGGTTCCGCTTGGTCCCCACCGCCTCTAGCAGCCTCGG
TTCAGCCTGGTTTGTCCCGAAACGCCTCCGAATCCCTAAAGACCCTAGGAGGATGAGCAGGCCACGTCTTCCCCTCTTCTACAAATTTTACCGTTGGTGGCACACGAAGA
GAGAATTCACCTGGCCTTCACGTGCCGCCAACGGTAAATTTGTAGATAAGGGGGGAAGACGTGGCCTGCAAGACGGTAAACCTACACACCGGTGTGGTGCTAGTCACACC
GACTCCGATGTTTAA
Protein sequenceShow/hide protein sequence
MWKRFWKIKSIPKAKLCAWIIIYDSIPSSSNIRKKGIDSNLPCSFAGNMRNQPFTPFGNEKELEIAILILWELWNLRNKVIHNAEIPNQNSIIRSVEANVKEGEAYLNLD
SAREIPRSKNQMSHVPWSSPPLGSWTLNVDASRNDSKFLGGVRWIWRDSSGSPICEGCSSVKGRWAIKMLEMRAILEGLRSLPTIRASRSNLTIPPIVVMSDASGVINLL
NKADSDHSEISFMVSEIESLVAEIGSISFVHCPRSHNEDAHKLARQAAFGFSGGSSCFLASSNSEEGIVLLGQLGPDFISPRFYGVVVKRAEEWIQGVKPTSGTGQDRRG
RALPDPLLGLGQRQRPRPWSAGLVRFRLVPTASSSLGSAWFVPKRLRIPKDPRRMSRPRLPLFYKFYRWWHTKREFTWPSRAANGKFVDKGGRRGLQDGKPTHRCGASHT
DSDV