; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg033252 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg033252
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationscaffold5:4366974..4372519
RNA-Seq ExpressionSpg033252
SyntenySpg033252
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAB4262994.1 unnamed protein product [Prunus armeniaca]4.8e-2029.43Show/hide
Query:  MVAGFITETGAWNVELLKEAVGDDDFNIIRRIP-ISLRISDSYLWHYDKYDKYTVKSGYGLFMKNRIE------QASSSSNPMSKVWNIVWKLKVPSKVK
        +V    T +G WNV LLK+   D +   I +IP +SL   D  +WHY++   Y+VKSGY L    R+E      ++S+  N  S+ W  +W LK+P+K+K
Subjt:  MVAGFITETGAWNVELLKEAVGDDDFNIIRRIP-ISLRISDSYLWHYDKYDKYTVKSGYGLFMKNRIE------QASSSSNPMSKVWNIVWKLKVPSKVK

Query:  FFCWKAL--------------------RWLALCEELSWEELRIVAVTCWAIWGDRNKKIHEI--QISSQEI-RSRLIINYLVEID---HLRPGLLLQPLS
        FF W+                       W A+      EE  + A   W +W  R+  I E   + ++Q + R   +     +++   H   G    P +
Subjt:  FFCWKAL--------------------RWLALCEELSWEELRIVAVTCWAIWGDRNKKIHEI--QISSQEI-RSRLIINYLVEID---HLRPGLLLQPLS

Query:  G------------KSMLMRAW--NELSTGIGVICRNERGDILRACCKFLDLITLPPPMAELMAIIEGLAFAISMGGNNLIIE
                     K  + RA    +   G+GV+ RNE GD + AC + +   +      ELMA IEGL FAI MG  + ++E
Subjt:  G------------KSMLMRAW--NELSTGIGVICRNERGDILRACCKFLDLITLPPPMAELMAIIEGLAFAISMGGNNLIIE

CAB4309574.1 unnamed protein product [Prunus armeniaca]2.2e-2030.18Show/hide
Query:  MVAGFITETGAWNVELLKEAVGDDDFNIIRRIP-ISLRISDSYLWHYDKYDKYTVKSGY--GLFMKNRI-EQASSSSNPMSKVWNIVWKLKVPSKVKFFC
        +V    T +G WNV LLK+   D + + I RIP +SL   D  LWHY++   Y+V SGY  G   K+++  ++S+ ++  S  W  +W LK+P+K+KFF 
Subjt:  MVAGFITETGAWNVELLKEAVGDDDFNIIRRIP-ISLRISDSYLWHYDKYDKYTVKSGY--GLFMKNRI-EQASSSSNPMSKVWNIVWKLKVPSKVKFFC

Query:  WKALRWLALCEELSWEEL-----------RIVAVTCWAIWGD-------RNKKIHEIQISSQEIRSRLIINYLVEIDHLRPGLLLQPLSGKSMLMRAW--
        W+ +  L  C +  ++             R V     AIWG        RN     +Q+  +  +     +    + H+  G   QP S   +L + W  
Subjt:  WKALRWLALCEELSWEEL-----------RIVAVTCWAIWGD-------RNKKIHEIQISSQEIRSRLIINYLVEIDHLRPGLLLQPLSGKSMLMRAW--

Query:  ----------------NELSTGIGVICRNERGDILRACCKFLDLITLPPPMAELMAIIEGLAFAISMGGNNLIIE
                         +   G+GV+ RNE+G+ + AC ++L   +      ELMA IEGL FAI MG  + I+E
Subjt:  ----------------NELSTGIGVICRNERGDILRACCKFLDLITLPPPMAELMAIIEGLAFAISMGGNNLIIE

EEC68887.1 hypothetical protein OsI_37529 [Oryza sativa Indica Group]1.6e-1827.61Show/hide
Query:  VAGFITETGAWNVELLKEAVGDDDFNIIRRIPISLRISDSYL-WHYDKYDKYTVKSGYGLF--MKNRIEQASSSSNPMSKVWNIVWKLKVPSKVKFFCWK
        V+  ITE G+W+V  + +   + D  +I  I IS R  + ++ WH DK   ++V+S Y L   + N  E +SS +N ++K W ++WK KVP KVK F W+
Subjt:  VAGFITETGAWNVELLKEAVGDDDFNIIRRIPISLRISDSYL-WHYDKYDKYTVKSGYGLF--MKNRIEQASSSSNPMSKVWNIVWKLKVPSKVKFFCWK

Query:  -ALRWLALCEELSWEELRIVAVTCWAIWGDRNKKIHEIQISSQEIRSRLIINYLVEIDHLRPGLLLQPLSGKSMLMR-----------------------
         A   LA         + +  +T W  W  RN+ IH       E   R I +Y+  +  +R       + GK ++                         
Subjt:  -ALRWLALCEELSWEELRIVAVTCWAIWGDRNKKIHEIQISSQEIRSRLIINYLVEIDHLRPGLLLQPLSGKSMLMR-----------------------

Query:  -AWNELST-----------GIGVICRNERGDILRACCKFLDLITLPPPMAELMAIIEGLAFAISMGGNNLIIETYCLQAFNLV-GI--------QCTRSH
          W +L+            G+G+I RN  GDI+   CK L+     P  +EL A +EGL  AI      + +ET C     L+ GI              
Subjt:  -AWNELST-----------GIGVICRNERGDILRACCKFLDLITLPPPMAELMAIIEGLAFAISMGGNNLIIETYCLQAFNLV-GI--------QCTRSH

Query:  RKIMI-EKILIWIKTCTPVWCLPHRL
        R +++ E+++   K C    C+ H L
Subjt:  RKIMI-EKILIWIKTCTPVWCLPHRL

ONI01138.1 hypothetical protein PRUPE_6G123900 [Prunus persica]5.9e-1826.71Show/hide
Query:  VAGFITETGAWNVELLKEAVGDDDFNIIRRIPI-SLRISDSYLWHYDKYDKYTVKSGY---GLFMKNRIEQASSSSNPMSKVWNIVWKLKVPSKVKFFCW
        V    T +G WNV LLK+   D + + I +IP+ SL   D  +WHY++   Y+VKSGY   GL       + S+  +  SK W  +W LK+P+K+KFF W
Subjt:  VAGFITETGAWNVELLKEAVGDDDFNIIRRIPI-SLRISDSYLWHYDKYDKYTVKSGY---GLFMKNRIEQASSSSNPMSKVWNIVWKLKVPSKVKFFCW

Query:  KAL----------------------------------------------------------------RWLALCEELSWEELRIVAVTCWAIWGDRNKKIH
        +                                                                   W AL    S EE  + A  CW +W  RN  I 
Subjt:  KAL----------------------------------------------------------------RWLALCEELSWEELRIVAVTCWAIWGDRNKKIH

Query:  EIQISSQ-EIRSRLI--------INYLVEIDHLRPGLLLQPLSG-KSMLMRAWNELSTGIGVICRNERGDILRACCKFLDLITLPPPMAELMAIIEGLAF
        E +  +  ++ SR+          N ++   H R      PL G +        +   G+GV+ RN  G+ + AC + +   +      ELMA IEGL F
Subjt:  EIQISSQ-EIRSRLI--------INYLVEIDHLRPGLLLQPLSG-KSMLMRAWNELSTGIGVICRNERGDILRACCKFLDLITLPPPMAELMAIIEGLAF

Query:  AISMGGNNLIIETYCLQAFNLV
        AI MG  + I+E       N +
Subjt:  AISMGGNNLIIETYCLQAFNLV

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]4.7e-2326.04Show/hide
Query:  MALNDSYAQEMVAGFITETGAWNVELLKEAVGDDDFNIIRRIPI-SLRISDSYLWHYDKYDKYTVKSGYGLFMKNRIEQASSSSNPMSKVWNIVWKLKVP
        +  N+      VA FIT  G W+V  +  +  ++D ++I  +PI S  + DS+LWHYDK   Y+V+SGY L+M  +    S+S+N     WN +WKL VP
Subjt:  MALNDSYAQEMVAGFITETGAWNVELLKEAVGDDDFNIIRRIPI-SLRISDSYLWHYDKYDKYTVKSGYGLFMKNRIEQASSSSNPMSKVWNIVWKLKVP

Query:  SKVKFFCWKALR-----------------------------------------------------------------WLALCEELSWEELRIVAVTCWAI
        +K+K F W++                                                                   W +L E+L  ++L + A+T W I
Subjt:  SKVKFFCWKALR-----------------------------------------------------------------WLALCEELSWEELRIVAVTCWAI

Query:  WGDRNKKIHEIQISSQEIRSRLIINYL--------------VEIDHLRPGLLLQPLSGKSMLMR---AWNELSTGIGVICRNERGDILRACCKFLDLITL
        W DRN  IH  Q+S  E +   +  +L               + +H       +P S  S+ +    A    ST  G I R+    ++ A         L
Subjt:  WGDRNKKIHEIQISSQEIRSRLIINYL--------------VEIDHLRPGLLLQPLSGKSMLMR---AWNELSTGIGVICRNERGDILRACCKFLDLITL

Query:  PPPMAELMAIIEGLAFAISMGGNNLIIETYCLQAFNLV
         P +AE+  I+EGL FA +    +L +E+  L A  L+
Subjt:  PPPMAELMAIIEGLAFAISMGGNNLIIETYCLQAFNLV

TrEMBL top hitse value%identityAlignment
A0A6J1DX30 uncharacterized protein LOC1110248742.3e-2326.04Show/hide
Query:  MALNDSYAQEMVAGFITETGAWNVELLKEAVGDDDFNIIRRIPI-SLRISDSYLWHYDKYDKYTVKSGYGLFMKNRIEQASSSSNPMSKVWNIVWKLKVP
        +  N+      VA FIT  G W+V  +  +  ++D ++I  +PI S  + DS+LWHYDK   Y+V+SGY L+M  +    S+S+N     WN +WKL VP
Subjt:  MALNDSYAQEMVAGFITETGAWNVELLKEAVGDDDFNIIRRIPI-SLRISDSYLWHYDKYDKYTVKSGYGLFMKNRIEQASSSSNPMSKVWNIVWKLKVP

Query:  SKVKFFCWKALR-----------------------------------------------------------------WLALCEELSWEELRIVAVTCWAI
        +K+K F W++                                                                   W +L E+L  ++L + A+T W I
Subjt:  SKVKFFCWKALR-----------------------------------------------------------------WLALCEELSWEELRIVAVTCWAI

Query:  WGDRNKKIHEIQISSQEIRSRLIINYL--------------VEIDHLRPGLLLQPLSGKSMLMR---AWNELSTGIGVICRNERGDILRACCKFLDLITL
        W DRN  IH  Q+S  E +   +  +L               + +H       +P S  S+ +    A    ST  G I R+    ++ A         L
Subjt:  WGDRNKKIHEIQISSQEIRSRLIINYL--------------VEIDHLRPGLLLQPLSGKSMLMR---AWNELSTGIGVICRNERGDILRACCKFLDLITL

Query:  PPPMAELMAIIEGLAFAISMGGNNLIIETYCLQAFNLV
         P +AE+  I+EGL FA +    +L +E+  L A  L+
Subjt:  PPPMAELMAIIEGLAFAISMGGNNLIIETYCLQAFNLV

A0A6J5TGK7 CCHC-type domain-containing protein2.3e-2029.43Show/hide
Query:  MVAGFITETGAWNVELLKEAVGDDDFNIIRRIP-ISLRISDSYLWHYDKYDKYTVKSGYGLFMKNRIE------QASSSSNPMSKVWNIVWKLKVPSKVK
        +V    T +G WNV LLK+   D +   I +IP +SL   D  +WHY++   Y+VKSGY L    R+E      ++S+  N  S+ W  +W LK+P+K+K
Subjt:  MVAGFITETGAWNVELLKEAVGDDDFNIIRRIP-ISLRISDSYLWHYDKYDKYTVKSGYGLFMKNRIE------QASSSSNPMSKVWNIVWKLKVPSKVK

Query:  FFCWKAL--------------------RWLALCEELSWEELRIVAVTCWAIWGDRNKKIHEI--QISSQEI-RSRLIINYLVEID---HLRPGLLLQPLS
        FF W+                       W A+      EE  + A   W +W  R+  I E   + ++Q + R   +     +++   H   G    P +
Subjt:  FFCWKAL--------------------RWLALCEELSWEELRIVAVTCWAIWGDRNKKIHEI--QISSQEI-RSRLIINYLVEID---HLRPGLLLQPLS

Query:  G------------KSMLMRAW--NELSTGIGVICRNERGDILRACCKFLDLITLPPPMAELMAIIEGLAFAISMGGNNLIIE
                     K  + RA    +   G+GV+ RNE GD + AC + +   +      ELMA IEGL FAI MG  + ++E
Subjt:  G------------KSMLMRAW--NELSTGIGVICRNERGDILRACCKFLDLITLPPPMAELMAIIEGLAFAISMGGNNLIIE

A0A6J5XES2 Uncharacterized protein1.1e-2030.18Show/hide
Query:  MVAGFITETGAWNVELLKEAVGDDDFNIIRRIP-ISLRISDSYLWHYDKYDKYTVKSGY--GLFMKNRI-EQASSSSNPMSKVWNIVWKLKVPSKVKFFC
        +V    T +G WNV LLK+   D + + I RIP +SL   D  LWHY++   Y+V SGY  G   K+++  ++S+ ++  S  W  +W LK+P+K+KFF 
Subjt:  MVAGFITETGAWNVELLKEAVGDDDFNIIRRIP-ISLRISDSYLWHYDKYDKYTVKSGY--GLFMKNRI-EQASSSSNPMSKVWNIVWKLKVPSKVKFFC

Query:  WKALRWLALCEELSWEEL-----------RIVAVTCWAIWGD-------RNKKIHEIQISSQEIRSRLIINYLVEIDHLRPGLLLQPLSGKSMLMRAW--
        W+ +  L  C +  ++             R V     AIWG        RN     +Q+  +  +     +    + H+  G   QP S   +L + W  
Subjt:  WKALRWLALCEELSWEEL-----------RIVAVTCWAIWGD-------RNKKIHEIQISSQEIRSRLIINYLVEIDHLRPGLLLQPLSGKSMLMRAW--

Query:  ----------------NELSTGIGVICRNERGDILRACCKFLDLITLPPPMAELMAIIEGLAFAISMGGNNLIIE
                         +   G+GV+ RNE+G+ + AC ++L   +      ELMA IEGL FAI MG  + I+E
Subjt:  ----------------NELSTGIGVICRNERGDILRACCKFLDLITLPPPMAELMAIIEGLAFAISMGGNNLIIE

B8BN96 Reverse transcriptase domain-containing protein7.5e-1927.61Show/hide
Query:  VAGFITETGAWNVELLKEAVGDDDFNIIRRIPISLRISDSYL-WHYDKYDKYTVKSGYGLF--MKNRIEQASSSSNPMSKVWNIVWKLKVPSKVKFFCWK
        V+  ITE G+W+V  + +   + D  +I  I IS R  + ++ WH DK   ++V+S Y L   + N  E +SS +N ++K W ++WK KVP KVK F W+
Subjt:  VAGFITETGAWNVELLKEAVGDDDFNIIRRIPISLRISDSYL-WHYDKYDKYTVKSGYGLF--MKNRIEQASSSSNPMSKVWNIVWKLKVPSKVKFFCWK

Query:  -ALRWLALCEELSWEELRIVAVTCWAIWGDRNKKIHEIQISSQEIRSRLIINYLVEIDHLRPGLLLQPLSGKSMLMR-----------------------
         A   LA         + +  +T W  W  RN+ IH       E   R I +Y+  +  +R       + GK ++                         
Subjt:  -ALRWLALCEELSWEELRIVAVTCWAIWGDRNKKIHEIQISSQEIRSRLIINYLVEIDHLRPGLLLQPLSGKSMLMR-----------------------

Query:  -AWNELST-----------GIGVICRNERGDILRACCKFLDLITLPPPMAELMAIIEGLAFAISMGGNNLIIETYCLQAFNLV-GI--------QCTRSH
          W +L+            G+G+I RN  GDI+   CK L+     P  +EL A +EGL  AI      + +ET C     L+ GI              
Subjt:  -AWNELST-----------GIGVICRNERGDILRACCKFLDLITLPPPMAELMAIIEGLAFAISMGGNNLIIETYCLQAFNLV-GI--------QCTRSH

Query:  RKIMI-EKILIWIKTCTPVWCLPHRL
        R +++ E+++   K C    C+ H L
Subjt:  RKIMI-EKILIWIKTCTPVWCLPHRL

M5W5F3 Reverse transcriptase domain-containing protein (Fragment)2.9e-1826.71Show/hide
Query:  VAGFITETGAWNVELLKEAVGDDDFNIIRRIPI-SLRISDSYLWHYDKYDKYTVKSGY---GLFMKNRIEQASSSSNPMSKVWNIVWKLKVPSKVKFFCW
        V    T +G WNV LLK+   D + + I +IP+ SL   D  +WHY++   Y+VKSGY   GL       + S+  +  SK W  +W LK+P+K+KFF W
Subjt:  VAGFITETGAWNVELLKEAVGDDDFNIIRRIPI-SLRISDSYLWHYDKYDKYTVKSGY---GLFMKNRIEQASSSSNPMSKVWNIVWKLKVPSKVKFFCW

Query:  KAL----------------------------------------------------------------RWLALCEELSWEELRIVAVTCWAIWGDRNKKIH
        +                                                                   W AL    S EE  + A  CW +W  RN  I 
Subjt:  KAL----------------------------------------------------------------RWLALCEELSWEELRIVAVTCWAIWGDRNKKIH

Query:  EIQISSQ-EIRSRLI--------INYLVEIDHLRPGLLLQPLSG-KSMLMRAWNELSTGIGVICRNERGDILRACCKFLDLITLPPPMAELMAIIEGLAF
        E +  +  ++ SR+          N ++   H R      PL G +        +   G+GV+ RN  G+ + AC + +   +      ELMA IEGL F
Subjt:  EIQISSQ-EIRSRLI--------INYLVEIDHLRPGLLLQPLSG-KSMLMRAWNELSTGIGVICRNERGDILRACCKFLDLITLPPPMAELMAIIEGLAF

Query:  AISMGGNNLIIETYCLQAFNLV
        AI MG  + I+E       N +
Subjt:  AISMGGNNLIIETYCLQAFNLV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G29090.1 Ribonuclease H-like superfamily protein2.5e-0629.81Show/hide
Query:  VAGFITETG-AWNVELLKEAVGDDDFNIIRRI-PISLRISDSYLWHYDKYDKYTVKSGYGLFMK--NRIEQASSSSNP-MSKVWNIVWKLKVPSKVKFFC
        V+  I E+G  W  ++++    + +  +I  + P   RI DSY W Y     YTVKSGY +  +  N+       S P ++ ++  +WK +   K++ F 
Subjt:  VAGFITETG-AWNVELLKEAVGDDDFNIIRRI-PISLRISDSYLWHYDKYDKYTVKSGYGLFMK--NRIEQASSSSNP-MSKVWNIVWKLKVPSKVKFFC

Query:  WKAL
        WK L
Subjt:  WKAL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGTATTTCGCCAGACATACTTTAAACTCTTGGAGATTCGAATCACTACTAACATCCACGCTTCCGGACTTGAATTCACTGTAAATACCCTCAGCACTCCAGACAC
CACCAAACAATATCGTGACCTGTTGAACTGTCGCCATGTTAGATTAGGAAGGAAAATTACAATGGCAATTTTCATGGCTTTGAATGACTCCTATGCTCAGGAAATGGTAG
CTGGTTTTATCACTGAGACGGGAGCCTGGAATGTGGAATTACTAAAGGAAGCAGTGGGTGATGATGACTTTAATATTATCAGACGAATTCCTATTAGCCTACGGATCTCT
GATAGCTATTTGTGGCATTATGACAAGTATGACAAGTATACTGTTAAAAGTGGATATGGGTTGTTTATGAAAAACAGAATTGAACAAGCTTCTTCGAGTAGCAACCCGAT
GAGCAAAGTGTGGAATATTGTTTGGAAGCTCAAAGTCCCTTCGAAGGTTAAATTTTTTTGCTGGAAAGCTTTGAGGTGGTTGGCCCTCTGCGAGGAATTATCTTGGGAGG
AGCTCAGAATTGTGGCGGTAACGTGCTGGGCCATCTGGGGAGACAGGAATAAGAAGATTCATGAAATTCAAATTTCCTCCCAGGAAATTCGTAGTAGATTGATCATAAAT
TACCTGGTGGAGATTGATCATCTCAGACCTGGTCTCCTCCTCCAACCACTTTCTGGAAAATCAATGTTGATGCGCGCGTGGAATGAGTTGTCTACTGGCATTGGTGTGAT
CTGTAGAAATGAGAGGGGAGATATCTTGAGAGCTTGCTGCAAATTTCTAGACTTAATTACCCTCCCTCCCCCTATGGCGGAGCTGATGGCAATCATAGAAGGATTGGCAT
TCGCTATTTCCATGGGAGGAAATAATCTGATTATTGAGACATATTGCCTTCAGGCCTTTAATTTGGTTGGGATCCAGTGTACTCGATCTCATAGGAAGATCATGATCGAG
AAAATTCTGATTTGGATCAAAACCTGCACACCGGTGTGGTGCTTGCCACACCGCCTCCGATGCTTAAGTCAGAAAGCAGAAGGTGGAGACCAAGAGGAGAGAGTAGAGAA
TAAAGTTCGGGATCCTCTCTTTAGTGATGAAGAGTGGTTTAAATACCTGCTCATGCTCCTAGGGTTTTTAAGAATTCGGAGGCGTTTCAGGACGAACCGAACCGGGGCAG
CAGGGACCGAATGGAGGCGAATGAGCTCGGCCGACCATTGGCCCGACCCTTTGGTCTGGTCTTCCTCTGGGTCGGTTTCCTGGTCTTATCTTTGTCCGATTGTCCTCGTC
AGCTCCTTGTGCATCAGGGTGTTGATCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAGGTATTTCGCCAGACATACTTTAAACTCTTGGAGATTCGAATCACTACTAACATCCACGCTTCCGGACTTGAATTCACTGTAAATACCCTCAGCACTCCAGACAC
CACCAAACAATATCGTGACCTGTTGAACTGTCGCCATGTTAGATTAGGAAGGAAAATTACAATGGCAATTTTCATGGCTTTGAATGACTCCTATGCTCAGGAAATGGTAG
CTGGTTTTATCACTGAGACGGGAGCCTGGAATGTGGAATTACTAAAGGAAGCAGTGGGTGATGATGACTTTAATATTATCAGACGAATTCCTATTAGCCTACGGATCTCT
GATAGCTATTTGTGGCATTATGACAAGTATGACAAGTATACTGTTAAAAGTGGATATGGGTTGTTTATGAAAAACAGAATTGAACAAGCTTCTTCGAGTAGCAACCCGAT
GAGCAAAGTGTGGAATATTGTTTGGAAGCTCAAAGTCCCTTCGAAGGTTAAATTTTTTTGCTGGAAAGCTTTGAGGTGGTTGGCCCTCTGCGAGGAATTATCTTGGGAGG
AGCTCAGAATTGTGGCGGTAACGTGCTGGGCCATCTGGGGAGACAGGAATAAGAAGATTCATGAAATTCAAATTTCCTCCCAGGAAATTCGTAGTAGATTGATCATAAAT
TACCTGGTGGAGATTGATCATCTCAGACCTGGTCTCCTCCTCCAACCACTTTCTGGAAAATCAATGTTGATGCGCGCGTGGAATGAGTTGTCTACTGGCATTGGTGTGAT
CTGTAGAAATGAGAGGGGAGATATCTTGAGAGCTTGCTGCAAATTTCTAGACTTAATTACCCTCCCTCCCCCTATGGCGGAGCTGATGGCAATCATAGAAGGATTGGCAT
TCGCTATTTCCATGGGAGGAAATAATCTGATTATTGAGACATATTGCCTTCAGGCCTTTAATTTGGTTGGGATCCAGTGTACTCGATCTCATAGGAAGATCATGATCGAG
AAAATTCTGATTTGGATCAAAACCTGCACACCGGTGTGGTGCTTGCCACACCGCCTCCGATGCTTAAGTCAGAAAGCAGAAGGTGGAGACCAAGAGGAGAGAGTAGAGAA
TAAAGTTCGGGATCCTCTCTTTAGTGATGAAGAGTGGTTTAAATACCTGCTCATGCTCCTAGGGTTTTTAAGAATTCGGAGGCGTTTCAGGACGAACCGAACCGGGGCAG
CAGGGACCGAATGGAGGCGAATGAGCTCGGCCGACCATTGGCCCGACCCTTTGGTCTGGTCTTCCTCTGGGTCGGTTTCCTGGTCTTATCTTTGTCCGATTGTCCTCGTC
AGCTCCTTGTGCATCAGGGTGTTGATCTAG
Protein sequenceShow/hide protein sequence
MEVFRQTYFKLLEIRITTNIHASGLEFTVNTLSTPDTTKQYRDLLNCRHVRLGRKITMAIFMALNDSYAQEMVAGFITETGAWNVELLKEAVGDDDFNIIRRIPISLRIS
DSYLWHYDKYDKYTVKSGYGLFMKNRIEQASSSSNPMSKVWNIVWKLKVPSKVKFFCWKALRWLALCEELSWEELRIVAVTCWAIWGDRNKKIHEIQISSQEIRSRLIIN
YLVEIDHLRPGLLLQPLSGKSMLMRAWNELSTGIGVICRNERGDILRACCKFLDLITLPPPMAELMAIIEGLAFAISMGGNNLIIETYCLQAFNLVGIQCTRSHRKIMIE
KILIWIKTCTPVWCLPHRLRCLSQKAEGGDQEERVENKVRDPLFSDEEWFKYLLMLLGFLRIRRRFRTNRTGAAGTEWRRMSSADHWPDPLVWSSSGSVSWSYLCPIVLV
SSLCIRVLI