; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg037524 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg037524
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationscaffold11:31391949..31399401
RNA-Seq ExpressionSpg037524
SyntenySpg037524
Gene Ontology termsGO:0009987 - cellular process (biological process)
GO:0110165 - cellular anatomical structure (cellular component)
InterPro domainsIPR026960 - Reverse transcriptase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0067918.1 hypothetical protein E6C27_scaffold138G00850 [Cucumis melo var. makuwa]1.4e-2254.74Show/hide
Query:  KKVQFFIWQVIHGRVNTLDRLSRKIPSLFGPFCCILCRMAEEDLDHMLWNCAFARTVWDEFFGSFGLQFARHRGLRETIEEFLPHPPFREQGKFL
        ++V+FFIWQV+ GRVNT+DRL RK     GPFCC+LC+ AEEDLDH+  +C +AR VW  F    G+ FA  R +R TIEEFL H  F+++  FL
Subjt:  KKVQFFIWQVIHGRVNTLDRLSRKIPSLFGPFCCILCRMAEEDLDHMLWNCAFARTVWDEFFGSFGLQFARHRGLRETIEEFLPHPPFREQGKFL

TYK09969.1 calpain-type cysteine protease DEK1 [Cucumis melo var. makuwa]2.4e-2741.18Show/hide
Query:  RKDIRVWSPDPSLGFSCRSFFRCLLDPVSSQPSIFSLLWKVKVPKKVQFFIWQVIHGRVNTLDRLSRKIPSLFGPFCCILCRMAEEDLDHMLWNCAFART
        R+D+ VWSP P  GF C+SFF+CL++   +  S+ SL+W++KVP+K   F WQV                              EEDLDH+LW+C     
Subjt:  RKDIRVWSPDPSLGFSCRSFFRCLLDPVSSQPSIFSLLWKVKVPKKVQFFIWQVIHGRVNTLDRLSRKIPSLFGPFCCILCRMAEEDLDHMLWNCAFART

Query:  VWDEFFGSFGLQFARHRGLRETIEEFLPHPPFREQGKFLWQAGICAIIWGLWG
        VWD+F  +FGL +ARHR +R T+EEFL + P  E+G FLW A + A + G  G
Subjt:  VWDEFFGSFGLQFARHRGLRETIEEFLPHPPFREQGKFLWQAGICAIIWGLWG

TYK11201.1 SIL1 [Cucumis melo var. makuwa]1.1e-2747.79Show/hide
Query:  FSCRSFFRCLLDPV--SSQPSIFSLLWKVKVPKKVQFFIWQVIHGRVNTLDRLSRKIPSLFGPFCCILCRMAEEDLDHMLWNCAFARTVWDEFFGSFGLQ
        F   S F CL D V    +    S   ++KVPKK +FFI QV+ G +NT+DRL + +  L GPF C LC+ A+EDLD++ W+C   R VW+ F   F   
Subjt:  FSCRSFFRCLLDPV--SSQPSIFSLLWKVKVPKKVQFFIWQVIHGRVNTLDRLSRKIPSLFGPFCCILCRMAEEDLDHMLWNCAFARTVWDEFFGSFGLQ

Query:  FARHRGLRETIEEFLPHPPFREQGKFLWQAGICAII
        F   R +R TIEEFL HPPFRE+   LW AG+CA+I
Subjt:  FARHRGLRETIEEFLPHPPFREQGKFLWQAGICAII

TYK14183.1 hypothetical protein E5676_scaffold8046G00070 [Cucumis melo var. makuwa]1.9e-2431.85Show/hide
Query:  MRQVLIDRKTFTLRKSGCGKKISIEECNDSKTGRVELDVGIAAWVRDCLLTVSKVGRSLGFWRRRRLETTIIFFQVHSNARGKYGLLSLEPFKDRRIRIF
        M ++ IDRK F +      +KI IEE N    G++ELD G +AWVRDCL+  +    S+ FW +RRLE  IIFFQV  N +G++ +LSLE FK+R+ RIF
Subjt:  MRQVLIDRKTFTLRKSGCGKKISIEECNDSKTGRVELDVGIAAWVRDCLLTVSKVGRSLGFWRRRRLETTIIFFQVHSNARGKYGLLSLEPFKDRRIRIF

Query:  FSEGAKGKG-----------------------------------------------------------------------DAWFSIRDAIKKSVQVGLMF
          EG++G G                                                                         W +++  +++   +   F
Subjt:  FSEGAKGKG-----------------------------------------------------------------------DAWFSIRDAIKKSVQVGLMF

Query:  KPFCANLEVGICGSSMEAKEVLELGNKADSKVFRVTDLCGGLMEDEEMIDYEVSMEDISFKAKGKSDGFM
        K FCANL VGI GS  EAK  L LG+  +  V      CGG  E+E+       M ++ FKA G  DGF+
Subjt:  KPFCANLEVGICGSSMEAKEVLELGNKADSKVFRVTDLCGGLMEDEEMIDYEVSMEDISFKAKGKSDGFM

TYK29954.1 reverse transcriptase [Cucumis melo var. makuwa]2.6e-2949.21Show/hide
Query:  KDIRVWSPDPSLGFSCRSFFRCLLDPVSSQPSIFSLLWKVKVPKKVQFFIWQVIHGRVNTLDRLSRKIPSLFGPFCCILCRMAEEDLDHMLWNCAFARTV
        +++ VWS +PS GFS +S F  LLDP  ++  +F  +W++KV KKV+FF WQV+  R N +DRL R+    +   CCILCR A+EDLDH+LW+C +AR V
Subjt:  KDIRVWSPDPSLGFSCRSFFRCLLDPVSSQPSIFSLLWKVKVPKKVQFFIWQVIHGRVNTLDRLSRKIPSLFGPFCCILCRMAEEDLDHMLWNCAFARTV

Query:  WDEFFGSFGLQFARHRGLRETIEEFL
        W  F   F ++ A  R +R+TIEEFL
Subjt:  WDEFFGSFGLQFARHRGLRETIEEFL

TrEMBL top hitse value%identityAlignment
A0A5D3CHC7 SIL15.2e-2847.79Show/hide
Query:  FSCRSFFRCLLDPV--SSQPSIFSLLWKVKVPKKVQFFIWQVIHGRVNTLDRLSRKIPSLFGPFCCILCRMAEEDLDHMLWNCAFARTVWDEFFGSFGLQ
        F   S F CL D V    +    S   ++KVPKK +FFI QV+ G +NT+DRL + +  L GPF C LC+ A+EDLD++ W+C   R VW+ F   F   
Subjt:  FSCRSFFRCLLDPV--SSQPSIFSLLWKVKVPKKVQFFIWQVIHGRVNTLDRLSRKIPSLFGPFCCILCRMAEEDLDHMLWNCAFARTVWDEFFGSFGLQ

Query:  FARHRGLRETIEEFLPHPPFREQGKFLWQAGICAII
        F   R +R TIEEFL HPPFRE+   LW AG+CA+I
Subjt:  FARHRGLRETIEEFLPHPPFREQGKFLWQAGICAII

A0A5D3CI74 Calpain-type cysteine protease DEK11.2e-2741.18Show/hide
Query:  RKDIRVWSPDPSLGFSCRSFFRCLLDPVSSQPSIFSLLWKVKVPKKVQFFIWQVIHGRVNTLDRLSRKIPSLFGPFCCILCRMAEEDLDHMLWNCAFART
        R+D+ VWSP P  GF C+SFF+CL++   +  S+ SL+W++KVP+K   F WQV                              EEDLDH+LW+C     
Subjt:  RKDIRVWSPDPSLGFSCRSFFRCLLDPVSSQPSIFSLLWKVKVPKKVQFFIWQVIHGRVNTLDRLSRKIPSLFGPFCCILCRMAEEDLDHMLWNCAFART

Query:  VWDEFFGSFGLQFARHRGLRETIEEFLPHPPFREQGKFLWQAGICAIIWGLWG
        VWD+F  +FGL +ARHR +R T+EEFL + P  E+G FLW A + A + G  G
Subjt:  VWDEFFGSFGLQFARHRGLRETIEEFLPHPPFREQGKFLWQAGICAIIWGLWG

A0A5D3CQQ0 Uncharacterized protein9.2e-2531.85Show/hide
Query:  MRQVLIDRKTFTLRKSGCGKKISIEECNDSKTGRVELDVGIAAWVRDCLLTVSKVGRSLGFWRRRRLETTIIFFQVHSNARGKYGLLSLEPFKDRRIRIF
        M ++ IDRK F +      +KI IEE N    G++ELD G +AWVRDCL+  +    S+ FW +RRLE  IIFFQV  N +G++ +LSLE FK+R+ RIF
Subjt:  MRQVLIDRKTFTLRKSGCGKKISIEECNDSKTGRVELDVGIAAWVRDCLLTVSKVGRSLGFWRRRRLETTIIFFQVHSNARGKYGLLSLEPFKDRRIRIF

Query:  FSEGAKGKG-----------------------------------------------------------------------DAWFSIRDAIKKSVQVGLMF
          EG++G G                                                                         W +++  +++   +   F
Subjt:  FSEGAKGKG-----------------------------------------------------------------------DAWFSIRDAIKKSVQVGLMF

Query:  KPFCANLEVGICGSSMEAKEVLELGNKADSKVFRVTDLCGGLMEDEEMIDYEVSMEDISFKAKGKSDGFM
        K FCANL VGI GS  EAK  L LG+  +  V      CGG  E+E+       M ++ FKA G  DGF+
Subjt:  KPFCANLEVGICGSSMEAKEVLELGNKADSKVFRVTDLCGGLMEDEEMIDYEVSMEDISFKAKGKSDGFM

A0A5D3E255 Reverse transcriptase1.2e-2949.21Show/hide
Query:  KDIRVWSPDPSLGFSCRSFFRCLLDPVSSQPSIFSLLWKVKVPKKVQFFIWQVIHGRVNTLDRLSRKIPSLFGPFCCILCRMAEEDLDHMLWNCAFARTV
        +++ VWS +PS GFS +S F  LLDP  ++  +F  +W++KV KKV+FF WQV+  R N +DRL R+    +   CCILCR A+EDLDH+LW+C +AR V
Subjt:  KDIRVWSPDPSLGFSCRSFFRCLLDPVSSQPSIFSLLWKVKVPKKVQFFIWQVIHGRVNTLDRLSRKIPSLFGPFCCILCRMAEEDLDHMLWNCAFARTV

Query:  WDEFFGSFGLQFARHRGLRETIEEFL
        W  F   F ++ A  R +R+TIEEFL
Subjt:  WDEFFGSFGLQFARHRGLRETIEEFL

M5XV38 zf-RVT domain-containing protein3.0e-2331.98Show/hide
Query:  IGALEPFTTRKDIRVWSPDPSLGFSCRSFFRCLLDPVSSQPSIFSLLWKVKVPKKVQFFIWQVIHGRVNTLDRLSRKIPSL-FGPFCCILCRMAEEDLDH
        +G +  + +R D R W  +    FSC+SF   LL         FS +WK K P K+QFF+W   +GR+NT D + R+ P +   P  C+LC+   E++DH
Subjt:  IGALEPFTTRKDIRVWSPDPSLGFSCRSFFRCLLDPVSSQPSIFSLLWKVKVPKKVQFFIWQVIHGRVNTLDRLSRKIPSL-FGPFCCILCRMAEEDLDH

Query:  MLWNCAFARTVWDEFFGSFGLQFARHRGLRETIEEFLPHPPFREQGKFLWQAGICAIIWGLWGERNNRTFRG
        +  +C+++  +W +  G+ G+++   +G  E +   L      ++   L    + AI W +W ERN R F+G
Subjt:  MLWNCAFARTVWDEFFGSFGLQFARHRGLRETIEEFLPHPPFREQGKFLWQAGICAIIWGLWGERNNRTFRG

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657501.1e-0626.63Show/hide
Query:  LEPFTTRKDIRVWSPDPSLGFSCRSFFRCLL---DPVSSQPSIFSLLWKVKVPKKVQFFIWQVIHGRVNTLDRLSRKIPSLFGPFCCILCRMAEEDLDHM
        L+  T  +D   W       FS RS +  L     P  +  S F+ LWKV+VP++V+ F+W V +  V T +   R+   L     C +C+   E + H+
Subjt:  LEPFTTRKDIRVWSPDPSLGFSCRSFFRCLL---DPVSSQPSIFSLLWKVKVPKKVQFFIWQVIHGRVNTLDRLSRKIPSLFGPFCCILCRMAEEDLDHM

Query:  LWNCAFARTVWDEFFGSFGLQFARHRGLRETIEEFLPHPPFREQGKFLWQAGICAIIWGLWGERNNRTF
        L +C     +W         Q    + L E + + L      E     W      IIW  W  R    F
Subjt:  LWNCAFARTVWDEFFGSFGLQFARHRGLRETIEEFLPHPPFREQGKFLWQAGICAIIWGLWGERNNRTF

Arabidopsis top hitse value%identityAlignment
AT2G02520.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein4.3e-0624.85Show/hide
Query:  LDPVSSQPSIFSLLW-KVKVPKKVQFFIWQVIHGRVNTLDRLSRKIPSLFGPFCCILCRMAEEDLDHMLWNCAFARTVWDEFFGSFGL--QFARHRGLRE
        L+P+  +   F  +W K K+PK   F  W  +  R++T DR+         P  C+ C   +E   H+ ++C FAR VW  F     +        G+R 
Subjt:  LDPVSSQPSIFSLLW-KVKVPKKVQFFIWQVIHGRVNTLDRLSRKIPSLFGPFCCILCRMAEEDLDHMLWNCAFARTVWDEFFGSFGL--QFARHRGLRE

Query:  TIEEFLPHPPFREQGKFLWQAGICAIIWGLWGERNNRTFRGLERDPIEDDNRLEEAMLAMEAKKI
            +L +P   +    + +    A ++ +W ERN R         + D      A L +E K +
Subjt:  TIEEFLPHPPFREQGKFLWQAGICAIIWGLWGERNNRTFRGLERDPIEDDNRLEEAMLAMEAKKI

AT2G02650.1 Ribonuclease H-like superfamily protein1.7e-0725.36Show/hide
Query:  LDPVSSQPSIFSLLWKVKVPKKVQFFIWQVIHGRVNTLDRL-SRKIPSLFGPFCCILCRMAEEDLDHMLWNCAFARTVWDEFFGSFGLQFARHRGLRETI
        + P      +   +WK+ V  K++ F+W+ + G + T  RL SR I +   P C   C + EE + H+++NC + ++VW       G Q+       + +
Subjt:  LDPVSSQPSIFSLLWKVKVPKKVQFFIWQVIHGRVNTLDRL-SRKIPSLFGPFCCILCRMAEEDLDHMLWNCAFARTVWDEFFGSFGLQFARHRGLRETI

Query:  EEFLPHPPFREQG---KFL--WQAGICAIIWGLWGERN
           +     +      +FL  W      I+W LW  RN
Subjt:  EEFLPHPPFREQG---KFL--WQAGICAIIWGLWGERN

AT4G29090.1 Ribonuclease H-like superfamily protein1.7e-0724.07Show/hide
Query:  ELERSMGIGALEPFTTR-KDIRVWSPDPSLGFSCRSFFRCLLDPVS--------SQPS---IFSLLWKVKVPKKVQFFIWQVIHGRVNTLDRLSRKIPSL
        E+ER + IG L P   R  D   W    S  ++ +S +  L   ++        S+PS   I+  +WK +   K+Q F+W+ +   +     L+ +   L
Subjt:  ELERSMGIGALEPFTTR-KDIRVWSPDPSLGFSCRSFFRCLLDPVS--------SQPS---IFSLLWKVKVPKKVQFFIWQVIHGRVNTLDRLSRKIPSL

Query:  FGPFCCILCRMAEEDLDHMLWNCAFARTVW--------------DEFFGSFGLQFARHRGLRETIEEFLPHPPFREQGKFL-WQAGICAIIWGLWGERNN
             CI C   +E ++H+L+ C FAR  W              D  + +    F    G          +P + +  + + W      ++W LW  RN 
Subjt:  FGPFCCILCRMAEEDLDHMLWNCAFARTVW--------------DEFFGSFGLQFARHRGLRETIEEFLPHPPFREQGKFL-WQAGICAIIWGLWGERNN

Query:  RTFRGLERDPIEDDNRLEEAMLAMEAKKIEKTQEEEGDKQE
          FRG E +  E   R E+    +E  +I    E  G K +
Subjt:  RTFRGLERDPIEDDNRLEEAMLAMEAKKIEKTQEEEGDKQE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTGAGCTTCTCTTTGATGATATTTTTAAGATTGAGCGATTAAACCCAGATGGTAAAAAGTTTGATAAAGGGAGTTTCTTGCCATCTAATTGGATTTTTGTGGGGAT
GGTTGTAGTTGCAGAGCCCCTTTTCTGGGAAGATAGTTTCAGAACAAATAGGATTGAGAAACCTCCTGAATGTGGCGCCATCTCCTTCCTTTTCAACGATGGAAAAATGA
GGCAAGTACTTATAGATAGAAAAACGTTTACACTCCGTAAGAGTGGGTGTGGAAAGAAAATCTCGATTGAGGAATGCAATGACTCTAAAACAGGGAGGGTGGAGCTAGAT
GTGGGTATCGCGGCTTGGGTCCGAGATTGTCTATTGACGGTGTCTAAAGTTGGTCGTTCGTTGGGATTTTGGAGAAGACGAAGGCTAGAGACAACAATTATATTTTTTCA
AGTTCATTCAAATGCTAGAGGAAAATATGGTTTGCTATCGTTGGAACCTTTCAAAGACAGAAGAATCAGAATTTTCTTCTCAGAGGGAGCGAAGGGGAAAGGGGATGCGT
GGTTCTCGATTAGAGATGCTATTAAAAAATCGGTGCAAGTGGGCTTGATGTTCAAGCCGTTCTGTGCTAACCTAGAGGTAGGAATTTGTGGGAGTTCCATGGAAGCTAAA
GAAGTTCTGGAATTGGGTAACAAAGCCGATTCAAAGGTTTTTCGAGTGACGGACTTATGTGGGGGGTTGATGGAGGATGAGGAGATGATCGACTATGAAGTGTCGATGGA
AGATATTAGCTTCAAGGCAAAGGGTAAGAGCGATGGCTTCATGGGTACAAGTCACGTGAAGGAAAAAGGAAAAGAAAGGGAGACTAAGCTCGTGTATAATGAAAAGAAAG
AGGTTAGGCCTTCCGTGGACGTGAACTTGGGCCAGGAAAAATCTGATAACTTCAAGCCCACAAGGGACCCTCAACATCAAAAGGCCTGTCATTTGAAGGGGAGGACAAAA
TTTAAAATCCTAGAACGAGGGGAGAATAGTAAGACAACATATGAGGAGCATGGGCTGGGCTCAGTTGTTAATTCGGAAGCCCAAGTAGGAGATGTGGATGGGCTAGACTC
CAAAATTGAGCTGGAAAGGAGTATGGGCATTGGGGCGTTAGAACCTTTTACCACTAGGAAAGACATTCGAGTCTGGAGTCCCGACCCTTCCCTCGGGTTTTCTTGTCGAT
CCTTCTTCCGATGCCTTTTGGACCCTGTTTCCTCTCAGCCGTCCATTTTCTCTTTATTGTGGAAGGTGAAAGTTCCAAAGAAGGTGCAGTTTTTTATTTGGCAGGTGATC
CATGGAAGAGTTAATACTCTTGATCGGCTGTCCAGAAAGATTCCTAGTCTATTTGGGCCTTTTTGTTGCATTCTTTGTCGGATGGCGGAGGAAGACCTCGATCATATGTT
ATGGAACTGCGCTTTTGCTAGGACAGTGTGGGATGAGTTCTTTGGCTCGTTCGGGTTGCAGTTTGCCAGACACAGAGGCCTCAGAGAGACGATCGAGGAGTTCCTTCCCC
ATCCTCCCTTTAGGGAGCAAGGAAAGTTTTTGTGGCAAGCTGGGATTTGTGCTATTATTTGGGGGTTGTGGGGTGAGAGAAACAATAGAACTTTTAGAGGGTTAGAGAGG
GATCCTATTGAGGATGATAACCGATTAGAGGAAGCTATGCTAGCTATGGAAGCCAAGAAAATTGAGAAGACCCAAGAGGAAGAAGGTGACAAGCAGGAGTCCAAAGTCTA
TGTTCCGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTTGAGCTTCTCTTTGATGATATTTTTAAGATTGAGCGATTAAACCCAGATGGTAAAAAGTTTGATAAAGGGAGTTTCTTGCCATCTAATTGGATTTTTGTGGGGAT
GGTTGTAGTTGCAGAGCCCCTTTTCTGGGAAGATAGTTTCAGAACAAATAGGATTGAGAAACCTCCTGAATGTGGCGCCATCTCCTTCCTTTTCAACGATGGAAAAATGA
GGCAAGTACTTATAGATAGAAAAACGTTTACACTCCGTAAGAGTGGGTGTGGAAAGAAAATCTCGATTGAGGAATGCAATGACTCTAAAACAGGGAGGGTGGAGCTAGAT
GTGGGTATCGCGGCTTGGGTCCGAGATTGTCTATTGACGGTGTCTAAAGTTGGTCGTTCGTTGGGATTTTGGAGAAGACGAAGGCTAGAGACAACAATTATATTTTTTCA
AGTTCATTCAAATGCTAGAGGAAAATATGGTTTGCTATCGTTGGAACCTTTCAAAGACAGAAGAATCAGAATTTTCTTCTCAGAGGGAGCGAAGGGGAAAGGGGATGCGT
GGTTCTCGATTAGAGATGCTATTAAAAAATCGGTGCAAGTGGGCTTGATGTTCAAGCCGTTCTGTGCTAACCTAGAGGTAGGAATTTGTGGGAGTTCCATGGAAGCTAAA
GAAGTTCTGGAATTGGGTAACAAAGCCGATTCAAAGGTTTTTCGAGTGACGGACTTATGTGGGGGGTTGATGGAGGATGAGGAGATGATCGACTATGAAGTGTCGATGGA
AGATATTAGCTTCAAGGCAAAGGGTAAGAGCGATGGCTTCATGGGTACAAGTCACGTGAAGGAAAAAGGAAAAGAAAGGGAGACTAAGCTCGTGTATAATGAAAAGAAAG
AGGTTAGGCCTTCCGTGGACGTGAACTTGGGCCAGGAAAAATCTGATAACTTCAAGCCCACAAGGGACCCTCAACATCAAAAGGCCTGTCATTTGAAGGGGAGGACAAAA
TTTAAAATCCTAGAACGAGGGGAGAATAGTAAGACAACATATGAGGAGCATGGGCTGGGCTCAGTTGTTAATTCGGAAGCCCAAGTAGGAGATGTGGATGGGCTAGACTC
CAAAATTGAGCTGGAAAGGAGTATGGGCATTGGGGCGTTAGAACCTTTTACCACTAGGAAAGACATTCGAGTCTGGAGTCCCGACCCTTCCCTCGGGTTTTCTTGTCGAT
CCTTCTTCCGATGCCTTTTGGACCCTGTTTCCTCTCAGCCGTCCATTTTCTCTTTATTGTGGAAGGTGAAAGTTCCAAAGAAGGTGCAGTTTTTTATTTGGCAGGTGATC
CATGGAAGAGTTAATACTCTTGATCGGCTGTCCAGAAAGATTCCTAGTCTATTTGGGCCTTTTTGTTGCATTCTTTGTCGGATGGCGGAGGAAGACCTCGATCATATGTT
ATGGAACTGCGCTTTTGCTAGGACAGTGTGGGATGAGTTCTTTGGCTCGTTCGGGTTGCAGTTTGCCAGACACAGAGGCCTCAGAGAGACGATCGAGGAGTTCCTTCCCC
ATCCTCCCTTTAGGGAGCAAGGAAAGTTTTTGTGGCAAGCTGGGATTTGTGCTATTATTTGGGGGTTGTGGGGTGAGAGAAACAATAGAACTTTTAGAGGGTTAGAGAGG
GATCCTATTGAGGATGATAACCGATTAGAGGAAGCTATGCTAGCTATGGAAGCCAAGAAAATTGAGAAGACCCAAGAGGAAGAAGGTGACAAGCAGGAGTCCAAAGTCTA
TGTTCCGTAG
Protein sequenceShow/hide protein sequence
MVELLFDDIFKIERLNPDGKKFDKGSFLPSNWIFVGMVVVAEPLFWEDSFRTNRIEKPPECGAISFLFNDGKMRQVLIDRKTFTLRKSGCGKKISIEECNDSKTGRVELD
VGIAAWVRDCLLTVSKVGRSLGFWRRRRLETTIIFFQVHSNARGKYGLLSLEPFKDRRIRIFFSEGAKGKGDAWFSIRDAIKKSVQVGLMFKPFCANLEVGICGSSMEAK
EVLELGNKADSKVFRVTDLCGGLMEDEEMIDYEVSMEDISFKAKGKSDGFMGTSHVKEKGKERETKLVYNEKKEVRPSVDVNLGQEKSDNFKPTRDPQHQKACHLKGRTK
FKILERGENSKTTYEEHGLGSVVNSEAQVGDVDGLDSKIELERSMGIGALEPFTTRKDIRVWSPDPSLGFSCRSFFRCLLDPVSSQPSIFSLLWKVKVPKKVQFFIWQVI
HGRVNTLDRLSRKIPSLFGPFCCILCRMAEEDLDHMLWNCAFARTVWDEFFGSFGLQFARHRGLRETIEEFLPHPPFREQGKFLWQAGICAIIWGLWGERNNRTFRGLER
DPIEDDNRLEEAMLAMEAKKIEKTQEEEGDKQESKVYVP