; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg018347 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg018347
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionTransposase, Ptta/En/Spm, plant
Genome locationscaffold3:14014817..14016184
RNA-Seq ExpressionSpg018347
SyntenySpg018347
Gene Ontology termsNA
InterPro domainsIPR004252 - Probable transposase, Ptta/En/Spm, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_038887409.1 poly [ADP-ribose] polymerase 1-like isoform X2 [Benincasa hispida]1.1e-4637.16Show/hide
Query:  QVRGSSRGVGLDRIIETTGNRVSISWTPEQGKPVGRVANMFNTEIGTLSRTFIPLKYHKKKDIPNEIMMNIVDRLLNKFDVNFSEYHVRKFILYEIGNRY
        + RG SR + LDR +   G R+ I    E GKPV   A  F+  IGT++R  IPL+     D+  E+   +VD+LL+ FD +  + HV+K++L  + N +
Subjt:  QVRGSSRGVGLDRIIETTGNRVSISWTPEQGKPVGRVANMFNTEIGTLSRTFIPLKYHKKKDIPNEIMMNIVDRLLNKFDVNFSEYHVRKFILYEIGNRY

Query:  KDWRARLHRYYKKIGDPEVARTRPHKDVTQ-EDWNMLCDRWETPDWKVKSNRNKNSRSKLPFNHCAGTKSFLSHREENKNEDGTYMNPIEMFRQTHWSTA
        K++R+ L+++Y++  DP+ AR  P K +T   DWN+LC+RWETP+WK K+  NK SRSK+P+ H  G+KSF+  + E K ++G  ++ +++FRQ+H+   
Subjt:  KDWRARLHRYYKKIGDPEVARTRPHKDVTQ-EDWNMLCDRWETPDWKVKSNRNKNSRSKLPFNHCAGTKSFLSHREENKNEDGTYMNPIEMFRQTHWSTA

Query:  KGWTDGAASQAHEQMVALSQEQ-ASSSTPLSDEEILATVLGTRSSYVKGMGYGPKPPPHQKKASYSQEYVQSLEANLQKSQELLQSQREENEKTQE
         GW +  A  A+ +M  L +       TP+S  E+   VLG RS Y+KG+G  PKP       SY Q+  + LE  ++K ++ +   +   E  +E
Subjt:  KGWTDGAASQAHEQMVALSQEQ-ASSSTPLSDEEILATVLGTRSSYVKGMGYGPKPPPHQKKASYSQEYVQSLEANLQKSQELLQSQREENEKTQE

XP_038887413.1 uncharacterized protein LOC120077557 isoform X5 [Benincasa hispida]2.7e-4234.06Show/hide
Query:  QVRGSSRGVGLDRIIETTGNRVSISWTPEQGKPVGRVANMFNTEIGTLSRTFIPLKYHKKKDIPNEIMMNIVDRLL------------------------
        + RG SR + LDR +   G R+ I    E GKPV   A  F+  IGT++R  IPL+     D+  E+   +VD+LL                        
Subjt:  QVRGSSRGVGLDRIIETTGNRVSISWTPEQGKPVGRVANMFNTEIGTLSRTFIPLKYHKKKDIPNEIMMNIVDRLL------------------------

Query:  ---NKFDVNFSEYHVRKFILYEIGNRYKDWRARLHRYYKKIGDPEVARTRPHKDVTQ-EDWNMLCDRWETPDWKVKSNRNKNSRSKLPFNHCAGTKSFLS
           + FD +  + HV+K++L  + N +K++R+ L+++Y++  DP+ AR  P K +T   DWN+LC+RWETP+WK K+  NK SRSK+P+ H  G+KSF+ 
Subjt:  ---NKFDVNFSEYHVRKFILYEIGNRYKDWRARLHRYYKKIGDPEVARTRPHKDVTQ-EDWNMLCDRWETPDWKVKSNRNKNSRSKLPFNHCAGTKSFLS

Query:  HREENKNEDGTYMNPIEMFRQTHWSTAKGWTDGAASQAHEQMVALSQEQ-ASSSTPLSDEEILATVLGTRSSYVKGMGYGPKPPPHQKKASYSQEYVQSL
         + E K ++G  ++ +++FRQ+H+    GW +  A  A+ +M  L +       TP+S  E+   VLG RS Y+KG+G  PKP       SY Q+  + L
Subjt:  HREENKNEDGTYMNPIEMFRQTHWSTAKGWTDGAASQAHEQMVALSQEQ-ASSSTPLSDEEILATVLGTRSSYVKGMGYGPKPPPHQKKASYSQEYVQSL

Query:  EANLQKSQELLQSQREENEKTQE
        E  ++K ++ +   +   E  +E
Subjt:  EANLQKSQELLQSQREENEKTQE

XP_038895319.1 uncharacterized protein LOC120083572 isoform X1 [Benincasa hispida]7.8e-7450.89Show/hide
Query:  NEDEIV---EMDEASSNSSQVRGSSRGVGLDRIIETTGNRVSISWTPEQGKPVGRVANMFNTEIGTLSRTFIPLKYHKKKDIPNEIMMNIVDRLLNKFDV
        N D+ V     ++ +    +VRG+SRGV L++   T G R+ ++WTP QGKP+G +A++FN EIG L R FIPLKY K+KDIPNE+   + ++LLN+FDV
Subjt:  NEDEIV---EMDEASSNSSQVRGSSRGVGLDRIIETTGNRVSISWTPEQGKPVGRVANMFNTEIGTLSRTFIPLKYHKKKDIPNEIMMNIVDRLLNKFDV

Query:  NFSEYHVRKFILYEIGNRYKDWRARLHRYYKKIGDPEVARTRPHKDVTQEDWNMLCDRWETPDWKVKSNRNKNSRSKLPFNHCAGTKSFLSHREENKNED
        + S+ H++++I YEIGNR+KD+R  L+++Y+K  DP  AR  P+K  T +DWN+LCDRWE+  WK KS RNK SRSK+ FNHC G+KSFLS R +   ED
Subjt:  NFSEYHVRKFILYEIGNRYKDWRARLHRYYKKIGDPEVARTRPHKDVTQEDWNMLCDRWETPDWKVKSNRNKNSRSKLPFNHCAGTKSFLSHREENKNED

Query:  GTYMNPIEMFRQTHWSTAKGWTDGAASQAHEQMVALSQEQASSSTPLSDEEILATVLGTRSSYVKGMGYGPKPPPHQKKAS
        GTY++ IE+F +TH S +KGW D AA +A+E M+ L + +       +DEEI+  VLG RSSY+ G GYGPKPP  ++ +S
Subjt:  GTYMNPIEMFRQTHWSTAKGWTDGAASQAHEQMVALSQEQASSSTPLSDEEILATVLGTRSSYVKGMGYGPKPPPHQKKAS

XP_038895320.1 uncharacterized protein LOC120083572 isoform X2 [Benincasa hispida]3.1e-6252.17Show/hide
Query:  NEDEIV---EMDEASSNSSQVRGSSRGVGLDRIIETTGNRVSISWTPEQGKPVGRVANMFNTEIGTLSRTFIPLKYHKKKDIPNEIMMNIVDRLLNKFDV
        N D+ V     ++ +    +VRG+SRGV L++   T G R+ ++WTP QGKP+G +A++FN EIG L R FIPLKY K+KDIPNE+   + ++LLN+FDV
Subjt:  NEDEIV---EMDEASSNSSQVRGSSRGVGLDRIIETTGNRVSISWTPEQGKPVGRVANMFNTEIGTLSRTFIPLKYHKKKDIPNEIMMNIVDRLLNKFDV

Query:  NFSEYHVRKFILYEIGNRYKDWRARLHRYYKKIGDPEVARTRPHKDVTQEDWNMLCDRWETPDWKVKSNRNKNSRSKLPFNHCAGTKSFLSHREENKNED
        + S+ H++++I YEIGNR+KD+R  L+++Y+K  DP  AR  P+K  T +DWN+LCDRWE+  WK KS RNK SRSK+ FNHC G+KSFLS R +   ED
Subjt:  NFSEYHVRKFILYEIGNRYKDWRARLHRYYKKIGDPEVARTRPHKDVTQEDWNMLCDRWETPDWKVKSNRNKNSRSKLPFNHCAGTKSFLSHREENKNED

Query:  GTYMNPIEMFRQTHWSTAKGWTDGAASQAH
        GTY++ IE+F +TH S +KGW D AA +A+
Subjt:  GTYMNPIEMFRQTHWSTAKGWTDGAASQAH

XP_038895321.1 uncharacterized protein LOC120083572 isoform X3 [Benincasa hispida]3.1e-6252.17Show/hide
Query:  NEDEIV---EMDEASSNSSQVRGSSRGVGLDRIIETTGNRVSISWTPEQGKPVGRVANMFNTEIGTLSRTFIPLKYHKKKDIPNEIMMNIVDRLLNKFDV
        N D+ V     ++ +    +VRG+SRGV L++   T G R+ ++WTP QGKP+G +A++FN EIG L R FIPLKY K+KDIPNE+   + ++LLN+FDV
Subjt:  NEDEIV---EMDEASSNSSQVRGSSRGVGLDRIIETTGNRVSISWTPEQGKPVGRVANMFNTEIGTLSRTFIPLKYHKKKDIPNEIMMNIVDRLLNKFDV

Query:  NFSEYHVRKFILYEIGNRYKDWRARLHRYYKKIGDPEVARTRPHKDVTQEDWNMLCDRWETPDWKVKSNRNKNSRSKLPFNHCAGTKSFLSHREENKNED
        + S+ H++++I YEIGNR+KD+R  L+++Y+K  DP  AR  P+K  T +DWN+LCDRWE+  WK KS RNK SRSK+ FNHC G+KSFLS R +   ED
Subjt:  NFSEYHVRKFILYEIGNRYKDWRARLHRYYKKIGDPEVARTRPHKDVTQEDWNMLCDRWETPDWKVKSNRNKNSRSKLPFNHCAGTKSFLSHREENKNED

Query:  GTYMNPIEMFRQTHWSTAKGWTDGAASQAH
        GTY++ IE+F +TH S +KGW D AA +A+
Subjt:  GTYMNPIEMFRQTHWSTAKGWTDGAASQAH

TrEMBL top hitse value%identityAlignment
A0A314YX60 Uncharacterized protein3.8e-4232.92Show/hide
Query:  DEASSNSSQVRGSSRGVGLDRIIETTGNRVSISWTPEQGKPVGRVANMFNTEIGTLSRTFIPLKYHKKKDIPNEIMMNIVDRLLNKFDVNFSEYHVRKFI
        +  +  + +VR  +RG+  D++    G+++ I+ T   G P G  ++MF  ++G + RT+ PL   + KDI  + +  +V R+++K+D++ S  HV +F+
Subjt:  DEASSNSSQVRGSSRGVGLDRIIETTGNRVSISWTPEQGKPVGRVANMFNTEIGTLSRTFIPLKYHKKKDIPNEIMMNIVDRLLNKFDVNFSEYHVRKFI

Query:  LYEIGNRYKDWRARLHRYYKKIGDPEVA-RTRPHKDVTQEDWNMLCDRWETPDWKVKSNRNKNSRSKLPFNHCAGTKSFLSHREENKNEDGTYMNPIEMF
        + +   RY D+R +L  Y+ K    E A + +P +  TQE+WN LC  + T  ++ +S +N  +RS L +NH  G+KSF++H+ E     G  + PIE F
Subjt:  LYEIGNRYKDWRARLHRYYKKIGDPEVA-RTRPHKDVTQEDWNMLCDRWETPDWKVKSNRNKNSRSKLPFNHCAGTKSFLSHREENKNEDGTYMNPIEMF

Query:  RQTHWSTAKGWTDGAASQAHEQMVALSQEQASSST-PLSDEEILATVLGTRSSYVKGMGYGPKPPPHQKKASYSQEYVQSLEANLQKSQELLQSQREENE
         + H +   GW  GA S+  E+M+A+  E ++  T  ++D+EI+A VLG +S Y+KG G+GP+P     KA  SQ        + ++ ++++QSQ+E+ +
Subjt:  RQTHWSTAKGWTDGAASQAHEQMVALSQEQASSST-PLSDEEILATVLGTRSSYVKGMGYGPKPPPHQKKASYSQEYVQSLEANLQKSQELLQSQREENE

Query:  KTQELLQTQREENARMEKRIEERME
        + +EL++   E+    +K+ EE ME
Subjt:  KTQELLQTQREENARMEKRIEERME

A0A438CMH8 Uncharacterized protein1.5e-4132.73Show/hide
Query:  NSSQVRGSSRGVGLDRIIETTGNR-VSISWTPEQGKPVGRVANMFNTEIGTLSRTFIPLKYHKKKDIPNEIMMNIVDR--------LLNKFDVNFSEYHV
        N   VRG +RGV LD++IE  G + + I+  P  GK  G+     + EIG   R   P++  K K +P   +  ++DR        +  KF ++ ++ HV
Subjt:  NSSQVRGSSRGVGLDRIIETTGNR-VSISWTPEQGKPVGRVANMFNTEIGTLSRTFIPLKYHKKKDIPNEIMMNIVDR--------LLNKFDVNFSEYHV

Query:  RKFILYEIGNRYKDWRARLHRYYKKIGDPEVARTRPHKDVT-QEDWNMLCDRWETPDWKVKSNRNKNSRSKLPFNHCAGTKSFLSHREENKNEDGTYMNP
        +K +  ++ +R+++WR  LH+++KK      A+  PH+ V+ QEDW+ LCDR+ + ++K +S  N  +RSK+PF+H  G++SF+ H  +   E+G  +  
Subjt:  RKFILYEIGNRYKDWRARLHRYYKKIGDPEVARTRPHKDVT-QEDWNMLCDRWETPDWKVKSNRNKNSRSKLPFNHCAGTKSFLSHREENKNEDGTYMNP

Query:  IEMFRQTHWSTAKGWTDGAASQAHEQMVALSQEQ-ASSSTPLSDEEILATVLGTRSSYVKGMGYGPKPPPHQKKASYSQEYVQSLEANLQKSQELLQSQR
        IE+F+  HW +  GW +  A   +E+M+ L ++  A  +  +++ EI   VLG +S YVKG+G+GPKP    K    S E+   LE  L ++Q L+++Q+
Subjt:  IEMFRQTHWSTAKGWTDGAASQAHEQMVALSQEQ-ASSSTPLSDEEILATVLGTRSSYVKGMGYGPKPPPHQKKASYSQEYVQSLEANLQKSQELLQSQR

Query:  EENEKTQ----ELLQTQREENARMEKRIEE
        ++ E  Q    +L    +++N +  ++ EE
Subjt:  EENEKTQ----ELLQTQREENARMEKRIEE

A0A438HRB4 Uncharacterized protein2.5e-4132.62Show/hide
Query:  NSSQVRGSSRGVGLDRIIETTGNR-VSISWTPEQGKPVGRVANMFNTEIGTLSRTFIPLKYHKKKDIPNEIMMNIVDR--------LLNKFDVNFSEYHV
        N   VRG +RGV LD++IE  G + + I+  P  GK  G+     + EIG   R   P++  K K +P   +  ++DR        +  KF ++ ++ HV
Subjt:  NSSQVRGSSRGVGLDRIIETTGNR-VSISWTPEQGKPVGRVANMFNTEIGTLSRTFIPLKYHKKKDIPNEIMMNIVDR--------LLNKFDVNFSEYHV

Query:  RKFILYEIGNRYKDWRARLHRYYKKIGDPEVARTRPHKDVT-QEDWNMLCDRWETPDWKVKSNRNKNSRSKLPFNHCAGTKSFLSHREENKNEDGTYMNP
        +K +  ++ +R+++WR  LH+++KK      A+  PH+ V+ QEDW+ LCDR+ + ++K +S  N  +RSK+PF+H  G++SF+ H  +   E+G  +  
Subjt:  RKFILYEIGNRYKDWRARLHRYYKKIGDPEVARTRPHKDVT-QEDWNMLCDRWETPDWKVKSNRNKNSRSKLPFNHCAGTKSFLSHREENKNEDGTYMNP

Query:  IEMFRQTHWSTAKGWTDGAASQAHEQMVALSQEQ-ASSSTPLSDEEILATVLGTRSSYVKGMGYGPKPPPHQKKASYSQEYVQSLEANL-QKSQELLQSQ
        IE+F+  HW +  GW +  A   +E+M+ L ++  A  +  +++ EI   VLG +S YVKG+G+GPKP    K    S E    LE  L +  Q+ L++Q
Subjt:  IEMFRQTHWSTAKGWTDGAASQAHEQMVALSQEQ-ASSSTPLSDEEILATVLGTRSSYVKGMGYGPKPPPHQKKASYSQEYVQSLEANL-QKSQELLQSQ

Query:  REENEKTQELLQTQREENARMEKRI
        ++  ++ + L+Q Q +++ +  + I
Subjt:  REENEKTQELLQTQREENARMEKRI

A0A443P6A9 Transposase, Ptta/En/Spm, plant1.9e-4131.63Show/hide
Query:  ASSNSSQVRGSSRGVGLDRIIETTGNRVSISWTPEQGKPVGRVANMFNTEIGTLSRTFIPLKYHKKKDIPNEIMMNIVDRLLNKFDVNFSEYHVRKFILY
        AS+++ + RG S+ + L ++I+ TG ++ I       +PVG+ A  F TE+G + RT+ PL  +    +  E      DR+L+KFD++     ++K I  
Subjt:  ASSNSSQVRGSSRGVGLDRIIETTGNRVSISWTPEQGKPVGRVANMFNTEIGTLSRTFIPLKYHKKKDIPNEIMMNIVDRLLNKFDVNFSEYHVRKFILY

Query:  EIGNRYKDWRARLHRYYKKIGDPEVARTRPHKDVTQEDWNMLCDRWETPDWKVKSNRNKNSRSKLPFNHCAGTKSFLSHREENKNEDGTYMNPIEMFRQT
         + + +K++R RLH +YK++G+   A  +P++ V+QEDW + C+R+ + +++  S +N  +R  L  NHC G+KSF+ +  E ++     ++ IE++ +T
Subjt:  EIGNRYKDWRARLHRYYKKIGDPEVARTRPHKDVTQEDWNMLCDRWETPDWKVKSNRNKNSRSKLPFNHCAGTKSFLSHREENKNEDGTYMNPIEMFRQT

Query:  HWSTAKGWTDGAASQAHEQMVAL-SQEQASSSTPLSDEEILATVLGTRSSYVKGMGYGPKPPPHQKKASYSQEYVQSLEANLQKSQELLQSQREENEKTQ
        H+S  KGW+       +E+M+ L SQ     S PL+D+EI   VLGTR  YV+G+G+G   PP      Y+ + V+ L      ++   Q   E+ ++ +
Subjt:  HWSTAKGWTDGAASQAHEQMVAL-SQEQASSSTPLSDEEILATVLGTRSSYVKGMGYGPKPPPHQKKASYSQEYVQSLEANLQKSQELLQSQREENEKTQ

Query:  ELLQTQREE-NARMEKRI---EERMEARMKEL
          +++QR++  A+ME ++   +++MEA+M ++
Subjt:  ELLQTQREE-NARMEKRI---EERMEARMKEL

A0A6J1DXU5 uncharacterized protein LOC1110255251.7e-4249.18Show/hide
Query:  EMDEASSNSSQVRGSSRGVGLDRIIETTGNRVSISWTPEQGKPVGRVANMFNTEIGTLSRTFIPLKYHKKKDIPNEIMMNIVDRLLNKFDVNFSEYHVRK
        E ++  +  ++ +G +RG  L R++   G ++ + WT +QG+PVG  +  FN+EIG L+R +I  K  KKK+I       I+  LL KF V+ S+ HV +
Subjt:  EMDEASSNSSQVRGSSRGVGLDRIIETTGNRVSISWTPEQGKPVGRVANMFNTEIGTLSRTFIPLKYHKKKDIPNEIMMNIVDRLLNKFDVNFSEYHVRK

Query:  FILYEIGNRYKDWRARLHRYYKKIGDPEVARTRPHKDVTQEDWNMLCDRWETPDWKVKSNRNKNSRSKLPFNHCAGTKSFLSH
        +ILYEIG R+KD+RA+LHR+YKK  DP  AR +P+KD+ QE WN+LCDRWE+P WK KS RNK +RSKL FNH  G K FL H
Subjt:  FILYEIGNRYKDWRARLHRYYKKIGDPEVARTRPHKDVTQEDWNMLCDRWETPDWKVKSNRNKNSRSKLPFNHCAGTKSFLSH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
GATACAATGCTTTCAAATAACGAAGACGAAATCGTAGAAATGGATGAGGCATCCTCTAACAGTTCCCAAGTTCGTGGATCTTCCCGTGGGGTTGGTCTTGATAGGATTAT
TGAGACCACTGGTAACAGAGTTTCTATTTCATGGACTCCAGAGCAAGGTAAACCAGTTGGACGAGTAGCAAATATGTTCAACACTGAAATTGGAACACTATCGAGGACTT
TCATCCCTTTAAAATACCATAAGAAGAAAGACATTCCAAACGAAATCATGATGAACATAGTAGACAGGTTGTTGAACAAATTTGATGTAAACTTCTCGGAGTACCATGTT
AGGAAATTCATCCTTTACGAGATTGGTAATAGATATAAAGATTGGAGAGCAAGATTACATCGATACTACAAAAAGATTGGTGATCCGGAAGTAGCTCGTACACGTCCACA
CAAGGATGTCACTCAAGAAGATTGGAACATGTTATGTGATAGATGGGAGACTCCCGATTGGAAGGTTAAATCAAATAGAAATAAAAATAGCAGAAGTAAACTCCCATTTA
ACCATTGTGCTGGAACGAAATCATTTCTCTCTCATAGAGAAGAAAATAAAAATGAAGATGGTACATATATGAATCCAATTGAGATGTTTCGTCAAACTCATTGGTCTACT
GCAAAAGGATGGACAGACGGTGCAGCAAGTCAAGCACATGAACAAATGGTGGCATTATCTCAAGAGCAAGCTAGCTCAAGTACACCTTTGAGCGATGAAGAAATTTTGGC
CACTGTTCTTGGAACAAGATCGTCTTATGTCAAAGGAATGGGATATGGACCTAAACCACCACCTCATCAGAAAAAAGCATCTTACTCACAGGAGTATGTCCAATCCCTCG
AGGCTAACCTTCAAAAGTCTCAAGAATTACTTCAGAGTCAACGAGAAGAAAATGAAAAGACTCAAGAATTACTACAGACTCAACGTGAGGAAAATGCAAGAATGGAAAAA
CGAATCGAAGAACGAATGGAAGCAAGAATGAAAGAACTTATGGATAGATTCACTGGAGCACACTCATCTAATTAA
mRNA sequenceShow/hide mRNA sequence
GATACAATGCTTTCAAATAACGAAGACGAAATCGTAGAAATGGATGAGGCATCCTCTAACAGTTCCCAAGTTCGTGGATCTTCCCGTGGGGTTGGTCTTGATAGGATTAT
TGAGACCACTGGTAACAGAGTTTCTATTTCATGGACTCCAGAGCAAGGTAAACCAGTTGGACGAGTAGCAAATATGTTCAACACTGAAATTGGAACACTATCGAGGACTT
TCATCCCTTTAAAATACCATAAGAAGAAAGACATTCCAAACGAAATCATGATGAACATAGTAGACAGGTTGTTGAACAAATTTGATGTAAACTTCTCGGAGTACCATGTT
AGGAAATTCATCCTTTACGAGATTGGTAATAGATATAAAGATTGGAGAGCAAGATTACATCGATACTACAAAAAGATTGGTGATCCGGAAGTAGCTCGTACACGTCCACA
CAAGGATGTCACTCAAGAAGATTGGAACATGTTATGTGATAGATGGGAGACTCCCGATTGGAAGGTTAAATCAAATAGAAATAAAAATAGCAGAAGTAAACTCCCATTTA
ACCATTGTGCTGGAACGAAATCATTTCTCTCTCATAGAGAAGAAAATAAAAATGAAGATGGTACATATATGAATCCAATTGAGATGTTTCGTCAAACTCATTGGTCTACT
GCAAAAGGATGGACAGACGGTGCAGCAAGTCAAGCACATGAACAAATGGTGGCATTATCTCAAGAGCAAGCTAGCTCAAGTACACCTTTGAGCGATGAAGAAATTTTGGC
CACTGTTCTTGGAACAAGATCGTCTTATGTCAAAGGAATGGGATATGGACCTAAACCACCACCTCATCAGAAAAAAGCATCTTACTCACAGGAGTATGTCCAATCCCTCG
AGGCTAACCTTCAAAAGTCTCAAGAATTACTTCAGAGTCAACGAGAAGAAAATGAAAAGACTCAAGAATTACTACAGACTCAACGTGAGGAAAATGCAAGAATGGAAAAA
CGAATCGAAGAACGAATGGAAGCAAGAATGAAAGAACTTATGGATAGATTCACTGGAGCACACTCATCTAATTAA
Protein sequenceShow/hide protein sequence
DTMLSNNEDEIVEMDEASSNSSQVRGSSRGVGLDRIIETTGNRVSISWTPEQGKPVGRVANMFNTEIGTLSRTFIPLKYHKKKDIPNEIMMNIVDRLLNKFDVNFSEYHV
RKFILYEIGNRYKDWRARLHRYYKKIGDPEVARTRPHKDVTQEDWNMLCDRWETPDWKVKSNRNKNSRSKLPFNHCAGTKSFLSHREENKNEDGTYMNPIEMFRQTHWST
AKGWTDGAASQAHEQMVALSQEQASSSTPLSDEEILATVLGTRSSYVKGMGYGPKPPPHQKKASYSQEYVQSLEANLQKSQELLQSQREENEKTQELLQTQREENARMEK
RIEERMEARMKELMDRFTGAHSSN