; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI03G15230 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI03G15230
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionCCHC-type domain-containing protein
Genome locationChr3:11373456..11374625
RNA-Seq ExpressionCSPI03G15230
SyntenyCSPI03G15230
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR005162 - Retrotransposon gag domain
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK11204.1 reverse transcriptase [Cucumis melo var. makuwa]2.3e-10556.15Show/hide
Query:  MDNLDSLNRRIDGLPAQARIE---DYDRHGGNKGGRRARRNFRHLPNPRNDQRRRQMDAPPRYADDDDQEDYDEWQNAQDHESSSGDERRNIWNGHEECR
        MD L++L RR+DGLPA ARIE   + DR+ GN+GGRRARRNFR+LPN RNDQR+R M+ P RYAD+D  E+Y+ WQN Q+ +SSS DE+ NIWN + + +
Subjt:  MDNLDSLNRRIDGLPAQARIE---DYDRHGGNKGGRRARRNFRHLPNPRNDQRRRQMDAPPRYADDDDQEDYDEWQNAQDHESSSGDERRNIWNGHEECR

Query:  MPQGYRVQEARRETYHDYKMKIDLPTYNGKRDIESFLDWIKNTENFFNYMDTPERKK-----------------------------------KMKKLLKT
        M Q YR  EA+RE +HDYKMKIDLPTY GK D+ESFLDWI+NTENFFNYMDT +RKK                                   KMKKL+K 
Subjt:  MPQGYRVQEARRETYHDYKMKIDLPTYNGKRDIESFLDWIKNTENFFNYMDTPERKK-----------------------------------KMKKLLKT

Query:  RFLPPNYEQTLYNQYQNCRQGSRIVAEYIEEFHRLSARTNLSENEQHQIARFIGGLRFDIKEKVKLQPFRFLSEAISLAETVEEMITARLKNSNRKITWE
        RFLPPNYEQTLYNQYQNCRQGS+ VAEYI+EFH+LSARTNL                                      +TVEEM+ ARL+N+NRK TWE
Subjt:  RFLPPNYEQTLYNQYQNCRQGSRIVAEYIEEFHRLSARTNLSENEQHQIARFIGGLRFDIKEKVKLQPFRFLSEAISLAETVEEMITARLKNSNRKITWE

Query:  TNSTKKQPYNRRTEEQPATSVAEKGKEFDTQEASKKKEGAGRGKSLNNYTRPSLGKCFRCGEPGHLSNNCPQRKTIALAEDEGSDMSKED
        T+ +KKQ Y+ +T EQP+TSVAEK K+ + QEA+KKKE AG+GK  NNY RPSLGKCFRC +  +LSN CPQRKTIALAE+E  D +++D
Subjt:  TNSTKKQPYNRRTEEQPATSVAEKGKEFDTQEASKKKEGAGRGKSLNNYTRPSLGKCFRCGEPGHLSNNCPQRKTIALAEDEGSDMSKED

TYK30863.1 transposon Ty3-I Gag-Pol polyprotein isoform X1 [Cucumis melo var. makuwa]1.1e-12370.55Show/hide
Query:  ARIEDYDRHGGNKGGRRARRNFRHLPNPRNDQRRRQMDAPPRYADDDDQEDYDEWQNAQDHESSSGDERRNIWNGHEECRMPQGYRVQEARRETYHDYKM
        ARIE Y R+  N+GGRRARRN+++ PN    QRRR  D P +YADD+ QE+Y+ WQN QDH+SS GDE+ NIWN   E RM QGYR QEARRETYHDYKM
Subjt:  ARIEDYDRHGGNKGGRRARRNFRHLPNPRNDQRRRQMDAPPRYADDDDQEDYDEWQNAQDHESSSGDERRNIWNGHEECRMPQGYRVQEARRETYHDYKM

Query:  KIDLPTYNGKRDIESFLDWIKNTENFFNYMDTPERKKKMKKLLKTR----FLPPNYEQTL---YNQYQNCRQGSRIVAEYIEEFHRLSARTNLSENEQHQ
        KIDLPTYNGKRDIESFLDWIKNTENFF YM  P+RKK     LK +      P +Y Q +   Y+QYQNCRQGS++VAEYIEEFHRL AR NLSENEQHQ
Subjt:  KIDLPTYNGKRDIESFLDWIKNTENFFNYMDTPERKKKMKKLLKTR----FLPPNYEQTL---YNQYQNCRQGSRIVAEYIEEFHRLSARTNLSENEQHQ

Query:  IARFIGGLRFDIKEKVKLQPFRFLSEAISLAETVEEMITARLKNSNRKITWETNSTKKQPYNRRTEEQPATSVAEKGKEFDTQEASKKKEGAGRGKSLNN
        IARFIGGLRFDIKEKVKL  FR LSEAISLAETVEEM+T RLKNSNR+  WETN +KKQ Y ++T+EQP+TS+ +KGK  D QE +KKKE   RGK+ NN
Subjt:  IARFIGGLRFDIKEKVKLQPFRFLSEAISLAETVEEMITARLKNSNRKITWETNSTKKQPYNRRTEEQPATSVAEKGKEFDTQEASKKKEGAGRGKSLNN

Query:  YTRPSLGKCFRCGEPGHLSNNCPQRKTIALAEDEGSDMSKEDK
        YTRPSLGKCFRCGEPGHLSNNC QRKTIALAEDE + MS  D+
Subjt:  YTRPSLGKCFRCGEPGHLSNNCPQRKTIALAEDEGSDMSKEDK

XP_031741035.1 uncharacterized protein LOC116403692 [Cucumis sativus]9.2e-10758.57Show/hide
Query:  MDNLDSLNRRIDGLPAQARIE---DYDRHGGNKGGRRARRNFRHLPNPRNDQRRRQMDAPPRYADDDDQEDYDEWQNAQDHESSSGDERRNIWNGHEECR
        M++++ LNRR +      R E     D+  G   GRRAR   R++ NPR  QRRR   A  +  D+D QED + WQ  Q+ +SSSGDE+ N+WN ++E R
Subjt:  MDNLDSLNRRIDGLPAQARIE---DYDRHGGNKGGRRARRNFRHLPNPRNDQRRRQMDAPPRYADDDDQEDYDEWQNAQDHESSSGDERRNIWNGHEECR

Query:  MPQGYRVQEARRETYHDYKMKIDLPTYNGKRDIESFLDWIKNTENFFNYMDTPERKK-----------------------------------KMKKLLKT
          +  +  EARR  YHDYKMKIDLP Y+GKR+IE+FLDWIK+TENFFNYMDTPERKK                                   KMKKLLK 
Subjt:  MPQGYRVQEARRETYHDYKMKIDLPTYNGKRDIESFLDWIKNTENFFNYMDTPERKK-----------------------------------KMKKLLKT

Query:  RFLPPNYEQTLYNQYQNCRQGSRIVAEYIEEFHRLSARTNLSENEQHQIARFIGGLRFDIKEKVKLQPFRFLSEAISLAETVEEMITARLKNSNRKITWE
        RFLPPNYEQTLYNQYQNCRQG R VAEYIEEFHRLSARTNLSENEQHQ+ARF+GGLRFDIKEKV+LQPFRFLSEAIS AETVEEMI  R KN NR+  WE
Subjt:  RFLPPNYEQTLYNQYQNCRQGSRIVAEYIEEFHRLSARTNLSENEQHQIARFIGGLRFDIKEKVKLQPFRFLSEAISLAETVEEMITARLKNSNRKITWE

Query:  TNSTKKQPYNRRTEEQPATSVAEKGKEFDTQEAS--KKKEGAGRGKSLNNYTRPSLGKCFRCGEPGHLSNNCPQRKTIALAEDEGSDMSKE
        TNSTK      +T +QP+TS   KGKE D QE +  +KKE   +    N+Y+RPSLGKCFRCG+ GHLS+NCPQRKTIA+AE EG  +S++
Subjt:  TNSTKKQPYNRRTEEQPATSVAEKGKEFDTQEAS--KKKEGAGRGKSLNNYTRPSLGKCFRCGEPGHLSNNCPQRKTIALAEDEGSDMSKE

XP_031743026.1 uncharacterized protein LOC116404533 [Cucumis sativus]1.1e-10758.63Show/hide
Query:  MDNLDSLNRRIDGLPAQARIE---DYDRHGGNKGGRRARRNFRHLPNPRNDQRRRQMDAPPRYADDDDQEDYDEWQNAQDHESSSGDERRNIWNGHEECR
        M++++ LNRR +      R E     D+  G   GRRAR   R++ NPR  QRRR   A  +  D+D QED ++WQ  Q+ +SSSGDE+ N+WN ++E R
Subjt:  MDNLDSLNRRIDGLPAQARIE---DYDRHGGNKGGRRARRNFRHLPNPRNDQRRRQMDAPPRYADDDDQEDYDEWQNAQDHESSSGDERRNIWNGHEECR

Query:  MPQGYRVQEARRETYHDYKMKIDLPTYNGKRDIESFLDWIKNTENFFNYMDTPERKK-----------------------------------KMKKLLKT
          +  R  EARR  YHDYKMKIDLP Y GKR+IE+FLDWIK+TENFF YMDTPERKK                                   KMKKLLK 
Subjt:  MPQGYRVQEARRETYHDYKMKIDLPTYNGKRDIESFLDWIKNTENFFNYMDTPERKK-----------------------------------KMKKLLKT

Query:  RFLPPNYEQTLYNQYQNCRQGSRIVAEYIEEFHRLSARTNLSENEQHQIARFIGGLRFDIKEKVKLQPFRFLSEAISLAETVEEMITARLKNSNRKITWE
        RFLPPNYEQTLYNQYQNCRQG R VAEYIEEFHRLSARTNLSENEQHQ+ARF+GGLRFDIKEKV+LQPFRFLSEAIS AETVEEMI  R KN NR+  WE
Subjt:  RFLPPNYEQTLYNQYQNCRQGSRIVAEYIEEFHRLSARTNLSENEQHQIARFIGGLRFDIKEKVKLQPFRFLSEAISLAETVEEMITARLKNSNRKITWE

Query:  TNSTKKQPYNRRTEEQPATSVAEKGKEFDTQEAS--KKKEGAGRGKSLNNYTRPSLGKCFRCGEPGHLSNNCPQRKTIALAEDEGSDMSKEDKV
        TNSTK      +T +QP+TS   KGKE D QE +  +KKE   +    NNY+RPSLGKCFRCG+ GHLSNNCPQRKTIA+AE+ G   + ED +
Subjt:  TNSTKKQPYNRRTEEQPATSVAEKGKEFDTQEAS--KKKEGAGRGKSLNNYTRPSLGKCFRCGEPGHLSNNCPQRKTIALAEDEGSDMSKEDKV

XP_031745523.1 uncharacterized protein LOC116405899 [Cucumis sativus]6.6e-8966.29Show/hide
Query:  MKIDLPTYNGKRDIESFLDWIKNTENFFNYMDTPERKK-----------------------------------KMKKLLKTRFLPPNYEQTLYNQYQNCR
        MK+DLP+Y+GKRDIESFLDW+K+TENFF+YMDTPE+KK                                   KMKKLLK RFLPPNYEQTLYNQYQNCR
Subjt:  MKIDLPTYNGKRDIESFLDWIKNTENFFNYMDTPERKK-----------------------------------KMKKLLKTRFLPPNYEQTLYNQYQNCR

Query:  QGSRIVAEYIEEFHRLSARTNLSENEQHQIARFIGGLRFDIKEKVKLQPFRFLSEAISLAETVEEMITARLKNSNRKITWETNSTKKQPYNRRTEEQPAT
        QG+R V EYIEEFHRLSARTNLSENEQHQIARF+GGLRFDIKEKVKLQP RFLSEAISLAETVEEMI  + K  NR+ TWE   TKK  Y  +T +QP  
Subjt:  QGSRIVAEYIEEFHRLSARTNLSENEQHQIARFIGGLRFDIKEKVKLQPFRFLSEAISLAETVEEMITARLKNSNRKITWETNSTKKQPYNRRTEEQPAT

Query:  SVAEKGKEFDTQEAS--KKKEGAGRGKSLNNYTRPSLGKCFRCGEPGHLSNNCPQRKTIALAEDEGS
         +  KGKE D+Q A+  KK E   + K+ NNYTRPSLGKCFRCG+PGHLSN+CPQRKTIALAE+EG+
Subjt:  SVAEKGKEFDTQEAS--KKKEGAGRGKSLNNYTRPSLGKCFRCGEPGHLSNNCPQRKTIALAEDEGS

TrEMBL top hitse value%identityAlignment
A0A5A7T256 Reverse transcriptase1.1e-7077.78Show/hide
Query:  YNQYQNCRQGSRIVAEYIEEFHRLSARTNLSENEQHQIARFIGGLRFDIKEKVKLQPFRFLSEAISLAETVEEMITARLKNSNRKITWETNSTKKQPYNR
        Y+QYQNCRQGS+ VAEYIEEFHRLSAR NLSENEQHQIARFIGGLRFDIKEKVKL  FR LSEAISLAETVEEM+T RLKNSNR+  WETN +KKQ Y +
Subjt:  YNQYQNCRQGSRIVAEYIEEFHRLSARTNLSENEQHQIARFIGGLRFDIKEKVKLQPFRFLSEAISLAETVEEMITARLKNSNRKITWETNSTKKQPYNR

Query:  RTEEQPATSVAEKGKEFDTQEASKKKEGAGRGKSLNNYTRPSLGKCFRCGEPGHLSNNCPQRKTIALAEDEGSDMSKEDK
        +T+EQP+TS+ +KGK  D QE +KKKE   RGK+ NNYTRPSLGKCFRCGEPGHLSNNC QRKTIALAEDE + MS  D+
Subjt:  RTEEQPATSVAEKGKEFDTQEASKKKEGAGRGKSLNNYTRPSLGKCFRCGEPGHLSNNCPQRKTIALAEDEGSDMSKEDK

A0A5A7UXS4 CCHC-type domain-containing protein5.3e-7658.09Show/hide
Query:  MKIDLPTYNGKRDIESFLDWIKNTENFFNYMDTPERKK-----------------------------------KMKKLLKTRFLPPNYEQTLYNQYQNCR
        MKIDLP YNGKRD ESFLDW+K+T+NFFNYMDT +RKK                                   KMKKLLK RFLPPNYEQT+YNQYQNC 
Subjt:  MKIDLPTYNGKRDIESFLDWIKNTENFFNYMDTPERKK-----------------------------------KMKKLLKTRFLPPNYEQTLYNQYQNCR

Query:  QGSRIVAEYIEEFHRLSARTNLSENEQHQIARFIGGLRFDIKEKVKLQPFRFLSEAISLAETVEEMITARLKNSNRKITWETNSTKKQPYNRRTEEQPAT
        QGSR +AEYIEEFHRLSARTNL ENEQHQIARFIG                         ETVEEM+ A LK+SNRK TW+ N +KKQ Y+ RT EQP+T
Subjt:  QGSRIVAEYIEEFHRLSARTNLSENEQHQIARFIGGLRFDIKEKVKLQPFRFLSEAISLAETVEEMITARLKNSNRKITWETNSTKKQPYNRRTEEQPAT

Query:  SVAEKGKEFDTQEASKKKEGAGRGKSLNNYTRPSLGKCFRCGEPGHLSNNCPQRKTIALAEDEGSDMSKEDK
        SV  K K+ DTQ+A+KKK+   +GKS N YTRPSL KCFRCG+ GHLSNNCPQR+TI+LA+ E + +S++DK
Subjt:  SVAEKGKEFDTQEASKKKEGAGRGKSLNNYTRPSLGKCFRCGEPGHLSNNCPQRKTIALAEDEGSDMSKEDK

A0A5D3CJ99 Reverse transcriptase1.1e-10556.15Show/hide
Query:  MDNLDSLNRRIDGLPAQARIE---DYDRHGGNKGGRRARRNFRHLPNPRNDQRRRQMDAPPRYADDDDQEDYDEWQNAQDHESSSGDERRNIWNGHEECR
        MD L++L RR+DGLPA ARIE   + DR+ GN+GGRRARRNFR+LPN RNDQR+R M+ P RYAD+D  E+Y+ WQN Q+ +SSS DE+ NIWN + + +
Subjt:  MDNLDSLNRRIDGLPAQARIE---DYDRHGGNKGGRRARRNFRHLPNPRNDQRRRQMDAPPRYADDDDQEDYDEWQNAQDHESSSGDERRNIWNGHEECR

Query:  MPQGYRVQEARRETYHDYKMKIDLPTYNGKRDIESFLDWIKNTENFFNYMDTPERKK-----------------------------------KMKKLLKT
        M Q YR  EA+RE +HDYKMKIDLPTY GK D+ESFLDWI+NTENFFNYMDT +RKK                                   KMKKL+K 
Subjt:  MPQGYRVQEARRETYHDYKMKIDLPTYNGKRDIESFLDWIKNTENFFNYMDTPERKK-----------------------------------KMKKLLKT

Query:  RFLPPNYEQTLYNQYQNCRQGSRIVAEYIEEFHRLSARTNLSENEQHQIARFIGGLRFDIKEKVKLQPFRFLSEAISLAETVEEMITARLKNSNRKITWE
        RFLPPNYEQTLYNQYQNCRQGS+ VAEYI+EFH+LSARTNL                                      +TVEEM+ ARL+N+NRK TWE
Subjt:  RFLPPNYEQTLYNQYQNCRQGSRIVAEYIEEFHRLSARTNLSENEQHQIARFIGGLRFDIKEKVKLQPFRFLSEAISLAETVEEMITARLKNSNRKITWE

Query:  TNSTKKQPYNRRTEEQPATSVAEKGKEFDTQEASKKKEGAGRGKSLNNYTRPSLGKCFRCGEPGHLSNNCPQRKTIALAEDEGSDMSKED
        T+ +KKQ Y+ +T EQP+TSVAEK K+ + QEA+KKKE AG+GK  NNY RPSLGKCFRC +  +LSN CPQRKTIALAE+E  D +++D
Subjt:  TNSTKKQPYNRRTEEQPATSVAEKGKEFDTQEASKKKEGAGRGKSLNNYTRPSLGKCFRCGEPGHLSNNCPQRKTIALAEDEGSDMSKED

A0A5D3DGR0 Reverse transcriptase1.1e-6244.53Show/hide
Query:  LDSLNRRIDGLPAQARIEDYDRHGGNKGGRRARRNFRHLPNPRNDQRRRQMDAPPRYADDDDQEDYDEWQNAQDHESSSGDERRNIWNGHEECRMPQGYR
        L+ +  R+D   AQ    D +       GRR RR   +    RN Q  R +           ++   EWQ  ++   +S     +  +   E R  +  +
Subjt:  LDSLNRRIDGLPAQARIEDYDRHGGNKGGRRARRNFRHLPNPRNDQRRRQMDAPPRYADDDDQEDYDEWQNAQDHESSSGDERRNIWNGHEECRMPQGYR

Query:  VQEARRETYHDYKMKIDLPTYNGKRDIESFLDWIKNTENFFNYMDTPERKK-----------------------------------KMKKLLKTRFLPPN
         +  +RE   +YKMKIDLP+Y+GKR+IE+FLDW+KNTENFF YM T + KK                                   KMKKL+K RF+PPN
Subjt:  VQEARRETYHDYKMKIDLPTYNGKRDIESFLDWIKNTENFFNYMDTPERKK-----------------------------------KMKKLLKTRFLPPN

Query:  YEQTLYNQYQNCRQGSRIVAEYIEEFHRLSARTNLSENEQHQIARFIGGLRFDIKEKVKLQPFRFLSEAISLAETVEEMITARLKNSNRKITWETNSTKK
        YEQTLY QYQNCRQG R  AEYIEEFHRL  RTNL E E+H I+ F+GGLRFD+KEKVKLQPF+ LSEAI+ AETVEEMI  R K S RK  WE +++KK
Subjt:  YEQTLYNQYQNCRQGSRIVAEYIEEFHRLSARTNLSENEQHQIARFIGGLRFDIKEKVKLQPFRFLSEAISLAETVEEMITARLKNSNRKITWETNSTKK

Query:  QPYNRRTEEQPATSVAEKGKEFDTQEASKKKE-GAGRGKSLNNYTRPSLGKCFRCGEPGHLSNNCPQRKTIALAE--DEGSDMS
              T        A   K  + +E+S KKE   G  K  N Y RP  G C+RCG+ GH SN CPQRKTIA+A+  D+GS+ S
Subjt:  QPYNRRTEEQPATSVAEKGKEFDTQEASKKKE-GAGRGKSLNNYTRPSLGKCFRCGEPGHLSNNCPQRKTIALAE--DEGSDMS

A0A5D3E417 Transposon Ty3-I Gag-Pol polyprotein isoform X15.2e-12470.55Show/hide
Query:  ARIEDYDRHGGNKGGRRARRNFRHLPNPRNDQRRRQMDAPPRYADDDDQEDYDEWQNAQDHESSSGDERRNIWNGHEECRMPQGYRVQEARRETYHDYKM
        ARIE Y R+  N+GGRRARRN+++ PN    QRRR  D P +YADD+ QE+Y+ WQN QDH+SS GDE+ NIWN   E RM QGYR QEARRETYHDYKM
Subjt:  ARIEDYDRHGGNKGGRRARRNFRHLPNPRNDQRRRQMDAPPRYADDDDQEDYDEWQNAQDHESSSGDERRNIWNGHEECRMPQGYRVQEARRETYHDYKM

Query:  KIDLPTYNGKRDIESFLDWIKNTENFFNYMDTPERKKKMKKLLKTR----FLPPNYEQTL---YNQYQNCRQGSRIVAEYIEEFHRLSARTNLSENEQHQ
        KIDLPTYNGKRDIESFLDWIKNTENFF YM  P+RKK     LK +      P +Y Q +   Y+QYQNCRQGS++VAEYIEEFHRL AR NLSENEQHQ
Subjt:  KIDLPTYNGKRDIESFLDWIKNTENFFNYMDTPERKKKMKKLLKTR----FLPPNYEQTL---YNQYQNCRQGSRIVAEYIEEFHRLSARTNLSENEQHQ

Query:  IARFIGGLRFDIKEKVKLQPFRFLSEAISLAETVEEMITARLKNSNRKITWETNSTKKQPYNRRTEEQPATSVAEKGKEFDTQEASKKKEGAGRGKSLNN
        IARFIGGLRFDIKEKVKL  FR LSEAISLAETVEEM+T RLKNSNR+  WETN +KKQ Y ++T+EQP+TS+ +KGK  D QE +KKKE   RGK+ NN
Subjt:  IARFIGGLRFDIKEKVKLQPFRFLSEAISLAETVEEMITARLKNSNRKITWETNSTKKQPYNRRTEEQPATSVAEKGKEFDTQEASKKKEGAGRGKSLNN

Query:  YTRPSLGKCFRCGEPGHLSNNCPQRKTIALAEDEGSDMSKEDK
        YTRPSLGKCFRCGEPGHLSNNC QRKTIALAEDE + MS  D+
Subjt:  YTRPSLGKCFRCGEPGHLSNNCPQRKTIALAEDEGSDMSKEDK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G47350.1 F-box associated ubiquitination effector family protein4.1e-0426.32Show/hide
Query:  LYNQYQNCRQGSRIVAEYIEEFHRLSARTNLSENEQHQIARFIGGLRFDIKEKVKLQPFRFLSEAISLAETVEEMITARLKNSNRKITWETNSTKKQPYN
        +YN+ QN R  +R V EY EEF+ L    ++++++   ++R IG LR  ++  +       +SEA   A + E+ +        R  +W   +T+ +   
Subjt:  LYNQYQNCRQGSRIVAEYIEEFHRLSARTNLSENEQHQIARFIGGLRFDIKEKVKLQPFRFLSEAISLAETVEEMITARLKNSNRKITWETNSTKKQPYN

Query:  RRTEEQPATSVAEK
        ++T   P T++A +
Subjt:  RRTEEQPATSVAEK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATAACTTGGACTCCCTCAATCGAAGAATAGACGGCCTGCCGGCACAGGCAAGGATAGAAGACTATGATCGGCATGGAGGAAACAAAGGAGGACGACGGGCACGAAG
GAATTTTAGACACCTGCCCAATCCAAGAAACGATCAAAGGAGGAGACAAATGGATGCCCCACCACGATACGCTGACGACGATGACCAAGAAGACTATGACGAGTGGCAAA
ACGCGCAAGACCACGAATCATCAAGTGGAGACGAGCGAAGAAATATTTGGAACGGCCACGAAGAATGCCGAATGCCCCAAGGGTACAGAGTGCAGGAGGCACGACGAGAA
ACTTATCATGACTACAAAATGAAAATTGACTTACCAACGTACAACGGCAAGCGCGATATTGAATCTTTCTTGGACTGGATTAAAAATACAGAGAACTTCTTCAACTATAT
GGATACACCCGAGAGAAAAAAGAAGATGAAGAAGCTCTTGAAGACACGCTTTCTGCCACCGAACTACGAACAAACATTGTATAATCAATATCAGAATTGCCGCCAAGGGA
GCCGAATTGTGGCAGAATATATTGAAGAATTCCACAGATTGAGCGCAAGAACCAATCTGAGTGAGAACGAACAGCATCAGATTGCAAGGTTCATTGGCGGATTACGATTC
GATATCAAGGAAAAGGTAAAGTTACAGCCCTTTCGCTTCTTGTCGGAAGCTATTTCTCTTGCGGAGACAGTAGAGGAAATGATCACAGCACGATTGAAGAACTCTAACAG
AAAGATTACATGGGAGACGAACTCCACCAAGAAGCAACCTTACAACAGGAGGACGGAGGAACAACCAGCAACATCAGTGGCTGAGAAGGGTAAAGAGTTCGATACTCAAG
AGGCAAGCAAAAAGAAAGAAGGAGCAGGCAGGGGGAAGAGTCTAAACAATTACACTCGCCCGTCCTTAGGGAAGTGTTTTCGATGTGGTGAACCTGGCCACTTATCCAAC
AACTGCCCCCAAAGGAAAACAATAGCACTAGCTGAAGATGAAGGCAGTGATATGAGTAAAGAAGACAAAGTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGATAACTTGGACTCCCTCAATCGAAGAATAGACGGCCTGCCGGCACAGGCAAGGATAGAAGACTATGATCGGCATGGAGGAAACAAAGGAGGACGACGGGCACGAAG
GAATTTTAGACACCTGCCCAATCCAAGAAACGATCAAAGGAGGAGACAAATGGATGCCCCACCACGATACGCTGACGACGATGACCAAGAAGACTATGACGAGTGGCAAA
ACGCGCAAGACCACGAATCATCAAGTGGAGACGAGCGAAGAAATATTTGGAACGGCCACGAAGAATGCCGAATGCCCCAAGGGTACAGAGTGCAGGAGGCACGACGAGAA
ACTTATCATGACTACAAAATGAAAATTGACTTACCAACGTACAACGGCAAGCGCGATATTGAATCTTTCTTGGACTGGATTAAAAATACAGAGAACTTCTTCAACTATAT
GGATACACCCGAGAGAAAAAAGAAGATGAAGAAGCTCTTGAAGACACGCTTTCTGCCACCGAACTACGAACAAACATTGTATAATCAATATCAGAATTGCCGCCAAGGGA
GCCGAATTGTGGCAGAATATATTGAAGAATTCCACAGATTGAGCGCAAGAACCAATCTGAGTGAGAACGAACAGCATCAGATTGCAAGGTTCATTGGCGGATTACGATTC
GATATCAAGGAAAAGGTAAAGTTACAGCCCTTTCGCTTCTTGTCGGAAGCTATTTCTCTTGCGGAGACAGTAGAGGAAATGATCACAGCACGATTGAAGAACTCTAACAG
AAAGATTACATGGGAGACGAACTCCACCAAGAAGCAACCTTACAACAGGAGGACGGAGGAACAACCAGCAACATCAGTGGCTGAGAAGGGTAAAGAGTTCGATACTCAAG
AGGCAAGCAAAAAGAAAGAAGGAGCAGGCAGGGGGAAGAGTCTAAACAATTACACTCGCCCGTCCTTAGGGAAGTGTTTTCGATGTGGTGAACCTGGCCACTTATCCAAC
AACTGCCCCCAAAGGAAAACAATAGCACTAGCTGAAGATGAAGGCAGTGATATGAGTAAAGAAGACAAAGTTTAA
Protein sequenceShow/hide protein sequence
MDNLDSLNRRIDGLPAQARIEDYDRHGGNKGGRRARRNFRHLPNPRNDQRRRQMDAPPRYADDDDQEDYDEWQNAQDHESSSGDERRNIWNGHEECRMPQGYRVQEARRE
TYHDYKMKIDLPTYNGKRDIESFLDWIKNTENFFNYMDTPERKKKMKKLLKTRFLPPNYEQTLYNQYQNCRQGSRIVAEYIEEFHRLSARTNLSENEQHQIARFIGGLRF
DIKEKVKLQPFRFLSEAISLAETVEEMITARLKNSNRKITWETNSTKKQPYNRRTEEQPATSVAEKGKEFDTQEASKKKEGAGRGKSLNNYTRPSLGKCFRCGEPGHLSN
NCPQRKTIALAEDEGSDMSKEDKV