; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI01G11050 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI01G11050
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationChr1:6915177..6917432
RNA-Seq ExpressionCSPI01G11050
SyntenyCSPI01G11050
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8652677.1 hypothetical protein Csa_014062 [Cucumis sativus]6.5e-11399.56Show/hide
Query:  MAMEAPTNILIHPSLSAPISPSHQTLLTSLHLLNASAFDPPTPNTFPPPPLDAPPPAQSSTKSNPSPSPSSSLSSPSTPLPPPTSRRSPISVGNRSAPVQ
        MAMEAPTNILIHPSLSAPISPSHQTLLTSLHLLNASAFDPPTPNTFPPPPLDAPPPAQSSTKSNPSPSPSSSLSSPSTPLPPPTSRRSPISVGNRSAPVQ
Subjt:  MAMEAPTNILIHPSLSAPISPSHQTLLTSLHLLNASAFDPPTPNTFPPPPLDAPPPAQSSTKSNPSPSPSSSLSSPSTPLPPPTSRRSPISVGNRSAPVQ

Query:  IVAIVIATTIIFIAALVGGFCYLRRRVLQNPPLVASEVVNLIMSRELWLALEELYGATSKSPYKSIGIILQNTRKGTMRMTEYLSMMKQTLENMQLAGSP
        IVAIVIATTIIFI ALVGGFCYLRRRVLQNPPLVASEVVNLIMSRELWLALEELYGATSKSPYKSIGIILQNTRKGTMRMTEYLSMMKQTLENMQLAGSP
Subjt:  IVAIVIATTIIFIAALVGGFCYLRRRVLQNPPLVASEVVNLIMSRELWLALEELYGATSKSPYKSIGIILQNTRKGTMRMTEYLSMMKQTLENMQLAGSP

Query:  ISHEDLFSYVLVGLDVEYIPIVCDIE
        ISHEDLFSYVLVGLDVEYIPIVCDIE
Subjt:  ISHEDLFSYVLVGLDVEYIPIVCDIE

TXG55646.1 hypothetical protein EZV62_020902 [Acer yangbiense]2.9e-3638.08Show/hide
Query:  VASEVVNLIMSRELWLALEELYGATSKSPYKSIGIILQNTRKGTMRMTEYLSMMKQTLENMQLAGSPISHEDLFSYVLVGLDVEYIPIVCDIEAGK-LAM
        VA  V+    +  LW ALE L+GA SKS   +I   +Q TRKG+  M EYL+ MK   +++ +AG P     LF+ +L GLD EY+PIV  IEA +    
Subjt:  VASEVVNLIMSRELWLALEELYGATSKSPYKSIGIILQNTRKGTMRMTEYLSMMKQTLENMQLAGSPISHEDLFSYVLVGLDVEYIPIVCDIEAGK-LAM

Query:  LRMSNNIVFQDIALEIITTILPLGF------------------NNNQSQGRGN-NVNGDTLQN----RQGGGRFNRH--RDNNCKPTCALFGKFSHGVVV
          + + ++  D  LE I  +   G                   N N++  + N N  G+   N    R GGGRF     R+NN +PTC + GKF H   V
Subjt:  LRMSNNIVFQDIALEIITTILPLGF------------------NNNQSQGRGN-NVNGDTLQN----RQGGGRFNRH--RDNNCKPTCALFGKFSHGVVV

Query:  CYHRFEEEFNNPIRGNNIQVNNNTTAFIATPEIVTYPNWSEDSGAINHVTNDFGNLQFKTKYYDTGSLTVGDGTKLTIAHV
        CY R+++ +   +   N    N+ + F+ATPE V    W  DSGA NHVTND GNL  K+ Y    SL VG+G +L I+HV
Subjt:  CYHRFEEEFNNPIRGNNIQVNNNTTAFIATPEIVTYPNWSEDSGAINHVTNDFGNLQFKTKYYDTGSLTVGDGTKLTIAHV

XP_022142770.1 uncharacterized protein LOC111012809 [Momordica charantia]1.7e-4136.99Show/hide
Query:  PLVASEVVNLIMSRELWLALEELYGATSKSPYKSIGIILQNTRKGTMRMTEYLSMMKQTLENMQLAGSPISHEDLFSYVLVGLDVEYIPIVCDIEAGKLA
        P++A +V+++  SR++W ALE+LY   +K+    +   LQ TRK  ++M++YLS MKQ  + + LAG PIS   L S VL GL+ EY+ I+C I A    
Subjt:  PLVASEVVNLIMSRELWLALEELYGATSKSPYKSIGIILQNTRKGTMRMTEYLSMMKQTLENMQLAGSPISHEDLFSYVLVGLDVEYIPIVCDIEAGKLA

Query:  MLRMSNNIVFQDIALEIITTILPLGFNNN---------------------------QSQGRGNNVNGDTLQNRQGGGRFNRHRDNNCKPTCALFGKFSHG
              NI +Q++   +IT    L   NN                           Q QGRG   N    + R  GGRF   R N+ +PTC + GK  H 
Subjt:  MLRMSNNIVFQDIALEIITTILPLGFNNN---------------------------QSQGRGNNVNGDTLQNRQGGGRFNRHRDNNCKPTCALFGKFSHG

Query:  VVVCYHRFEEEF--NNPIRGNNIQVNNNTTAFIATPEIVTYPNWSEDSGAINHVTNDFGNLQFKTKYYDTGSLTVGDGTKLTIAHV------------KE
         VVCYHR   ++  N P  GN         A+I  PE++  PNW  DSGA NH TND  NL  + +Y    +LTVG+  KL IAHV            KE
Subjt:  VVVCYHRFEEEF--NNPIRGNNIQVNNNTTAFIATPEIVTYPNWSEDSGAINHVTNDFGNLQFKTKYYDTGSLTVGDGTKLTIAHV------------KE

Query:  ITKVVPQGTLKNGLYQLSL
          +++ +G L  GLYQL L
Subjt:  ITKVVPQGTLKNGLYQLSL

XP_022157748.1 uncharacterized protein LOC111024384 isoform X1 [Momordica charantia]7.1e-4339.85Show/hide
Query:  PLVASEVVNLIMSRELWLALEELYGATSKSPYKSIGIILQNTRKGTMRMTEYLSMMKQTLENMQLAGSPISHEDLFSYVLVGLDVEYIPIVCDIEAGKLA
        P +A +VV+   SRE+W ALE+LYGATSK+    +  +LQNT+K +++M+EYL +MKQ  E+++LAG P++   L S VL GL+ EY+PIVC IE     
Subjt:  PLVASEVVNLIMSRELWLALEELYGATSKSPYKSIGIILQNTRKGTMRMTEYLSMMKQTLENMQLAGSPISHEDLFSYVLVGLDVEYIPIVCDIEAGKLA

Query:  MLR--MSNNIVFQD--IALEIITTILPLGFN-----------------------NNQSQGRGNNVNGDTLQN--RQGGGRFNRHRDNNCKPTCALFGKFS
          +   +  + F++  + L I++T    G +                       + Q QGRG+  + D   N   +G GRF+ +R NN KP+C L GK+ 
Subjt:  MLR--MSNNIVFQD--IALEIITTILPLGFN-----------------------NNQSQGRGNNVNGDTLQN--RQGGGRFNRHRDNNCKPTCALFGKFS

Query:  HGVVVCYHRFEEEFNNPIRGNNIQVNNNTTAFIATPEIVTYPNWSEDSGAINHVTNDFGNLQFKTKYYDTG
        H   VCY RF+E FNN    N    NN  +A++A PEIV  P+W  DSGA +HVT+D  NL  K+ Y   G
Subjt:  HGVVVCYHRFEEEFNNPIRGNNIQVNNNTTAFIATPEIVTYPNWSEDSGAINHVTNDFGNLQFKTKYYDTG

XP_022157750.1 uncharacterized protein LOC111024384 isoform X2 [Momordica charantia]5.4e-4339.71Show/hide
Query:  PLVASEVVNLIMSRELWLALEELYGATSKSPYKSIGIILQNTRKGTMRMTEYLSMMKQTLENMQLAGSPISHEDLFSYVLVGLDVEYIPIVCDIEAGKLA
        P +A +VV+   SRE+W ALE+LYGATSK+    +  +LQNT+K +++M+EYL +MKQ  E+++LAG P++   L S VL GL+ EY+PIVC IE     
Subjt:  PLVASEVVNLIMSRELWLALEELYGATSKSPYKSIGIILQNTRKGTMRMTEYLSMMKQTLENMQLAGSPISHEDLFSYVLVGLDVEYIPIVCDIEAGKLA

Query:  MLR--MSNNIVFQD--IALEIITTILPLGFN-----------------------NNQSQGRGNNVNGDTLQN--RQGGGRFNRHRDNNCKPTCALFGKFS
          +   +  + F++  + L I++T    G +                       + Q QGRG+  + D   N   +G GRF+ +R NN KP+C L GK+ 
Subjt:  MLR--MSNNIVFQD--IALEIITTILPLGFN-----------------------NNQSQGRGNNVNGDTLQN--RQGGGRFNRHRDNNCKPTCALFGKFS

Query:  HGVVVCYHRFEEEFNNPIRGNNIQVNNNTTAFIATPEIVTYPNWSEDSGAINHVTNDFGNLQFKTKYYDTGS
        H   VCY RF+E FNN    N    NN  +A++A PEIV  P+W  DSGA +HVT+D  NL  K+ Y   G+
Subjt:  HGVVVCYHRFEEEFNNPIRGNNIQVNNNTTAFIATPEIVTYPNWSEDSGAINHVTNDFGNLQFKTKYYDTGS

TrEMBL top hitse value%identityAlignment
A0A0A0LUB0 Uncharacterized protein3.2e-12699.6Show/hide
Query:  MKRYSIRFVNMRSFVVFLVSSLVSLSMAMEAPTNILIHPSLSAPISPSHQTLLTSLHLLNASAFDPPTPNTFPPPPLDAPPPAQSSTKSNPSPSPSSSLS
        MKRYSIRFVNMRSFVVFLVSSLVSLSMAMEAPTNILIHPSLSAPISPSHQTLLTSLHLLNASAFDPPTPNTFPPPPLDAPPPAQSSTKSNPSPSPSSSLS
Subjt:  MKRYSIRFVNMRSFVVFLVSSLVSLSMAMEAPTNILIHPSLSAPISPSHQTLLTSLHLLNASAFDPPTPNTFPPPPLDAPPPAQSSTKSNPSPSPSSSLS

Query:  SPSTPLPPPTSRRSPISVGNRSAPVQIVAIVIATTIIFIAALVGGFCYLRRRVLQNPPLVASEVVNLIMSRELWLALEELYGATSKSPYKSIGIILQNTR
        SPSTPLPPPTSRRSPISVGNRSAPVQIVAIVIATTIIFI ALVGGFCYLRRRVLQNPPLVASEVVNLIMSRELWLALEELYGATSKSPYKSIGIILQNTR
Subjt:  SPSTPLPPPTSRRSPISVGNRSAPVQIVAIVIATTIIFIAALVGGFCYLRRRVLQNPPLVASEVVNLIMSRELWLALEELYGATSKSPYKSIGIILQNTR

Query:  KGTMRMTEYLSMMKQTLENMQLAGSPISHEDLFSYVLVGLDVEYIPIVCDIE
        KGTMRMTEYLSMMKQTLENMQLAGSPISHEDLFSYVLVGLDVEYIPIVCDIE
Subjt:  KGTMRMTEYLSMMKQTLENMQLAGSPISHEDLFSYVLVGLDVEYIPIVCDIE

A0A5C7HHE9 Uncharacterized protein1.4e-3638.08Show/hide
Query:  VASEVVNLIMSRELWLALEELYGATSKSPYKSIGIILQNTRKGTMRMTEYLSMMKQTLENMQLAGSPISHEDLFSYVLVGLDVEYIPIVCDIEAGK-LAM
        VA  V+    +  LW ALE L+GA SKS   +I   +Q TRKG+  M EYL+ MK   +++ +AG P     LF+ +L GLD EY+PIV  IEA +    
Subjt:  VASEVVNLIMSRELWLALEELYGATSKSPYKSIGIILQNTRKGTMRMTEYLSMMKQTLENMQLAGSPISHEDLFSYVLVGLDVEYIPIVCDIEAGK-LAM

Query:  LRMSNNIVFQDIALEIITTILPLGF------------------NNNQSQGRGN-NVNGDTLQN----RQGGGRFNRH--RDNNCKPTCALFGKFSHGVVV
          + + ++  D  LE I  +   G                   N N++  + N N  G+   N    R GGGRF     R+NN +PTC + GKF H   V
Subjt:  LRMSNNIVFQDIALEIITTILPLGF------------------NNNQSQGRGN-NVNGDTLQN----RQGGGRFNRH--RDNNCKPTCALFGKFSHGVVV

Query:  CYHRFEEEFNNPIRGNNIQVNNNTTAFIATPEIVTYPNWSEDSGAINHVTNDFGNLQFKTKYYDTGSLTVGDGTKLTIAHV
        CY R+++ +   +   N    N+ + F+ATPE V    W  DSGA NHVTND GNL  K+ Y    SL VG+G +L I+HV
Subjt:  CYHRFEEEFNNPIRGNNIQVNNNTTAFIATPEIVTYPNWSEDSGAINHVTNDFGNLQFKTKYYDTGSLTVGDGTKLTIAHV

A0A6J1CLV9 uncharacterized protein LOC1110128098.4e-4236.99Show/hide
Query:  PLVASEVVNLIMSRELWLALEELYGATSKSPYKSIGIILQNTRKGTMRMTEYLSMMKQTLENMQLAGSPISHEDLFSYVLVGLDVEYIPIVCDIEAGKLA
        P++A +V+++  SR++W ALE+LY   +K+    +   LQ TRK  ++M++YLS MKQ  + + LAG PIS   L S VL GL+ EY+ I+C I A    
Subjt:  PLVASEVVNLIMSRELWLALEELYGATSKSPYKSIGIILQNTRKGTMRMTEYLSMMKQTLENMQLAGSPISHEDLFSYVLVGLDVEYIPIVCDIEAGKLA

Query:  MLRMSNNIVFQDIALEIITTILPLGFNNN---------------------------QSQGRGNNVNGDTLQNRQGGGRFNRHRDNNCKPTCALFGKFSHG
              NI +Q++   +IT    L   NN                           Q QGRG   N    + R  GGRF   R N+ +PTC + GK  H 
Subjt:  MLRMSNNIVFQDIALEIITTILPLGFNNN---------------------------QSQGRGNNVNGDTLQNRQGGGRFNRHRDNNCKPTCALFGKFSHG

Query:  VVVCYHRFEEEF--NNPIRGNNIQVNNNTTAFIATPEIVTYPNWSEDSGAINHVTNDFGNLQFKTKYYDTGSLTVGDGTKLTIAHV------------KE
         VVCYHR   ++  N P  GN         A+I  PE++  PNW  DSGA NH TND  NL  + +Y    +LTVG+  KL IAHV            KE
Subjt:  VVVCYHRFEEEF--NNPIRGNNIQVNNNTTAFIATPEIVTYPNWSEDSGAINHVTNDFGNLQFKTKYYDTGSLTVGDGTKLTIAHV------------KE

Query:  ITKVVPQGTLKNGLYQLSL
          +++ +G L  GLYQL L
Subjt:  ITKVVPQGTLKNGLYQLSL

A0A6J1DTZ7 uncharacterized protein LOC111024384 isoform X22.6e-4339.71Show/hide
Query:  PLVASEVVNLIMSRELWLALEELYGATSKSPYKSIGIILQNTRKGTMRMTEYLSMMKQTLENMQLAGSPISHEDLFSYVLVGLDVEYIPIVCDIEAGKLA
        P +A +VV+   SRE+W ALE+LYGATSK+    +  +LQNT+K +++M+EYL +MKQ  E+++LAG P++   L S VL GL+ EY+PIVC IE     
Subjt:  PLVASEVVNLIMSRELWLALEELYGATSKSPYKSIGIILQNTRKGTMRMTEYLSMMKQTLENMQLAGSPISHEDLFSYVLVGLDVEYIPIVCDIEAGKLA

Query:  MLR--MSNNIVFQD--IALEIITTILPLGFN-----------------------NNQSQGRGNNVNGDTLQN--RQGGGRFNRHRDNNCKPTCALFGKFS
          +   +  + F++  + L I++T    G +                       + Q QGRG+  + D   N   +G GRF+ +R NN KP+C L GK+ 
Subjt:  MLR--MSNNIVFQD--IALEIITTILPLGFN-----------------------NNQSQGRGNNVNGDTLQN--RQGGGRFNRHRDNNCKPTCALFGKFS

Query:  HGVVVCYHRFEEEFNNPIRGNNIQVNNNTTAFIATPEIVTYPNWSEDSGAINHVTNDFGNLQFKTKYYDTGS
        H   VCY RF+E FNN    N    NN  +A++A PEIV  P+W  DSGA +HVT+D  NL  K+ Y   G+
Subjt:  HGVVVCYHRFEEEFNNPIRGNNIQVNNNTTAFIATPEIVTYPNWSEDSGAINHVTNDFGNLQFKTKYYDTGS

A0A6J1DU77 uncharacterized protein LOC111024384 isoform X13.4e-4339.85Show/hide
Query:  PLVASEVVNLIMSRELWLALEELYGATSKSPYKSIGIILQNTRKGTMRMTEYLSMMKQTLENMQLAGSPISHEDLFSYVLVGLDVEYIPIVCDIEAGKLA
        P +A +VV+   SRE+W ALE+LYGATSK+    +  +LQNT+K +++M+EYL +MKQ  E+++LAG P++   L S VL GL+ EY+PIVC IE     
Subjt:  PLVASEVVNLIMSRELWLALEELYGATSKSPYKSIGIILQNTRKGTMRMTEYLSMMKQTLENMQLAGSPISHEDLFSYVLVGLDVEYIPIVCDIEAGKLA

Query:  MLR--MSNNIVFQD--IALEIITTILPLGFN-----------------------NNQSQGRGNNVNGDTLQN--RQGGGRFNRHRDNNCKPTCALFGKFS
          +   +  + F++  + L I++T    G +                       + Q QGRG+  + D   N   +G GRF+ +R NN KP+C L GK+ 
Subjt:  MLR--MSNNIVFQD--IALEIITTILPLGFN-----------------------NNQSQGRGNNVNGDTLQN--RQGGGRFNRHRDNNCKPTCALFGKFS

Query:  HGVVVCYHRFEEEFNNPIRGNNIQVNNNTTAFIATPEIVTYPNWSEDSGAINHVTNDFGNLQFKTKYYDTG
        H   VCY RF+E FNN    N    NN  +A++A PEIV  P+W  DSGA +HVT+D  NL  K+ Y   G
Subjt:  HGVVVCYHRFEEEFNNPIRGNNIQVNNNTTAFIATPEIVTYPNWSEDSGAINHVTNDFGNLQFKTKYYDTG

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.4e-0923.08Show/hide
Query:  ELWLALEELYGATSKSPYKSIGIILQNTRKGTMRMTEYLSMMKQTLENMQLAGSPISHEDLFSYVLVGLDVEYIPIVCDIEAGKL--AMLRMSNNIVFQD
        ++W  L ++Y   S      +   L+   KGT  + +Y+  +    + + L G P+ H++    VL  L  EY P++  I A      +  +   ++  +
Subjt:  ELWLALEELYGATSKSPYKSIGIILQNTRKGTMRMTEYLSMMKQTLENMQLAGSPISHEDLFSYVLVGLDVEYIPIVCDIEAGKL--AMLRMSNNIVFQD

Query:  IALEIIT--TILPLGFN--NNQSQGRGNNVNGDTLQNRQGGGRFNRHRDNNCKP--------------------TCALFGKFSHGVVVCYHRFEEEFNNP
          +  ++  T++P+  N  ++++    NN N     NR      NR+ +NN KP                     C + G   H    C     + F + 
Subjt:  IALEIIT--TILPLGFN--NNQSQGRGNNVNGDTLQNRQGGGRFNRHRDNNCKP--------------------TCALFGKFSHGVVVCYHRFEEEFNNP

Query:  IRGNNIQVNNNTT-----AFIATPEIVTYPNWSEDSGAINHVTNDFGNLQFKTKYYDTGSLTVGDGTKLTIAH
        +  N+ Q  +  T     A +A     +  NW  DSGA +H+T+DF NL     Y     + V DG+ + I+H
Subjt:  IRGNNIQVNNNTT-----AFIATPEIVTYPNWSEDSGAINHVTNDFGNLQFKTKYYDTGSLTVGDGTKLTIAH

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE27.2e-0621.43Show/hide
Query:  TSKSPYKSIGIILQNTRKGTMRMTEYLSMMKQTLENMQLAGSPISHEDLFSYVLVGLDVEYIPIVCDIEAGKL--AMLRMSNNIVFQDIALEIITT--IL
        T+   ++++  I  N   G +    +++   Q    + L G P+ H++    VL  L  +Y P++  I A     ++  +   ++ ++  L  + +  ++
Subjt:  TSKSPYKSIGIILQNTRKGTMRMTEYLSMMKQTLENMQLAGSPISHEDLFSYVLVGLDVEYIPIVCDIEAGKL--AMLRMSNNIVFQDIALEIITT--IL

Query:  PLGFN---------NNQSQGRGNNVNGDTLQNR-------QGGGRFNRHRDNNCKPTCALFGKFSHGVVVC--YHRFEEEFNNPIRGNNIQVNNNTTAFI
        P+  N         N     RG+N N +   NR         G R +  +       C +     H    C   H+F+   N   +  +        A +
Subjt:  PLGFN---------NNQSQGRGNNVNGDTLQNR-------QGGGRFNRHRDNNCKPTCALFGKFSHGVVVC--YHRFEEEFNNPIRGNNIQVNNNTTAFI

Query:  ATPEIVTYPNWSEDSGAINHVTNDFGNLQFKTKYYDTGSLTVGDGTKLTIAH
        A        NW  DSGA +H+T+DF NL F   Y     + + DG+ + I H
Subjt:  ATPEIVTYPNWSEDSGAINHVTNDFGNLQFKTKYYDTGSLTVGDGTKLTIAH

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAAGGTATAGTATAAGATTTGTAAATATGCGGAGTTTTGTGGTGTTTTTAGTTTCCTCCTTAGTTTCCCTATCCATGGCCATGGAAGCTCCCACGAATATTTTGAT
ACACCCTTCCCTTTCTGCACCAATTTCACCTTCTCACCAAACTCTACTTACCTCTCTTCACCTCCTAAATGCTTCCGCCTTTGATCCTCCAACTCCAAATACTTTTCCTC
CACCTCCTCTGGATGCGCCTCCTCCAGCCCAATCTTCGACGAAATCGAACCCATCTCCCTCACCTTCTTCCTCGCTCTCCTCTCCTTCCACGCCACTACCACCCCCTACC
TCGCGTCGTTCACCAATTTCAGTGGGAAACAGATCCGCACCAGTGCAAATTGTGGCGATTGTGATTGCAACTACAATTATTTTCATAGCTGCACTTGTTGGGGGATTTTG
TTACCTACGCCGACGAGTGTTACAAAATCCCCCGCTTGTGGCATCAGAGGTCGTTAATCTGATAATGTCAAGAGAGTTATGGTTGGCCTTAGAGGAACTCTATGGTGCAA
CTAGCAAGAGTCCATATAAATCAATTGGGATAATCCTTCAAAATACTAGAAAAGGAACAATGAGAATGACAGAATATCTATCCATGATGAAACAAACATTGGAAAATATG
CAACTAGCAGGATCACCAATTTCTCATGAAGATTTATTCTCTTATGTCTTAGTCGGCCTTGACGTCGAATATATTCCAATTGTGTGTGATATAGAAGCAGGCAAACTAGC
AATGTTAAGAATGAGCAACAACATCGTTTTCCAGGATATAGCTCTTGAAATCATAACAACAATCCTACCTTTAGGCTTTAACAATAATCAATCTCAAGGAAGAGGTAACA
ATGTTAATGGTGACACACTTCAAAATAGACAAGGCGGAGGTAGGTTCAACCGTCATAGAGATAATAATTGTAAACCCACTTGTGCACTCTTTGGAAAATTTAGTCATGGA
GTTGTTGTTTGCTATCATCGTTTTGAAGAAGAATTTAATAATCCAATTAGAGGAAACAACATTCAGGTGAACAATAATACAACAGCCTTTATTGCTACTCCTGAAATTGT
GACATATCCAAATTGGTCGGAAGATAGTGGAGCCATAAACCATGTCACCAACGATTTTGGTAATCTCCAATTCAAAACTAAATACTATGATACTGGGTCTTTAACTGTTG
GTGATGGAACAAAATTGACTATAGCACATGTCAAGGAAATCACAAAGGTGGTGCCGCAAGGAACACTTAAAAATGGACTCTACCAACTATCTCTCCCTCATCAATCTCCT
CCTAATGGCAAAGTTCAAATCTCCTTTACTTCTCCATCTCCTCTCCAAAAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAAAGGTATAGTATAAGATTTGTAAATATGCGGAGTTTTGTGGTGTTTTTAGTTTCCTCCTTAGTTTCCCTATCCATGGCCATGGAAGCTCCCACGAATATTTTGAT
ACACCCTTCCCTTTCTGCACCAATTTCACCTTCTCACCAAACTCTACTTACCTCTCTTCACCTCCTAAATGCTTCCGCCTTTGATCCTCCAACTCCAAATACTTTTCCTC
CACCTCCTCTGGATGCGCCTCCTCCAGCCCAATCTTCGACGAAATCGAACCCATCTCCCTCACCTTCTTCCTCGCTCTCCTCTCCTTCCACGCCACTACCACCCCCTACC
TCGCGTCGTTCACCAATTTCAGTGGGAAACAGATCCGCACCAGTGCAAATTGTGGCGATTGTGATTGCAACTACAATTATTTTCATAGCTGCACTTGTTGGGGGATTTTG
TTACCTACGCCGACGAGTGTTACAAAATCCCCCGCTTGTGGCATCAGAGGTCGTTAATCTGATAATGTCAAGAGAGTTATGGTTGGCCTTAGAGGAACTCTATGGTGCAA
CTAGCAAGAGTCCATATAAATCAATTGGGATAATCCTTCAAAATACTAGAAAAGGAACAATGAGAATGACAGAATATCTATCCATGATGAAACAAACATTGGAAAATATG
CAACTAGCAGGATCACCAATTTCTCATGAAGATTTATTCTCTTATGTCTTAGTCGGCCTTGACGTCGAATATATTCCAATTGTGTGTGATATAGAAGCAGGCAAACTAGC
AATGTTAAGAATGAGCAACAACATCGTTTTCCAGGATATAGCTCTTGAAATCATAACAACAATCCTACCTTTAGGCTTTAACAATAATCAATCTCAAGGAAGAGGTAACA
ATGTTAATGGTGACACACTTCAAAATAGACAAGGCGGAGGTAGGTTCAACCGTCATAGAGATAATAATTGTAAACCCACTTGTGCACTCTTTGGAAAATTTAGTCATGGA
GTTGTTGTTTGCTATCATCGTTTTGAAGAAGAATTTAATAATCCAATTAGAGGAAACAACATTCAGGTGAACAATAATACAACAGCCTTTATTGCTACTCCTGAAATTGT
GACATATCCAAATTGGTCGGAAGATAGTGGAGCCATAAACCATGTCACCAACGATTTTGGTAATCTCCAATTCAAAACTAAATACTATGATACTGGGTCTTTAACTGTTG
GTGATGGAACAAAATTGACTATAGCACATGTCAAGGAAATCACAAAGGTGGTGCCGCAAGGAACACTTAAAAATGGACTCTACCAACTATCTCTCCCTCATCAATCTCCT
CCTAATGGCAAAGTTCAAATCTCCTTTACTTCTCCATCTCCTCTCCAAAAGTGA
Protein sequenceShow/hide protein sequence
MKRYSIRFVNMRSFVVFLVSSLVSLSMAMEAPTNILIHPSLSAPISPSHQTLLTSLHLLNASAFDPPTPNTFPPPPLDAPPPAQSSTKSNPSPSPSSSLSSPSTPLPPPT
SRRSPISVGNRSAPVQIVAIVIATTIIFIAALVGGFCYLRRRVLQNPPLVASEVVNLIMSRELWLALEELYGATSKSPYKSIGIILQNTRKGTMRMTEYLSMMKQTLENM
QLAGSPISHEDLFSYVLVGLDVEYIPIVCDIEAGKLAMLRMSNNIVFQDIALEIITTILPLGFNNNQSQGRGNNVNGDTLQNRQGGGRFNRHRDNNCKPTCALFGKFSHG
VVVCYHRFEEEFNNPIRGNNIQVNNNTTAFIATPEIVTYPNWSEDSGAINHVTNDFGNLQFKTKYYDTGSLTVGDGTKLTIAHVKEITKVVPQGTLKNGLYQLSLPHQSP
PNGKVQISFTSPSPLQK