; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0015842 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0015842
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr12:26872548..26883867
RNA-Seq ExpressionLag0015842
SyntenyLag0015842
Gene Ontology termsGO:0006259 - DNA metabolic process (biological process)
GO:0006412 - translation (biological process)
GO:0005840 - ribosome (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0003735 - structural constituent of ribosome (molecular function)
GO:0016779 - nucleotidyltransferase activity (molecular function)
InterPro domainsIPR002583 - Ribosomal protein S20
IPR036510 - Ribosomal protein S20 superfamily
IPR041373 - Reverse transcriptase, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAU45471.1 hypothetical protein TSUD_13140 [Trifolium subterraneum]2.0e-11844.64Show/hide
Query:  GFVPNDLQG----RWTFRIPTHQEFK-----ERKVNSNIQ--CKKEVNKWLDAGIIYPIVDSNWVSPIQCVPKKGGITVVSNKDNKLIPTRTVTGWRVCM
        G+  +DL+G        +I    E+K     +R++N  ++   KKEV K L+AG+IYPI DS WVSP+  VPKKGG+TV+ N  N+LIPTRTVTGWR+C+
Subjt:  GFVPNDLQG----RWTFRIPTHQEFK-----ERKVNSNIQ--CKKEVNKWLDAGIIYPIVDSNWVSPIQCVPKKGGITVVSNKDNKLIPTRTVTGWRVCM

Query:  DYRRLNKANRKDHFPLPFIDQMLDRLAGQAYYCFLDGYSGYNQITIALEDQEKTTFTCPYGTFALR----------------------------------
        DYRRLNKA RKDH+PLP +DQML+RLAGQ YYCFLDGYSGYNQI +   DQEKT FTCP+G FA R                                  
Subjt:  DYRRLNKANRKDHFPLPFIDQMLDRLAGQAYYCFLDGYSGYNQITIALEDQEKTTFTCPYGTFALR----------------------------------

Query:  --------------------------------------------------------------------------------------------------VL
                                                                                                          VL
Subjt:  --------------------------------------------------------------------------------------------------VL

Query:  NEAQVNYTTTEKELLAVVFAFEKFGSYFVGSKVTVFTDHAAIRYLMAKKDAKPRLIRWVLLLQKFDLEIKDKKGSENVIADHLSRLDPSSSLLKQSAISD
        NE QVNYTTTEKELLA+VFA EKF SY +GSKV VFTDHAA+R+L+ K ++KPRL+RW+LLLQ+FD+EIKDKKG ENV+ADHLSRL+      K+  IS+
Subjt:  NEAQVNYTTTEKELLAVVFAFEKFGSYFVGSKVTVFTDHAAIRYLMAKKDAKPRLIRWVLLLQKFDLEIKDKKGSENVIADHLSRLDPSSSLLKQSAISD

Query:  AFPDEQLFAIEVKVVRGVPWYADIANFLVKGVTPIDMDWRHKKKFKHDAKFFYWDEPFIYKQCSNGIIRRCVSSDEAKEILEQCHSSPYGGHFSGQRTAM
        AFPDE L AI  +     PW+AD+ N+ V G  P D+    +KKF HD+KF++WD PF++K+  +GIIRRCV + E+ +I+  CH+SP G H SG RTA 
Subjt:  AFPDEQLFAIEVKVVRGVPWYADIANFLVKGVTPIDMDWRHKKKFKHDAKFFYWDEPFIYKQCSNGIIRRCVSSDEAKEILEQCHSSPYGGHFSGQRTAM

Query:  RILHCGFFWPTLFKDAHEWNKS
        ++LH GFFWPTLFKD  ++ K+
Subjt:  RILHCGFFWPTLFKDAHEWNKS

PIM97577.1 DNA-directed DNA polymerase [Handroanthus impetiginosus]1.4e-11944.65Show/hide
Query:  KKEVNKWLDAGIIYPIVDSNWVSPIQCVPKKGGITVVSNKDNKLIPTRTVTGWRVCMDYRRLNKANRKDHFPLPFIDQMLDRLAGQAYYCFLDGYSGYNQ
        KKE+ KWLDAGIIYPI DS+WVSP+QCVPKKGGITVV N  N+LIPTRTVTGWRVCMDYR+LNKA RKDHFPL FIDQMLDRLAG+ +YCFLDGYSGYNQ
Subjt:  KKEVNKWLDAGIIYPIVDSNWVSPIQCVPKKGGITVVSNKDNKLIPTRTVTGWRVCMDYRRLNKANRKDHFPLPFIDQMLDRLAGQAYYCFLDGYSGYNQ

Query:  ITIALEDQEKTTFTCPYGTFALR-----------------------------------------------------------------------------
        I IA EDQEK TFTCPYGTFA R                                                                             
Subjt:  ITIALEDQEKTTFTCPYGTFALR-----------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------VLNEAQVNYTTTEKELLAVVFAFEKFGSYFVGSKVTVFTDHAAIRYLMAKKDAKPRLIRWVLLLQKFDLE
                                       LN+AQ+NYTTTEKELLAVVFAF+KF SY VG+KV V+TDHAAIRYL+ KKDAKPRLIRWVLLLQ+FDLE
Subjt:  ------------------------------VLNEAQVNYTTTEKELLAVVFAFEKFGSYFVGSKVTVFTDHAAIRYLMAKKDAKPRLIRWVLLLQKFDLE

Query:  IKDKKGSENVIADHLSRLDPSSSLLKQSAISDAFPDEQLFAIEVKVVRGVPWYADIANFLVKGVTPIDMDWRHKKKFKHDAKFFYWDEPFIYKQCSNGII
        I+D+KG+EN IADHLSRL+  +   + + I+D FPDEQL AI   V   VPWYADI N+L  G+ P D+  + KKKF  D + ++WD+PF++KQ  + I+
Subjt:  IKDKKGSENVIADHLSRLDPSSSLLKQSAISDAFPDEQLFAIEVKVVRGVPWYADIANFLVKGVTPIDMDWRHKKKFKHDAKFFYWDEPFIYKQCSNGII

Query:  RRCVSSDEAKEILEQCHSSPYGGHFSGQRTAMRILHCGFFWPTLFKDAHEW
        RRCV   E  +ILEQCH+SPYGGHF G RTA +IL  GFFWP LFKDAH +
Subjt:  RRCVSSDEAKEILEQCHSSPYGGHFSGQRTAMRILHCGFFWPTLFKDAHEW

PIN05661.1 DNA-directed DNA polymerase [Handroanthus impetiginosus]1.5e-12156.1Show/hide
Query:  KKEVNKWLDAGIIYPIVDSNWVSPIQCVPKKGGITVVSNKDNKLIPTRTVTGWRVCMDYRRLNKANRKDHFPLPFIDQMLDRLAGQAYYCFLDGYSGYNQ
        KKE+ KWLDAGIIYPI D              GITVV N  N+LIPTRTVTGWRVCMDYR+LNKA RKDHFPLPFIDQMLDRLAG+ +YCFLDGYSGYNQ
Subjt:  KKEVNKWLDAGIIYPIVDSNWVSPIQCVPKKGGITVVSNKDNKLIPTRTVTGWRVCMDYRRLNKANRKDHFPLPFIDQMLDRLAGQAYYCFLDGYSGYNQ

Query:  ITIALEDQEKTTFTCPYGTFALR------------------------------------------------------------------VLNEAQVNYTT
        I IA EDQEK TFTCPYGTFA R                                                                   LN+AQ+NYTT
Subjt:  ITIALEDQEKTTFTCPYGTFALR------------------------------------------------------------------VLNEAQVNYTT

Query:  TEKELLAVVFAFEKFGSYFVGSKVTVFTDHAAIRYLMAKKDAKPRLIRWVLLLQKFDLEIKDKKGSENVIADHLSRLDPSSSLLKQSAISDAFPDEQLFA
        TEKELLAVVFAF+KF SY VG+KV V+TDHA IRYL+ KKDAKPRLIRWVLLLQ+FDLEI+D+KG+EN IADHLSRL+  +   + + I+D FPD+QL A
Subjt:  TEKELLAVVFAFEKFGSYFVGSKVTVFTDHAAIRYLMAKKDAKPRLIRWVLLLQKFDLEIKDKKGSENVIADHLSRLDPSSSLLKQSAISDAFPDEQLFA

Query:  IEVKVVRGVPWYADIANFLVKGVTPIDMDWRHKKKFKHDAKFFYWDEPFIYKQCSNGIIRRCVSSDEAKEILEQCHSSPYGGHFSGQRTAMRILHCGFFW
        I   V   VPWYADI N+L  G+ P D+  + KKKF  D + ++ D+PF++KQ  + I+R CV   E  +ILEQCH+SPYGGHF G RTA +IL   FFW
Subjt:  IEVKVVRGVPWYADIANFLVKGVTPIDMDWRHKKKFKHDAKFFYWDEPFIYKQCSNGIIRRCVSSDEAKEILEQCHSSPYGGHFSGQRTAMRILHCGFFW

Query:  PTLFKDAHEW
        P LFKDAH +
Subjt:  PTLFKDAHEW

PIN22487.1 DNA-directed DNA polymerase [Handroanthus impetiginosus]1.5e-12145.01Show/hide
Query:  KKEVNKWLDAGIIYPIVDSNWVSPIQCVPKKGGITVVSNKDNKLIPTRTVTGWRVCMDYRRLNKANRKDHFPLPFIDQMLDRLAGQAYYCFLDGYSGYNQ
        KKE+ KWLDAGIIYPI DS+WVSP+QCVPKKGGITVV N  N+LIPTRTVTGWRVCMDYR+LNKA RKDHFPLPFIDQMLDRLAG+ +YCFLDGYSGYNQ
Subjt:  KKEVNKWLDAGIIYPIVDSNWVSPIQCVPKKGGITVVSNKDNKLIPTRTVTGWRVCMDYRRLNKANRKDHFPLPFIDQMLDRLAGQAYYCFLDGYSGYNQ

Query:  ITIALEDQEKTTFTCPYGTFALR-----------------------------------------------------------------------------
        I IA EDQEKTTFTCPYGTFA R                                                                             
Subjt:  ITIALEDQEKTTFTCPYGTFALR-----------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------VLNEAQVNYTTTEKELLAVVFAFEKFGSYFVGSKVTVFTDHAAIRYLMAKKDAKPRLIRWVLLLQKFDLE
                                       LN+AQ+NYTTTEKELLAVVFAF+KF SY VG+KV V+TDHAAIRYL+ KKDAKPRLIRWVLLLQ+FDLE
Subjt:  ------------------------------VLNEAQVNYTTTEKELLAVVFAFEKFGSYFVGSKVTVFTDHAAIRYLMAKKDAKPRLIRWVLLLQKFDLE

Query:  IKDKKGSENVIADHLSRLDPSSSLLKQSAISDAFPDEQLFAIEVKVVRGVPWYADIANFLVKGVTPIDMDWRHKKKFKHDAKFFYWDEPFIYKQCSNGII
        I+D+KG+EN IADHLSRL+  +   + + I+D FPDEQL AI   V   VPWYADI N+L  G+ P D+  + KKKF  D + ++WD+PF++KQ  + I+
Subjt:  IKDKKGSENVIADHLSRLDPSSSLLKQSAISDAFPDEQLFAIEVKVVRGVPWYADIANFLVKGVTPIDMDWRHKKKFKHDAKFFYWDEPFIYKQCSNGII

Query:  RRCVSSDEAKEILEQCHSSPYGGHFSGQRTAMRILHCGFFWPTLFKDAHEW
        RRCV   E  +ILEQCH+SPYGGHF G RTA +IL  GFFWP LFKDAH +
Subjt:  RRCVSSDEAKEILEQCHSSPYGGHFSGQRTAMRILHCGFFWPTLFKDAHEW

PIN22518.1 DNA-directed DNA polymerase [Handroanthus impetiginosus]2.9e-12244.58Show/hide
Query:  ERKVNSNIQ--CKKEVNKWLDAGIIYPIVDSNWVSPIQCVPKKGGITVVSNKDNKLIPTRTVTGWRVCMDYRRLNKANRKDHFPLPFIDQMLDRLAGQAY
        +R++NS ++   KKE+ KWLDAGIIYPI DS+WVSP+QCVPKKGGITVV N  N+LIPTRTVTGWRVCMDYR+LNKA RKDHFPLPFIDQMLDRLAG+ +
Subjt:  ERKVNSNIQ--CKKEVNKWLDAGIIYPIVDSNWVSPIQCVPKKGGITVVSNKDNKLIPTRTVTGWRVCMDYRRLNKANRKDHFPLPFIDQMLDRLAGQAY

Query:  YCFLDGYSGYNQITIALEDQEKTTFTCPYGTFALR-----------------------------------------------------------------
        YCFLDGYSGYNQI IA EDQEKTTFTCPYGTFA R                                                                 
Subjt:  YCFLDGYSGYNQITIALEDQEKTTFTCPYGTFALR-----------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------------VLNEAQVNYTTTEKELLAVVFAFEKFGSYFVGSKVTVFTDHAAIRYLMAKKDAKPRLI
                                                   LN+AQ+NYTTTEKELLAVVFAF+KF SY VG+KV V+TDHAAIRYL+ KKDAKPRLI
Subjt:  ------------------------------------------VLNEAQVNYTTTEKELLAVVFAFEKFGSYFVGSKVTVFTDHAAIRYLMAKKDAKPRLI

Query:  RWVLLLQKFDLEIKDKKGSENVIADHLSRLDPSSSLLKQSAISDAFPDEQLFAIEVKVVRGVPWYADIANFLVKGVTPIDMDWRHKKKFKHDAKFFYWDE
        RWVLLLQ+FDLEI+D+KG+EN IADHLSRL+  +   + + I+D FPDEQL AI   V   VPWYADI N+L  G+ P D+  + KKKF  D + ++WD+
Subjt:  RWVLLLQKFDLEIKDKKGSENVIADHLSRLDPSSSLLKQSAISDAFPDEQLFAIEVKVVRGVPWYADIANFLVKGVTPIDMDWRHKKKFKHDAKFFYWDE

Query:  PFIYKQCSNGIIRRCVSSDEAKEILEQCHSSPYGGHFSGQRTAMRILHCGFFWPTLFKDAHEW
        PF++KQ  + I+RRCV   E  +ILEQCH+SPYGGHF G RTA +IL  GFFWP LFKDAH +
Subjt:  PFIYKQCSNGIIRRCVSSDEAKEILEQCHSSPYGGHFSGQRTAMRILHCGFFWPTLFKDAHEW

TrEMBL top hitse value%identityAlignment
A0A2G9FWY3 Reverse transcriptase6.6e-12044.65Show/hide
Query:  KKEVNKWLDAGIIYPIVDSNWVSPIQCVPKKGGITVVSNKDNKLIPTRTVTGWRVCMDYRRLNKANRKDHFPLPFIDQMLDRLAGQAYYCFLDGYSGYNQ
        KKE+ KWLDAGIIYPI DS+WVSP+QCVPKKGGITVV N  N+LIPTRTVTGWRVCMDYR+LNKA RKDHFPL FIDQMLDRLAG+ +YCFLDGYSGYNQ
Subjt:  KKEVNKWLDAGIIYPIVDSNWVSPIQCVPKKGGITVVSNKDNKLIPTRTVTGWRVCMDYRRLNKANRKDHFPLPFIDQMLDRLAGQAYYCFLDGYSGYNQ

Query:  ITIALEDQEKTTFTCPYGTFALR-----------------------------------------------------------------------------
        I IA EDQEK TFTCPYGTFA R                                                                             
Subjt:  ITIALEDQEKTTFTCPYGTFALR-----------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------VLNEAQVNYTTTEKELLAVVFAFEKFGSYFVGSKVTVFTDHAAIRYLMAKKDAKPRLIRWVLLLQKFDLE
                                       LN+AQ+NYTTTEKELLAVVFAF+KF SY VG+KV V+TDHAAIRYL+ KKDAKPRLIRWVLLLQ+FDLE
Subjt:  ------------------------------VLNEAQVNYTTTEKELLAVVFAFEKFGSYFVGSKVTVFTDHAAIRYLMAKKDAKPRLIRWVLLLQKFDLE

Query:  IKDKKGSENVIADHLSRLDPSSSLLKQSAISDAFPDEQLFAIEVKVVRGVPWYADIANFLVKGVTPIDMDWRHKKKFKHDAKFFYWDEPFIYKQCSNGII
        I+D+KG+EN IADHLSRL+  +   + + I+D FPDEQL AI   V   VPWYADI N+L  G+ P D+  + KKKF  D + ++WD+PF++KQ  + I+
Subjt:  IKDKKGSENVIADHLSRLDPSSSLLKQSAISDAFPDEQLFAIEVKVVRGVPWYADIANFLVKGVTPIDMDWRHKKKFKHDAKFFYWDEPFIYKQCSNGII

Query:  RRCVSSDEAKEILEQCHSSPYGGHFSGQRTAMRILHCGFFWPTLFKDAHEW
        RRCV   E  +ILEQCH+SPYGGHF G RTA +IL  GFFWP LFKDAH +
Subjt:  RRCVSSDEAKEILEQCHSSPYGGHFSGQRTAMRILHCGFFWPTLFKDAHEW

A0A2G9GK35 Reverse transcriptase7.0e-12256.1Show/hide
Query:  KKEVNKWLDAGIIYPIVDSNWVSPIQCVPKKGGITVVSNKDNKLIPTRTVTGWRVCMDYRRLNKANRKDHFPLPFIDQMLDRLAGQAYYCFLDGYSGYNQ
        KKE+ KWLDAGIIYPI D              GITVV N  N+LIPTRTVTGWRVCMDYR+LNKA RKDHFPLPFIDQMLDRLAG+ +YCFLDGYSGYNQ
Subjt:  KKEVNKWLDAGIIYPIVDSNWVSPIQCVPKKGGITVVSNKDNKLIPTRTVTGWRVCMDYRRLNKANRKDHFPLPFIDQMLDRLAGQAYYCFLDGYSGYNQ

Query:  ITIALEDQEKTTFTCPYGTFALR------------------------------------------------------------------VLNEAQVNYTT
        I IA EDQEK TFTCPYGTFA R                                                                   LN+AQ+NYTT
Subjt:  ITIALEDQEKTTFTCPYGTFALR------------------------------------------------------------------VLNEAQVNYTT

Query:  TEKELLAVVFAFEKFGSYFVGSKVTVFTDHAAIRYLMAKKDAKPRLIRWVLLLQKFDLEIKDKKGSENVIADHLSRLDPSSSLLKQSAISDAFPDEQLFA
        TEKELLAVVFAF+KF SY VG+KV V+TDHA IRYL+ KKDAKPRLIRWVLLLQ+FDLEI+D+KG+EN IADHLSRL+  +   + + I+D FPD+QL A
Subjt:  TEKELLAVVFAFEKFGSYFVGSKVTVFTDHAAIRYLMAKKDAKPRLIRWVLLLQKFDLEIKDKKGSENVIADHLSRLDPSSSLLKQSAISDAFPDEQLFA

Query:  IEVKVVRGVPWYADIANFLVKGVTPIDMDWRHKKKFKHDAKFFYWDEPFIYKQCSNGIIRRCVSSDEAKEILEQCHSSPYGGHFSGQRTAMRILHCGFFW
        I   V   VPWYADI N+L  G+ P D+  + KKKF  D + ++ D+PF++KQ  + I+R CV   E  +ILEQCH+SPYGGHF G RTA +IL   FFW
Subjt:  IEVKVVRGVPWYADIANFLVKGVTPIDMDWRHKKKFKHDAKFFYWDEPFIYKQCSNGIIRRCVSSDEAKEILEQCHSSPYGGHFSGQRTAMRILHCGFFW

Query:  PTLFKDAHEW
        P LFKDAH +
Subjt:  PTLFKDAHEW

A0A2G9HYA0 Reverse transcriptase7.0e-12245.01Show/hide
Query:  KKEVNKWLDAGIIYPIVDSNWVSPIQCVPKKGGITVVSNKDNKLIPTRTVTGWRVCMDYRRLNKANRKDHFPLPFIDQMLDRLAGQAYYCFLDGYSGYNQ
        KKE+ KWLDAGIIYPI DS+WVSP+QCVPKKGGITVV N  N+LIPTRTVTGWRVCMDYR+LNKA RKDHFPLPFIDQMLDRLAG+ +YCFLDGYSGYNQ
Subjt:  KKEVNKWLDAGIIYPIVDSNWVSPIQCVPKKGGITVVSNKDNKLIPTRTVTGWRVCMDYRRLNKANRKDHFPLPFIDQMLDRLAGQAYYCFLDGYSGYNQ

Query:  ITIALEDQEKTTFTCPYGTFALR-----------------------------------------------------------------------------
        I IA EDQEKTTFTCPYGTFA R                                                                             
Subjt:  ITIALEDQEKTTFTCPYGTFALR-----------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------VLNEAQVNYTTTEKELLAVVFAFEKFGSYFVGSKVTVFTDHAAIRYLMAKKDAKPRLIRWVLLLQKFDLE
                                       LN+AQ+NYTTTEKELLAVVFAF+KF SY VG+KV V+TDHAAIRYL+ KKDAKPRLIRWVLLLQ+FDLE
Subjt:  ------------------------------VLNEAQVNYTTTEKELLAVVFAFEKFGSYFVGSKVTVFTDHAAIRYLMAKKDAKPRLIRWVLLLQKFDLE

Query:  IKDKKGSENVIADHLSRLDPSSSLLKQSAISDAFPDEQLFAIEVKVVRGVPWYADIANFLVKGVTPIDMDWRHKKKFKHDAKFFYWDEPFIYKQCSNGII
        I+D+KG+EN IADHLSRL+  +   + + I+D FPDEQL AI   V   VPWYADI N+L  G+ P D+  + KKKF  D + ++WD+PF++KQ  + I+
Subjt:  IKDKKGSENVIADHLSRLDPSSSLLKQSAISDAFPDEQLFAIEVKVVRGVPWYADIANFLVKGVTPIDMDWRHKKKFKHDAKFFYWDEPFIYKQCSNGII

Query:  RRCVSSDEAKEILEQCHSSPYGGHFSGQRTAMRILHCGFFWPTLFKDAHEW
        RRCV   E  +ILEQCH+SPYGGHF G RTA +IL  GFFWP LFKDAH +
Subjt:  RRCVSSDEAKEILEQCHSSPYGGHFSGQRTAMRILHCGFFWPTLFKDAHEW

A0A2G9HYD8 Reverse transcriptase1.4e-12244.58Show/hide
Query:  ERKVNSNIQ--CKKEVNKWLDAGIIYPIVDSNWVSPIQCVPKKGGITVVSNKDNKLIPTRTVTGWRVCMDYRRLNKANRKDHFPLPFIDQMLDRLAGQAY
        +R++NS ++   KKE+ KWLDAGIIYPI DS+WVSP+QCVPKKGGITVV N  N+LIPTRTVTGWRVCMDYR+LNKA RKDHFPLPFIDQMLDRLAG+ +
Subjt:  ERKVNSNIQ--CKKEVNKWLDAGIIYPIVDSNWVSPIQCVPKKGGITVVSNKDNKLIPTRTVTGWRVCMDYRRLNKANRKDHFPLPFIDQMLDRLAGQAY

Query:  YCFLDGYSGYNQITIALEDQEKTTFTCPYGTFALR-----------------------------------------------------------------
        YCFLDGYSGYNQI IA EDQEKTTFTCPYGTFA R                                                                 
Subjt:  YCFLDGYSGYNQITIALEDQEKTTFTCPYGTFALR-----------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------------VLNEAQVNYTTTEKELLAVVFAFEKFGSYFVGSKVTVFTDHAAIRYLMAKKDAKPRLI
                                                   LN+AQ+NYTTTEKELLAVVFAF+KF SY VG+KV V+TDHAAIRYL+ KKDAKPRLI
Subjt:  ------------------------------------------VLNEAQVNYTTTEKELLAVVFAFEKFGSYFVGSKVTVFTDHAAIRYLMAKKDAKPRLI

Query:  RWVLLLQKFDLEIKDKKGSENVIADHLSRLDPSSSLLKQSAISDAFPDEQLFAIEVKVVRGVPWYADIANFLVKGVTPIDMDWRHKKKFKHDAKFFYWDE
        RWVLLLQ+FDLEI+D+KG+EN IADHLSRL+  +   + + I+D FPDEQL AI   V   VPWYADI N+L  G+ P D+  + KKKF  D + ++WD+
Subjt:  RWVLLLQKFDLEIKDKKGSENVIADHLSRLDPSSSLLKQSAISDAFPDEQLFAIEVKVVRGVPWYADIANFLVKGVTPIDMDWRHKKKFKHDAKFFYWDE

Query:  PFIYKQCSNGIIRRCVSSDEAKEILEQCHSSPYGGHFSGQRTAMRILHCGFFWPTLFKDAHEW
        PF++KQ  + I+RRCV   E  +ILEQCH+SPYGGHF G RTA +IL  GFFWP LFKDAH +
Subjt:  PFIYKQCSNGIIRRCVSSDEAKEILEQCHSSPYGGHFSGQRTAMRILHCGFFWPTLFKDAHEW

A0A2Z6NN66 Reverse transcriptase9.5e-11944.64Show/hide
Query:  GFVPNDLQG----RWTFRIPTHQEFK-----ERKVNSNIQ--CKKEVNKWLDAGIIYPIVDSNWVSPIQCVPKKGGITVVSNKDNKLIPTRTVTGWRVCM
        G+  +DL+G        +I    E+K     +R++N  ++   KKEV K L+AG+IYPI DS WVSP+  VPKKGG+TV+ N  N+LIPTRTVTGWR+C+
Subjt:  GFVPNDLQG----RWTFRIPTHQEFK-----ERKVNSNIQ--CKKEVNKWLDAGIIYPIVDSNWVSPIQCVPKKGGITVVSNKDNKLIPTRTVTGWRVCM

Query:  DYRRLNKANRKDHFPLPFIDQMLDRLAGQAYYCFLDGYSGYNQITIALEDQEKTTFTCPYGTFALR----------------------------------
        DYRRLNKA RKDH+PLP +DQML+RLAGQ YYCFLDGYSGYNQI +   DQEKT FTCP+G FA R                                  
Subjt:  DYRRLNKANRKDHFPLPFIDQMLDRLAGQAYYCFLDGYSGYNQITIALEDQEKTTFTCPYGTFALR----------------------------------

Query:  --------------------------------------------------------------------------------------------------VL
                                                                                                          VL
Subjt:  --------------------------------------------------------------------------------------------------VL

Query:  NEAQVNYTTTEKELLAVVFAFEKFGSYFVGSKVTVFTDHAAIRYLMAKKDAKPRLIRWVLLLQKFDLEIKDKKGSENVIADHLSRLDPSSSLLKQSAISD
        NE QVNYTTTEKELLA+VFA EKF SY +GSKV VFTDHAA+R+L+ K ++KPRL+RW+LLLQ+FD+EIKDKKG ENV+ADHLSRL+      K+  IS+
Subjt:  NEAQVNYTTTEKELLAVVFAFEKFGSYFVGSKVTVFTDHAAIRYLMAKKDAKPRLIRWVLLLQKFDLEIKDKKGSENVIADHLSRLDPSSSLLKQSAISD

Query:  AFPDEQLFAIEVKVVRGVPWYADIANFLVKGVTPIDMDWRHKKKFKHDAKFFYWDEPFIYKQCSNGIIRRCVSSDEAKEILEQCHSSPYGGHFSGQRTAM
        AFPDE L AI  +     PW+AD+ N+ V G  P D+    +KKF HD+KF++WD PF++K+  +GIIRRCV + E+ +I+  CH+SP G H SG RTA 
Subjt:  AFPDEQLFAIEVKVVRGVPWYADIANFLVKGVTPIDMDWRHKKKFKHDAKFFYWDEPFIYKQCSNGIIRRCVSSDEAKEILEQCHSSPYGGHFSGQRTAM

Query:  RILHCGFFWPTLFKDAHEWNKS
        ++LH GFFWPTLFKD  ++ K+
Subjt:  RILHCGFFWPTLFKDAHEWNKS

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.61.6e-1438.74Show/hide
Query:  TFALRVLNEAQVNYTTTEKELLAVVFAFEKFGSYFVGSKVTVFTDHAAIRYLMAKKDAKPRLIRWVLLLQKFDLEIKDKKGSENVIADHLSRLDPSSSLL
        ++  R LNE ++NY+T EKELLA+V+A + F  Y +G    + +DH  + +L   KD   +L RW + L +FD +IK  KG EN +AD LSR+    + L
Subjt:  TFALRVLNEAQVNYTTTEKELLAVVFAFEKFGSYFVGSKVTVFTDHAAIRYLMAKKDAKPRLIRWVLLLQKFDLEIKDKKGSENVIADHLSRLDPSSSLL

Query:  KQSAISDAFPD
         +     A  D
Subjt:  KQSAISDAFPD

P04323 Retrovirus-related Pol polyprotein from transposon 17.69.5e-0729.51Show/hide
Query:  QCKKEVNKWLDAGIIYPIVDSNWVSPIQCVPKKGGITVVSNKDNKLIPTRTVTGWRVCMDYRRLNKANRKDHFPLPFIDQMLDRLAGQAYYCFLDGYSGY
        + + ++   L+ GII    +S + SPI  VPKK      S K            +R+ +DYR+LN+    D  P+P +D++L +L    Y+  +D   G+
Subjt:  QCKKEVNKWLDAGIIYPIVDSNWVSPIQCVPKKGGITVVSNKDNKLIPTRTVTGWRVCMDYRRLNKANRKDHFPLPFIDQMLDRLAGQAYYCFLDGYSGY

Query:  NQITIALEDQEKTTFTCPYGTF
        +QI +  E   KT F+  +G +
Subjt:  NQITIALEDQEKTTFTCPYGTF

P20825 Retrovirus-related Pol polyprotein from transposon 2973.7e-1138.71Show/hide
Query:  TFALRVLNEAQVNYTTTEKELLAVVFAFEKFGSYFVGSKVTVFTDHAAIRYLMAKKDAKPRLIRWVLLLQKFDLEIKDKKGSENVIADHLSRL
        +F  R LN+ ++NY+  EKELLA+V+A + F  Y +G +  + +DH  +R+L   K+   +L RW + L ++  +I   KG EN +AD LSR+
Subjt:  TFALRVLNEAQVNYTTTEKELLAVVFAFEKFGSYFVGSKVTVFTDHAAIRYLMAKKDAKPRLIRWVLLLQKFDLEIKDKKGSENVIADHLSRL

P20825 Retrovirus-related Pol polyprotein from transposon 2973.3e-0730.08Show/hide
Query:  IQCKKEVNKWLDAGIIYPIVDSNWVSPIQCVPKKGGITVVSNKDNKLIPTRTVTGWRVCMDYRRLNKANRKDHFPLPFIDQMLDRLAGQAYYCFLDGYSG
        I+ + +V + L+ G+I    +S + SP   VPKK      ++  NK         +RV +DYR+LN+    D +P+P +D++L +L    Y+  +D   G
Subjt:  IQCKKEVNKWLDAGIIYPIVDSNWVSPIQCVPKKGGITVVSNKDNKLIPTRTVTGWRVCMDYRRLNKANRKDHFPLPFIDQMLDRLAGQAYYCFLDGYSG

Query:  YNQITIALEDQEKTTFTCPYGTF
        ++QI +  E   KT F+   G +
Subjt:  YNQITIALEDQEKTTFTCPYGTF

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein1.9e-1531.09Show/hide
Query:  VPNDLQGRWTFRIPTHQEFKERKVNSNIQCKKEVNKWLDAGIIYPIVDSNWVSPIQCVPKKGGITVVSNKDNKLIPTRTVTGWRVCMDYRRLNKANRKDH
        V +D++ +   R+P  Q +   + N   +  K V K LD   I P   S   SP+  VPKK G                   +R+C+DYR LNKA   D 
Subjt:  VPNDLQGRWTFRIPTHQEFKERKVNSNIQCKKEVNKWLDAGIIYPIVDSNWVSPIQCVPKKGGITVVSNKDNKLIPTRTVTGWRVCMDYRRLNKANRKDH

Query:  FPLPFIDQMLDRLAGQAYYCFLDGYSGYNQITIALEDQEKTTFTCPYGTFALRVLNEAQVNYTTTEKELLAVVFAFEKFGSYFVGSKVTVFTD
        FPLP ID +L R+     +  LD +SGY+QI +  +D+ KT F  P G +   V+    VN  +T    +A  F   +F + ++   + +F++
Subjt:  FPLPFIDQMLDRLAGQAYYCFLDGYSGYNQITIALEDQEKTTFTCPYGTFALRVLNEAQVNYTTTEKELLAVVFAFEKFGSYFVGSKVTVFTD

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus4.9e-1132.43Show/hide
Query:  RVLNEAQVNYTTTEKELLAVVFAFEKFGSYFVGS-KVTVFTDHAAIRYLMAKKDAKPRLIRWVLLLQKFDLEIKDKKGSENVIADHLSRLDPSSSLLKQS
        R LN+ + NY T EKE+LA++++ +   +Y  G+  + V+TDH  + + +  ++   +L RW   +++++ E+  K G  NV+AD LSR+ P  + L  S
Subjt:  RVLNEAQVNYTTTEKELLAVVFAFEKFGSYFVGS-KVTVFTDHAAIRYLMAKKDAKPRLIRWVLLLQKFDLEIKDKKGSENVIADHLSRLDPSSSLLKQS

Query:  AISDAFPDEQL
           DA P++ +
Subjt:  AISDAFPDEQL

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus4.3e-0730.71Show/hide
Query:  VNSNIQCKKEVNKWLDAGIIYPIVDSNWVSPIQCVPKKGGITVVSNKDNKLIPTRTVTGWRVCMDYRRLNKANRKDHFPLPFIDQMLDRLAGQAYYCFLD
        VN   + ++++++ L  GII P  +S + SPI  VPKK       N + +         +R+ +D++RLN     D +P+P I+  L  L    Y+  LD
Subjt:  VNSNIQCKKEVNKWLDAGIIYPIVDSNWVSPIQCVPKKGGITVVSNKDNKLIPTRTVTGWRVCMDYRRLNKANRKDHFPLPFIDQMLDRLAGQAYYCFLD

Query:  GYSGYNQITIALEDQEKTTFTCPYGTF
          SG++QI +   D  KT F+   G +
Subjt:  GYSGYNQITIALEDQEKTTFTCPYGTF

Q99315 Transposon Ty3-G Gag-Pol polyprotein1.9e-1531.09Show/hide
Query:  VPNDLQGRWTFRIPTHQEFKERKVNSNIQCKKEVNKWLDAGIIYPIVDSNWVSPIQCVPKKGGITVVSNKDNKLIPTRTVTGWRVCMDYRRLNKANRKDH
        V +D++ +   R+P  Q +   + N   +  K V K LD   I P   S   SP+  VPKK G                   +R+C+DYR LNKA   D 
Subjt:  VPNDLQGRWTFRIPTHQEFKERKVNSNIQCKKEVNKWLDAGIIYPIVDSNWVSPIQCVPKKGGITVVSNKDNKLIPTRTVTGWRVCMDYRRLNKANRKDH

Query:  FPLPFIDQMLDRLAGQAYYCFLDGYSGYNQITIALEDQEKTTFTCPYGTFALRVLNEAQVNYTTTEKELLAVVFAFEKFGSYFVGSKVTVFTD
        FPLP ID +L R+     +  LD +SGY+QI +  +D+ KT F  P G +   V+    VN  +T    +A  F   +F + ++   + +F++
Subjt:  FPLPFIDQMLDRLAGQAYYCFLDGYSGYNQITIALEDQEKTTFTCPYGTFALRVLNEAQVNYTTTEKELLAVVFAFEKFGSYFVGSKVTVFTD

Arabidopsis top hitse value%identityAlignment
AT3G15190.1 chloroplast 30S ribosomal protein S20, putative3.8e-1170Show/hide
Query:  QSPFRYFVVCE-AAPTKKVDSTAKRARQAEKRRIYNKARKFEIKTRIKKV
        Q P R  +VCE AAPTKK DS AKRARQAEKRR+YNK++K E +TR+KKV
Subjt:  QSPFRYFVVCE-AAPTKKVDSTAKRARQAEKRRIYNKARKFEIKTRIKKV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTAAAGCTCAGAGCCCATTTCGATACTTTGTGGTCTGCGAGGCGGCTCCTACGAAAAAGGTCGATTCTACTGCAAAGAGAGCTCGGCAGGCTGAGAAAAGACGCAT
TTACAACAAAGCCCGGAAGTTTGAAATCAAAACCAGGATCAAGAAGGTACCTAATTTTGCTGATTTAGGTGGAACAGGCTTGGCGCTAATGAATGCTATATTACTTCTTT
GTGGAAATCATGCAGTCACCTCTTCTATTGACCAAATGAGCGTGATATTGGAGGAGTCCATGGAATGTAAGTCTGTAAAGGAGATTTTTCATGATCTCTTCACCAATGGA
GCTCGTTGCTCTCTGAGCAGGACTAGTTGGAACCACACGTGCGCTGCTAGTGATGATGGATTTGTTCCCAATGACCTTCAGGGAAGGTGGACTTTTAGAATTCCCACCCA
TCAAGAATTCAAAGAGAGGAAGGTGAACTCTAATATACAGTGTAAAAAGGAGGTGAATAAATGGTTGGATGCTGGGATCATTTATCCAATTGTCGATAGCAATTGGGTAA
GCCCAATCCAATGTGTTCCTAAGAAAGGAGGTATCACTGTGGTGAGCAATAAAGACAATAAGTTGATCCCAACTAGGACAGTAACTGGCTGGAGGGTTTGCATGGACTAC
AGGAGGCTTAATAAAGCTAACCGTAAGGACCACTTCCCTCTACCATTTATTGACCAGATGTTGGATAGATTGGCTGGTCAGGCCTACTACTGTTTCTTAGATGGTTATTC
TGGGTATAACCAGATTACTATTGCTCTTGAGGATCAGGAAAAAACCACTTTCACCTGCCCTTATGGGACGTTTGCTTTAAGGGTTTTAAATGAAGCACAAGTCAACTATA
CAACTACTGAAAAGGAGTTGTTAGCTGTGGTGTTTGCTTTTGAGAAATTCGGGTCATATTTTGTTGGATCCAAAGTCACGGTGTTCACGGATCATGCCGCAATAAGGTAT
CTAATGGCTAAGAAAGATGCAAAGCCTAGGCTAATTCGTTGGGTTTTATTATTGCAGAAGTTCGACTTAGAGATAAAGGACAAGAAGGGATCAGAAAATGTCATTGCAGA
TCATTTGTCTCGTCTTGATCCATCATCATCTTTGCTGAAACAATCTGCCATTTCAGATGCTTTTCCAGATGAACAACTTTTTGCTATTGAGGTAAAAGTAGTCAGGGGTG
TCCCTTGGTATGCTGATATTGCCAACTTTTTGGTAAAGGGAGTCACTCCTATTGACATGGATTGGAGGCATAAGAAAAAGTTTAAGCATGATGCAAAATTTTTCTATTGG
GATGAGCCATTTATTTATAAGCAATGTTCTAACGGTATTATTCGTAGGTGTGTTTCAAGTGATGAAGCAAAGGAAATCCTGGAGCAATGTCACTCTTCACCGTATGGAGG
TCATTTCAGCGGTCAGAGGACAGCTATGAGGATTTTGCATTGTGGATTCTTTTGGCCTACCTTATTTAAGGATGCCCATGAATGGAATAAAAGTCCCCACGCAGCGGAAG
CGCATCGATTGGACCTTACGTCTTTAAATATGGCGTTAAATCGATTTGTCCCTAAAACGCTTGATATTTATCAGTTGGCAGTTAGGTCTGATCAAGGCGTTGAATTTCCA
CGCTTAGTGAGGAAATCAAGCAAGTGGTGGGCTAATTTTGGAGTTTCTAATTTGGGCCAGCTAGCCGCCGCCGTTCGCTCAGCCGCGTCGCACACCCAGCCGCTCTCGTT
TCGCCATCCAGTCGCCGCACGTTCAGCCGCCGCTGTGGGTTTTCCCTCTTCTCCTCTCCGCGCACTTCTCTCTCCCTCTCTTCCTCGTGCAAGGCGCGAGCCCAGCCGTC
GTTCGTCAGCCAGCCAACGCGCCGCCGCTGCCTCTCTTTCTCCTCTCGATCCCTTGCGTTTTCGGCCGAGGAAAATTCCTGGATCTTGCGCGGACAGCAGCCTGAAGCTT
CGTCTTCTTCGTGATTTCTCCCTCTTTGTTGCGTTTTTTGGCCGTAAATCGTTGGGTAAGCTCTCTTCCTTCGATCTTAGTCGTTTGAGACCCGATCTATGTCATCAATC
TGGGCGCTTTCGGCTTCGTTTAGCGATTTCGGCGAGCTTTAACTCATACCCATTCTTGGTGTTGGGTTTTAATTTGGGGGATCTTGAGTGGCTGTCCGGCAAGGAAATAA
GTCCTCTCTTAGTTGAAATAGGCCTAGAGTTTCTCGTGGCTGTTTTAGGATTGTTGGAGTTGCTTATGAGCGCTATAGCGGAGCGTTACGCGGAAATCACGTGGGTTGTG
AGTGCGGAGCGTGACGCGGAAATCACTTTTAAGCGTATTGTTAGGTTATGGTATAAAAGCATGTTGGGTGTATGTGGTTACTTAAGCATGTTGCCTGTTCATGGTTATGA
AAGCCTGTTGGATGCGTGTGCTAAGCATGTTGTTTGTTTTGGTTGTTGTTTGTGGTTTGGTAAGTGTGACCTGCTTACCAGTACCACGGTTGTACTGATACCCCCTTCCC
CCCTTCCCCCCAACATTTTAGATGTTGCAGGTTACGTTGATGAGCTAGATCCTGGTGAGGAAGAGGAGAACTACGAGGAGGAACCCTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGTAAAGCTCAGAGCCCATTTCGATACTTTGTGGTCTGCGAGGCGGCTCCTACGAAAAAGGTCGATTCTACTGCAAAGAGAGCTCGGCAGGCTGAGAAAAGACGCAT
TTACAACAAAGCCCGGAAGTTTGAAATCAAAACCAGGATCAAGAAGGTACCTAATTTTGCTGATTTAGGTGGAACAGGCTTGGCGCTAATGAATGCTATATTACTTCTTT
GTGGAAATCATGCAGTCACCTCTTCTATTGACCAAATGAGCGTGATATTGGAGGAGTCCATGGAATGTAAGTCTGTAAAGGAGATTTTTCATGATCTCTTCACCAATGGA
GCTCGTTGCTCTCTGAGCAGGACTAGTTGGAACCACACGTGCGCTGCTAGTGATGATGGATTTGTTCCCAATGACCTTCAGGGAAGGTGGACTTTTAGAATTCCCACCCA
TCAAGAATTCAAAGAGAGGAAGGTGAACTCTAATATACAGTGTAAAAAGGAGGTGAATAAATGGTTGGATGCTGGGATCATTTATCCAATTGTCGATAGCAATTGGGTAA
GCCCAATCCAATGTGTTCCTAAGAAAGGAGGTATCACTGTGGTGAGCAATAAAGACAATAAGTTGATCCCAACTAGGACAGTAACTGGCTGGAGGGTTTGCATGGACTAC
AGGAGGCTTAATAAAGCTAACCGTAAGGACCACTTCCCTCTACCATTTATTGACCAGATGTTGGATAGATTGGCTGGTCAGGCCTACTACTGTTTCTTAGATGGTTATTC
TGGGTATAACCAGATTACTATTGCTCTTGAGGATCAGGAAAAAACCACTTTCACCTGCCCTTATGGGACGTTTGCTTTAAGGGTTTTAAATGAAGCACAAGTCAACTATA
CAACTACTGAAAAGGAGTTGTTAGCTGTGGTGTTTGCTTTTGAGAAATTCGGGTCATATTTTGTTGGATCCAAAGTCACGGTGTTCACGGATCATGCCGCAATAAGGTAT
CTAATGGCTAAGAAAGATGCAAAGCCTAGGCTAATTCGTTGGGTTTTATTATTGCAGAAGTTCGACTTAGAGATAAAGGACAAGAAGGGATCAGAAAATGTCATTGCAGA
TCATTTGTCTCGTCTTGATCCATCATCATCTTTGCTGAAACAATCTGCCATTTCAGATGCTTTTCCAGATGAACAACTTTTTGCTATTGAGGTAAAAGTAGTCAGGGGTG
TCCCTTGGTATGCTGATATTGCCAACTTTTTGGTAAAGGGAGTCACTCCTATTGACATGGATTGGAGGCATAAGAAAAAGTTTAAGCATGATGCAAAATTTTTCTATTGG
GATGAGCCATTTATTTATAAGCAATGTTCTAACGGTATTATTCGTAGGTGTGTTTCAAGTGATGAAGCAAAGGAAATCCTGGAGCAATGTCACTCTTCACCGTATGGAGG
TCATTTCAGCGGTCAGAGGACAGCTATGAGGATTTTGCATTGTGGATTCTTTTGGCCTACCTTATTTAAGGATGCCCATGAATGGAATAAAAGTCCCCACGCAGCGGAAG
CGCATCGATTGGACCTTACGTCTTTAAATATGGCGTTAAATCGATTTGTCCCTAAAACGCTTGATATTTATCAGTTGGCAGTTAGGTCTGATCAAGGCGTTGAATTTCCA
CGCTTAGTGAGGAAATCAAGCAAGTGGTGGGCTAATTTTGGAGTTTCTAATTTGGGCCAGCTAGCCGCCGCCGTTCGCTCAGCCGCGTCGCACACCCAGCCGCTCTCGTT
TCGCCATCCAGTCGCCGCACGTTCAGCCGCCGCTGTGGGTTTTCCCTCTTCTCCTCTCCGCGCACTTCTCTCTCCCTCTCTTCCTCGTGCAAGGCGCGAGCCCAGCCGTC
GTTCGTCAGCCAGCCAACGCGCCGCCGCTGCCTCTCTTTCTCCTCTCGATCCCTTGCGTTTTCGGCCGAGGAAAATTCCTGGATCTTGCGCGGACAGCAGCCTGAAGCTT
CGTCTTCTTCGTGATTTCTCCCTCTTTGTTGCGTTTTTTGGCCGTAAATCGTTGGGTAAGCTCTCTTCCTTCGATCTTAGTCGTTTGAGACCCGATCTATGTCATCAATC
TGGGCGCTTTCGGCTTCGTTTAGCGATTTCGGCGAGCTTTAACTCATACCCATTCTTGGTGTTGGGTTTTAATTTGGGGGATCTTGAGTGGCTGTCCGGCAAGGAAATAA
GTCCTCTCTTAGTTGAAATAGGCCTAGAGTTTCTCGTGGCTGTTTTAGGATTGTTGGAGTTGCTTATGAGCGCTATAGCGGAGCGTTACGCGGAAATCACGTGGGTTGTG
AGTGCGGAGCGTGACGCGGAAATCACTTTTAAGCGTATTGTTAGGTTATGGTATAAAAGCATGTTGGGTGTATGTGGTTACTTAAGCATGTTGCCTGTTCATGGTTATGA
AAGCCTGTTGGATGCGTGTGCTAAGCATGTTGTTTGTTTTGGTTGTTGTTTGTGGTTTGGTAAGTGTGACCTGCTTACCAGTACCACGGTTGTACTGATACCCCCTTCCC
CCCTTCCCCCCAACATTTTAGATGTTGCAGGTTACGTTGATGAGCTAGATCCTGGTGAGGAAGAGGAGAACTACGAGGAGGAACCCTAG
Protein sequenceShow/hide protein sequence
MSKAQSPFRYFVVCEAAPTKKVDSTAKRARQAEKRRIYNKARKFEIKTRIKKVPNFADLGGTGLALMNAILLLCGNHAVTSSIDQMSVILEESMECKSVKEIFHDLFTNG
ARCSLSRTSWNHTCAASDDGFVPNDLQGRWTFRIPTHQEFKERKVNSNIQCKKEVNKWLDAGIIYPIVDSNWVSPIQCVPKKGGITVVSNKDNKLIPTRTVTGWRVCMDY
RRLNKANRKDHFPLPFIDQMLDRLAGQAYYCFLDGYSGYNQITIALEDQEKTTFTCPYGTFALRVLNEAQVNYTTTEKELLAVVFAFEKFGSYFVGSKVTVFTDHAAIRY
LMAKKDAKPRLIRWVLLLQKFDLEIKDKKGSENVIADHLSRLDPSSSLLKQSAISDAFPDEQLFAIEVKVVRGVPWYADIANFLVKGVTPIDMDWRHKKKFKHDAKFFYW
DEPFIYKQCSNGIIRRCVSSDEAKEILEQCHSSPYGGHFSGQRTAMRILHCGFFWPTLFKDAHEWNKSPHAAEAHRLDLTSLNMALNRFVPKTLDIYQLAVRSDQGVEFP
RLVRKSSKWWANFGVSNLGQLAAAVRSAASHTQPLSFRHPVAARSAAAVGFPSSPLRALLSPSLPRARREPSRRSSASQRAAAASLSPLDPLRFRPRKIPGSCADSSLKL
RLLRDFSLFVAFFGRKSLGKLSSFDLSRLRPDLCHQSGRFRLRLAISASFNSYPFLVLGFNLGDLEWLSGKEISPLLVEIGLEFLVAVLGLLELLMSAIAERYAEITWVV
SAERDAEITFKRIVRLWYKSMLGVCGYLSMLPVHGYESLLDACAKHVVCFGCCLWFGKCDLLTSTTVVLIPPSPLPPNILDVAGYVDELDPGEEEENYEEEP