; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0019323 (gene) of Chayote v1 genome

Gene IDSed0019323
OrganismSechium edule (Chayote v1)
DescriptionReverse transcriptase
Genome locationLG03:40966487..40968649
RNA-Seq ExpressionSed0019323
SyntenySed0019323
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0003824 - catalytic activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily
IPR032567 - LDOC1-related
IPR036875 - Zinc finger, CCHC-type superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025242.1 pol protein [Cucumis melo var. makuwa]1.0e-9436.99Show/hide
Query:  RGTRIRSDTAGTSSQAETDGIPPVAPPVAPRGGVEASERREIALPEREFADSMRALMG------------------------------AAAPPAPQEQVA
        RGTR      G  +       PPVAP V P   V    + ++A  E+ + D ++A +                               A APPAP+E  A
Subjt:  RGTRIRSDTAGTSSQAETDGIPPVAPPVAPRGGVEASERREIALPEREFADSMRALMG------------------------------AAAPPAPQEQVA

Query:  PEADVAEGAEELGRWLKDFVKWKLEKFDASGD-ALAAARWIAHLEYTFLIMVCPDVQRPRCAAHVLGGTARWWWESTLSERPAGSPPA--TWEFFKSEFK
            V   AE   + L+DF K+  + FD S D    A  W+  +E  F  M CP+ Q+ +CA   L      WWE+  +ER  G   +  TWE FK  F 
Subjt:  PEADVAEGAEELGRWLKDFVKWKLEKFDASGD-ALAAARWIAHLEYTFLIMVCPDVQRPRCAAHVLGGTARWWWESTLSERPAGSPPA--TWEFFKSEFK

Query:  AKYISDEAQEKMVELFQNLKQGSDSIEEYERKFSEYGYFAPQLIATPELKIRKFISGLR--QATIYNGKYPVERV----VAAPVQRPVPA----------
        AK+ S   +   ++ F NL+QG  ++E+Y+ +F     FAP ++     +  KF+ GLR     I     P        +A  +  P  A          
Subjt:  AKYISDEAQEKMVELFQNLKQGSDSIEEYERKFSEYGYFAPQLIATPELKIRKFISGLR--QATIYNGKYPVERV----VAAPVQRPVPA----------

Query:  TVGQKRKFDFRG--GGQR--RQGQFQNRDQRRAPARFQPSQQGGPRQFPACATCGRAHAEACQTRPRLCFRCGREGHLARFCDQPAMAV---QPAQQQQG
         +GQKRK + +     QR  R G    R +R   A  +       R+ PAC TCGR H   C     +CFRC + GH A  C +        QP+  QQG
Subjt:  TVGQKRKFDFRG--GGQR--RQGQFQNRDQRRAPARFQPSQQGGPRQFPACATCGRAHAEACQTRPRLCFRCGREGHLARFCDQPAMAV---QPAQQQQG

Query:  RVFATTRQEAERTDAAIAG------------FE--------------------------LSVATFSGVNMLARDKIKDGRIEISGELLVATLIVLPMSGF
        RVFATTRQEAER    + G            F+                          LSV+T SG  +L++++IK  R+EI+  +L  TL+VL M  F
Subjt:  RVFATTRQEAERTDAAIAG------------FE--------------------------LSVATFSGVNMLARDKIKDGRIEISGELLVATLIVLPMSGF

Query:  DVILGMEWLAANHALIECHRKEVVF-------------------------------SSWSLGLSSQVVRGEPPLAIVDQSLVVREFADVFLRSSPG----
        DVILGM+WL+ANHA I+C+ KEVVF                               S  + G+ + VV    P   +    VVRE+ DVF    PG    
Subjt:  DVILGMEWLAANHALIECHRKEVVF-------------------------------SSWSLGLSSQVVRGEPPLAIVDQSLVVREFADVFLRSSPG----

Query:  -------------APQERGSVSYGANELKESRFRLQEMLDRGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNKYPLPRIDDLFDQLQGAFV
                     AP  R        ELKE + +LQE+LD+GFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKN+YPLPRIDDLFDQLQGA V
Subjt:  -------------APQERGSVSYGANELKESRFRLQEMLDRGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNKYPLPRIDDLFDQLQGAFV

Query:  FSKIDLRSDYH
        FSKIDLRS YH
Subjt:  FSKIDLRSDYH

KAA0048442.1 pol protein [Cucumis melo var. makuwa]5.7e-9839.12Show/hide
Query:  RGTRIRSDTAGTSSQAETDGIPPVAPPVAPRGGVEASERREIALPEREFADSMRALMG----------------AAAPPAPQEQVAPEADVAEGAEELGR
        RG R      G  +       PPVAP V P   V    + ++A  E+ + D ++A +                 A APPAP+E  A    V   AE   +
Subjt:  RGTRIRSDTAGTSSQAETDGIPPVAPPVAPRGGVEASERREIALPEREFADSMRALMG----------------AAAPPAPQEQVAPEADVAEGAEELGR

Query:  WLKDFVKWKLEKFDASGD-ALAAARWIAHLEYTFLIMVCPDVQRPRCAAHVLGGTARWWWESTLSERPAGSP--PATWEFFKSEFKAKYISDEAQEKMVE
         L+DF K+  + FD S D    A  W+  +E  F  M CP+ Q+ +CA   L      WWE+  +ER  G      TWE FK  F AK+ S   +   ++
Subjt:  WLKDFVKWKLEKFDASGD-ALAAARWIAHLEYTFLIMVCPDVQRPRCAAHVLGGTARWWWESTLSERPAGSP--PATWEFFKSEFKAKYISDEAQEKMVE

Query:  LFQNLKQGSDSIEEYERKFSEYGYFAPQLIATPELKIRKFISGLR--QATIYNGKYPVERV----VAAPVQRPVPA----------TVGQKRKF----DF
         F NL+QG  ++E+Y+ +F     FAP +I     +  KF+ GLR     I     P        +A  +  P  A           +GQKRK     D 
Subjt:  LFQNLKQGSDSIEEYERKFSEYGYFAPQLIATPELKIRKFISGLR--QATIYNGKYPVERV----VAAPVQRPVPA----------TVGQKRKF----DF

Query:  RGGGQRRQGQFQNRDQRRAPARFQPSQQGGPRQFPACATCGRAHAEACQTRPRLCFRCGREGHLARFCDQPAMAV---QPAQQQQGRVFATTRQEAERTD
              R G    R +R   A  +       R+ PAC TCGR H   C     +CFRC + GH A  C +        QP+  QQGRVFATTRQEAER  
Subjt:  RGGGQRRQGQFQNRDQRRAPARFQPSQQGGPRQFPACATCGRAHAEACQTRPRLCFRCGREGHLARFCDQPAMAV---QPAQQQQGRVFATTRQEAERTD

Query:  AAIA---GFELSVATFSGVNMLARDKIKDGRIEISGELLVATLIVLPMSGFDVILGMEWLAANHALIECHRKEVVF------------------------
          +    G  LSV+T SG  + ++++IK  R+EI+  +L  TL+VL M  FDVILGM+WL+ANHA I+C  KEVVF                        
Subjt:  AAIA---GFELSVATFSGVNMLARDKIKDGRIEISGELLVATLIVLPMSGFDVILGMEWLAANHALIECHRKEVVF------------------------

Query:  -------SSWSLGLSSQVVRGEPPLAIVDQSLVVREFADVFLRSSPG-----------------APQERGSVSYGANELKESRFRLQEMLDRGFIRPSVS
               S  + G+ + VV    P   +    VVRE+ DVF    PG                 AP  R        ELKE + +LQE+LD+GFIRPSVS
Subjt:  -------SSWSLGLSSQVVRGEPPLAIVDQSLVVREFADVFLRSSPG-----------------APQERGSVSYGANELKESRFRLQEMLDRGFIRPSVS

Query:  PWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNKYPLPRIDDLFDQLQGAFVFSKIDLRSDYH
        PWGAPVLFVK KDGSMRLCIDYRELNKVTVKN+YPLPRIDDLFDQLQGA VFSKIDLRS YH
Subjt:  PWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNKYPLPRIDDLFDQLQGAFVFSKIDLRSDYH

KAA0053272.1 gag protease polyprotein [Cucumis melo var. makuwa]1.0e-9437.39Show/hide
Query:  RGTRIRSDTAGTSSQAETDGIPPVAPPVAPRGGVEASERREIALPEREFADSMRALMG----------------AAAPPAPQEQVAPEADVAEGAEELGR
        RG R      G  +       PPVAP V P   V    + ++A  E+ + D ++A +                 A APPAP+E  A    V   AE   +
Subjt:  RGTRIRSDTAGTSSQAETDGIPPVAPPVAPRGGVEASERREIALPEREFADSMRALMG----------------AAAPPAPQEQVAPEADVAEGAEELGR

Query:  WLKDFVKWKLEKFDASGD-ALAAARWIAHLEYTFLIMVCPDVQRPRCAAHVLGGTARWWWESTLSERPAGSPPA--TWEFFKSEFKAKYISDEAQEKMVE
         L+DF K+  + FD S D    A  W+  +E  F  M CP+ Q+ +CA   L      WWE+  +ER  G   +  TWE FK  F AK+ S   +   ++
Subjt:  WLKDFVKWKLEKFDASGD-ALAAARWIAHLEYTFLIMVCPDVQRPRCAAHVLGGTARWWWESTLSERPAGSPPA--TWEFFKSEFKAKYISDEAQEKMVE

Query:  LFQNLKQGSDSIEEYERKFSEYGYFAPQLIATPELKIRKFISGLR--QATIYNGKYPVERV----VAAPVQRPVPA----------TVGQKRKFDFRGG-
         F NL+QG  ++E+Y+ +F     FAP ++     +  KF+ GLR     I     P        +A  +  P  A           +GQKRK + +   
Subjt:  LFQNLKQGSDSIEEYERKFSEYGYFAPQLIATPELKIRKFISGLR--QATIYNGKYPVERV----VAAPVQRPVPA----------TVGQKRKFDFRGG-

Query:  ----GQRRQGQFQNRDQRRAPARFQPSQQGGPRQFPACATCGRAHAEACQTRPRLCFRCGREGHLARFCDQPAMAV---QPAQQQQGRVFATTRQEAERT
              R  G FQ   Q  A A          R+ PAC TCGR H   C     +CFRC + GH A  C +        QP+  QQGRVFATTRQEAER 
Subjt:  ----GQRRQGQFQNRDQRRAPARFQPSQQGGPRQFPACATCGRAHAEACQTRPRLCFRCGREGHLARFCDQPAMAV---QPAQQQQGRVFATTRQEAERT

Query:  DAAIAG------------FE--------------------------LSVATFSGVNMLARDKIKDGRIEISGELLVATLIVLPMSGFDVILGMEWLAANH
           + G            F+                          LSV+T SG  +L++++IK  R+EI+  +L  TL+VL M  FDVILGM+WL+ANH
Subjt:  DAAIAG------------FE--------------------------LSVATFSGVNMLARDKIKDGRIEISGELLVATLIVLPMSGFDVILGMEWLAANH

Query:  ALIECHRKEVVF-------------------------------SSWSLGLSSQVVRGEPPLAIVDQSLVVREFADVFLRSSPG-----------------
        A I+C  KEVVF                               S  + G+ + VV    P   +    VVRE+ DVF    PG                 
Subjt:  ALIECHRKEVVF-------------------------------SSWSLGLSSQVVRGEPPLAIVDQSLVVREFADVFLRSSPG-----------------

Query:  APQERGSVSYGANELKESRFRLQEMLDRGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNKYPLPRIDDLFDQLQGAFVFSKIDLRSDYH
        AP  R        ELKE + +LQE+LD+GFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKN+YPLPRI+DLFDQLQGA VFSKIDLRS YH
Subjt:  APQERGSVSYGANELKESRFRLQEMLDRGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNKYPLPRIDDLFDQLQGAFVFSKIDLRSDYH

KAA0054800.1 reverse transcriptase [Cucumis melo var. makuwa]1.3e-9438.34Show/hide
Query:  MPVTRDG-RGRPRAGRGTRIRSDTAGTSSQAETDGIPPVAPPVAPRGGVEASERREIALPEREFADSMRALMG----------------AAAPPAPQEQV
        MP  R   RG  R GRG         +  Q E     P AP V P   V    + ++A  E+ + D ++A +                 A  PPAP+E  
Subjt:  MPVTRDG-RGRPRAGRGTRIRSDTAGTSSQAETDGIPPVAPPVAPRGGVEASERREIALPEREFADSMRALMG----------------AAAPPAPQEQV

Query:  APEADVAEGAEELGRWLKDFVKWKLEKFDASGD-ALAAARWIAHLEYTFLIMVCPDVQRPRCAAHVLGGTARWWWESTLSERPAGSPPA--TWEFFKSEF
        A    +   AE   + L+DF K+  + FD S D    A  W+  +E  F  M CP+ Q+ +C    L      WWE+  +ER  G   +  TWE FK  F
Subjt:  APEADVAEGAEELGRWLKDFVKWKLEKFDASGD-ALAAARWIAHLEYTFLIMVCPDVQRPRCAAHVLGGTARWWWESTLSERPAGSPPA--TWEFFKSEF

Query:  KAKYISDEAQEKMVELFQNLKQGSDSIEEYERKFSEYGYFAPQLIATPELKIRKFISGLR--QATIYNGKYPVERVVAAPV-------QRPVP-------
         AK+ S   +   ++ F NL+QG  ++E+Y+ +F     FAP ++     +  KF+ GLR     I     P     A  +       +R  P       
Subjt:  KAKYISDEAQEKMVELFQNLKQGSDSIEEYERKFSEYGYFAPQLIATPELKIRKFISGLR--QATIYNGKYPVERVVAAPV-------QRPVP-------

Query:  ATVGQKRKFDFRGG--GQR--RQGQFQNRDQRRAPARFQPSQQGGPRQFPACATCGRAHAEACQTRPRLCFRCGREGHLARFCDQPAMAV---QPAQQQQ
        + VGQKRK + +     QR  R G    R +R   A  +       R+ PAC TCGR H   C     +CFRC + GH A  C +        QP+  QQ
Subjt:  ATVGQKRKFDFRGG--GQR--RQGQFQNRDQRRAPARFQPSQQGGPRQFPACATCGRAHAEACQTRPRLCFRCGREGHLARFCDQPAMAV---QPAQQQQ

Query:  GRVFATTRQEAERTDAAIAG------------FE--------------------------LSVATFSGVNMLARDKIKDGRIEISGELLVATLIVLPMSG
        GRVFATTRQEAER    + G            F+                          LSV+T SG  +L+++KIK  R+EI+  +L  TL+VL M  
Subjt:  GRVFATTRQEAERTDAAIAG------------FE--------------------------LSVATFSGVNMLARDKIKDGRIEISGELLVATLIVLPMSG

Query:  FDVILGMEWLAANHALIECHRKEVVF-----SSWSLGLSSQVVRGEPPLAIVDQSLVVREFADVFLRSSPG-----------------APQERGSVSYGA
        FDVILGM+WL+ANHA I+C  KEVVF     +S+    +  V+  + P   +    VVRE+ DVF    PG                 AP  R       
Subjt:  FDVILGMEWLAANHALIECHRKEVVF-----SSWSLGLSSQVVRGEPPLAIVDQSLVVREFADVFLRSSPG-----------------APQERGSVSYGA

Query:  NELKESRFRLQEMLDRGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNKYPLPRIDDLFDQLQGAFVFSKIDLRSDYH
         ELKE + +LQE+LD+GFIRPSVSPWGAPVLF+KKKDGSM LCIDYRELNKVTVKN+YPLPRIDDLFDQLQGA VFSKIDLRSDYH
Subjt:  NELKESRFRLQEMLDRGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNKYPLPRIDDLFDQLQGAFVFSKIDLRSDYH

TYJ95850.1 pol protein [Cucumis melo var. makuwa]1.3e-9436.99Show/hide
Query:  RGTRIRSDTAGTSSQAETDGIPPVAPPVAPRGGVEASERREIALPEREFADSMRALMG------------------------------AAAPPAPQEQVA
        RGTR      G  +       PPVAP V P   V    + ++A  E+ + D ++A +                               A APPAP+E  A
Subjt:  RGTRIRSDTAGTSSQAETDGIPPVAPPVAPRGGVEASERREIALPEREFADSMRALMG------------------------------AAAPPAPQEQVA

Query:  PEADVAEGAEELGRWLKDFVKWKLEKFDASGD-ALAAARWIAHLEYTFLIMVCPDVQRPRCAAHVLGGTARWWWESTLSERPAGSPPA--TWEFFKSEFK
            V   AE   + L+DF K+  + FD S D    A  W+  +E  F  M CP+ Q+ +CA   L      WWE+  +ER  G   +  TWE FK  F 
Subjt:  PEADVAEGAEELGRWLKDFVKWKLEKFDASGD-ALAAARWIAHLEYTFLIMVCPDVQRPRCAAHVLGGTARWWWESTLSERPAGSPPA--TWEFFKSEFK

Query:  AKYISDEAQEKMVELFQNLKQGSDSIEEYERKFSEYGYFAPQLIATPELKIRKFISGLR--QATIYNGKYPVERV----VAAPVQRPVPA----------
        AK+ S   +   ++ F NL+QG  ++E+Y+ +F     FAP ++     +  KF+ GLR     I     P        +A  +  P  A          
Subjt:  AKYISDEAQEKMVELFQNLKQGSDSIEEYERKFSEYGYFAPQLIATPELKIRKFISGLR--QATIYNGKYPVERV----VAAPVQRPVPA----------

Query:  TVGQKRKFDFRGG--GQR--RQGQFQNRDQRRAPARFQPSQQGGPRQFPACATCGRAHAEACQTRPRLCFRCGREGHLARFCDQPAMAV---QPAQQQQG
         +GQKRK + +     QR  R G    R +R   A  +       R+ PAC TCGR H   C     +CFRC + GH A  C +        QP+  QQG
Subjt:  TVGQKRKFDFRGG--GQR--RQGQFQNRDQRRAPARFQPSQQGGPRQFPACATCGRAHAEACQTRPRLCFRCGREGHLARFCDQPAMAV---QPAQQQQG

Query:  RVFATTRQEAERTDAAIAG------------FE--------------------------LSVATFSGVNMLARDKIKDGRIEISGELLVATLIVLPMSGF
        RVFATTRQEAER    + G            F+                          LSV+T SG  +L++++IK  R+EI+  +L  TL+VL M  F
Subjt:  RVFATTRQEAERTDAAIAG------------FE--------------------------LSVATFSGVNMLARDKIKDGRIEISGELLVATLIVLPMSGF

Query:  DVILGMEWLAANHALIECHRKEVVF-------------------------------SSWSLGLSSQVVRGEPPLAIVDQSLVVREFADVFLRSSPG----
        DVILGM+WL+ANHA I+C+ KEVVF                               S  + G+ + VV    P   +    VVRE+ DVF    PG    
Subjt:  DVILGMEWLAANHALIECHRKEVVF-------------------------------SSWSLGLSSQVVRGEPPLAIVDQSLVVREFADVFLRSSPG----

Query:  -------------APQERGSVSYGANELKESRFRLQEMLDRGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNKYPLPRIDDLFDQLQGAFV
                     AP  R        ELKE + +LQE+LD+GFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKN+YPLPRIDDLFDQLQGA V
Subjt:  -------------APQERGSVSYGANELKESRFRLQEMLDRGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNKYPLPRIDDLFDQLQGAFV

Query:  FSKIDLRSDYH
        FSKIDLRS YH
Subjt:  FSKIDLRSDYH

TrEMBL top hitse value%identityAlignment
A0A5A7TSL0 Reverse transcriptase4.9e-9536.99Show/hide
Query:  RGTRIRSDTAGTSSQAETDGIPPVAPPVAPRGGVEASERREIALPEREFADSMRALMG------------------------------AAAPPAPQEQVA
        RGTR      G  +       PPVAP V P   V    + ++A  E+ + D ++A +                               A APPAP+E  A
Subjt:  RGTRIRSDTAGTSSQAETDGIPPVAPPVAPRGGVEASERREIALPEREFADSMRALMG------------------------------AAAPPAPQEQVA

Query:  PEADVAEGAEELGRWLKDFVKWKLEKFDASGD-ALAAARWIAHLEYTFLIMVCPDVQRPRCAAHVLGGTARWWWESTLSERPAGSPPA--TWEFFKSEFK
            V   AE   + L+DF K+  + FD S D    A  W+  +E  F  M CP+ Q+ +CA   L      WWE+  +ER  G   +  TWE FK  F 
Subjt:  PEADVAEGAEELGRWLKDFVKWKLEKFDASGD-ALAAARWIAHLEYTFLIMVCPDVQRPRCAAHVLGGTARWWWESTLSERPAGSPPA--TWEFFKSEFK

Query:  AKYISDEAQEKMVELFQNLKQGSDSIEEYERKFSEYGYFAPQLIATPELKIRKFISGLR--QATIYNGKYPVERV----VAAPVQRPVPA----------
        AK+ S   +   ++ F NL+QG  ++E+Y+ +F     FAP ++     +  KF+ GLR     I     P        +A  +  P  A          
Subjt:  AKYISDEAQEKMVELFQNLKQGSDSIEEYERKFSEYGYFAPQLIATPELKIRKFISGLR--QATIYNGKYPVERV----VAAPVQRPVPA----------

Query:  TVGQKRKFDFRG--GGQR--RQGQFQNRDQRRAPARFQPSQQGGPRQFPACATCGRAHAEACQTRPRLCFRCGREGHLARFCDQPAMAV---QPAQQQQG
         +GQKRK + +     QR  R G    R +R   A  +       R+ PAC TCGR H   C     +CFRC + GH A  C +        QP+  QQG
Subjt:  TVGQKRKFDFRG--GGQR--RQGQFQNRDQRRAPARFQPSQQGGPRQFPACATCGRAHAEACQTRPRLCFRCGREGHLARFCDQPAMAV---QPAQQQQG

Query:  RVFATTRQEAERTDAAIAG------------FE--------------------------LSVATFSGVNMLARDKIKDGRIEISGELLVATLIVLPMSGF
        RVFATTRQEAER    + G            F+                          LSV+T SG  +L++++IK  R+EI+  +L  TL+VL M  F
Subjt:  RVFATTRQEAERTDAAIAG------------FE--------------------------LSVATFSGVNMLARDKIKDGRIEISGELLVATLIVLPMSGF

Query:  DVILGMEWLAANHALIECHRKEVVF-------------------------------SSWSLGLSSQVVRGEPPLAIVDQSLVVREFADVFLRSSPG----
        DVILGM+WL+ANHA I+C+ KEVVF                               S  + G+ + VV    P   +    VVRE+ DVF    PG    
Subjt:  DVILGMEWLAANHALIECHRKEVVF-------------------------------SSWSLGLSSQVVRGEPPLAIVDQSLVVREFADVFLRSSPG----

Query:  -------------APQERGSVSYGANELKESRFRLQEMLDRGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNKYPLPRIDDLFDQLQGAFV
                     AP  R        ELKE + +LQE+LD+GFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKN+YPLPRIDDLFDQLQGA V
Subjt:  -------------APQERGSVSYGANELKESRFRLQEMLDRGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNKYPLPRIDDLFDQLQGAFV

Query:  FSKIDLRSDYH
        FSKIDLRS YH
Subjt:  FSKIDLRSDYH

A0A5A7TY28 Reverse transcriptase2.8e-9839.12Show/hide
Query:  RGTRIRSDTAGTSSQAETDGIPPVAPPVAPRGGVEASERREIALPEREFADSMRALMG----------------AAAPPAPQEQVAPEADVAEGAEELGR
        RG R      G  +       PPVAP V P   V    + ++A  E+ + D ++A +                 A APPAP+E  A    V   AE   +
Subjt:  RGTRIRSDTAGTSSQAETDGIPPVAPPVAPRGGVEASERREIALPEREFADSMRALMG----------------AAAPPAPQEQVAPEADVAEGAEELGR

Query:  WLKDFVKWKLEKFDASGD-ALAAARWIAHLEYTFLIMVCPDVQRPRCAAHVLGGTARWWWESTLSERPAGSP--PATWEFFKSEFKAKYISDEAQEKMVE
         L+DF K+  + FD S D    A  W+  +E  F  M CP+ Q+ +CA   L      WWE+  +ER  G      TWE FK  F AK+ S   +   ++
Subjt:  WLKDFVKWKLEKFDASGD-ALAAARWIAHLEYTFLIMVCPDVQRPRCAAHVLGGTARWWWESTLSERPAGSP--PATWEFFKSEFKAKYISDEAQEKMVE

Query:  LFQNLKQGSDSIEEYERKFSEYGYFAPQLIATPELKIRKFISGLR--QATIYNGKYPVERV----VAAPVQRPVPA----------TVGQKRKF----DF
         F NL+QG  ++E+Y+ +F     FAP +I     +  KF+ GLR     I     P        +A  +  P  A           +GQKRK     D 
Subjt:  LFQNLKQGSDSIEEYERKFSEYGYFAPQLIATPELKIRKFISGLR--QATIYNGKYPVERV----VAAPVQRPVPA----------TVGQKRKF----DF

Query:  RGGGQRRQGQFQNRDQRRAPARFQPSQQGGPRQFPACATCGRAHAEACQTRPRLCFRCGREGHLARFCDQPAMAV---QPAQQQQGRVFATTRQEAERTD
              R G    R +R   A  +       R+ PAC TCGR H   C     +CFRC + GH A  C +        QP+  QQGRVFATTRQEAER  
Subjt:  RGGGQRRQGQFQNRDQRRAPARFQPSQQGGPRQFPACATCGRAHAEACQTRPRLCFRCGREGHLARFCDQPAMAV---QPAQQQQGRVFATTRQEAERTD

Query:  AAIA---GFELSVATFSGVNMLARDKIKDGRIEISGELLVATLIVLPMSGFDVILGMEWLAANHALIECHRKEVVF------------------------
          +    G  LSV+T SG  + ++++IK  R+EI+  +L  TL+VL M  FDVILGM+WL+ANHA I+C  KEVVF                        
Subjt:  AAIA---GFELSVATFSGVNMLARDKIKDGRIEISGELLVATLIVLPMSGFDVILGMEWLAANHALIECHRKEVVF------------------------

Query:  -------SSWSLGLSSQVVRGEPPLAIVDQSLVVREFADVFLRSSPG-----------------APQERGSVSYGANELKESRFRLQEMLDRGFIRPSVS
               S  + G+ + VV    P   +    VVRE+ DVF    PG                 AP  R        ELKE + +LQE+LD+GFIRPSVS
Subjt:  -------SSWSLGLSSQVVRGEPPLAIVDQSLVVREFADVFLRSSPG-----------------APQERGSVSYGANELKESRFRLQEMLDRGFIRPSVS

Query:  PWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNKYPLPRIDDLFDQLQGAFVFSKIDLRSDYH
        PWGAPVLFVK KDGSMRLCIDYRELNKVTVKN+YPLPRIDDLFDQLQGA VFSKIDLRS YH
Subjt:  PWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNKYPLPRIDDLFDQLQGAFVFSKIDLRSDYH

A0A5A7UC03 Gag protease polyprotein4.9e-9537.39Show/hide
Query:  RGTRIRSDTAGTSSQAETDGIPPVAPPVAPRGGVEASERREIALPEREFADSMRALMG----------------AAAPPAPQEQVAPEADVAEGAEELGR
        RG R      G  +       PPVAP V P   V    + ++A  E+ + D ++A +                 A APPAP+E  A    V   AE   +
Subjt:  RGTRIRSDTAGTSSQAETDGIPPVAPPVAPRGGVEASERREIALPEREFADSMRALMG----------------AAAPPAPQEQVAPEADVAEGAEELGR

Query:  WLKDFVKWKLEKFDASGD-ALAAARWIAHLEYTFLIMVCPDVQRPRCAAHVLGGTARWWWESTLSERPAGSPPA--TWEFFKSEFKAKYISDEAQEKMVE
         L+DF K+  + FD S D    A  W+  +E  F  M CP+ Q+ +CA   L      WWE+  +ER  G   +  TWE FK  F AK+ S   +   ++
Subjt:  WLKDFVKWKLEKFDASGD-ALAAARWIAHLEYTFLIMVCPDVQRPRCAAHVLGGTARWWWESTLSERPAGSPPA--TWEFFKSEFKAKYISDEAQEKMVE

Query:  LFQNLKQGSDSIEEYERKFSEYGYFAPQLIATPELKIRKFISGLR--QATIYNGKYPVERV----VAAPVQRPVPA----------TVGQKRKFDFRGG-
         F NL+QG  ++E+Y+ +F     FAP ++     +  KF+ GLR     I     P        +A  +  P  A           +GQKRK + +   
Subjt:  LFQNLKQGSDSIEEYERKFSEYGYFAPQLIATPELKIRKFISGLR--QATIYNGKYPVERV----VAAPVQRPVPA----------TVGQKRKFDFRGG-

Query:  ----GQRRQGQFQNRDQRRAPARFQPSQQGGPRQFPACATCGRAHAEACQTRPRLCFRCGREGHLARFCDQPAMAV---QPAQQQQGRVFATTRQEAERT
              R  G FQ   Q  A A          R+ PAC TCGR H   C     +CFRC + GH A  C +        QP+  QQGRVFATTRQEAER 
Subjt:  ----GQRRQGQFQNRDQRRAPARFQPSQQGGPRQFPACATCGRAHAEACQTRPRLCFRCGREGHLARFCDQPAMAV---QPAQQQQGRVFATTRQEAERT

Query:  DAAIAG------------FE--------------------------LSVATFSGVNMLARDKIKDGRIEISGELLVATLIVLPMSGFDVILGMEWLAANH
           + G            F+                          LSV+T SG  +L++++IK  R+EI+  +L  TL+VL M  FDVILGM+WL+ANH
Subjt:  DAAIAG------------FE--------------------------LSVATFSGVNMLARDKIKDGRIEISGELLVATLIVLPMSGFDVILGMEWLAANH

Query:  ALIECHRKEVVF-------------------------------SSWSLGLSSQVVRGEPPLAIVDQSLVVREFADVFLRSSPG-----------------
        A I+C  KEVVF                               S  + G+ + VV    P   +    VVRE+ DVF    PG                 
Subjt:  ALIECHRKEVVF-------------------------------SSWSLGLSSQVVRGEPPLAIVDQSLVVREFADVFLRSSPG-----------------

Query:  APQERGSVSYGANELKESRFRLQEMLDRGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNKYPLPRIDDLFDQLQGAFVFSKIDLRSDYH
        AP  R        ELKE + +LQE+LD+GFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKN+YPLPRI+DLFDQLQGA VFSKIDLRS YH
Subjt:  APQERGSVSYGANELKESRFRLQEMLDRGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNKYPLPRIDDLFDQLQGAFVFSKIDLRSDYH

A0A5A7UHN4 Reverse transcriptase6.4e-9538.34Show/hide
Query:  MPVTRDG-RGRPRAGRGTRIRSDTAGTSSQAETDGIPPVAPPVAPRGGVEASERREIALPEREFADSMRALMG----------------AAAPPAPQEQV
        MP  R   RG  R GRG         +  Q E     P AP V P   V    + ++A  E+ + D ++A +                 A  PPAP+E  
Subjt:  MPVTRDG-RGRPRAGRGTRIRSDTAGTSSQAETDGIPPVAPPVAPRGGVEASERREIALPEREFADSMRALMG----------------AAAPPAPQEQV

Query:  APEADVAEGAEELGRWLKDFVKWKLEKFDASGD-ALAAARWIAHLEYTFLIMVCPDVQRPRCAAHVLGGTARWWWESTLSERPAGSPPA--TWEFFKSEF
        A    +   AE   + L+DF K+  + FD S D    A  W+  +E  F  M CP+ Q+ +C    L      WWE+  +ER  G   +  TWE FK  F
Subjt:  APEADVAEGAEELGRWLKDFVKWKLEKFDASGD-ALAAARWIAHLEYTFLIMVCPDVQRPRCAAHVLGGTARWWWESTLSERPAGSPPA--TWEFFKSEF

Query:  KAKYISDEAQEKMVELFQNLKQGSDSIEEYERKFSEYGYFAPQLIATPELKIRKFISGLR--QATIYNGKYPVERVVAAPV-------QRPVP-------
         AK+ S   +   ++ F NL+QG  ++E+Y+ +F     FAP ++     +  KF+ GLR     I     P     A  +       +R  P       
Subjt:  KAKYISDEAQEKMVELFQNLKQGSDSIEEYERKFSEYGYFAPQLIATPELKIRKFISGLR--QATIYNGKYPVERVVAAPV-------QRPVP-------

Query:  ATVGQKRKFDFRGG--GQR--RQGQFQNRDQRRAPARFQPSQQGGPRQFPACATCGRAHAEACQTRPRLCFRCGREGHLARFCDQPAMAV---QPAQQQQ
        + VGQKRK + +     QR  R G    R +R   A  +       R+ PAC TCGR H   C     +CFRC + GH A  C +        QP+  QQ
Subjt:  ATVGQKRKFDFRGG--GQR--RQGQFQNRDQRRAPARFQPSQQGGPRQFPACATCGRAHAEACQTRPRLCFRCGREGHLARFCDQPAMAV---QPAQQQQ

Query:  GRVFATTRQEAERTDAAIAG------------FE--------------------------LSVATFSGVNMLARDKIKDGRIEISGELLVATLIVLPMSG
        GRVFATTRQEAER    + G            F+                          LSV+T SG  +L+++KIK  R+EI+  +L  TL+VL M  
Subjt:  GRVFATTRQEAERTDAAIAG------------FE--------------------------LSVATFSGVNMLARDKIKDGRIEISGELLVATLIVLPMSG

Query:  FDVILGMEWLAANHALIECHRKEVVF-----SSWSLGLSSQVVRGEPPLAIVDQSLVVREFADVFLRSSPG-----------------APQERGSVSYGA
        FDVILGM+WL+ANHA I+C  KEVVF     +S+    +  V+  + P   +    VVRE+ DVF    PG                 AP  R       
Subjt:  FDVILGMEWLAANHALIECHRKEVVF-----SSWSLGLSSQVVRGEPPLAIVDQSLVVREFADVFLRSSPG-----------------APQERGSVSYGA

Query:  NELKESRFRLQEMLDRGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNKYPLPRIDDLFDQLQGAFVFSKIDLRSDYH
         ELKE + +LQE+LD+GFIRPSVSPWGAPVLF+KKKDGSM LCIDYRELNKVTVKN+YPLPRIDDLFDQLQGA VFSKIDLRSDYH
Subjt:  NELKESRFRLQEMLDRGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNKYPLPRIDDLFDQLQGAFVFSKIDLRSDYH

A0A5D3CQB5 Reverse transcriptase6.4e-9536.99Show/hide
Query:  RGTRIRSDTAGTSSQAETDGIPPVAPPVAPRGGVEASERREIALPEREFADSMRALMG------------------------------AAAPPAPQEQVA
        RGTR      G  +       PPVAP V P   V    + ++A  E+ + D ++A +                               A APPAP+E  A
Subjt:  RGTRIRSDTAGTSSQAETDGIPPVAPPVAPRGGVEASERREIALPEREFADSMRALMG------------------------------AAAPPAPQEQVA

Query:  PEADVAEGAEELGRWLKDFVKWKLEKFDASGD-ALAAARWIAHLEYTFLIMVCPDVQRPRCAAHVLGGTARWWWESTLSERPAGSPPA--TWEFFKSEFK
            V   AE   + L+DF K+  + FD S D    A  W+  +E  F  M CP+ Q+ +CA   L      WWE+  +ER  G   +  TWE FK  F 
Subjt:  PEADVAEGAEELGRWLKDFVKWKLEKFDASGD-ALAAARWIAHLEYTFLIMVCPDVQRPRCAAHVLGGTARWWWESTLSERPAGSPPA--TWEFFKSEFK

Query:  AKYISDEAQEKMVELFQNLKQGSDSIEEYERKFSEYGYFAPQLIATPELKIRKFISGLR--QATIYNGKYPVERV----VAAPVQRPVPA----------
        AK+ S   +   ++ F NL+QG  ++E+Y+ +F     FAP ++     +  KF+ GLR     I     P        +A  +  P  A          
Subjt:  AKYISDEAQEKMVELFQNLKQGSDSIEEYERKFSEYGYFAPQLIATPELKIRKFISGLR--QATIYNGKYPVERV----VAAPVQRPVPA----------

Query:  TVGQKRKFDFRGG--GQR--RQGQFQNRDQRRAPARFQPSQQGGPRQFPACATCGRAHAEACQTRPRLCFRCGREGHLARFCDQPAMAV---QPAQQQQG
         +GQKRK + +     QR  R G    R +R   A  +       R+ PAC TCGR H   C     +CFRC + GH A  C +        QP+  QQG
Subjt:  TVGQKRKFDFRGG--GQR--RQGQFQNRDQRRAPARFQPSQQGGPRQFPACATCGRAHAEACQTRPRLCFRCGREGHLARFCDQPAMAV---QPAQQQQG

Query:  RVFATTRQEAERTDAAIAG------------FE--------------------------LSVATFSGVNMLARDKIKDGRIEISGELLVATLIVLPMSGF
        RVFATTRQEAER    + G            F+                          LSV+T SG  +L++++IK  R+EI+  +L  TL+VL M  F
Subjt:  RVFATTRQEAERTDAAIAG------------FE--------------------------LSVATFSGVNMLARDKIKDGRIEISGELLVATLIVLPMSGF

Query:  DVILGMEWLAANHALIECHRKEVVF-------------------------------SSWSLGLSSQVVRGEPPLAIVDQSLVVREFADVFLRSSPG----
        DVILGM+WL+ANHA I+C+ KEVVF                               S  + G+ + VV    P   +    VVRE+ DVF    PG    
Subjt:  DVILGMEWLAANHALIECHRKEVVF-------------------------------SSWSLGLSSQVVRGEPPLAIVDQSLVVREFADVFLRSSPG----

Query:  -------------APQERGSVSYGANELKESRFRLQEMLDRGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNKYPLPRIDDLFDQLQGAFV
                     AP  R        ELKE + +LQE+LD+GFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKN+YPLPRIDDLFDQLQGA V
Subjt:  -------------APQERGSVSYGANELKESRFRLQEMLDRGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNKYPLPRIDDLFDQLQGAFV

Query:  FSKIDLRSDYH
        FSKIDLRS YH
Subjt:  FSKIDLRSDYH

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein1.6e-1344.16Show/hide
Query:  LQEMLDRGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNKYPLPRIDDLFDQLQGAFVFSKIDLRSDYH
        + + L  G IR S +    PV+FV KK+G++R+ +DY+ LNK    N YPLP I+ L  ++QG+ +F+K+DL+S YH
Subjt:  LQEMLDRGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNKYPLPRIDDLFDQLQGAFVFSKIDLRSDYH

P0CT41 Transposon Tf2-12 polyprotein1.6e-1344.16Show/hide
Query:  LQEMLDRGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNKYPLPRIDDLFDQLQGAFVFSKIDLRSDYH
        + + L  G IR S +    PV+FV KK+G++R+ +DY+ LNK    N YPLP I+ L  ++QG+ +F+K+DL+S YH
Subjt:  LQEMLDRGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNKYPLPRIDDLFDQLQGAFVFSKIDLRSDYH

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein2.7e-1851.95Show/hide
Query:  LQEMLDRGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNKYPLPRIDDLFDQLQGAFVFSKIDLRSDYH
        +Q++LD  FI PS SP  +PV+ V KKDG+ RLC+DYR LNK T+ + +PLPRID+L  ++  A +F+ +DL S YH
Subjt:  LQEMLDRGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNKYPLPRIDDLFDQLQGAFVFSKIDLRSDYH

Q99315 Transposon Ty3-G Gag-Pol polyprotein2.7e-1851.95Show/hide
Query:  LQEMLDRGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNKYPLPRIDDLFDQLQGAFVFSKIDLRSDYH
        +Q++LD  FI PS SP  +PV+ V KKDG+ RLC+DYR LNK T+ + +PLPRID+L  ++  A +F+ +DL S YH
Subjt:  LQEMLDRGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNKYPLPRIDDLFDQLQGAFVFSKIDLRSDYH

Q9UR07 Transposon Tf2-11 polyprotein1.6e-1344.16Show/hide
Query:  LQEMLDRGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNKYPLPRIDDLFDQLQGAFVFSKIDLRSDYH
        + + L  G IR S +    PV+FV KK+G++R+ +DY+ LNK    N YPLP I+ L  ++QG+ +F+K+DL+S YH
Subjt:  LQEMLDRGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNKYPLPRIDDLFDQLQGAFVFSKIDLRSDYH

Arabidopsis top hitse value%identityAlignment
ATMG00850.1 DNA/RNA polymerases superfamily protein3.9e-0447.83Show/hide
Query:  SVSYGANELKESRFR--LQEMLDRGFIRPSVSPWGAPVLFVKKKDG
        SV  G + L+ +R +  L EML+   I+PS+SP+ +PVL V+KKDG
Subjt:  SVSYGANELKESRFR--LQEMLDRGFIRPSVSPWGAPVLFVKKKDG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTGTCACCCGCGATGGAAGAGGACGTCCGAGAGCAGGTCGAGGTACCAGGATCAGATCTGATACTGCAGGGACATCGTCCCAAGCTGAGACTGATGGCATT
CCACCAGTAGCACCTCCTGTTGCGCCTAGAGGAGGAGTTGAGGCGTCTGAGCGCCGTGAGATCGCTCTCCCCGAGCGAGAATTTGCTGATTCTATGAGGGCTTTG
ATGGGCGCAGCAGCACCCCCAGCTCCACAGGAGCAAGTAGCCCCTGAGGCAGATGTAGCAGAGGGTGCCGAGGAGCTTGGTAGATGGTTAAAGGACTTCGTGAAG
TGGAAGCTAGAGAAGTTTGATGCCTCGGGAGATGCTTTAGCGGCAGCCAGATGGATTGCCCATTTGGAGTACACCTTCCTGATTATGGTGTGCCCCGATGTTCAG
AGGCCGAGGTGTGCAGCCCATGTGCTAGGAGGCACAGCTAGATGGTGGTGGGAGTCCACCTTGAGTGAGAGACCAGCTGGGTCTCCACCTGCTACTTGGGAGTTC
TTCAAGTCGGAGTTCAAGGCCAAGTACATTAGTGATGAGGCTCAAGAGAAGATGGTGGAGCTTTTCCAAAATCTGAAGCAGGGTTCTGACTCTATCGAGGAGTAT
GAGAGGAAGTTCTCGGAGTATGGTTATTTTGCTCCGCAGTTGATAGCTACTCCTGAGCTGAAGATTAGGAAGTTCATTTCTGGTTTGAGGCAGGCCACCATTTAT
AATGGGAAGTATCCTGTTGAGCGGGTTGTTGCCGCTCCGGTTCAGCGTCCTGTCCCAGCTACAGTGGGTCAGAAGAGGAAGTTTGATTTCCGTGGAGGCGGCCAG
CGCCGTCAAGGTCAGTTTCAGAACCGTGATCAGCGTAGAGCTCCAGCCCGTTTCCAGCCTAGCCAGCAGGGCGGTCCCAGGCAGTTTCCCGCATGTGCTACTTGT
GGGAGAGCCCACGCTGAAGCTTGTCAGACCAGGCCCAGGCTGTGTTTTCGTTGTGGAAGAGAGGGGCACTTGGCCCGTTTTTGTGATCAGCCAGCTATGGCAGTT
CAGCCAGCGCAGCAGCAGCAGGGTCGGGTGTTTGCTACGACCCGTCAGGAGGCTGAGCGTACGGATGCCGCTATTGCAGGTTTTGAGTTGTCTGTTGCAACTTTT
TCGGGAGTTAATATGTTAGCTAGAGATAAGATTAAGGATGGCCGGATCGAGATATCTGGAGAGTTGCTTGTAGCTACGCTCATAGTTCTGCCTATGAGCGGTTTT
GATGTGATTCTTGGTATGGAGTGGTTAGCTGCTAACCACGCATTGATTGAATGCCACCGGAAAGAAGTAGTGTTCAGCTCATGGAGCTTGGGGCTATCTAGCCAA
GTTGTTCGAGGAGAGCCGCCCCTCGCGATCGTTGATCAGTCTTTGGTTGTGCGGGAGTTTGCGGATGTGTTTCTGAGGAGTTCACCGGGCGCCCCTCAAGAGAGA
GGCTCCGTATCGTATGGCGCCAATGAGTTAAAGGAATCAAGATTCAGGCTGCAGGAGATGTTAGATCGAGGGTTCATCCGTCCGAGTGTGTCCCCGTGGGGAGCA
CCAGTTCTGTTTGTGAAAAAGAAGGATGGTTCTATGCGCTTATGTATTGATTATCGCGAGCTTAATAAGGTGACAGTGAAGAATAAGTATCCGTTGCCCCGTATT
GATGATTTGTTCGACCAGCTTCAGGGAGCTTTCGTGTTCTCCAAGATAGATTTGAGATCCGATTACCATTAG
mRNA sequenceShow/hide mRNA sequence
ATGCCTGTCACCCGCGATGGAAGAGGACGTCCGAGAGCAGGTCGAGGTACCAGGATCAGATCTGATACTGCAGGGACATCGTCCCAAGCTGAGACTGATGGCATT
CCACCAGTAGCACCTCCTGTTGCGCCTAGAGGAGGAGTTGAGGCGTCTGAGCGCCGTGAGATCGCTCTCCCCGAGCGAGAATTTGCTGATTCTATGAGGGCTTTG
ATGGGCGCAGCAGCACCCCCAGCTCCACAGGAGCAAGTAGCCCCTGAGGCAGATGTAGCAGAGGGTGCCGAGGAGCTTGGTAGATGGTTAAAGGACTTCGTGAAG
TGGAAGCTAGAGAAGTTTGATGCCTCGGGAGATGCTTTAGCGGCAGCCAGATGGATTGCCCATTTGGAGTACACCTTCCTGATTATGGTGTGCCCCGATGTTCAG
AGGCCGAGGTGTGCAGCCCATGTGCTAGGAGGCACAGCTAGATGGTGGTGGGAGTCCACCTTGAGTGAGAGACCAGCTGGGTCTCCACCTGCTACTTGGGAGTTC
TTCAAGTCGGAGTTCAAGGCCAAGTACATTAGTGATGAGGCTCAAGAGAAGATGGTGGAGCTTTTCCAAAATCTGAAGCAGGGTTCTGACTCTATCGAGGAGTAT
GAGAGGAAGTTCTCGGAGTATGGTTATTTTGCTCCGCAGTTGATAGCTACTCCTGAGCTGAAGATTAGGAAGTTCATTTCTGGTTTGAGGCAGGCCACCATTTAT
AATGGGAAGTATCCTGTTGAGCGGGTTGTTGCCGCTCCGGTTCAGCGTCCTGTCCCAGCTACAGTGGGTCAGAAGAGGAAGTTTGATTTCCGTGGAGGCGGCCAG
CGCCGTCAAGGTCAGTTTCAGAACCGTGATCAGCGTAGAGCTCCAGCCCGTTTCCAGCCTAGCCAGCAGGGCGGTCCCAGGCAGTTTCCCGCATGTGCTACTTGT
GGGAGAGCCCACGCTGAAGCTTGTCAGACCAGGCCCAGGCTGTGTTTTCGTTGTGGAAGAGAGGGGCACTTGGCCCGTTTTTGTGATCAGCCAGCTATGGCAGTT
CAGCCAGCGCAGCAGCAGCAGGGTCGGGTGTTTGCTACGACCCGTCAGGAGGCTGAGCGTACGGATGCCGCTATTGCAGGTTTTGAGTTGTCTGTTGCAACTTTT
TCGGGAGTTAATATGTTAGCTAGAGATAAGATTAAGGATGGCCGGATCGAGATATCTGGAGAGTTGCTTGTAGCTACGCTCATAGTTCTGCCTATGAGCGGTTTT
GATGTGATTCTTGGTATGGAGTGGTTAGCTGCTAACCACGCATTGATTGAATGCCACCGGAAAGAAGTAGTGTTCAGCTCATGGAGCTTGGGGCTATCTAGCCAA
GTTGTTCGAGGAGAGCCGCCCCTCGCGATCGTTGATCAGTCTTTGGTTGTGCGGGAGTTTGCGGATGTGTTTCTGAGGAGTTCACCGGGCGCCCCTCAAGAGAGA
GGCTCCGTATCGTATGGCGCCAATGAGTTAAAGGAATCAAGATTCAGGCTGCAGGAGATGTTAGATCGAGGGTTCATCCGTCCGAGTGTGTCCCCGTGGGGAGCA
CCAGTTCTGTTTGTGAAAAAGAAGGATGGTTCTATGCGCTTATGTATTGATTATCGCGAGCTTAATAAGGTGACAGTGAAGAATAAGTATCCGTTGCCCCGTATT
GATGATTTGTTCGACCAGCTTCAGGGAGCTTTCGTGTTCTCCAAGATAGATTTGAGATCCGATTACCATTAG
Protein sequenceShow/hide protein sequence
MPVTRDGRGRPRAGRGTRIRSDTAGTSSQAETDGIPPVAPPVAPRGGVEASERREIALPEREFADSMRALMGAAAPPAPQEQVAPEADVAEGAEELGRWLKDFVK
WKLEKFDASGDALAAARWIAHLEYTFLIMVCPDVQRPRCAAHVLGGTARWWWESTLSERPAGSPPATWEFFKSEFKAKYISDEAQEKMVELFQNLKQGSDSIEEY
ERKFSEYGYFAPQLIATPELKIRKFISGLRQATIYNGKYPVERVVAAPVQRPVPATVGQKRKFDFRGGGQRRQGQFQNRDQRRAPARFQPSQQGGPRQFPACATC
GRAHAEACQTRPRLCFRCGREGHLARFCDQPAMAVQPAQQQQGRVFATTRQEAERTDAAIAGFELSVATFSGVNMLARDKIKDGRIEISGELLVATLIVLPMSGF
DVILGMEWLAANHALIECHRKEVVFSSWSLGLSSQVVRGEPPLAIVDQSLVVREFADVFLRSSPGAPQERGSVSYGANELKESRFRLQEMLDRGFIRPSVSPWGA
PVLFVKKKDGSMRLCIDYRELNKVTVKNKYPLPRIDDLFDQLQGAFVFSKIDLRSDYH