; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI06G16040 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI06G16040
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionReverse transcriptase
Genome locationChr6:14447962..14451230
RNA-Seq ExpressionCSPI06G16040
SyntenyCSPI06G16040
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0043170 - macromolecule metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0016787 - hydrolase activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR005162 - Retrotransposon gag domain
IPR012337 - Ribonuclease H-like superfamily
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0031931.1 pol protein [Cucumis melo var. makuwa]2.0e-24447.06Show/hide
Query:  RMVGGDVRQITWDQFRDRFYTKFFSANLRDVKSQKFLELKQGHMTVEEYDQEFDMLSRFALEL------------------------ALKPTTQAEALCL
        RM+GGDV QITW QF++ FY KFFSA+LRD K Q+FL L+QG MTVE+YD EFDMLSRFA E+                        A +P T A+AL L
Subjt:  RMVGGDVRQITWDQFRDRFYTKFFSANLRDVKSQKFLELKQGHMTVEEYDQEFDMLSRFALEL------------------------ALKPTTQAEALCL

Query:  VVDISIGKDEVHAKNFDKGKSYGQKRKVEQWFVEVPKRNLRLSGPLCSFQQSPGGAGDTTREKPLCNTCGKLHLGRCLGGTRVCFKYRQEGHMADRCPLR
         VD+S+ +    +K   +G + GQKRK EQ  V VP+RN R  G    FQQ P  AG+  R KPLC TCGK HLGRCL GTR CFK RQEGH ADRCPLR
Subjt:  VVDISIGKDEVHAKNFDKGKSYGQKRKVEQWFVEVPKRNLRLSGPLCSFQQSPGGAGDTTREKPLCNTCGKLHLGRCLGGTRVCFKYRQEGHMADRCPLR

Query:  FTETGQS--------SQRFAISRLEAESADTVVT---------------------------------EVEPLDYVLSVSTPFGKIMLSKEKIKACKIEIT
         T   Q+         + FA ++ EAE A TVVT                                 EVEPL +VLSVSTP G+ MLSKEK+KAC+IEI 
Subjt:  FTETGQS--------SQRFAISRLEAESADTVVT---------------------------------EVEPLDYVLSVSTPFGKIMLSKEKIKACKIEIT

Query:  GRVLD--------------------------------KVVFSPPTKPSFKFKAVGTVVLPKVISAMKASKLLNQGTWSILASMVDTREGEISLTSKPVVK
        G V++                                +V F+PP+  SFKFK  G+  LP+VISA++ASKLL+QGTW ILAS+VDTRE ++SL+S+PVV+
Subjt:  GRVLD--------------------------------KVVFSPPTKPSFKFKAVGTVVLPKVISAMKASKLLNQGTWSILASMVDTREGEISLTSKPVVK

Query:  EYPDVFPEELLGLPPHREIDFAIVLEPGTTPISRASYRMAPTELKELKVQLQELLDKGFIRPSVSLWDAPALFVKKKDGSMRLCIDYRELN---------
        +YPDVFPEEL GLPPHRE++FAI LEPGT PISRA YRMAP ELKELKVQLQELLDKGFIRPS+S W AP LFVKKKDGSMRLCIDYRELN         
Subjt:  EYPDVFPEELLGLPPHREIDFAIVLEPGTTPISRASYRMAPTELKELKVQLQELLDKGFIRPSVSLWDAPALFVKKKDGSMRLCIDYRELN---------

Query:  ------------------KIDLRSGYHELRIRDSYIPKT--------------------------------------VFVRVFIDDILVYSKIETEHEEH
                          KIDLRSGYH+LRI+D  +PKT                                       FV VFIDDIL+YSK E EHEEH
Subjt:  ------------------KIDLRSGYHELRIRDSYIPKT--------------------------------------VFVRVFIDDILVYSKIETEHEEH

Query:  LHTFLETLRANKLYAKLSKCEFWLKQVAFLGHVVSSEGVFVDPVKIEVVTSWSRPSTVSE--DIIG--GLWRTFL------------VTRKGTPFVWSLT
        L   L+TLR NKLYAK SKCEFWLKQV+FLGHVVS  GV VDP KIE VT W+RPSTVSE    +G  G +R F+            +TRKG PFVWS  
Subjt:  LHTFLETLRANKLYAKLSKCEFWLKQVAFLGHVVSSEGVFVDPVKIEVVTSWSRPSTVSE--DIIG--GLWRTFL------------VTRKGTPFVWSLT

Query:  CDDSFKNLKQKLVIAPVLTVPDGSG---------------------------------------------------------------------------
        C+DSF+NLKQKLV A VLTVPDGSG                                                                           
Subjt:  CDDSFKNLKQKLVIAPVLTVPDGSG---------------------------------------------------------------------------

Query:  ------------------------------KANVVADALSGKVSHSTALITRQAPLHRDIERAEIS----------------------------SDPCLV
                                      KANVVADALS KVSHS ALITRQAPLHRD+ERAEI+                            +DP LV
Subjt:  ------------------------------KANVVADALSGKVSHSTALITRQAPLHRDIERAEIS----------------------------SDPCLV

Query:  ERRHLVGSVKTDEFSISSNGGLLFERRLCVLVDIAVKTDLLNEAHNSPFSMHPGSTKMYEDLN-------------EFIG------GVKAPRQKPAGLLQ
        E+R L  + +  EFS+SS+GGLLFERRLCV  D AVKT+LL+EAH+SPFSMHPGSTKMY+DL              EF+        VKAPRQKPAGLLQ
Subjt:  ERRHLVGSVKTDEFSISSNGGLLFERRLCVLVDIAVKTDLLNEAHNSPFSMHPGSTKMYEDLN-------------EFIG------GVKAPRQKPAGLLQ

Query:  PLSVLEWKWENVSMDFITGLPRILKGY-------TRCT------------------------------------------FYFQVLEGTSYCHGHELDFS
        PLS+ EWKWEN+SMDFITGLPR L+G+        R T                                          F  +  +G     G  LDFS
Subjt:  PLSVLEWKWENVSMDFITGLPRILKGY-------TRCT------------------------------------------FYFQVLEGTSYCHGHELDFS

Query:  TTFHAQTDGQTERLNQVLEDMLRA
        T FH QTDGQTERLNQVLEDMLRA
Subjt:  TTFHAQTDGQTERLNQVLEDMLRA

KAA0043391.1 pol protein [Cucumis melo var. makuwa]6.0e-24948.14Show/hide
Query:  RMVGGDVRQITWDQFRDRFYTKFFSANLRDVKSQKFLELKQGHMTVEEYDQEFDMLSRFALEL------------------------ALKPTTQAEALCL
        RM+GGDV QITW QF++ FY KFFSA+LRD K Q+FL ++QG MTVE+YD EFDMLSRFA E+                        A +P T A+AL L
Subjt:  RMVGGDVRQITWDQFRDRFYTKFFSANLRDVKSQKFLELKQGHMTVEEYDQEFDMLSRFALEL------------------------ALKPTTQAEALCL

Query:  VVDISIGKDEVHAKNFDKGKSYGQKRKVEQWFVEVPKRNLRLSGPLCSFQQSPGGAGDTTREKPLCNTCGKLHLGRCLGGTRVCFKYRQEGHMADRCPLR
         VD+S+ +    +K   +G + GQKRK EQ  V VP+RN R  G    FQQ P  AG+  R KPLC TCGK HLGRCL GTR CFK RQEGH ADRCPLR
Subjt:  VVDISIGKDEVHAKNFDKGKSYGQKRKVEQWFVEVPKRNLRLSGPLCSFQQSPGGAGDTTREKPLCNTCGKLHLGRCLGGTRVCFKYRQEGHMADRCPLR

Query:  FTETGQS--------SQRFAISRLEAESADTVVT---------------------------------EVEPLDYVLSVSTPFGKIMLSKEKIKACKIEIT
         T   Q+         + FA ++ EAE A TVVT                                 EVEPL +VLSVSTP G+ MLSKEK+KAC+IEI 
Subjt:  FTETGQS--------SQRFAISRLEAESADTVVT---------------------------------EVEPLDYVLSVSTPFGKIMLSKEKIKACKIEIT

Query:  GRVLD--------------------------------KVVFSPPTKPSFKFKAVGTVVLPKVISAMKASKLLNQGTWSILASMVDTREGEISLTSKPVVK
        G V++                                +V F+PP+  SFKFK  G+  LP+VISA++ASKLL+QGTW ILAS+VDTRE ++SL+S+PVV+
Subjt:  GRVLD--------------------------------KVVFSPPTKPSFKFKAVGTVVLPKVISAMKASKLLNQGTWSILASMVDTREGEISLTSKPVVK

Query:  EYPDVFPEELLGLPPHREIDFAIVLEPGTTPISRASYRMAPTELKELKVQLQELLDKGFIRPSVSLWDAPALFVKKKDGSMRLCIDYRELN---------
        +YPDVFPEEL GLPPHRE++FAI LEPGT PISRA YRMAP ELKELKVQLQELLDKGFIRPSVS W AP LFVKKKDGSMRLCIDYRELN         
Subjt:  EYPDVFPEELLGLPPHREIDFAIVLEPGTTPISRASYRMAPTELKELKVQLQELLDKGFIRPSVSLWDAPALFVKKKDGSMRLCIDYRELN---------

Query:  ------------------KIDLRSGYHELRIRDSYIPKT--------------------------------------VFVRVFIDDILVYSKIETEHEEH
                          KIDLRSGYH+LRI+D  +PKT                                       FV VFIDDIL+YSK E EHEEH
Subjt:  ------------------KIDLRSGYHELRIRDSYIPKT--------------------------------------VFVRVFIDDILVYSKIETEHEEH

Query:  LHTFLETLRANKLYAKLSKCEFWLKQVAFLGHVVSSEGVFVDPVKIEVVTSWSRPSTVSE--DIIG--GLWRTFL------------VTRKGTPFVWSLT
        L   L+TLR NKLYAK SKCEFWLKQV+FLGHVVS  GV VDP KIE VT W+RPSTVSE    +G  G +  F+            +TRKG PFVWS  
Subjt:  LHTFLETLRANKLYAKLSKCEFWLKQVAFLGHVVSSEGVFVDPVKIEVVTSWSRPSTVSE--DIIG--GLWRTFL------------VTRKGTPFVWSLT

Query:  CDDSFKNLKQKLVIAPVLTVPDGS----------------------------------------------------------------------------
        C+DSF+NLKQKLV APVLTVPDGS                                                                            
Subjt:  CDDSFKNLKQKLVIAPVLTVPDGS----------------------------------------------------------------------------

Query:  -----------------------------GKANVVADALSGKVSHSTALITRQAPLHRDIERAEI----------------------------SSDPCLV
                                     GKANVVADALS KVSHS ALITRQA LHRD+ERA+I                            S+DP LV
Subjt:  -----------------------------GKANVVADALSGKVSHSTALITRQAPLHRDIERAEI----------------------------SSDPCLV

Query:  ERRHLVGSVKTDEFSISSNGGLLFERRLCVLVDIAVKTDLLNEAHNSPFSMHPGSTKMYEDLN-------------EFIG------GVKAPRQKPAGLLQ
        E+R L  + +  EFS+SS+GGLLFERRLCV  D AVK +LL+EAH+SPFSMHPGSTKMY+DL              EF+        VKAPRQKPAGLLQ
Subjt:  ERRHLVGSVKTDEFSISSNGGLLFERRLCVLVDIAVKTDLLNEAHNSPFSMHPGSTKMYEDLN-------------EFIG------GVKAPRQKPAGLLQ

Query:  PLSVLEWKWENVSMDFITGLPRILKGYT-------RCT-------------------FYFQVLEGTSYCHGHELDFSTTFHAQTDGQTERLNQVLEDMLR
        PLS+ EWKWENVSMDFITGLPR L+G+T       R T                    Y   ++G     G  LDFSTTFH QTDGQTERLNQVLEDMLR
Subjt:  PLSVLEWKWENVSMDFITGLPRILKGYT-------RCT-------------------FYFQVLEGTSYCHGHELDFSTTFHAQTDGQTERLNQVLEDMLR

Query:  A
        A
Subjt:  A

KAA0053368.1 pol protein [Cucumis melo var. makuwa]3.4e-24446.8Show/hide
Query:  RMVGGDVRQITWDQFRDRFYTKFFSANLRDVKSQKFLELKQGHMTVEEYDQEFDMLSRFALEL------------------------ALKPTTQAEALCL
        RM+G DV QITW QF++ FY KFFSA+LRD K Q+FL L+QG MTVE+YD EFDMLSRFA E+                        A +P T A+AL L
Subjt:  RMVGGDVRQITWDQFRDRFYTKFFSANLRDVKSQKFLELKQGHMTVEEYDQEFDMLSRFALEL------------------------ALKPTTQAEALCL

Query:  VVDISIGKDEVHAKNFDKGKSYGQKRKVEQWFVEVPKRNLRLSGPLCSFQQSPGGAGDTTREKPLCNTCGKLHLGRCLGGTRVCFKYRQEGHMADRCPLR
         VD+S+ +    +K   +G + GQKRK EQ  V VP+RN R  G    FQQ P   G+  R KPLC TCGK HLGRCL GTR CFK RQEGH+ADRCPLR
Subjt:  VVDISIGKDEVHAKNFDKGKSYGQKRKVEQWFVEVPKRNLRLSGPLCSFQQSPGGAGDTTREKPLCNTCGKLHLGRCLGGTRVCFKYRQEGHMADRCPLR

Query:  FTETGQS--------SQRFAISRLEAESADTVVT---------------------------------EVEPLDYVLSVSTPFGKIMLSKEKIKACKIEIT
         T   Q+         + FA ++ EAE A TVVT                                 EVEPL +VLSVSTP G+ MLSKE++KAC+IEI 
Subjt:  FTETGQS--------SQRFAISRLEAESADTVVT---------------------------------EVEPLDYVLSVSTPFGKIMLSKEKIKACKIEIT

Query:  GRVLD--------------------------------KVVFSPPTKPSFKFKAVGTVVLPKVISAMKASKLLNQGTWSILASMVDTREGEISLTSKPVVK
        G V++                                 V F+PP+  SFKFK  G+  LP+VISA++ASKL +QGTW ILAS+VDTRE ++SL+S+PVV+
Subjt:  GRVLD--------------------------------KVVFSPPTKPSFKFKAVGTVVLPKVISAMKASKLLNQGTWSILASMVDTREGEISLTSKPVVK

Query:  EYPDVFPEELLGLPPHREIDFAIVLEPGTTPISRASYRMAPTELKELKVQLQELLDKGFIRPSVSLWDAPALFVKKKDGSMRLCIDYRELN---------
        +YPDVFPEEL GLPPHRE++FAI LEPGT PISRA YRMAP ELKELKVQLQELLDKGFIRPSVS W AP LFVKKKDGSMRLCIDYRELN         
Subjt:  EYPDVFPEELLGLPPHREIDFAIVLEPGTTPISRASYRMAPTELKELKVQLQELLDKGFIRPSVSLWDAPALFVKKKDGSMRLCIDYRELN---------

Query:  ------------------KIDLRSGYHELRIRDSYIPKT--------------------------------------VFVRVFIDDILVYSKIETEHEEH
                          KIDLRSGYH+LRI+D  +PKT                                       FV VFIDDIL+YSK E EHEEH
Subjt:  ------------------KIDLRSGYHELRIRDSYIPKT--------------------------------------VFVRVFIDDILVYSKIETEHEEH

Query:  LHTFLETLRANKLYAKLSKCEFWLKQVAFLGHVVSSEGVFVDPVKIEVVTSWSRPSTVSE--DIIG--GLWRTFL------------VTRKGTPFVWSLT
        L   L+TLR NKLYAK SKCEFWLKQV+FLGHVVS  GV VDP KIE VT W+RPSTVSE    +G  G +R F+            +TRKG PFVWS  
Subjt:  LHTFLETLRANKLYAKLSKCEFWLKQVAFLGHVVSSEGVFVDPVKIEVVTSWSRPSTVSE--DIIG--GLWRTFL------------VTRKGTPFVWSLT

Query:  CDDSFKNLKQKLVIAPVLTVPDGS----------------------------------------------------------------------------
        C+DSF+NLKQKLV AP+LTVPDGS                                                                            
Subjt:  CDDSFKNLKQKLVIAPVLTVPDGS----------------------------------------------------------------------------

Query:  -----------------------------GKANVVADALSGKVSHSTALITRQAPLHRDIERAEI----------------------------SSDPCLV
                                     GKANVVADALS KVSHS ALITRQAPLHRD+ERAEI                            S+DP LV
Subjt:  -----------------------------GKANVVADALSGKVSHSTALITRQAPLHRDIERAEI----------------------------SSDPCLV

Query:  ERRHLVGSVKTDEFSISSNGGLLFERRLCVLVDIAVKTDLLNEAHNSPFSMHPGSTKMYEDLN-------------EFIG------GVKAPRQKPAGLLQ
        E+R L  + + + FSISS+GGL FERRLCV  D  +KT+LL+EAH+SPFSMHPGSTKMY+DL              EF+        VKAPRQKPAGLLQ
Subjt:  ERRHLVGSVKTDEFSISSNGGLLFERRLCVLVDIAVKTDLLNEAHNSPFSMHPGSTKMYEDLN-------------EFIG------GVKAPRQKPAGLLQ

Query:  PLSVLEWKWENVSMDFITGLPRILKGYT--------------------RCT-----------------------------FYFQVLEGTSYCHGHELDFS
        PLS+ EWKWENVSMDFITGLPR L+G+T                     CT                             F  +  +G     G  LDFS
Subjt:  PLSVLEWKWENVSMDFITGLPRILKGYT--------------------RCT-----------------------------FYFQVLEGTSYCHGHELDFS

Query:  TTFHAQTDGQTERLNQVLEDMLRA
        T FH QTDGQTERLNQVLEDMLRA
Subjt:  TTFHAQTDGQTERLNQVLEDMLRA

KAA0056806.1 pol protein [Cucumis melo var. makuwa]2.1e-24649.24Show/hide
Query:  RMVGGDVRQITWDQFRDRFYTKFFSANLRDVKSQKFLELKQGHMTVEEYDQEFDMLSRFALEL------------------------ALKPTTQAEALCL
        RM+GGDV QITW QF++ FY KFF A+LRD K Q+FL L+QG MTVE+YD EFDMLSRFA E+                        A +P T  +AL L
Subjt:  RMVGGDVRQITWDQFRDRFYTKFFSANLRDVKSQKFLELKQGHMTVEEYDQEFDMLSRFALEL------------------------ALKPTTQAEALCL

Query:  VVDISIGKDEVHAKNFDKGKSYGQKRKVEQWFVEVPKRNLRLSGPLCSFQQSPGGAGDTTREKPLCNTCGKLHLGRCLGGTRVCFKYRQEGHMADRCPLR
         VD+S+ +    +K   KG +  QKRK EQ  V VP+RN R  G    FQQ P   G+  R KPLC TCGK HLGRCL GTR CFK RQEGH ADR PLR
Subjt:  VVDISIGKDEVHAKNFDKGKSYGQKRKVEQWFVEVPKRNLRLSGPLCSFQQSPGGAGDTTREKPLCNTCGKLHLGRCLGGTRVCFKYRQEGHMADRCPLR

Query:  FTETGQS--------SQRFAISRLEAESADTVVT---------------------------------EVEPLDYVLSVSTPFGKIMLSKEKIKACKIEIT
         T   Q+         + FA ++ EAE A TVVT                                 EVEPL +VLSVSTP G+ MLSKEK+KAC+IEI 
Subjt:  FTETGQS--------SQRFAISRLEAESADTVVT---------------------------------EVEPLDYVLSVSTPFGKIMLSKEKIKACKIEIT

Query:  GRVLD--------------------------------KVVFSPPTKPSFKFKAVGTVVLPKVISAMKASKLLNQGTWSILASMVDTREGEISLTSKPVVK
        G V++                                +V F+PP+  SFKFK  G+  LP+VISA++ASKLL+QGTW ILAS+VDTRE ++SL+S+ VV+
Subjt:  GRVLD--------------------------------KVVFSPPTKPSFKFKAVGTVVLPKVISAMKASKLLNQGTWSILASMVDTREGEISLTSKPVVK

Query:  EYPDVFPEELLGLPPHREIDFAIVLEPGTTPISRASYRMAPTELKELKVQLQELLDKGFIRPSVSLWDAPALFVKKKDGSMRLCIDYRELN---------
        +YPDVFPEEL GLPPHRE++FAI LEPGT PISRA YRMAP ELKELKVQLQELLDKGFIRPSVS W AP LFVKKKDGSMRLCIDYRELN         
Subjt:  EYPDVFPEELLGLPPHREIDFAIVLEPGTTPISRASYRMAPTELKELKVQLQELLDKGFIRPSVSLWDAPALFVKKKDGSMRLCIDYRELN---------

Query:  ------------------KIDLRSGYHELRIRDSYIPKT--------------------------------------VFVRVFIDDILVYSKIETEHEEH
                          KIDLRSGYH+LRI+D  +PKT                                       FV VFIDDIL+YSK E EHEEH
Subjt:  ------------------KIDLRSGYHELRIRDSYIPKT--------------------------------------VFVRVFIDDILVYSKIETEHEEH

Query:  LHTFLETLRANKLYAKLSKCEFWLKQVAFLGHVVSSEGVFVDPVKIEVVTSWSRPSTVSE--DIIG--GLWRTFL------------VTRKGTPFVWSLT
        L   L+TLR NKLYAK SKCEFWLKQV+FLGHVVS  GV VDP KIE VT W+RPSTVSE    +G  G +R F+            +TRKG  FVWS  
Subjt:  LHTFLETLRANKLYAKLSKCEFWLKQVAFLGHVVSSEGVFVDPVKIEVVTSWSRPSTVSE--DIIG--GLWRTFL------------VTRKGTPFVWSLT

Query:  CDDSFKNLKQKLVIAPVLTVPDGS----------------------------------------------------------------------------
        C+DSF+NLKQKLV APVLT+PDGS                                                                            
Subjt:  CDDSFKNLKQKLVIAPVLTVPDGS----------------------------------------------------------------------------

Query:  -----------------------------GKANVVADALSGKVSHSTALITRQAPLHRDIERAEIS---SDPCLVERRHLVGSVKTDEFSISSNGGLLFE
                                     GKANVVADALS KVSHS ALITRQ PLHRD+ERAEI+   +DP LVE+R L  + +  EFSISS+GGLLFE
Subjt:  -----------------------------GKANVVADALSGKVSHSTALITRQAPLHRDIERAEIS---SDPCLVERRHLVGSVKTDEFSISSNGGLLFE

Query:  RRLCVLVDIAVKTDLLNEAHNSPFSMHPGSTKMYEDLNEFIGGVKAPRQKPAGLLQPLSVLEWKWENVSMDFITGLPRILKGYTRCTFYFQVLE------
        RRLCV  D  VKT+LL+EAH+SPFSMHPGSTKMY+DL      VKAPRQKPAGLLQPLS+ EWKWENVSMDFITGLPR L+G+T       +L       
Subjt:  RRLCVLVDIAVKTDLLNEAHNSPFSMHPGSTKMYEDLNEFIGGVKAPRQKPAGLLQPLSVLEWKWENVSMDFITGLPRILKGYTRCTFYFQVLE------

Query:  -------------------GTSYCHGHELDFSTTFHAQTDGQTERLNQVLEDMLRA
                           G     G  LDFST FH QTD QTERLNQVLEDMLRA
Subjt:  -------------------GTSYCHGHELDFSTTFHAQTDGQTERLNQVLEDMLRA

TYK01613.1 pol protein [Cucumis melo var. makuwa]1.3e-24847.69Show/hide
Query:  RMVGGDVRQITWDQFRDRFYTKFFSANLRDVKSQKFLELKQGHMTVEEYDQEFDMLSRFALEL------------------------ALKPTTQAEALCL
        RM+GGDV QITW QF++ FY KFFSA+LRD K Q+FL L+QG MTVE+YD EFDMLSRFA E+                        A +P T A+AL L
Subjt:  RMVGGDVRQITWDQFRDRFYTKFFSANLRDVKSQKFLELKQGHMTVEEYDQEFDMLSRFALEL------------------------ALKPTTQAEALCL

Query:  VVDISIGKDEVHAKNFDKGKSYGQKRKVEQWFVEVPKRNLRLSGPLCSFQQSPGGAGDTTREKPLCNTCGKLHLGRCLGGTRVCFKYRQEGHMADRCPLR
         VD+S+ +    +K   +G + GQKRK EQ  V VP+RN R  G   SFQQ P  AG+  R KPLC TCGK HLGRCL GTR CFK RQEGH ADRCPLR
Subjt:  VVDISIGKDEVHAKNFDKGKSYGQKRKVEQWFVEVPKRNLRLSGPLCSFQQSPGGAGDTTREKPLCNTCGKLHLGRCLGGTRVCFKYRQEGHMADRCPLR

Query:  FTETGQS--------SQRFAISRLEAESADTVVT---------------------------------EVEPLDYVLSVSTPFGKIMLSKEKIKACKIEIT
         T   Q+         + FA +R EAE A TVVT                                 EVEPL +VLSVSTP G+ MLSKEK+KAC+IEI 
Subjt:  FTETGQS--------SQRFAISRLEAESADTVVT---------------------------------EVEPLDYVLSVSTPFGKIMLSKEKIKACKIEIT

Query:  GRVLD--------------------------------KVVFSPPTKPSFKFKAVGTVVLPKVISAMKASKLLNQGTWSILASMVDTREGEISLTSKPVVK
        G V++                                +V F+PP+  SFKFK  G+  LP+VISA++ASKLL+QGTW ILAS+VDTRE ++SL+S+PVV+
Subjt:  GRVLD--------------------------------KVVFSPPTKPSFKFKAVGTVVLPKVISAMKASKLLNQGTWSILASMVDTREGEISLTSKPVVK

Query:  EYPDVFPEELLGLPPHREIDFAIVLEPGTTPISRASYRMAPTELKELKVQLQELLDKGFIRPSVSLWDAPALFVKKKDGSMRLCIDYRELN---------
        +YPDVFPEEL GLPPHRE++FAI LEPGT PISRA YRMAP ELKELKVQLQELLDKGFIRPSVS W AP LFVKKKDGSMRLCIDYRELN         
Subjt:  EYPDVFPEELLGLPPHREIDFAIVLEPGTTPISRASYRMAPTELKELKVQLQELLDKGFIRPSVSLWDAPALFVKKKDGSMRLCIDYRELN---------

Query:  ------------------KIDLRSGYHELRIRDSYIPKT--------------------------------------VFVRVFIDDILVYSKIETEHEEH
                          KIDLRSGYH+LRI+D  +PKT                                       FV VFIDDIL+YSK E EHEEH
Subjt:  ------------------KIDLRSGYHELRIRDSYIPKT--------------------------------------VFVRVFIDDILVYSKIETEHEEH

Query:  LHTFLETLRANKLYAKLSKCEFWLKQVAFLGHVVSSEGVFVDPVKIEVVTSWSRPSTVSE--DIIG--GLWRTFL------------VTRKGTPFVWSLT
        L   L+TLR NKLYAK SKCEFWLKQV+FLGHVVS  GV VDP KIE VT W+RPSTVSE    +G  G +R F+            +TRKG PFVWS  
Subjt:  LHTFLETLRANKLYAKLSKCEFWLKQVAFLGHVVSSEGVFVDPVKIEVVTSWSRPSTVSE--DIIG--GLWRTFL------------VTRKGTPFVWSLT

Query:  CDDSFKNLKQKLVIAPVLTVPDGS----------------------------------------------------------------------------
        C+DSF+ LKQKLV APVLTVPDGS                                                                            
Subjt:  CDDSFKNLKQKLVIAPVLTVPDGS----------------------------------------------------------------------------

Query:  -----------------------------GKANVVADALSGKVSHSTALITRQAPLHRDIERAEI----------------------------SSDPCLV
                                     GKANVVADALS KVSHS ALITRQAPLHRD+ERAEI                            S+DP LV
Subjt:  -----------------------------GKANVVADALSGKVSHSTALITRQAPLHRDIERAEI----------------------------SSDPCLV

Query:  ERRHLVGSVKTDEFSISSNGGLLFERRLCVLVDIAVKTDLLNEAHNSPFSMHPGSTKMYEDLN-------------EFIG------GVKAPRQKPAGLLQ
        E+R L  + +T EFS+SS+GGLLFERRLCV  D AVKT+LL+EAH+SPFSMHPGSTKMY+DL              EF+        VKAPRQKPAGLLQ
Subjt:  ERRHLVGSVKTDEFSISSNGGLLFERRLCVLVDIAVKTDLLNEAHNSPFSMHPGSTKMYEDLN-------------EFIG------GVKAPRQKPAGLLQ

Query:  PLSVLEWKWENVSMDFITGLPRILKGYT-------RCT------------------------------------------FYFQVLEGTSYCHGHELDFS
        PLS+ EWKWENVSMDFITGLPR L+G+T       R T                                          F  +  +G     G  LDFS
Subjt:  PLSVLEWKWENVSMDFITGLPRILKGYT-------RCT------------------------------------------FYFQVLEGTSYCHGHELDFS

Query:  TTFHAQTDGQTERLNQVLEDMLRA
        T FH QTDGQTERLNQVLEDMLRA
Subjt:  TTFHAQTDGQTERLNQVLEDMLRA

TrEMBL top hitse value%identityAlignment
A0A5A7SQU8 Reverse transcriptase9.6e-24547.06Show/hide
Query:  RMVGGDVRQITWDQFRDRFYTKFFSANLRDVKSQKFLELKQGHMTVEEYDQEFDMLSRFALEL------------------------ALKPTTQAEALCL
        RM+GGDV QITW QF++ FY KFFSA+LRD K Q+FL L+QG MTVE+YD EFDMLSRFA E+                        A +P T A+AL L
Subjt:  RMVGGDVRQITWDQFRDRFYTKFFSANLRDVKSQKFLELKQGHMTVEEYDQEFDMLSRFALEL------------------------ALKPTTQAEALCL

Query:  VVDISIGKDEVHAKNFDKGKSYGQKRKVEQWFVEVPKRNLRLSGPLCSFQQSPGGAGDTTREKPLCNTCGKLHLGRCLGGTRVCFKYRQEGHMADRCPLR
         VD+S+ +    +K   +G + GQKRK EQ  V VP+RN R  G    FQQ P  AG+  R KPLC TCGK HLGRCL GTR CFK RQEGH ADRCPLR
Subjt:  VVDISIGKDEVHAKNFDKGKSYGQKRKVEQWFVEVPKRNLRLSGPLCSFQQSPGGAGDTTREKPLCNTCGKLHLGRCLGGTRVCFKYRQEGHMADRCPLR

Query:  FTETGQS--------SQRFAISRLEAESADTVVT---------------------------------EVEPLDYVLSVSTPFGKIMLSKEKIKACKIEIT
         T   Q+         + FA ++ EAE A TVVT                                 EVEPL +VLSVSTP G+ MLSKEK+KAC+IEI 
Subjt:  FTETGQS--------SQRFAISRLEAESADTVVT---------------------------------EVEPLDYVLSVSTPFGKIMLSKEKIKACKIEIT

Query:  GRVLD--------------------------------KVVFSPPTKPSFKFKAVGTVVLPKVISAMKASKLLNQGTWSILASMVDTREGEISLTSKPVVK
        G V++                                +V F+PP+  SFKFK  G+  LP+VISA++ASKLL+QGTW ILAS+VDTRE ++SL+S+PVV+
Subjt:  GRVLD--------------------------------KVVFSPPTKPSFKFKAVGTVVLPKVISAMKASKLLNQGTWSILASMVDTREGEISLTSKPVVK

Query:  EYPDVFPEELLGLPPHREIDFAIVLEPGTTPISRASYRMAPTELKELKVQLQELLDKGFIRPSVSLWDAPALFVKKKDGSMRLCIDYRELN---------
        +YPDVFPEEL GLPPHRE++FAI LEPGT PISRA YRMAP ELKELKVQLQELLDKGFIRPS+S W AP LFVKKKDGSMRLCIDYRELN         
Subjt:  EYPDVFPEELLGLPPHREIDFAIVLEPGTTPISRASYRMAPTELKELKVQLQELLDKGFIRPSVSLWDAPALFVKKKDGSMRLCIDYRELN---------

Query:  ------------------KIDLRSGYHELRIRDSYIPKT--------------------------------------VFVRVFIDDILVYSKIETEHEEH
                          KIDLRSGYH+LRI+D  +PKT                                       FV VFIDDIL+YSK E EHEEH
Subjt:  ------------------KIDLRSGYHELRIRDSYIPKT--------------------------------------VFVRVFIDDILVYSKIETEHEEH

Query:  LHTFLETLRANKLYAKLSKCEFWLKQVAFLGHVVSSEGVFVDPVKIEVVTSWSRPSTVSE--DIIG--GLWRTFL------------VTRKGTPFVWSLT
        L   L+TLR NKLYAK SKCEFWLKQV+FLGHVVS  GV VDP KIE VT W+RPSTVSE    +G  G +R F+            +TRKG PFVWS  
Subjt:  LHTFLETLRANKLYAKLSKCEFWLKQVAFLGHVVSSEGVFVDPVKIEVVTSWSRPSTVSE--DIIG--GLWRTFL------------VTRKGTPFVWSLT

Query:  CDDSFKNLKQKLVIAPVLTVPDGSG---------------------------------------------------------------------------
        C+DSF+NLKQKLV A VLTVPDGSG                                                                           
Subjt:  CDDSFKNLKQKLVIAPVLTVPDGSG---------------------------------------------------------------------------

Query:  ------------------------------KANVVADALSGKVSHSTALITRQAPLHRDIERAEIS----------------------------SDPCLV
                                      KANVVADALS KVSHS ALITRQAPLHRD+ERAEI+                            +DP LV
Subjt:  ------------------------------KANVVADALSGKVSHSTALITRQAPLHRDIERAEIS----------------------------SDPCLV

Query:  ERRHLVGSVKTDEFSISSNGGLLFERRLCVLVDIAVKTDLLNEAHNSPFSMHPGSTKMYEDLN-------------EFIG------GVKAPRQKPAGLLQ
        E+R L  + +  EFS+SS+GGLLFERRLCV  D AVKT+LL+EAH+SPFSMHPGSTKMY+DL              EF+        VKAPRQKPAGLLQ
Subjt:  ERRHLVGSVKTDEFSISSNGGLLFERRLCVLVDIAVKTDLLNEAHNSPFSMHPGSTKMYEDLN-------------EFIG------GVKAPRQKPAGLLQ

Query:  PLSVLEWKWENVSMDFITGLPRILKGY-------TRCT------------------------------------------FYFQVLEGTSYCHGHELDFS
        PLS+ EWKWEN+SMDFITGLPR L+G+        R T                                          F  +  +G     G  LDFS
Subjt:  PLSVLEWKWENVSMDFITGLPRILKGY-------TRCT------------------------------------------FYFQVLEGTSYCHGHELDFS

Query:  TTFHAQTDGQTERLNQVLEDMLRA
        T FH QTDGQTERLNQVLEDMLRA
Subjt:  TTFHAQTDGQTERLNQVLEDMLRA

A0A5A7TP96 Reverse transcriptase2.9e-24948.14Show/hide
Query:  RMVGGDVRQITWDQFRDRFYTKFFSANLRDVKSQKFLELKQGHMTVEEYDQEFDMLSRFALEL------------------------ALKPTTQAEALCL
        RM+GGDV QITW QF++ FY KFFSA+LRD K Q+FL ++QG MTVE+YD EFDMLSRFA E+                        A +P T A+AL L
Subjt:  RMVGGDVRQITWDQFRDRFYTKFFSANLRDVKSQKFLELKQGHMTVEEYDQEFDMLSRFALEL------------------------ALKPTTQAEALCL

Query:  VVDISIGKDEVHAKNFDKGKSYGQKRKVEQWFVEVPKRNLRLSGPLCSFQQSPGGAGDTTREKPLCNTCGKLHLGRCLGGTRVCFKYRQEGHMADRCPLR
         VD+S+ +    +K   +G + GQKRK EQ  V VP+RN R  G    FQQ P  AG+  R KPLC TCGK HLGRCL GTR CFK RQEGH ADRCPLR
Subjt:  VVDISIGKDEVHAKNFDKGKSYGQKRKVEQWFVEVPKRNLRLSGPLCSFQQSPGGAGDTTREKPLCNTCGKLHLGRCLGGTRVCFKYRQEGHMADRCPLR

Query:  FTETGQS--------SQRFAISRLEAESADTVVT---------------------------------EVEPLDYVLSVSTPFGKIMLSKEKIKACKIEIT
         T   Q+         + FA ++ EAE A TVVT                                 EVEPL +VLSVSTP G+ MLSKEK+KAC+IEI 
Subjt:  FTETGQS--------SQRFAISRLEAESADTVVT---------------------------------EVEPLDYVLSVSTPFGKIMLSKEKIKACKIEIT

Query:  GRVLD--------------------------------KVVFSPPTKPSFKFKAVGTVVLPKVISAMKASKLLNQGTWSILASMVDTREGEISLTSKPVVK
        G V++                                +V F+PP+  SFKFK  G+  LP+VISA++ASKLL+QGTW ILAS+VDTRE ++SL+S+PVV+
Subjt:  GRVLD--------------------------------KVVFSPPTKPSFKFKAVGTVVLPKVISAMKASKLLNQGTWSILASMVDTREGEISLTSKPVVK

Query:  EYPDVFPEELLGLPPHREIDFAIVLEPGTTPISRASYRMAPTELKELKVQLQELLDKGFIRPSVSLWDAPALFVKKKDGSMRLCIDYRELN---------
        +YPDVFPEEL GLPPHRE++FAI LEPGT PISRA YRMAP ELKELKVQLQELLDKGFIRPSVS W AP LFVKKKDGSMRLCIDYRELN         
Subjt:  EYPDVFPEELLGLPPHREIDFAIVLEPGTTPISRASYRMAPTELKELKVQLQELLDKGFIRPSVSLWDAPALFVKKKDGSMRLCIDYRELN---------

Query:  ------------------KIDLRSGYHELRIRDSYIPKT--------------------------------------VFVRVFIDDILVYSKIETEHEEH
                          KIDLRSGYH+LRI+D  +PKT                                       FV VFIDDIL+YSK E EHEEH
Subjt:  ------------------KIDLRSGYHELRIRDSYIPKT--------------------------------------VFVRVFIDDILVYSKIETEHEEH

Query:  LHTFLETLRANKLYAKLSKCEFWLKQVAFLGHVVSSEGVFVDPVKIEVVTSWSRPSTVSE--DIIG--GLWRTFL------------VTRKGTPFVWSLT
        L   L+TLR NKLYAK SKCEFWLKQV+FLGHVVS  GV VDP KIE VT W+RPSTVSE    +G  G +  F+            +TRKG PFVWS  
Subjt:  LHTFLETLRANKLYAKLSKCEFWLKQVAFLGHVVSSEGVFVDPVKIEVVTSWSRPSTVSE--DIIG--GLWRTFL------------VTRKGTPFVWSLT

Query:  CDDSFKNLKQKLVIAPVLTVPDGS----------------------------------------------------------------------------
        C+DSF+NLKQKLV APVLTVPDGS                                                                            
Subjt:  CDDSFKNLKQKLVIAPVLTVPDGS----------------------------------------------------------------------------

Query:  -----------------------------GKANVVADALSGKVSHSTALITRQAPLHRDIERAEI----------------------------SSDPCLV
                                     GKANVVADALS KVSHS ALITRQA LHRD+ERA+I                            S+DP LV
Subjt:  -----------------------------GKANVVADALSGKVSHSTALITRQAPLHRDIERAEI----------------------------SSDPCLV

Query:  ERRHLVGSVKTDEFSISSNGGLLFERRLCVLVDIAVKTDLLNEAHNSPFSMHPGSTKMYEDLN-------------EFIG------GVKAPRQKPAGLLQ
        E+R L  + +  EFS+SS+GGLLFERRLCV  D AVK +LL+EAH+SPFSMHPGSTKMY+DL              EF+        VKAPRQKPAGLLQ
Subjt:  ERRHLVGSVKTDEFSISSNGGLLFERRLCVLVDIAVKTDLLNEAHNSPFSMHPGSTKMYEDLN-------------EFIG------GVKAPRQKPAGLLQ

Query:  PLSVLEWKWENVSMDFITGLPRILKGYT-------RCT-------------------FYFQVLEGTSYCHGHELDFSTTFHAQTDGQTERLNQVLEDMLR
        PLS+ EWKWENVSMDFITGLPR L+G+T       R T                    Y   ++G     G  LDFSTTFH QTDGQTERLNQVLEDMLR
Subjt:  PLSVLEWKWENVSMDFITGLPRILKGYT-------RCT-------------------FYFQVLEGTSYCHGHELDFSTTFHAQTDGQTERLNQVLEDMLR

Query:  A
        A
Subjt:  A

A0A5A7UE75 Reverse transcriptase1.6e-24446.8Show/hide
Query:  RMVGGDVRQITWDQFRDRFYTKFFSANLRDVKSQKFLELKQGHMTVEEYDQEFDMLSRFALEL------------------------ALKPTTQAEALCL
        RM+G DV QITW QF++ FY KFFSA+LRD K Q+FL L+QG MTVE+YD EFDMLSRFA E+                        A +P T A+AL L
Subjt:  RMVGGDVRQITWDQFRDRFYTKFFSANLRDVKSQKFLELKQGHMTVEEYDQEFDMLSRFALEL------------------------ALKPTTQAEALCL

Query:  VVDISIGKDEVHAKNFDKGKSYGQKRKVEQWFVEVPKRNLRLSGPLCSFQQSPGGAGDTTREKPLCNTCGKLHLGRCLGGTRVCFKYRQEGHMADRCPLR
         VD+S+ +    +K   +G + GQKRK EQ  V VP+RN R  G    FQQ P   G+  R KPLC TCGK HLGRCL GTR CFK RQEGH+ADRCPLR
Subjt:  VVDISIGKDEVHAKNFDKGKSYGQKRKVEQWFVEVPKRNLRLSGPLCSFQQSPGGAGDTTREKPLCNTCGKLHLGRCLGGTRVCFKYRQEGHMADRCPLR

Query:  FTETGQS--------SQRFAISRLEAESADTVVT---------------------------------EVEPLDYVLSVSTPFGKIMLSKEKIKACKIEIT
         T   Q+         + FA ++ EAE A TVVT                                 EVEPL +VLSVSTP G+ MLSKE++KAC+IEI 
Subjt:  FTETGQS--------SQRFAISRLEAESADTVVT---------------------------------EVEPLDYVLSVSTPFGKIMLSKEKIKACKIEIT

Query:  GRVLD--------------------------------KVVFSPPTKPSFKFKAVGTVVLPKVISAMKASKLLNQGTWSILASMVDTREGEISLTSKPVVK
        G V++                                 V F+PP+  SFKFK  G+  LP+VISA++ASKL +QGTW ILAS+VDTRE ++SL+S+PVV+
Subjt:  GRVLD--------------------------------KVVFSPPTKPSFKFKAVGTVVLPKVISAMKASKLLNQGTWSILASMVDTREGEISLTSKPVVK

Query:  EYPDVFPEELLGLPPHREIDFAIVLEPGTTPISRASYRMAPTELKELKVQLQELLDKGFIRPSVSLWDAPALFVKKKDGSMRLCIDYRELN---------
        +YPDVFPEEL GLPPHRE++FAI LEPGT PISRA YRMAP ELKELKVQLQELLDKGFIRPSVS W AP LFVKKKDGSMRLCIDYRELN         
Subjt:  EYPDVFPEELLGLPPHREIDFAIVLEPGTTPISRASYRMAPTELKELKVQLQELLDKGFIRPSVSLWDAPALFVKKKDGSMRLCIDYRELN---------

Query:  ------------------KIDLRSGYHELRIRDSYIPKT--------------------------------------VFVRVFIDDILVYSKIETEHEEH
                          KIDLRSGYH+LRI+D  +PKT                                       FV VFIDDIL+YSK E EHEEH
Subjt:  ------------------KIDLRSGYHELRIRDSYIPKT--------------------------------------VFVRVFIDDILVYSKIETEHEEH

Query:  LHTFLETLRANKLYAKLSKCEFWLKQVAFLGHVVSSEGVFVDPVKIEVVTSWSRPSTVSE--DIIG--GLWRTFL------------VTRKGTPFVWSLT
        L   L+TLR NKLYAK SKCEFWLKQV+FLGHVVS  GV VDP KIE VT W+RPSTVSE    +G  G +R F+            +TRKG PFVWS  
Subjt:  LHTFLETLRANKLYAKLSKCEFWLKQVAFLGHVVSSEGVFVDPVKIEVVTSWSRPSTVSE--DIIG--GLWRTFL------------VTRKGTPFVWSLT

Query:  CDDSFKNLKQKLVIAPVLTVPDGS----------------------------------------------------------------------------
        C+DSF+NLKQKLV AP+LTVPDGS                                                                            
Subjt:  CDDSFKNLKQKLVIAPVLTVPDGS----------------------------------------------------------------------------

Query:  -----------------------------GKANVVADALSGKVSHSTALITRQAPLHRDIERAEI----------------------------SSDPCLV
                                     GKANVVADALS KVSHS ALITRQAPLHRD+ERAEI                            S+DP LV
Subjt:  -----------------------------GKANVVADALSGKVSHSTALITRQAPLHRDIERAEI----------------------------SSDPCLV

Query:  ERRHLVGSVKTDEFSISSNGGLLFERRLCVLVDIAVKTDLLNEAHNSPFSMHPGSTKMYEDLN-------------EFIG------GVKAPRQKPAGLLQ
        E+R L  + + + FSISS+GGL FERRLCV  D  +KT+LL+EAH+SPFSMHPGSTKMY+DL              EF+        VKAPRQKPAGLLQ
Subjt:  ERRHLVGSVKTDEFSISSNGGLLFERRLCVLVDIAVKTDLLNEAHNSPFSMHPGSTKMYEDLN-------------EFIG------GVKAPRQKPAGLLQ

Query:  PLSVLEWKWENVSMDFITGLPRILKGYT--------------------RCT-----------------------------FYFQVLEGTSYCHGHELDFS
        PLS+ EWKWENVSMDFITGLPR L+G+T                     CT                             F  +  +G     G  LDFS
Subjt:  PLSVLEWKWENVSMDFITGLPRILKGYT--------------------RCT-----------------------------FYFQVLEGTSYCHGHELDFS

Query:  TTFHAQTDGQTERLNQVLEDMLRA
        T FH QTDGQTERLNQVLEDMLRA
Subjt:  TTFHAQTDGQTERLNQVLEDMLRA

A0A5A7UR62 Reverse transcriptase1.0e-24649.24Show/hide
Query:  RMVGGDVRQITWDQFRDRFYTKFFSANLRDVKSQKFLELKQGHMTVEEYDQEFDMLSRFALEL------------------------ALKPTTQAEALCL
        RM+GGDV QITW QF++ FY KFF A+LRD K Q+FL L+QG MTVE+YD EFDMLSRFA E+                        A +P T  +AL L
Subjt:  RMVGGDVRQITWDQFRDRFYTKFFSANLRDVKSQKFLELKQGHMTVEEYDQEFDMLSRFALEL------------------------ALKPTTQAEALCL

Query:  VVDISIGKDEVHAKNFDKGKSYGQKRKVEQWFVEVPKRNLRLSGPLCSFQQSPGGAGDTTREKPLCNTCGKLHLGRCLGGTRVCFKYRQEGHMADRCPLR
         VD+S+ +    +K   KG +  QKRK EQ  V VP+RN R  G    FQQ P   G+  R KPLC TCGK HLGRCL GTR CFK RQEGH ADR PLR
Subjt:  VVDISIGKDEVHAKNFDKGKSYGQKRKVEQWFVEVPKRNLRLSGPLCSFQQSPGGAGDTTREKPLCNTCGKLHLGRCLGGTRVCFKYRQEGHMADRCPLR

Query:  FTETGQS--------SQRFAISRLEAESADTVVT---------------------------------EVEPLDYVLSVSTPFGKIMLSKEKIKACKIEIT
         T   Q+         + FA ++ EAE A TVVT                                 EVEPL +VLSVSTP G+ MLSKEK+KAC+IEI 
Subjt:  FTETGQS--------SQRFAISRLEAESADTVVT---------------------------------EVEPLDYVLSVSTPFGKIMLSKEKIKACKIEIT

Query:  GRVLD--------------------------------KVVFSPPTKPSFKFKAVGTVVLPKVISAMKASKLLNQGTWSILASMVDTREGEISLTSKPVVK
        G V++                                +V F+PP+  SFKFK  G+  LP+VISA++ASKLL+QGTW ILAS+VDTRE ++SL+S+ VV+
Subjt:  GRVLD--------------------------------KVVFSPPTKPSFKFKAVGTVVLPKVISAMKASKLLNQGTWSILASMVDTREGEISLTSKPVVK

Query:  EYPDVFPEELLGLPPHREIDFAIVLEPGTTPISRASYRMAPTELKELKVQLQELLDKGFIRPSVSLWDAPALFVKKKDGSMRLCIDYRELN---------
        +YPDVFPEEL GLPPHRE++FAI LEPGT PISRA YRMAP ELKELKVQLQELLDKGFIRPSVS W AP LFVKKKDGSMRLCIDYRELN         
Subjt:  EYPDVFPEELLGLPPHREIDFAIVLEPGTTPISRASYRMAPTELKELKVQLQELLDKGFIRPSVSLWDAPALFVKKKDGSMRLCIDYRELN---------

Query:  ------------------KIDLRSGYHELRIRDSYIPKT--------------------------------------VFVRVFIDDILVYSKIETEHEEH
                          KIDLRSGYH+LRI+D  +PKT                                       FV VFIDDIL+YSK E EHEEH
Subjt:  ------------------KIDLRSGYHELRIRDSYIPKT--------------------------------------VFVRVFIDDILVYSKIETEHEEH

Query:  LHTFLETLRANKLYAKLSKCEFWLKQVAFLGHVVSSEGVFVDPVKIEVVTSWSRPSTVSE--DIIG--GLWRTFL------------VTRKGTPFVWSLT
        L   L+TLR NKLYAK SKCEFWLKQV+FLGHVVS  GV VDP KIE VT W+RPSTVSE    +G  G +R F+            +TRKG  FVWS  
Subjt:  LHTFLETLRANKLYAKLSKCEFWLKQVAFLGHVVSSEGVFVDPVKIEVVTSWSRPSTVSE--DIIG--GLWRTFL------------VTRKGTPFVWSLT

Query:  CDDSFKNLKQKLVIAPVLTVPDGS----------------------------------------------------------------------------
        C+DSF+NLKQKLV APVLT+PDGS                                                                            
Subjt:  CDDSFKNLKQKLVIAPVLTVPDGS----------------------------------------------------------------------------

Query:  -----------------------------GKANVVADALSGKVSHSTALITRQAPLHRDIERAEIS---SDPCLVERRHLVGSVKTDEFSISSNGGLLFE
                                     GKANVVADALS KVSHS ALITRQ PLHRD+ERAEI+   +DP LVE+R L  + +  EFSISS+GGLLFE
Subjt:  -----------------------------GKANVVADALSGKVSHSTALITRQAPLHRDIERAEIS---SDPCLVERRHLVGSVKTDEFSISSNGGLLFE

Query:  RRLCVLVDIAVKTDLLNEAHNSPFSMHPGSTKMYEDLNEFIGGVKAPRQKPAGLLQPLSVLEWKWENVSMDFITGLPRILKGYTRCTFYFQVLE------
        RRLCV  D  VKT+LL+EAH+SPFSMHPGSTKMY+DL      VKAPRQKPAGLLQPLS+ EWKWENVSMDFITGLPR L+G+T       +L       
Subjt:  RRLCVLVDIAVKTDLLNEAHNSPFSMHPGSTKMYEDLNEFIGGVKAPRQKPAGLLQPLSVLEWKWENVSMDFITGLPRILKGYTRCTFYFQVLE------

Query:  -------------------GTSYCHGHELDFSTTFHAQTDGQTERLNQVLEDMLRA
                           G     G  LDFST FH QTD QTERLNQVLEDMLRA
Subjt:  -------------------GTSYCHGHELDFSTTFHAQTDGQTERLNQVLEDMLRA

A0A5D3BPI1 Reverse transcriptase6.4e-24947.69Show/hide
Query:  RMVGGDVRQITWDQFRDRFYTKFFSANLRDVKSQKFLELKQGHMTVEEYDQEFDMLSRFALEL------------------------ALKPTTQAEALCL
        RM+GGDV QITW QF++ FY KFFSA+LRD K Q+FL L+QG MTVE+YD EFDMLSRFA E+                        A +P T A+AL L
Subjt:  RMVGGDVRQITWDQFRDRFYTKFFSANLRDVKSQKFLELKQGHMTVEEYDQEFDMLSRFALEL------------------------ALKPTTQAEALCL

Query:  VVDISIGKDEVHAKNFDKGKSYGQKRKVEQWFVEVPKRNLRLSGPLCSFQQSPGGAGDTTREKPLCNTCGKLHLGRCLGGTRVCFKYRQEGHMADRCPLR
         VD+S+ +    +K   +G + GQKRK EQ  V VP+RN R  G   SFQQ P  AG+  R KPLC TCGK HLGRCL GTR CFK RQEGH ADRCPLR
Subjt:  VVDISIGKDEVHAKNFDKGKSYGQKRKVEQWFVEVPKRNLRLSGPLCSFQQSPGGAGDTTREKPLCNTCGKLHLGRCLGGTRVCFKYRQEGHMADRCPLR

Query:  FTETGQS--------SQRFAISRLEAESADTVVT---------------------------------EVEPLDYVLSVSTPFGKIMLSKEKIKACKIEIT
         T   Q+         + FA +R EAE A TVVT                                 EVEPL +VLSVSTP G+ MLSKEK+KAC+IEI 
Subjt:  FTETGQS--------SQRFAISRLEAESADTVVT---------------------------------EVEPLDYVLSVSTPFGKIMLSKEKIKACKIEIT

Query:  GRVLD--------------------------------KVVFSPPTKPSFKFKAVGTVVLPKVISAMKASKLLNQGTWSILASMVDTREGEISLTSKPVVK
        G V++                                +V F+PP+  SFKFK  G+  LP+VISA++ASKLL+QGTW ILAS+VDTRE ++SL+S+PVV+
Subjt:  GRVLD--------------------------------KVVFSPPTKPSFKFKAVGTVVLPKVISAMKASKLLNQGTWSILASMVDTREGEISLTSKPVVK

Query:  EYPDVFPEELLGLPPHREIDFAIVLEPGTTPISRASYRMAPTELKELKVQLQELLDKGFIRPSVSLWDAPALFVKKKDGSMRLCIDYRELN---------
        +YPDVFPEEL GLPPHRE++FAI LEPGT PISRA YRMAP ELKELKVQLQELLDKGFIRPSVS W AP LFVKKKDGSMRLCIDYRELN         
Subjt:  EYPDVFPEELLGLPPHREIDFAIVLEPGTTPISRASYRMAPTELKELKVQLQELLDKGFIRPSVSLWDAPALFVKKKDGSMRLCIDYRELN---------

Query:  ------------------KIDLRSGYHELRIRDSYIPKT--------------------------------------VFVRVFIDDILVYSKIETEHEEH
                          KIDLRSGYH+LRI+D  +PKT                                       FV VFIDDIL+YSK E EHEEH
Subjt:  ------------------KIDLRSGYHELRIRDSYIPKT--------------------------------------VFVRVFIDDILVYSKIETEHEEH

Query:  LHTFLETLRANKLYAKLSKCEFWLKQVAFLGHVVSSEGVFVDPVKIEVVTSWSRPSTVSE--DIIG--GLWRTFL------------VTRKGTPFVWSLT
        L   L+TLR NKLYAK SKCEFWLKQV+FLGHVVS  GV VDP KIE VT W+RPSTVSE    +G  G +R F+            +TRKG PFVWS  
Subjt:  LHTFLETLRANKLYAKLSKCEFWLKQVAFLGHVVSSEGVFVDPVKIEVVTSWSRPSTVSE--DIIG--GLWRTFL------------VTRKGTPFVWSLT

Query:  CDDSFKNLKQKLVIAPVLTVPDGS----------------------------------------------------------------------------
        C+DSF+ LKQKLV APVLTVPDGS                                                                            
Subjt:  CDDSFKNLKQKLVIAPVLTVPDGS----------------------------------------------------------------------------

Query:  -----------------------------GKANVVADALSGKVSHSTALITRQAPLHRDIERAEI----------------------------SSDPCLV
                                     GKANVVADALS KVSHS ALITRQAPLHRD+ERAEI                            S+DP LV
Subjt:  -----------------------------GKANVVADALSGKVSHSTALITRQAPLHRDIERAEI----------------------------SSDPCLV

Query:  ERRHLVGSVKTDEFSISSNGGLLFERRLCVLVDIAVKTDLLNEAHNSPFSMHPGSTKMYEDLN-------------EFIG------GVKAPRQKPAGLLQ
        E+R L  + +T EFS+SS+GGLLFERRLCV  D AVKT+LL+EAH+SPFSMHPGSTKMY+DL              EF+        VKAPRQKPAGLLQ
Subjt:  ERRHLVGSVKTDEFSISSNGGLLFERRLCVLVDIAVKTDLLNEAHNSPFSMHPGSTKMYEDLN-------------EFIG------GVKAPRQKPAGLLQ

Query:  PLSVLEWKWENVSMDFITGLPRILKGYT-------RCT------------------------------------------FYFQVLEGTSYCHGHELDFS
        PLS+ EWKWENVSMDFITGLPR L+G+T       R T                                          F  +  +G     G  LDFS
Subjt:  PLSVLEWKWENVSMDFITGLPRILKGYT-------RCT------------------------------------------FYFQVLEGTSYCHGHELDFS

Query:  TTFHAQTDGQTERLNQVLEDMLRA
        T FH QTDGQTERLNQVLEDMLRA
Subjt:  TTFHAQTDGQTERLNQVLEDMLRA

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein2.2e-2025.44Show/hide
Query:  VVKEYPDVFPE-ELLGLP-PHREIDFAIVLEPGTTPISRASYRMAPTELKELKVQLQELLDKGFIRPSVSLWDAPALFVKKKDGSMRLCIDYRELN----
        + KE+ D+  E     LP P + ++F + L      +   +Y + P +++ +  ++ + L  G IR S ++   P +FV KK+G++R+ +DY+ LN    
Subjt:  VVKEYPDVFPE-ELLGLP-PHREIDFAIVLEPGTTPISRASYRMAPTELKELKVQLQELLDKGFIRPSVSLWDAPALFVKKKDGSMRLCIDYRELN----

Query:  -----------------------KIDLRSGYHELRIRDS-------YIPKTVF-------------------------------VRVFIDDILVYSKIET
                               K+DL+S YH +R+R           P+ VF                               V  ++DDIL++SK E+
Subjt:  -----------------------KIDLRSGYHELRIRDS-------YIPKTVF-------------------------------VRVFIDDILVYSKIET

Query:  EHEEHLHTFLETLRANKLYAKLSKCEFWLKQVAFLGHVVSSEGVFVDPVKIEVVTSWSRPSTVSE--DIIGGL--WRTFL------------VTRKGTPF
        EH +H+   L+ L+   L    +KCEF   QV F+G+ +S +G       I+ V  W +P    E    +G +   R F+            + +K   +
Subjt:  EHEEHLHTFLETLRANKLYAKLSKCEFWLKQVAFLGHVVSSEGVFVDPVKIEVVTSWSRPSTVSE--DIIGGL--WRTFL------------VTRKGTPF

Query:  VWSLTCDDSFKNLKQKLVIAPVLTVPDGSGKANVVADA
         W+ T   + +N+KQ LV  PVL   D S K  +  DA
Subjt:  VWSLTCDDSFKNLKQKLVIAPVLTVPDGSGKANVVADA

P0CT35 Transposon Tf2-2 polyprotein2.2e-2025.44Show/hide
Query:  VVKEYPDVFPE-ELLGLP-PHREIDFAIVLEPGTTPISRASYRMAPTELKELKVQLQELLDKGFIRPSVSLWDAPALFVKKKDGSMRLCIDYRELN----
        + KE+ D+  E     LP P + ++F + L      +   +Y + P +++ +  ++ + L  G IR S ++   P +FV KK+G++R+ +DY+ LN    
Subjt:  VVKEYPDVFPE-ELLGLP-PHREIDFAIVLEPGTTPISRASYRMAPTELKELKVQLQELLDKGFIRPSVSLWDAPALFVKKKDGSMRLCIDYRELN----

Query:  -----------------------KIDLRSGYHELRIRDS-------YIPKTVF-------------------------------VRVFIDDILVYSKIET
                               K+DL+S YH +R+R           P+ VF                               V  ++DDIL++SK E+
Subjt:  -----------------------KIDLRSGYHELRIRDS-------YIPKTVF-------------------------------VRVFIDDILVYSKIET

Query:  EHEEHLHTFLETLRANKLYAKLSKCEFWLKQVAFLGHVVSSEGVFVDPVKIEVVTSWSRPSTVSE--DIIGGL--WRTFL------------VTRKGTPF
        EH +H+   L+ L+   L    +KCEF   QV F+G+ +S +G       I+ V  W +P    E    +G +   R F+            + +K   +
Subjt:  EHEEHLHTFLETLRANKLYAKLSKCEFWLKQVAFLGHVVSSEGVFVDPVKIEVVTSWSRPSTVSE--DIIGGL--WRTFL------------VTRKGTPF

Query:  VWSLTCDDSFKNLKQKLVIAPVLTVPDGSGKANVVADA
         W+ T   + +N+KQ LV  PVL   D S K  +  DA
Subjt:  VWSLTCDDSFKNLKQKLVIAPVLTVPDGSGKANVVADA

P0CT36 Transposon Tf2-3 polyprotein2.2e-2025.44Show/hide
Query:  VVKEYPDVFPE-ELLGLP-PHREIDFAIVLEPGTTPISRASYRMAPTELKELKVQLQELLDKGFIRPSVSLWDAPALFVKKKDGSMRLCIDYRELN----
        + KE+ D+  E     LP P + ++F + L      +   +Y + P +++ +  ++ + L  G IR S ++   P +FV KK+G++R+ +DY+ LN    
Subjt:  VVKEYPDVFPE-ELLGLP-PHREIDFAIVLEPGTTPISRASYRMAPTELKELKVQLQELLDKGFIRPSVSLWDAPALFVKKKDGSMRLCIDYRELN----

Query:  -----------------------KIDLRSGYHELRIRDS-------YIPKTVF-------------------------------VRVFIDDILVYSKIET
                               K+DL+S YH +R+R           P+ VF                               V  ++DDIL++SK E+
Subjt:  -----------------------KIDLRSGYHELRIRDS-------YIPKTVF-------------------------------VRVFIDDILVYSKIET

Query:  EHEEHLHTFLETLRANKLYAKLSKCEFWLKQVAFLGHVVSSEGVFVDPVKIEVVTSWSRPSTVSE--DIIGGL--WRTFL------------VTRKGTPF
        EH +H+   L+ L+   L    +KCEF   QV F+G+ +S +G       I+ V  W +P    E    +G +   R F+            + +K   +
Subjt:  EHEEHLHTFLETLRANKLYAKLSKCEFWLKQVAFLGHVVSSEGVFVDPVKIEVVTSWSRPSTVSE--DIIGGL--WRTFL------------VTRKGTPF

Query:  VWSLTCDDSFKNLKQKLVIAPVLTVPDGSGKANVVADA
         W+ T   + +N+KQ LV  PVL   D S K  +  DA
Subjt:  VWSLTCDDSFKNLKQKLVIAPVLTVPDGSGKANVVADA

P0CT37 Transposon Tf2-4 polyprotein2.2e-2025.44Show/hide
Query:  VVKEYPDVFPE-ELLGLP-PHREIDFAIVLEPGTTPISRASYRMAPTELKELKVQLQELLDKGFIRPSVSLWDAPALFVKKKDGSMRLCIDYRELN----
        + KE+ D+  E     LP P + ++F + L      +   +Y + P +++ +  ++ + L  G IR S ++   P +FV KK+G++R+ +DY+ LN    
Subjt:  VVKEYPDVFPE-ELLGLP-PHREIDFAIVLEPGTTPISRASYRMAPTELKELKVQLQELLDKGFIRPSVSLWDAPALFVKKKDGSMRLCIDYRELN----

Query:  -----------------------KIDLRSGYHELRIRDS-------YIPKTVF-------------------------------VRVFIDDILVYSKIET
                               K+DL+S YH +R+R           P+ VF                               V  ++DDIL++SK E+
Subjt:  -----------------------KIDLRSGYHELRIRDS-------YIPKTVF-------------------------------VRVFIDDILVYSKIET

Query:  EHEEHLHTFLETLRANKLYAKLSKCEFWLKQVAFLGHVVSSEGVFVDPVKIEVVTSWSRPSTVSE--DIIGGL--WRTFL------------VTRKGTPF
        EH +H+   L+ L+   L    +KCEF   QV F+G+ +S +G       I+ V  W +P    E    +G +   R F+            + +K   +
Subjt:  EHEEHLHTFLETLRANKLYAKLSKCEFWLKQVAFLGHVVSSEGVFVDPVKIEVVTSWSRPSTVSE--DIIGGL--WRTFL------------VTRKGTPF

Query:  VWSLTCDDSFKNLKQKLVIAPVLTVPDGSGKANVVADA
         W+ T   + +N+KQ LV  PVL   D S K  +  DA
Subjt:  VWSLTCDDSFKNLKQKLVIAPVLTVPDGSGKANVVADA

P0CT41 Transposon Tf2-12 polyprotein2.2e-2025.44Show/hide
Query:  VVKEYPDVFPE-ELLGLP-PHREIDFAIVLEPGTTPISRASYRMAPTELKELKVQLQELLDKGFIRPSVSLWDAPALFVKKKDGSMRLCIDYRELN----
        + KE+ D+  E     LP P + ++F + L      +   +Y + P +++ +  ++ + L  G IR S ++   P +FV KK+G++R+ +DY+ LN    
Subjt:  VVKEYPDVFPE-ELLGLP-PHREIDFAIVLEPGTTPISRASYRMAPTELKELKVQLQELLDKGFIRPSVSLWDAPALFVKKKDGSMRLCIDYRELN----

Query:  -----------------------KIDLRSGYHELRIRDS-------YIPKTVF-------------------------------VRVFIDDILVYSKIET
                               K+DL+S YH +R+R           P+ VF                               V  ++DDIL++SK E+
Subjt:  -----------------------KIDLRSGYHELRIRDS-------YIPKTVF-------------------------------VRVFIDDILVYSKIET

Query:  EHEEHLHTFLETLRANKLYAKLSKCEFWLKQVAFLGHVVSSEGVFVDPVKIEVVTSWSRPSTVSE--DIIGGL--WRTFL------------VTRKGTPF
        EH +H+   L+ L+   L    +KCEF   QV F+G+ +S +G       I+ V  W +P    E    +G +   R F+            + +K   +
Subjt:  EHEEHLHTFLETLRANKLYAKLSKCEFWLKQVAFLGHVVSSEGVFVDPVKIEVVTSWSRPSTVSE--DIIGGL--WRTFL------------VTRKGTPF

Query:  VWSLTCDDSFKNLKQKLVIAPVLTVPDGSGKANVVADA
         W+ T   + +N+KQ LV  PVL   D S K  +  DA
Subjt:  VWSLTCDDSFKNLKQKLVIAPVLTVPDGSGKANVVADA

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein1.8e-0932.26Show/hide
Query:  HLHTFLETLRANKLYAKLSKCEFWLKQVAFLG--HVVSSEGVFVDPVKIEVVTSWSRPSTVSE--DIIG--GLWRTFL-----VTR------KGTPFVWS
        HL   L+    ++ YA   KC F   Q+A+LG  H++S EGV  DP K+E +  W  P   +E    +G  G +R F+     + R      K     W+
Subjt:  HLHTFLETLRANKLYAKLSKCEFWLKQVAFLG--HVVSSEGVFVDPVKIEVVTSWSRPSTVSE--DIIG--GLWRTFL-----VTR------KGTPFVWS

Query:  LTCDDSFKNLKQKLVIAPVLTVPD
             +FK LK  +   PVL +PD
Subjt:  LTCDDSFKNLKQKLVIAPVLTVPD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGTATGGTAGGTGGGGATGTGAGACAGATCACTTGGGACCAGTTTAGAGACCGCTTCTATACCAAGTTTTTCTCAGCTAACCTTAGAGATGTTAAAAGCCAGAAATT
CTTGGAATTGAAGCAAGGACACATGACAGTTGAGGAGTATGACCAGGAGTTCGACATGCTGTCGCGCTTTGCCCTTGAACTTGCACTAAAGCCCACCACTCAGGCTGAAG
CACTGTGTCTAGTAGTGGATATAAGTATTGGGAAGGACGAGGTTCATGCAAAGAATTTTGATAAGGGAAAATCGTATGGTCAGAAAAGGAAAGTAGAACAGTGGTTTGTG
GAAGTTCCTAAGAGGAACTTGAGATTAAGTGGTCCTCTTTGCAGTTTCCAGCAGAGTCCTGGTGGGGCAGGGGACACTACTAGAGAGAAGCCATTATGCAACACGTGTGG
AAAGCTCCATCTAGGTCGTTGTTTAGGTGGGACCAGAGTTTGCTTCAAGTACAGGCAGGAGGGACACATGGCTGATCGTTGTCCCTTGAGATTTACTGAGACTGGACAGA
GTAGTCAGAGATTTGCCATTAGTAGACTAGAGGCAGAGAGTGCTGACACAGTTGTGACAGAGGTGGAGCCCTTAGATTATGTTTTATCAGTGTCTACACCGTTTGGGAAA
ATTATGTTGTCTAAGGAAAAGATCAAAGCATGTAAAATTGAGATAACTGGTCGTGTGCTGGATAAGGTAGTGTTCAGTCCTCCTACCAAGCCTAGTTTTAAATTCAAAGC
GGTAGGAACAGTGGTACTACCTAAAGTGATTTCAGCTATGAAAGCTAGTAAACTGCTCAACCAGGGTACCTGGAGTATTTTGGCCAGTATGGTGGACACTAGGGAAGGTG
AGATTTCTTTAACTTCAAAACCTGTGGTAAAAGAATACCCAGATGTGTTTCCAGAAGAGCTTCTAGGACTTCCGCCTCATAGGGAGATTGATTTTGCTATTGTGTTGGAG
CCAGGTACTACTCCCATTTCTAGAGCCTCGTATAGAATGGCTCCGACTGAATTGAAAGAACTGAAGGTACAGTTACAAGAGTTGCTAGACAAGGGTTTTATTCGACCTAG
TGTGTCCCTTTGGGATGCACCAGCACTGTTTGTGAAGAAGAAGGATGGATCGATGCGTCTTTGTATTGATTACAGAGAACTTAATAAGATTGACTTGCGGTCAGGTTATC
ACGAGTTGAGGATTAGAGACAGTTACATTCCTAAGACTGTTTTCGTTCGAGTCTTCATTGATGATATTTTGGTTTATTCCAAGATAGAGACTGAACATGAGGAGCATTTA
CATACATTTTTAGAGACTCTTCGAGCCAATAAACTTTATGCTAAATTATCAAAGTGTGAATTCTGGTTGAAGCAGGTGGCTTTCCTTGGTCATGTGGTATCTAGTGAGGG
AGTGTTTGTGGATCCTGTAAAAATTGAAGTTGTTACCAGTTGGTCTCGACCCTCTACAGTTAGTGAGGATATTATAGGAGGTTTGTGGAGGACTTTTCTCGTTACCAGGA
AGGGGACTCCTTTTGTTTGGAGCTTAACATGTGACGATAGTTTTAAGAACCTTAAGCAAAAGCTGGTTATTGCGCCGGTTCTTACAGTACCAGATGGATCTGGTAAAGCG
AATGTGGTAGCTGATGCTCTTAGTGGGAAAGTATCACATTCAACAGCACTTATTACCAGACAGGCCCCATTACATCGAGACATTGAGAGAGCTGAGATTAGTAGTGATCC
TTGTTTGGTGGAGAGACGTCACCTTGTAGGATCAGTGAAAACTGATGAGTTCTCCATATCCTCTAATGGTGGGCTGTTGTTTGAGAGACGTCTATGTGTGCTAGTAGACA
TTGCAGTTAAGACTGACTTACTAAATGAAGCCCATAATTCCCCATTTTCCATGCATCCTGGTAGTACGAAAATGTATGAGGATCTAAACGAGTTTATTGGTGGCGTTAAG
GCACCTAGACAAAAACCAGCAGGTTTGTTACAGCCTTTGAGTGTGCTAGAATGGAAGTGGGAGAATGTGTCGATGGATTTTATTACAGGACTGCCTAGAATCCTGAAGGG
TTACACAAGATGCACGTTTTACTTCCAAGTTTTAGAAGGAACTTCATACTGCCATGGGCACGAGTTAGATTTTAGTACAACCTTTCATGCACAGACTGATGGTCAAACTG
AACGTTTGAACCAAGTGTTGGAGGATATGCTGCGAGCTTTGCGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGCGTATGGTAGGTGGGGATGTGAGACAGATCACTTGGGACCAGTTTAGAGACCGCTTCTATACCAAGTTTTTCTCAGCTAACCTTAGAGATGTTAAAAGCCAGAAATT
CTTGGAATTGAAGCAAGGACACATGACAGTTGAGGAGTATGACCAGGAGTTCGACATGCTGTCGCGCTTTGCCCTTGAACTTGCACTAAAGCCCACCACTCAGGCTGAAG
CACTGTGTCTAGTAGTGGATATAAGTATTGGGAAGGACGAGGTTCATGCAAAGAATTTTGATAAGGGAAAATCGTATGGTCAGAAAAGGAAAGTAGAACAGTGGTTTGTG
GAAGTTCCTAAGAGGAACTTGAGATTAAGTGGTCCTCTTTGCAGTTTCCAGCAGAGTCCTGGTGGGGCAGGGGACACTACTAGAGAGAAGCCATTATGCAACACGTGTGG
AAAGCTCCATCTAGGTCGTTGTTTAGGTGGGACCAGAGTTTGCTTCAAGTACAGGCAGGAGGGACACATGGCTGATCGTTGTCCCTTGAGATTTACTGAGACTGGACAGA
GTAGTCAGAGATTTGCCATTAGTAGACTAGAGGCAGAGAGTGCTGACACAGTTGTGACAGAGGTGGAGCCCTTAGATTATGTTTTATCAGTGTCTACACCGTTTGGGAAA
ATTATGTTGTCTAAGGAAAAGATCAAAGCATGTAAAATTGAGATAACTGGTCGTGTGCTGGATAAGGTAGTGTTCAGTCCTCCTACCAAGCCTAGTTTTAAATTCAAAGC
GGTAGGAACAGTGGTACTACCTAAAGTGATTTCAGCTATGAAAGCTAGTAAACTGCTCAACCAGGGTACCTGGAGTATTTTGGCCAGTATGGTGGACACTAGGGAAGGTG
AGATTTCTTTAACTTCAAAACCTGTGGTAAAAGAATACCCAGATGTGTTTCCAGAAGAGCTTCTAGGACTTCCGCCTCATAGGGAGATTGATTTTGCTATTGTGTTGGAG
CCAGGTACTACTCCCATTTCTAGAGCCTCGTATAGAATGGCTCCGACTGAATTGAAAGAACTGAAGGTACAGTTACAAGAGTTGCTAGACAAGGGTTTTATTCGACCTAG
TGTGTCCCTTTGGGATGCACCAGCACTGTTTGTGAAGAAGAAGGATGGATCGATGCGTCTTTGTATTGATTACAGAGAACTTAATAAGATTGACTTGCGGTCAGGTTATC
ACGAGTTGAGGATTAGAGACAGTTACATTCCTAAGACTGTTTTCGTTCGAGTCTTCATTGATGATATTTTGGTTTATTCCAAGATAGAGACTGAACATGAGGAGCATTTA
CATACATTTTTAGAGACTCTTCGAGCCAATAAACTTTATGCTAAATTATCAAAGTGTGAATTCTGGTTGAAGCAGGTGGCTTTCCTTGGTCATGTGGTATCTAGTGAGGG
AGTGTTTGTGGATCCTGTAAAAATTGAAGTTGTTACCAGTTGGTCTCGACCCTCTACAGTTAGTGAGGATATTATAGGAGGTTTGTGGAGGACTTTTCTCGTTACCAGGA
AGGGGACTCCTTTTGTTTGGAGCTTAACATGTGACGATAGTTTTAAGAACCTTAAGCAAAAGCTGGTTATTGCGCCGGTTCTTACAGTACCAGATGGATCTGGTAAAGCG
AATGTGGTAGCTGATGCTCTTAGTGGGAAAGTATCACATTCAACAGCACTTATTACCAGACAGGCCCCATTACATCGAGACATTGAGAGAGCTGAGATTAGTAGTGATCC
TTGTTTGGTGGAGAGACGTCACCTTGTAGGATCAGTGAAAACTGATGAGTTCTCCATATCCTCTAATGGTGGGCTGTTGTTTGAGAGACGTCTATGTGTGCTAGTAGACA
TTGCAGTTAAGACTGACTTACTAAATGAAGCCCATAATTCCCCATTTTCCATGCATCCTGGTAGTACGAAAATGTATGAGGATCTAAACGAGTTTATTGGTGGCGTTAAG
GCACCTAGACAAAAACCAGCAGGTTTGTTACAGCCTTTGAGTGTGCTAGAATGGAAGTGGGAGAATGTGTCGATGGATTTTATTACAGGACTGCCTAGAATCCTGAAGGG
TTACACAAGATGCACGTTTTACTTCCAAGTTTTAGAAGGAACTTCATACTGCCATGGGCACGAGTTAGATTTTAGTACAACCTTTCATGCACAGACTGATGGTCAAACTG
AACGTTTGAACCAAGTGTTGGAGGATATGCTGCGAGCTTTGCGTTAG
Protein sequenceShow/hide protein sequence
MRMVGGDVRQITWDQFRDRFYTKFFSANLRDVKSQKFLELKQGHMTVEEYDQEFDMLSRFALELALKPTTQAEALCLVVDISIGKDEVHAKNFDKGKSYGQKRKVEQWFV
EVPKRNLRLSGPLCSFQQSPGGAGDTTREKPLCNTCGKLHLGRCLGGTRVCFKYRQEGHMADRCPLRFTETGQSSQRFAISRLEAESADTVVTEVEPLDYVLSVSTPFGK
IMLSKEKIKACKIEITGRVLDKVVFSPPTKPSFKFKAVGTVVLPKVISAMKASKLLNQGTWSILASMVDTREGEISLTSKPVVKEYPDVFPEELLGLPPHREIDFAIVLE
PGTTPISRASYRMAPTELKELKVQLQELLDKGFIRPSVSLWDAPALFVKKKDGSMRLCIDYRELNKIDLRSGYHELRIRDSYIPKTVFVRVFIDDILVYSKIETEHEEHL
HTFLETLRANKLYAKLSKCEFWLKQVAFLGHVVSSEGVFVDPVKIEVVTSWSRPSTVSEDIIGGLWRTFLVTRKGTPFVWSLTCDDSFKNLKQKLVIAPVLTVPDGSGKA
NVVADALSGKVSHSTALITRQAPLHRDIERAEISSDPCLVERRHLVGSVKTDEFSISSNGGLLFERRLCVLVDIAVKTDLLNEAHNSPFSMHPGSTKMYEDLNEFIGGVK
APRQKPAGLLQPLSVLEWKWENVSMDFITGLPRILKGYTRCTFYFQVLEGTSYCHGHELDFSTTFHAQTDGQTERLNQVLEDMLRALR