; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh04G020500 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh04G020500
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionTy3-gypsy retrotransposon protein
Genome locationCmo_Chr04:11435802..11445728
RNA-Seq ExpressionCmoCh04G020500
SyntenyCmoCh04G020500
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0033091.1 putative Gag-pol protein [Cucumis melo var. makuwa]1.3e-10335.15Show/hide
Query:  DMIDLASGVIRISNKIDLRSGYHQLRIKESDISKTVFRTRYGHYEFRVMSFGLTNDLAAFMGLMNKVLKEFLDNFVIVFIDDILVYSKTREQHEEHLRKV
        D+ D   G   I +KIDLRSGYHQLRI++S I KT FR+RY HYEF VMSFGLTN LA FM LMN+V K+FLD+FVIVFIDDIL+YSKT  +HEEHL +V
Subjt:  DMIDLASGVIRISNKIDLRSGYHQLRIKESDISKTVFRTRYGHYEFRVMSFGLTNDLAAFMGLMNKVLKEFLDNFVIVFIDDILVYSKTREQHEEHLRKV

Query:  LTTLRTNKLFAKFSKCEFWAEQVGFLGHVVSRMRIIVEPAKIEAVTNWPRQTTMTE--------------------------------------------
        L TLR N+L+ KFSKCEFW ++V FLGHVVS   + V+PAKIEA+TNWPR +T++E                                            
Subjt:  LTTLRTNKLFAKFSKCEFWAEQVGFLGHVVSRMRIIVEPAKIEAVTNWPRQTTMTE--------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------------------------------------------NMKCDVAEFVSKCLVCQQVKVPRQKPAELLQ
                                                                             NMK +VA+FVS+CLVCQQVK  RQ+PA LLQ
Subjt:  ---------------------------------------------------------------------NMKCDVAEFVSKCLVCQQVKVPRQKPAELLQ

Query:  PLSVLEWKWENIAMDFIVRLPRTPRGYTVIWVVVDRLTKSAHFPPRKSTYSVDKWAQLYLKEVTRLHGVPM-----------------------------
        PLSV  WKWE+++MDFI+ LPRT +GYTVIWVVVDRLTK AHF P KSTY+  KW QLY+ E+ +LHGVP+                             
Subjt:  PLSVLEWKWENIAMDFIVRLPRTPRGYTVIWVVVDRLTKSAHFPPRKSTYSVDKWAQLYLKEVTRLHGVPM-----------------------------

Query:  -------DVQTERLNQILEDMLRACMLDFVSSWDTHLHLIEFVYNNSYQA--------------------------------------------------
               D QTERLNQILEDMLRAC+L+F  SWD+HLHL+EF YNNSY+                                                   
Subjt:  -------DVQTERLNQILEDMLRACMLDFVSSWDTHLHLIEFVYNNSYQA--------------------------------------------------

Query:  ---------------------------------------------TLDSSNVIDFKPLRLDKGLNHEEEPVQIVDREVKTLRNREIAFVKVLWRNHQLEE
                                                       D ++V+DF+PL++ + L++EE+PV+I+ REVK LR+REI+ VKVLWRNH +EE
Subjt:  ---------------------------------------------TLDSSNVIDFKPLRLDKGLNHEEEPVQIVDREVKTLRNREIAFVKVLWRNHQLEE

Query:  VTWEREEKMMSKYPELF
         TW+REE M ++YPELF
Subjt:  VTWEREEKMMSKYPELF

KAA0036958.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]1.7e-10035.81Show/hide
Query:  ISNKIDLRSGYHQLRIKESDISKTVFRTRYGHYEFRVMSFGLTNDLAAFMGLMNKVLKEFLDNFVIVFIDDILVYSKTREQHEEHLRKVLTTLRTNKLFA
        + +KIDLRSGYHQLRI++SDI KT FR+RY HYEF VMSFGLTN    FM LMN+V K+FLD+FVIVFIDDI +YS+T  +HEEHL +VL TLR N+L+A
Subjt:  ISNKIDLRSGYHQLRIKESDISKTVFRTRYGHYEFRVMSFGLTNDLAAFMGLMNKVLKEFLDNFVIVFIDDILVYSKTREQHEEHLRKVLTTLRTNKLFA

Query:  KFSKCEFWAEQVGFLGHVVSRMRIIVEPAKIEAVTNWPRQTTMTE-------------------------------------------------------
        KFSK EF  ++V FLGHVVS   + V+PAKIEAV+NWPR +T++E                                                       
Subjt:  KFSKCEFWAEQVGFLGHVVSRMRIIVEPAKIEAVTNWPRQTTMTE-------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------NMKCDVAEFVSKCLVCQQVKVPRQKPAELLQPLSVLEWKWENIAMDFIVRLPRTPRGYTVIWVV
                                            NMK +VA+FVS+CLVCQQVK  RQ+PA LLQ LSV   KWE+++MDFI  LP+T +GYTVIWVV
Subjt:  ------------------------------------NMKCDVAEFVSKCLVCQQVKVPRQKPAELLQPLSVLEWKWENIAMDFIVRLPRTPRGYTVIWVV

Query:  VDRLTKSAHFPPRKSTYSVDKWAQLYLKEVTRLHGVPM------------------------------------DVQTERLNQILEDMLRACMLDFVSSW
        VDRLTKSAHF P KSTY+ +KW QLY+ E+ RLHGVP+                                    D QTERLNQILEDMLRAC+LDF  SW
Subjt:  VDRLTKSAHFPPRKSTYSVDKWAQLYLKEVTRLHGVPM------------------------------------DVQTERLNQILEDMLRACMLDFVSSW

Query:  DTHLHLIEFVYNNSYQATL---------------------------------------------------------------------------------
        D+HLHL+EF YN SYQAT+                                                                                 
Subjt:  DTHLHLIEFVYNNSYQATL---------------------------------------------------------------------------------

Query:  ------------------DSSNVIDFKPLRLDKGLNHEEEPVQIVDREVKTLRNREIAFVKVLWRNHQLEEVTWEREEKMMSKYPEL
                          D ++V+DF+PL++ + L++EE+PV+I+ REVK L +REI+ VKVLWRNH +EEVTWEREE M ++YPEL
Subjt:  ------------------DSSNVIDFKPLRLDKGLNHEEEPVQIVDREVKTLRNREIAFVKVLWRNHQLEEVTWEREEKMMSKYPEL

KAA0051522.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]3.0e-10534.61Show/hide
Query:  DMIDLASGVIRISNKIDLRSGYHQLRIKESDISKTVFRTRYGHYEFRVMSFGLTNDLAAFMGLMNKVLKEFLDNFVIVFIDDILVYSKTREQHEEHLRKV
        D+ D   G   + +KIDLRSGYHQLRIK  D+ KT FR+RYGHYEF VMSFGLTN  A FM LMN+V +EFLD FVIVFIDDIL+YSKT  +HEEHLR V
Subjt:  DMIDLASGVIRISNKIDLRSGYHQLRIKESDISKTVFRTRYGHYEFRVMSFGLTNDLAAFMGLMNKVLKEFLDNFVIVFIDDILVYSKTREQHEEHLRKV

Query:  LTTLRTNKLFAKFSKCEFWAEQVGFLGHVVSRMRIIVEPAKIEAVTNWPRQTTMTE--------------------------------------------
        L TLR NKL+AKFSKCEFW +QV FLGHVVS+  + V+PAKIEAVT W R +T++E                                            
Subjt:  LTTLRTNKLFAKFSKCEFWAEQVGFLGHVVSRMRIIVEPAKIEAVTNWPRQTTMTE--------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------------------------------------------------------------------NMKCDVAEFVSKCLVCQQVKVPRQKPAEL
                                                                               NMK +VAEFVS+CLVCQQVK PRQKPA L
Subjt:  -----------------------------------------------------------------------NMKCDVAEFVSKCLVCQQVKVPRQKPAEL

Query:  LQPLSVLEWKWENIAMDFIVRLPRTPRGYTVIWVVVDRLTKSAHFPPRKSTYSVDKWAQLYLKEVTRLHGVPM---------------------------
        LQPLS+ EWKWEN++MDFI  LPRT RG+TVIWVVV+RLTKSAHF P KSTY+  KWAQLY+ E+ RLHGVP+                           
Subjt:  LQPLSVLEWKWENIAMDFIVRLPRTPRGYTVIWVVVDRLTKSAHFPPRKSTYSVDKWAQLYLKEVTRLHGVPM---------------------------

Query:  ---------DVQTERLNQILEDMLRACMLDFVSSWDTHLHLIEFVYNNSYQATL----------------------------------------------
                 D QTERLNQ+LEDMLRAC L+F  SWD+HLHL+EF YNNSYQAT+                                              
Subjt:  ---------DVQTERLNQILEDMLRACMLDFVSSWDTHLHLIEFVYNNSYQATL----------------------------------------------

Query:  ------------------------------------------------------------------------------------------DSSNVIDFKP
                                                                                                  D S+V+D++P
Subjt:  ------------------------------------------------------------------------------------------DSSNVIDFKP

Query:  LRLDKGLNHEEEPVQIVDREVKTLRNREIAFVKVLWRNHQLEEVTWEREEKMMSKYPELF
        L +D+ L++ E+PV ++ REVK LRNREI FVK+LWRNH++EE TWERE+ M S+YPELF
Subjt:  LRLDKGLNHEEEPVQIVDREVKTLRNREIAFVKVLWRNHQLEEVTWEREEKMMSKYPELF

TYK01415.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]1.4e-10234.03Show/hide
Query:  DMIDLASGVIRISNKIDLRSGYHQLRIKESDISKTVFRTRYGHYEFRVMSFGLTNDLAAFMGLMNKVLKEFLDNFVIVFIDDILVYSKTREQHEEHLRKV
        D+ D   G   + +KI LR GYHQLRIK+ D+ KT FR+RYGHYEF VMSFGLTN  A FM LMN+V +EFLD FVIVFIDDIL+YSKT  +HEEHLR V
Subjt:  DMIDLASGVIRISNKIDLRSGYHQLRIKESDISKTVFRTRYGHYEFRVMSFGLTNDLAAFMGLMNKVLKEFLDNFVIVFIDDILVYSKTREQHEEHLRKV

Query:  LTTLRTNKLFAKFSKCEFWAEQVGFLGHVVSRMRIIVEPAKIEAVTNWPRQTTMTE--------------------------------------------
        L TLR NKL+AKFSKCEFW + V FLGHVVS+  + V+P KIEAVT+W R +T++E                                            
Subjt:  LTTLRTNKLFAKFSKCEFWAEQVGFLGHVVSRMRIIVEPAKIEAVTNWPRQTTMTE--------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --NMKCDVAEFVSKCLVCQQVKVPRQKPAELLQPLSVLEWKWENIAMDFIVRLPRTPRGYTVIWVVVDRLTKSAHFPPRKSTYSVDKWAQLYLKEVTRLH
          NMK +VAEFVSKCLVCQQVK PRQKPA LLQ LS+ EWKWEN++MDFI  LPRT RG+TVIWVVVDRLTKSAHF P KSTY+  KWAQLY+ E+ RLH
Subjt:  --NMKCDVAEFVSKCLVCQQVKVPRQKPAELLQPLSVLEWKWENIAMDFIVRLPRTPRGYTVIWVVVDRLTKSAHFPPRKSTYSVDKWAQLYLKEVTRLH

Query:  GVPM------------------------------------DVQTERLNQILEDMLRACMLDFVSSWDTHLHLIEFVYNNSYQATL---------------
        GVP+                                    D QTERLNQ+LEDMLRAC L+F+ SWD+HLHL+EF YNNS+QAT+               
Subjt:  GVPM------------------------------------DVQTERLNQILEDMLRACMLDFVSSWDTHLHLIEFVYNNSYQATL---------------

Query:  -------------------------------------------------------------------------------------------DSSNVIDFK
                                                                                                   D S+V+D++
Subjt:  -------------------------------------------------------------------------------------------DSSNVIDFK

Query:  PLRLDKGLNHEEEPVQIVDREVKTLRNREIAFVKVLWRNHQLEEVTWEREEKMMSKYPELF
        PL +D+ L++ E+PV+++ REVK LRNREIA VKVLWRNH++EE TWE+E+ M S+YPELF
Subjt:  PLRLDKGLNHEEEPVQIVDREVKTLRNREIAFVKVLWRNHQLEEVTWEREEKMMSKYPELF

TYK21249.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]1.7e-10035.81Show/hide
Query:  ISNKIDLRSGYHQLRIKESDISKTVFRTRYGHYEFRVMSFGLTNDLAAFMGLMNKVLKEFLDNFVIVFIDDILVYSKTREQHEEHLRKVLTTLRTNKLFA
        + +KIDLRSGYHQLRI++SDI KT FR+RY HYEF VMSFGLTN    FM LMN+V K+FLD+FVIVFIDDI +YS+T  +HEEHL +VL TLR N+L+A
Subjt:  ISNKIDLRSGYHQLRIKESDISKTVFRTRYGHYEFRVMSFGLTNDLAAFMGLMNKVLKEFLDNFVIVFIDDILVYSKTREQHEEHLRKVLTTLRTNKLFA

Query:  KFSKCEFWAEQVGFLGHVVSRMRIIVEPAKIEAVTNWPRQTTMTE-------------------------------------------------------
        KFSK EF  ++V FLGHVVS   + V+PAKIEAV+NWPR +T++E                                                       
Subjt:  KFSKCEFWAEQVGFLGHVVSRMRIIVEPAKIEAVTNWPRQTTMTE-------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------NMKCDVAEFVSKCLVCQQVKVPRQKPAELLQPLSVLEWKWENIAMDFIVRLPRTPRGYTVIWVV
                                            NMK +VA+FVS+CLVCQQVK  RQ+PA LLQ LSV   KWE+++MDFI  LP+T +GYTVIWVV
Subjt:  ------------------------------------NMKCDVAEFVSKCLVCQQVKVPRQKPAELLQPLSVLEWKWENIAMDFIVRLPRTPRGYTVIWVV

Query:  VDRLTKSAHFPPRKSTYSVDKWAQLYLKEVTRLHGVPM------------------------------------DVQTERLNQILEDMLRACMLDFVSSW
        VDRLTKSAHF P KSTY+ +KW QLY+ E+ RLHGVP+                                    D QTERLNQILEDMLRAC+LDF  SW
Subjt:  VDRLTKSAHFPPRKSTYSVDKWAQLYLKEVTRLHGVPM------------------------------------DVQTERLNQILEDMLRACMLDFVSSW

Query:  DTHLHLIEFVYNNSYQATL---------------------------------------------------------------------------------
        D+HLHL+EF YN SYQAT+                                                                                 
Subjt:  DTHLHLIEFVYNNSYQATL---------------------------------------------------------------------------------

Query:  ------------------DSSNVIDFKPLRLDKGLNHEEEPVQIVDREVKTLRNREIAFVKVLWRNHQLEEVTWEREEKMMSKYPEL
                          D ++V+DF+PL++ + L++EE+PV+I+ REVK L +REI+ VKVLWRNH +EEVTWEREE M ++YPEL
Subjt:  ------------------DSSNVIDFKPLRLDKGLNHEEEPVQIVDREVKTLRNREIAFVKVLWRNHQLEEVTWEREEKMMSKYPEL

TrEMBL top hitse value%identityAlignment
A0A5A7SUC9 Putative Gag-pol protein6.2e-10435.15Show/hide
Query:  DMIDLASGVIRISNKIDLRSGYHQLRIKESDISKTVFRTRYGHYEFRVMSFGLTNDLAAFMGLMNKVLKEFLDNFVIVFIDDILVYSKTREQHEEHLRKV
        D+ D   G   I +KIDLRSGYHQLRI++S I KT FR+RY HYEF VMSFGLTN LA FM LMN+V K+FLD+FVIVFIDDIL+YSKT  +HEEHL +V
Subjt:  DMIDLASGVIRISNKIDLRSGYHQLRIKESDISKTVFRTRYGHYEFRVMSFGLTNDLAAFMGLMNKVLKEFLDNFVIVFIDDILVYSKTREQHEEHLRKV

Query:  LTTLRTNKLFAKFSKCEFWAEQVGFLGHVVSRMRIIVEPAKIEAVTNWPRQTTMTE--------------------------------------------
        L TLR N+L+ KFSKCEFW ++V FLGHVVS   + V+PAKIEA+TNWPR +T++E                                            
Subjt:  LTTLRTNKLFAKFSKCEFWAEQVGFLGHVVSRMRIIVEPAKIEAVTNWPRQTTMTE--------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------------------------------------------NMKCDVAEFVSKCLVCQQVKVPRQKPAELLQ
                                                                             NMK +VA+FVS+CLVCQQVK  RQ+PA LLQ
Subjt:  ---------------------------------------------------------------------NMKCDVAEFVSKCLVCQQVKVPRQKPAELLQ

Query:  PLSVLEWKWENIAMDFIVRLPRTPRGYTVIWVVVDRLTKSAHFPPRKSTYSVDKWAQLYLKEVTRLHGVPM-----------------------------
        PLSV  WKWE+++MDFI+ LPRT +GYTVIWVVVDRLTK AHF P KSTY+  KW QLY+ E+ +LHGVP+                             
Subjt:  PLSVLEWKWENIAMDFIVRLPRTPRGYTVIWVVVDRLTKSAHFPPRKSTYSVDKWAQLYLKEVTRLHGVPM-----------------------------

Query:  -------DVQTERLNQILEDMLRACMLDFVSSWDTHLHLIEFVYNNSYQA--------------------------------------------------
               D QTERLNQILEDMLRAC+L+F  SWD+HLHL+EF YNNSY+                                                   
Subjt:  -------DVQTERLNQILEDMLRACMLDFVSSWDTHLHLIEFVYNNSYQA--------------------------------------------------

Query:  ---------------------------------------------TLDSSNVIDFKPLRLDKGLNHEEEPVQIVDREVKTLRNREIAFVKVLWRNHQLEE
                                                       D ++V+DF+PL++ + L++EE+PV+I+ REVK LR+REI+ VKVLWRNH +EE
Subjt:  ---------------------------------------------TLDSSNVIDFKPLRLDKGLNHEEEPVQIVDREVKTLRNREIAFVKVLWRNHQLEE

Query:  VTWEREEKMMSKYPELF
         TW+REE M ++YPELF
Subjt:  VTWEREEKMMSKYPELF

A0A5A7T2F0 Ty3-gypsy retrotransposon protein8.4e-10135.81Show/hide
Query:  ISNKIDLRSGYHQLRIKESDISKTVFRTRYGHYEFRVMSFGLTNDLAAFMGLMNKVLKEFLDNFVIVFIDDILVYSKTREQHEEHLRKVLTTLRTNKLFA
        + +KIDLRSGYHQLRI++SDI KT FR+RY HYEF VMSFGLTN    FM LMN+V K+FLD+FVIVFIDDI +YS+T  +HEEHL +VL TLR N+L+A
Subjt:  ISNKIDLRSGYHQLRIKESDISKTVFRTRYGHYEFRVMSFGLTNDLAAFMGLMNKVLKEFLDNFVIVFIDDILVYSKTREQHEEHLRKVLTTLRTNKLFA

Query:  KFSKCEFWAEQVGFLGHVVSRMRIIVEPAKIEAVTNWPRQTTMTE-------------------------------------------------------
        KFSK EF  ++V FLGHVVS   + V+PAKIEAV+NWPR +T++E                                                       
Subjt:  KFSKCEFWAEQVGFLGHVVSRMRIIVEPAKIEAVTNWPRQTTMTE-------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------NMKCDVAEFVSKCLVCQQVKVPRQKPAELLQPLSVLEWKWENIAMDFIVRLPRTPRGYTVIWVV
                                            NMK +VA+FVS+CLVCQQVK  RQ+PA LLQ LSV   KWE+++MDFI  LP+T +GYTVIWVV
Subjt:  ------------------------------------NMKCDVAEFVSKCLVCQQVKVPRQKPAELLQPLSVLEWKWENIAMDFIVRLPRTPRGYTVIWVV

Query:  VDRLTKSAHFPPRKSTYSVDKWAQLYLKEVTRLHGVPM------------------------------------DVQTERLNQILEDMLRACMLDFVSSW
        VDRLTKSAHF P KSTY+ +KW QLY+ E+ RLHGVP+                                    D QTERLNQILEDMLRAC+LDF  SW
Subjt:  VDRLTKSAHFPPRKSTYSVDKWAQLYLKEVTRLHGVPM------------------------------------DVQTERLNQILEDMLRACMLDFVSSW

Query:  DTHLHLIEFVYNNSYQATL---------------------------------------------------------------------------------
        D+HLHL+EF YN SYQAT+                                                                                 
Subjt:  DTHLHLIEFVYNNSYQATL---------------------------------------------------------------------------------

Query:  ------------------DSSNVIDFKPLRLDKGLNHEEEPVQIVDREVKTLRNREIAFVKVLWRNHQLEEVTWEREEKMMSKYPEL
                          D ++V+DF+PL++ + L++EE+PV+I+ REVK L +REI+ VKVLWRNH +EEVTWEREE M ++YPEL
Subjt:  ------------------DSSNVIDFKPLRLDKGLNHEEEPVQIVDREVKTLRNREIAFVKVLWRNHQLEEVTWEREEKMMSKYPEL

A0A5A7T690 Pol protein2.1e-9933.12Show/hide
Query:  DMIDLASGVIRISNKIDLRSGYHQLRIKESDISKTVFRTRYGHYEFRVMSFGLTNDLAAFMGLMNKVLKEFLDNFVIVFIDDILVYSKTREQHEEHLRKV
        D+ D   G   + +KIDLRSGYHQLRI++SDI KT FR+RYGHYEF VMSFGLTN  + FM LMNKV K+FLD+F+IVFIDDIL+YSKT  +HEEHL +V
Subjt:  DMIDLASGVIRISNKIDLRSGYHQLRIKESDISKTVFRTRYGHYEFRVMSFGLTNDLAAFMGLMNKVLKEFLDNFVIVFIDDILVYSKTREQHEEHLRKV

Query:  LTTLRTNKLFAKFSKCEFWAEQVGFLGHVVSRMRIIVEPAKIEAVTNWPRQTTMTE--------------------------------------------
        L TLR N+L+AKFSKCEFW ++V FLGHVVS  R+ V+PAKIEAVTNWPR +T++E                                            
Subjt:  LTTLRTNKLFAKFSKCEFWAEQVGFLGHVVSRMRIIVEPAKIEAVTNWPRQTTMTE--------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------NMKCDVAEFVSKCLVCQQVKVPRQKPAELLQPLSVLEWKWENIAMDFIVRLPRTPRGYTVIWVVVDRLTKSAHFPPRKSTYSVDKWA
                     NMK +VA+FVS+CLVCQQVK PRQ+PAELLQP+SV  WKWE+++MDFI  LP+T +GYTVIWVVVDRLTKSAHF P KSTY+  KW 
Subjt:  -------------NMKCDVAEFVSKCLVCQQVKVPRQKPAELLQPLSVLEWKWENIAMDFIVRLPRTPRGYTVIWVVVDRLTKSAHFPPRKSTYSVDKWA

Query:  QLYLKEVTRLHGVPM------------------------------------DVQTERLNQILEDMLRACMLDFVSSWDTHLHLIEFVYNNSYQATL----
        QLY+ E+ RLH VP+                                    D QTERLNQ+LEDMLRACML+F  SWD+HLHL+EF YNNSYQAT+    
Subjt:  QLYLKEVTRLHGVPM------------------------------------DVQTERLNQILEDMLRACMLDFVSSWDTHLHLIEFVYNNSYQATL----

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----DSSNVIDFKPLRLDKGLNHEEEPVQIVDREVKTLRNREIAFVKVLWRNHQLEEVTWEREEKM
             D ++V+DFKPL++ + L++EE+PV+I+ REVK LR+REI+ VKV+WRNH +EE TWEREE M
Subjt:  -----DSSNVIDFKPLRLDKGLNHEEEPVQIVDREVKTLRNREIAFVKVLWRNHQLEEVTWEREEKM

A0A5A7U6Z4 Ty3-gypsy retrotransposon protein1.5e-10534.61Show/hide
Query:  DMIDLASGVIRISNKIDLRSGYHQLRIKESDISKTVFRTRYGHYEFRVMSFGLTNDLAAFMGLMNKVLKEFLDNFVIVFIDDILVYSKTREQHEEHLRKV
        D+ D   G   + +KIDLRSGYHQLRIK  D+ KT FR+RYGHYEF VMSFGLTN  A FM LMN+V +EFLD FVIVFIDDIL+YSKT  +HEEHLR V
Subjt:  DMIDLASGVIRISNKIDLRSGYHQLRIKESDISKTVFRTRYGHYEFRVMSFGLTNDLAAFMGLMNKVLKEFLDNFVIVFIDDILVYSKTREQHEEHLRKV

Query:  LTTLRTNKLFAKFSKCEFWAEQVGFLGHVVSRMRIIVEPAKIEAVTNWPRQTTMTE--------------------------------------------
        L TLR NKL+AKFSKCEFW +QV FLGHVVS+  + V+PAKIEAVT W R +T++E                                            
Subjt:  LTTLRTNKLFAKFSKCEFWAEQVGFLGHVVSRMRIIVEPAKIEAVTNWPRQTTMTE--------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------------------------------------------------------------------NMKCDVAEFVSKCLVCQQVKVPRQKPAEL
                                                                               NMK +VAEFVS+CLVCQQVK PRQKPA L
Subjt:  -----------------------------------------------------------------------NMKCDVAEFVSKCLVCQQVKVPRQKPAEL

Query:  LQPLSVLEWKWENIAMDFIVRLPRTPRGYTVIWVVVDRLTKSAHFPPRKSTYSVDKWAQLYLKEVTRLHGVPM---------------------------
        LQPLS+ EWKWEN++MDFI  LPRT RG+TVIWVVV+RLTKSAHF P KSTY+  KWAQLY+ E+ RLHGVP+                           
Subjt:  LQPLSVLEWKWENIAMDFIVRLPRTPRGYTVIWVVVDRLTKSAHFPPRKSTYSVDKWAQLYLKEVTRLHGVPM---------------------------

Query:  ---------DVQTERLNQILEDMLRACMLDFVSSWDTHLHLIEFVYNNSYQATL----------------------------------------------
                 D QTERLNQ+LEDMLRAC L+F  SWD+HLHL+EF YNNSYQAT+                                              
Subjt:  ---------DVQTERLNQILEDMLRACMLDFVSSWDTHLHLIEFVYNNSYQATL----------------------------------------------

Query:  ------------------------------------------------------------------------------------------DSSNVIDFKP
                                                                                                  D S+V+D++P
Subjt:  ------------------------------------------------------------------------------------------DSSNVIDFKP

Query:  LRLDKGLNHEEEPVQIVDREVKTLRNREIAFVKVLWRNHQLEEVTWEREEKMMSKYPELF
        L +D+ L++ E+PV ++ REVK LRNREI FVK+LWRNH++EE TWERE+ M S+YPELF
Subjt:  LRLDKGLNHEEEPVQIVDREVKTLRNREIAFVKVLWRNHQLEEVTWEREEKMMSKYPELF

A0A5D3DDB2 Ty3-gypsy retrotransposon protein8.4e-10135.81Show/hide
Query:  ISNKIDLRSGYHQLRIKESDISKTVFRTRYGHYEFRVMSFGLTNDLAAFMGLMNKVLKEFLDNFVIVFIDDILVYSKTREQHEEHLRKVLTTLRTNKLFA
        + +KIDLRSGYHQLRI++SDI KT FR+RY HYEF VMSFGLTN    FM LMN+V K+FLD+FVIVFIDDI +YS+T  +HEEHL +VL TLR N+L+A
Subjt:  ISNKIDLRSGYHQLRIKESDISKTVFRTRYGHYEFRVMSFGLTNDLAAFMGLMNKVLKEFLDNFVIVFIDDILVYSKTREQHEEHLRKVLTTLRTNKLFA

Query:  KFSKCEFWAEQVGFLGHVVSRMRIIVEPAKIEAVTNWPRQTTMTE-------------------------------------------------------
        KFSK EF  ++V FLGHVVS   + V+PAKIEAV+NWPR +T++E                                                       
Subjt:  KFSKCEFWAEQVGFLGHVVSRMRIIVEPAKIEAVTNWPRQTTMTE-------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------NMKCDVAEFVSKCLVCQQVKVPRQKPAELLQPLSVLEWKWENIAMDFIVRLPRTPRGYTVIWVV
                                            NMK +VA+FVS+CLVCQQVK  RQ+PA LLQ LSV   KWE+++MDFI  LP+T +GYTVIWVV
Subjt:  ------------------------------------NMKCDVAEFVSKCLVCQQVKVPRQKPAELLQPLSVLEWKWENIAMDFIVRLPRTPRGYTVIWVV

Query:  VDRLTKSAHFPPRKSTYSVDKWAQLYLKEVTRLHGVPM------------------------------------DVQTERLNQILEDMLRACMLDFVSSW
        VDRLTKSAHF P KSTY+ +KW QLY+ E+ RLHGVP+                                    D QTERLNQILEDMLRAC+LDF  SW
Subjt:  VDRLTKSAHFPPRKSTYSVDKWAQLYLKEVTRLHGVPM------------------------------------DVQTERLNQILEDMLRACMLDFVSSW

Query:  DTHLHLIEFVYNNSYQATL---------------------------------------------------------------------------------
        D+HLHL+EF YN SYQAT+                                                                                 
Subjt:  DTHLHLIEFVYNNSYQATL---------------------------------------------------------------------------------

Query:  ------------------DSSNVIDFKPLRLDKGLNHEEEPVQIVDREVKTLRNREIAFVKVLWRNHQLEEVTWEREEKMMSKYPEL
                          D ++V+DF+PL++ + L++EE+PV+I+ REVK L +REI+ VKVLWRNH +EEVTWEREE M ++YPEL
Subjt:  ------------------DSSNVIDFKPLRLDKGLNHEEEPVQIVDREVKTLRNREIAFVKVLWRNHQLEEVTWEREEKMMSKYPEL

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.61.5e-2239.01Show/hide
Query:  IDLRSGYHQLRIKESDISKTVFRTRYGHYEFRVMSFGLTNDLAAFMGLMNKVLKEFLDNFVIVFIDDILVYSKTREQHEEHLRKVLTTLRTNKLFAKFSK
        IDL  G+HQ+ +    +SKT F T++GHYE+  M FGL N  A F   MN +L+  L+   +V++DDI+V+S + ++H + L  V   L    L  +  K
Subjt:  IDLRSGYHQLRIKESDISKTVFRTRYGHYEFRVMSFGLTNDLAAFMGLMNKVLKEFLDNFVIVFIDDILVYSKTREQHEEHLRKVLTTLRTNKLFAKFSK

Query:  CEFWAEQVGFLGHVVSRMRIIVEPAKIEAVTNWPRQTTMTE
        CEF  ++  FLGHV++   I   P KIEA+  +P  T   E
Subjt:  CEFWAEQVGFLGHVVSRMRIIVEPAKIEAVTNWPRQTTMTE

P20825 Retrovirus-related Pol polyprotein from transposon 2974.0e-2330.96Show/hide
Query:  KCSCLEIFDQQESLKLDMIDLASGVIRISNKIDLRSGYHQLRIKESDISKTVFRTRYGHYEFRVMSFGLTNDLAAFMGLMNKVLKEFLDNFVIVFIDDIL
        K + + I D+     +D I    G  +    IDL  G+HQ+ + E  ISKT F T+ GHYE+  M FGL N  A F   MN +L+  L+   +V++DDI+
Subjt:  KCSCLEIFDQQESLKLDMIDLASGVIRISNKIDLRSGYHQLRIKESDISKTVFRTRYGHYEFRVMSFGLTNDLAAFMGLMNKVLKEFLDNFVIVFIDDIL

Query:  VYSKTREQHEEHLRKVLTTLRTNKLFAKFSKCEFWAEQVGFLGHVVSRMRIIVEPAKIEAVTNWPRQTTMTENMK---------------CDVAEFVSKC
        ++S +  +H   ++ V T L    L  +  KCEF  ++  FLGH+V+   I   P K++A+ ++P  T   E                   D+A+ ++ C
Subjt:  VYSKTREQHEEHLRKVLTTLRTNKLFAKFSKCEFWAEQVGFLGHVVSRMRIIVEPAKIEAVTNWPRQTTMTENMK---------------CDVAEFVSKC

Query:  LVCQQVKVPRQKPAELLQPLSVLEWKWENIAMDFIVRLP
        L  ++ K+  QK    L+ +   E     I  D I++LP
Subjt:  LVCQQVKVPRQKPAELLQPLSVLEWKWENIAMDFIVRLP

P31843 RNA-directed DNA polymerase homolog1.2e-1957.3Show/hide
Query:  KIDLRSGYHQLRIKESDISKTVFRTRYGHYEFRVMSFGLTNDLAAFMGLMNKVLKEFLDNFVIVFIDDILV---YSKTREQHEEHLRKV
        K+DLRSGY Q+RI + D  KT   TRYG +EFRVM FGLTN LA F  LMN VL E+LD+FV+V++DD++V   YS +  +H +HLR V
Subjt:  KIDLRSGYHQLRIKESDISKTVFRTRYGHYEFRVMSFGLTNDLAAFMGLMNKVLKEFLDNFVIVFIDDILV---YSKTREQHEEHLRKV

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus4.4e-2239.72Show/hide
Query:  IDLRSGYHQLRIKESDISKTVFRTRYGHYEFRVMSFGLTNDLAAFMGLMNKVLKEFLDNFVIVFIDDILVYSKTREQHEEHLRKVLTTLRTNKLFAKFSK
        +DL SG+HQ+ +KESDI KT F T  G YEF  + FGL N  A F  +++ +L+E +     V+IDDI+V+S+  + H ++LR VL +L    L     K
Subjt:  IDLRSGYHQLRIKESDISKTVFRTRYGHYEFRVMSFGLTNDLAAFMGLMNKVLKEFLDNFVIVFIDDILVYSKTREQHEEHLRKVLTTLRTNKLFAKFSK

Query:  CEFWAEQVGFLGHVVSRMRIIVEPAKIEAVTNWPRQTTMTE
          F   QV FLG++V+   I  +P K+ A++  P  T++ E
Subjt:  CEFWAEQVGFLGHVVSRMRIIVEPAKIEAVTNWPRQTTMTE

Q99315 Transposon Ty3-G Gag-Pol polyprotein1.6e-1934.81Show/hide
Query:  KLDMIDLASGVIRISNKIDLRSGYHQLRIKESDISKTVFRTRYGHYEFRVMSFGLTNDLAAFMGLMNKVLKEFLDNFVIVFIDDILVYSKTREQHEEHLR
        ++D +    G  +I   +DL SGYHQ+ ++  D  KT F T  G YE+ VM FGL N  + F   M    ++    FV V++DDIL++S++ E+H +HL 
Subjt:  KLDMIDLASGVIRISNKIDLRSGYHQLRIKESDISKTVFRTRYGHYEFRVMSFGLTNDLAAFMGLMNKVLKEFLDNFVIVFIDDILVYSKTREQHEEHLR

Query:  KVLTTLRTNKLFAKFSKCEFWAEQVGFLGHVVSRMRIIVEPAKIEAVTNWPRQTTMTE
         VL  L+   L  K  KC+F +E+  FLG+ +   +I     K  A+ ++P   T+ +
Subjt:  KVLTTLRTNKLFAKFSKCEFWAEQVGFLGHVVSRMRIIVEPAKIEAVTNWPRQTTMTE

Q99315 Transposon Ty3-G Gag-Pol polyprotein3.4e-1431.54Show/hide
Query:  IIVEPAKIEAVTNWPRQTTMTENMKCDVAEFVSKCLVCQQVKVPRQKPAELLQPLSVLEWKWENIAMDFIVRLPRTPRGYTVIWVVVDRLTKSAHFPPRK
        + V  AKI  +  WP+       ++  + +++  C+ CQ +K  R +   LLQPL + E +W +I+MDF+  LP T     +I VVVDR +K AHF   +
Subjt:  IIVEPAKIEAVTNWPRQTTMTENMKCDVAEFVSKCLVCQQVKVPRQKPAELLQPLSVLEWKWENIAMDFIVRLPRTPRGYTVIWVVVDRLTKSAHFPPRK

Query:  STYSVDKWAQLYLKEVTRLHGVPMDVQTER
         T    +   L  + +   HG P  + ++R
Subjt:  STYSVDKWAQLYLKEVTRLHGVPMDVQTER

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein6.5e-0534.92Show/hide
Query:  HLRKVLTTLRTNKLFAKFSKCEFWAEQVGFLG--HVVSRMRIIVEPAKIEAVTNWPRQTTMTE
        HL  VL     ++ +A   KC F   Q+ +LG  H++S   +  +PAK+EA+  WP     TE
Subjt:  HLRKVLTTLRTNKLFAKFSKCEFWAEQVGFLG--HVVSRMRIIVEPAKIEAVTNWPRQTTMTE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATCAATCTTGCTTGTGGAGTGATTCGAATCTCAATCAAGTGTTCTTGTCTTGAGATATTTGATCAACAAGAATCATTGAAGCTAGACATGATCGATCTTGCTTCTGG
AGTGATTCGAATCTCAAACAAAATTGACTTACGGTCAGGCTACCATCAATTGAGGATTAAGGAGAGTGATATCTCAAAGACAGTCTTTCGCACAAGGTATGGACATTACG
AGTTTAGGGTAATGTCGTTTGGCTTGACCAACGATCTAGCAGCATTTATGGGGTTGATGAACAAAGTGCTCAAAGAATTCTTAGACAACTTCGTGATTGTCTTCATCGAT
GATATCCTTGTGTATTCCAAGACTAGGGAACAACACGAGGAACATCTTAGGAAGGTGTTGACCACTCTGAGAACGAACAAGTTGTTTGCAAAGTTCTCCAAGTGTGAGTT
CTGGGCAGAACAAGTAGGATTTCTTGGGCATGTAGTTTCACGAATGAGAATCATCGTAGAGCCAGCAAAGATTGAAGCAGTGACGAATTGGCCTCGCCAGACCACAATGA
CTGAGAATATGAAATGTGATGTAGCTGAGTTTGTGAGCAAGTGTTTGGTTTGTCAACAAGTCAAAGTCCCAAGGCAGAAGCCAGCGGAGCTATTACAACCACTTAGCGTA
CTTGAATGGAAGTGGGAGAACATCGCCATGGACTTTATTGTGAGATTACCGAGGACGCCCAGAGGCTACACAGTTATATGGGTTGTGGTTGATAGGCTCACCAAGTCAGC
TCACTTCCCGCCAAGGAAGTCCACTTATTCCGTGGATAAATGGGCTCAGTTGTACTTGAAGGAAGTAACAAGATTGCATGGGGTACCGATGGATGTGCAAACTGAACGCT
TAAACCAAATTCTGGAGGACATGTTACGAGCTTGTATGTTGGACTTTGTCAGTAGCTGGGACACACACTTGCATCTCATAGAGTTTGTTTATAACAACAGTTATCAAGCG
ACATTGGACTCATCGAATGTGATAGATTTCAAACCTCTACGGTTGGATAAGGGTTTGAACCATGAGGAAGAACCAGTGCAAATTGTGGATAGAGAAGTGAAAACCCTACG
TAACCGAGAGATCGCATTTGTCAAAGTATTATGGCGGAATCACCAGCTAGAAGAAGTCACTTGGGAACGAGAAGAGAAAATGATGAGCAAGTACCCTGAGCTCTTCAGAG
CGGAAGGTTGTGTGCTGTTGTGTAATCTGATTAGGCTGTGTGTTGTGGTAGCAACGATTAAGGGGCTTGTGGTAATAGTTACACGCTGTGTGTTTCGGGTGTTTGTCTGG
ATGATTGTCGGGTTGAACATTGTTACTGCTAAGAATTTTGCTAGTGGCATGGGATTGATTTTGTGTTCGAGTAAGAAGTCCATCCCTAAAACAATGTTGAAGTCATCCAT
TTTTACCATCACAAAGTCAGTTTGCCCAGTCCAAGTCCACAGCTTGATCAGAGTTCTCTTTGCTACTTCTAAGACAGGCCTATACATAAGACCGCGTTCCACTAGCTCTT
CTATCTCTTCCAAGAGAGTGGACAAGTACTTTAGTGCCCTCATTTGGGGGTGCTATGATGGTGTTACTATAACATGTGCGGAAGCAATTTGGATCTTAGGCACTCTAAAC
AACGACGCCCTTCTCCATTTTCTCTCTTCCACACCTTTCATTCTTTCTTCGGATTCCTTCTCCCTTCTATTCTGGCAACGGCAACGACGCCAACCAGCCTTCCCGTTTGG
AAGCGGGCCAACGCTACCCAGCAGCTCTTCTTCCGGAAACGACAACGGTGGAAGTAGGGTTTCTGTAGAACTTGTCGGAGCATCTCATTTCTTTTGTGTTTGTTTTCAAT
TCATTTGA
mRNA sequenceShow/hide mRNA sequence
ATGATCAATCTTGCTTGTGGAGTGATTCGAATCTCAATCAAGTGTTCTTGTCTTGAGATATTTGATCAACAAGAATCATTGAAGCTAGACATGATCGATCTTGCTTCTGG
AGTGATTCGAATCTCAAACAAAATTGACTTACGGTCAGGCTACCATCAATTGAGGATTAAGGAGAGTGATATCTCAAAGACAGTCTTTCGCACAAGGTATGGACATTACG
AGTTTAGGGTAATGTCGTTTGGCTTGACCAACGATCTAGCAGCATTTATGGGGTTGATGAACAAAGTGCTCAAAGAATTCTTAGACAACTTCGTGATTGTCTTCATCGAT
GATATCCTTGTGTATTCCAAGACTAGGGAACAACACGAGGAACATCTTAGGAAGGTGTTGACCACTCTGAGAACGAACAAGTTGTTTGCAAAGTTCTCCAAGTGTGAGTT
CTGGGCAGAACAAGTAGGATTTCTTGGGCATGTAGTTTCACGAATGAGAATCATCGTAGAGCCAGCAAAGATTGAAGCAGTGACGAATTGGCCTCGCCAGACCACAATGA
CTGAGAATATGAAATGTGATGTAGCTGAGTTTGTGAGCAAGTGTTTGGTTTGTCAACAAGTCAAAGTCCCAAGGCAGAAGCCAGCGGAGCTATTACAACCACTTAGCGTA
CTTGAATGGAAGTGGGAGAACATCGCCATGGACTTTATTGTGAGATTACCGAGGACGCCCAGAGGCTACACAGTTATATGGGTTGTGGTTGATAGGCTCACCAAGTCAGC
TCACTTCCCGCCAAGGAAGTCCACTTATTCCGTGGATAAATGGGCTCAGTTGTACTTGAAGGAAGTAACAAGATTGCATGGGGTACCGATGGATGTGCAAACTGAACGCT
TAAACCAAATTCTGGAGGACATGTTACGAGCTTGTATGTTGGACTTTGTCAGTAGCTGGGACACACACTTGCATCTCATAGAGTTTGTTTATAACAACAGTTATCAAGCG
ACATTGGACTCATCGAATGTGATAGATTTCAAACCTCTACGGTTGGATAAGGGTTTGAACCATGAGGAAGAACCAGTGCAAATTGTGGATAGAGAAGTGAAAACCCTACG
TAACCGAGAGATCGCATTTGTCAAAGTATTATGGCGGAATCACCAGCTAGAAGAAGTCACTTGGGAACGAGAAGAGAAAATGATGAGCAAGTACCCTGAGCTCTTCAGAG
CGGAAGGTTGTGTGCTGTTGTGTAATCTGATTAGGCTGTGTGTTGTGGTAGCAACGATTAAGGGGCTTGTGGTAATAGTTACACGCTGTGTGTTTCGGGTGTTTGTCTGG
ATGATTGTCGGGTTGAACATTGTTACTGCTAAGAATTTTGCTAGTGGCATGGGATTGATTTTGTGTTCGAGTAAGAAGTCCATCCCTAAAACAATGTTGAAGTCATCCAT
TTTTACCATCACAAAGTCAGTTTGCCCAGTCCAAGTCCACAGCTTGATCAGAGTTCTCTTTGCTACTTCTAAGACAGGCCTATACATAAGACCGCGTTCCACTAGCTCTT
CTATCTCTTCCAAGAGAGTGGACAAGTACTTTAGTGCCCTCATTTGGGGGTGCTATGATGGTGTTACTATAACATGTGCGGAAGCAATTTGGATCTTAGGCACTCTAAAC
AACGACGCCCTTCTCCATTTTCTCTCTTCCACACCTTTCATTCTTTCTTCGGATTCCTTCTCCCTTCTATTCTGGCAACGGCAACGACGCCAACCAGCCTTCCCGTTTGG
AAGCGGGCCAACGCTACCCAGCAGCTCTTCTTCCGGAAACGACAACGGTGGAAGTAGGGTTTCTGTAGAACTTGTCGGAGCATCTCATTTCTTTTGTGTTTGTTTTCAAT
TCATTTGA
Protein sequenceShow/hide protein sequence
MINLACGVIRISIKCSCLEIFDQQESLKLDMIDLASGVIRISNKIDLRSGYHQLRIKESDISKTVFRTRYGHYEFRVMSFGLTNDLAAFMGLMNKVLKEFLDNFVIVFID
DILVYSKTREQHEEHLRKVLTTLRTNKLFAKFSKCEFWAEQVGFLGHVVSRMRIIVEPAKIEAVTNWPRQTTMTENMKCDVAEFVSKCLVCQQVKVPRQKPAELLQPLSV
LEWKWENIAMDFIVRLPRTPRGYTVIWVVVDRLTKSAHFPPRKSTYSVDKWAQLYLKEVTRLHGVPMDVQTERLNQILEDMLRACMLDFVSSWDTHLHLIEFVYNNSYQA
TLDSSNVIDFKPLRLDKGLNHEEEPVQIVDREVKTLRNREIAFVKVLWRNHQLEEVTWEREEKMMSKYPELFRAEGCVLLCNLIRLCVVVATIKGLVVIVTRCVFRVFVW
MIVGLNIVTAKNFASGMGLILCSSKKSIPKTMLKSSIFTITKSVCPVQVHSLIRVLFATSKTGLYIRPRSTSSSISSKRVDKYFSALIWGCYDGVTITCAEAIWILGTLN
NDALLHFLSSTPFILSSDSFSLLFWQRQRRQPAFPFGSGPTLPSSSSSGNDNGGSRVSVELVGASHFFCVCFQFI