; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0021103 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0021103
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionTy3-gypsy retrotransposon protein
Genome locationchr07:14376775..14379199
RNA-Seq ExpressionPay0021103
SyntenyPay0021103
Gene Ontology termsGO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0032970.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]4.4e-11746.84Show/hide
Query:  MSCGLTNAPTVFMDLMNRVFKDFLDTFLIVFIDDILIYFKTEAEHEEHLHQVLKTLRANKLYAKFSKCEFWLKKVTFLGHMVFSEGVSVDPAKIEVV---
        MS GLTNAP VFMDLMNRVFKDFLDTF+IVFIDDILIY KTEAEH+EHLH+VL+TLRANKLYAKFSKCEFWLKKVTFLGH+VFSEGVS DP KIE V   
Subjt:  MSCGLTNAPTVFMDLMNRVFKDFLDTFLIVFIDDILIYFKTEAEHEEHLHQVLKTLRANKLYAKFSKCEFWLKKVTFLGHMVFSEGVSVDPAKIEVV---

Query:  ---------------TILTVSDGSRSFVIYSDASKKRLGCVFMQQ-------------------------------------------------------
                       T+LTV DGS SFVIYSDASK+ LGCV MQQ                                                       
Subjt:  ---------------TILTVSDGSRSFVIYSDASKKRLGCVFMQQ-------------------------------------------------------

Query:  -----------------------------GKANVVAGALYMKVAHSAALITKQAPLLKDFKRAEIVVSLNDPYLVEKHLLVEAGQGEDFSISSNDGLTFD
                                     GKANVVA AL  KVAHSAA ITK+APLL DF+RAEI VSLNDPYLV K LLV+AGQGEDFSISS+DGLTFD
Subjt:  -----------------------------GKANVVAGALYMKVAHSAALITKQAPLLKDFKRAEIVVSLNDPYLVEKHLLVEAGQGEDFSISSNDGLTFD

Query:  GRLCVPEDSAIKTELLTEAHSSPFTMHPGSTKMYKDLRNAR---------FTSKFWKGLQLALDTSLSF-------------------------------
        GRLCV ED+A+KT+LLTEAHS PFTM+PGS KMY+DLR             T++  KGLQLAL T L F                               
Subjt:  GRLCVPEDSAIKTELLTEAHSSPFTMHPGSTKMYKDLRNAR---------FTSKFWKGLQLALDTSLSF-------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------GVAPSFSAVHDVFIVSVWRKYVSNLTHVVDYEPLQINENLSYEEQ-PVEILAREVKMLRNRVISLVKVLWQNHGVEEATWEREDDMRVQYPELF
               + PSFSAVHDVF VS+ RKYV++ THVVD+EPLQINENLSYE+Q PVEI AREVK+L NR I+LVKVLW+NH VEEATWE E+D+R Q PELF
Subjt:  ------GVAPSFSAVHDVFIVSVWRKYVSNLTHVVDYEPLQINENLSYEEQ-PVEILAREVKMLRNRVISLVKVLWQNHGVEEATWEREDDMRVQYPELF

Query:  ED
        ED
Subjt:  ED

KAA0036823.1 pol protein [Cucumis melo var. makuwa]4.0e-11845.35Show/hide
Query:  MSCGLTNAPTVFMDLMNRVFKDFLDTFLIVFIDDILIYFKTEAEHEEHLHQVLKTLRANKLYAKFSKCEFWLKKVTFLGHMVFSEGVSVDPAKIEVVTIL
        MS GLTNAP VFM+LMNRVFKDF+DTF+IVFIDDILI  KTEAEHEEHL++VL+TL+ANKLYAKFSKCEFWLKKVTFLGH+V SEGVSVDPAKIE VT  
Subjt:  MSCGLTNAPTVFMDLMNRVFKDFLDTFLIVFIDDILIYFKTEAEHEEHLHQVLKTLRANKLYAKFSKCEFWLKKVTFLGHMVFSEGVSVDPAKIEVVTIL

Query:  -----------------------------------------------TVSDGSRSFVIYSDASKKRLGCVFMQQG-------------------------
                                                       T+ DGS SFVIY+DASKK LGCV MQQG                         
Subjt:  -----------------------------------------------TVSDGSRSFVIYSDASKKRLGCVFMQQG-------------------------

Query:  ---------------KANVVAGALYMKVAHSAALITKQAPLLKDFKRAEIVVSLNDPYLVEKHLLVEAGQGEDFSISSNDGLTFDGRLCVPEDSAIKTEL
                       KANVVA AL  KVAHSAALITKQAPLL+DF+RAEI VSL DPYLVEK  L+E GQGEDFSISS+DGL F+GRLCVPEDSA+K EL
Subjt:  ---------------KANVVAGALYMKVAHSAALITKQAPLLKDFKRAEIVVSLNDPYLVEKHLLVEAGQGEDFSISSNDGLTFDGRLCVPEDSAIKTEL

Query:  LTEAHSSPFTMHPGSTKMYKDLR-----------------------------------------------------------------------------
        L EAHSSPFTMHPGSTKMY+DLR                                                                             
Subjt:  LTEAHSSPFTMHPGSTKMYKDLR-----------------------------------------------------------------------------

Query:  ------------NARFTSKFWKGLQLALDTSLSF------------------------------------------------------------------
                    NARFTSKFWKGLQLAL T L F                                                                  
Subjt:  ------------NARFTSKFWKGLQLALDTSLSF------------------------------------------------------------------

Query:  ----------------------------------------GVAPSFSAVHDVFIVSVWRKYVSNLTHVVDYEPLQINENLSYEEQPVEILAREVKMLRNR
                                                 + PS SA+HDVF VS+ RKYV++ THVVD+EPLQ++ENLSYEEQPVEILAREVK LRNR
Subjt:  ----------------------------------------GVAPSFSAVHDVFIVSVWRKYVSNLTHVVDYEPLQINENLSYEEQPVEILAREVKMLRNR

Query:  VISLVKVLWQNHGVEEATWEREDDMRVQYPELFED
         ISLVKVLW+N GVEEAT ERE+DMR QY ELFED
Subjt:  VISLVKVLWQNHGVEEATWEREDDMRVQYPELFED

TYK18857.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]1.0e-12145.33Show/hide
Query:  MSCGLTNAPTVFMDLMNRVFKDFLDTFLIVFIDDILIYFKTEAEHEEHLHQVLKTLRANKLYAKFSKCEFWLKKVTFLGHMVFSEGVSVDPAKIEVVT--
        MS GLTNAP VFMD MNRVFKDFLD+F+IVFIDDILIYFKTEAEHEEHLHQVL+TLRANK+YAKFSKCEFWLKKVTFL H+V SE VSVDPAKIE VT  
Subjt:  MSCGLTNAPTVFMDLMNRVFKDFLDTFLIVFIDDILIYFKTEAEHEEHLHQVLKTLRANKLYAKFSKCEFWLKKVTFLGHMVFSEGVSVDPAKIEVVT--

Query:  ---------------------------------------ILTVSDGSRSFVIYSDASKKRLGCVFMQQ--------------------------------
                                               +LTVSDGS SFVIYSDASKK LGCV MQQ                                
Subjt:  ---------------------------------------ILTVSDGSRSFVIYSDASKKRLGCVFMQQ--------------------------------

Query:  ---------------------GKANVVAGALYMKVAHSAALITKQAPLLKDFKRAEIVVS-------------------------LNDPYLVEKHLLVEA
                             GKANVVA AL  KVAHSAALITKQAPLL+DFKRAEI VS                         LNDPYLVEK  LVE 
Subjt:  ---------------------GKANVVAGALYMKVAHSAALITKQAPLLKDFKRAEIVVS-------------------------LNDPYLVEKHLLVEA

Query:  GQGEDFSISSNDGLTFDGRLCVPEDSAIKTELLTEAHSSPFTMHPGSTKMYKDL----------------------------------------------
        GQGEDFSISSNDGLTF+GRL VP+DSA+K ELLTEAH SPFTMHPGSTKMY+DL                                              
Subjt:  GQGEDFSISSNDGLTFDGRLCVPEDSAIKTELLTEAHSSPFTMHPGSTKMYKDL----------------------------------------------

Query:  --------------------------------------------RNARFTSKFWKGLQLALDTSLSFGVA------------------------------
                                                    R+ARFTSKFWKGLQLAL T L F  A                              
Subjt:  --------------------------------------------RNARFTSKFWKGLQLALDTSLSFGVA------------------------------

Query:  ------------------------------------------------------------------------PSFSAVHDVFIVSVWRKYVSNLTHVVDY
                                                                                PS SAVHDVF VS+ RKYV++LTHVVD+
Subjt:  ------------------------------------------------------------------------PSFSAVHDVFIVSVWRKYVSNLTHVVDY

Query:  EPLQINENLSYEEQPVEILAREVKMLRNRVISLVKVLWQNHGVEEATWEREDDMRVQYPELFED
        EPLQI+ENLSYEEQ VEILAREVK LR+R ISLVKVLW+NHGVEEATWERE+DMR QYPELFED
Subjt:  EPLQINENLSYEEQPVEILAREVKMLRNRVISLVKVLWQNHGVEEATWEREDDMRVQYPELFED

TYK21999.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]4.4e-11746.84Show/hide
Query:  MSCGLTNAPTVFMDLMNRVFKDFLDTFLIVFIDDILIYFKTEAEHEEHLHQVLKTLRANKLYAKFSKCEFWLKKVTFLGHMVFSEGVSVDPAKIEVV---
        MS GLTNAP VFMDLMNRVFKDFLDTF+IVFIDDILIY KTEAEH+EHLH+VL+TLRANKLYAKFSKCEFWLKKVTFLGH+VFSEGVS DP KIE V   
Subjt:  MSCGLTNAPTVFMDLMNRVFKDFLDTFLIVFIDDILIYFKTEAEHEEHLHQVLKTLRANKLYAKFSKCEFWLKKVTFLGHMVFSEGVSVDPAKIEVV---

Query:  ---------------TILTVSDGSRSFVIYSDASKKRLGCVFMQQ-------------------------------------------------------
                       T+LTV DGS SFVIYSDASK+ LGCV MQQ                                                       
Subjt:  ---------------TILTVSDGSRSFVIYSDASKKRLGCVFMQQ-------------------------------------------------------

Query:  -----------------------------GKANVVAGALYMKVAHSAALITKQAPLLKDFKRAEIVVSLNDPYLVEKHLLVEAGQGEDFSISSNDGLTFD
                                     GKANVVA AL  KVAHSAA ITK+APLL DF+RAEI VSLNDPYLV K LLV+AGQGEDFSISS+DGLTFD
Subjt:  -----------------------------GKANVVAGALYMKVAHSAALITKQAPLLKDFKRAEIVVSLNDPYLVEKHLLVEAGQGEDFSISSNDGLTFD

Query:  GRLCVPEDSAIKTELLTEAHSSPFTMHPGSTKMYKDLRNAR---------FTSKFWKGLQLALDTSLSF-------------------------------
        GRLCV ED+A+KT+LLTEAHS PFTM+PGS KMY+DLR             T++  KGLQLAL T L F                               
Subjt:  GRLCVPEDSAIKTELLTEAHSSPFTMHPGSTKMYKDLRNAR---------FTSKFWKGLQLALDTSLSF-------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------GVAPSFSAVHDVFIVSVWRKYVSNLTHVVDYEPLQINENLSYEEQ-PVEILAREVKMLRNRVISLVKVLWQNHGVEEATWEREDDMRVQYPELF
               + PSFSAVHDVF VS+ RKYV++ THVVD+EPLQINENLSYE+Q PVEI AREVK+L NR I+LVKVLW+NH VEEATWE E+D+R Q PELF
Subjt:  ------GVAPSFSAVHDVFIVSVWRKYVSNLTHVVDYEPLQINENLSYEEQ-PVEILAREVKMLRNRVISLVKVLWQNHGVEEATWEREDDMRVQYPELF

Query:  ED
        ED
Subjt:  ED

TYK23820.1 retrotransposon protein, putative, Ty3-gypsy sub-class [Cucumis melo var. makuwa]4.7e-11952.51Show/hide
Query:  MSCGLTNAPTVFMDLMNRVFKDFLDTFLIVFIDDILIYFKTEAEHEEHLHQVLKTLRANKLYAKFSKCEFWLKKVTFLGHMVFSEGVSVDPAKIEV----
        MSCGLTNAPT                              TEAEHEEHLHQVLKTLRANKLYAKFSKCEFWLKKVTFLGHMVFSEGVSVDP KIEV    
Subjt:  MSCGLTNAPTVFMDLMNRVFKDFLDTFLIVFIDDILIYFKTEAEHEEHLHQVLKTLRANKLYAKFSKCEFWLKKVTFLGHMVFSEGVSVDPAKIEV----

Query:  -------------------------------------------------------VTILTVSDGSRSFVIYSDASKKR---------LGC-VFMQQGKAN
                                                               + I T     + F    + + ++           C +    GK N
Subjt:  -------------------------------------------------------VTILTVSDGSRSFVIYSDASKKR---------LGC-VFMQQGKAN

Query:  VVAGALYMKVAHSAALITKQAPLLKDFKRAEIVVSLNDPYLVEKHLLVEAGQGEDFSISSNDGLTFDGRLCVPEDSAIKTELLTEAHSSPFTMHPGSTKM
        VVA ALY KVAHSAALITKQAPLLKDFKR EIVVSLNDPYLVEK LLVEAGQGEDFSISS+DGLTFDGRLCVPEDSAIKTELLTEAHSSPFTMHPGSTKM
Subjt:  VVAGALYMKVAHSAALITKQAPLLKDFKRAEIVVSLNDPYLVEKHLLVEAGQGEDFSISSNDGLTFDGRLCVPEDSAIKTELLTEAHSSPFTMHPGSTKM

Query:  YKDL--------------------------------------------------RNARFTSKFWKGLQLALDTS--LSFGVAP-----------------
        YKDL                                                  RNARFTSKFWKGLQLALD S   + G+ P                 
Subjt:  YKDL--------------------------------------------------RNARFTSKFWKGLQLALDTS--LSFGVAP-----------------

Query:  ---------------------------SFSAVHDVFIVSVWRKYVSNLTHVVDYEPLQINENLSYEEQPVEILAREVKMLRNRVISLVKVLWQNHGVEEA
                                     S +HDV  VSVWRKYV+NLTHVVD EPLQINENL YEEQ VEILAREVKMLRNRVISLVKVLWQNHGVEEA
Subjt:  ---------------------------SFSAVHDVFIVSVWRKYVSNLTHVVDYEPLQINENLSYEEQPVEILAREVKMLRNRVISLVKVLWQNHGVEEA

Query:  TWEREDDMRVQYPELFED
        TWEREDD+RVQYPELFED
Subjt:  TWEREDDMRVQYPELFED

TrEMBL top hitse value%identityAlignment
A0A5A7SPY5 Reverse transcriptase2.1e-11746.84Show/hide
Query:  MSCGLTNAPTVFMDLMNRVFKDFLDTFLIVFIDDILIYFKTEAEHEEHLHQVLKTLRANKLYAKFSKCEFWLKKVTFLGHMVFSEGVSVDPAKIEVV---
        MS GLTNAP VFMDLMNRVFKDFLDTF+IVFIDDILIY KTEAEH+EHLH+VL+TLRANKLYAKFSKCEFWLKKVTFLGH+VFSEGVS DP KIE V   
Subjt:  MSCGLTNAPTVFMDLMNRVFKDFLDTFLIVFIDDILIYFKTEAEHEEHLHQVLKTLRANKLYAKFSKCEFWLKKVTFLGHMVFSEGVSVDPAKIEVV---

Query:  ---------------TILTVSDGSRSFVIYSDASKKRLGCVFMQQ-------------------------------------------------------
                       T+LTV DGS SFVIYSDASK+ LGCV MQQ                                                       
Subjt:  ---------------TILTVSDGSRSFVIYSDASKKRLGCVFMQQ-------------------------------------------------------

Query:  -----------------------------GKANVVAGALYMKVAHSAALITKQAPLLKDFKRAEIVVSLNDPYLVEKHLLVEAGQGEDFSISSNDGLTFD
                                     GKANVVA AL  KVAHSAA ITK+APLL DF+RAEI VSLNDPYLV K LLV+AGQGEDFSISS+DGLTFD
Subjt:  -----------------------------GKANVVAGALYMKVAHSAALITKQAPLLKDFKRAEIVVSLNDPYLVEKHLLVEAGQGEDFSISSNDGLTFD

Query:  GRLCVPEDSAIKTELLTEAHSSPFTMHPGSTKMYKDLRNAR---------FTSKFWKGLQLALDTSLSF-------------------------------
        GRLCV ED+A+KT+LLTEAHS PFTM+PGS KMY+DLR             T++  KGLQLAL T L F                               
Subjt:  GRLCVPEDSAIKTELLTEAHSSPFTMHPGSTKMYKDLRNAR---------FTSKFWKGLQLALDTSLSF-------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------GVAPSFSAVHDVFIVSVWRKYVSNLTHVVDYEPLQINENLSYEEQ-PVEILAREVKMLRNRVISLVKVLWQNHGVEEATWEREDDMRVQYPELF
               + PSFSAVHDVF VS+ RKYV++ THVVD+EPLQINENLSYE+Q PVEI AREVK+L NR I+LVKVLW+NH VEEATWE E+D+R Q PELF
Subjt:  ------GVAPSFSAVHDVFIVSVWRKYVSNLTHVVDYEPLQINENLSYEEQ-PVEILAREVKMLRNRVISLVKVLWQNHGVEEATWEREDDMRVQYPELF

Query:  ED
        ED
Subjt:  ED

A0A5A7T1M8 Pol protein1.9e-11845.35Show/hide
Query:  MSCGLTNAPTVFMDLMNRVFKDFLDTFLIVFIDDILIYFKTEAEHEEHLHQVLKTLRANKLYAKFSKCEFWLKKVTFLGHMVFSEGVSVDPAKIEVVTIL
        MS GLTNAP VFM+LMNRVFKDF+DTF+IVFIDDILI  KTEAEHEEHL++VL+TL+ANKLYAKFSKCEFWLKKVTFLGH+V SEGVSVDPAKIE VT  
Subjt:  MSCGLTNAPTVFMDLMNRVFKDFLDTFLIVFIDDILIYFKTEAEHEEHLHQVLKTLRANKLYAKFSKCEFWLKKVTFLGHMVFSEGVSVDPAKIEVVTIL

Query:  -----------------------------------------------TVSDGSRSFVIYSDASKKRLGCVFMQQG-------------------------
                                                       T+ DGS SFVIY+DASKK LGCV MQQG                         
Subjt:  -----------------------------------------------TVSDGSRSFVIYSDASKKRLGCVFMQQG-------------------------

Query:  ---------------KANVVAGALYMKVAHSAALITKQAPLLKDFKRAEIVVSLNDPYLVEKHLLVEAGQGEDFSISSNDGLTFDGRLCVPEDSAIKTEL
                       KANVVA AL  KVAHSAALITKQAPLL+DF+RAEI VSL DPYLVEK  L+E GQGEDFSISS+DGL F+GRLCVPEDSA+K EL
Subjt:  ---------------KANVVAGALYMKVAHSAALITKQAPLLKDFKRAEIVVSLNDPYLVEKHLLVEAGQGEDFSISSNDGLTFDGRLCVPEDSAIKTEL

Query:  LTEAHSSPFTMHPGSTKMYKDLR-----------------------------------------------------------------------------
        L EAHSSPFTMHPGSTKMY+DLR                                                                             
Subjt:  LTEAHSSPFTMHPGSTKMYKDLR-----------------------------------------------------------------------------

Query:  ------------NARFTSKFWKGLQLALDTSLSF------------------------------------------------------------------
                    NARFTSKFWKGLQLAL T L F                                                                  
Subjt:  ------------NARFTSKFWKGLQLALDTSLSF------------------------------------------------------------------

Query:  ----------------------------------------GVAPSFSAVHDVFIVSVWRKYVSNLTHVVDYEPLQINENLSYEEQPVEILAREVKMLRNR
                                                 + PS SA+HDVF VS+ RKYV++ THVVD+EPLQ++ENLSYEEQPVEILAREVK LRNR
Subjt:  ----------------------------------------GVAPSFSAVHDVFIVSVWRKYVSNLTHVVDYEPLQINENLSYEEQPVEILAREVKMLRNR

Query:  VISLVKVLWQNHGVEEATWEREDDMRVQYPELFED
         ISLVKVLW+N GVEEAT ERE+DMR QY ELFED
Subjt:  VISLVKVLWQNHGVEEATWEREDDMRVQYPELFED

A0A5D3D5M9 Ty3-gypsy retrotransposon protein4.9e-12245.33Show/hide
Query:  MSCGLTNAPTVFMDLMNRVFKDFLDTFLIVFIDDILIYFKTEAEHEEHLHQVLKTLRANKLYAKFSKCEFWLKKVTFLGHMVFSEGVSVDPAKIEVVT--
        MS GLTNAP VFMD MNRVFKDFLD+F+IVFIDDILIYFKTEAEHEEHLHQVL+TLRANK+YAKFSKCEFWLKKVTFL H+V SE VSVDPAKIE VT  
Subjt:  MSCGLTNAPTVFMDLMNRVFKDFLDTFLIVFIDDILIYFKTEAEHEEHLHQVLKTLRANKLYAKFSKCEFWLKKVTFLGHMVFSEGVSVDPAKIEVVT--

Query:  ---------------------------------------ILTVSDGSRSFVIYSDASKKRLGCVFMQQ--------------------------------
                                               +LTVSDGS SFVIYSDASKK LGCV MQQ                                
Subjt:  ---------------------------------------ILTVSDGSRSFVIYSDASKKRLGCVFMQQ--------------------------------

Query:  ---------------------GKANVVAGALYMKVAHSAALITKQAPLLKDFKRAEIVVS-------------------------LNDPYLVEKHLLVEA
                             GKANVVA AL  KVAHSAALITKQAPLL+DFKRAEI VS                         LNDPYLVEK  LVE 
Subjt:  ---------------------GKANVVAGALYMKVAHSAALITKQAPLLKDFKRAEIVVS-------------------------LNDPYLVEKHLLVEA

Query:  GQGEDFSISSNDGLTFDGRLCVPEDSAIKTELLTEAHSSPFTMHPGSTKMYKDL----------------------------------------------
        GQGEDFSISSNDGLTF+GRL VP+DSA+K ELLTEAH SPFTMHPGSTKMY+DL                                              
Subjt:  GQGEDFSISSNDGLTFDGRLCVPEDSAIKTELLTEAHSSPFTMHPGSTKMYKDL----------------------------------------------

Query:  --------------------------------------------RNARFTSKFWKGLQLALDTSLSFGVA------------------------------
                                                    R+ARFTSKFWKGLQLAL T L F  A                              
Subjt:  --------------------------------------------RNARFTSKFWKGLQLALDTSLSFGVA------------------------------

Query:  ------------------------------------------------------------------------PSFSAVHDVFIVSVWRKYVSNLTHVVDY
                                                                                PS SAVHDVF VS+ RKYV++LTHVVD+
Subjt:  ------------------------------------------------------------------------PSFSAVHDVFIVSVWRKYVSNLTHVVDY

Query:  EPLQINENLSYEEQPVEILAREVKMLRNRVISLVKVLWQNHGVEEATWEREDDMRVQYPELFED
        EPLQI+ENLSYEEQ VEILAREVK LR+R ISLVKVLW+NHGVEEATWERE+DMR QYPELFED
Subjt:  EPLQINENLSYEEQPVEILAREVKMLRNRVISLVKVLWQNHGVEEATWEREDDMRVQYPELFED

A0A5D3DF76 Reverse transcriptase2.1e-11746.84Show/hide
Query:  MSCGLTNAPTVFMDLMNRVFKDFLDTFLIVFIDDILIYFKTEAEHEEHLHQVLKTLRANKLYAKFSKCEFWLKKVTFLGHMVFSEGVSVDPAKIEVV---
        MS GLTNAP VFMDLMNRVFKDFLDTF+IVFIDDILIY KTEAEH+EHLH+VL+TLRANKLYAKFSKCEFWLKKVTFLGH+VFSEGVS DP KIE V   
Subjt:  MSCGLTNAPTVFMDLMNRVFKDFLDTFLIVFIDDILIYFKTEAEHEEHLHQVLKTLRANKLYAKFSKCEFWLKKVTFLGHMVFSEGVSVDPAKIEVV---

Query:  ---------------TILTVSDGSRSFVIYSDASKKRLGCVFMQQ-------------------------------------------------------
                       T+LTV DGS SFVIYSDASK+ LGCV MQQ                                                       
Subjt:  ---------------TILTVSDGSRSFVIYSDASKKRLGCVFMQQ-------------------------------------------------------

Query:  -----------------------------GKANVVAGALYMKVAHSAALITKQAPLLKDFKRAEIVVSLNDPYLVEKHLLVEAGQGEDFSISSNDGLTFD
                                     GKANVVA AL  KVAHSAA ITK+APLL DF+RAEI VSLNDPYLV K LLV+AGQGEDFSISS+DGLTFD
Subjt:  -----------------------------GKANVVAGALYMKVAHSAALITKQAPLLKDFKRAEIVVSLNDPYLVEKHLLVEAGQGEDFSISSNDGLTFD

Query:  GRLCVPEDSAIKTELLTEAHSSPFTMHPGSTKMYKDLRNAR---------FTSKFWKGLQLALDTSLSF-------------------------------
        GRLCV ED+A+KT+LLTEAHS PFTM+PGS KMY+DLR             T++  KGLQLAL T L F                               
Subjt:  GRLCVPEDSAIKTELLTEAHSSPFTMHPGSTKMYKDLRNAR---------FTSKFWKGLQLALDTSLSF-------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------GVAPSFSAVHDVFIVSVWRKYVSNLTHVVDYEPLQINENLSYEEQ-PVEILAREVKMLRNRVISLVKVLWQNHGVEEATWEREDDMRVQYPELF
               + PSFSAVHDVF VS+ RKYV++ THVVD+EPLQINENLSYE+Q PVEI AREVK+L NR I+LVKVLW+NH VEEATWE E+D+R Q PELF
Subjt:  ------GVAPSFSAVHDVFIVSVWRKYVSNLTHVVDYEPLQINENLSYEEQ-PVEILAREVKMLRNRVISLVKVLWQNHGVEEATWEREDDMRVQYPELF

Query:  ED
        ED
Subjt:  ED

A0A5D3DK87 Retrotransposon protein, putative, Ty3-gypsy sub-class2.3e-11952.51Show/hide
Query:  MSCGLTNAPTVFMDLMNRVFKDFLDTFLIVFIDDILIYFKTEAEHEEHLHQVLKTLRANKLYAKFSKCEFWLKKVTFLGHMVFSEGVSVDPAKIEV----
        MSCGLTNAPT                              TEAEHEEHLHQVLKTLRANKLYAKFSKCEFWLKKVTFLGHMVFSEGVSVDP KIEV    
Subjt:  MSCGLTNAPTVFMDLMNRVFKDFLDTFLIVFIDDILIYFKTEAEHEEHLHQVLKTLRANKLYAKFSKCEFWLKKVTFLGHMVFSEGVSVDPAKIEV----

Query:  -------------------------------------------------------VTILTVSDGSRSFVIYSDASKKR---------LGC-VFMQQGKAN
                                                               + I T     + F    + + ++           C +    GK N
Subjt:  -------------------------------------------------------VTILTVSDGSRSFVIYSDASKKR---------LGC-VFMQQGKAN

Query:  VVAGALYMKVAHSAALITKQAPLLKDFKRAEIVVSLNDPYLVEKHLLVEAGQGEDFSISSNDGLTFDGRLCVPEDSAIKTELLTEAHSSPFTMHPGSTKM
        VVA ALY KVAHSAALITKQAPLLKDFKR EIVVSLNDPYLVEK LLVEAGQGEDFSISS+DGLTFDGRLCVPEDSAIKTELLTEAHSSPFTMHPGSTKM
Subjt:  VVAGALYMKVAHSAALITKQAPLLKDFKRAEIVVSLNDPYLVEKHLLVEAGQGEDFSISSNDGLTFDGRLCVPEDSAIKTELLTEAHSSPFTMHPGSTKM

Query:  YKDL--------------------------------------------------RNARFTSKFWKGLQLALDTS--LSFGVAP-----------------
        YKDL                                                  RNARFTSKFWKGLQLALD S   + G+ P                 
Subjt:  YKDL--------------------------------------------------RNARFTSKFWKGLQLALDTS--LSFGVAP-----------------

Query:  ---------------------------SFSAVHDVFIVSVWRKYVSNLTHVVDYEPLQINENLSYEEQPVEILAREVKMLRNRVISLVKVLWQNHGVEEA
                                     S +HDV  VSVWRKYV+NLTHVVD EPLQINENL YEEQ VEILAREVKMLRNRVISLVKVLWQNHGVEEA
Subjt:  ---------------------------SFSAVHDVFIVSVWRKYVSNLTHVVDYEPLQINENLSYEEQPVEILAREVKMLRNRVISLVKVLWQNHGVEEA

Query:  TWEREDDMRVQYPELFED
        TWEREDD+RVQYPELFED
Subjt:  TWEREDDMRVQYPELFED

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.61.3e-1035.05Show/hide
Query:  MSCGLTNAPTVFMDLMNRVFKDFLDTFLIVFIDDILIYFKTEAEHEEHLHQVLKTLRANKLYAKFSKCEFWLKKVTFLGHMVFSEGVSVDPAKIEVV
        M  GL NAP  F   MN + +  L+   +V++DDI+++  +  EH + L  V + L    L  +  KCEF  ++ TFLGH++  +G+  +P KIE +
Subjt:  MSCGLTNAPTVFMDLMNRVFKDFLDTFLIVFIDDILIYFKTEAEHEEHLHQVLKTLRANKLYAKFSKCEFWLKKVTFLGHMVFSEGVSVDPAKIEVV

P0CT41 Transposon Tf2-12 polyprotein1.0e-0730.93Show/hide
Query:  MSCGLTNAPTVFMDLMNRVFKDFLDTFLIVFIDDILIYFKTEAEHEEHLHQVLKTLRANKLYAKFSKCEFWLKKVTFLGHMVFSEGVSVDPAKIEVV
        M  G++ AP  F   +N +  +  ++ ++ ++DDILI+ K+E+EH +H+  VL+ L+   L    +KCEF   +V F+G+ +  +G +     I+ V
Subjt:  MSCGLTNAPTVFMDLMNRVFKDFLDTFLIVFIDDILIYFKTEAEHEEHLHQVLKTLRANKLYAKFSKCEFWLKKVTFLGHMVFSEGVSVDPAKIEVV

P20825 Retrovirus-related Pol polyprotein from transposon 2972.8e-1034.02Show/hide
Query:  MSCGLTNAPTVFMDLMNRVFKDFLDTFLIVFIDDILIYFKTEAEHEEHLHQVLKTLRANKLYAKFSKCEFWLKKVTFLGHMVFSEGVSVDPAKIEVV
        M  GL NAP  F   MN + +  L+   +V++DDI+I+  +  EH   +  V   L    L  +  KCEF  K+  FLGH+V  +G+  +P K++ +
Subjt:  MSCGLTNAPTVFMDLMNRVFKDFLDTFLIVFIDDILIYFKTEAEHEEHLHQVLKTLRANKLYAKFSKCEFWLKKVTFLGHMVFSEGVSVDPAKIEVV

Q89703 Putative enzymatic polyprotein1.6e-0836.36Show/hide
Query:  GLTNAPTVFMDLMNRVFKDFLDTFLIVFIDDILIYFKTEAEHEEHLHQVLKTLRANKLYAKFSKCEFWLKKVTFLGHMVFSEGVSVDP
        G  N+P++F   M+R+F+ + D F+IV+IDDIL++ KT  EH+ H+ +      AN L     K E   +K+ FLG  +   G+ + P
Subjt:  GLTNAPTVFMDLMNRVFKDFLDTFLIVFIDDILIYFKTEAEHEEHLHQVLKTLRANKLYAKFSKCEFWLKKVTFLGHMVFSEGVSVDP

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus1.0e-0729.47Show/hide
Query:  GLTNAPTVFMDLMNRVFKDFLDTFLIVFIDDILIYFKTEAEHEEHLHQVLKTLRANKLYAKFSKCEFWLKKVTFLGHMVFSEGVSVDPAKIEVVT
        GL NAP +F  +++ + ++ +     V+IDDI+++ +    H ++L  VL +L    L     K  F   +V FLG++V ++G+  DP K+  ++
Subjt:  GLTNAPTVFMDLMNRVFKDFLDTFLIVFIDDILIYFKTEAEHEEHLHQVLKTLRANKLYAKFSKCEFWLKKVTFLGHMVFSEGVSVDPAKIEVVT

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein4.1e-0440.38Show/hide
Query:  HLHQVLKTLRANKLYAKFSKCEFWLKKVTFLG--HMVFSEGVSVDPAKIEVV
        HL  VL+    ++ YA   KC F   ++ +LG  H++  EGVS DPAK+E +
Subjt:  HLHQVLKTLRANKLYAKFSKCEFWLKKVTFLG--HMVFSEGVSVDPAKIEVV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTTGTGGTTTGACTAATGCTCCTACGGTATTCATGGACTTGATGAACAGAGTGTTTAAGGATTTCTTAGACACGTTTTTGATAGTTTTCATTGACGACATTTTGAT
TTACTTCAAGACTGAGGCTGAGCATGAGGAGCATTTGCATCAGGTTTTGAAGACTCTTCGAGCTAATAAGCTGTATGCCAAGTTCTCCAAGTGTGAATTCTGGCTGAAGA
AGGTGACTTTTCTTGGCCATATGGTTTTCAGTGAGGGAGTTTCTGTGGACCCAGCAAAGATCGAAGTGGTTACCATCCTTACAGTGTCGGATGGATCTAGAAGTTTCGTG
ATCTACAGTGATGCCTCAAAGAAACGATTGGGTTGTGTTTTCATGCAGCAAGGTAAGGCAAATGTAGTAGCTGGTGCGCTGTATATGAAGGTTGCACATTCAGCAGCGCT
TATCACCAAGCAAGCTCCCTTGCTCAAAGATTTTAAGAGAGCCGAGATTGTAGTCTCGCTAAACGATCCTTATTTGGTCGAGAAGCATCTATTAGTAGAGGCAGGGCAAG
GTGAGGATTTCTCCATATCCTCCAATGATGGACTTACATTTGATGGACGTTTGTGTGTGCCAGAAGACAGTGCAATCAAGACAGAGCTTTTGACTGAGGCTCACAGTTCT
CCATTTACTATGCACCCTGGAAGTACGAAGATGTACAAAGACTTGAGAAATGCTCGTTTCACATCAAAGTTTTGGAAAGGACTTCAGCTGGCGTTGGACACGAGCTTATC
GTTTGGCGTTGCCCCATCATTTTCTGCAGTGCATGACGTATTCATTGTCTCCGTGTGGAGGAAGTATGTTTCAAACCTGACGCATGTAGTTGACTACGAGCCATTGCAAA
TTAATGAGAACTTGAGCTACGAGGAGCAACCTGTTGAGATTTTGGCAAGGGAGGTCAAGATGCTCCGTAATCGAGTAATTTCACTGGTCAAAGTTCTTTGGCAAAACCAC
GGAGTTGAAGAGGCCACATGGGAGAGAGAAGATGACATGAGAGTCCAGTACCCCGAGCTGTTCGAGGATTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCTTGTGGTTTGACTAATGCTCCTACGGTATTCATGGACTTGATGAACAGAGTGTTTAAGGATTTCTTAGACACGTTTTTGATAGTTTTCATTGACGACATTTTGAT
TTACTTCAAGACTGAGGCTGAGCATGAGGAGCATTTGCATCAGGTTTTGAAGACTCTTCGAGCTAATAAGCTGTATGCCAAGTTCTCCAAGTGTGAATTCTGGCTGAAGA
AGGTGACTTTTCTTGGCCATATGGTTTTCAGTGAGGGAGTTTCTGTGGACCCAGCAAAGATCGAAGTGGTTACCATCCTTACAGTGTCGGATGGATCTAGAAGTTTCGTG
ATCTACAGTGATGCCTCAAAGAAACGATTGGGTTGTGTTTTCATGCAGCAAGGTAAGGCAAATGTAGTAGCTGGTGCGCTGTATATGAAGGTTGCACATTCAGCAGCGCT
TATCACCAAGCAAGCTCCCTTGCTCAAAGATTTTAAGAGAGCCGAGATTGTAGTCTCGCTAAACGATCCTTATTTGGTCGAGAAGCATCTATTAGTAGAGGCAGGGCAAG
GTGAGGATTTCTCCATATCCTCCAATGATGGACTTACATTTGATGGACGTTTGTGTGTGCCAGAAGACAGTGCAATCAAGACAGAGCTTTTGACTGAGGCTCACAGTTCT
CCATTTACTATGCACCCTGGAAGTACGAAGATGTACAAAGACTTGAGAAATGCTCGTTTCACATCAAAGTTTTGGAAAGGACTTCAGCTGGCGTTGGACACGAGCTTATC
GTTTGGCGTTGCCCCATCATTTTCTGCAGTGCATGACGTATTCATTGTCTCCGTGTGGAGGAAGTATGTTTCAAACCTGACGCATGTAGTTGACTACGAGCCATTGCAAA
TTAATGAGAACTTGAGCTACGAGGAGCAACCTGTTGAGATTTTGGCAAGGGAGGTCAAGATGCTCCGTAATCGAGTAATTTCACTGGTCAAAGTTCTTTGGCAAAACCAC
GGAGTTGAAGAGGCCACATGGGAGAGAGAAGATGACATGAGAGTCCAGTACCCCGAGCTGTTCGAGGATTAG
Protein sequenceShow/hide protein sequence
MSCGLTNAPTVFMDLMNRVFKDFLDTFLIVFIDDILIYFKTEAEHEEHLHQVLKTLRANKLYAKFSKCEFWLKKVTFLGHMVFSEGVSVDPAKIEVVTILTVSDGSRSFV
IYSDASKKRLGCVFMQQGKANVVAGALYMKVAHSAALITKQAPLLKDFKRAEIVVSLNDPYLVEKHLLVEAGQGEDFSISSNDGLTFDGRLCVPEDSAIKTELLTEAHSS
PFTMHPGSTKMYKDLRNARFTSKFWKGLQLALDTSLSFGVAPSFSAVHDVFIVSVWRKYVSNLTHVVDYEPLQINENLSYEEQPVEILAREVKMLRNRVISLVKVLWQNH
GVEEATWEREDDMRVQYPELFED