; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmaCh04G020050 (gene) of Cucurbita maxima (Rimu) v1.1 genome

Gene IDCmaCh04G020050
OrganismCucurbita maxima Rimu (Cucurbita maxima (Rimu) v1.1)
DescriptionTransposon Ty3-G Gag-Pol polyprotein
Genome locationCma_Chr04:12388487..12396082
RNA-Seq ExpressionCmaCh04G020050
SyntenyCmaCh04G020050
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR005162 - Retrotransposon gag domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_023520282.1 uncharacterized protein LOC111783592 [Cucurbita pepo subsp. pepo]8.9e-10672.44Show/hide
Query:  PRGPPRADTYEGNNSDHHEDNPHVVGHGLMQGRDHGRRYHNLQQRVPYDDRIDRNVGSIKLKLLKFYGKTDPEEYLQWEKTVESVFNCHNFSDEKKVLLY
        P+  P  DTYEG+NSDHHEDNPH VGHGLM+GRDHGRRYHNLQQRVPYDDRIDRNVGSIKLKL KFYGKTDPEEYLQWEKTVESVFNCHNFSD+KKVLL 
Subjt:  PRGPPRADTYEGNNSDHHEDNPHVVGHGLMQGRDHGRRYHNLQQRVPYDDRIDRNVGSIKLKLLKFYGKTDPEEYLQWEKTVESVFNCHNFSDEKKVLLY

Query:  IAQFKQYAQIWWDKLMSSRRRNLEAPIDSWVEFKESMRKRFVPRYFHRDMAQKLQALKQGHKSVEDYYKEMDTLMDLLELDEDMEALMARYLNGLNTEIT
        IAQFKQYAQIWWDKLMSSRRRNLEAPIDSW EFKESMRKRFVP+YF RDMAQKLQALKQG KSVEDYYKEMDTLMD L        LMAR+LNGLNTEI 
Subjt:  IAQFKQYAQIWWDKLMSSRRRNLEAPIDSWVEFKESMRKRFVPRYFHRDMAQKLQALKQGHKSVEDYYKEMDTLMDLLELDEDMEALMARYLNGLNTEIT

Query:  NKINLQPYSNIEELLHIAIKIESDFVVLLQEFEDLFFEEKPSS-LPPLRGIEHK-----IDFIPGAPIPNRPAYRTNPKEVEE
        +K +LQPYSNIEELLHIAIKIE       Q +    F    S+     + I++K     I+  P A      + RT  ++VE+
Subjt:  NKINLQPYSNIEELLHIAIKIESDFVVLLQEFEDLFFEEKPSS-LPPLRGIEHK-----IDFIPGAPIPNRPAYRTNPKEVEE

XP_023520282.1 uncharacterized protein LOC111783592 [Cucurbita pepo subsp. pepo]2.7e-2291.94Show/hide
Query:  IESDFVVLLQEFEDLFFEEKPSSLPPLRGIEHKIDFIPGAPIPNRPAYRTNPKEVEEIQRQV
        + SDFVVLLQEFEDLF EE PSSLPPLRGIEHKIDFIPGAPIPNRPAYRTNPKE EEIQRQV
Subjt:  IESDFVVLLQEFEDLFFEEKPSSLPPLRGIEHKIDFIPGAPIPNRPAYRTNPKEVEEIQRQV

XP_023520835.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111784339 [Cucurbita pepo subsp. pepo]2.3e-15445.14Show/hide
Query:  PRGPPRADTYEGNNSDHHEDNPHVVGHGLMQGRDHGRRYHNLQQRVPYDDRIDRNVGSIKLKLLKFYGKTDPEEYLQWEKTVESVFNCHNFSDEKKVLLY
        P+  P  DTYEG+NSDHHEDNPH VGHGLM+GRDHGRRYHNLQQRVPYDDRIDRNVGSIKLKL KFYGKTDPEEYLQWEKTVESVFNCHNFSD+KKVLL 
Subjt:  PRGPPRADTYEGNNSDHHEDNPHVVGHGLMQGRDHGRRYHNLQQRVPYDDRIDRNVGSIKLKLLKFYGKTDPEEYLQWEKTVESVFNCHNFSDEKKVLLY

Query:  IAQFKQYAQIWWDKLMSSRRRNLEAPIDSWVEFKESMRKRFVPRYFHRDMAQKLQALKQGHKSVEDYYKEMDTLMDLLELDEDMEALMARYLNGLNTEIT
        IAQFKQYAQIWWDKLMSSRRRNLEAPIDSW EFKESMRKRFVP+YF RDMAQKLQALKQG KSVEDYYKEMDTLMD L+LDEDMEALMAR+LNGLNTEI 
Subjt:  IAQFKQYAQIWWDKLMSSRRRNLEAPIDSWVEFKESMRKRFVPRYFHRDMAQKLQALKQGHKSVEDYYKEMDTLMDLLELDEDMEALMARYLNGLNTEIT

Query:  NKINLQPYSNIEELLHIAIKIE------------------------------------------------------------------------------
        +K +LQPYSNIEELLHIAIKIE                                                                              
Subjt:  NKINLQPYSNIEELLHIAIKIE------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------------------------------------------------------------------SDFVVLLQEFEDLF
                                                                                              SDFVVLLQEFEDLF
Subjt:  --------------------------------------------------------------------------------------SDFVVLLQEFEDLF

Query:  FEEKPSSLPPLRGIEHKIDFIPGAPIPNRPAYRTNPKEVEEIQRQVSELLAKG------------------------------AINKITIKYRHPIPRLD
         EE PSSLPPLRGIEHKIDFIPGAPIPNRPAYRTNPKE EEIQRQVSELLAKG                              AINKITIKYRHPIPRLD
Subjt:  FEEKPSSLPPLRGIEHKIDFIPGAPIPNRPAYRTNPKEVEEIQRQVSELLAKG------------------------------AINKITIKYRHPIPRLD

Query:  DMLDELNGCSLFTKIDLKSGYHQIRMHIGDEWKTTFKTKYGLYEWLVMPFGLTNAPSTFMRLMNHVLREYL
        DMLDEL+GCSLFTKIDLKS YHQIRMHIGDEWKT FKTKYGLYEWLVMPFGLTNAPSTF RLMNHVLREYL
Subjt:  DMLDELNGCSLFTKIDLKSGYHQIRMHIGDEWKTTFKTKYGLYEWLVMPFGLTNAPSTFMRLMNHVLREYL

XP_023541047.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111801285 [Cucurbita pepo subsp. pepo]4.6e-11046.88Show/hide
Query:  PRGPPRADTYEGNNSDHHEDNPHVVGHGLMQGRDHGRRYHNLQQRVPYDDRIDRNVGSIKLKLLKFYGKTDPEEYLQWEKTVESVFNCHNFSDEKKVLLY
        P   P  +TYEG+N DHHEDNPH VGHGLM+GRDHGRRYHNLQQRVPYDDRIDRNVGSIKLKL KFYGKTDP+EY +WEKTV+SVFNCHNFSDEKKVLL 
Subjt:  PRGPPRADTYEGNNSDHHEDNPHVVGHGLMQGRDHGRRYHNLQQRVPYDDRIDRNVGSIKLKLLKFYGKTDPEEYLQWEKTVESVFNCHNFSDEKKVLLY

Query:  IAQFKQYAQIWWDKLMSSRRRNLEAPIDSWVEFKESMRKRFVPRYFHRDMAQKLQALKQGHKSVEDYYKEMDTLMDLLELDEDMEALMARYLNGLNTEIT
        IAQFKQYAQIWWDKLMSSRRRNLEAPIDSWVEFKESMRK FVP+YF RDMAQKLQALKQG KS EDYYKEMDTLMD LELDE+MEALMAR+LNGLNT+I 
Subjt:  IAQFKQYAQIWWDKLMSSRRRNLEAPIDSWVEFKESMRKRFVPRYFHRDMAQKLQALKQGHKSVEDYYKEMDTLMDLLELDEDMEALMARYLNGLNTEIT

Query:  NKINLQPYSNIEELLHIAIKIESDF---------------------------------------------------------------------------
        +K +LQPYSNIEELLHIAIKI+                                                                              
Subjt:  NKINLQPYSNIEELLHIAIKIESDF---------------------------------------------------------------------------

Query:  ----------VVLLQE------------------------------------------------------------------------------------
                  ++ ++E                                                                                    
Subjt:  ----------VVLLQE------------------------------------------------------------------------------------

Query:  -------------------------------------------------------------------FEDLFFEEKPSSLPPLRGIEHKIDFIPGAPIPN
                                                                           FEDLF EE PSSLPPLRGIEHKIDFIPGAPIPN
Subjt:  -------------------------------------------------------------------FEDLFFEEKPSSLPPLRGIEHKIDFIPGAPIPN

Query:  RPAYRTNPKEVEEIQRQVSELLAKGAINK
        RPAYRTNPKE EEIQRQVSELLAKG + +
Subjt:  RPAYRTNPKEVEEIQRQVSELLAKGAINK

XP_023553652.1 uncharacterized protein LOC111811140 [Cucurbita pepo subsp. pepo]6.1e-15545.08Show/hide
Query:  PRGPPRADTYEGNNSDHHEDNPHVVGHGLMQGRDHGRRYHNLQQRVPYDDRIDRNVGSIKLKLLKFYGKTDPEEYLQWEKTVESVFNCHNFSDEKKVLLY
        P   P  DTYEG+NSDHHEDNPH VGHGLM+GRDHGRRYHNLQQRVPYDDRIDRNVGSIKLKL KFYGKTDPEEYL+WEKT+ESVF CHNFSD+KKVLL 
Subjt:  PRGPPRADTYEGNNSDHHEDNPHVVGHGLMQGRDHGRRYHNLQQRVPYDDRIDRNVGSIKLKLLKFYGKTDPEEYLQWEKTVESVFNCHNFSDEKKVLLY

Query:  IAQFKQYAQIWWDKLMSSRRRNLEAPIDSWVEFKESMRKRFVPRYFHRDMAQKLQALKQGHKSVEDYYKEMDTLMDLLELDEDMEALMARYLNGLNTEIT
        IAQFKQYAQIWWDKLMSSRRRNLEAPIDSW EFKESMRKRFVP+YF RDMAQKLQALKQG KSVEDYYKEMDTLMD L+LDEDMEALMAR+LNGLNTEI 
Subjt:  IAQFKQYAQIWWDKLMSSRRRNLEAPIDSWVEFKESMRKRFVPRYFHRDMAQKLQALKQGHKSVEDYYKEMDTLMDLLELDEDMEALMARYLNGLNTEIT

Query:  NKINLQPYSNIEELLHIAIKIE------------------------------------------------------------------------------
        +K +LQPYSNIEELLHIAIKIE                                                                              
Subjt:  NKINLQPYSNIEELLHIAIKIE------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------------------------------------------------------------------SDFVVLLQEFEDLF
                                                                                              SDFVVLLQEFEDLF
Subjt:  --------------------------------------------------------------------------------------SDFVVLLQEFEDLF

Query:  FEEKPSSLPPLRGIEHKIDFIPGAPIPNRPAYRTNPKEVEEIQRQVSELLAKG------------------------------AINKITIKYRHPIPRLD
         EE PSSLPPLRGIEHKIDFIPGAPIPNRPAYRTNPKE EEIQRQVSELLAKG                              AINKITIKYRHPIPRLD
Subjt:  FEEKPSSLPPLRGIEHKIDFIPGAPIPNRPAYRTNPKEVEEIQRQVSELLAKG------------------------------AINKITIKYRHPIPRLD

Query:  DMLDELNGCSLFTKIDLKSGYHQIRMHIGDEWKTTFKTKYGLYEWLVMPFGLTNAPSTFMRLMNHVLREYLG
        DMLDEL+GCSLFTKIDLKSGYHQIRMHIGDEWKT FKTKYGLYEWLVMPFGLTNAPSTFMRLMNHVLREYLG
Subjt:  DMLDELNGCSLFTKIDLKSGYHQIRMHIGDEWKTTFKTKYGLYEWLVMPFGLTNAPSTFMRLMNHVLREYLG

XP_038887118.1 uncharacterized protein K02A2.6-like [Benincasa hispida]6.4e-11244.28Show/hide
Query:  VGSIKLKLLKFYGKTDPEEYLQWEKTVESVFNCHNFSDEKKVLLYIAQFKQYAQIWWDKLMSSRRRNLEAPIDSWVEFKESMRKRFVPRYFHRDMAQKLQ
        +G IK K+ KF+GKTDPEEY++WEK VE+VF CHNFSD++KV   +A+FK YAQ WWDKL + RRRNLEAPI SW EFK+SMRK FVP +F RDMAQ+LQ
Subjt:  VGSIKLKLLKFYGKTDPEEYLQWEKTVESVFNCHNFSDEKKVLLYIAQFKQYAQIWWDKLMSSRRRNLEAPIDSWVEFKESMRKRFVPRYFHRDMAQKLQ

Query:  ALKQGHKSVEDYYKEMDTLMDLLELDEDMEALMARYLNGLNTEITNKINLQPYSNIEELLHIAIKIE----------------------------SDF--
        AL+QG+KSVEDYYKEMD LMD L+LDEDME LMAR+LNGLN EI +K++LQPY +IEE+LH+AIK+E                            SDF  
Subjt:  ALKQGHKSVEDYYKEMDTLMDLLELDEDMEALMARYLNGLNTEITNKINLQPYSNIEELLHIAIKIE----------------------------SDF--

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------------------------------------------------VVLLQEFEDLFFEEKPSSLPPLRGIEHKIDFI
                                                                              LLQEF+D+F E+ PS LPPLRGIEH+IDF+
Subjt:  --------------------------------------------------------------------VVLLQEFEDLFFEEKPSSLPPLRGIEHKIDFI

Query:  PGAPIPNRPAYRTNPKEVEEIQRQVSELLAKG------------------------------AINKITIKYRHPIPRLDDMLDELNGCSLFTKIDLKSGY
         GA IPNRPAYR NP E +EIQ+QV ELLAKG                               INKITIKYRHPIPRLDDMLDEL+   LFTKIDLK GY
Subjt:  PGAPIPNRPAYRTNPKEVEEIQRQVSELLAKG------------------------------AINKITIKYRHPIPRLDDMLDELNGCSLFTKIDLKSGY

Query:  HQIRMHIGDEWKTTFKTKYGLYEWLVMPFGLTNAPSTFMRLMNHVLREYLG
        HQIRM +GDEWK  FKTK+GLYEWLVMPF LTNAPSTFMRLMNHVLREY+G
Subjt:  HQIRMHIGDEWKTTFKTKYGLYEWLVMPFGLTNAPSTFMRLMNHVLREYLG

TrEMBL top hitse value%identityAlignment
A0A2N9FQI2 Reverse transcriptase domain-containing protein1.1e-10444.12Show/hide
Query:  RGPPRADTYEGNNSD--HHEDNPHVVGHGLMQGRDHGRRYHNLQQRVPYDDRIDRNVGSIKLKLLKFYGKTDPEEYLQWEKTVESVFNCHNFSDEKKVLL
        R  PR +    N  D    ED    VG G  +     RR    +      D +DRN+GSIK+K+  F G+TDPE YL+WEK ++ VFNCHN+S+EKKV L
Subjt:  RGPPRADTYEGNNSD--HHEDNPHVVGHGLMQGRDHGRRYHNLQQRVPYDDRIDRNVGSIKLKLLKFYGKTDPEEYLQWEKTVESVFNCHNFSDEKKVLL

Query:  YIAQFKQYAQIWWDKLMSSRRRNLEAPIDSWVEFKESMRKRFVPRYFHRDMAQKLQALKQGHKSVEDYYKEMDTLMDLLELDEDMEALMARYLNGLNTEI
         + +F  YA IWWD+L+++R RN E P+++W E K  MR+RFV  +F+RD+ QKLQ L QG +SVEDYYKEM+  M    ++ED EA MAR+L+GLN +I
Subjt:  YIAQFKQYAQIWWDKLMSSRRRNLEAPIDSWVEFKESMRKRFVPRYFHRDMAQKLQALKQGHKSVEDYYKEMDTLMDLLELDEDMEALMARYLNGLNTEI

Query:  TNKINLQPYSNIEELLHIAI----------------------------KIES------------------------------------------------
         N I LQ Y  IE+++H+ +                            K+ES                                                
Subjt:  TNKINLQPYSNIEELLHIAI----------------------------KIES------------------------------------------------

Query:  ---------DFVVL----------------LQEFEDLFFEEKPSSLPPLRGIEHKIDFIPGAPIPNRPAYRTNPKEVEEIQRQVSELLAKG---------
                 D VV                 ++E+ED+F  + PS LPP+RGIEH+IDF+PGA IPNRP YR+NP+E +E+Q QV EL+AKG         
Subjt:  ---------DFVVL----------------LQEFEDLFFEEKPSSLPPLRGIEHKIDFIPGAPIPNRPAYRTNPKEVEEIQRQVSELLAKG---------

Query:  ---------------------AINKITIKYRHPIPRLDDMLDELNGCSLFTKIDLKSGYHQIRMHIGDEWKTTFKTKYGLYEWLVMPFGLTNAPSTFMRL
                             AI  IT+KYRHPIPRL DMLDEL+G  +FTKIDLKSGYHQIRM  GDEWKT FKTKYGLYEWLVMPFGLTNAPSTFMRL
Subjt:  ---------------------AINKITIKYRHPIPRLDDMLDELNGCSLFTKIDLKSGYHQIRMHIGDEWKTTFKTKYGLYEWLVMPFGLTNAPSTFMRL

Query:  MNHVLREYLG
        MNH LR +LG
Subjt:  MNHVLREYLG

A0A2N9GK81 Reverse transcriptase domain-containing protein9.6e-10643.91Show/hide
Query:  NSDHHEDNPHVVGHGLMQGRDHGRRYHNLQQRVPYDDRIDRNVGSIKLKLLKFYGKTDPEEYLQWEKTVESVFNCHNFSDEKKVLLYIAQFKQYAQIWWD
        +S+  ED    VG G  +   H R    L+  +   D +DRN+GSIK+K+  F G+TDPE YL+WEK ++ VF+CHN+S EKKV L + +F  YA IWWD
Subjt:  NSDHHEDNPHVVGHGLMQGRDHGRRYHNLQQRVPYDDRIDRNVGSIKLKLLKFYGKTDPEEYLQWEKTVESVFNCHNFSDEKKVLLYIAQFKQYAQIWWD

Query:  KLMSSRRRNLEAPIDSWVEFKESMRKRFVPRYFHRDMAQKLQALKQGHKSVEDYYKEMDTLMDLLELDEDMEALMARYLNGLNTEITNKINLQPYSNIEE
        +L+++RRRN E P+++W E K  MR+RFVP +F+RD+ QKLQ L QG +SVEDY+KEM+  M    ++ED EA MAR+L+GLN +I N I LQ Y  IE+
Subjt:  KLMSSRRRNLEAPIDSWVEFKESMRKRFVPRYFHRDMAQKLQALKQGHKSVEDYYKEMDTLMDLLELDEDMEALMARYLNGLNTEITNKINLQPYSNIEE

Query:  LLHIAIK---------------------------------------------------------------------IES----------------DFVVL
         +HIA+K                                                                     IES                D VV 
Subjt:  LLHIAIK---------------------------------------------------------------------IES----------------DFVVL

Query:  -------------------------------------LQEFEDLFFEEKPSSLPPLRGIEHKIDFIPGAPIPNRPAYRTNPKEVEEIQRQVSELLAKG--
                                              +EFED+F EE P+ LPP+RGIEH+IDF+PGA IPNRPAYR+NP+E +E+QRQV +L++KG  
Subjt:  -------------------------------------LQEFEDLFFEEKPSSLPPLRGIEHKIDFIPGAPIPNRPAYRTNPKEVEEIQRQVSELLAKG--

Query:  ----------------------------AINKITIKYRHPIPRLDDMLDELNGCSLFTKIDLKSGYHQIRMHIGDEWKTTFKTKYGLYEWLVMPFGLTNA
                                    AIN IT+KYRHPIPRLDDMLDEL+G  +F+KIDLKSGYHQIRM  GDEWKT FKTKYGLYEWLVMPFGLTNA
Subjt:  ----------------------------AINKITIKYRHPIPRLDDMLDELNGCSLFTKIDLKSGYHQIRMHIGDEWKTTFKTKYGLYEWLVMPFGLTNA

Query:  PSTFMRLMNHVLREYLG
        PSTFMRLMNHVL  ++G
Subjt:  PSTFMRLMNHVLREYLG

A0A2N9GYW5 Uncharacterized protein1.2e-10339.96Show/hide
Query:  GRNAGNP--RGPPRADTYEGNNSDH------HEDNPHVVGHGLMQGRDHGRRYHNLQQRVPYDDRIDRNVGSIKLKLLKFYGKTDPEEYLQWEKTVESVF
        GR  G P  R   R    + ++ DH       ED   + G  + +G   GR +   +  + + D  D N+G+IK+K+  F GK DPE YL+WEK VE +F
Subjt:  GRNAGNP--RGPPRADTYEGNNSDH------HEDNPHVVGHGLMQGRDHGRRYHNLQQRVPYDDRIDRNVGSIKLKLLKFYGKTDPEEYLQWEKTVESVF

Query:  NCHNFSDEKKVLLYIAQFKQYAQIWWDKLMSSRRRNLEAPIDSWVEFKESMRKRFVPRYFHRDMAQKLQALKQGHKSVEDYYKEMDTLMDLLELDEDMEA
         CHN+S+EKKV L + +F  YA IWWD+L+ +RRRN E  I++W E +  MR+RFVP +++RD+ QKLQ+L QG++SV+DYYKEM+  +    ++ED EA
Subjt:  NCHNFSDEKKVLLYIAQFKQYAQIWWDKLMSSRRRNLEAPIDSWVEFKESMRKRFVPRYFHRDMAQKLQALKQGHKSVEDYYKEMDTLMDLLELDEDMEA

Query:  LMARYLNGLNTEITNKINLQPYSNIEELLHIAIKIE----------------------------------------------------------------
         MAR+LNGLN +I N + LQ Y  +E+++H+AIK+E                                                                
Subjt:  LMARYLNGLNTEITNKINLQPYSNIEELLHIAIKIE----------------------------------------------------------------

Query:  ----------------------------------SDFVVLLQ------------------------------------------------EFEDLFFEEK
                                          +D + +L+                                                E+ED+F  + 
Subjt:  ----------------------------------SDFVVLLQ------------------------------------------------EFEDLFFEEK

Query:  PSSLPPLRGIEHKIDFIPGAPIPNRPAYRTNPKEVEEIQRQVSELLAKG------------------------------AINKITIKYRHPIPRLDDMLD
        PS LPP+RGIEH+IDF+PGA IPNRPAYR+NP+E +E+QRQV ELLAKG                              AIN IT+KYRHPIPRLDDMLD
Subjt:  PSSLPPLRGIEHKIDFIPGAPIPNRPAYRTNPKEVEEIQRQVSELLAKG------------------------------AINKITIKYRHPIPRLDDMLD

Query:  ELNGCSLFTKIDLKSGYHQIRMHIGDEWKTTFKTKYGLYEWLVMPFGLTNAPSTFMRLMNHVLREYLG
        EL+G  +FTKIDLKSGYHQIRM  GDEWKT FKTKYGLYEWLVMPFGLTNAPSTFMRLMNH LR +LG
Subjt:  ELNGCSLFTKIDLKSGYHQIRMHIGDEWKTTFKTKYGLYEWLVMPFGLTNAPSTFMRLMNHVLREYLG

A0A2N9HC25 Uncharacterized protein2.7e-10842.61Show/hide
Query:  GRNAGNPRGPP---------RADTYEGNNSDH---HEDNPHVVGHGLMQGRDHGRRYHNLQQRVPYDDRIDRNVGSIKLKLLKFYGKTDPEEYLQWEKTV
        G   G P+G P           D  +G++ D     ED   + G  + +G   GR +   +  + + D  D N+G+IK+K+  F GK DPE YL+WEK V
Subjt:  GRNAGNPRGPP---------RADTYEGNNSDH---HEDNPHVVGHGLMQGRDHGRRYHNLQQRVPYDDRIDRNVGSIKLKLLKFYGKTDPEEYLQWEKTV

Query:  ESVFNCHNFSDEKKVLLYIAQFKQYAQIWWDKLMSSRRRNLEAPIDSWVEFKESMRKRFVPRYFHRDMAQKLQALKQGHKSVEDYYKEMDTLMDLLELDE
        E +F CHN+S+EKKV L + +F  YA IWWD+L+ +RRRN E  I++W E +  MR+RFVP +++RD+ QKLQ+L QG++SV+DYYKEM+  +    ++E
Subjt:  ESVFNCHNFSDEKKVLLYIAQFKQYAQIWWDKLMSSRRRNLEAPIDSWVEFKESMRKRFVPRYFHRDMAQKLQALKQGHKSVEDYYKEMDTLMDLLELDE

Query:  DMEALMARYLNGLNTEITNKINLQPYSNIEELLHIAIKIESDFV--------------------------------------------------------
        D EA MAR+LNGLN +I N + LQ Y  +E+++H+AIK+E                                                            
Subjt:  DMEALMARYLNGLNTEITNKINLQPYSNIEELLHIAIKIESDFV--------------------------------------------------------

Query:  ----------------------------------------------VLLQEFEDLFFEEKPSSLPPLRGIEHKIDFIPGAPIPNRPAYRTNPKEVEEIQR
                                                      +  +E+ED+F  + PS LPP+RGIEH+IDF+PGA IPNRPAYR+NP+E +E+QR
Subjt:  ----------------------------------------------VLLQEFEDLFFEEKPSSLPPLRGIEHKIDFIPGAPIPNRPAYRTNPKEVEEIQR

Query:  QVSELLAKG------------------------------AINKITIKYRHPIPRLDDMLDELNGCSLFTKIDLKSGYHQIRMHIGDEWKTTFKTKYGLYE
        QV ELLAKG                              AIN IT+KYRHPIPRLDDMLDEL+G  +FTKIDLKSGYHQIRM  GDEWKT FKTKYGLYE
Subjt:  QVSELLAKG------------------------------AINKITIKYRHPIPRLDDMLDELNGCSLFTKIDLKSGYHQIRMHIGDEWKTTFKTKYGLYE

Query:  WLVMPFGLTNAPSTFMRLMNHVLREYLG
        WLVMPFGLTNAPSTFMRLMNH LR +LG
Subjt:  WLVMPFGLTNAPSTFMRLMNHVLREYLG

A0A2N9I0N6 Uncharacterized protein1.3e-10541.54Show/hide
Query:  EGNNSDHHEDNPHVVGHGLMQGRDHGRRYHNLQQRVPYDDRIDRNVGSIKLKLLKFYGKTDPEEYLQWEKTVESVFNCHNFSDEKKVLLYIAQFKQYAQI
        EG   D    N   V  G  +GR         +  + + D  D N+G+IK+K+  F GK DPE YL+WEK VE +F CHN+S+EKKV L + +F  YA I
Subjt:  EGNNSDHHEDNPHVVGHGLMQGRDHGRRYHNLQQRVPYDDRIDRNVGSIKLKLLKFYGKTDPEEYLQWEKTVESVFNCHNFSDEKKVLLYIAQFKQYAQI

Query:  WWDKLMSSRRRNLEAPIDSWVEFKESMRKRFVPRYFHRDMAQKLQALKQGHKSVEDYYKEMDTLMDLLELDEDMEALMARYLNGLNTEITNKINLQPYSN
        WWD+L+ +RRRN E  I++W E +  MR+RFVP +++RD+ QKLQ+L QG++SV+DYYKEM+  +    ++ED EA MAR+LNGLN +I N + LQ Y  
Subjt:  WWDKLMSSRRRNLEAPIDSWVEFKESMRKRFVPRYFHRDMAQKLQALKQGHKSVEDYYKEMDTLMDLLELDEDMEALMARYLNGLNTEITNKINLQPYSN

Query:  IEELLHIAIKIE----------------------------------------------------------------------------------------
        +E+++H+AIK+E                                                                                        
Subjt:  IEELLHIAIKIE----------------------------------------------------------------------------------------

Query:  ----------SDFVVLL-------------------------QEFEDLFFEEKPSSLPPLRGIEHKIDFIPGAPIPNRPAYRTNPKEVEEIQRQVSELLA
                  +D + +L                         QE+ED+F  + PS LPP+RGIEH+IDF+PGA IPNRPAYR+NP+E +E+QRQV ELLA
Subjt:  ----------SDFVVLL-------------------------QEFEDLFFEEKPSSLPPLRGIEHKIDFIPGAPIPNRPAYRTNPKEVEEIQRQVSELLA

Query:  KG------------------------------AINKITIKYRHPIPRLDDMLDELNGCSLFTKIDLKSGYHQIRMHIGDEWKTTFKTKYGLYEWLVMPFG
        KG                              AIN IT+KYRHPIPRLDDMLDEL+G  +FTKIDLKSGYHQIRM  GDEWKT FKTKYGLYEWLVMPFG
Subjt:  KG------------------------------AINKITIKYRHPIPRLDDMLDELNGCSLFTKIDLKSGYHQIRMHIGDEWKTTFKTKYGLYEWLVMPFG

Query:  LTNAPSTFMRLMNHVLREYLGD----GFDSRTNLSQEGENDMNH
        LTNAPSTFMRLMNH LR +L       FD     S+  +  +NH
Subjt:  LTNAPSTFMRLMNHVLREYLGD----GFDSRTNLSQEGENDMNH

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.61.1e-2132.86Show/hide
Query:  NKINLQPYSNIEELLHIAIKIESDFVVLLQEFEDLFFEEKPSSLPPLRGIEHKIDFIPGAPIPNRPAYRTNPKEVE-EIQRQVSELLAKGAI--------
        NKI+    S++  L H+  + +     LLQ++ D+ + E    L      +H I+     P+ ++ +Y   P+  E E++ Q+ ++L +G I        
Subjt:  NKINLQPYSNIEELLHIAIKIESDFVVLLQEFEDLFFEEKPSSLPPLRGIEHKIDFIPGAPIPNRPAYRTNPKEVE-EIQRQVSELLAKGAI--------

Query:  ---------------------------NKITIKYRHPIPRLDDMLDELNGCSLFTKIDLKSGYHQIRMHIGDEWKTTFKTKYGLYEWLVMPFGLTNAPST
                                   N+IT+  RHPIP +D++L +L  C+ FT IDL  G+HQI M      KT F TK+G YE+L MPFGL NAP+T
Subjt:  ---------------------------NKITIKYRHPIPRLDDMLDELNGCSLFTKIDLKSGYHQIRMHIGDEWKTTFKTKYGLYEWLVMPFGLTNAPST

Query:  FMRLMNHVLREYL
        F R MN +LR  L
Subjt:  FMRLMNHVLREYL

P20825 Retrovirus-related Pol polyprotein from transposon 2975.9e-2030Show/hide
Query:  KESMRKRFVPRYFHR-----DMAQKLQALKQGHKSVEDYYKEMDTLMD------LLELDEDMEALMARYLNGLNTEITNKINLQPYSNIEELLHIAIKIE
        + S+ K+  P Y HR     DM    + LK   +SV +Y  +  TL D        E + +    + R    + +     I    +S    L H+  +  
Subjt:  KESMRKRFVPRYFHR-----DMAQKLQALKQGHKSVEDYYKEMDTLMD------LLELDEDMEALMARYLNGLNTEITNKINLQPYSNIEELLHIAIKIE

Query:  SDFVVLLQEFEDLFFEEKPSSLPPLRGIEHKIDFIPGAPIPNRPAYRTNPKEVEEIQRQVSELLAKGAI-------------------------------
             LL +F +L ++E    L     I+H ++    +PI ++        E+ E++ QV E+L +G I                               
Subjt:  SDFVVLLQEFEDLFFEEKPSSLPPLRGIEHKIDFIPGAPIPNRPAYRTNPKEVEEIQRQVSELLAKGAI-------------------------------

Query:  ----NKITIKYRHPIPRLDDMLDELNGCSLFTKIDLKSGYHQIRMHIGDEWKTTFKTKYGLYEWLVMPFGLTNAPSTFMRLMNHVLREYL
            N+ITI  R+PIP +D++L +L  C  FT IDL  G+HQI M      KT F TK G YE+L MPFGL NAP+TF R MN++LR  L
Subjt:  ----NKITIKYRHPIPRLDDMLDELNGCSLFTKIDLKSGYHQIRMHIGDEWKTTFKTKYGLYEWLVMPFGLTNAPSTFMRLMNHVLREYL

P31843 RNA-directed DNA polymerase homolog2.0e-2359.09Show/hide
Query:  AINKITIKYRHPIPRLDDMLDELNGCSLFTKIDLKSGYHQIRMHIGDEWKTTFKTKYGLYEWLVMPFGLTNAPSTFMRLMNHVLREYL
        A+ K+TIK ++PIPR+DD+ D L   + FTK+DL+SGY Q+R+  GDE KTT  T+YG +E+ VMPFGLTNA +TF  LMN+VL EYL
Subjt:  AINKITIKYRHPIPRLDDMLDELNGCSLFTKIDLKSGYHQIRMHIGDEWKTTFKTKYGLYEWLVMPFGLTNAPSTFMRLMNHVLREYL

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein7.0e-2129.82Show/hide
Query:  LDEDMEALMARYLNGLNTEITNKINLQPYSNIEELLHIAIKIESDFVVLLQEFEDLFFEEKPSSLPPLRGI--EHKIDFIPGAPIPNRPAYRTNPKEVEE
        L+ED      +Y N ++T  + + N   +SN +    + + ++       Q++ ++   + P     +  I  +H I+  PGA +P    Y    K  +E
Subjt:  LDEDMEALMARYLNGLNTEITNKINLQPYSNIEELLHIAIKIESDFVVLLQEFEDLFFEEKPSSLPPLRGI--EHKIDFIPGAPIPNRPAYRTNPKEVEE

Query:  IQRQVSELLAK------------------------------GAINKITIKYRHPIPRLDDMLDELNGCSLFTKIDLKSGYHQIRMHIGDEWKTTFKTKYG
        I + V +LL                                  +NK TI    P+PR+D++L  +    +FT +DL SGYHQI M   D +KT F T  G
Subjt:  IQRQVSELLAK------------------------------GAINKITIKYRHPIPRLDDMLDELNGCSLFTKIDLKSGYHQIRMHIGDEWKTTFKTKYG

Query:  LYEWLVMPFGLTNAPSTFMRLMNHVLRE
         YE+ VMPFGL NAPSTF R M    R+
Subjt:  LYEWLVMPFGLTNAPSTFMRLMNHVLRE

Q99315 Transposon Ty3-G Gag-Pol polyprotein7.0e-2129.82Show/hide
Query:  LDEDMEALMARYLNGLNTEITNKINLQPYSNIEELLHIAIKIESDFVVLLQEFEDLFFEEKPSSLPPLRGI--EHKIDFIPGAPIPNRPAYRTNPKEVEE
        L+ED      +Y N ++T  + + N   +SN +    + + ++       Q++ ++   + P     +  I  +H I+  PGA +P    Y    K  +E
Subjt:  LDEDMEALMARYLNGLNTEITNKINLQPYSNIEELLHIAIKIESDFVVLLQEFEDLFFEEKPSSLPPLRGI--EHKIDFIPGAPIPNRPAYRTNPKEVEE

Query:  IQRQVSELLAK------------------------------GAINKITIKYRHPIPRLDDMLDELNGCSLFTKIDLKSGYHQIRMHIGDEWKTTFKTKYG
        I + V +LL                                  +NK TI    P+PR+D++L  +    +FT +DL SGYHQI M   D +KT F T  G
Subjt:  IQRQVSELLAK------------------------------GAINKITIKYRHPIPRLDDMLDELNGCSLFTKIDLKSGYHQIRMHIGDEWKTTFKTKYG

Query:  LYEWLVMPFGLTNAPSTFMRLMNHVLRE
         YE+ VMPFGL NAPSTF R M    R+
Subjt:  LYEWLVMPFGLTNAPSTFMRLMNHVLRE

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGGAGGCATTTAACCCAAAAACCTACGAAGAGTTGCTGAGGACGGCCAAGGTCTTGGAGGAACCTCAAGAAGACAAAAACTCAGAGCCAACAGTCGCCGTAGGAAA
GAAATGCCCCATAGTGGTGTCGTCGTACAGGTCACACATAACCCAATTTTGCATTGGAAGGAACGCGGGAAACCCAAGGGGACCCCCTAGAGCCGATACATATGAGGGCA
ACAATTCTGATCACCACGAGGATAATCCACATGTGGTTGGTCATGGCTTGATGCAAGGGAGAGACCATGGAAGAAGGTATCATAATTTACAACAACGAGTTCCTTATGAT
GATAGAATTGATCGTAACGTGGGGAGCATCAAATTAAAACTTCTCAAGTTTTATGGCAAAACCGATCCAGAGGAGTACCTTCAATGGGAGAAAACGGTGGAGTCAGTGTT
CAACTGTCATAATTTTAGTGATGAAAAGAAGGTACTGTTATACATTGCTCAATTCAAACAATACGCTCAAATTTGGTGGGATAAATTGATGTCAAGTAGGAGAAGAAATC
TTGAAGCACCAATTGATTCATGGGTCGAGTTCAAAGAGTCTATGAGGAAGCGTTTTGTCCCACGATATTTTCACCGGGACATGGCGCAAAAGCTTCAAGCATTGAAACAA
GGACACAAATCTGTGGAGGATTATTACAAGGAGATGGATACATTGATGGATCTACTTGAACTCGATGAGGACATGGAGGCTCTCATGGCGCGGTATCTTAATGGGTTAAA
CACAGAGATTACGAACAAGATTAATTTACAGCCTTATTCTAATATTGAGGAGTTGTTGCACATTGCAATTAAGATCGAGAGTGATTTTGTTGTGCTTTTGCAAGAGTTTG
AAGATTTATTTTTCGAGGAGAAGCCTAGTAGTTTGCCACCACTTAGAGGGATTGAACACAAGATTGACTTCATTCCTGGCGCGCCCATTCCAAACCGACCAGCTTATAGG
ACTAATCCAAAGGAGGTTGAAGAGATACAAAGGCAAGTAAGTGAACTCCTTGCTAAAGGGGCTATAAACAAGATAACTATAAAGTATAGGCATCCAATTCCCAGATTAGA
TGATATGCTTGATGAATTGAATGGATGTAGTCTTTTTACTAAGATTGATTTAAAATCGGGTTATCATCAAATTCGCATGCATATTGGGGATGAGTGGAAAACAACTTTTA
AAACCAAGTATGGTCTTTATGAATGGTTGGTTATGCCTTTTGGATTAACTAATGCACCTAGTACATTCATGAGACTAATGAACCATGTCTTACGAGAATACTTAGGTGAT
GGCTTTGATTCGAGGACAAATCTTTCTCAAGAGGGGGAGAATGATATGAACCATGACCAAGAAATTTCCATACCTCAAGGTCCAATTACAAGGACGAGAGCTAAGAAGCT
ACAACAAACCTTATACAGTTATATTCAAGCTATGGTGAGCTCATCAAAGGAAATTCTAGGATATGCTGGAGACCTCCCTTATATGTTGTGCAAAGTTGAGCTTCAAGAAA
GAGATATAGGAGTTAAAGGTTATATTGTAACCCACATTACTATATTCCACCACTAA
mRNA sequenceShow/hide mRNA sequence
ATGGTGGAGGCATTTAACCCAAAAACCTACGAAGAGTTGCTGAGGACGGCCAAGGTCTTGGAGGAACCTCAAGAAGACAAAAACTCAGAGCCAACAGTCGCCGTAGGAAA
GAAATGCCCCATAGTGGTGTCGTCGTACAGGTCACACATAACCCAATTTTGCATTGGAAGGAACGCGGGAAACCCAAGGGGACCCCCTAGAGCCGATACATATGAGGGCA
ACAATTCTGATCACCACGAGGATAATCCACATGTGGTTGGTCATGGCTTGATGCAAGGGAGAGACCATGGAAGAAGGTATCATAATTTACAACAACGAGTTCCTTATGAT
GATAGAATTGATCGTAACGTGGGGAGCATCAAATTAAAACTTCTCAAGTTTTATGGCAAAACCGATCCAGAGGAGTACCTTCAATGGGAGAAAACGGTGGAGTCAGTGTT
CAACTGTCATAATTTTAGTGATGAAAAGAAGGTACTGTTATACATTGCTCAATTCAAACAATACGCTCAAATTTGGTGGGATAAATTGATGTCAAGTAGGAGAAGAAATC
TTGAAGCACCAATTGATTCATGGGTCGAGTTCAAAGAGTCTATGAGGAAGCGTTTTGTCCCACGATATTTTCACCGGGACATGGCGCAAAAGCTTCAAGCATTGAAACAA
GGACACAAATCTGTGGAGGATTATTACAAGGAGATGGATACATTGATGGATCTACTTGAACTCGATGAGGACATGGAGGCTCTCATGGCGCGGTATCTTAATGGGTTAAA
CACAGAGATTACGAACAAGATTAATTTACAGCCTTATTCTAATATTGAGGAGTTGTTGCACATTGCAATTAAGATCGAGAGTGATTTTGTTGTGCTTTTGCAAGAGTTTG
AAGATTTATTTTTCGAGGAGAAGCCTAGTAGTTTGCCACCACTTAGAGGGATTGAACACAAGATTGACTTCATTCCTGGCGCGCCCATTCCAAACCGACCAGCTTATAGG
ACTAATCCAAAGGAGGTTGAAGAGATACAAAGGCAAGTAAGTGAACTCCTTGCTAAAGGGGCTATAAACAAGATAACTATAAAGTATAGGCATCCAATTCCCAGATTAGA
TGATATGCTTGATGAATTGAATGGATGTAGTCTTTTTACTAAGATTGATTTAAAATCGGGTTATCATCAAATTCGCATGCATATTGGGGATGAGTGGAAAACAACTTTTA
AAACCAAGTATGGTCTTTATGAATGGTTGGTTATGCCTTTTGGATTAACTAATGCACCTAGTACATTCATGAGACTAATGAACCATGTCTTACGAGAATACTTAGGTGAT
GGCTTTGATTCGAGGACAAATCTTTCTCAAGAGGGGGAGAATGATATGAACCATGACCAAGAAATTTCCATACCTCAAGGTCCAATTACAAGGACGAGAGCTAAGAAGCT
ACAACAAACCTTATACAGTTATATTCAAGCTATGGTGAGCTCATCAAAGGAAATTCTAGGATATGCTGGAGACCTCCCTTATATGTTGTGCAAAGTTGAGCTTCAAGAAA
GAGATATAGGAGTTAAAGGTTATATTGTAACCCACATTACTATATTCCACCACTAA
Protein sequenceShow/hide protein sequence
MVEAFNPKTYEELLRTAKVLEEPQEDKNSEPTVAVGKKCPIVVSSYRSHITQFCIGRNAGNPRGPPRADTYEGNNSDHHEDNPHVVGHGLMQGRDHGRRYHNLQQRVPYD
DRIDRNVGSIKLKLLKFYGKTDPEEYLQWEKTVESVFNCHNFSDEKKVLLYIAQFKQYAQIWWDKLMSSRRRNLEAPIDSWVEFKESMRKRFVPRYFHRDMAQKLQALKQ
GHKSVEDYYKEMDTLMDLLELDEDMEALMARYLNGLNTEITNKINLQPYSNIEELLHIAIKIESDFVVLLQEFEDLFFEEKPSSLPPLRGIEHKIDFIPGAPIPNRPAYR
TNPKEVEEIQRQVSELLAKGAINKITIKYRHPIPRLDDMLDELNGCSLFTKIDLKSGYHQIRMHIGDEWKTTFKTKYGLYEWLVMPFGLTNAPSTFMRLMNHVLREYLGD
GFDSRTNLSQEGENDMNHDQEISIPQGPITRTRAKKLQQTLYSYIQAMVSSSKEILGYAGDLPYMLCKVELQERDIGVKGYIVTHITIFHH