; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0027651 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0027651
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionRetrotransposon protein
Genome locationchr12:18931975..18934332
RNA-Seq ExpressionPI0027651
SyntenyPI0027651
Gene Ontology termsGO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR027806 - Harbinger transposase-derived nuclease domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADN34114.1 retrotransposon protein [Cucumis melo subsp. melo]3.7e-11542.38Show/hide
Query:  TIAGLAATEVVDVEEMVAIFLHVLAHDVKNRLVQREFVRSGEIISRHFNLVLLAVVCLHDELLKKPQPITNTCTDPHWQSFENCLGALDDTYIKVNVMAN
        TIAGL +TEVVDVEEMVA+FLH+LAHDVK+R+++REF+RSGE ISRHFN+VLLAV+ LH+ELLKKPQP+ N CTD  W+ FENCLGALD TYIKVNV A+
Subjt:  TIAGLAATEVVDVEEMVAIFLHVLAHDVKNRLVQREFVRSGEIISRHFNLVLLAVVCLHDELLKKPQPITNTCTDPHWQSFENCLGALDDTYIKVNVMAN

Query:  DQPRYRTQKGEVATNILGICDTKGNFVFVLIWWEGSAADSSIVRDVMSRPNGLKVPKGNYYLCDVGYPNVEGFLAPYRGQRYHLQECHGAENAPTTAKEF
        D+ RYRT+KGEVATN+LG+CDTKG+FV+VL  WEGSAADS I+RD +SRPN LKVPKG YYL DVGYPN EGFLAPYRGQRYHLQE  G ENAP+T+KEF
Subjt:  DQPRYRTQKGEVATNILGICDTKGNFVFVLIWWEGSAADSSIVRDVMSRPNGLKVPKGNYYLCDVGYPNVEGFLAPYRGQRYHLQECHGAENAPTTAKEF

Query:  FNKKHSFARK------------------------------------------------------------------------------------------
        FN KH  AR                                                                                           
Subjt:  FNKKHSFARK------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------SHLAAKCLLNKSFPYYDELSYVLKKDHTTGTCTKTFPDVGSNVSGSFQPYPGEDGNDMEIPNMYSQGVPMSPEDIQGTRPDRANECRTASS
                 SH AAK LLNKSF +YDELSYV  KD  TG   ++F D+GSN    +     +   D + P MYS G+ MSP+D+  TR  R +E R  SS
Subjt:  ---------SHLAAKCLLNKSFPYYDELSYVLKKDHTTGTCTKTFPDVGSNVSGSFQPYPGEDGNDMEIPNMYSQGVPMSPEDIQGTRPDRANECRTASS

Query:  GSKRKRGGQNVETVKIIRSAMEYANDQLKAIAEWPQLQQQDKSITCATVVSQLQEIPALSRLDKVCCMRILMQNMDDMKAFLNVSNELKLDYWTVIL
        GSKRKR G   ++  I+R+A+EY N+QL  IAEWP LQ+QD + T   +V  L+ IP L+ +D+   MRILM+N+DDMKAFL V + +K  Y ++IL
Subjt:  GSKRKRGGQNVETVKIIRSAMEYANDQLKAIAEWPQLQQQDKSITCATVVSQLQEIPALSRLDKVCCMRILMQNMDDMKAFLNVSNELKLDYWTVIL

KAA0033290.1 putative nuclease HARBI1 [Cucumis melo var. makuwa]7.1e-13552.32Show/hide
Query:  MDRRTFVILCHLLRTIAGLAATEVVDVEEMVAIFLHVLAHDVKNRLVQREFVRSGEIISRHFNLVLLAVVCLHDELLKKPQPITNTCTDPHWQSFENCLG
        MDRR F ILCHLLRT A L  TEV+DVEEMVA+FLH+LAHD+KNR++QREFVRSGE +SRHFNLVLL+V+ LH+ELLKKPQ +TN+C DP W+ FENCLG
Subjt:  MDRRTFVILCHLLRTIAGLAATEVVDVEEMVAIFLHVLAHDVKNRLVQREFVRSGEIISRHFNLVLLAVVCLHDELLKKPQPITNTCTDPHWQSFENCLG

Query:  ALDDTYIKVNVMANDQPRYRTQKGEVATNILGICDTKGNFVFVLIWWEGSAADSSIVRDVMSRPNGLKVPKGNYYLCDVGYPNVEGFLAPYRGQRYHLQE
        ALDDTYIKVNV A D+PRY T+KGEVA N+LG+CDTKG+FVFVL  WEGSAADS I+RD +SR NGLKVPKG YYLCD GY NVEGFLAPYRG+RYHL E
Subjt:  ALDDTYIKVNVMANDQPRYRTQKGEVATNILGICDTKGNFVFVLIWWEGSAADSSIVRDVMSRPNGLKVPKGNYYLCDVGYPNVEGFLAPYRGQRYHLQE

Query:  CHGAENAPTTAKEFFNKKHSFAR-----------------------------------------------------------------------------
         HG  NAPTTA+EFFN KHS AR                                                                             
Subjt:  CHGAENAPTTAKEFFNKKHSFAR-----------------------------------------------------------------------------

Query:  -----KSHLAAKCLLNKSFPYYDELSYVLKKDHTTGTCTKTFPDVGSNVSGSFQ-PYPGEDGNDMEIPNMYSQGVPMSPEDIQGTRPDRANECRTASSGS
             +  LA K LL+KSFPYYD+L YV  KD  T   ++TF DVGSNV   F    P +D +D +IP MYSQGV MSP+++ G R  +A+E +  SSGS
Subjt:  -----KSHLAAKCLLNKSFPYYDELSYVLKKDHTTGTCTKTFPDVGSNVSGSFQ-PYPGEDGNDMEIPNMYSQGVPMSPEDIQGTRPDRANECRTASSGS

Query:  KRKRGGQNVETVKIIRSAMEYANDQLKAIAEWPQLQQQDKSITCATVVSQLQEIPALSRLDKVCCMRILMQNMDDMKAFLNVSNELKLDYWTVIL
        KRK+G ++ ETV++I+SA+E+ NDQLKAIA WP+ ++  +    A V+ QLQ+IP L   D+   ++IL ++++ ++ FL++  E KL+Y  ++L
Subjt:  KRKRGGQNVETVKIIRSAMEYANDQLKAIAEWPQLQQQDKSITCATVVSQLQEIPALSRLDKVCCMRILMQNMDDMKAFLNVSNELKLDYWTVIL

KAA0034843.1 retrotransposon protein [Cucumis melo var. makuwa]3.2e-12746.34Show/hide
Query:  MDRRTFVILCHLLRTIAGLAATEVVDVEEMVAIFLHVLAHDVKNRLVQREFVRSGEIISRHFNLVLLAVVCLHDELLKKPQPITNTCTDPHWQSFENCLG
        MDRR F ILCHLLRTIAGL +TEVVDVEEMVA+FLH+LAHDVKNR++QREF+RSGE ISRHFN+VLLAV+ LHDELLKKPQP+ N CTD  W+ FENCLG
Subjt:  MDRRTFVILCHLLRTIAGLAATEVVDVEEMVAIFLHVLAHDVKNRLVQREFVRSGEIISRHFNLVLLAVVCLHDELLKKPQPITNTCTDPHWQSFENCLG

Query:  ALDDTYIKVNVMANDQPRYRTQKGEVATNILGICDTKGNFVFVLIWWEGSAADSSIVRDVMSRPNGLKVPKGNYYLCDVGYPNVEGFLAPYRGQRYHLQE
        ALD TYIKVNV A+D+ RYRT+KGEVATN+LG+ DTKG+FV+VL  WEGSAADS I+RD +SRPN LKVPKG YYL D GYPN EGFLAPYRGQRYHLQE
Subjt:  ALDDTYIKVNVMANDQPRYRTQKGEVATNILGICDTKGNFVFVLIWWEGSAADSSIVRDVMSRPNGLKVPKGNYYLCDVGYPNVEGFLAPYRGQRYHLQE

Query:  CHGAENAPTTAKEFFNKKHSFARK----------------------------------------------------------------------------
          G +NAP+T+KEFFN KHS AR                                                                             
Subjt:  CHGAENAPTTAKEFFNKKHSFARK----------------------------------------------------------------------------

Query:  --------------------------------------------------------------------------------------SHLAAKCLLNKSFP
                                                                                              SH AAK LLNKSF 
Subjt:  --------------------------------------------------------------------------------------SHLAAKCLLNKSFP

Query:  YYDELSYVLKKDHTTGTCTKTFPDVGSNVSGSFQPYPGEDGNDMEIPNMYSQGVPMSPEDIQGTRPDRANECRTASSGSKRKRGGQNVETVKIIRSAMEY
        +YDELSYV  KD  TG   ++F D+GSN    +  +  +   D +   MYS G+ MSP+D+  TR  R +E R  SSGSKRKR G   ++  I+R+A+EY
Subjt:  YYDELSYVLKKDHTTGTCTKTFPDVGSNVSGSFQPYPGEDGNDMEIPNMYSQGVPMSPEDIQGTRPDRANECRTASSGSKRKRGGQNVETVKIIRSAMEY

Query:  ANDQLKAIAEWPQLQQQDKSITCATVVSQLQEIPALSRLDKVCCMRILMQNMDDMKAFLNVSNELKLDYWTVIL
         N+QL  IAEWP LQ+QD + T   +V QL+ IP L+ +D+   MRILM+N+DDMKAFL V + +K  Y ++IL
Subjt:  ANDQLKAIAEWPQLQQQDKSITCATVVSQLQEIPALSRLDKVCCMRILMQNMDDMKAFLNVSNELKLDYWTVIL

KAA0062747.1 retrotransposon protein [Cucumis melo var. makuwa]3.5e-11042.22Show/hide
Query:  MDRRTFVILCHLLRTIAGLAATEVVDVEEMVAIFLHVLAHDVKNRLVQREFVRSGEIISRHFNLVLLAVVCLHDELLKKPQPITNTCTDPHWQSFE---N
        MDRR F ILCHLLRT AGL  TEV+DVEEMVA+FLH+LAH VKNR++QREFVRSGE +SRHFN+VLLA   LHDELLKKPQP+TN+CTDP W+ FE   N
Subjt:  MDRRTFVILCHLLRTIAGLAATEVVDVEEMVAIFLHVLAHDVKNRLVQREFVRSGEIISRHFNLVLLAVVCLHDELLKKPQPITNTCTDPHWQSFE---N

Query:  CLGALDDTYIKVNVMANDQPRYRTQKGEVATNILGICDTKGNFVFVLIWWEGSAADSSIVRDVMSRPNGLKVPKGNYYLCDVGYPNVEGFLAPYRGQRYH
        CL + + TYIKVNV A D+PRYRT+KGEVATN+LG CDTKG+FVFVL  WEGSAADS I+RD +SR NGLKVPKG YYLCD GYPN EGFLAPYRG+RYH
Subjt:  CLGALDDTYIKVNVMANDQPRYRTQKGEVATNILGICDTKGNFVFVLIWWEGSAADSSIVRDVMSRPNGLKVPKGNYYLCDVGYPNVEGFLAPYRGQRYH

Query:  LQECHGAENAPTTAKEFFNKKHSFAR--------------------------------------------------------------------------
        L E  G  NAPTT +EFFN KHS +R                                                                          
Subjt:  LQECHGAENAPTTAKEFFNKKHSFAR--------------------------------------------------------------------------

Query:  -----------------------------------------------------------------------------------------------KSHLA
                                                                                                       KSH A
Subjt:  -----------------------------------------------------------------------------------------------KSHLA

Query:  AKCLLNKSFPYYDELSYVLKKDHTTGTCTKTFPDVGSNVSGSF-QPYPGEDGNDMEIPNMYSQGVPMSPEDIQGTRPDRANECRTASSGSKRKRGGQNVE
         K LL+KSFPYYD+LSYV  KD  TG  ++TF DVGSNV   F    P  D +D +IP MYSQGV +SP+++ G R                        
Subjt:  AKCLLNKSFPYYDELSYVLKKDHTTGTCTKTFPDVGSNVSGSF-QPYPGEDGNDMEIPNMYSQGVPMSPEDIQGTRPDRANECRTASSGSKRKRGGQNVE

Query:  TVKIIRSAMEYANDQLKAIAEWPQLQQQDKSITCATVVSQLQEIPALSRLDKVCCMRILMQNMDDMKAFLNVSNELKLDYWTVIL
          ++IRS ME+ N+QLKAIA+W + ++  +    A VV QLQ+IP L    +   M+IL ++++ +  FL++  ELKL+Y  ++L
Subjt:  TVKIIRSAMEYANDQLKAIAEWPQLQQQDKSITCATVVSQLQEIPALSRLDKVCCMRILMQNMDDMKAFLNVSNELKLDYWTVIL

TYK02751.1 putative nuclease HARBI1 [Cucumis melo var. makuwa]1.3e-12052.25Show/hide
Query:  MDRRTFVILCHLLRTIAGLAATEVVDVEEMVAIFLHVLAHDVKNRLVQREFVRSGEIISRHFNLVLLAVVCLHDELLKKPQPITNTCTDPHWQSFENCLG
        MDRR F ILCHLLRT+AGL + EVVDVEEMVA+FLH++AHDVKNR++QREF+RSGE ISRHFN+VLL V+ LHD+LLKKPQP+ N CTD  W+ FENCLG
Subjt:  MDRRTFVILCHLLRTIAGLAATEVVDVEEMVAIFLHVLAHDVKNRLVQREFVRSGEIISRHFNLVLLAVVCLHDELLKKPQPITNTCTDPHWQSFENCLG

Query:  ALDDTYIKVNVMANDQPRYRTQKGEVATNILGICDTKGNFVFVLIWWEGSAADSSIVRDVMSRPNGLKVPKGNYYLCDVGYPNVEGFLAPYRGQRYHLQE
        ALD TYIKVNV A+D+ RYRT KGEVATN+LG+CDTKG+FV+VL  WEGSAADS I+ D +SRPNGLKVPKG YYL D GYPNV+GFLA YRGQRYHLQE
Subjt:  ALDDTYIKVNVMANDQPRYRTQKGEVATNILGICDTKGNFVFVLIWWEGSAADSSIVRDVMSRPNGLKVPKGNYYLCDVGYPNVEGFLAPYRGQRYHLQE

Query:  CHGAENAPTTAKEFFNKKHSFAR-------------------------------------------------------KSHLAAKCLLNKSFPYYDELSY
          G ENAP+T+KEFFN KHS AR                                                       KSH AAK LLNKSF +YDELSY
Subjt:  CHGAENAPTTAKEFFNKKHSFAR-------------------------------------------------------KSHLAAKCLLNKSFPYYDELSY

Query:  VLKKDHTTGTCTKTFPDVGSNVSGSFQPYPGEDGNDMEIPNMYSQGVPMSPEDIQGTRPDRANECRTASSGSKRKRGGQNVETVKIIRSAMEYANDQLKA
        V  KD  T    ++F + GSN    +  +  +   DM+ P MYSQG+ MS +D+ GTR  R +E R  SSGSK+KR G   ++  I+R+A+E        
Subjt:  VLKKDHTTGTCTKTFPDVGSNVSGSFQPYPGEDGNDMEIPNMYSQGVPMSPEDIQGTRPDRANECRTASSGSKRKRGGQNVETVKIIRSAMEYANDQLKA

Query:  IAEWPQLQQQDKSITCATVVSQLQEIPALSRLDKVCCMRILMQNMDDMKAFLNVSNELKLDYWTVIL
                                      RL     MRILM+N+DDMKAFL V + +K  Y ++IL
Subjt:  IAEWPQLQQQDKSITCATVVSQLQEIPALSRLDKVCCMRILMQNMDDMKAFLNVSNELKLDYWTVIL

TrEMBL top hitse value%identityAlignment
A0A5A7SQU2 Putative nuclease HARBI13.4e-13552.32Show/hide
Query:  MDRRTFVILCHLLRTIAGLAATEVVDVEEMVAIFLHVLAHDVKNRLVQREFVRSGEIISRHFNLVLLAVVCLHDELLKKPQPITNTCTDPHWQSFENCLG
        MDRR F ILCHLLRT A L  TEV+DVEEMVA+FLH+LAHD+KNR++QREFVRSGE +SRHFNLVLL+V+ LH+ELLKKPQ +TN+C DP W+ FENCLG
Subjt:  MDRRTFVILCHLLRTIAGLAATEVVDVEEMVAIFLHVLAHDVKNRLVQREFVRSGEIISRHFNLVLLAVVCLHDELLKKPQPITNTCTDPHWQSFENCLG

Query:  ALDDTYIKVNVMANDQPRYRTQKGEVATNILGICDTKGNFVFVLIWWEGSAADSSIVRDVMSRPNGLKVPKGNYYLCDVGYPNVEGFLAPYRGQRYHLQE
        ALDDTYIKVNV A D+PRY T+KGEVA N+LG+CDTKG+FVFVL  WEGSAADS I+RD +SR NGLKVPKG YYLCD GY NVEGFLAPYRG+RYHL E
Subjt:  ALDDTYIKVNVMANDQPRYRTQKGEVATNILGICDTKGNFVFVLIWWEGSAADSSIVRDVMSRPNGLKVPKGNYYLCDVGYPNVEGFLAPYRGQRYHLQE

Query:  CHGAENAPTTAKEFFNKKHSFAR-----------------------------------------------------------------------------
         HG  NAPTTA+EFFN KHS AR                                                                             
Subjt:  CHGAENAPTTAKEFFNKKHSFAR-----------------------------------------------------------------------------

Query:  -----KSHLAAKCLLNKSFPYYDELSYVLKKDHTTGTCTKTFPDVGSNVSGSFQ-PYPGEDGNDMEIPNMYSQGVPMSPEDIQGTRPDRANECRTASSGS
             +  LA K LL+KSFPYYD+L YV  KD  T   ++TF DVGSNV   F    P +D +D +IP MYSQGV MSP+++ G R  +A+E +  SSGS
Subjt:  -----KSHLAAKCLLNKSFPYYDELSYVLKKDHTTGTCTKTFPDVGSNVSGSFQ-PYPGEDGNDMEIPNMYSQGVPMSPEDIQGTRPDRANECRTASSGS

Query:  KRKRGGQNVETVKIIRSAMEYANDQLKAIAEWPQLQQQDKSITCATVVSQLQEIPALSRLDKVCCMRILMQNMDDMKAFLNVSNELKLDYWTVIL
        KRK+G ++ ETV++I+SA+E+ NDQLKAIA WP+ ++  +    A V+ QLQ+IP L   D+   ++IL ++++ ++ FL++  E KL+Y  ++L
Subjt:  KRKRGGQNVETVKIIRSAMEYANDQLKAIAEWPQLQQQDKSITCATVVSQLQEIPALSRLDKVCCMRILMQNMDDMKAFLNVSNELKLDYWTVIL

A0A5A7SWD8 Retrotransposon protein1.5e-12746.34Show/hide
Query:  MDRRTFVILCHLLRTIAGLAATEVVDVEEMVAIFLHVLAHDVKNRLVQREFVRSGEIISRHFNLVLLAVVCLHDELLKKPQPITNTCTDPHWQSFENCLG
        MDRR F ILCHLLRTIAGL +TEVVDVEEMVA+FLH+LAHDVKNR++QREF+RSGE ISRHFN+VLLAV+ LHDELLKKPQP+ N CTD  W+ FENCLG
Subjt:  MDRRTFVILCHLLRTIAGLAATEVVDVEEMVAIFLHVLAHDVKNRLVQREFVRSGEIISRHFNLVLLAVVCLHDELLKKPQPITNTCTDPHWQSFENCLG

Query:  ALDDTYIKVNVMANDQPRYRTQKGEVATNILGICDTKGNFVFVLIWWEGSAADSSIVRDVMSRPNGLKVPKGNYYLCDVGYPNVEGFLAPYRGQRYHLQE
        ALD TYIKVNV A+D+ RYRT+KGEVATN+LG+ DTKG+FV+VL  WEGSAADS I+RD +SRPN LKVPKG YYL D GYPN EGFLAPYRGQRYHLQE
Subjt:  ALDDTYIKVNVMANDQPRYRTQKGEVATNILGICDTKGNFVFVLIWWEGSAADSSIVRDVMSRPNGLKVPKGNYYLCDVGYPNVEGFLAPYRGQRYHLQE

Query:  CHGAENAPTTAKEFFNKKHSFARK----------------------------------------------------------------------------
          G +NAP+T+KEFFN KHS AR                                                                             
Subjt:  CHGAENAPTTAKEFFNKKHSFARK----------------------------------------------------------------------------

Query:  --------------------------------------------------------------------------------------SHLAAKCLLNKSFP
                                                                                              SH AAK LLNKSF 
Subjt:  --------------------------------------------------------------------------------------SHLAAKCLLNKSFP

Query:  YYDELSYVLKKDHTTGTCTKTFPDVGSNVSGSFQPYPGEDGNDMEIPNMYSQGVPMSPEDIQGTRPDRANECRTASSGSKRKRGGQNVETVKIIRSAMEY
        +YDELSYV  KD  TG   ++F D+GSN    +  +  +   D +   MYS G+ MSP+D+  TR  R +E R  SSGSKRKR G   ++  I+R+A+EY
Subjt:  YYDELSYVLKKDHTTGTCTKTFPDVGSNVSGSFQPYPGEDGNDMEIPNMYSQGVPMSPEDIQGTRPDRANECRTASSGSKRKRGGQNVETVKIIRSAMEY

Query:  ANDQLKAIAEWPQLQQQDKSITCATVVSQLQEIPALSRLDKVCCMRILMQNMDDMKAFLNVSNELKLDYWTVIL
         N+QL  IAEWP LQ+QD + T   +V QL+ IP L+ +D+   MRILM+N+DDMKAFL V + +K  Y ++IL
Subjt:  ANDQLKAIAEWPQLQQQDKSITCATVVSQLQEIPALSRLDKVCCMRILMQNMDDMKAFLNVSNELKLDYWTVIL

A0A5D3BSN2 Putative nuclease HARBI16.3e-12152.25Show/hide
Query:  MDRRTFVILCHLLRTIAGLAATEVVDVEEMVAIFLHVLAHDVKNRLVQREFVRSGEIISRHFNLVLLAVVCLHDELLKKPQPITNTCTDPHWQSFENCLG
        MDRR F ILCHLLRT+AGL + EVVDVEEMVA+FLH++AHDVKNR++QREF+RSGE ISRHFN+VLL V+ LHD+LLKKPQP+ N CTD  W+ FENCLG
Subjt:  MDRRTFVILCHLLRTIAGLAATEVVDVEEMVAIFLHVLAHDVKNRLVQREFVRSGEIISRHFNLVLLAVVCLHDELLKKPQPITNTCTDPHWQSFENCLG

Query:  ALDDTYIKVNVMANDQPRYRTQKGEVATNILGICDTKGNFVFVLIWWEGSAADSSIVRDVMSRPNGLKVPKGNYYLCDVGYPNVEGFLAPYRGQRYHLQE
        ALD TYIKVNV A+D+ RYRT KGEVATN+LG+CDTKG+FV+VL  WEGSAADS I+ D +SRPNGLKVPKG YYL D GYPNV+GFLA YRGQRYHLQE
Subjt:  ALDDTYIKVNVMANDQPRYRTQKGEVATNILGICDTKGNFVFVLIWWEGSAADSSIVRDVMSRPNGLKVPKGNYYLCDVGYPNVEGFLAPYRGQRYHLQE

Query:  CHGAENAPTTAKEFFNKKHSFAR-------------------------------------------------------KSHLAAKCLLNKSFPYYDELSY
          G ENAP+T+KEFFN KHS AR                                                       KSH AAK LLNKSF +YDELSY
Subjt:  CHGAENAPTTAKEFFNKKHSFAR-------------------------------------------------------KSHLAAKCLLNKSFPYYDELSY

Query:  VLKKDHTTGTCTKTFPDVGSNVSGSFQPYPGEDGNDMEIPNMYSQGVPMSPEDIQGTRPDRANECRTASSGSKRKRGGQNVETVKIIRSAMEYANDQLKA
        V  KD  T    ++F + GSN    +  +  +   DM+ P MYSQG+ MS +D+ GTR  R +E R  SSGSK+KR G   ++  I+R+A+E        
Subjt:  VLKKDHTTGTCTKTFPDVGSNVSGSFQPYPGEDGNDMEIPNMYSQGVPMSPEDIQGTRPDRANECRTASSGSKRKRGGQNVETVKIIRSAMEYANDQLKA

Query:  IAEWPQLQQQDKSITCATVVSQLQEIPALSRLDKVCCMRILMQNMDDMKAFLNVSNELKLDYWTVIL
                                      RL     MRILM+N+DDMKAFL V + +K  Y ++IL
Subjt:  IAEWPQLQQQDKSITCATVVSQLQEIPALSRLDKVCCMRILMQNMDDMKAFLNVSNELKLDYWTVIL

A0A5D3DG22 Retrotransposon protein1.7e-11042.22Show/hide
Query:  MDRRTFVILCHLLRTIAGLAATEVVDVEEMVAIFLHVLAHDVKNRLVQREFVRSGEIISRHFNLVLLAVVCLHDELLKKPQPITNTCTDPHWQSFE---N
        MDRR F ILCHLLRT AGL  TEV+DVEEMVA+FLH+LAH VKNR++QREFVRSGE +SRHFN+VLLA   LHDELLKKPQP+TN+CTDP W+ FE   N
Subjt:  MDRRTFVILCHLLRTIAGLAATEVVDVEEMVAIFLHVLAHDVKNRLVQREFVRSGEIISRHFNLVLLAVVCLHDELLKKPQPITNTCTDPHWQSFE---N

Query:  CLGALDDTYIKVNVMANDQPRYRTQKGEVATNILGICDTKGNFVFVLIWWEGSAADSSIVRDVMSRPNGLKVPKGNYYLCDVGYPNVEGFLAPYRGQRYH
        CL + + TYIKVNV A D+PRYRT+KGEVATN+LG CDTKG+FVFVL  WEGSAADS I+RD +SR NGLKVPKG YYLCD GYPN EGFLAPYRG+RYH
Subjt:  CLGALDDTYIKVNVMANDQPRYRTQKGEVATNILGICDTKGNFVFVLIWWEGSAADSSIVRDVMSRPNGLKVPKGNYYLCDVGYPNVEGFLAPYRGQRYH

Query:  LQECHGAENAPTTAKEFFNKKHSFAR--------------------------------------------------------------------------
        L E  G  NAPTT +EFFN KHS +R                                                                          
Subjt:  LQECHGAENAPTTAKEFFNKKHSFAR--------------------------------------------------------------------------

Query:  -----------------------------------------------------------------------------------------------KSHLA
                                                                                                       KSH A
Subjt:  -----------------------------------------------------------------------------------------------KSHLA

Query:  AKCLLNKSFPYYDELSYVLKKDHTTGTCTKTFPDVGSNVSGSF-QPYPGEDGNDMEIPNMYSQGVPMSPEDIQGTRPDRANECRTASSGSKRKRGGQNVE
         K LL+KSFPYYD+LSYV  KD  TG  ++TF DVGSNV   F    P  D +D +IP MYSQGV +SP+++ G R                        
Subjt:  AKCLLNKSFPYYDELSYVLKKDHTTGTCTKTFPDVGSNVSGSF-QPYPGEDGNDMEIPNMYSQGVPMSPEDIQGTRPDRANECRTASSGSKRKRGGQNVE

Query:  TVKIIRSAMEYANDQLKAIAEWPQLQQQDKSITCATVVSQLQEIPALSRLDKVCCMRILMQNMDDMKAFLNVSNELKLDYWTVIL
          ++IRS ME+ N+QLKAIA+W + ++  +    A VV QLQ+IP L    +   M+IL ++++ +  FL++  ELKL+Y  ++L
Subjt:  TVKIIRSAMEYANDQLKAIAEWPQLQQQDKSITCATVVSQLQEIPALSRLDKVCCMRILMQNMDDMKAFLNVSNELKLDYWTVIL

E5GCB5 Retrotransposon protein1.8e-11542.38Show/hide
Query:  TIAGLAATEVVDVEEMVAIFLHVLAHDVKNRLVQREFVRSGEIISRHFNLVLLAVVCLHDELLKKPQPITNTCTDPHWQSFENCLGALDDTYIKVNVMAN
        TIAGL +TEVVDVEEMVA+FLH+LAHDVK+R+++REF+RSGE ISRHFN+VLLAV+ LH+ELLKKPQP+ N CTD  W+ FENCLGALD TYIKVNV A+
Subjt:  TIAGLAATEVVDVEEMVAIFLHVLAHDVKNRLVQREFVRSGEIISRHFNLVLLAVVCLHDELLKKPQPITNTCTDPHWQSFENCLGALDDTYIKVNVMAN

Query:  DQPRYRTQKGEVATNILGICDTKGNFVFVLIWWEGSAADSSIVRDVMSRPNGLKVPKGNYYLCDVGYPNVEGFLAPYRGQRYHLQECHGAENAPTTAKEF
        D+ RYRT+KGEVATN+LG+CDTKG+FV+VL  WEGSAADS I+RD +SRPN LKVPKG YYL DVGYPN EGFLAPYRGQRYHLQE  G ENAP+T+KEF
Subjt:  DQPRYRTQKGEVATNILGICDTKGNFVFVLIWWEGSAADSSIVRDVMSRPNGLKVPKGNYYLCDVGYPNVEGFLAPYRGQRYHLQECHGAENAPTTAKEF

Query:  FNKKHSFARK------------------------------------------------------------------------------------------
        FN KH  AR                                                                                           
Subjt:  FNKKHSFARK------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------SHLAAKCLLNKSFPYYDELSYVLKKDHTTGTCTKTFPDVGSNVSGSFQPYPGEDGNDMEIPNMYSQGVPMSPEDIQGTRPDRANECRTASS
                 SH AAK LLNKSF +YDELSYV  KD  TG   ++F D+GSN    +     +   D + P MYS G+ MSP+D+  TR  R +E R  SS
Subjt:  ---------SHLAAKCLLNKSFPYYDELSYVLKKDHTTGTCTKTFPDVGSNVSGSFQPYPGEDGNDMEIPNMYSQGVPMSPEDIQGTRPDRANECRTASS

Query:  GSKRKRGGQNVETVKIIRSAMEYANDQLKAIAEWPQLQQQDKSITCATVVSQLQEIPALSRLDKVCCMRILMQNMDDMKAFLNVSNELKLDYWTVIL
        GSKRKR G   ++  I+R+A+EY N+QL  IAEWP LQ+QD + T   +V  L+ IP L+ +D+   MRILM+N+DDMKAFL V + +K  Y ++IL
Subjt:  GSKRKRGGQNVETVKIIRSAMEYANDQLKAIAEWPQLQQQDKSITCATVVSQLQEIPALSRLDKVCCMRILMQNMDDMKAFLNVSNELKLDYWTVIL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G43722.1 unknown protein2.4e-2430.74Show/hide
Query:  FVILCHLLRTIAGLAATEVVDVEEMVAIFLHVLAHDVKNRLVQREFVRSGEIISRHFNLVLLAVVCLHDELLKKPQ-------PITNTCTDPHWQSFENC
        F  LC++L+T   L  T  + +EE VA+FL +  H+   R V   F R+ E + R F  VL A   L  + ++ P        P        +W  F   
Subjt:  FVILCHLLRTIAGLAATEVVDVEEMVAIFLHVLAHDVKNRLVQREFVRSGEIISRHFNLVLLAVVCLHDELLKKPQ-------PITNTCTDPHWQSFENC

Query:  LGALDDTYIKVNVMANDQPRYRTQKGEVATNILGICDTKGNFVFVLIWWEGSAADSSIVRDVMSRPNGLKVPKG-NYYLCDVGYPNVEGFLAPYRGQ---
        +GA+D T++ V V  + Q  Y  +    + NI+ ICD K  F ++     GS  D+++++      +   +P    YYL D GYPN +G LAPYR     
Subjt:  LGALDDTYIKVNVMANDQPRYRTQKGEVATNILGICDTKGNFVFVLIWWEGSAADSSIVRDVMSRPNGLKVPKG-NYYLCDVGYPNVEGFLAPYRGQ---

Query:  --RYHLQECHGAENAPTTAKEFFNKKHSFAR
          RYH+ + +     P    E FN+ H+  R
Subjt:  --RYHLQECHGAENAPTTAKEFFNKKHSFAR

AT5G28730.1 unknown protein1.2e-1227.4Show/hide
Query:  MDRRTFVILCHLLRTIAGLAATEVVDVEEMVAIFLHVLAHDVKNRLVQREFVRSGEIISRHFNLVL-----LAVVCLHDELLKKPQPITNTCTDP--HWQ
        M    F  LC +L    GL ++  + ++E VAIFL + A +   R +   F  + E I R F+ VL     LAV  +    +++ + I+N   D   +W 
Subjt:  MDRRTFVILCHLLRTIAGLAATEVVDVEEMVAIFLHVLAHDVKNRLVQREFVRSGEIISRHFNLVL-----LAVVCLHDELLKKPQPITNTCTDP--HWQ

Query:  SFENCLGALDDTYIKVNVMANDQPRYRTQKGEVATNILGICDTKGNFVFVLIWWEGSAADSSIVRDVMSRPNGLKV-PKGNYYLCDVGYPNVEGFLAPYR
           + LG                          + N+L ICD    F +  +   GS  D+ ++   +S      V P   YYL D GY N  G+LAPYR
Subjt:  SFENCLGALDDTYIKVNVMANDQPRYRTQKGEVATNILGICDTKGNFVFVLIWWEGSAADSSIVRDVMSRPNGLKV-PKGNYYLCDVGYPNVEGFLAPYR

Query:  GQRYHLQE
         +    Q+
Subjt:  GQRYHLQE

AT5G28950.1 unknown protein1.9e-1341.03Show/hide
Query:  FENCLGALDDTYIKVNVMANDQPRYRTQKGEVATNILGICDTKGNFVFVLIWWEGSAADSSIVRDVMSR-PNGLKVPK
        F++C+GA+DDT+I   V     P +R +KG+++ N+L  C+    F++VL  WEGSA DS ++ D ++R  N L VP+
Subjt:  FENCLGALDDTYIKVNVMANDQPRYRTQKGEVATNILGICDTKGNFVFVLIWWEGSAADSSIVRDVMSR-PNGLKVPK

AT5G35695.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)1.2e-1229.95Show/hide
Query:  FVFVLIWWEGSAADSSIVRDVMSRPNGLKVPKGNYYLCDVGYPNVEGFLAPYRGQRYHLQECHGAENAPTTAKEFFNKKHSFARK------SHLAAKCLL
        F++VL  WEGSA DS ++ D + +          +YL D G+ N   FLAP+RG RYHLQE  G    P T  E FN +H   R           ++  +
Subjt:  FVFVLIWWEGSAADSSIVRDVMSRPNGLKVPKGNYYLCDVGYPNVEGFLAPYRGQRYHLQECHGAENAPTTAKEFFNKKHSFARK------SHLAAKCLL

Query:  NKSFPYYDELSYVLKKDHTTGTCTKTFPDVGSNVSGSFQPYPGEDGNDMEIPNMYSQGVPMSPEDIQGTRPDRANECRTASSGSKRK
         KS P +   SY  K+     TC      +          +P E GN+ ++ N  ++G  M+  +I    P  A +    ++   RK
Subjt:  NKSFPYYDELSYVLKKDHTTGTCTKTFPDVGSNVSGSFQPYPGEDGNDMEIPNMYSQGVPMSPEDIQGTRPDRANECRTASSGSKRK

AT5G41980.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)5.6e-3736.36Show/hide
Query:  MDRRTFVILCHLLRTIAGLAATEVVDVEEMVAIFLHVLAHDVKNRLVQREFVRSGEIISRHFNLVLLAVVCLHDELLKKPQPITNTCT----DPHWQSFE
        MD+  F  LC LL+T   L  T  + +E  +AIFL ++ H+++ R VQ  F  SGE ISRHFN VL AV+ +  +     QP +N+ T    DP+   F+
Subjt:  MDRRTFVILCHLLRTIAGLAATEVVDVEEMVAIFLHVLAHDVKNRLVQREFVRSGEIISRHFNLVLLAVVCLHDELLKKPQPITNTCT----DPHWQSFE

Query:  NCLGALDDTYIKVNVMANDQPRYRTQKGEVATNILGICDTKGNFVFVLIWWEGSAADSSIVRDVMSRPNGLKVPKGNYYLCDVGYPNVEGFLAPYRGQRY
        +C+G +D  +I V V  ++Q  +R   G +  N+L        F +VL  WEGSA+D  ++   ++R N L+VP+G YY+ D  YPN+ GF+APY G   
Subjt:  NCLGALDDTYIKVNVMANDQPRYRTQKGEVATNILGICDTKGNFVFVLIWWEGSAADSSIVRDVMSRPNGLKVPKGNYYLCDVGYPNVEGFLAPYRGQRY

Query:  HLQECHGAENAPTTAKEFFNKKHSFARKSHLAAKCLLNKSFP
               + N+   AKE FN++H    ++       L + FP
Subjt:  HLQECHGAENAPTTAKEFFNKKHSFARKSHLAAKCLLNKSFP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACAGAAGAACTTTTGTCATTCTGTGCCATTTGCTTAGGACAATCGCTGGATTAGCAGCGACTGAAGTCGTCGATGTTGAGGAGATGGTTGCCATATTCCTACATGT
GCTGGCCCACGATGTGAAGAATCGATTGGTCCAGAGGGAGTTCGTGCGATCCGGTGAGATAATTTCGAGGCATTTTAACCTGGTGTTGTTGGCTGTTGTATGTCTGCATG
ACGAGTTGCTGAAAAAACCACAACCAATAACGAACACGTGCACAGATCCCCACTGGCAAAGTTTCGAGAATTGCCTTGGCGCATTAGACGACACGTACATCAAAGTGAAC
GTAATGGCAAATGATCAGCCAAGATATAGAACGCAAAAGGGAGAAGTTGCGACGAACATCTTGGGAATCTGTGACACGAAAGGAAATTTTGTCTTCGTGCTAATCTGGTG
GGAAGGATCCGCAGCTGACTCGAGCATTGTTCGAGATGTTATGTCAAGACCGAACGGCCTGAAGGTTCCCAAGGGAAACTACTACTTATGTGATGTGGGTTACCCCAACG
TAGAAGGATTCCTGGCTCCGTACAGAGGACAGAGATACCACTTGCAGGAGTGCCATGGAGCGGAAAATGCTCCAACAACAGCGAAAGAATTCTTCAACAAGAAACATTCT
TTCGCACGTAAGAGTCATCTTGCAGCAAAGTGTCTTTTGAACAAATCGTTTCCATACTACGACGAACTGTCCTATGTCTTAAAAAAAGATCACACTACAGGCACATGCAC
GAAGACTTTCCCGGATGTCGGTTCGAATGTGTCGGGCAGTTTCCAGCCGTACCCAGGTGAAGATGGAAACGATATGGAGATCCCAAATATGTACAGCCAGGGAGTTCCAA
TGTCACCCGAAGACATACAAGGAACACGGCCTGATCGGGCGAATGAGTGTAGGACGGCTTCGAGCGGTTCAAAGAGGAAGCGGGGAGGCCAAAATGTGGAAACTGTGAAA
ATCATTCGTAGTGCCATGGAATATGCGAATGACCAACTCAAGGCAATTGCAGAATGGCCTCAGTTACAACAACAAGACAAAAGTATAACCTGTGCGACAGTCGTTAGTCA
GTTACAGGAAATCCCTGCACTATCAAGACTCGACAAGGTGTGTTGTATGCGGATACTGATGCAAAACATGGACGACATGAAAGCATTCTTGAACGTATCCAACGAGCTTA
AGTTGGACTACTGGACGGTCATTCTATAA
mRNA sequenceShow/hide mRNA sequence
ATGGACAGAAGAACTTTTGTCATTCTGTGCCATTTGCTTAGGACAATCGCTGGATTAGCAGCGACTGAAGTCGTCGATGTTGAGGAGATGGTTGCCATATTCCTACATGT
GCTGGCCCACGATGTGAAGAATCGATTGGTCCAGAGGGAGTTCGTGCGATCCGGTGAGATAATTTCGAGGCATTTTAACCTGGTGTTGTTGGCTGTTGTATGTCTGCATG
ACGAGTTGCTGAAAAAACCACAACCAATAACGAACACGTGCACAGATCCCCACTGGCAAAGTTTCGAGAATTGCCTTGGCGCATTAGACGACACGTACATCAAAGTGAAC
GTAATGGCAAATGATCAGCCAAGATATAGAACGCAAAAGGGAGAAGTTGCGACGAACATCTTGGGAATCTGTGACACGAAAGGAAATTTTGTCTTCGTGCTAATCTGGTG
GGAAGGATCCGCAGCTGACTCGAGCATTGTTCGAGATGTTATGTCAAGACCGAACGGCCTGAAGGTTCCCAAGGGAAACTACTACTTATGTGATGTGGGTTACCCCAACG
TAGAAGGATTCCTGGCTCCGTACAGAGGACAGAGATACCACTTGCAGGAGTGCCATGGAGCGGAAAATGCTCCAACAACAGCGAAAGAATTCTTCAACAAGAAACATTCT
TTCGCACGTAAGAGTCATCTTGCAGCAAAGTGTCTTTTGAACAAATCGTTTCCATACTACGACGAACTGTCCTATGTCTTAAAAAAAGATCACACTACAGGCACATGCAC
GAAGACTTTCCCGGATGTCGGTTCGAATGTGTCGGGCAGTTTCCAGCCGTACCCAGGTGAAGATGGAAACGATATGGAGATCCCAAATATGTACAGCCAGGGAGTTCCAA
TGTCACCCGAAGACATACAAGGAACACGGCCTGATCGGGCGAATGAGTGTAGGACGGCTTCGAGCGGTTCAAAGAGGAAGCGGGGAGGCCAAAATGTGGAAACTGTGAAA
ATCATTCGTAGTGCCATGGAATATGCGAATGACCAACTCAAGGCAATTGCAGAATGGCCTCAGTTACAACAACAAGACAAAAGTATAACCTGTGCGACAGTCGTTAGTCA
GTTACAGGAAATCCCTGCACTATCAAGACTCGACAAGGTGTGTTGTATGCGGATACTGATGCAAAACATGGACGACATGAAAGCATTCTTGAACGTATCCAACGAGCTTA
AGTTGGACTACTGGACGGTCATTCTATAA
Protein sequenceShow/hide protein sequence
MDRRTFVILCHLLRTIAGLAATEVVDVEEMVAIFLHVLAHDVKNRLVQREFVRSGEIISRHFNLVLLAVVCLHDELLKKPQPITNTCTDPHWQSFENCLGALDDTYIKVN
VMANDQPRYRTQKGEVATNILGICDTKGNFVFVLIWWEGSAADSSIVRDVMSRPNGLKVPKGNYYLCDVGYPNVEGFLAPYRGQRYHLQECHGAENAPTTAKEFFNKKHS
FARKSHLAAKCLLNKSFPYYDELSYVLKKDHTTGTCTKTFPDVGSNVSGSFQPYPGEDGNDMEIPNMYSQGVPMSPEDIQGTRPDRANECRTASSGSKRKRGGQNVETVK
IIRSAMEYANDQLKAIAEWPQLQQQDKSITCATVVSQLQEIPALSRLDKVCCMRILMQNMDDMKAFLNVSNELKLDYWTVIL