; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0000364 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0000364
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr4:5118731..5121222
RNA-Seq ExpressionLag0000364
SyntenyLag0000364
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR001969 - Aspartic peptidase, active site
IPR021109 - Aspartic peptidase domain superfamily
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0054966.1 transposon Ty3-I Gag-Pol polyprotein isoform X1 [Cucumis melo var. makuwa]2.2e-15741.49Show/hide
Query:  KEAESTTVLSPRSTTVRLLSVEQDTKILKEDVGEIKKILEMICEKMGCRTDQQVFDSRTHIAAEKRQQDYRGDDSRTRQWQERQFTEQKMIQEPIPAPRF
        +EA  T +LSPR+++  L SVE         + EI+++L  +  ++     Q   + R       +    RG         +R F E+ + ++    P+ 
Subjt:  KEAESTTVLSPRSTTVRLLSVEQDTKILKEDVGEIKKILEMICEKMGCRTDQQVFDSRTHIAAEKRQQDYRGDDSRTRQWQERQFTEQKMIQEPIPAPRF

Query:  QQDYHSGMQEFKQNPLFRRQPEWNGDSSSE----DEYQELQKEGRRNNGDQHHQESDFKIKIDILTYSGKMEIEAFLEWIRHVENFFNYMNTLENKKVR-
        ++D     Q         R+ E    SSSE    D+  E ++     N  Q  + S++K+KID+ +Y GK  IE FL+W+++ ENFF YM T +NKKV  
Subjt:  QQDYHSGMQEFKQNPLFRRQPEWNGDSSSE----DEYQELQKEGRRNNGDQHHQESDFKIKIDILTYSGKMEIEAFLEWIRHVENFFNYMNTLENKKVR-

Query:  -------------------------------------------------------------------------------------------YKGGLRYDI
                                                                                                   + GGLR+D+
Subjt:  -------------------------------------------------------------------------------------------YKGGLRYDI

Query:  KEQIALQPIGYLNEAISAAATIEEQIANRFKRTYAR------------------RNTSEQGSSFTKPSG------GDKTN--------------------
        KE++ LQP  +L+EAI+ A T+EE I NR K T  R                    TSE+     + SG      G+K                      
Subjt:  KEQIALQPIGYLNEAISAAATIEEQIANRFKRTYAR------------------RNTSEQGSSFTKPSG------GDKTN--------------------

Query:  ------LQTTAAALKDNVDFQKEEIEDDTDDIVDFLEPDEGDPVSLVIQRLLLAPKTDHNYQRHALFKTCCTINGKICNVIIDSGSTENLVASKLVSSLN
               + T A  KDN D     +  + D+  + +E DEGD +S ++QR+L++PK ++  QRH+LFKT CTI GK+CNVIIDSGS+EN V+ KLV++LN
Subjt:  ------LQTTAAALKDNVDFQKEEIEDDTDDIVDFLEPDEGDPVSLVIQRLLLAPKTDHNYQRHALFKTCCTINGKICNVIIDSGSTENLVASKLVSSLN

Query:  LPLHPHPTPYKVSWIKKGEEAQVTHTSTIPLSIGASYKDQIICDVLDMDACHILLGRPWQYDEQATHMGRDNTYEFLWMGKKIVLLPINPAKQLNSNIPS
        L   PH  PYK+ WIKKG E  ++    +PLSIG SYKDQ++CDV++MD CHILLGRPWQ+D Q+ H GR+NTYEF+WM KK++LLP+   K  N    +
Subjt:  LPLHPHPTPYKVSWIKKGEEAQVTHTSTIPLSIGASYKDQIICDVLDMDACHILLGRPWQYDEQATHMGRDNTYEFLWMGKKIVLLPINPAKQLNSNIPS

Query:  SKKGPLFTLTLGKSFYSAKQFPILGLVVKNFSDHDSTDPIPSELHTLLQEFPKIVHNPIDLPPLRDIQHSIDFLPGATLPNLPHYKMSPSEYQILHNQIQ
         KKG LF    GK F   ++  ILG+V+    D    + IP  +  L +++PKI   P  LPPLRDI H+I+ L GA+ P+LPHY MSP+EY+ILH+ I+
Subjt:  SKKGPLFTLTLGKSFYSAKQFPILGLVVKNFSDHDSTDPIPSELHTLLQEFPKIVHNPIDLPPLRDIQHSIDFLPGATLPNLPHYKMSPSEYQILHNQIQ

Query:  ELL-EGHIQPSLSPCAVPALLTPKKDGSWRMCVDSRAINKIIVKYRFPIPCIGDLLDQLGGATIFSKIDLKSGYHQIRIRPGDEWKTTFKTNEGLFEWLV
        ELL +GHI+PS S C VPALLTPKKDG+WRMCVDSRAINKI VKYRFPIP + DLLDQLGGA IFSKIDL+S YHQIRIRPGDEWKT FKTNEGLFEWLV
Subjt:  ELL-EGHIQPSLSPCAVPALLTPKKDGSWRMCVDSRAINKIIVKYRFPIPCIGDLLDQLGGATIFSKIDLKSGYHQIRIRPGDEWKTTFKTNEGLFEWLV

Query:  MPFGLSNAPSTFMRLMN
        MPF LSNAPSTFMRLMN
Subjt:  MPFGLSNAPSTFMRLMN

XP_011648447.2 uncharacterized protein LOC105434464 [Cucumis sativus]1.4e-14048.43Show/hide
Query:  WNGDSSSEDEYQELQKEGRRNNGDQHHQESDFKIKIDILTYSGKMEIEAFLEWIRHVENFFNYMNTLENKKVRYKGGLRYDIKEQIALQPIGYLNEAISA
        WN +          + E RR  G+ H    D+K+KID+  Y GK  IEAFL+WI+  ENFFNYM+T E KKV             +AL+           
Subjt:  WNGDSSSEDEYQELQKEGRRNNGDQHHQESDFKIKIDILTYSGKMEIEAFLEWIRHVENFFNYMNTLENKKVRYKGGLRYDIKEQIALQPIGYLNEAISA

Query:  AATIEEQIANR---------FKRTYARRN-----TSEQGSSFTKPSGGDKTNLQTTAAALKDNVDF----------------------------------
        A T+EE IA R         +K T  R N     T++Q S+ TK  G +  N Q      K+   F                                  
Subjt:  AATIEEQIANR---------FKRTYARRN-----TSEQGSSFTKPSGGDKTNLQTTAAALKDNVDF----------------------------------

Query:  --------QKEEIEDDTDDIVDFLEPDEGDPVSLVIQRLLLAPKTDHNYQRHALFKTCCTINGKICNVIIDSGSTENLVASKLVSSLNLPLHPHPTPYKV
                Q  E     +D ++ +E D+G+ VS VIQR+L+ PK +   QRH LFK  CTING++C+VIID+ S++N VA KLV+ LNL    HPT YK+
Subjt:  --------QKEEIEDDTDDIVDFLEPDEGDPVSLVIQRLLLAPKTDHNYQRHALFKTCCTINGKICNVIIDSGSTENLVASKLVSSLNLPLHPHPTPYKV

Query:  SWIKKGEEAQVTHTSTIPLSIGASYKDQIICDVLDMDACHILLGRPWQYDEQATHMGRDNTYEFLWMGKKIVLLPINPAKQLNSNIPSSKKGPLFTLTLG
         W++K  EA V+   T+PLSI  +YKDQI+CDV++MD CH+LLGRPWQYD Q+ H GR+NTYE   MG+K+VLLPI            +K+G        
Subjt:  SWIKKGEEAQVTHTSTIPLSIGASYKDQIICDVLDMDACHILLGRPWQYDEQATHMGRDNTYEFLWMGKKIVLLPINPAKQLNSNIPSSKKGPLFTLTLG

Query:  KSFYSAKQFPILGLVVKNFSDHDSTDPIPSELHTLLQEFPKIVHNPIDLPPLRDIQHSIDFLPGATLPNLPHYKMSPSEYQILHNQIQELL-EGHIQPSL
                                 + I  EL  LL EFP+I   P  LPPLRDIQH ID +PGA+LPNL HY+MSP EY+ LH+ I+ELL +GHI+PSL
Subjt:  KSFYSAKQFPILGLVVKNFSDHDSTDPIPSELHTLLQEFPKIVHNPIDLPPLRDIQHSIDFLPGATLPNLPHYKMSPSEYQILHNQIQELL-EGHIQPSL

Query:  SPCAVPALLTPKKDGSWRMCVDSRAINKIIVKYRFPIPCIGDLLDQLGGATIFSKIDLKSGYHQIRIRPGDEWKTTFKTNEGLFEWLVMPFGLSNAPSTF
        SPCAVPALLT KKDGSWRMCVDSRAIN+I VKYRF IP I DLLDQLG A+IFSKIDLKSGYHQIRIRPGDEWKTTFKT EGLFEW+VMPFGLSNAP+TF
Subjt:  SPCAVPALLTPKKDGSWRMCVDSRAINKIIVKYRFPIPCIGDLLDQLGGATIFSKIDLKSGYHQIRIRPGDEWKTTFKTNEGLFEWLVMPFGLSNAPSTF

Query:  MRLMN
        MRLMN
Subjt:  MRLMN

XP_031744062.1 uncharacterized protein LOC116404773 [Cucumis sativus]3.0e-15447.64Show/hide
Query:  KEGRRN------NGDQHHQESDFKIKIDILTYSGKMEIEAFLEWIRHVENFFNYMNTLENKKV-----RYKGGL-----RYDI-KEQIALQPI-------
        + GR N       G+ H    D+K+KID+  Y GK  IEAFL+WI+  ENFFNYM+T E KKV     + + G      + +I +++   QPI       
Subjt:  KEGRRN------NGDQHHQESDFKIKIDILTYSGKMEIEAFLEWIRHVENFFNYMNTLENKKV-----RYKGGL-----RYDI-KEQIALQPI-------

Query:  -------------------------------GYLNE------------------AISAAATIEEQIANRFKR---------TYARRNTSEQGSSFTKPSG
                                        Y+ E                  A     T+EE IA R K          T  +  T++Q S+ TK  G
Subjt:  -------------------------------GYLNE------------------AISAAATIEEQIANRFKR---------TYARRNTSEQGSSFTKPSG

Query:  GDKTNLQTTAAALKDNV-----------------------------------------DFQKEEIEDDTDDIVDFLEPDEGDPVSLVIQRLLLAPKTDHN
         +  N +      K+                                             Q  E   + ++  + +E D+G+ VS  IQR+L+ PK + N
Subjt:  GDKTNLQTTAAALKDNV-----------------------------------------DFQKEEIEDDTDDIVDFLEPDEGDPVSLVIQRLLLAPKTDHN

Query:  YQRHALFKTCCTINGKICNVIIDSGSTENLVASKLVSSLNLPLHPHPTPYKVSWIKKGEEAQVTHTSTIPLSIGASYKDQIICDVLDMDACHILLGRPWQ
         QRH LFKT CTING++C+VIIDSGS+EN VA KLV  LNL    HPTPYK+ W++KG EA V+   T+PLSIG +YKDQI+CDV++MD CH+LLGRPWQ
Subjt:  YQRHALFKTCCTINGKICNVIIDSGSTENLVASKLVSSLNLPLHPHPTPYKVSWIKKGEEAQVTHTSTIPLSIGASYKDQIICDVLDMDACHILLGRPWQ

Query:  YDEQATHMGRDNTYEFLWMGKKIVLLPINPAKQLNSNIPSSKKGPLFTLTLGKSFYSAKQFPILGLVVKNFSDHDSTDPIPSELHTLLQEFPKIVHNPID
        YD Q+ H GR+NTYEF WMG+K+VLLPI   K++N  +   K+  LF    GK     ++  ILGLVV   +     + I  +L  LL EFP I   P  
Subjt:  YDEQATHMGRDNTYEFLWMGKKIVLLPINPAKQLNSNIPSSKKGPLFTLTLGKSFYSAKQFPILGLVVKNFSDHDSTDPIPSELHTLLQEFPKIVHNPID

Query:  LPPLRDIQHSIDFLPGATLPNLPHYKMSPSEYQILHNQIQELL-EGHIQPSLSPCAVPALLTPKKDGSWRMCVDSRAINKIIVKYRFPIPCIGDLLDQLG
        LPPLRDIQH ID +PGA+LPNL HY+MSP EY+ILH+ I+ELL +GHI+PSLSPCAVPALLTPKKDGSWRMCVDSRAIN+I VKYRFPIP I DLLDQLG
Subjt:  LPPLRDIQHSIDFLPGATLPNLPHYKMSPSEYQILHNQIQELL-EGHIQPSLSPCAVPALLTPKKDGSWRMCVDSRAINKIIVKYRFPIPCIGDLLDQLG

Query:  GATIFSKIDLKSGYHQIRIRPGDEWKTTFKTNEGLFEWLVMPFGLSNAPSTFMRLMN
         A+IFSKIDLKSGYHQIR+RPGDEWKT FKTNEGLFEW+VMPFGLSNAPSTFMRLMN
Subjt:  GATIFSKIDLKSGYHQIRIRPGDEWKTTFKTNEGLFEWLVMPFGLSNAPSTFMRLMN

XP_038989925.1 uncharacterized protein LOC120113183 [Phoenix dactylifera]5.0e-13341.16Show/hide
Query:  GDQHHQESDFKIKIDILTYSGKMEIEAFLEWIRHVENFFNYMNTLENKKV--------------------------------------------------
        G +     DF++K+D+  ++G + IE FL+W+  VE FF+YM   + KKV                                                  
Subjt:  GDQHHQESDFKIKIDILTYSGKMEIEAFLEWIRHVENFFNYMNTLENKKV--------------------------------------------------

Query:  ------------------------------------------RYKGGLRYDIKEQIALQPIGYLNEAISAAATIEEQI------------ANRFKRTYAR
                                                  RY GGL+  I++Q+ L P+  L EA S A  +E Q             A+  + T A+
Subjt:  ------------------------------------------RYKGGLRYDIKEQIALQPIGYLNEAISAAATIEEQI------------ANRFKRTYAR

Query:  RNTSEQGSSFTKP----SGGDKTNLQTTAAALKDN---------------------------------VDFQKEE---------IEDDTDD-IVDFLEPD
          T E  ++ ++P    +     N     AA K N                                 V+ + EE          E  TDD  V+   PD
Subjt:  RNTSEQGSSFTKP----SGGDKTNLQTTAAALKDN---------------------------------VDFQKEE---------IEDDTDD-IVDFLEPD

Query:  EGDPVSLVIQRLLLAPKTDHNYQRHALFKTCCTINGKICNVIIDSGSTENLVASKLVSSLNLPLHPHPTPYKVSWIKKGEEAQVTHTSTIPLSIGASYKD
        +G P S V+QR+L APK +   QRH++FKTCCTIN  +CNVIIDSGS+EN+V+S LV ++ L    HP+PYK+ WIKKG E +V +   +PLSIG  YKD
Subjt:  EGDPVSLVIQRLLLAPKTDHNYQRHALFKTCCTINGKICNVIIDSGSTENLVASKLVSSLNLPLHPHPTPYKVSWIKKGEEAQVTHTSTIPLSIGASYKD

Query:  QIICDVLDMDACHILLGRPWQYDEQATHMGRDNTYEFLWMGKKIVLLPINPAKQLNSNIPSSKKGPLFTLTLGKSF--YSAKQF--------PILGLVVK
        +++CDV+DMDACH+LLGRPWQ+D   T  GRDNTY F W G+KI+L+P+             K  P  +   GKSF   S+ QF         ++ L+VK
Subjt:  QIICDVLDMDACHILLGRPWQYDEQATHMGRDNTYEFLWMGKKIVLLPINPAKQLNSNIPSSKKGPLFTLTLGKSF--YSAKQF--------PILGLVVK

Query:  NFSDHDSTDPIPSELHTLLQEFPKI--VHNPIDLPPLRDIQHSIDFLPGATLPNLPHYKMSPSEYQILHNQIQELL-EGHIQPSLSPCAVPALLTPKKDG
          ++      +P E   LL EF  I     P  LPP+RDIQH ID +PGA+LPNLPHY+MSP E +IL  Q+++L+ +G I+ S+SPCAVPALLTPKKDG
Subjt:  NFSDHDSTDPIPSELHTLLQEFPKI--VHNPIDLPPLRDIQHSIDFLPGATLPNLPHYKMSPSEYQILHNQIQELL-EGHIQPSLSPCAVPALLTPKKDG

Query:  SWRMCVDSRAINKIIVKYRFPIPCIGDLLDQLGGATIFSKIDLKSGYHQIRIRPGDEWKTTFKTNEGLFEWLVMPFGLSNAPSTFMRLMN
        SWRMCVDSRAIN+I VKYRFPIP + D+LD L GA +FSKIDL+SGYHQIRIRPGDEWKT FKT +GL+EW+VMPFGLSNAPSTFMR MN
Subjt:  SWRMCVDSRAINKIIVKYRFPIPCIGDLLDQLGGATIFSKIDLKSGYHQIRIRPGDEWKTTFKTNEGLFEWLVMPFGLSNAPSTFMRLMN

XP_040994264.1 uncharacterized protein LOC121240799 [Juglans microcarpa x Juglans regia]3.8e-13339.89Show/hide
Query:  SRTRQWQERQFTEQKMIQE--------PIPAPRFQQDYHSGMQEFKQNPLFRRQPEWNGDSSSEDEYQELQKEGRRNNGDQHHQESDFKIKIDILTYSGK
        ++ RQ QER    ++M+QE         +   R Q +   G     + P   +Q     +SSSE+E + L   G +   D +   ++FKIKID+  ++G 
Subjt:  SRTRQWQERQFTEQKMIQE--------PIPAPRFQQDYHSGMQEFKQNPLFRRQPEWNGDSSSEDEYQELQKEGRRNNGDQHHQESDFKIKIDILTYSGK

Query:  MEIEAFLEWIRHVENFFNYMNTLENKKV------------------------------------------------------------------------
        + +E+FL+W+  VENFF+YM   E ++V                                                                        
Subjt:  MEIEAFLEWIRHVENFFNYMNTLENKKV------------------------------------------------------------------------

Query:  --------------------RYKGGLRYDIKEQIALQPIGYLNEAISAAATIEEQI------------------------------ANRFKRT-----YA
                            RY GGLR  I++++ L  +  L+EA++ A  IE Q+                              ++R +RT       
Subjt:  --------------------RYKGGLRYDIKEQIALQPIGYLNEAISAAATIEEQI------------------------------ANRFKRT-----YA

Query:  RRNTSEQG-------SSFTKPSGG-----DKTNLQTTAAALKDNV-----DFQKEEIEDDTDDIVDFLEPDEGDPVSLVIQRLLLAPKTDHNYQRHALFK
        + NT   G       + + KP  G     ++   ++     + +V     D   +E +D++++  +F+E DEGD V+ VIQRLLLAPK + + QRH +FK
Subjt:  RRNTSEQG-------SSFTKPSGG-----DKTNLQTTAAALKDNV-----DFQKEEIEDDTDDIVDFLEPDEGDPVSLVIQRLLLAPKTDHNYQRHALFK

Query:  TCCTINGKICNVIIDSGSTENLVASKLVSSLNLPLHPHPTPYKVSWIKKGEEAQVTHTSTIPLSIGASYKDQIICDVLDMDACHILLGRPWQYDEQATHM
        T CT+N K+CN+IIDSGS EN+V+  LVS+L L    HP PYK++WIKKG E +VT T  IP SIG  Y D + CDV++MDACH+LLGRPWQYD  AT+ 
Subjt:  TCCTINGKICNVIIDSGSTENLVASKLVSSLNLPLHPHPTPYKVSWIKKGEEAQVTHTSTIPLSIGASYKDQIICDVLDMDACHILLGRPWQYDEQATHM

Query:  GRDNTYEFLWMGKKIVLLPINPAKQLNSNIPSSKKGPLFTLTLGKSFYSAKQF-PILGLVVKNFSDHDSTDPIPSELHTLLQEFPKIVHN--PIDLPPLR
        GRDNTY F W  +K+VLLP    ++ +      KK    T++  +    AK+   IL LVVK  ++   ++  P  +  LL+EF  I     P  LPPLR
Subjt:  GRDNTYEFLWMGKKIVLLPINPAKQLNSNIPSSKKGPLFTLTLGKSFYSAKQF-PILGLVVKNFSDHDSTDPIPSELHTLLQEFPKIVHN--PIDLPPLR

Query:  DIQHSIDFLPGATLPNLPHYKMSPSEYQILHNQIQELL-EGHIQPSLSPCAVPALLTPKKDGSWRMCVDSRAINKIIVKYRFPIPCIGDLLDQLGGATIF
        DIQH ID +PG +LPNLPHY+MSP+E++IL +Q+++L+ +G I+ S+SPCAVPALL PKKDGSWRMCVDSRAINKI VKYRFPIP + D+LD L G+ +F
Subjt:  DIQHSIDFLPGATLPNLPHYKMSPSEYQILHNQIQELL-EGHIQPSLSPCAVPALLTPKKDGSWRMCVDSRAINKIIVKYRFPIPCIGDLLDQLGGATIF

Query:  SKIDLKSGYHQIRIRPGDEWKTTFKTNEGLFEWLVMPFGLSNAPSTFMRLMN
        SK+DL+SGYHQIR+RPGDEWKT FKT EGL+EWLVMPFGLSNAPSTFMR+M+
Subjt:  SKIDLKSGYHQIRIRPGDEWKTTFKTNEGLFEWLVMPFGLSNAPSTFMRLMN

TrEMBL top hitse value%identityAlignment
A0A5B7BER3 Uncharacterized protein5.4e-13340.48Show/hide
Query:  RRQPEWNGDSSSEDEYQELQKEGRRNNGDQHHQESDFKIKIDILTYSGKMEIEAFLEWIRHVENFFNYM-------------------------------
        R  P +  +S S++E+ +   +       +     ++++KID+ +++G + IE+FL+WI  VE FF+ M                               
Subjt:  RRQPEWNGDSSSEDEYQELQKEGRRNNGDQHHQESDFKIKIDILTYSGKMEIEAFLEWIRHVENFFNYM-------------------------------

Query:  ---------------------------------------------------NTL---------ENKKV-RYKGGLRYDIKEQIALQPIGYLNEAISAAAT
                                                           NTL         EN++V RY GGLR  I++Q+ L+ I  LNEA S A  
Subjt:  ---------------------------------------------------NTL---------ENKKV-RYKGGLRYDIKEQIALQPIGYLNEAISAAAT

Query:  IEEQIANR------FKRTYARRNTSEQ-----------------------------------------------GSSFTKPSGGDKTN-------LQTTA
        +E Q + +        R+Y   + ++Q                                               G  F     G ++N       +    
Subjt:  IEEQIANR------FKRTYARRNTSEQ-----------------------------------------------GSSFTKPSGGDKTN-------LQTTA

Query:  AALKDNVDFQKEEIEDDTDDI--VDFLEPDEGDPVSLVIQRLLLAPKTDHNYQRHALFKTCCTINGKICNVIIDSGSTENLVASKLVSSLNLPLHPHPTP
            ++ DF+ EE  +  D+    +  E DEG+ VS V+QRLLL PK + + QRH +F+T CTIN K+C+VIIDSGS+EN+V+  LV +L L    HP P
Subjt:  AALKDNVDFQKEEIEDDTDDI--VDFLEPDEGDPVSLVIQRLLLAPKTDHNYQRHALFKTCCTINGKICNVIIDSGSTENLVASKLVSSLNLPLHPHPTP

Query:  YKVSWIKKGEEAQVTHTSTIPLSIGASYKDQIICDVLDMDACHILLGRPWQYDEQATHMGRDNTYEFLWMGKKIVLLPINPAKQLNSNIPSSKK---GPL
        YK+ WIKKG E +VT    +P SIG  YKD++ CD++DMDACH+LLGRPWQ+D  ATH G+DNTY F W  KK+VL+P     +  SN+P + K     L
Subjt:  YKVSWIKKGEEAQVTHTSTIPLSIGASYKDQIICDVLDMDACHILLGRPWQYDEQATHMGRDNTYEFLWMGKKIVLLPINPAKQLNSNIPSSKK---GPL

Query:  FTLTLGKSFYSAKQF-PILGLVVKNFSDHDSTDPIPSELHTLLQEFPKIVHN--PIDLPPLRDIQHSIDFLPGATLPNLPHYKMSPSEYQILHNQIQELL
         T+   +    AK+   I+ ++VK  +  +  D +P  L  LL EF  I  +  P  LPP+RDIQH ID +PGA+LPNLPHY+MSP E +IL  Q+++L+
Subjt:  FTLTLGKSFYSAKQF-PILGLVVKNFSDHDSTDPIPSELHTLLQEFPKIVHN--PIDLPPLRDIQHSIDFLPGATLPNLPHYKMSPSEYQILHNQIQELL

Query:  -EGHIQPSLSPCAVPALLTPKKDGSWRMCVDSRAINKIIVKYRFPIPCIGDLLDQLGGATIFSKIDLKSGYHQIRIRPGDEWKTTFKTNEGLFEWLVMPF
         +G IQ S+SPCAVPALLTPKKDGSWRMCVDSRAINKI VKYRFPIP + D+LD L G+ IFSKIDL+SGYHQIRIRPGDEWKT FKT EGL+EWLVMPF
Subjt:  -EGHIQPSLSPCAVPALLTPKKDGSWRMCVDSRAINKIIVKYRFPIPCIGDLLDQLGGATIFSKIDLKSGYHQIRIRPGDEWKTTFKTNEGLFEWLVMPF

Query:  GLSNAPSTFMRLMN
        GLSNAPSTFMR+MN
Subjt:  GLSNAPSTFMRLMN

A0A5D3DGR0 Reverse transcriptase1.1e-15741.49Show/hide
Query:  KEAESTTVLSPRSTTVRLLSVEQDTKILKEDVGEIKKILEMICEKMGCRTDQQVFDSRTHIAAEKRQQDYRGDDSRTRQWQERQFTEQKMIQEPIPAPRF
        +EA  T +LSPR+++  L SVE         + EI+++L  +  ++     Q   + R       +    RG         +R F E+ + ++    P+ 
Subjt:  KEAESTTVLSPRSTTVRLLSVEQDTKILKEDVGEIKKILEMICEKMGCRTDQQVFDSRTHIAAEKRQQDYRGDDSRTRQWQERQFTEQKMIQEPIPAPRF

Query:  QQDYHSGMQEFKQNPLFRRQPEWNGDSSSE----DEYQELQKEGRRNNGDQHHQESDFKIKIDILTYSGKMEIEAFLEWIRHVENFFNYMNTLENKKVR-
        ++D     Q         R+ E    SSSE    D+  E ++     N  Q  + S++K+KID+ +Y GK  IE FL+W+++ ENFF YM T +NKKV  
Subjt:  QQDYHSGMQEFKQNPLFRRQPEWNGDSSSE----DEYQELQKEGRRNNGDQHHQESDFKIKIDILTYSGKMEIEAFLEWIRHVENFFNYMNTLENKKVR-

Query:  -------------------------------------------------------------------------------------------YKGGLRYDI
                                                                                                   + GGLR+D+
Subjt:  -------------------------------------------------------------------------------------------YKGGLRYDI

Query:  KEQIALQPIGYLNEAISAAATIEEQIANRFKRTYAR------------------RNTSEQGSSFTKPSG------GDKTN--------------------
        KE++ LQP  +L+EAI+ A T+EE I NR K T  R                    TSE+     + SG      G+K                      
Subjt:  KEQIALQPIGYLNEAISAAATIEEQIANRFKRTYAR------------------RNTSEQGSSFTKPSG------GDKTN--------------------

Query:  ------LQTTAAALKDNVDFQKEEIEDDTDDIVDFLEPDEGDPVSLVIQRLLLAPKTDHNYQRHALFKTCCTINGKICNVIIDSGSTENLVASKLVSSLN
               + T A  KDN D     +  + D+  + +E DEGD +S ++QR+L++PK ++  QRH+LFKT CTI GK+CNVIIDSGS+EN V+ KLV++LN
Subjt:  ------LQTTAAALKDNVDFQKEEIEDDTDDIVDFLEPDEGDPVSLVIQRLLLAPKTDHNYQRHALFKTCCTINGKICNVIIDSGSTENLVASKLVSSLN

Query:  LPLHPHPTPYKVSWIKKGEEAQVTHTSTIPLSIGASYKDQIICDVLDMDACHILLGRPWQYDEQATHMGRDNTYEFLWMGKKIVLLPINPAKQLNSNIPS
        L   PH  PYK+ WIKKG E  ++    +PLSIG SYKDQ++CDV++MD CHILLGRPWQ+D Q+ H GR+NTYEF+WM KK++LLP+   K  N    +
Subjt:  LPLHPHPTPYKVSWIKKGEEAQVTHTSTIPLSIGASYKDQIICDVLDMDACHILLGRPWQYDEQATHMGRDNTYEFLWMGKKIVLLPINPAKQLNSNIPS

Query:  SKKGPLFTLTLGKSFYSAKQFPILGLVVKNFSDHDSTDPIPSELHTLLQEFPKIVHNPIDLPPLRDIQHSIDFLPGATLPNLPHYKMSPSEYQILHNQIQ
         KKG LF    GK F   ++  ILG+V+    D    + IP  +  L +++PKI   P  LPPLRDI H+I+ L GA+ P+LPHY MSP+EY+ILH+ I+
Subjt:  SKKGPLFTLTLGKSFYSAKQFPILGLVVKNFSDHDSTDPIPSELHTLLQEFPKIVHNPIDLPPLRDIQHSIDFLPGATLPNLPHYKMSPSEYQILHNQIQ

Query:  ELL-EGHIQPSLSPCAVPALLTPKKDGSWRMCVDSRAINKIIVKYRFPIPCIGDLLDQLGGATIFSKIDLKSGYHQIRIRPGDEWKTTFKTNEGLFEWLV
        ELL +GHI+PS S C VPALLTPKKDG+WRMCVDSRAINKI VKYRFPIP + DLLDQLGGA IFSKIDL+S YHQIRIRPGDEWKT FKTNEGLFEWLV
Subjt:  ELL-EGHIQPSLSPCAVPALLTPKKDGSWRMCVDSRAINKIIVKYRFPIPCIGDLLDQLGGATIFSKIDLKSGYHQIRIRPGDEWKTTFKTNEGLFEWLV

Query:  MPFGLSNAPSTFMRLMN
        MPF LSNAPSTFMRLMN
Subjt:  MPFGLSNAPSTFMRLMN

A0A5D3E417 Transposon Ty3-I Gag-Pol polyprotein isoform X11.7e-13140.03Show/hide
Query:  IAAEKRQQDYRGDDSRTRQWQERQFTEQKMIQEPIPAPRFQQDYHSGMQEFKQNPLFRRQPEWNGDSSSEDEYQEL--------QKEGRRNNGDQHHQES
        I A  R  + RG     R +  + F  Q+     IP  ++  D      E  QN         + DSS  DE   +          +G R    +     
Subjt:  IAAEKRQQDYRGDDSRTRQWQERQFTEQKMIQEPIPAPRFQQDYHSGMQEFKQNPLFRRQPEWNGDSSSEDEYQEL--------QKEGRRNNGDQHHQES

Query:  DFKIKIDILTYSGKMEIEAFLEWIRHVENFFNYMNTLENKKV----------------------------------------------------------
        D+K+KID+ TY+GK +IE+FL+WI++ ENFF YM   + KKV                                                          
Subjt:  DFKIKIDILTYSGKMEIEAFLEWIRHVENFFNYMNTLENKKV----------------------------------------------------------

Query:  ------RYKGGLRYDIKEQIALQPIGYLNEAISAAATIEEQIANRFKRTYAR--------------RNTSEQGS--------------------------
              R+ GGLR+DIKE++ L     L+EAIS A T+EE +  R K +  R              + T EQ S                          
Subjt:  ------RYKGGLRYDIKEQIALQPIGYLNEAISAAATIEEQIANRFKRTYAR--------------RNTSEQGS--------------------------

Query:  ---SFTKPSGG-----------DKTNLQTTAAALKDNVDFQKEEIEDDTDDIVDFLEPDEGDPVSLVIQRLLLAPKTDHNYQRHALFKTCCTINGKICNV
           ++T+PS G                Q    AL ++ D      +++ ++  + +E D+GD +S ++QR+L+ PK + N Q H+LFKT CTINGK+   
Subjt:  ---SFTKPSGG-----------DKTNLQTTAAALKDNVDFQKEEIEDDTDDIVDFLEPDEGDPVSLVIQRLLLAPKTDHNYQRHALFKTCCTINGKICNV

Query:  IIDSGSTENLVASKLVSSLNLPLHPHPTPYKVSWIKKGEEAQVTHTSTIPLSIGASYKDQIICDVLDMDACHILLGRPWQYDEQATHMGRDNTYEFLWMG
                               +PHP PYK+ W+KKG E  +    TIPLSIG SYKDQI+CDV++MD CH+LLGRPWQ+D Q  H GR+NTYEF WMG
Subjt:  IIDSGSTENLVASKLVSSLNLPLHPHPTPYKVSWIKKGEEAQVTHTSTIPLSIGASYKDQIICDVLDMDACHILLGRPWQYDEQATHMGRDNTYEFLWMG

Query:  KKIVLLPINPAKQLNSNIPSSKKGPLFTLTLGKSFYSAKQFPILGLVVKNFSDHDSTDPIPSELHTLLQEFPKIVHNPIDLPPLRDIQHSIDFLPGATLP
        KK++LLP+  AK+   +I    K  LF    GK+    ++  +LGL+V + S   +++ +   L  L  EFP +   P  LPPLRDIQH ID +P A+LP
Subjt:  KKIVLLPINPAKQLNSNIPSSKKGPLFTLTLGKSFYSAKQFPILGLVVKNFSDHDSTDPIPSELHTLLQEFPKIVHNPIDLPPLRDIQHSIDFLPGATLP

Query:  NLPHYKMSPSEYQILHNQIQELL-EGHIQPSLSPCAVPALLTPKKDGSWRMCVDSRAINKIIVKYRFPIPCIGDLLDQLGGATIFSKIDLKSGYHQIRIR
        NLPHY+MSP EYQ+LH+ I++LL +GHI+PSLSPCAVPALLTP KDGSWRMCVDSRAIN++  KYRFPIP IGDLLDQLG A IFSKIDL++GYHQI+IR
Subjt:  NLPHYKMSPSEYQILHNQIQELL-EGHIQPSLSPCAVPALLTPKKDGSWRMCVDSRAINKIIVKYRFPIPCIGDLLDQLGGATIFSKIDLKSGYHQIRIR

Query:  PGDEWKTTFKTNEGLFE
        PGDEWKT FKTNEGLFE
Subjt:  PGDEWKTTFKTNEGLFE

A0A6P3Z018 uncharacterized protein LOC1074050626.1e-12940.17Show/hide
Query:  HSGMQEFKQNPLFRRQPEWNGDSSSEDEYQE--LQKEGRRNNGDQH-----HQESDFKIKIDILTYSGKMEIEAFLEWIRHVENFFNYMNTLENKKV---
        HSG+   K        PE +GD S+  +  +  +    R      H     HQ SD++IK+DI  + G + IE FL+W++ VE+FF YM+  E+K+V   
Subjt:  HSGMQEFKQNPLFRRQPEWNGDSSSEDEYQE--LQKEGRRNNGDQH-----HQESDFKIKIDILTYSGKMEIEAFLEWIRHVENFFNYMNTLENKKV---

Query:  -----------------------------------------------------------------------------------------RYKGGLRYDIK
                                                                                                 RY  GL   I+
Subjt:  -----------------------------------------------------------------------------------------RYKGGLRYDIK

Query:  EQIALQPIGYLNEAISAAATIEEQIANRFKRTYAR-------------------------------RNTSEQGSSFTKPSGGDKTNL------------Q
        E+I L P+  L+EA++ A  IE+QI     +T A+                               +NTS+  +   +PS     N             +
Subjt:  EQIALQPIGYLNEAISAAATIEEQIANRFKRTYAR-------------------------------RNTSEQGSSFTKPSGGDKTNL------------Q

Query:  TTAAALKDNVDFQKEEIEDDT-------DDIVDFLEPDEGDPVSLVIQRLLLAPKTDHNYQRHALFKTCCTINGKICNVIIDSGSTENLVASKLVSSLNL
        +    L+  ++    E +DD+        D  + ++ D+G+PV  +IQ+LL +PK     QRH++FKT CTI  K+C VI DSGS+EN+V+  LV +L L
Subjt:  TTAAALKDNVDFQKEEIEDDT-------DDIVDFLEPDEGDPVSLVIQRLLLAPKTDHNYQRHALFKTCCTINGKICNVIIDSGSTENLVASKLVSSLNL

Query:  PLHPHPTPYKVSWIKKGEEAQVTHTSTIPLSIGASYKDQIICDVLDMDACHILLGRPWQYDEQATHMGRDNTYEFLWMGKKIVLLPINPAKQLN------
            HP PYKV WIKKG E +VT    +  SIG  Y D+++CDV+DMDACHILLGRPWQ+D   TH GR NT+ F W GKKIVLLP  P           
Subjt:  PLHPHPTPYKVSWIKKGEEAQVTHTSTIPLSIGASYKDQIICDVLDMDACHILLGRPWQYDEQATHMGRDNTYEFLWMGKKIVLLPINPAKQLN------

Query:  -SNIPSSKKGPLFTLTLGKSF-YSAKQFPI-LGLVVKNFSDHDSTDPIPSELHTLLQEFPKIVHN--PIDLPPLRDIQHSIDFLPGATLPNLPHYKMSPS
         S    S KGP+   T GK F   AK   I  G+V    +  DS  P P  +  LLQEF +I  +  P  LPP+RDIQH+ID LPGA LPNLPHY+M P 
Subjt:  -SNIPSSKKGPLFTLTLGKSF-YSAKQFPI-LGLVVKNFSDHDSTDPIPSELHTLLQEFPKIVHN--PIDLPPLRDIQHSIDFLPGATLPNLPHYKMSPS

Query:  EYQILHNQIQELLEGH-IQPSLSPCAVPALLTPKKDGSWRMCVDSRAINKIIVKYRFPIPCIGDLLDQLGGATIFSKIDLKSGYHQIRIRPGDEWKTTFK
        E QIL   +++LL+ + I+ SLSPCAVPALL PKK+G WRMC+DSRAINKI  KYRFPIP + D+LD+L GA +FSK+DL+SGYHQIRIRPGDEWKT FK
Subjt:  EYQILHNQIQELLEGH-IQPSLSPCAVPALLTPKKDGSWRMCVDSRAINKIIVKYRFPIPCIGDLLDQLGGATIFSKIDLKSGYHQIRIRPGDEWKTTFK

Query:  TNEGLFEWLVMPFGLSNAPSTFMRLMN
        T  GL+EW VMPFGL NAPSTFMRLMN
Subjt:  TNEGLFEWLVMPFGLSNAPSTFMRLMN

A0A6P6GFU0 uncharacterized protein LOC1124928194.7e-12939.25Show/hide
Query:  AAEKRQQDYRGDDSRTRQWQERQF----TEQKMIQEPIPAPRFQQDYHSGMQEFKQN-PLFRRQPEWNGDSSSEDEYQEL----QKEGRRNNGDQHHQES
        AA     ++ GD S   Q  +           ++Q+  P PR  +  H    + +Q  PL       + DS SEDE + +      +   N   + HQ S
Subjt:  AAEKRQQDYRGDDSRTRQWQERQF----TEQKMIQEPIPAPRFQQDYHSGMQEFKQN-PLFRRQPEWNGDSSSEDEYQEL----QKEGRRNNGDQHHQES

Query:  DFKIKIDILTYSGKMEIEAFLEWIRHVENFFNYMNTLENKKV----------------------------------------------------------
        D++IK+DI  + G + IE FL+W++ VE+FF YM+  E+K+V                                                          
Subjt:  DFKIKIDILTYSGKMEIEAFLEWIRHVENFFNYMNTLENKKV----------------------------------------------------------

Query:  ----------------------------------RYKGGLRYDIKEQIALQPIGYLNEAISAAATIEEQIANRFKRTYAR--------------------
                                          RY  GL   I+E+I L P+  L+EA++ A  IE+QI     +T A+                    
Subjt:  ----------------------------------RYKGGLRYDIKEQIALQPIGYLNEAISAAATIEEQIANRFKRTYAR--------------------

Query:  -----------RNTSEQGSSFTKPSGGDKTNL------------QTTAAALKDNVDFQKEEIEDDT-------DDIVDFLEPDEGDPVSLVIQRLLLAPK
                   +NTS+  +   +PS     N             ++    L+  V+    E +DD+        D  + ++ D+G+PV  +IQ+LL +PK
Subjt:  -----------RNTSEQGSSFTKPSGGDKTNL------------QTTAAALKDNVDFQKEEIEDDT-------DDIVDFLEPDEGDPVSLVIQRLLLAPK

Query:  TDHNYQRHALFKTCCTINGKICNVIIDSGSTENLVASKLVSSLNLPLHPHPTPYKVSWIKKGEEAQVTHTSTIPLSIGASYKDQIICDVLDMDACHILLG
             QRH++FKT CTIN K+C VIIDSGS+EN+V+  LV +L LP   HP PYKV WIKKG E +VT    +  SIG  Y D+++CDV++MDACHILLG
Subjt:  TDHNYQRHALFKTCCTINGKICNVIIDSGSTENLVASKLVSSLNLPLHPHPTPYKVSWIKKGEEAQVTHTSTIPLSIGASYKDQIICDVLDMDACHILLG

Query:  RPWQYDEQATHMGRDNTYEFLWMGKKIVLLPINPAKQLN-------SNIPSSKKGPLFTLTLGKSF-YSAKQFPI-LGLVVKNFSDHDSTDPIPSELHTL
        RPWQ+D   T  GR NT+ F W GKKIVLLP  P            S    S KGP+   T GK F   AK   I  G+V    +  DS  P P  +  L
Subjt:  RPWQYDEQATHMGRDNTYEFLWMGKKIVLLPINPAKQLN-------SNIPSSKKGPLFTLTLGKSF-YSAKQFPI-LGLVVKNFSDHDSTDPIPSELHTL

Query:  LQEFPKIVHN--PIDLPPLRDIQHSIDFLPGATLPNLPHYKMSPSEYQILHNQIQELLEGH-IQPSLSPCAVPALLTPKKDGSWRMCVDSRAINKIIVKY
        LQEF +I  +  P  LPP+RDIQH+ID LPGA LPNLPHY+M P E QIL   +++LL+ + I+ SLSPCAVPALL PKK+G WRMC+DSRAINKI  KY
Subjt:  LQEFPKIVHN--PIDLPPLRDIQHSIDFLPGATLPNLPHYKMSPSEYQILHNQIQELLEGH-IQPSLSPCAVPALLTPKKDGSWRMCVDSRAINKIIVKY

Query:  RFPIPCIGDLLDQLGGATIFSKIDLKSGYHQIRIRPGDEWKTTFKTNEGLFEWLVMPFGLSNAPSTFMRLMN
        RFPIP + D+LD+L GA +FSK+DL+SGYHQIRIRP DEWKT FKT  GL+EW VMPFGL NAPSTFMRLMN
Subjt:  RFPIPCIGDLLDQLGGATIFSKIDLKSGYHQIRIRPGDEWKTTFKTNEGLFEWLVMPFGLSNAPSTFMRLMN

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein7.7e-2837.64Show/hide
Query:  ELHTLLQEFPKIV--HNPIDLP-PLRDIQHSIDFLPGATLPNLPHYKMSPSEYQILHNQI-QELLEGHIQPSLSPCAVPALLTPKKDGSWRMCVDSRAIN
        EL  + +EF  I    N   LP P++ ++  ++         + +Y + P + Q ++++I Q L  G I+ S +  A P +  PKK+G+ RM VD + +N
Subjt:  ELHTLLQEFPKIV--HNPIDLP-PLRDIQHSIDFLPGATLPNLPHYKMSPSEYQILHNQI-QELLEGHIQPSLSPCAVPALLTPKKDGSWRMCVDSRAIN

Query:  KIIVKYRFPIPCIGDLLDQLGGATIFSKIDLKSGYHQIRIRPGDEWKTTFKTNEGLFEWLVMPFGLSNAPSTFMRLMN
        K +    +P+P I  LL ++ G+TIF+K+DLKS YH IR+R GDE K  F+   G+FE+LVMP+G+S AP+ F   +N
Subjt:  KIIVKYRFPIPCIGDLLDQLGGATIFSKIDLKSGYHQIRIRPGDEWKTTFKTNEGLFEWLVMPFGLSNAPSTFMRLMN

P0CT35 Transposon Tf2-2 polyprotein7.7e-2837.64Show/hide
Query:  ELHTLLQEFPKIV--HNPIDLP-PLRDIQHSIDFLPGATLPNLPHYKMSPSEYQILHNQI-QELLEGHIQPSLSPCAVPALLTPKKDGSWRMCVDSRAIN
        EL  + +EF  I    N   LP P++ ++  ++         + +Y + P + Q ++++I Q L  G I+ S +  A P +  PKK+G+ RM VD + +N
Subjt:  ELHTLLQEFPKIV--HNPIDLP-PLRDIQHSIDFLPGATLPNLPHYKMSPSEYQILHNQI-QELLEGHIQPSLSPCAVPALLTPKKDGSWRMCVDSRAIN

Query:  KIIVKYRFPIPCIGDLLDQLGGATIFSKIDLKSGYHQIRIRPGDEWKTTFKTNEGLFEWLVMPFGLSNAPSTFMRLMN
        K +    +P+P I  LL ++ G+TIF+K+DLKS YH IR+R GDE K  F+   G+FE+LVMP+G+S AP+ F   +N
Subjt:  KIIVKYRFPIPCIGDLLDQLGGATIFSKIDLKSGYHQIRIRPGDEWKTTFKTNEGLFEWLVMPFGLSNAPSTFMRLMN

P0CT41 Transposon Tf2-12 polyprotein7.7e-2837.64Show/hide
Query:  ELHTLLQEFPKIV--HNPIDLP-PLRDIQHSIDFLPGATLPNLPHYKMSPSEYQILHNQI-QELLEGHIQPSLSPCAVPALLTPKKDGSWRMCVDSRAIN
        EL  + +EF  I    N   LP P++ ++  ++         + +Y + P + Q ++++I Q L  G I+ S +  A P +  PKK+G+ RM VD + +N
Subjt:  ELHTLLQEFPKIV--HNPIDLP-PLRDIQHSIDFLPGATLPNLPHYKMSPSEYQILHNQI-QELLEGHIQPSLSPCAVPALLTPKKDGSWRMCVDSRAIN

Query:  KIIVKYRFPIPCIGDLLDQLGGATIFSKIDLKSGYHQIRIRPGDEWKTTFKTNEGLFEWLVMPFGLSNAPSTFMRLMN
        K +    +P+P I  LL ++ G+TIF+K+DLKS YH IR+R GDE K  F+   G+FE+LVMP+G+S AP+ F   +N
Subjt:  KIIVKYRFPIPCIGDLLDQLGGATIFSKIDLKSGYHQIRIRPGDEWKTTFKTNEGLFEWLVMPFGLSNAPSTFMRLMN

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein1.1e-3943.88Show/hide
Query:  VVKNFSDHDSTDPIPSELHTLLQEFPKIVHNPIDLPPLR------DIQHSIDFLPGATLPNLPHYKMSPSEYQILHNQIQELLEG-HIQPSLSPCAVPAL
        V  N +DH + D   +    L Q++ +I+ N  DLPP         ++H I+  PGA LP L  Y ++    Q ++  +Q+LL+   I PS SPC+ P +
Subjt:  VVKNFSDHDSTDPIPSELHTLLQEFPKIVHNPIDLPPLR------DIQHSIDFLPGATLPNLPHYKMSPSEYQILHNQIQELLEG-HIQPSLSPCAVPAL

Query:  LTPKKDGSWRMCVDSRAINKIIVKYRFPIPCIGDLLDQLGGATIFSKIDLKSGYHQIRIRPGDEWKTTFKTNEGLFEWLVMPFGLSNAPSTFMRLM
        L PKKDG++R+CVD R +NK  +   FP+P I +LL ++G A IF+ +DL SGYHQI + P D +KT F T  G +E+ VMPFGL NAPSTF R M
Subjt:  LTPKKDGSWRMCVDSRAINKIIVKYRFPIPCIGDLLDQLGGATIFSKIDLKSGYHQIRIRPGDEWKTTFKTNEGLFEWLVMPFGLSNAPSTFMRLM

Q99315 Transposon Ty3-G Gag-Pol polyprotein1.1e-3943.88Show/hide
Query:  VVKNFSDHDSTDPIPSELHTLLQEFPKIVHNPIDLPPLR------DIQHSIDFLPGATLPNLPHYKMSPSEYQILHNQIQELLEG-HIQPSLSPCAVPAL
        V  N +DH + D   +    L Q++ +I+ N  DLPP         ++H I+  PGA LP L  Y ++    Q ++  +Q+LL+   I PS SPC+ P +
Subjt:  VVKNFSDHDSTDPIPSELHTLLQEFPKIVHNPIDLPPLR------DIQHSIDFLPGATLPNLPHYKMSPSEYQILHNQIQELLEG-HIQPSLSPCAVPAL

Query:  LTPKKDGSWRMCVDSRAINKIIVKYRFPIPCIGDLLDQLGGATIFSKIDLKSGYHQIRIRPGDEWKTTFKTNEGLFEWLVMPFGLSNAPSTFMRLM
        L PKKDG++R+CVD R +NK  +   FP+P I +LL ++G A IF+ +DL SGYHQI + P D +KT F T  G +E+ VMPFGL NAPSTF R M
Subjt:  LTPKKDGSWRMCVDSRAINKIIVKYRFPIPCIGDLLDQLGGATIFSKIDLKSGYHQIRIRPGDEWKTTFKTNEGLFEWLVMPFGLSNAPSTFMRLM

Arabidopsis top hitse value%identityAlignment
AT4G13320.1 unknown protein7.2e-1333.33Show/hide
Query:  LFKTCCTINGKICNVIIDSGSTENLVASKLVSSLNL-PLHPHPTPYKVSWIKKGEEAQVTHTSTIPLSIGASYKDQIICDVLDM--DACHILLGRPWQYD
        +F+T C IN + C +++  G+  N+++  LV  L L  L  +P+   ++   + E+     T  +P+SIG  YKD++ C V++M  +   +L G PW Y 
Subjt:  LFKTCCTINGKICNVIIDSGSTENLVASKLVSSLNL-PLHPHPTPYKVSWIKKGEEAQVTHTSTIPLSIGASYKDQIICDVLDM--DACHILLGRPWQYD

Query:  EQATHMGRDNTYEFLWMGKKIVL
         QATH GRD++   +W    I+L
Subjt:  EQATHMGRDNTYEFLWMGKKIVL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACGACCAAGAAAGTTTCATCCAATGTAATTGGAGATTCATCCTTGGCGGGAAAAGAGGCGGAAAGCACCACCGTCCTCTCACCACGATCAACAACCGTACGCTTGCT
GTCGGTTGAACAAGATACAAAAATCTTGAAAGAAGATGTGGGTGAGATAAAGAAGATCTTGGAGATGATTTGTGAAAAAATGGGCTGCAGAACGGATCAACAAGTTTTTG
ATTCAAGAACACATATAGCAGCGGAAAAGAGACAGCAAGACTATCGAGGAGACGATTCAAGGACAAGACAATGGCAAGAGAGACAATTTACGGAGCAGAAAATGATACAA
GAACCAATTCCAGCACCAAGATTCCAGCAAGATTACCATTCTGGAATGCAAGAATTTAAACAGAATCCCCTATTTCGAAGACAACCTGAATGGAATGGGGATAGTTCGAG
TGAGGATGAATATCAAGAGCTTCAAAAGGAAGGTCGGAGGAACAATGGAGATCAACATCATCAAGAAAGCGACTTTAAGATAAAGATTGATATCCTGACATACAGTGGAA
AGATGGAAATAGAGGCTTTCTTGGAATGGATTAGACATGTGGAAAATTTTTTCAATTACATGAACACTCTCGAAAACAAGAAAGTAAGGTATAAGGGTGGCCTACGCTAT
GATATTAAAGAACAAATTGCCTTACAACCTATTGGATACTTGAATGAGGCAATTTCGGCAGCAGCAACCATTGAGGAGCAGATTGCAAACCGTTTCAAAAGAACCTATGC
AAGACGAAATACAAGTGAGCAAGGCAGTAGCTTTACCAAACCCTCAGGTGGGGACAAAACCAATCTTCAAACTACAGCTGCAGCTCTTAAAGACAATGTAGACTTTCAAA
AAGAAGAAATTGAAGATGACACCGATGACATTGTCGACTTTCTTGAGCCTGATGAGGGAGATCCGGTATCTTTAGTAATCCAAAGATTACTTCTTGCTCCTAAAACAGAC
CACAACTACCAGCGTCATGCCCTATTCAAGACCTGTTGCACCATCAATGGCAAGATATGCAATGTCATAATTGACAGCGGCAGCACGGAAAATTTGGTGGCAAGTAAACT
TGTCTCTTCCTTAAACCTCCCTTTACATCCTCATCCGACACCATACAAGGTAAGTTGGATCAAGAAAGGCGAGGAAGCACAAGTAACCCATACTAGTACGATTCCTTTAT
CCATTGGGGCGAGTTACAAAGACCAAATCATATGCGACGTATTGGACATGGACGCTTGCCACATTCTTTTGGGACGACCATGGCAATACGATGAGCAAGCAACTCATATG
GGTCGTGACAACACCTACGAGTTCCTTTGGATGGGCAAGAAAATAGTCCTTCTCCCAATCAACCCGGCCAAACAACTTAACAGCAACATTCCTTCATCTAAAAAAGGTCC
GTTATTTACTTTAACCTTAGGGAAATCTTTCTATTCTGCAAAACAATTCCCAATTCTTGGCCTAGTTGTTAAAAATTTCTCTGACCACGACTCTACTGATCCTATTCCTT
CAGAGCTGCACACTTTGCTGCAAGAATTTCCAAAAATAGTGCACAATCCAATTGATCTTCCTCCATTGCGAGATATACAGCACTCAATCGACTTTTTACCCGGCGCAACA
CTTCCTAACTTACCTCATTATAAAATGAGTCCATCTGAGTACCAAATTCTCCACAATCAAATTCAAGAACTGCTAGAAGGACATATTCAGCCAAGTCTAAGTCCTTGTGC
AGTGCCTGCTCTACTTACACCAAAAAAGGATGGCAGTTGGAGAATGTGTGTTGACAGCCGAGCTATCAATAAGATCATAGTCAAATACCGGTTTCCTATCCCGTGCATTG
GTGACTTGTTGGATCAATTAGGTGGCGCCACAATCTTCTCCAAGATTGACTTGAAGAGTGGCTATCACCAAATACGCATACGCCCTGGGGATGAATGGAAAACGACCTTC
AAGACTAACGAGGGCTTATTCGAATGGCTTGTAATGCCGTTTGGTCTGTCCAACGCTCCTAGCACATTCATGCGTTTGATGAACTAG
mRNA sequenceShow/hide mRNA sequence
ATGACGACCAAGAAAGTTTCATCCAATGTAATTGGAGATTCATCCTTGGCGGGAAAAGAGGCGGAAAGCACCACCGTCCTCTCACCACGATCAACAACCGTACGCTTGCT
GTCGGTTGAACAAGATACAAAAATCTTGAAAGAAGATGTGGGTGAGATAAAGAAGATCTTGGAGATGATTTGTGAAAAAATGGGCTGCAGAACGGATCAACAAGTTTTTG
ATTCAAGAACACATATAGCAGCGGAAAAGAGACAGCAAGACTATCGAGGAGACGATTCAAGGACAAGACAATGGCAAGAGAGACAATTTACGGAGCAGAAAATGATACAA
GAACCAATTCCAGCACCAAGATTCCAGCAAGATTACCATTCTGGAATGCAAGAATTTAAACAGAATCCCCTATTTCGAAGACAACCTGAATGGAATGGGGATAGTTCGAG
TGAGGATGAATATCAAGAGCTTCAAAAGGAAGGTCGGAGGAACAATGGAGATCAACATCATCAAGAAAGCGACTTTAAGATAAAGATTGATATCCTGACATACAGTGGAA
AGATGGAAATAGAGGCTTTCTTGGAATGGATTAGACATGTGGAAAATTTTTTCAATTACATGAACACTCTCGAAAACAAGAAAGTAAGGTATAAGGGTGGCCTACGCTAT
GATATTAAAGAACAAATTGCCTTACAACCTATTGGATACTTGAATGAGGCAATTTCGGCAGCAGCAACCATTGAGGAGCAGATTGCAAACCGTTTCAAAAGAACCTATGC
AAGACGAAATACAAGTGAGCAAGGCAGTAGCTTTACCAAACCCTCAGGTGGGGACAAAACCAATCTTCAAACTACAGCTGCAGCTCTTAAAGACAATGTAGACTTTCAAA
AAGAAGAAATTGAAGATGACACCGATGACATTGTCGACTTTCTTGAGCCTGATGAGGGAGATCCGGTATCTTTAGTAATCCAAAGATTACTTCTTGCTCCTAAAACAGAC
CACAACTACCAGCGTCATGCCCTATTCAAGACCTGTTGCACCATCAATGGCAAGATATGCAATGTCATAATTGACAGCGGCAGCACGGAAAATTTGGTGGCAAGTAAACT
TGTCTCTTCCTTAAACCTCCCTTTACATCCTCATCCGACACCATACAAGGTAAGTTGGATCAAGAAAGGCGAGGAAGCACAAGTAACCCATACTAGTACGATTCCTTTAT
CCATTGGGGCGAGTTACAAAGACCAAATCATATGCGACGTATTGGACATGGACGCTTGCCACATTCTTTTGGGACGACCATGGCAATACGATGAGCAAGCAACTCATATG
GGTCGTGACAACACCTACGAGTTCCTTTGGATGGGCAAGAAAATAGTCCTTCTCCCAATCAACCCGGCCAAACAACTTAACAGCAACATTCCTTCATCTAAAAAAGGTCC
GTTATTTACTTTAACCTTAGGGAAATCTTTCTATTCTGCAAAACAATTCCCAATTCTTGGCCTAGTTGTTAAAAATTTCTCTGACCACGACTCTACTGATCCTATTCCTT
CAGAGCTGCACACTTTGCTGCAAGAATTTCCAAAAATAGTGCACAATCCAATTGATCTTCCTCCATTGCGAGATATACAGCACTCAATCGACTTTTTACCCGGCGCAACA
CTTCCTAACTTACCTCATTATAAAATGAGTCCATCTGAGTACCAAATTCTCCACAATCAAATTCAAGAACTGCTAGAAGGACATATTCAGCCAAGTCTAAGTCCTTGTGC
AGTGCCTGCTCTACTTACACCAAAAAAGGATGGCAGTTGGAGAATGTGTGTTGACAGCCGAGCTATCAATAAGATCATAGTCAAATACCGGTTTCCTATCCCGTGCATTG
GTGACTTGTTGGATCAATTAGGTGGCGCCACAATCTTCTCCAAGATTGACTTGAAGAGTGGCTATCACCAAATACGCATACGCCCTGGGGATGAATGGAAAACGACCTTC
AAGACTAACGAGGGCTTATTCGAATGGCTTGTAATGCCGTTTGGTCTGTCCAACGCTCCTAGCACATTCATGCGTTTGATGAACTAG
Protein sequenceShow/hide protein sequence
MTTKKVSSNVIGDSSLAGKEAESTTVLSPRSTTVRLLSVEQDTKILKEDVGEIKKILEMICEKMGCRTDQQVFDSRTHIAAEKRQQDYRGDDSRTRQWQERQFTEQKMIQ
EPIPAPRFQQDYHSGMQEFKQNPLFRRQPEWNGDSSSEDEYQELQKEGRRNNGDQHHQESDFKIKIDILTYSGKMEIEAFLEWIRHVENFFNYMNTLENKKVRYKGGLRY
DIKEQIALQPIGYLNEAISAAATIEEQIANRFKRTYARRNTSEQGSSFTKPSGGDKTNLQTTAAALKDNVDFQKEEIEDDTDDIVDFLEPDEGDPVSLVIQRLLLAPKTD
HNYQRHALFKTCCTINGKICNVIIDSGSTENLVASKLVSSLNLPLHPHPTPYKVSWIKKGEEAQVTHTSTIPLSIGASYKDQIICDVLDMDACHILLGRPWQYDEQATHM
GRDNTYEFLWMGKKIVLLPINPAKQLNSNIPSSKKGPLFTLTLGKSFYSAKQFPILGLVVKNFSDHDSTDPIPSELHTLLQEFPKIVHNPIDLPPLRDIQHSIDFLPGAT
LPNLPHYKMSPSEYQILHNQIQELLEGHIQPSLSPCAVPALLTPKKDGSWRMCVDSRAINKIIVKYRFPIPCIGDLLDQLGGATIFSKIDLKSGYHQIRIRPGDEWKTTF
KTNEGLFEWLVMPFGLSNAPSTFMRLMN