; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0029724 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0029724
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr8:41551086..41557589
RNA-Seq ExpressionLag0029724
SyntenyLag0029724
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022929949.1 uncharacterized protein LOC111436411 [Cucurbita moschata]2.7e-10839.4Show/hide
Query:  MKHVMFQMLQMVGQFHGLPYEDLHLHLKFYLGV-------SDSFVIQGVSRDALRLTLLFYSLRDGAKAWLNSFTLGSTSTWNELGEKFLIKYFPPIRNA
        +K VMFQMLQ +GQFHGLP ED HLHLK +LGV       SDSF  QGV +D +RL+L  Y LRDGAK+WLN+   G+  +WN L E FLIKYFPP RNA
Subjt:  MKHVMFQMLQMVGQFHGLPYEDLHLHLKFYLGV-------SDSFVIQGVSRDALRLTLLFYSLRDGAKAWLNSFTLGSTSTWNELGEKFLIKYFPPIRNA

Query:  KLKSEIVGFRQNEDETFSEAWEMFKGLLGICPHHGLPHCIQMEIFYNGLNITTQGMVDASTRGGLLAKTFYEAHEILERISTNSCQWSNVRSY-SKKVKA
        + K+EIV F+Q EDET SEA E FK +L  CPHHGLPHCIQME FYNGLNI T+ +VDAS  G +L+KT+ EA+EILERI++N+CQW++VRS   +K + 
Subjt:  KLKSEIVGFRQNEDETFSEAWEMFKGLLGICPHHGLPHCIQMEIFYNGLNITTQGMVDASTRGGLLAKTFYEAHEILERISTNSCQWSNVRSY-SKKVKA

Query:  TMEVDDVSTIRVDIASLGNALKSMTL---------VNTI---------------------QQPSVVESVAVVG---------------------------
         +EVD +S+I   +AS+ N L+++ L         V+T                      Q PS   S+  VG                           
Subjt:  TMEVDDVSTIRVDIASLGNALKSMTL---------VNTI---------------------QQPSVVESVAVVG---------------------------

Query:  -----------------------------VTTPTLRGEGK------------------------------------------------------------
                                      ++  +  +GK                                                            
Subjt:  -----------------------------VTTPTLRGEGK------------------------------------------------------------

Query:  -------------------EVIRKPLMH-----------------------KKRWQISNDLFGKINKDCHIKISSYPV----------------------
                           EV  +P M                        +K+ +   + F  I K+ HI I                           
Subjt:  -------------------EVIRKPLMH-----------------------KKRWQISNDLFGKINKDCHIKISSYPV----------------------

Query:  EFGV-----------KNGLLPKVKDPGSFTIPFSIGGKELGRELCDLGTSINLMPLKIYRKLGIGETRSTIVTLQLADKFITYPEGKIEDILVPVDKFIF
        EF V           KN +  K KDPGSFTIP SIGGKELGR LCDLG +INLMPL IY+KLGIGE R T VTLQLAD+ ITYPEGKIEDIL+ VDKFIF
Subjt:  EFGV-----------KNGLLPKVKDPGSFTIPFSIGGKELGRELCDLGTSINLMPLKIYRKLGIGETRSTIVTLQLADKFITYPEGKIEDILVPVDKFIF

Query:  PVDFIILDYETDKNVYIILGPPFLAAGRSLIDFQKGELTMKVDDQKVKFNVFDTMKYPNDIEDCSCIQVL
          DFIILDYE D +V IILG PFL  GR+L+D  KG +T+++  QKV+FN+ D+MKYP  IE+CS +  L
Subjt:  PVDFIILDYETDKNVYIILGPPFLAAGRSLIDFQKGELTMKVDDQKVKFNVFDTMKYPNDIEDCSCIQVL

XP_023521407.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111785222 [Cucurbita pepo subsp. pepo]1.6e-9253.91Show/hide
Query:  EDETFSEAWEMFKGLLGICPHHGLPHCIQMEIFYNGLNITTQGMVDASTRGGLLAKTFYEAHEILERISTNSCQWSNVRSY-SKKVKATMEVDDVSTIRV
        EDET SEAWE FK +L  CPHHGLPHCIQME FYNGLNI T+ +VDAS  G +L+KT+ EA+EILERI++N+CQW++VRS   +K +  +EVD +S+I  
Subjt:  EDETFSEAWEMFKGLLGICPHHGLPHCIQMEIFYNGLNITTQGMVDASTRGGLLAKTFYEAHEILERISTNSCQWSNVRSY-SKKVKATMEVDDVSTIRV

Query:  DIASLGNALKSMTLVNTIQQPSVVESVAVVGVTTP---TLRGEGKEVIRKPLMHKKRWQISN-DLFGKINKDCHIKISSYPVEFG--VKNGLLPKVKDPG
         +AS+ N L+++ L       + V + AV+  T        GE     + P      + + N D+     K    K+     E    +KN +  K KDPG
Subjt:  DIASLGNALKSMTLVNTIQQPSVVESVAVVGVTTP---TLRGEGKEVIRKPLMHKKRWQISN-DLFGKINKDCHIKISSYPVEFG--VKNGLLPKVKDPG

Query:  SFTIPFSIGGKELGRELCDLGTSINLMPLKIYRKLGIGETRSTIVTLQLADKFITYPEGKIEDILVPVDKFIFPVDFIILDYETDKNVYIILGPPFLAAG
        SFTIP SIGGK+LGR LCDLG+SINLMPL IY+KLGIGE R T VTLQLAD+  TYPEGKIEDIL+ VDKFIFP DFIILDYE D +V IILG PFL  G
Subjt:  SFTIPFSIGGKELGRELCDLGTSINLMPLKIYRKLGIGETRSTIVTLQLADKFITYPEGKIEDILVPVDKFIFPVDFIILDYETDKNVYIILGPPFLAAG

Query:  RSLIDFQKGELTMKVDDQKVKFNVFDTMKYPNDIEDCSCIQVLDE
        R+L+D  KG +T+++ DQKV+FN+ D+MKYP   E+CS +  L E
Subjt:  RSLIDFQKGELTMKVDDQKVKFNVFDTMKYPNDIEDCSCIQVLDE

XP_024042858.1 uncharacterized protein LOC112099671 [Citrus clementina]1.1e-9036.39Show/hide
Query:  QFHGLPYEDLHLHLKFYLGVSDSFVIQGVSRDALRLTLLFYSLRDGAKAWLNSFTLGSTSTWNELGEKFLIKYFPPIRNAKLKSEIVGFRQNEDETFSEA
        + +GLP ED HLHLK +L +SD+F   G ++DALRL L  YSLRD A+AWLNS    S +TWNEL +KFL+KYFPP +NAKL++EI  F Q EDE+  + 
Subjt:  QFHGLPYEDLHLHLKFYLGVSDSFVIQGVSRDALRLTLLFYSLRDGAKAWLNSFTLGSTSTWNELGEKFLIKYFPPIRNAKLKSEIVGFRQNEDETFSEA

Query:  WEMFKGLLGICPHHGLPHCIQMEIFYNGLNITTQGMVDASTRGGLLAKTFYEAHEILERISTNSCQWSNVR-SYSKKVKATMEVDDVSTIRVDIASLGNA
        WE FK LL  CPHHG+P CIQ+E FYNGLN +T+ MVDAS    LL K++ EA+EILERI+ N+ QW + R + +++      +D ++T+   + SL N 
Subjt:  WEMFKGLLGICPHHGLPHCIQMEIFYNGLNITTQGMVDASTRGGLLAKTFYEAHEILERISTNSCQWSNVR-SYSKKVKATMEVDDVSTIRVDIASLGNA

Query:  LKSMTLV---------------------------------------------------------------------------NTIQQPS-----------
        +K+MT                                                                             N   QPS           
Subjt:  LKSMTLV---------------------------------------------------------------------------NTIQQPS-----------

Query:  ----------------------VVESVAV--------VG--VTTPTLRGEG------KEVIRKPLMH-------------------KKRWQISND-----
                              +V+S  V        +G   TT + R +G      K+  R+   H                   KKR ++++      
Subjt:  ----------------------VVESVAV--------VG--VTTPTLRGEG------KEVIRKPLMH-------------------KKRWQISND-----

Query:  ----LFGKINKDCHIKISSYPVEFG-------------------VKNGLLPKVKDPGSFTIPFSIGGKELGRELCDLGTSINLMPLKIYRKLGIGETRST
            L    ++D  +   +  +  G                   +++ +  K+KDPGSFTIP SIG +  GR LCDLG +INLM L ++++L + E R T
Subjt:  ----LFGKINKDCHIKISSYPVEFG-------------------VKNGLLPKVKDPGSFTIPFSIGGKELGRELCDLGTSINLMPLKIYRKLGIGETRST

Query:  IVTLQLADKFITYPEGKIEDILVPVDKFIFPVDFIILDYETDKNVYIILGPPFLAAGRSLIDFQKGELTMKVDDQKVKFNVFDTMKYPNDIEDCSCIQVL
         VTLQLA++   YPE KIED+LV VDKFIFPVDFI+LD+E DK V IILG PFLA  ++LID QK ELTM+++DQ+V FNV + MK  ++ +DC+ + V+
Subjt:  IVTLQLADKFITYPEGKIEDILVPVDKFIFPVDFIILDYETDKNVYIILGPPFLAAGRSLIDFQKGELTMKVDDQKVKFNVFDTMKYPNDIEDCSCIQVL

Query:  DEIVEDHIEK
        D +V D + K
Subjt:  DEIVEDHIEK

XP_024965798.1 uncharacterized protein LOC112506000 [Cynara cardunculus var. scolymus]2.8e-8937.33Show/hide
Query:  VSDSFVIQGVSRDALRLTLLFYSLRDGAKAWLNSFTLGSTSTWNELGEKFLIKYFPPIRNAKLKSEIVGFRQNEDETFSEAWEMFKGLLGICPHHGLPHC
        ++D F I GV+++ALRLTL  Y+L+D A+AWLNS    S  TWN+L EKFL KYFPP RN K+++EI+ FRQ EDE  SEAWE FK LL  C HHG+PHC
Subjt:  VSDSFVIQGVSRDALRLTLLFYSLRDGAKAWLNSFTLGSTSTWNELGEKFLIKYFPPIRNAKLKSEIVGFRQNEDETFSEAWEMFKGLLGICPHHGLPHC

Query:  IQMEIFYNGLNITTQGMVDASTRGGLLAKTFYEAHEILERISTNSCQWSNVRS-YSKKVKATMEVDDVSTIRVDIASLGNALKSMTLVN--------TIQ
        +Q+E FY+ L+   + ++DA+  G   AKT+ E ++ILERIS N+ +WSN R+  SK      E+D +S++   I +L N +K+   +N         + 
Subjt:  IQMEIFYNGLNITTQGMVDASTRGGLLAKTFYEAHEILERISTNSCQWSNVRS-YSKKVKATMEVDDVSTIRVDIASLGNALKSMTLVN--------TIQ

Query:  QPSVVESVAVVG------------------------------------------------------------VTTPTLRGE--------GKEVIRKPL--
          S+ ES    G                                                            V   TLR +         KE   +P+  
Subjt:  QPSVVESVAVVG------------------------------------------------------------VTTPTLRGE--------GKEVIRKPL--

Query:  ----------------------------------------------------------------------------MHKKRWQISNDLFGKINKDCHIKI
                                                                                    M  K   +    F  I K  +I I
Subjt:  ----------------------------------------------------------------------------MHKKRWQISNDLFGKINKDCHIKI

Query:  ---------SSYPVEF--GVKN-----------------------GLLPKVKDPGSFTIPFSIGGKELGRELCDLGTSINLMPLKIYRKLGIGETRSTIV
                 SSY V+F   + N                        + PK+KDPGSFTI  SIGGKE+G  LCDLG SINLMPL I+ +LGIG+ R TIV
Subjt:  ---------SSYPVEF--GVKN-----------------------GLLPKVKDPGSFTIPFSIGGKELGRELCDLGTSINLMPLKIYRKLGIGETRSTIV

Query:  TLQLADKFITYPEGKIEDILVPVDKFIFPVDFIILDYETDKNVYIILGPPFLAAGRSLIDFQKGELTMKVDDQKVKFNVFDTMKYPNDIEDCSCIQVLDE
        TLQLAD+ + YP+ KIEDILV VDKFIFP DF++LDYE +KNV IILG PFLA GR+LID QKGELTM+V+DQ+V FNVF T+K+  +IEDCS I  + E
Subjt:  TLQLADKFITYPEGKIEDILVPVDKFIFPVDFIILDYETDKNVYIILGPPFLAAGRSLIDFQKGELTMKVDDQKVKFNVFDTMKYPNDIEDCSCIQVLDE

XP_030443756.1 uncharacterized protein LOC115666104 [Syzygium oleosum]3.5e-8738.87Show/hide
Query:  MKHVMFQMLQMVGQFHGLPYEDLHLHLKFYLGVSDSFVIQGVSRDALRLTLLFYSLRDGAKAWLNSFTLGSTSTWNELGEKFLIKYFPPIRNAKLKSEIV
        +K  + QMLQ   QF GLP +D ++HL  +L + D+    GV+ DA+RL L  +SLRD AK WL S   GS +TWN++ +KFL KYFPP ++AK++++I 
Subjt:  MKHVMFQMLQMVGQFHGLPYEDLHLHLKFYLGVSDSFVIQGVSRDALRLTLLFYSLRDGAKAWLNSFTLGSTSTWNELGEKFLIKYFPPIRNAKLKSEIV

Query:  GFRQNEDETFSEAWEMFKGLLGICPHHGLPHCIQMEIFYNGLNITTQGMVDASTRGGLLAKTFYEAHEILERISTNSCQWSNVRSYSKKVKATMEVDDVS
         F Q + E+  EAWE FK LL  CPHHGLP  +Q+  FYNG+    +  +DA+  G L  K+  EA ++LE ++ NS QW   R  ++K     EV   +
Subjt:  GFRQNEDETFSEAWEMFKGLLGICPHHGLPHCIQMEIFYNGLNITTQGMVDASTRGGLLAKTFYEAHEILERISTNSCQWSNVRSYSKKVKATMEVDDVS

Query:  TIRVDIAS---LGNALK-------SMTLVNTIQQ------PS-----------VVESVAVVGVTT---------------PTLRGEGKEVIR--------
        TI  +        N          + +  N  QQ      PS           + E V  +  TT                ++R   K+ +         
Subjt:  TIRVDIAS---LGNALK-------SMTLVNTIQQ------PS-----------VVESVAVVGVTT---------------PTLRGEGKEVIR--------

Query:  -----------KPLMHKKRWQISNDLFGKINKDCHIKISSYPVEFGVKNGLLPKVKDPGSFTIPFSIGGKELGRELCDLGTSINLMPLKIYRKLGIGETR
                   K ++  KR ++ +    K+N++C   +         +N L PK+KDPGSFTIP++IG     + LCDLG SINLMP  ++RKLG+GE +
Subjt:  -----------KPLMHKKRWQISNDLFGKINKDCHIKISSYPVEFGVKNGLLPKVKDPGSFTIPFSIGGKELGRELCDLGTSINLMPLKIYRKLGIGETR

Query:  STIVTLQLADKFITYPEGKIEDILVPVDKFIFPVDFIILDYETDKNVYIILGPPFLAAGRSLIDFQKGELTMKVDDQKVKFNVFDTMKYPNDIEDCSCIQ
        +T V+LQLAD+ I YP+G +ED+LV VDKFIFP DFI+L+ E D  V IILG PFLA GR+LID Q+G+L ++V D +V F+VF  MKYP +  +C  + 
Subjt:  STIVTLQLADKFITYPEGKIEDILVPVDKFIFPVDFIILDYETDKNVYIILGPPFLAAGRSLIDFQKGELTMKVDDQKVKFNVFDTMKYPNDIEDCSCIQ

Query:  VLDEIVEDHIEK
         +D +VE    K
Subjt:  VLDEIVEDHIEK

TrEMBL top hitse value%identityAlignment
A0A5B6ULS1 Retrotrans_gag domain-containing protein2.0e-7743.1Show/hide
Query:  MKHVMFQMLQMVGQFHGLPYEDLHLHLKFYLGVSDSFVIQGVSRDALRLTLLFYSLRDGAKAWLNSFTLGSTSTWNELGEKFLIKYFPPIRNAKLKSEIV
        +K VM QM Q +G F GLP +D   HL+F+L V                    Y  +D AK  LN+       +WN+  ++FL++Y  P  NAKL++EI 
Subjt:  MKHVMFQMLQMVGQFHGLPYEDLHLHLKFYLGVSDSFVIQGVSRDALRLTLLFYSLRDGAKAWLNSFTLGSTSTWNELGEKFLIKYFPPIRNAKLKSEIV

Query:  GFRQNEDETFSEAWEMFKGLLGICPHHGLPHCIQMEIFYNGLNITTQGMVDASTRGGLLAKTFYEAHEILERISTNSCQWSNVRS-YSKKVKATMEVDDV
         F+Q EDET  EAWE FK LL  CP H   H  QM++FYNGLN  T  +VD S  G  L +++ EA++ILERI+ N  Q+  +R+   ++V  + EV+ +
Subjt:  GFRQNEDETFSEAWEMFKGLLGICPHHGLPHCIQMEIFYNGLNITTQGMVDASTRGGLLAKTFYEAHEILERISTNSCQWSNVRS-YSKKVKATMEVDDV

Query:  STIRVDIASLGNALKSMTLV----NTIQQPSVVESVAVVGVTTPTLRGEGKEVIRKPLMHKKRWQISNDLFGKINKDCHIKISSYPVEFGVKNGLLPKVK
        ++I   ++SL N +K+M LV    + ++Q  +        V  P      K++    L+ K+R  +       + + C   +         KN   P +K
Subjt:  STIRVDIASLGNALKSMTLV----NTIQQPSVVESVAVVGVTTPTLRGEGKEVIRKPLMHKKRWQISNDLFGKINKDCHIKISSYPVEFGVKNGLLPKVK

Query:  DPGSFTIPFSIGGKELGRELCDLGTSINLMPLKIYRKLGIGETRSTIVTLQLADKFITYPEGKIEDILVPVDKFIFPVDFIILDYETDKNVYIILGPPFL
        +PGSFTIP SI  + +G  LCDLG+SINLMP+ +++KLGIGE R TIVTLQLAD+     EGKIED+LV VDKF FPVDFI+LD E DK+V IILG PFL
Subjt:  DPGSFTIPFSIGGKELGRELCDLGTSINLMPLKIYRKLGIGETRSTIVTLQLADKFITYPEGKIEDILVPVDKFIFPVDFIILDYETDKNVYIILGPPFL

Query:  AAGRSLIDFQKGELTMKVDD
        A G+S+ID QK ELTM+V D
Subjt:  AAGRSLIDFQKGELTMKVDD

A0A6J1CPJ3 uncharacterized protein LOC1110129477.8e-7731.39Show/hide
Query:  KHVMFQMLQMVGQFHGLPYEDLHLHLKFYLGVSDSFVIQGVSRDALRLTLLFYSLRDGAKAWLNSFTLGSTSTWNELGEKFLIKYFPPIRNAKLKSEIVG
        K +M QML  +GQF GL +ED   HLK ++ V+++F + G+S DALRLTL  +SL   A AWLN+F   +  T +++ +KFL+KYFPP RNA ++ EI+ 
Subjt:  KHVMFQMLQMVGQFHGLPYEDLHLHLKFYLGVSDSFVIQGVSRDALRLTLLFYSLRDGAKAWLNSFTLGSTSTWNELGEKFLIKYFPPIRNAKLKSEIVG

Query:  FRQNEDETFSEAWEMFKGLLGICPHHGLPHCIQMEIFYNGLNITTQGMVDASTRGGLLAKTFYEAHEILERISTNSCQWSNVRSYSKKVKA----TMEVD
        FRQ E+E  + AWE FK L+  CP+ G+P C+Q+E F+   +I T  M++ +  G   +K+F E  EIL+++S ++ QW + +  ++  +A     + +D
Subjt:  FRQNEDETFSEAWEMFKGLLGICPHHGLPHCIQMEIFYNGLNITTQGMVDASTRGGLLAKTFYEAHEILERISTNSCQWSNVRSYSKKVKA----TMEVD

Query:  DVSTIRVDIASLGNALKSM-------TLVNTIQQPSVVESVA-------------VVGVTTP--------TLRGEGK-----------------------
        ++++++  I ++   LK+M        L      PS V  +A                +  P        +  G+G                        
Subjt:  DVSTIRVDIASLGNALKSM-------TLVNTIQQPSVVESVA-------------VVGVTTP--------TLRGEGK-----------------------

Query:  --------------------------EVIRKPLMHK--------------------------------------KRWQISNDL-----------------
                                  E++ K  + K                                      +  Q++N++                 
Subjt:  --------------------------EVIRKPLMHK--------------------------------------KRWQISNDL-----------------

Query:  --------------------------------------------------------FGKINKDCHI--------------------------KISSYPV-
                                                                F  I K  HI                          K+  Y   
Subjt:  --------------------------------------------------------FGKINKDCHI--------------------------KISSYPV-

Query:  ------EFGVKNGLLPKVKDPGSFTIPFSIGGKELGRELCDLGTSINLMPLKIYRKLGIGETRSTIVTLQLADKFITYPEGKIEDILVPVDKFIFPVDFI
                  K+   PK+KDPGSFTI   IGGK++GR LCDLG  INLMPL I++KL IG+   T VTL LAD+ IT PEGKIED+LV VDKFIFP DFI
Subjt:  ------EFGVKNGLLPKVKDPGSFTIPFSIGGKELGRELCDLGTSINLMPLKIYRKLGIGETRSTIVTLQLADKFITYPEGKIEDILVPVDKFIFPVDFI

Query:  ILDYETDKNVYIILGPPFLAAGRSLIDFQKGELTMKVDDQKVKFNVFDTMKYPNDIEDCSCIQV--------LDEIVEDHIEKEL
        ILD E DK+V IILG PFLA G +LID +KGELTM+VDDQKV FN+ D MKYP+D E+C  I +        LD+++   IE EL
Subjt:  ILDYETDKNVYIILGPPFLAAGRSLIDFQKGELTMKVDDQKVKFNVFDTMKYPNDIEDCSCIQV--------LDEIVEDHIEKEL

A0A6J1DY39 uncharacterized protein LOC1110256533.0e-7630.33Show/hide
Query:  KHVMFQMLQMVGQFHGLPYEDLHLHLKFYLGVSDSFVIQGVSRDALRLTLLFYSLRDGAKAWLNSFTLGSTSTWNELGEKFLIKYFPPIRNAKLKSEIVG
        K +M QML  +GQF GL +ED   HLK ++ V+++F + G+S DALRLTL  +S+   A AWLN+F   + +TW+++ +KFL+KYFPP RNA ++ EI+ 
Subjt:  KHVMFQMLQMVGQFHGLPYEDLHLHLKFYLGVSDSFVIQGVSRDALRLTLLFYSLRDGAKAWLNSFTLGSTSTWNELGEKFLIKYFPPIRNAKLKSEIVG

Query:  FRQNEDETFSEAWEMFKGLLGICPHHGLPHCIQMEIFYNGLNITTQGMVDASTRGGLLAKTFYEAHEILERISTNSCQWSNVRSYSKKVKA----TMEVD
        FRQ E+E  + AWE FK L+  CP+ G+P C+Q+E F+ G +I T+ M++ +  G   +K+F E  EIL+++S ++ QW + +S ++  +A     + +D
Subjt:  FRQNEDETFSEAWEMFKGLLGICPHHGLPHCIQMEIFYNGLNITTQGMVDASTRGGLLAKTFYEAHEILERISTNSCQWSNVRSYSKKVKA----TMEVD

Query:  DVSTIRVDIASLGNALKSM---------------------------------------------------------------------------------
        ++++++  I ++   LK+M                                                                                 
Subjt:  DVSTIRVDIASLGNALKSM---------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  TLVNTIQQ------PSVVESVAVVG-------VTTPTLRGEG----KEVIRKPLMHKKRW-----------------QISND------------------
         LVN ++       PS  E    +G        T   L+ EG     E    P   K                    Q+SN                   
Subjt:  TLVNTIQQ------PSVVESVAVVG-------VTTPTLRGEG----KEVIRKPLMHKKRW-----------------QISND------------------

Query:  --LFGKINKDCHI--------------------------KISSYPV-------EFGVKNGLLPKVKDPGSFTIPFSIGGKELGRELCDLGTSINLMPLKI
           F  I K  HI                          K+  Y             K+ + PK+KDPGSFTIP  IGGK++GR LCDLG SINLMPL I
Subjt:  --LFGKINKDCHI--------------------------KISSYPV-------EFGVKNGLLPKVKDPGSFTIPFSIGGKELGRELCDLGTSINLMPLKI

Query:  YRKLGIGETRSTIVTLQLADKFITYPEGKIEDILVPVDKFIFPVDFIILDYETDKNVYIILGPPFLAAGRSLIDFQKGELTMKVDDQKVKFNVFDTMKYP
        ++K  IG+   T VTLQLAD+ IT PEGKIED+LV VDKFIFP DFIILD E DK+V IILG PFLA G +LID +KGELTM+VDDQKV FN+ D MKY 
Subjt:  YRKLGIGETRSTIVTLQLADKFITYPEGKIEDILVPVDKFIFPVDFIILDYETDKNVYIILGPPFLAAGRSLIDFQKGELTMKVDDQKVKFNVFDTMKYP

Query:  NDIEDCSCIQV--------LDEIVEDHIEKEL
        +D+E+C+ I +        LD+++   IE EL
Subjt:  NDIEDCSCIQV--------LDEIVEDHIEKEL

A0A6J1EQ90 uncharacterized protein LOC1114364111.3e-10839.4Show/hide
Query:  MKHVMFQMLQMVGQFHGLPYEDLHLHLKFYLGV-------SDSFVIQGVSRDALRLTLLFYSLRDGAKAWLNSFTLGSTSTWNELGEKFLIKYFPPIRNA
        +K VMFQMLQ +GQFHGLP ED HLHLK +LGV       SDSF  QGV +D +RL+L  Y LRDGAK+WLN+   G+  +WN L E FLIKYFPP RNA
Subjt:  MKHVMFQMLQMVGQFHGLPYEDLHLHLKFYLGV-------SDSFVIQGVSRDALRLTLLFYSLRDGAKAWLNSFTLGSTSTWNELGEKFLIKYFPPIRNA

Query:  KLKSEIVGFRQNEDETFSEAWEMFKGLLGICPHHGLPHCIQMEIFYNGLNITTQGMVDASTRGGLLAKTFYEAHEILERISTNSCQWSNVRSY-SKKVKA
        + K+EIV F+Q EDET SEA E FK +L  CPHHGLPHCIQME FYNGLNI T+ +VDAS  G +L+KT+ EA+EILERI++N+CQW++VRS   +K + 
Subjt:  KLKSEIVGFRQNEDETFSEAWEMFKGLLGICPHHGLPHCIQMEIFYNGLNITTQGMVDASTRGGLLAKTFYEAHEILERISTNSCQWSNVRSY-SKKVKA

Query:  TMEVDDVSTIRVDIASLGNALKSMTL---------VNTI---------------------QQPSVVESVAVVG---------------------------
         +EVD +S+I   +AS+ N L+++ L         V+T                      Q PS   S+  VG                           
Subjt:  TMEVDDVSTIRVDIASLGNALKSMTL---------VNTI---------------------QQPSVVESVAVVG---------------------------

Query:  -----------------------------VTTPTLRGEGK------------------------------------------------------------
                                      ++  +  +GK                                                            
Subjt:  -----------------------------VTTPTLRGEGK------------------------------------------------------------

Query:  -------------------EVIRKPLMH-----------------------KKRWQISNDLFGKINKDCHIKISSYPV----------------------
                           EV  +P M                        +K+ +   + F  I K+ HI I                           
Subjt:  -------------------EVIRKPLMH-----------------------KKRWQISNDLFGKINKDCHIKISSYPV----------------------

Query:  EFGV-----------KNGLLPKVKDPGSFTIPFSIGGKELGRELCDLGTSINLMPLKIYRKLGIGETRSTIVTLQLADKFITYPEGKIEDILVPVDKFIF
        EF V           KN +  K KDPGSFTIP SIGGKELGR LCDLG +INLMPL IY+KLGIGE R T VTLQLAD+ ITYPEGKIEDIL+ VDKFIF
Subjt:  EFGV-----------KNGLLPKVKDPGSFTIPFSIGGKELGRELCDLGTSINLMPLKIYRKLGIGETRSTIVTLQLADKFITYPEGKIEDILVPVDKFIF

Query:  PVDFIILDYETDKNVYIILGPPFLAAGRSLIDFQKGELTMKVDDQKVKFNVFDTMKYPNDIEDCSCIQVL
          DFIILDYE D +V IILG PFL  GR+L+D  KG +T+++  QKV+FN+ D+MKYP  IE+CS +  L
Subjt:  PVDFIILDYETDKNVYIILGPPFLAAGRSLIDFQKGELTMKVDDQKVKFNVFDTMKYPNDIEDCSCIQVL

A5AZ88 Integrase catalytic domain-containing protein3.7e-7936.42Show/hide
Query:  VSDSFVIQGVSRDALRLTLLFYSLRDGAKAWLNSFTLGSTSTWNELGEKFLIKYFPPIRNAKLKSEIVGFRQNEDETFSEAWEMFKGLLGICPHHGLPHC
        + D+F   GV+ DA+RL L  +SL + AKAWL S   G+ +TW+ L   FL KYFP  ++ K++++I  F Q + E+  EAWE FK LL  CPHHGLP  
Subjt:  VSDSFVIQGVSRDALRLTLLFYSLRDGAKAWLNSFTLGSTSTWNELGEKFLIKYFPPIRNAKLKSEIVGFRQNEDETFSEAWEMFKGLLGICPHHGLPHC

Query:  IQMEIFYNGLNITTQGMVDASTRGGLLAKTFYEAHEILERISTNSCQWSNVRSYSKKVKATMEVDDVSTIRVDIASLGNALKSMTLVNTIQQPSVVESVA
        +Q ++FYN L+  TQ MVDA++ G  + KT  E +++++ +++N+   S  R+  K+     ++D  + +   +A L N  K + +V  +    V E+ A
Subjt:  IQMEIFYNGLNITTQGMVDASTRGGLLAKTFYEAHEILERISTNSCQWSNVRSYSKKVKATMEVDDVSTIRVDIASLGNALKSMTLVNTIQQPSVVESVA

Query:  ------VVGVTTPTLRGEGKEV----------------------------------IRKP--------------LMHKKRWQISNDLFGKINKDCHIKIS
                 V  P      K+V                                  ++KP               +  +  Q +ND       +   + +
Subjt:  ------VVGVTTPTLRGEGKEV----------------------------------IRKP--------------LMHKKRWQISNDLFGKINKDCHIKIS

Query:  SYPVEFG-----------------VKNGLLPKVKDPGSFTIPFSIGGKELGRELCDLGTSINLMPLKIYRKLGIGETRSTIVTLQLADKFITYPEGKIED
        S P+                    ++  L PK+KDPGSFTIP +IG  +  + LCDLG S+NLMPL I+RKLG+GE + T V LQLAD+ I +P G IED
Subjt:  SYPVEFG-----------------VKNGLLPKVKDPGSFTIPFSIGGKELGRELCDLGTSINLMPLKIYRKLGIGETRSTIVTLQLADKFITYPEGKIED

Query:  ILVPVDKFIFPVDFIILDYETDKNVYIILGPPFLAAGRSLIDFQKGELTMKVDDQKVKFNVFDTMKYPNDIEDCSCIQVLDEIVED
        +LV VDKF+FP+DFI+LD E D++V +ILG PFLA  R+LIDF +G+L ++V D++V FNVF+ MK+P++++ C  I VLD +V +
Subjt:  ILVPVDKFIFPVDFIILDYETDKNVYIILGPPFLAAGRSLIDFQKGELTMKVDDQKVKFNVFDTMKYPNDIEDCSCIQVLDEIVED

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAACATGTTATGTTCCAAATGTTGCAAATGGTTGGTCAATTCCATGGCTTACCATATGAGGATCTTCACCTTCATCTTAAGTTTTATCTAGGAGTTAGTGATTCGTT
TGTTATCCAGGGAGTGTCTAGAGATGCTCTTAGATTAACCCTATTGTTTTATTCCCTTAGAGATGGTGCAAAGGCGTGGTTGAATTCTTTTACCCTAGGATCGACAAGTA
CTTGGAATGAGCTAGGAGAGAAATTTCTTATTAAGTATTTTCCACCAATTAGGAATGCCAAGTTGAAGAGTGAGATAGTGGGATTTAGGCAAAATGAGGATGAAACTTTT
AGTGAGGCTTGGGAAATGTTTAAGGGACTTTTGGGAATTTGTCCCCACCACGGTTTACCACATTGTATTCAAATGGAGATATTTTACAATGGGTTAAACATAACAACCCA
AGGAATGGTTGATGCTTCTACGAGAGGTGGCCTTTTGGCAAAAACCTTCTATGAAGCTCATGAGATTTTAGAGAGAATATCAACCAACAGTTGTCAATGGTCAAACGTGA
GAAGTTATAGTAAGAAAGTTAAAGCAACAATGGAAGTTGATGATGTGTCAACCATTAGGGTTGATATTGCATCATTGGGTAATGCTCTTAAAAGTATGACACTTGTTAAC
ACTATTCAGCAACCGTCAGTGGTGGAATCTGTTGCAGTGGTTGGTGTAACCACCCCAACTTTGCGTGGGGAGGGCAAGGAAGTAATTCGCAAGCCCCTCATGCACAAAAA
AAGGTGGCAAATCAGCAATGATTTGTTTGGCAAAATCAATAAGGATTGCCACATCAAAATAAGCAGCTACCCAGTAGAATTCGGTGTAAAGAACGGGCTACTTCCCAAGG
TTAAGGATCCAGGATCATTCACTATTCCTTTCTCTATAGGTGGAAAAGAGTTGGGGAGAGAACTTTGTGATTTAGGCACGAGCATAAACCTTATGCCTCTTAAGATTTAT
CGAAAGCTAGGTATAGGAGAAACTAGGTCTACCATAGTCACACTCCAATTGGCGGATAAGTTTATCACATATCCTGAAGGTAAAATTGAGGATATTTTGGTCCCGGTAGA
TAAATTTATTTTTCCTGTCGATTTTATTATCCTAGATTATGAGACAGATAAGAATGTCTATATTATTCTTGGTCCTCCATTTTTGGCAGCTGGTAGATCATTAATAGATT
TCCAAAAGGGAGAGCTTACAATGAAGGTGGATGACCAAAAGGTGAAATTCAATGTGTTTGATACAATGAAATATCCTAATGATATTGAGGATTGCTCGTGCATTCAGGTG
TTGGATGAAATTGTTGAGGACCACATTGAGAAGGAATTGATGAAAAAGATCAGACTTTTAGGTCTTAAGGACACCGATATCACCCTCCAGCTTGCAGATAGATCAGTTAC
CCACCTGATGGGTAGAGTGGAGGATGTATTGGTGAAAGTCAATAAGTTCATCTTTCCAGTTAACTTTGTGTTTGATGATGAGGAGTCAAATGATTGCTCTGTAGAGTCTG
ATTGGAATGATGAGTCTCATGAGAATATAGAGATTATACCTGATGAATGGATCCACTTAGCAAGCCCCGAAAAATCCGAAGATGCATTTTATCGGCCCCAAAGGATGAGG
ACGATTTTCTCCTGGACTCAAGGAAGTCTCAGTTTCCCTCTTAATGCTTTCTTGTTGTTAAACATTGGGGACAATGTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAAACATGTTATGTTCCAAATGTTGCAAATGGTTGGTCAATTCCATGGCTTACCATATGAGGATCTTCACCTTCATCTTAAGTTTTATCTAGGAGTTAGTGATTCGTT
TGTTATCCAGGGAGTGTCTAGAGATGCTCTTAGATTAACCCTATTGTTTTATTCCCTTAGAGATGGTGCAAAGGCGTGGTTGAATTCTTTTACCCTAGGATCGACAAGTA
CTTGGAATGAGCTAGGAGAGAAATTTCTTATTAAGTATTTTCCACCAATTAGGAATGCCAAGTTGAAGAGTGAGATAGTGGGATTTAGGCAAAATGAGGATGAAACTTTT
AGTGAGGCTTGGGAAATGTTTAAGGGACTTTTGGGAATTTGTCCCCACCACGGTTTACCACATTGTATTCAAATGGAGATATTTTACAATGGGTTAAACATAACAACCCA
AGGAATGGTTGATGCTTCTACGAGAGGTGGCCTTTTGGCAAAAACCTTCTATGAAGCTCATGAGATTTTAGAGAGAATATCAACCAACAGTTGTCAATGGTCAAACGTGA
GAAGTTATAGTAAGAAAGTTAAAGCAACAATGGAAGTTGATGATGTGTCAACCATTAGGGTTGATATTGCATCATTGGGTAATGCTCTTAAAAGTATGACACTTGTTAAC
ACTATTCAGCAACCGTCAGTGGTGGAATCTGTTGCAGTGGTTGGTGTAACCACCCCAACTTTGCGTGGGGAGGGCAAGGAAGTAATTCGCAAGCCCCTCATGCACAAAAA
AAGGTGGCAAATCAGCAATGATTTGTTTGGCAAAATCAATAAGGATTGCCACATCAAAATAAGCAGCTACCCAGTAGAATTCGGTGTAAAGAACGGGCTACTTCCCAAGG
TTAAGGATCCAGGATCATTCACTATTCCTTTCTCTATAGGTGGAAAAGAGTTGGGGAGAGAACTTTGTGATTTAGGCACGAGCATAAACCTTATGCCTCTTAAGATTTAT
CGAAAGCTAGGTATAGGAGAAACTAGGTCTACCATAGTCACACTCCAATTGGCGGATAAGTTTATCACATATCCTGAAGGTAAAATTGAGGATATTTTGGTCCCGGTAGA
TAAATTTATTTTTCCTGTCGATTTTATTATCCTAGATTATGAGACAGATAAGAATGTCTATATTATTCTTGGTCCTCCATTTTTGGCAGCTGGTAGATCATTAATAGATT
TCCAAAAGGGAGAGCTTACAATGAAGGTGGATGACCAAAAGGTGAAATTCAATGTGTTTGATACAATGAAATATCCTAATGATATTGAGGATTGCTCGTGCATTCAGGTG
TTGGATGAAATTGTTGAGGACCACATTGAGAAGGAATTGATGAAAAAGATCAGACTTTTAGGTCTTAAGGACACCGATATCACCCTCCAGCTTGCAGATAGATCAGTTAC
CCACCTGATGGGTAGAGTGGAGGATGTATTGGTGAAAGTCAATAAGTTCATCTTTCCAGTTAACTTTGTGTTTGATGATGAGGAGTCAAATGATTGCTCTGTAGAGTCTG
ATTGGAATGATGAGTCTCATGAGAATATAGAGATTATACCTGATGAATGGATCCACTTAGCAAGCCCCGAAAAATCCGAAGATGCATTTTATCGGCCCCAAAGGATGAGG
ACGATTTTCTCCTGGACTCAAGGAAGTCTCAGTTTCCCTCTTAATGCTTTCTTGTTGTTAAACATTGGGGACAATGTTTAG
Protein sequenceShow/hide protein sequence
MKHVMFQMLQMVGQFHGLPYEDLHLHLKFYLGVSDSFVIQGVSRDALRLTLLFYSLRDGAKAWLNSFTLGSTSTWNELGEKFLIKYFPPIRNAKLKSEIVGFRQNEDETF
SEAWEMFKGLLGICPHHGLPHCIQMEIFYNGLNITTQGMVDASTRGGLLAKTFYEAHEILERISTNSCQWSNVRSYSKKVKATMEVDDVSTIRVDIASLGNALKSMTLVN
TIQQPSVVESVAVVGVTTPTLRGEGKEVIRKPLMHKKRWQISNDLFGKINKDCHIKISSYPVEFGVKNGLLPKVKDPGSFTIPFSIGGKELGRELCDLGTSINLMPLKIY
RKLGIGETRSTIVTLQLADKFITYPEGKIEDILVPVDKFIFPVDFIILDYETDKNVYIILGPPFLAAGRSLIDFQKGELTMKVDDQKVKFNVFDTMKYPNDIEDCSCIQV
LDEIVEDHIEKELMKKIRLLGLKDTDITLQLADRSVTHLMGRVEDVLVKVNKFIFPVNFVFDDEESNDCSVESDWNDESHENIEIIPDEWIHLASPEKSEDAFYRPQRMR
TIFSWTQGSLSFPLNAFLLLNIGDNV