; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc10g20210 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc10g20210
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionTy3-gypsy retrotransposon protein
Genome locationchr10:14915341..14918624
RNA-Seq ExpressionMoc10g20210
SyntenyMoc10g20210
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
InterPro domainsIPR001969 - Aspartic peptidase, active site
IPR021109 - Aspartic peptidase domain superfamily
IPR041588 - Integrase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039528.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]4.5e-4929.94Show/hide
Query:  REKGLCYRCNEKFAPGHRCKKKELQILVVQKVEAVGDEFQDAVDTIGPETKSDG-----VGLTMLSLNSLVGLTSTKTLKVCGQLDKRGIVALVDSGATH
        +EKGLC+RC+EKF+  HRCK+ EL I+ VQ+ E + +E     D +G E ++ G     + +  LSLNSLVGL+S KT+K+ G++  R IV L+D GATH
Subjt:  REKGLCYRCNEKFAPGHRCKKKELQILVVQKVEAVGDEFQDAVDTIGPETKSDG-----VGLTMLSLNSLVGLTSTKTLKVCGQLDKRGIVALVDSGATH

Query:  NFIALDLVCACQLPISDTKGYEIVLGHYGGKRGS------------------------------------------------------------------
        NFI  ++V   ++ I     Y IVLG  G  R +                                                                  
Subjt:  NFIALDLVCACQLPISDTKGYEIVLGHYGGKRGS------------------------------------------------------------------

Query:  -------------------------TEVRKVLLEFEKVFRPLEGLPPSRQQDHAIELQPSAGPVNHV---------------------------------
                                  E++KVL  F  VF     LPP R  DHAIEL+  A  VN V                                 
Subjt:  -------------------------TEVRKVLLEFEKVFRPLEGLPPSRQQDHAIELQPSAGPVNHV---------------------------------

Query:  --------------------------------------------------IPNKYKKWITKLMGYDFEIEYKLGLENK-DDALSHQSEPVQLNALSLLGG
                                                          I  +Y++WI KL+ YDF IEY  GLENK  DALS      +L  +S++GG
Subjt:  --------------------------------------------------IPNKYKKWITKLMGYDFEIEYKLGLENK-DDALSHQSEPVQLNALSLLGG

Query:  LNISIFAQE------IRDDAELSKVITTLQANQPKMAGYSLRNDTLFSHGR--REVYWLGMRQAVQRFCAGCTICQQAKYLSLAPAGLLQLLPILDRIWE
        LN S+F  +      +RD   +  ++   + +   + G+   +  L ++ R  REVYW GM+  ++ + A C++CQQAKYLSL P+GL Q LPILDR+WE
Subjt:  LNISIFAQE------IRDDAELSKVITTLQANQPKMAGYSLRNDTLFSHGR--REVYWLGMRQAVQRFCAGCTICQQAKYLSLAPAGLLQLLPILDRIWE

Query:  DISMDFVEGLPKSDGVEMLAI
        DIS+DF++GLPKS G +++ +
Subjt:  DISMDFVEGLPKSDGVEMLAI

KAF7828587.1 Retrotransposable element Tf2 [Senna tora]7.4e-5229.29Show/hide
Query:  RKLDVLIFEAENRDGWLHRVARYFKINRLADNEKLEASILCLEG-----------------------TLSDRF------------AAVRQETTVRDYRRR
        RKL++ +F+ +   GWL RV RYF +NR+ D EKLEA  +CLEG                        L  RF             A++Q  TV +YR +
Subjt:  RKLDVLIFEAENRDGWLHRVARYFKINRLADNEKLEASILCLEG-----------------------TLSDRF------------AAVRQETTVRDYRRR

Query:  FEEAGGVVDGFPAFLLEGLFINGLKEDLRNEERVMKPSGLKEMMVMVQLLKDK-----ICKESDPLTSGG-----SNST---------VKSSGP------
        FE     +   P  +L G F NGLKE++R E R++K   L E+M + Q +++K       KE     S       S ST          KSSGP      
Subjt:  FEEAGGVVDGFPAFLLEGLFINGLKEDLRNEERVMKPSGLKEMMVMVQLLKDK-----ICKESDPLTSGG-----SNST---------VKSSGP------

Query:  ----------SQRIGDQKSR------RTTAEQFRRLSDIKLKSKREKGLCYRCNEKFAPGHRCKKKELQILVVQK--VEAVGDEFQDAVDTIGPETKSDG
                  S+  G +KSR       T   +++RL+D ++  KR  G C+ C+EK++  H+CK K L +L++     E   +E  +  +  G   +  G
Subjt:  ----------SQRIGDQKSR------RTTAEQFRRLSDIKLKSKREKGLCYRCNEKFAPGHRCKKKELQILVVQK--VEAVGDEFQDAVDTIGPETKSDG

Query:  VGLTMLSLNSLVGLTSTKTLKVCGQLDKRGIVALVDSGATHNFIALDLVCACQLPISDTKGYEIVLGHYGGKRGSTEVRKVLLEFEKVFRPLEGLPPSRQ
          L  LS+NS+VG+T  +T+K+  ++    ++ ++DSGA+HNFI+  LV   QLP+  T  YE+ +G     +G      VL +F KV +PL+GLPPSR 
Subjt:  VGLTMLSLNSLVGLTSTKTLKVCGQLDKRGIVALVDSGATHNFIALDLVCACQLPISDTKGYEIVLGHYGGKRGSTEVRKVLLEFEKVFRPLEGLPPSRQ

Query:  QDHAIELQPSAGPVN----------------------------------------------------------------------------------HVI
        +DHAI ++  A P N                                                                                   +I
Subjt:  QDHAIELQPSAGPVN----------------------------------------------------------------------------------HVI

Query:  PNKYKKWITKLMGYDFEIEYKLGLENK-DDALSHQSEPVQLNALSLLGGLNISIFAQEIRDDAELSKVITTLQANQPKMAGYSLRNDTLFSHGR
            +KW +KLMGY FEI+YK G+ENK  DALS + E ++L A S+        + +E+++D +L+ ++  +   Q    GY L+N  L   GR
Subjt:  PNKYKKWITKLMGYDFEIEYKLGLENK-DDALSHQSEPVQLNALSLLGGLNISIFAQEIRDDAELSKVITTLQANQPKMAGYSLRNDTLFSHGR

XP_022848903.1 uncharacterized protein LOC111371244 [Olea europaea var. sylvestris]1.4e-6326.04Show/hide
Query:  DLRLRKLDVLIFEAENRDGWLHRVARYFKINRLADNEKLEASILCL-----------------------------------EGTLSDRFAAVRQETTVRD
        D R R+L++ +F  EN DGWL +  RYF IN  ++ EK+EA+ +C                                    EGT  ++F A+RQE TVRD
Subjt:  DLRLRKLDVLIFEAENRDGWLHRVARYFKINRLADNEKLEASILCL-----------------------------------EGTLSDRFAAVRQETTVRD

Query:  YRRRFEEAGGVVDGFPAFLLEGLFINGLKEDLRNEERVMKPSGLKEMMVMVQLLKDK--ICKESDPLTSGGSNSTVKSSGPSQRIGDQKSRRTT------
        Y R FE     +   P  +LEG FINGLK  +R E R++KP GL  +M + Q ++D+  + +++   + G       S  PS   G  K    T      
Subjt:  YRRRFEEAGGVVDGFPAFLLEGLFINGLKEDLRNEERVMKPSGLKEMMVMVQLLKDK--ICKESDPLTSGGSNSTVKSSGPSQRIGDQKSRRTT------

Query:  ------------------------AEQFRRLSDIKLKSKREKGLCYRCNEKFAPGHRCKKKELQILVVQKVEAVGDEFQDAVDTIGPETKSDGVG-LTML
                                A  F++LSD++L+ KRE+GLCYRC+EKF PGH+C+ KEL +LVVQ     G+E  + +  +  +     VG +  L
Subjt:  ------------------------AEQFRRLSDIKLKSKREKGLCYRCNEKFAPGHRCKKKELQILVVQKVEAVGDEFQDAVDTIGPETKSDGVG-LTML

Query:  SLNSLVGLTSTKTLKVCGQLDKRGIVALVDSGATHNFIALDLVCACQLPISDTKGYEIVLGHYGGKRG------------STEV----------------
        S+NS+VGL + K++K+ G +    ++ L+D GATHNFI++DL    Q+P   T GY I++G     RG            S EV                
Subjt:  SLNSLVGLTSTKTLKVCGQLDKRGIVALVDSGATHNFIALDLVCACQLPISDTKGYEIVLGHYGGKRG------------STEV----------------

Query:  --------------------------------------------------------------------------------RKVLLEFEKVFRPLEGLPPS
                                                                                         ++L ++  VF     LPPS
Subjt:  --------------------------------------------------------------------------------RKVLLEFEKVFRPLEGLPPS

Query:  RQQDHAIELQPSAGPVN-----------------------------------------------------------------------------------
        R +DH+I LQ  + PVN                                                                                   
Subjt:  RQQDHAIELQPSAGPVN-----------------------------------------------------------------------------------

Query:  ----------------------------------------------HVIPNKYKKWITKLMGYDFEIEYKLGLENK-DDALSHQSEPVQLNALSLLGGLN
                                                       V+  +Y+KW+TKLM ++FEI+Y+L +EN+  DALS +   +QL ++S    L 
Subjt:  ----------------------------------------------HVIPNKYKKWITKLMGYDFEIEYKLGLENK-DDALSHQSEPVQLNALSLLGGLN

Query:  ISIFAQEIRDDAELSKVITTLQANQPKMAGYSLRNDTLFSHGR------------------------------------REVYWLGMRQAVQRFCAGCTI
        +    + + +D EL +++  +QA       YSL    L   GR                                    REVYW GM++ VQ F A C +
Subjt:  ISIFAQEIRDDAELSKVITTLQANQPKMAGYSLRNDTLFSHGR------------------------------------REVYWLGMRQAVQRFCAGCTI

Query:  CQQAKYLSLAPAGLLQLLPILDRIWEDISMDFVEGLPKSDG
        CQQ KY++L+P GLLQ LPI + IW D+SMDF+EGLPK+ G
Subjt:  CQQAKYLSLAPAGLLQLLPILDRIWEDISMDFVEGLPKSDG

XP_024017591.1 uncharacterized protein LOC112090471 [Morus notabilis]3.8e-4832.47Show/hide
Query:  DLRLRKLDVLIFEAENRDGWLHRVARYFKINRLADNEKLEASILCL-----------------------------------EGTLSDRFAAVRQETTVRD
        + R R++++ +F+ EN DGW+ R  RYF +NRL D EKL+ +++ L                                   EGTL ++F ++ QETTVR+
Subjt:  DLRLRKLDVLIFEAENRDGWLHRVARYFKINRLADNEKLEASILCL-----------------------------------EGTLSDRFAAVRQETTVRD

Query:  YRRRFEEAGGVVDGFPAFLLEGLFINGLKEDLRNEERVMKPSGLKEMMVMVQLLKDKICKESDPLTSGGS-NSTVKSSGPSQRIG---------------
        YRR+FE    ++   P  +LE  F+NGLK ++R E R+MKP+GL  +M   Q ++++      P T   S NS  ++   SQ +G               
Subjt:  YRRRFEEAGGVVDGFPAFLLEGLFINGLKEDLRNEERVMKPSGLKEMMVMVQLLKDKICKESDPLTSGGS-NSTVKSSGPSQRIG---------------

Query:  -----------DQKSRRTTA--EQFRRLSDIKLKSKREKGLCYRCNEKFAPGHRCKKKELQILVVQKVEAVGDEFQDAVDTIGPETKSDGVGLTMLSLNS
                    +K++ +T+    +RRL+D K+++KREKGLCYRC+ K++ G+RC  +ELQ+L+V++ + V +E ++  + +G + +     +  LSLNS
Subjt:  -----------DQKSRRTTA--EQFRRLSDIKLKSKREKGLCYRCNEKFAPGHRCKKKELQILVVQKVEAVGDEFQDAVDTIGPETKSDGVGLTMLSLNS

Query:  LVGLTSTKTLKVCGQLDKRGIVALVDSGATHNFIALDLVCACQLPISDTKGYEIVLGHYGGKRGSTEVRKVLLEFEKV-----FRPLE
        +VG TS KT+K+ G L  + +  L+DSGA HNFI++DLV   +L +  T+ Y +++G     +     R +++  + +     F PLE
Subjt:  LVGLTSTKTLKVCGQLDKRGIVALVDSGATHNFIALDLVCACQLPISDTKGYEIVLGHYGGKRGSTEVRKVLLEFEKV-----FRPLE

XP_034697296.1 uncharacterized protein K02A2.6-like [Vitis riparia]1.4e-4735.62Show/hide
Query:  RLRKLDVLIFEAENRDGWLHRVARYFKINRLADNEKLEASILCLE-----------------------------------GTLSDRFAAVRQETTVRDYR
        R RKL++ +F   N DGW+ +  RYF  NRL + EKLEA+I+  E                                   GTL +++ A+ Q+ +V DYR
Subjt:  RLRKLDVLIFEAENRDGWLHRVARYFKINRLADNEKLEASILCLE-----------------------------------GTLSDRFAAVRQETTVRDYR

Query:  RRFEEAGGVVDGFPAFLLEGLFINGLKEDLRNEERVMKPSGLKEMMVMVQLLKDK--ICKESDP---LTSGGSNSTVKSSGPSQR----IGDQKSRRTTA
        RRF E    ++     +  G FINGLK ++R E R+++PS L   M + Q +++K  I K   P   +T  G+ ++  S  PS+          S    A
Subjt:  RRFEEAGGVVDGFPAFLLEGLFINGLKEDLRNEERVMKPSGLKEMMVMVQLLKDK--ICKESDP---LTSGGSNSTVKSSGPSQR----IGDQKSRRTTA

Query:  EQFRRLSDIKLKSKREKGLCYRCNEKFAPGHRCKKKELQILVVQKVEAVGDEFQDAVDTIGPETKSDGVG-LTMLSLNSLVGLTSTKTLKVCGQLDKRGI
         + RRLSD +L+ KREKGLC+RC+EK+ PGHRCKKKEL +L++   +       D  +    + ++  +  +  +SL+S+VGLT+ KT+K+ G + ++ +
Subjt:  EQFRRLSDIKLKSKREKGLCYRCNEKFAPGHRCKKKELQILVVQKVEAVGDEFQDAVDTIGPETKSDGVG-LTMLSLNSLVGLTSTKTLKVCGQLDKRGI

Query:  VALVDSGATHNFIALDLVCACQLPISDTKGYEIVLGHYGGKRGSTEVRKVLLEFEKVFRPLEGLP
        V L+D GATHNFI+LDLV   QLPI D++ Y + +G     RG    R V L  + +    E LP
Subjt:  VALVDSGATHNFIALDLVCACQLPISDTKGYEIVLGHYGGKRGSTEVRKVLLEFEKVFRPLEGLP

TrEMBL top hitse value%identityAlignment
A0A087HNU1 Uncharacterized protein2.0e-4222.81Show/hide
Query:  RKLDVLIFEAENRDGWLHRVARYFKINRLADNEKLEASILCLEGTLSDRFAAVRQETTVRDYRRRFEEAGGVVDGFPAFLLEGLFINGLKEDLRNEERVM
        R+L++ IF  E  + W+ R+ +YF +    D +KL                   +E TV++Y R F          P F+LE  F+NGL+ ++R      
Subjt:  RKLDVLIFEAENRDGWLHRVARYFKINRLADNEKLEASILCLEGTLSDRFAAVRQETTVRDYRRRFEEAGGVVDGFPAFLLEGLFINGLKEDLRNEERVM

Query:  KPSGLKEMMVMVQLLKDKICKESDPLT-------SGGSNSTVKSSGPS-------------------------QRIGDQKSRRTTAEQ--FRRLSDIKLK
         P  L  MM   + +   +  E+ P          G     +K  GPS                         QR  D+   R    +  FRRL   ++ 
Subjt:  KPSGLKEMMVMVQLLKDKICKESDPLT-------SGGSNSTVKSSGPS-------------------------QRIGDQKSRRTTAEQ--FRRLSDIKLK

Query:  SKREKGLCYRCNEKFAPGHRCKKKELQILVVQKVEAVGDEFQDAVDTIGPETKSDGVGLTMLSLNSLVGLTSTKTLKVCGQLDKRGIVALVDSGATHNFI
         ++ +GLC+RC+EK     +C  KE Q+L+VQK +    E ++A +    ET  + VG+  LSLNS+VG++S +T+K+ G +    +V L+DSGATHNFI
Subjt:  SKREKGLCYRCNEKFAPGHRCKKKELQILVVQKVEAVGDEFQDAVDTIGPETKSDGVGLTMLSLNSLVGLTSTKTLKVCGQLDKRGIVALVDSGATHNFI

Query:  ALDLVCACQLPISDTKGYEIVLG-------------------------------------------------------HYGGKRGS--------TE-VRK
        +  +V   +L   +TKGY +V G                                                        Y G +          TE V+ 
Subjt:  ALDLVCACQLPISDTKGYEIVLG-------------------------------------------------------HYGGKRGS--------TE-VRK

Query:  VLLEFEKVFRPLEGLPPSRQQDHAIELQPSAGPVN-----------------------------------------------------------------
        +L EF++VF   +GLPPSR ++H IEL   A PV+                                                                 
Subjt:  VLLEFEKVFRPLEGLPPSRQQDHAIELQPSAGPVN-----------------------------------------------------------------

Query:  ---------------------------------------------------------HVI--------PNK-----------------------------
                                                                 HVI        P K                             
Subjt:  ---------------------------------------------------------HVI--------PNK-----------------------------

Query:  ------------------------------------------------------------YKKWITKLMGYDFEIEYKLGLENK-DDALSHQSEPVQLNA
                                                                    Y++W+TK++G+DFEI+YK GLENK  DALS      QL A
Subjt:  ------------------------------------------------------------YKKWITKLMGYDFEIEYKLGLENK-DDALSHQSEPVQLNA

Query:  LSLLGGLNISIFAQEIRDDAELSKVITTLQANQPKMAGYSLRNDTLFSHGR------------------------------------REVYWLGMRQAVQ
        LS+   + ++   + +  DAELSK+   +  +      +S+    L   GR                                       +W  M   ++
Subjt:  LSLLGGLNISIFAQEIRDDAELSKVITTLQANQPKMAGYSLRNDTLFSHGR------------------------------------REVYWLGMRQAVQ

Query:  RFCAGCTICQQAKYLSLAPAGLLQLLPILDRIWEDISMDFVEGLPKSDGVEMLAI
        ++ A C +CQ+ KY +LAPAGLLQ LP+  ++WEDISMDFVEGLPKSDG +++ +
Subjt:  RFCAGCTICQQAKYLSLAPAGLLQLLPILDRIWEDISMDFVEGLPKSDGVEMLAI

A0A5A7T217 Putative retroelement pol polyprotein3.6e-4424.14Show/hide
Query:  RLRKLDVLIFEAENRDGWLHRVAR-YFKINRLADNEKLEASILC-----LEGTLSDRFAAVRQETTVRDYRRRFEEAGGVVDGFPAFLLEGLFINGLKED
        + +K+++ +F  E+ D WL R  R   + ++      ++  +L       +GT+S +F  ++QE+TV +Y   F++    V+  P  ++E  F+NGL   
Subjt:  RLRKLDVLIFEAENRDGWLHRVAR-YFKINRLADNEKLEASILC-----LEGTLSDRFAAVRQETTVRDYRRRFEEAGGVVDGFPAFLLEGLFINGLKED

Query:  LRNEERVMKPSGLKEMMVMVQLLKDKICKESDPLTSG----------GSNSTVKSSGPS----------------QRIGDQKSRRTTAEQFRRLSDIKLK
        +R+E    +P GL EMM + Q+++++    ++   SG          G N    S G +                +  G  ++RR     ++RL D + +
Subjt:  LRNEERVMKPSGLKEMMVMVQLLKDKICKESDPLTSG----------GSNSTVKSSGPS----------------QRIGDQKSRRTTAEQFRRLSDIKLK

Query:  SKREKGLCYRCNEKFAPGHRCK---KKELQILVV----QKVEAVGDEFQDAVDTIGP-ETKSDGVGLTMLSLNSLVGLTSTKTLKVCGQLDKRGIVALVD
        +++EKGLC+RCNEK++  H+C+   ++EL++ VV    ++ E V +E    V  +G  E   D   +  LS+NS+VGL    T+KV G+L    ++ L+D
Subjt:  SKREKGLCYRCNEKFAPGHRCK---KKELQILVV----QKVEAVGDEFQDAVDTIGP-ETKSDGVGLTMLSLNSLVGLTSTKTLKVCGQLDKRGIVALVD

Query:  SGATHNFIALDLVCACQLPISDTKGYEIVLGHYG-----GKRGSTEVRKVLLEFEKVFRPL---------------------------------------
         GATHNF++  L     LPI +T  Y ++LG        G  G  EV+    +  + F PL                                       
Subjt:  SGATHNFIALDLVCACQLPISDTKGYEIVLGHYG-----GKRGSTEVRKVLLEFEKVFRPL---------------------------------------

Query:  ----------EGLPPSRQQDHAI-----------------------------------------------------------------------------
                  EG+    ++  AI                                                                             
Subjt:  ----------EGLPPSRQQDHAI-----------------------------------------------------------------------------

Query:  -----------------ELQPSAGPVNH-------------------------VIPNKYKKWITKLMGYDFEIEYKLGLENK-DDALSHQSEPVQLNALS
                         + +P A   +H                         VI  +Y+KWI KL+ Y FE+ YK  +EN+  DALS + + VQL  LS
Subjt:  -----------------ELQPSAGPVNH-------------------------VIPNKYKKWITKLMGYDFEIEYKLGLENK-DDALSHQSEPVQLNALS

Query:  LLGGLNISIFAQEIRDDAELSKVITTL-QANQPKMAGYSLRNDTLFSHGR------------------------------------REVYWLGMRQAVQR
        +   ++  +  +E+  D +  K+I  + Q+ +   + YSL+   L    R                                     E+YW GM+  +++
Subjt:  LLGGLNISIFAQEIRDDAELSKVITTL-QANQPKMAGYSLRNDTLFSHGR------------------------------------REVYWLGMRQAVQR

Query:  FCAGCTICQQAKYLSLAPAGLLQLLPILDRIWEDISMDFVEGLPKSDGVEMLAI
         C  C  CQ++K L+L+PAGLL  L I   IW DISMDFVEGLPKS G E++ +
Subjt:  FCAGCTICQQAKYLSLAPAGLLQLLPILDRIWEDISMDFVEGLPKSDGVEMLAI

A0A5C7IJS7 Uncharacterized protein6.1e-4433.08Show/hide
Query:  VGSSEGSTQLFNALAPVFDLRLRKLDVLIFEAENRDGWLHRVARYFKINRLADNEKLEASIL-----------------------------------CLE
        +G + GS    N      D R RKL++ +F+  N DGW+ +   YF + R  + EKLEAS++                                     E
Subjt:  VGSSEGSTQLFNALAPVFDLRLRKLDVLIFEAENRDGWLHRVARYFKINRLADNEKLEASIL-----------------------------------CLE

Query:  GTLSDRFAAVRQETTVRDYRRRFEEAGGVVDGFPAFLLEGLFINGLKEDLRNEERVMKPSGLKEMMVMVQLLKDKICKESDPLTSGGSNSTVKSSG----
        G+L ++F A+RQ+ TV++YRR+F E    +D     +    FINGL  ++RNE RV+ P  L   M + Q ++ K+        +GG ++T K SG    
Subjt:  GTLSDRFAAVRQETTVRDYRRRFEEAGGVVDGFPAFLLEGLFINGLKEDLRNEERVMKPSGLKEMMVMVQLLKDKICKESDPLTSGGSNSTVKSSG----

Query:  -------PSQRIGDQKSRRTTAEQFRRLSDIKLKSKREKGLCYRCNEKFAPGHRCKKKELQILVV--QKVEAVGDEFQDAVD--TIGPETKSDGVGLTML
               P    G            RRL+D +L+ KR  GLCYRC+EK++PGH+CKKKEL +L+   ++ E   +E    VD   +     S+      +
Subjt:  -------PSQRIGDQKSRRTTAEQFRRLSDIKLKSKREKGLCYRCNEKFAPGHRCKKKELQILVV--QKVEAVGDEFQDAVD--TIGPETKSDGVGLTML

Query:  SLNSLVGLTSTKTLKVCGQLDKRGIVALVDSGATHNFIALDLVCACQLPISDTKGYEIVLGHYGGKRGSTEVRKVLLEFEKVFRPLEGLP
        SLNS+VGLT+ KT+K+ G + ++ +V L+D GATHNFI+ DLV   +LPI+ T+ Y + +G     RG    + V L  + +    E LP
Subjt:  SLNSLVGLTSTKTLKVCGQLDKRGIVALVDSGATHNFIALDLVCACQLPISDTKGYEIVLGHYGGKRGSTEVRKVLLEFEKVFRPLEGLP

A0A5D3CTU6 Ty3-gypsy retrotransposon protein2.2e-4929.94Show/hide
Query:  REKGLCYRCNEKFAPGHRCKKKELQILVVQKVEAVGDEFQDAVDTIGPETKSDG-----VGLTMLSLNSLVGLTSTKTLKVCGQLDKRGIVALVDSGATH
        +EKGLC+RC+EKF+  HRCK+ EL I+ VQ+ E + +E     D +G E ++ G     + +  LSLNSLVGL+S KT+K+ G++  R IV L+D GATH
Subjt:  REKGLCYRCNEKFAPGHRCKKKELQILVVQKVEAVGDEFQDAVDTIGPETKSDG-----VGLTMLSLNSLVGLTSTKTLKVCGQLDKRGIVALVDSGATH

Query:  NFIALDLVCACQLPISDTKGYEIVLGHYGGKRGS------------------------------------------------------------------
        NFI  ++V   ++ I     Y IVLG  G  R +                                                                  
Subjt:  NFIALDLVCACQLPISDTKGYEIVLGHYGGKRGS------------------------------------------------------------------

Query:  -------------------------TEVRKVLLEFEKVFRPLEGLPPSRQQDHAIELQPSAGPVNHV---------------------------------
                                  E++KVL  F  VF     LPP R  DHAIEL+  A  VN V                                 
Subjt:  -------------------------TEVRKVLLEFEKVFRPLEGLPPSRQQDHAIELQPSAGPVNHV---------------------------------

Query:  --------------------------------------------------IPNKYKKWITKLMGYDFEIEYKLGLENK-DDALSHQSEPVQLNALSLLGG
                                                          I  +Y++WI KL+ YDF IEY  GLENK  DALS      +L  +S++GG
Subjt:  --------------------------------------------------IPNKYKKWITKLMGYDFEIEYKLGLENK-DDALSHQSEPVQLNALSLLGG

Query:  LNISIFAQE------IRDDAELSKVITTLQANQPKMAGYSLRNDTLFSHGR--REVYWLGMRQAVQRFCAGCTICQQAKYLSLAPAGLLQLLPILDRIWE
        LN S+F  +      +RD   +  ++   + +   + G+   +  L ++ R  REVYW GM+  ++ + A C++CQQAKYLSL P+GL Q LPILDR+WE
Subjt:  LNISIFAQE------IRDDAELSKVITTLQANQPKMAGYSLRNDTLFSHGR--REVYWLGMRQAVQRFCAGCTICQQAKYLSLAPAGLLQLLPILDRIWE

Query:  DISMDFVEGLPKSDGVEMLAI
        DIS+DF++GLPKS G +++ +
Subjt:  DISMDFVEGLPKSDGVEMLAI

A0A803QDN9 Uncharacterized protein5.5e-5327Show/hide
Query:  SDRFAAVRQETTVRDYRRRFEEAGGVVDGFPAFLLEGLFINGLKEDLRNEERVMKPSGLKEMMVMVQLLKD--------KICKESDPLTSGGSNSTVKS-
        ++   +V Q  TV++YR ++E     V   P  +LEG F+ GLKE+++    +++P GL  +M   Q +++         + K +    S GS+S  +S 
Subjt:  SDRFAAVRQETTVRDYRRRFEEAGGVVDGFPAFLLEGLFINGLKEDLRNEERVMKPSGLKEMMVMVQLLKD--------KICKESDPLTSGGSNSTVKS-

Query:  -----------------------SGPSQRIGDQKSRRTTAEQFRRLSDIKLKSKREKGLCYRCNEKFAPGHRCKKKELQILVVQKVEAVGDEFQDAVDTI
                               S  S     + +R  T   ++RL++ + + K  +GL + C++KF PG  C++K LQIL+    E +  +  D    +
Subjt:  -----------------------SGPSQRIGDQKSRRTTAEQFRRLSDIKLKSKREKGLCYRCNEKFAPGHRCKKKELQILVVQKVEAVGDEFQDAVDTI

Query:  GP---ETKSDGVGLTMLSLNSLVGLTSTKTLKVCGQLDKRGIVALVDSGATHNFIALDLVCACQLPISDTKGYEIVLG----------------------
         P   ET      L  LSLNSLVGL++  T+K+  Q+  + +  L+DSGATHNF+ +++  A  LPIS T  Y I+LG                      
Subjt:  GP---ETKSDGVGLTMLSLNSLVGLTSTKTLKVCGQLDKRGIVALVDSGATHNFIALDLVCACQLPISDTKGYEIVLG----------------------

Query:  --------------------------------------HYGGKRG----STEVRKVLLEFEKVFRPLEGLPPSRQQDHAIELQPSAGPVN----------
                                                GG  G      E+ ++L ++E VF    GLPP RQ+DH I L+P+ GP++          
Subjt:  --------------------------------------HYGGKRG----STEVRKVLLEFEKVFRPLEGLPPSRQQDHAIELQPSAGPVN----------

Query:  -------------------------------------HVI--------PNK----------------------------------YKKWITKLMGYDFEI
                                             HVI        P+K                                  ++KW+TK++GYDF+I
Subjt:  -------------------------------------HVI--------PNK----------------------------------YKKWITKLMGYDFEI

Query:  EYKLGLENKD-DALSHQSEPVQLNALSLLGGLNISIFAQEIRDDAELSKVITTLQANQPKMAGYSLRNDTLFSHGR------------------------
        +YK GLEN+  DALSH S  V L A+S+ G  + +     I  D +L+K+++ +   Q    G+S+ +  L   GR                        
Subjt:  EYKLGLENKD-DALSHQSEPVQLNALSLLGGLNISIFAQEIRDDAELSKVITTLQANQPKMAGYSLRNDTLFSHGR------------------------

Query:  ------------REVYWLGMRQAVQRFCAGCTICQQAKYLSLAPAGLLQLLPILDRIWEDISMDFVEGLPKSDG
                     + YW GMR+ VQ F   C++CQQ KYL+  PAGLLQ LP  +++WEDIS +F+EGLP S+G
Subjt:  ------------REVYWLGMRQAVQRFCAGCTICQQAKYLSLAPAGLLQLLPILDRIWEDISMDFVEGLPKSDG

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein4.4e-0729.85Show/hide
Query:  LSHQSEPVQLNALSLLGGLNISIFAQ-EIRDDAELSKVITTLQANQPKM--AGYSLRNDTLFSHGRREVYWLGMRQAVQRFCAGCTICQQAKYLSLAPAG
        L+++ + V+ N + L  GL I+   Q  + +D +L++ I      + K+   G  L  + +     R   W G+R+ +Q +   C  CQ  K  +  P G
Subjt:  LSHQSEPVQLNALSLLGGLNISIFAQ-EIRDDAELSKVITTLQANQPKM--AGYSLRNDTLFSHGRREVYWLGMRQAVQRFCAGCTICQQAKYLSLAPAG

Query:  LLQLLPILDRIWEDISMDFVEGLPKSDGVEMLAI
         LQ +P  +R WE +SMDF+  LP+S G   L +
Subjt:  LLQLLPILDRIWEDISMDFVEGLPKSDGVEMLAI

P0CT35 Transposon Tf2-2 polyprotein4.4e-0729.85Show/hide
Query:  LSHQSEPVQLNALSLLGGLNISIFAQ-EIRDDAELSKVITTLQANQPKM--AGYSLRNDTLFSHGRREVYWLGMRQAVQRFCAGCTICQQAKYLSLAPAG
        L+++ + V+ N + L  GL I+   Q  + +D +L++ I      + K+   G  L  + +     R   W G+R+ +Q +   C  CQ  K  +  P G
Subjt:  LSHQSEPVQLNALSLLGGLNISIFAQ-EIRDDAELSKVITTLQANQPKM--AGYSLRNDTLFSHGRREVYWLGMRQAVQRFCAGCTICQQAKYLSLAPAG

Query:  LLQLLPILDRIWEDISMDFVEGLPKSDGVEMLAI
         LQ +P  +R WE +SMDF+  LP+S G   L +
Subjt:  LLQLLPILDRIWEDISMDFVEGLPKSDGVEMLAI

P0CT36 Transposon Tf2-3 polyprotein4.4e-0729.85Show/hide
Query:  LSHQSEPVQLNALSLLGGLNISIFAQ-EIRDDAELSKVITTLQANQPKM--AGYSLRNDTLFSHGRREVYWLGMRQAVQRFCAGCTICQQAKYLSLAPAG
        L+++ + V+ N + L  GL I+   Q  + +D +L++ I      + K+   G  L  + +     R   W G+R+ +Q +   C  CQ  K  +  P G
Subjt:  LSHQSEPVQLNALSLLGGLNISIFAQ-EIRDDAELSKVITTLQANQPKM--AGYSLRNDTLFSHGRREVYWLGMRQAVQRFCAGCTICQQAKYLSLAPAG

Query:  LLQLLPILDRIWEDISMDFVEGLPKSDGVEMLAI
         LQ +P  +R WE +SMDF+  LP+S G   L +
Subjt:  LLQLLPILDRIWEDISMDFVEGLPKSDGVEMLAI

P0CT41 Transposon Tf2-12 polyprotein4.4e-0729.85Show/hide
Query:  LSHQSEPVQLNALSLLGGLNISIFAQ-EIRDDAELSKVITTLQANQPKM--AGYSLRNDTLFSHGRREVYWLGMRQAVQRFCAGCTICQQAKYLSLAPAG
        L+++ + V+ N + L  GL I+   Q  + +D +L++ I      + K+   G  L  + +     R   W G+R+ +Q +   C  CQ  K  +  P G
Subjt:  LSHQSEPVQLNALSLLGGLNISIFAQ-EIRDDAELSKVITTLQANQPKM--AGYSLRNDTLFSHGRREVYWLGMRQAVQRFCAGCTICQQAKYLSLAPAG

Query:  LLQLLPILDRIWEDISMDFVEGLPKSDGVEMLAI
         LQ +P  +R WE +SMDF+  LP+S G   L +
Subjt:  LLQLLPILDRIWEDISMDFVEGLPKSDGVEMLAI

Q9UR07 Transposon Tf2-11 polyprotein4.4e-0729.85Show/hide
Query:  LSHQSEPVQLNALSLLGGLNISIFAQ-EIRDDAELSKVITTLQANQPKM--AGYSLRNDTLFSHGRREVYWLGMRQAVQRFCAGCTICQQAKYLSLAPAG
        L+++ + V+ N + L  GL I+   Q  + +D +L++ I      + K+   G  L  + +     R   W G+R+ +Q +   C  CQ  K  +  P G
Subjt:  LSHQSEPVQLNALSLLGGLNISIFAQ-EIRDDAELSKVITTLQANQPKM--AGYSLRNDTLFSHGRREVYWLGMRQAVQRFCAGCTICQQAKYLSLAPAG

Query:  LLQLLPILDRIWEDISMDFVEGLPKSDGVEMLAI
         LQ +P  +R WE +SMDF+  LP+S G   L +
Subjt:  LLQLLPILDRIWEDISMDFVEGLPKSDGVEMLAI

Arabidopsis top hitse value%identityAlignment
AT3G29750.1 Eukaryotic aspartyl protease family protein3.6e-0421.25Show/hide
Query:  FAAVRQETTVRDYRRRFEEAGGVVDGFPAFLLEGLFINGLKEDLRNEERVMKPSGLKEMMVMVQLLKDKICKESDPLTSGGSNSTVKSSGPSQRIGDQKS
        ++ ++QE +VRDYR RFE         P    E +F+ GL+  L+   R +KP+G+                                           S
Subjt:  FAAVRQETTVRDYRRRFEEAGGVVDGFPAFLLEGLFINGLKEDLRNEERVMKPSGLKEMMVMVQLLKDKICKESDPLTSGGSNSTVKSSGPSQRIGDQKS

Query:  RRTTAEQFRRLSDIKLKSKREKGLCYRCNEKFAPGHRCKKKELQILVVQKVEAVGDEFQDAVDTIGPETKSDGVGLTMLSLNSLVGLTSTKTLKVCGQLD
         ++   +   L+ ++ K                           + VV+K + V +E +        E + D   L       ++ LT  K ++  G + 
Subjt:  RRTTAEQFRRLSDIKLKSKREKGLCYRCNEKFAPGHRCKKKELQILVVQKVEAVGDEFQDAVDTIGPETKSDGVGLTMLSLNSLVGLTSTKTLKVCGQLD

Query:  KRGIVALVDSGATHNFIALDLVCACQLPISDTKGYEIVLG
           +V  +DSGAT NFI ++L  + +LP S T    ++LG
Subjt:  KRGIVALVDSGATHNFIALDLVCACQLPISDTKGYEIVLG

AT3G42723.1 aminoacyl-tRNA ligases;ATP binding;nucleotide binding5.0e-0636.67Show/hide
Query:  FAAVRQETTVRDYRRRFEE--AGGVVDGFPAFLLEGLFINGLKEDLRNEERVMKPSGLKEMMVMVQLLKDKICKESDPLTSGGSNSTVKS
        ++ ++QE +VR+YR RFE    G V+   P   LE LF+ GL+  L+   R +KP+G+ +MM   Q L     +ES+ L   GS  +V++
Subjt:  FAAVRQETTVRDYRRRFEE--AGGVVDGFPAFLLEGLFINGLKEDLRNEERVMKPSGLKEMMVMVQLLKDKICKESDPLTSGGSNSTVKS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTGCTGTGGGGTCAAGTGAGGGGTCAACTCAACTCTTCAATGCATTGGCACCGGTATTCGATCTTCGTCTTCGTAAGTTGGATGTGCTGATTTTTGAAGCGGAAAA
TCGAGATGGATGGCTGCACCGGGTCGCACGGTATTTTAAGATCAATCGGTTAGCGGACAATGAAAAATTAGAGGCGTCGATCCTTTGCCTTGAGGGTACCCTGTCTGATC
GATTTGCGGCGGTACGACAAGAAACAACCGTCCGAGATTATCGTCGGCGATTTGAGGAGGCGGGTGGTGTGGTTGATGGATTCCCTGCATTTCTTCTTGAAGGCCTATTC
ATCAATGGGTTGAAGGAGGACTTGAGAAACGAGGAAAGGGTGATGAAGCCCAGTGGGCTCAAAGAGATGATGGTAATGGTCCAATTACTTAAGGATAAAATTTGTAAGGA
ATCAGACCCATTGACATCAGGGGGATCGAACAGCACAGTGAAGTCATCTGGGCCATCCCAAAGAATCGGTGACCAAAAGTCGCGTCGCACAACCGCCGAGCAATTTCGGC
GACTTTCTGATATAAAACTCAAATCCAAACGGGAGAAGGGGCTATGCTATCGTTGTAATGAAAAATTCGCACCGGGTCATCGTTGCAAGAAAAAGGAATTACAGATCCTC
GTGGTTCAAAAAGTTGAAGCGGTGGGCGACGAATTCCAGGATGCTGTGGATACCATCGGCCCGGAAACCAAGTCGGATGGGGTGGGTTTGACGATGCTATCCTTGAATTC
GTTGGTGGGTTTAACTTCTACGAAGACCCTCAAGGTATGTGGGCAATTGGACAAGCGAGGGATTGTGGCTTTGGTGGATAGTGGGGCGACGCACAACTTCATTGCCCTAG
ACTTGGTTTGTGCTTGTCAACTCCCAATCTCGGACACTAAGGGGTACGAAATTGTCTTGGGACACTACGGGGGCAAGAGAGGTTCCACTGAAGTGAGGAAGGTGTTGCTT
GAATTTGAGAAGGTTTTTCGACCTCTTGAGGGACTTCCTCCTTCACGGCAACAAGACCATGCCATAGAGTTGCAACCGTCAGCTGGTCCCGTGAATCACGTAATCCCGAA
TAAATATAAAAAATGGATCACCAAATTGATGGGCTATGATTTCGAGATAGAATATAAACTGGGATTGGAAAACAAAGACGATGCTTTGTCTCACCAGTCGGAACCCGTTC
AACTCAATGCATTATCATTGCTTGGGGGACTCAACATCTCCATTTTTGCCCAAGAAATTCGGGACGATGCTGAATTATCCAAGGTCATCACAACTCTGCAGGCTAATCAA
CCAAAAATGGCAGGATATTCTTTAAGGAATGACACCCTGTTTTCCCATGGCCGCCGGGAGGTTTATTGGCTTGGTATGCGTCAAGCTGTTCAGCGTTTTTGTGCAGGATG
TACAATCTGTCAGCAAGCAAAATACCTTTCCTTAGCCCCTGCTGGGTTACTACAACTTCTACCCATTCTGGACCGAATTTGGGAAGACATTTCGATGGATTTCGTGGAGG
GCCTACCCAAATCCGATGGCGTTGAGATGCTCGCTATTTCCTCAGGTTCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGTGCTGTGGGGTCAAGTGAGGGGTCAACTCAACTCTTCAATGCATTGGCACCGGTATTCGATCTTCGTCTTCGTAAGTTGGATGTGCTGATTTTTGAAGCGGAAAA
TCGAGATGGATGGCTGCACCGGGTCGCACGGTATTTTAAGATCAATCGGTTAGCGGACAATGAAAAATTAGAGGCGTCGATCCTTTGCCTTGAGGGTACCCTGTCTGATC
GATTTGCGGCGGTACGACAAGAAACAACCGTCCGAGATTATCGTCGGCGATTTGAGGAGGCGGGTGGTGTGGTTGATGGATTCCCTGCATTTCTTCTTGAAGGCCTATTC
ATCAATGGGTTGAAGGAGGACTTGAGAAACGAGGAAAGGGTGATGAAGCCCAGTGGGCTCAAAGAGATGATGGTAATGGTCCAATTACTTAAGGATAAAATTTGTAAGGA
ATCAGACCCATTGACATCAGGGGGATCGAACAGCACAGTGAAGTCATCTGGGCCATCCCAAAGAATCGGTGACCAAAAGTCGCGTCGCACAACCGCCGAGCAATTTCGGC
GACTTTCTGATATAAAACTCAAATCCAAACGGGAGAAGGGGCTATGCTATCGTTGTAATGAAAAATTCGCACCGGGTCATCGTTGCAAGAAAAAGGAATTACAGATCCTC
GTGGTTCAAAAAGTTGAAGCGGTGGGCGACGAATTCCAGGATGCTGTGGATACCATCGGCCCGGAAACCAAGTCGGATGGGGTGGGTTTGACGATGCTATCCTTGAATTC
GTTGGTGGGTTTAACTTCTACGAAGACCCTCAAGGTATGTGGGCAATTGGACAAGCGAGGGATTGTGGCTTTGGTGGATAGTGGGGCGACGCACAACTTCATTGCCCTAG
ACTTGGTTTGTGCTTGTCAACTCCCAATCTCGGACACTAAGGGGTACGAAATTGTCTTGGGACACTACGGGGGCAAGAGAGGTTCCACTGAAGTGAGGAAGGTGTTGCTT
GAATTTGAGAAGGTTTTTCGACCTCTTGAGGGACTTCCTCCTTCACGGCAACAAGACCATGCCATAGAGTTGCAACCGTCAGCTGGTCCCGTGAATCACGTAATCCCGAA
TAAATATAAAAAATGGATCACCAAATTGATGGGCTATGATTTCGAGATAGAATATAAACTGGGATTGGAAAACAAAGACGATGCTTTGTCTCACCAGTCGGAACCCGTTC
AACTCAATGCATTATCATTGCTTGGGGGACTCAACATCTCCATTTTTGCCCAAGAAATTCGGGACGATGCTGAATTATCCAAGGTCATCACAACTCTGCAGGCTAATCAA
CCAAAAATGGCAGGATATTCTTTAAGGAATGACACCCTGTTTTCCCATGGCCGCCGGGAGGTTTATTGGCTTGGTATGCGTCAAGCTGTTCAGCGTTTTTGTGCAGGATG
TACAATCTGTCAGCAAGCAAAATACCTTTCCTTAGCCCCTGCTGGGTTACTACAACTTCTACCCATTCTGGACCGAATTTGGGAAGACATTTCGATGGATTTCGTGGAGG
GCCTACCCAAATCCGATGGCGTTGAGATGCTCGCTATTTCCTCAGGTTCTTAG
Protein sequenceShow/hide protein sequence
MSAVGSSEGSTQLFNALAPVFDLRLRKLDVLIFEAENRDGWLHRVARYFKINRLADNEKLEASILCLEGTLSDRFAAVRQETTVRDYRRRFEEAGGVVDGFPAFLLEGLF
INGLKEDLRNEERVMKPSGLKEMMVMVQLLKDKICKESDPLTSGGSNSTVKSSGPSQRIGDQKSRRTTAEQFRRLSDIKLKSKREKGLCYRCNEKFAPGHRCKKKELQIL
VVQKVEAVGDEFQDAVDTIGPETKSDGVGLTMLSLNSLVGLTSTKTLKVCGQLDKRGIVALVDSGATHNFIALDLVCACQLPISDTKGYEIVLGHYGGKRGSTEVRKVLL
EFEKVFRPLEGLPPSRQQDHAIELQPSAGPVNHVIPNKYKKWITKLMGYDFEIEYKLGLENKDDALSHQSEPVQLNALSLLGGLNISIFAQEIRDDAELSKVITTLQANQ
PKMAGYSLRNDTLFSHGRREVYWLGMRQAVQRFCAGCTICQQAKYLSLAPAGLLQLLPILDRIWEDISMDFVEGLPKSDGVEMLAISSGS