; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g22060 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g22060
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionTy3/gypsy retrotransposon protein
Genome locationchr7:16207263..16211415
RNA-Seq ExpressionMoc07g22060
SyntenyMoc07g22060
Gene Ontology termsNA
InterPro domainsIPR000953 - Chromo/chromo shadow domain
IPR016197 - Chromo-like domain superfamily
IPR021109 - Aspartic peptidase domain superfamily
IPR023780 - Chromo domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0062868.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]7.0e-7841.72Show/hide
Query:  MTVAVISFDGVALAWYWYMENRNSFVDWDDLKNRL---FKSTST-------------------------------QIADEVLEGAFLNELDPIIRAKVLA
        M V+ I FDG AL WY   E R  FV W +LK RL   F+ST                                  + D V+E  F++ L P IRA+V+ 
Subjt:  MTVAVISFDGVALAWYWYMENRNSFVDWDDLKNRL---FKSTST-------------------------------QIADEVLEGAFLNELDPIIRAKVLA

Query:  MEPKGLDQIMRKAQLIEDIGLADQEAGELNPNQVTKKPDTTIAKTM---------NKAVDPVPTRTVTIASKGTDAFPATRIAPTQRQLTKVEYQKQKEK
          PKGL ++MR AQL+ED  +  + A  LN     K    T   T          NKA  P P RT+T+ S  +     TR   T ++L   E+Q ++EK
Subjt:  MEPKGLDQIMRKAQLIEDIGLADQEAGELNPNQVTKKPDTTIAKTM---------NKAVDPVPTRTVTIASKGTDAFPATRIAPTQRQLTKVEYQKQKEK

Query:  GLCFRCDEKYSIGYKCK---NRELRVYVV-HNDEEVDDSEESE--TVEVVEDISNDKGKAVEFVAFNSIVGLTTPGTMKLKGSIQGKEVIILIDCGATHN
        GLCF+C+EKYS  +KCK    RELR++VV  N+EE++  EE+E  T E+       +  A   ++ NS+VGL  PGTMK++G++QGKEV+ILIDCGATHN
Subjt:  GLCFRCDEKYSIGYKCK---NRELRVYVV-HNDEEVDDSEESE--TVEVVEDISNDKGKAVEFVAFNSIVGLTTPGTMKLKGSIQGKEVIILIDCGATHN

Query:  FVSMRVVEELSIPRTDTSNYGVIMGTGIAVKGNGICRDVVLDLPNLTVREDFLPLELGSLDVVLGMQWLRRIGKMKVDWPALSMSFKQDGKKIQLQGDPT
        FVS ++V  L +P  +T++YGVI+G+G A++G GIC  + + + + TV+EDFLPLELG +DV+LGMQWL  +G    DW  L+++F  D KKI ++GDP+
Subjt:  FVSMRVVEELSIPRTDTSNYGVIMGTGIAVKGNGICRDVVLDLPNLTVREDFLPLELGSLDVVLGMQWLRRIGKMKVDWPALSMSFKQDGKKIQLQGDPT

Query:  LTQIEVFFKRLSRSWDHRDQGFLVELRAL
        LT+  V  K L ++W+  D G+L+E R++
Subjt:  LTQIEVFFKRLSRSWDHRDQGFLVELRAL

KAA0068193.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]5.3e-7841.72Show/hide
Query:  MTVAVISFDGVALAWYWYMENRNSFVDWDDLKNRL---FKSTS-------------------------------TQIADEVLEGAFLNELDPIIRAKVLA
        M V+ I FDG AL WY   E R  FV W +LK RL   F+ST                                + + D V+E  F++ L P IRA+V+ 
Subjt:  MTVAVISFDGVALAWYWYMENRNSFVDWDDLKNRL---FKSTS-------------------------------TQIADEVLEGAFLNELDPIIRAKVLA

Query:  MEPKGLDQIMRKAQLIEDIGLADQEAGELNPNQVTKKPDTTIAKTM---------NKAVDPVPTRTVTIASKGTDAFPATRIAPTQRQLTKVEYQKQKEK
          PKGL ++MR AQL+ED  +  + A  LN     K    T   T          NKA  P P RT+T+ S  +     TR   T ++L   E+Q ++EK
Subjt:  MEPKGLDQIMRKAQLIEDIGLADQEAGELNPNQVTKKPDTTIAKTM---------NKAVDPVPTRTVTIASKGTDAFPATRIAPTQRQLTKVEYQKQKEK

Query:  GLCFRCDEKYSIGYKCK---NRELRVYVV-HNDEEVDDSEESE--TVEVVEDISNDKGKAVEFVAFNSIVGLTTPGTMKLKGSIQGKEVIILIDCGATHN
        GLCF+C+EKYS  +KCK    RELR++VV  N+EE++  EE+E  T E+       +  A   ++ NS+VGL  PGTMK++G++QGKEV+ILIDCGATHN
Subjt:  GLCFRCDEKYSIGYKCK---NRELRVYVV-HNDEEVDDSEESE--TVEVVEDISNDKGKAVEFVAFNSIVGLTTPGTMKLKGSIQGKEVIILIDCGATHN

Query:  FVSMRVVEELSIPRTDTSNYGVIMGTGIAVKGNGICRDVVLDLPNLTVREDFLPLELGSLDVVLGMQWLRRIGKMKVDWPALSMSFKQDGKKIQLQGDPT
        FVS ++V  L +P  +T++YGVI+G+G A++G GIC  + + + + TV+EDFLPLELG +DV+LGMQWL  +G    DW  L+++F  D KKI ++GDP+
Subjt:  FVSMRVVEELSIPRTDTSNYGVIMGTGIAVKGNGICRDVVLDLPNLTVREDFLPLELGSLDVVLGMQWLRRIGKMKVDWPALSMSFKQDGKKIQLQGDPT

Query:  LTQIEVFFKRLSRSWDHRDQGFLVELRAL
        LT+  V  K L ++W+  D G+L+E R++
Subjt:  LTQIEVFFKRLSRSWDHRDQGFLVELRAL

TYJ96875.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]5.3e-7841.72Show/hide
Query:  MTVAVISFDGVALAWYWYMENRNSFVDWDDLKNRL---FKSTS-------------------------------TQIADEVLEGAFLNELDPIIRAKVLA
        M V+ I FDG AL WY   E R  FV W +LK RL   F+ST                                + + D V+E  F++ L P IRA+V+ 
Subjt:  MTVAVISFDGVALAWYWYMENRNSFVDWDDLKNRL---FKSTS-------------------------------TQIADEVLEGAFLNELDPIIRAKVLA

Query:  MEPKGLDQIMRKAQLIEDIGLADQEAGELNPNQVTKKPDTTIAKTM---------NKAVDPVPTRTVTIASKGTDAFPATRIAPTQRQLTKVEYQKQKEK
          PKGL ++MR AQL+ED  +  + A  LN     K    T   T          NKA  P P RT+T+ S  +     TR   T ++L   E+Q ++EK
Subjt:  MEPKGLDQIMRKAQLIEDIGLADQEAGELNPNQVTKKPDTTIAKTM---------NKAVDPVPTRTVTIASKGTDAFPATRIAPTQRQLTKVEYQKQKEK

Query:  GLCFRCDEKYSIGYKCK---NRELRVYVV-HNDEEVDDSEESE--TVEVVEDISNDKGKAVEFVAFNSIVGLTTPGTMKLKGSIQGKEVIILIDCGATHN
        GLCF+C+EKYS  +KCK    RELR++VV  N+EE++  EE+E  T E+       +  A   ++ NS+VGL  PGTMK++G++QGKEV+ILIDCGATHN
Subjt:  GLCFRCDEKYSIGYKCK---NRELRVYVV-HNDEEVDDSEESE--TVEVVEDISNDKGKAVEFVAFNSIVGLTTPGTMKLKGSIQGKEVIILIDCGATHN

Query:  FVSMRVVEELSIPRTDTSNYGVIMGTGIAVKGNGICRDVVLDLPNLTVREDFLPLELGSLDVVLGMQWLRRIGKMKVDWPALSMSFKQDGKKIQLQGDPT
        FVS ++V  L +P  +T++YGVI+G+G A++G GIC  + + + + TV+EDFLPLELG +DV+LGMQWL  +G    DW  L+++F  D KKI ++GDP+
Subjt:  FVSMRVVEELSIPRTDTSNYGVIMGTGIAVKGNGICRDVVLDLPNLTVREDFLPLELGSLDVVLGMQWLRRIGKMKVDWPALSMSFKQDGKKIQLQGDPT

Query:  LTQIEVFFKRLSRSWDHRDQGFLVELRAL
        LT+  V  K L ++W+  D G+L+E R++
Subjt:  LTQIEVFFKRLSRSWDHRDQGFLVELRAL

TYK21115.1 transposon Tf2-1 polyprotein isoform X1 [Cucumis melo var. makuwa]5.3e-7841.72Show/hide
Query:  MTVAVISFDGVALAWYWYMENRNSFVDWDDLKNRL---FKSTS-------------------------------TQIADEVLEGAFLNELDPIIRAKVLA
        M V+ I FDG AL WY   E R  FV W +LK RL   F+ST                                + + D V+E  F++ L P IRA+V+ 
Subjt:  MTVAVISFDGVALAWYWYMENRNSFVDWDDLKNRL---FKSTS-------------------------------TQIADEVLEGAFLNELDPIIRAKVLA

Query:  MEPKGLDQIMRKAQLIEDIGLADQEAGELNPNQVTKKPDTTIAKTM---------NKAVDPVPTRTVTIASKGTDAFPATRIAPTQRQLTKVEYQKQKEK
          PKGL ++MR AQL+ED  +  + A  LN     K    T   T          NKA  P P RT+T+ S  +     TR   T ++L   E+Q ++EK
Subjt:  MEPKGLDQIMRKAQLIEDIGLADQEAGELNPNQVTKKPDTTIAKTM---------NKAVDPVPTRTVTIASKGTDAFPATRIAPTQRQLTKVEYQKQKEK

Query:  GLCFRCDEKYSIGYKCK---NRELRVYVV-HNDEEVDDSEESE--TVEVVEDISNDKGKAVEFVAFNSIVGLTTPGTMKLKGSIQGKEVIILIDCGATHN
        GLCF+C+EKYS  +KCK    RELR++VV  N+EE++  EE+E  T E+       +  A   ++ NS+VGL  PGTMK++G++QGKEV+ILIDCGATHN
Subjt:  GLCFRCDEKYSIGYKCK---NRELRVYVV-HNDEEVDDSEESE--TVEVVEDISNDKGKAVEFVAFNSIVGLTTPGTMKLKGSIQGKEVIILIDCGATHN

Query:  FVSMRVVEELSIPRTDTSNYGVIMGTGIAVKGNGICRDVVLDLPNLTVREDFLPLELGSLDVVLGMQWLRRIGKMKVDWPALSMSFKQDGKKIQLQGDPT
        FVS ++V  L +P  +T++YGVI+G+G A++G GIC  + + + + TV+EDFLPLELG +DV+LGMQWL  +G    DW  L+++F  D KKI ++GDP+
Subjt:  FVSMRVVEELSIPRTDTSNYGVIMGTGIAVKGNGICRDVVLDLPNLTVREDFLPLELGSLDVVLGMQWLRRIGKMKVDWPALSMSFKQDGKKIQLQGDPT

Query:  LTQIEVFFKRLSRSWDHRDQGFLVELRAL
        LT+  V  K L ++W+  D G+L+E R++
Subjt:  LTQIEVFFKRLSRSWDHRDQGFLVELRAL

XP_022154744.1 uncharacterized protein LOC111021922 [Momordica charantia]3.0e-8945.91Show/hide
Query:  MTVAVISFDGVALAWYWYMENRNSFVDWDDLKNRLFKSTSTQIADEVLEGAFLNELDPIIRAKVLAMEPKGLDQIMRK-----AQLIEDIGLADQEAGEL
        MTV +ISF+G A+ WY + ENR  F DW++LK R+F+              F +  D  + ++ L+++ +G     R+     +  + DI     E    
Subjt:  MTVAVISFDGVALAWYWYMENRNSFVDWDDLKNRLFKSTSTQIADEVLEGAFLNELDPIIRAKVLAMEPKGLDQIMRK-----AQLIEDIGLADQEAGEL

Query:  NPNQVTKKPDT--TIAKTMNKAVDPVPTRTVTIASKGTDAFPATRIAPTQRQLTKVEYQKQKEKGLCFRCDEKYSIGYKCKNRELRVYVVHNDE--EVDD
             ++ P T     KT  K  + V TRTVT++ K        R   TQ++LT+ EYQ++K+KGLCFR +EKYSIG++CKN+EL+V+VVH+DE  E+D 
Subjt:  NPNQVTKKPDT--TIAKTMNKAVDPVPTRTVTIASKGTDAFPATRIAPTQRQLTKVEYQKQKEKGLCFRCDEKYSIGYKCKNRELRVYVVHNDE--EVDD

Query:  SE-----ESETVEVVEDISNDKGKAVEFVAFNSIVGLTTPGTMKLKGSIQGKEVIILIDCGATHNFVSMRVVEELSIPRTDTSNYGVIMGTGIAVKGNGI
         E     E     +VE+++         +A N++VG +TPGTMKL+G I+ KEV+ILIDCGATHNF+S ++V+  ++P  +TSNYGVIMGTG+ V+G GI
Subjt:  SE-----ESETVEVVEDISNDKGKAVEFVAFNSIVGLTTPGTMKLKGSIQGKEVIILIDCGATHNFVSMRVVEELSIPRTDTSNYGVIMGTGIAVKGNGI

Query:  CRDVVLDLPNLTVREDFLPLELGSLDVVLGMQWLRRIGKMKVDWPALSMSFKQDGKKIQLQGDPTLTQIEVFFKRLSRSWDHRDQGFLVELRALLTATEE
        C+ ++L LP LT+RE+FLPLELG+LDVVLGMQWL   G M+VDW AL+MSF+    +I L+GDPTL ++EV  K+L+R+W+  DQGFLVELRA  +A  E
Subjt:  CRDVVLDLPNLTVREDFLPLELGSLDVVLGMQWLRRIGKMKVDWPALSMSFKQDGKKIQLQGDPTLTQIEVFFKRLSRSWDHRDQGFLVELRALLTATEE

Query:  GME
        G E
Subjt:  GME

TrEMBL top hitse value%identityAlignment
A0A5A7V5H5 Ty3/gypsy retrotransposon protein3.4e-7841.72Show/hide
Query:  MTVAVISFDGVALAWYWYMENRNSFVDWDDLKNRL---FKSTST-------------------------------QIADEVLEGAFLNELDPIIRAKVLA
        M V+ I FDG AL WY   E R  FV W +LK RL   F+ST                                  + D V+E  F++ L P IRA+V+ 
Subjt:  MTVAVISFDGVALAWYWYMENRNSFVDWDDLKNRL---FKSTST-------------------------------QIADEVLEGAFLNELDPIIRAKVLA

Query:  MEPKGLDQIMRKAQLIEDIGLADQEAGELNPNQVTKKPDTTIAKTM---------NKAVDPVPTRTVTIASKGTDAFPATRIAPTQRQLTKVEYQKQKEK
          PKGL ++MR AQL+ED  +  + A  LN     K    T   T          NKA  P P RT+T+ S  +     TR   T ++L   E+Q ++EK
Subjt:  MEPKGLDQIMRKAQLIEDIGLADQEAGELNPNQVTKKPDTTIAKTM---------NKAVDPVPTRTVTIASKGTDAFPATRIAPTQRQLTKVEYQKQKEK

Query:  GLCFRCDEKYSIGYKCK---NRELRVYVV-HNDEEVDDSEESE--TVEVVEDISNDKGKAVEFVAFNSIVGLTTPGTMKLKGSIQGKEVIILIDCGATHN
        GLCF+C+EKYS  +KCK    RELR++VV  N+EE++  EE+E  T E+       +  A   ++ NS+VGL  PGTMK++G++QGKEV+ILIDCGATHN
Subjt:  GLCFRCDEKYSIGYKCK---NRELRVYVV-HNDEEVDDSEESE--TVEVVEDISNDKGKAVEFVAFNSIVGLTTPGTMKLKGSIQGKEVIILIDCGATHN

Query:  FVSMRVVEELSIPRTDTSNYGVIMGTGIAVKGNGICRDVVLDLPNLTVREDFLPLELGSLDVVLGMQWLRRIGKMKVDWPALSMSFKQDGKKIQLQGDPT
        FVS ++V  L +P  +T++YGVI+G+G A++G GIC  + + + + TV+EDFLPLELG +DV+LGMQWL  +G    DW  L+++F  D KKI ++GDP+
Subjt:  FVSMRVVEELSIPRTDTSNYGVIMGTGIAVKGNGICRDVVLDLPNLTVREDFLPLELGSLDVVLGMQWLRRIGKMKVDWPALSMSFKQDGKKIQLQGDPT

Query:  LTQIEVFFKRLSRSWDHRDQGFLVELRAL
        LT+  V  K L ++W+  D G+L+E R++
Subjt:  LTQIEVFFKRLSRSWDHRDQGFLVELRAL

A0A5A7VJA0 Ty3/gypsy retrotransposon protein2.6e-7841.72Show/hide
Query:  MTVAVISFDGVALAWYWYMENRNSFVDWDDLKNRL---FKSTS-------------------------------TQIADEVLEGAFLNELDPIIRAKVLA
        M V+ I FDG AL WY   E R  FV W +LK RL   F+ST                                + + D V+E  F++ L P IRA+V+ 
Subjt:  MTVAVISFDGVALAWYWYMENRNSFVDWDDLKNRL---FKSTS-------------------------------TQIADEVLEGAFLNELDPIIRAKVLA

Query:  MEPKGLDQIMRKAQLIEDIGLADQEAGELNPNQVTKKPDTTIAKTM---------NKAVDPVPTRTVTIASKGTDAFPATRIAPTQRQLTKVEYQKQKEK
          PKGL ++MR AQL+ED  +  + A  LN     K    T   T          NKA  P P RT+T+ S  +     TR   T ++L   E+Q ++EK
Subjt:  MEPKGLDQIMRKAQLIEDIGLADQEAGELNPNQVTKKPDTTIAKTM---------NKAVDPVPTRTVTIASKGTDAFPATRIAPTQRQLTKVEYQKQKEK

Query:  GLCFRCDEKYSIGYKCK---NRELRVYVV-HNDEEVDDSEESE--TVEVVEDISNDKGKAVEFVAFNSIVGLTTPGTMKLKGSIQGKEVIILIDCGATHN
        GLCF+C+EKYS  +KCK    RELR++VV  N+EE++  EE+E  T E+       +  A   ++ NS+VGL  PGTMK++G++QGKEV+ILIDCGATHN
Subjt:  GLCFRCDEKYSIGYKCK---NRELRVYVV-HNDEEVDDSEESE--TVEVVEDISNDKGKAVEFVAFNSIVGLTTPGTMKLKGSIQGKEVIILIDCGATHN

Query:  FVSMRVVEELSIPRTDTSNYGVIMGTGIAVKGNGICRDVVLDLPNLTVREDFLPLELGSLDVVLGMQWLRRIGKMKVDWPALSMSFKQDGKKIQLQGDPT
        FVS ++V  L +P  +T++YGVI+G+G A++G GIC  + + + + TV+EDFLPLELG +DV+LGMQWL  +G    DW  L+++F  D KKI ++GDP+
Subjt:  FVSMRVVEELSIPRTDTSNYGVIMGTGIAVKGNGICRDVVLDLPNLTVREDFLPLELGSLDVVLGMQWLRRIGKMKVDWPALSMSFKQDGKKIQLQGDPT

Query:  LTQIEVFFKRLSRSWDHRDQGFLVELRAL
        LT+  V  K L ++W+  D G+L+E R++
Subjt:  LTQIEVFFKRLSRSWDHRDQGFLVELRAL

A0A5D3BEL2 Ty3/gypsy retrotransposon protein2.6e-7841.72Show/hide
Query:  MTVAVISFDGVALAWYWYMENRNSFVDWDDLKNRL---FKSTS-------------------------------TQIADEVLEGAFLNELDPIIRAKVLA
        M V+ I FDG AL WY   E R  FV W +LK RL   F+ST                                + + D V+E  F++ L P IRA+V+ 
Subjt:  MTVAVISFDGVALAWYWYMENRNSFVDWDDLKNRL---FKSTS-------------------------------TQIADEVLEGAFLNELDPIIRAKVLA

Query:  MEPKGLDQIMRKAQLIEDIGLADQEAGELNPNQVTKKPDTTIAKTM---------NKAVDPVPTRTVTIASKGTDAFPATRIAPTQRQLTKVEYQKQKEK
          PKGL ++MR AQL+ED  +  + A  LN     K    T   T          NKA  P P RT+T+ S  +     TR   T ++L   E+Q ++EK
Subjt:  MEPKGLDQIMRKAQLIEDIGLADQEAGELNPNQVTKKPDTTIAKTM---------NKAVDPVPTRTVTIASKGTDAFPATRIAPTQRQLTKVEYQKQKEK

Query:  GLCFRCDEKYSIGYKCK---NRELRVYVV-HNDEEVDDSEESE--TVEVVEDISNDKGKAVEFVAFNSIVGLTTPGTMKLKGSIQGKEVIILIDCGATHN
        GLCF+C+EKYS  +KCK    RELR++VV  N+EE++  EE+E  T E+       +  A   ++ NS+VGL  PGTMK++G++QGKEV+ILIDCGATHN
Subjt:  GLCFRCDEKYSIGYKCK---NRELRVYVV-HNDEEVDDSEESE--TVEVVEDISNDKGKAVEFVAFNSIVGLTTPGTMKLKGSIQGKEVIILIDCGATHN

Query:  FVSMRVVEELSIPRTDTSNYGVIMGTGIAVKGNGICRDVVLDLPNLTVREDFLPLELGSLDVVLGMQWLRRIGKMKVDWPALSMSFKQDGKKIQLQGDPT
        FVS ++V  L +P  +T++YGVI+G+G A++G GIC  + + + + TV+EDFLPLELG +DV+LGMQWL  +G    DW  L+++F  D KKI ++GDP+
Subjt:  FVSMRVVEELSIPRTDTSNYGVIMGTGIAVKGNGICRDVVLDLPNLTVREDFLPLELGSLDVVLGMQWLRRIGKMKVDWPALSMSFKQDGKKIQLQGDPT

Query:  LTQIEVFFKRLSRSWDHRDQGFLVELRAL
        LT+  V  K L ++W+  D G+L+E R++
Subjt:  LTQIEVFFKRLSRSWDHRDQGFLVELRAL

A0A5D3DC20 Transposon Tf2-1 polyprotein isoform X12.6e-7841.72Show/hide
Query:  MTVAVISFDGVALAWYWYMENRNSFVDWDDLKNRL---FKSTS-------------------------------TQIADEVLEGAFLNELDPIIRAKVLA
        M V+ I FDG AL WY   E R  FV W +LK RL   F+ST                                + + D V+E  F++ L P IRA+V+ 
Subjt:  MTVAVISFDGVALAWYWYMENRNSFVDWDDLKNRL---FKSTS-------------------------------TQIADEVLEGAFLNELDPIIRAKVLA

Query:  MEPKGLDQIMRKAQLIEDIGLADQEAGELNPNQVTKKPDTTIAKTM---------NKAVDPVPTRTVTIASKGTDAFPATRIAPTQRQLTKVEYQKQKEK
          PKGL ++MR AQL+ED  +  + A  LN     K    T   T          NKA  P P RT+T+ S  +     TR   T ++L   E+Q ++EK
Subjt:  MEPKGLDQIMRKAQLIEDIGLADQEAGELNPNQVTKKPDTTIAKTM---------NKAVDPVPTRTVTIASKGTDAFPATRIAPTQRQLTKVEYQKQKEK

Query:  GLCFRCDEKYSIGYKCK---NRELRVYVV-HNDEEVDDSEESE--TVEVVEDISNDKGKAVEFVAFNSIVGLTTPGTMKLKGSIQGKEVIILIDCGATHN
        GLCF+C+EKYS  +KCK    RELR++VV  N+EE++  EE+E  T E+       +  A   ++ NS+VGL  PGTMK++G++QGKEV+ILIDCGATHN
Subjt:  GLCFRCDEKYSIGYKCK---NRELRVYVV-HNDEEVDDSEESE--TVEVVEDISNDKGKAVEFVAFNSIVGLTTPGTMKLKGSIQGKEVIILIDCGATHN

Query:  FVSMRVVEELSIPRTDTSNYGVIMGTGIAVKGNGICRDVVLDLPNLTVREDFLPLELGSLDVVLGMQWLRRIGKMKVDWPALSMSFKQDGKKIQLQGDPT
        FVS ++V  L +P  +T++YGVI+G+G A++G GIC  + + + + TV+EDFLPLELG +DV+LGMQWL  +G    DW  L+++F  D KKI ++GDP+
Subjt:  FVSMRVVEELSIPRTDTSNYGVIMGTGIAVKGNGICRDVVLDLPNLTVREDFLPLELGSLDVVLGMQWLRRIGKMKVDWPALSMSFKQDGKKIQLQGDPT

Query:  LTQIEVFFKRLSRSWDHRDQGFLVELRAL
        LT+  V  K L ++W+  D G+L+E R++
Subjt:  LTQIEVFFKRLSRSWDHRDQGFLVELRAL

A0A6J1DN22 Reverse transcriptase1.5e-8945.91Show/hide
Query:  MTVAVISFDGVALAWYWYMENRNSFVDWDDLKNRLFKSTSTQIADEVLEGAFLNELDPIIRAKVLAMEPKGLDQIMRK-----AQLIEDIGLADQEAGEL
        MTV +ISF+G A+ WY + ENR  F DW++LK R+F+              F +  D  + ++ L+++ +G     R+     +  + DI     E    
Subjt:  MTVAVISFDGVALAWYWYMENRNSFVDWDDLKNRLFKSTSTQIADEVLEGAFLNELDPIIRAKVLAMEPKGLDQIMRK-----AQLIEDIGLADQEAGEL

Query:  NPNQVTKKPDT--TIAKTMNKAVDPVPTRTVTIASKGTDAFPATRIAPTQRQLTKVEYQKQKEKGLCFRCDEKYSIGYKCKNRELRVYVVHNDE--EVDD
             ++ P T     KT  K  + V TRTVT++ K        R   TQ++LT+ EYQ++K+KGLCFR +EKYSIG++CKN+EL+V+VVH+DE  E+D 
Subjt:  NPNQVTKKPDT--TIAKTMNKAVDPVPTRTVTIASKGTDAFPATRIAPTQRQLTKVEYQKQKEKGLCFRCDEKYSIGYKCKNRELRVYVVHNDE--EVDD

Query:  SE-----ESETVEVVEDISNDKGKAVEFVAFNSIVGLTTPGTMKLKGSIQGKEVIILIDCGATHNFVSMRVVEELSIPRTDTSNYGVIMGTGIAVKGNGI
         E     E     +VE+++         +A N++VG +TPGTMKL+G I+ KEV+ILIDCGATHNF+S ++V+  ++P  +TSNYGVIMGTG+ V+G GI
Subjt:  SE-----ESETVEVVEDISNDKGKAVEFVAFNSIVGLTTPGTMKLKGSIQGKEVIILIDCGATHNFVSMRVVEELSIPRTDTSNYGVIMGTGIAVKGNGI

Query:  CRDVVLDLPNLTVREDFLPLELGSLDVVLGMQWLRRIGKMKVDWPALSMSFKQDGKKIQLQGDPTLTQIEVFFKRLSRSWDHRDQGFLVELRALLTATEE
        C+ ++L LP LT+RE+FLPLELG+LDVVLGMQWL   G M+VDW AL+MSF+    +I L+GDPTL ++EV  K+L+R+W+  DQGFLVELRA  +A  E
Subjt:  CRDVVLDLPNLTVREDFLPLELGSLDVVLGMQWLRRIGKMKVDWPALSMSFKQDGKKIQLQGDPTLTQIEVFFKRLSRSWDHRDQGFLVELRALLTATEE

Query:  GME
        G E
Subjt:  GME

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G29750.1 Eukaryotic aspartyl protease family protein9.9e-1430.23Show/hide
Query:  IVGLTTPGTMKLKGSIQGKEVIILIDCGATHNFVSMRVVEELSIPRTDTSNYGVIMGTGIAVKGNGICRDVVLDLPNLTVREDFLPLELG--SLDVVLGM
        ++ LT    M+  G I   +V++ ID GAT NF+ + +   L +P + T+   V++G    ++  G C  + L +  + + E+FL L+L    +DV+LG 
Subjt:  IVGLTTPGTMKLKGSIQGKEVIILIDCGATHNFVSMRVVEELSIPRTDTSNYGVIMGTGIAVKGNGICRDVVLDLPNLTVREDFLPLELG--SLDVVLGM

Query:  QWLRRIGKMKVDWPALSMSFKQDGKKIQL
        +WL ++G+  V+W     SF  + + I L
Subjt:  QWLRRIGKMKVDWPALSMSFKQDGKKIQL

AT3G30770.1 Eukaryotic aspartyl protease family protein2.5e-0925.54Show/hide
Query:  KAVEFVAFNSIVGLTTPGTMKLKGSIQGKEVIILIDCGATHNFVSMRVVEELSIPRTDTSNYGVIMGTGIAVKGNGICRDVVLDLPNLTVREDFLPLEL-
        K +  V   S    T    M+  G I   +V+++ID GAT+NF+S  +   L +P + T+   V++G    ++  G C  + L +  + + E+FL L+L 
Subjt:  KAVEFVAFNSIVGLTTPGTMKLKGSIQGKEVIILIDCGATHNFVSMRVVEELSIPRTDTSNYGVIMGTGIAVKGNGICRDVVLDLPNLTVREDFLPLEL-

Query:  -GSLDVVLGMQWLRRIGKMKVDWPALSMSFKQDGKKIQL-QGDPTLTQIEVFFKRLSRSWDHRDQGFLVELRALLTATEEGMEG
           +DV+LG    + + +  + W     SF  + + + L   D  L Q+    K  S     +   +L E + +L    + M G
Subjt:  -GSLDVVLGMQWLRRIGKMKVDWPALSMSFKQDGKKIQL-QGDPTLTQIEVFFKRLSRSWDHRDQGFLVELRALLTATEEGMEG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTGTAGCAGTTATTAGCTTCGATGGTGTCGCGTTGGCGTGGTACTGGTATATGGAGAATCGAAATAGCTTTGTGGATTGGGACGATTTGAAGAATCGTCTGTTTAA
GAGCACCTCTACTCAGATCGCTGATGAGGTTTTGGAGGGAGCGTTCTTGAACGAATTGGATCCGATCATTCGGGCCAAGGTTTTGGCTATGGAACCTAAAGGTTTGGATC
AAATCATGCGAAAGGCCCAATTGATTGAGGATATCGGTCTGGCAGACCAAGAGGCCGGAGAATTGAACCCGAACCAGGTGACAAAAAAACCGGATACCACCATCGCGAAG
ACTATGAATAAAGCAGTGGATCCAGTACCCACGCGTACTGTTACGATCGCAAGTAAGGGGACCGACGCATTTCCGGCTACTCGAATAGCGCCAACACAACGACAATTGAC
GAAGGTGGAATATCAGAAACAGAAGGAGAAAGGATTATGTTTCCGGTGTGACGAGAAATACTCCATCGGCTATAAATGCAAGAATCGAGAGTTGCGAGTCTACGTTGTAC
ACAACGATGAGGAAGTAGACGATTCAGAGGAGAGCGAAACCGTGGAAGTCGTTGAGGACATCAGCAACGACAAAGGGAAGGCGGTGGAATTTGTGGCGTTTAACTCGATA
GTCGGATTGACAACGCCTGGGACCATGAAATTGAAGGGTTCGATCCAAGGGAAAGAGGTAATCATTCTAATTGACTGTGGAGCAACGCATAATTTCGTTTCTATGCGCGT
GGTTGAAGAACTGAGTATTCCCCGTACGGATACATCCAATTATGGCGTGATCATGGGAACGGGTATCGCGGTGAAGGGGAACGGAATCTGTAGGGATGTAGTCCTCGATT
TGCCTAATCTGACTGTGCGTGAGGATTTCTTGCCTCTGGAACTAGGGAGTTTAGATGTAGTCCTAGGTATGCAGTGGTTGAGGCGGATAGGGAAGATGAAGGTGGATTGG
CCTGCACTCAGCATGAGTTTCAAACAGGATGGAAAGAAAATACAGCTGCAGGGGGATCCGACATTGACGCAAATAGAAGTGTTCTTCAAACGATTATCACGATCGTGGGA
TCATCGTGACCAAGGATTTTTAGTGGAGCTTCGTGCCTTATTGACTGCCACTGAGGAGGGTATGGAAGGGAATGGAGTTAATCCGGAACCTTTGCCTGGCCGGGAACTAG
AATGGATAGTGCAACCTGCGAAGGTCGTTGCAACGCAAGTGAATTTGGATACGGGCAAGGAAGAAGCTTTGGTTAGCTGGGTGAATTTACCTGAAGAAGAAGCGACTTGG
GAAGTGATAGATGACCTGAAGCTGCAGTTTCCAGACTTTTATGCGACTTTTTTTCCTAATACTCACCTTGGGGACAAGGTGAATGTTGGGAGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGACTGTAGCAGTTATTAGCTTCGATGGTGTCGCGTTGGCGTGGTACTGGTATATGGAGAATCGAAATAGCTTTGTGGATTGGGACGATTTGAAGAATCGTCTGTTTAA
GAGCACCTCTACTCAGATCGCTGATGAGGTTTTGGAGGGAGCGTTCTTGAACGAATTGGATCCGATCATTCGGGCCAAGGTTTTGGCTATGGAACCTAAAGGTTTGGATC
AAATCATGCGAAAGGCCCAATTGATTGAGGATATCGGTCTGGCAGACCAAGAGGCCGGAGAATTGAACCCGAACCAGGTGACAAAAAAACCGGATACCACCATCGCGAAG
ACTATGAATAAAGCAGTGGATCCAGTACCCACGCGTACTGTTACGATCGCAAGTAAGGGGACCGACGCATTTCCGGCTACTCGAATAGCGCCAACACAACGACAATTGAC
GAAGGTGGAATATCAGAAACAGAAGGAGAAAGGATTATGTTTCCGGTGTGACGAGAAATACTCCATCGGCTATAAATGCAAGAATCGAGAGTTGCGAGTCTACGTTGTAC
ACAACGATGAGGAAGTAGACGATTCAGAGGAGAGCGAAACCGTGGAAGTCGTTGAGGACATCAGCAACGACAAAGGGAAGGCGGTGGAATTTGTGGCGTTTAACTCGATA
GTCGGATTGACAACGCCTGGGACCATGAAATTGAAGGGTTCGATCCAAGGGAAAGAGGTAATCATTCTAATTGACTGTGGAGCAACGCATAATTTCGTTTCTATGCGCGT
GGTTGAAGAACTGAGTATTCCCCGTACGGATACATCCAATTATGGCGTGATCATGGGAACGGGTATCGCGGTGAAGGGGAACGGAATCTGTAGGGATGTAGTCCTCGATT
TGCCTAATCTGACTGTGCGTGAGGATTTCTTGCCTCTGGAACTAGGGAGTTTAGATGTAGTCCTAGGTATGCAGTGGTTGAGGCGGATAGGGAAGATGAAGGTGGATTGG
CCTGCACTCAGCATGAGTTTCAAACAGGATGGAAAGAAAATACAGCTGCAGGGGGATCCGACATTGACGCAAATAGAAGTGTTCTTCAAACGATTATCACGATCGTGGGA
TCATCGTGACCAAGGATTTTTAGTGGAGCTTCGTGCCTTATTGACTGCCACTGAGGAGGGTATGGAAGGGAATGGAGTTAATCCGGAACCTTTGCCTGGCCGGGAACTAG
AATGGATAGTGCAACCTGCGAAGGTCGTTGCAACGCAAGTGAATTTGGATACGGGCAAGGAAGAAGCTTTGGTTAGCTGGGTGAATTTACCTGAAGAAGAAGCGACTTGG
GAAGTGATAGATGACCTGAAGCTGCAGTTTCCAGACTTTTATGCGACTTTTTTTCCTAATACTCACCTTGGGGACAAGGTGAATGTTGGGAGTTAG
Protein sequenceShow/hide protein sequence
MTVAVISFDGVALAWYWYMENRNSFVDWDDLKNRLFKSTSTQIADEVLEGAFLNELDPIIRAKVLAMEPKGLDQIMRKAQLIEDIGLADQEAGELNPNQVTKKPDTTIAK
TMNKAVDPVPTRTVTIASKGTDAFPATRIAPTQRQLTKVEYQKQKEKGLCFRCDEKYSIGYKCKNRELRVYVVHNDEEVDDSEESETVEVVEDISNDKGKAVEFVAFNSI
VGLTTPGTMKLKGSIQGKEVIILIDCGATHNFVSMRVVEELSIPRTDTSNYGVIMGTGIAVKGNGICRDVVLDLPNLTVREDFLPLELGSLDVVLGMQWLRRIGKMKVDW
PALSMSFKQDGKKIQLQGDPTLTQIEVFFKRLSRSWDHRDQGFLVELRALLTATEEGMEGNGVNPEPLPGRELEWIVQPAKVVATQVNLDTGKEEALVSWVNLPEEEATW
EVIDDLKLQFPDFYATFFPNTHLGDKVNVGS