; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g07620 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g07620
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionTy3/gypsy retrotransposon protein
Genome locationchr4:5431114..5442942
RNA-Seq ExpressionMoc04g07620
SyntenyMoc04g07620
Gene Ontology termsNA
InterPro domainsIPR016197 - Chromo-like domain superfamily
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0032325.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]2.9e-7136.29Show/hide
Query:  KEGSLCPRFLAIHQTRTVAEFRRTFEELSAPVQGLSDEVLEGTFVNGLHPLIRAEILSKETIGLQNVMCTAQLIEDRIVARAYNEGTKIPSPNPTQSTLP
        + G+ C RFLAI Q  +V E+ + FEELSA +  + ++VL GTF NGL P+IR ++ +   +GL+++M  A+L E+++     ++G       PTQ   P
Subjt:  KEGSLCPRFLAIHQTRTVAEFRRTFEELSAPVQGLSDEVLEGTFVNGLHPLIRAEILSKETIGLQNVMCTAQLIEDRIVARAYNEGTKIPSPNPTQSTLP

Query:  SRNKAAQSPLTRTVTLASR---------NPMFSTASNLNRKDPAFKRLSEMVYNACKAKGLCFRCNERYTIGHRCKNHKLRVLLAHDDDPRNMEIDSEIS
           K ++ P T+ +TLA R         NP    A N  R++  F+R ++    A + KGLC+RC+E ++ GHRCKN +L + +  DD     +I+ E S
Subjt:  SRNKAAQSPLTRTVTLASR---------NPMFSTASNLNRKDPAFKRLSEMVYNACKAKGLCFRCNERYTIGHRCKNHKLRVLLAHDDDPRNMEIDSEIS

Query:  VEQASVKKVREVVELSLNMVVGFSTPGTMKLHGEINQSGVVVLIDCGATHNFISQKLVEALQIPTAETSNYGVVMGIGVAVKGKGICKGVILKLPELTIV
          +  + +V  VVELSLN VVG +  GT K+ G ++    V+++DCGATHNFISQKLVE+L +P AET+NYGV+MG G AV+G+ +CKG+ + LP LTIV
Subjt:  VEQASVKKVREVVELSLNMVVGFSTPGTMKLHGEINQSGVVVLIDCGATHNFISQKLVEALQIPTAETSNYGVVMGIGVAVKGKGICKGVILKLPELTIV

Query:  ENFLPLELGGINVVL----------------------------------------EVTLKRLARSLDAHDQGFLMELCALTGYQA--EAPNAAPPMELPR
        E+FLPLELG +++VL                                        E++LK L ++    DQGFL+   A+   +A  E        +   
Subjt:  ENFLPLELGGINVVL----------------------------------------EVTLKRLARSLDAHDQGFLMELCALTGYQA--EAPNAAPPMELPR

Query:  SIDRLLTECQVVFHLPEGLPPTREVDHRILLTEG------HPPNHPPTFIESYEWVAEPLDIMGICRSPDTNEWE---LLLQWKD
          ++L  E   +F +P+ LPP R++DHRI L EG       P  +P       E +   +   GI R P TN +    +L++ KD
Subjt:  SIDRLLTECQVVFHLPEGLPPTREVDHRILLTEG------HPPNHPPTFIESYEWVAEPLDIMGICRSPDTNEWE---LLLQWKD

TYJ99421.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]2.9e-7136.29Show/hide
Query:  KEGSLCPRFLAIHQTRTVAEFRRTFEELSAPVQGLSDEVLEGTFVNGLHPLIRAEILSKETIGLQNVMCTAQLIEDRIVARAYNEGTKIPSPNPTQSTLP
        + G+ C RFLAI Q  +V E+ + FEELSA +  + ++VL GTF NGL P+IR ++ +   +GL+++M  A+L E+++     ++G       PTQ   P
Subjt:  KEGSLCPRFLAIHQTRTVAEFRRTFEELSAPVQGLSDEVLEGTFVNGLHPLIRAEILSKETIGLQNVMCTAQLIEDRIVARAYNEGTKIPSPNPTQSTLP

Query:  SRNKAAQSPLTRTVTLASR---------NPMFSTASNLNRKDPAFKRLSEMVYNACKAKGLCFRCNERYTIGHRCKNHKLRVLLAHDDDPRNMEIDSEIS
           K ++ P T+ +TLA R         NP    A N  R++  F+R ++    A + KGLC+RC+E ++ GHRCKN +L + +  DD     +I+ E S
Subjt:  SRNKAAQSPLTRTVTLASR---------NPMFSTASNLNRKDPAFKRLSEMVYNACKAKGLCFRCNERYTIGHRCKNHKLRVLLAHDDDPRNMEIDSEIS

Query:  VEQASVKKVREVVELSLNMVVGFSTPGTMKLHGEINQSGVVVLIDCGATHNFISQKLVEALQIPTAETSNYGVVMGIGVAVKGKGICKGVILKLPELTIV
          +  + +V  VVELSLN VVG +  GT K+ G ++    V+++DCGATHNFISQKLVE+L +P AET+NYGV+MG G AV+G+ +CKG+ + LP LTIV
Subjt:  VEQASVKKVREVVELSLNMVVGFSTPGTMKLHGEINQSGVVVLIDCGATHNFISQKLVEALQIPTAETSNYGVVMGIGVAVKGKGICKGVILKLPELTIV

Query:  ENFLPLELGGINVVL----------------------------------------EVTLKRLARSLDAHDQGFLMELCALTGYQA--EAPNAAPPMELPR
        E+FLPLELG +++VL                                        E++LK L ++    DQGFL+   A+   +A  E        +   
Subjt:  ENFLPLELGGINVVL----------------------------------------EVTLKRLARSLDAHDQGFLMELCALTGYQA--EAPNAAPPMELPR

Query:  SIDRLLTECQVVFHLPEGLPPTREVDHRILLTEG------HPPNHPPTFIESYEWVAEPLDIMGICRSPDTNEWE---LLLQWKD
          ++L  E   +F +P+ LPP R++DHRI L EG       P  +P       E +   +   GI R P TN +    +L++ KD
Subjt:  SIDRLLTECQVVFHLPEGLPPTREVDHRILLTEG------HPPNHPPTFIESYEWVAEPLDIMGICRSPDTNEWE---LLLQWKD

TYK17386.1 gypsy/ty3 element polyprotein [Cucumis melo var. makuwa]4.5e-7239.08Show/hide
Query:  GSLCPRFLAIHQTRTVAEFRRTFEELSAPVQGLSDEVLEGTFVNGLHPLIRAEILSKETIGLQNVMCTAQLIEDRIVARAYNEGTKIPSPNPTQSTLPSR
        G+ C RFLAI Q  +V E+ + FEELSAP+  ++++VL G F NGL P+IR E+ +   +GL+N+M  A+L E+++     + G         Q   P  
Subjt:  GSLCPRFLAIHQTRTVAEFRRTFEELSAPVQGLSDEVLEGTFVNGLHPLIRAEILSKETIGLQNVMCTAQLIEDRIVARAYNEGTKIPSPNPTQSTLPSR

Query:  NKAAQSPLTRTVTLASRNPMFSTASNLN---------RKDPAFKRLSEMVYNACKAKGLCFRCNERYTIGHRCKNHKLRVLLAHDDDPRNMEIDSEISVE
         K  ++P T+ VTLA R P     +N +         R+D  F+R ++    A + KGLC+RC E ++ GHRCKN +LR+ +  DD      +DS    E
Subjt:  NKAAQSPLTRTVTLASRNPMFSTASNLN---------RKDPAFKRLSEMVYNACKAKGLCFRCNERYTIGHRCKNHKLRVLLAHDDDPRNMEIDSEISVE

Query:  QASVKKVREVVELSLNMVVGFSTPGTMKLHGEINQSGVVVLIDCGATHNFISQKLVEALQIPTAETSNYGVVMGIGVAVKGKGICKGVILKLPELTIVEN
             +V  VVELSLN VVG + PGT KL G +    +V+++DCGATHNFIS KLVE L++P AET+NYGV+MG G AV+G+GICKG+ + LP ++IVE+
Subjt:  QASVKKVREVVELSLNMVVGFSTPGTMKLHGEINQSGVVVLIDCGATHNFISQKLVEALQIPTAETSNYGVVMGIGVAVKGKGICKGVILKLPELTIVEN

Query:  FLPLELGGINVVL----------------------------------------EVTLKRLARSLDAHDQGFLMELCALTGYQAEAPNAAPPM--ELPRSI
        FLPLELG I++VL                                        E++LK L ++    DQGFL+   A+   +A+          E     
Subjt:  FLPLELGGINVVL----------------------------------------EVTLKRLARSLDAHDQGFLMELCALTGYQAEAPNAAPPM--ELPRSI

Query:  DRLLTECQVVFHLPEGLPPTREVDHRILLTEGHPP
         +L  E   VF +P+GLPP R +DHRI L EG  P
Subjt:  DRLLTECQVVFHLPEGLPPTREVDHRILLTEGHPP

XP_022154744.1 uncharacterized protein LOC111021922 [Momordica charantia]5.9e-7241.63Show/hide
Query:  KEGSLCPRFLAIHQTRTVAEFRRTFEELSAPVQGLSDEVLEGTFVNGLHPLIRAEILSKETIGLQNVMCTAQLIEDRIVARAYNEGTKIPSPNPTQSTLP
        K+ +L  RFL+I +  TV E+R+TFE  SA +  ++++V+E                                           +G+    P    +T  
Subjt:  KEGSLCPRFLAIHQTRTVAEFRRTFEELSAPVQGLSDEVLEGTFVNGLHPLIRAEILSKETIGLQNVMCTAQLIEDRIVARAYNEGTKIPSPNPTQSTLP

Query:  SRNKAAQSPLTRTVTLASRNPMFSTASNLNRKDPAFKRLSEMVYNACKAKGLCFRCNERYTIGHRCKNHKLRVLLAHDDDPRNMEIDSE---ISVEQASV
        +  K +++  TRTVTL+ ++  + TA  L RK+   K+L+E  Y   K KGLCFR  E+Y+IGHRCKN +L+V + HDD+   ME+D E   +S E    
Subjt:  SRNKAAQSPLTRTVTLASRNPMFSTASNLNRKDPAFKRLSEMVYNACKAKGLCFRCNERYTIGHRCKNHKLRVLLAHDDDPRNMEIDSE---ISVEQASV

Query:  KKVREVVELSLNMVVGFSTPGTMKLHGEINQSGVVVLIDCGATHNFISQKLVEALQIPTAETSNYGVVMGIGVAVKGKGICKGVILKLPELTIVENFLPL
          V EV EL+LN VVGFSTPGTMKL G I    VV+LIDCGATHNFISQKLV+A  +P  ETSNYGV+MG GV V+GKGICKG+IL LPELTI ENFLPL
Subjt:  KKVREVVELSLNMVVGFSTPGTMKLHGEINQSGVVVLIDCGATHNFISQKLVEALQIPTAETSNYGVVMGIGVAVKGKGICKGVILKLPELTIVENFLPL

Query:  ELGGINVVL----------------------------------------EVTLKRLARSLDAHDQGFLMELCALTGYQ--AEAPNAAPPMELPRSIDRLL
        ELG ++VVL                                        EVTLK+LAR+ + HDQGFL+EL A +      EA  A  P    + I+ L+
Subjt:  ELGGINVVL----------------------------------------EVTLKRLARSLDAHDQGFLMELCALTGYQ--AEAPNAAPPMELPRSIDRLL

Query:  TECQVVFHLPEGLPPTREVDHRILLTEGHP
             VF  P  LPP R +DH+I L +G P
Subjt:  TECQVVFHLPEGLPPTREVDHRILLTEGHP

XP_022154744.1 uncharacterized protein LOC111021922 [Momordica charantia]7.3e-0639.13Show/hide
Query:  PTFIESYEWVAEPLDIMGICRSPDTNEWELLLQWKDLPEHDVTWETVTTIRRQFPHLHLEDKGVFLPRG
        P  ++ +E + EP ++  +  + DT E E L+ WK   E D TWE + T + QFP+ HLEDK    P G
Subjt:  PTFIESYEWVAEPLDIMGICRSPDTNEWELLLQWKDLPEHDVTWETVTTIRRQFPHLHLEDKGVFLPRG

XP_022154744.1 uncharacterized protein LOC111021922 [Momordica charantia]2.2e-7138.85Show/hide
Query:  GSLCPRFLAIHQTRTVAEFRRTFEELSAPVQGLSDEVLEGTFVNGLHPLIRAEILSKETIGLQNVMCTAQLIEDRIVARAYNEGTKIPSPNPTQSTLPSR
        G+ C RFLAI Q  +V E+ + FEELSAP+  ++++VL G F NGL P+IR E+ +   +GL+++M  A+L E+++     + G         Q   P  
Subjt:  GSLCPRFLAIHQTRTVAEFRRTFEELSAPVQGLSDEVLEGTFVNGLHPLIRAEILSKETIGLQNVMCTAQLIEDRIVARAYNEGTKIPSPNPTQSTLPSR

Query:  NKAAQSPLTRTVTLASRNPMFSTASNLN---------RKDPAFKRLSEMVYNACKAKGLCFRCNERYTIGHRCKNHKLRVLLAHDDDPRNMEIDSEISVE
         K  ++P T+ VTLA R P     +N +         R+D  F+R ++    A + KGLC+RC E ++ GHRCKN +LR+ +  DD      +DS    E
Subjt:  NKAAQSPLTRTVTLASRNPMFSTASNLN---------RKDPAFKRLSEMVYNACKAKGLCFRCNERYTIGHRCKNHKLRVLLAHDDDPRNMEIDSEISVE

Query:  QASVKKVREVVELSLNMVVGFSTPGTMKLHGEINQSGVVVLIDCGATHNFISQKLVEALQIPTAETSNYGVVMGIGVAVKGKGICKGVILKLPELTIVEN
             +V  VVELSLN VVG + PGT KL G +    +V+++DCGATHNFIS KLVE L++P AET+NYGV+MG G AV+G+GICKG+ + LP ++IVE+
Subjt:  QASVKKVREVVELSLNMVVGFSTPGTMKLHGEINQSGVVVLIDCGATHNFISQKLVEALQIPTAETSNYGVVMGIGVAVKGKGICKGVILKLPELTIVEN

Query:  FLPLELGGINVVL----------------------------------------EVTLKRLARSLDAHDQGFLMELCALTGYQAEAPNAA--PPMELPRSI
        FLPLELG I++VL                                        E++LK L ++    DQGFL+   A+   +A+          E     
Subjt:  FLPLELGGINVVL----------------------------------------EVTLKRLARSLDAHDQGFLMELCALTGYQAEAPNAA--PPMELPRSI

Query:  DRLLTECQVVFHLPEGLPPTREVDHRILLTEGHPP
         +L  E   VF +P+GLPP R +DHRI L EG  P
Subjt:  DRLLTECQVVFHLPEGLPPTREVDHRILLTEGHPP

TrEMBL top hitse value%identityAlignment
A0A5A7SSY7 Ty3/gypsy retrotransposon protein1.4e-7136.29Show/hide
Query:  KEGSLCPRFLAIHQTRTVAEFRRTFEELSAPVQGLSDEVLEGTFVNGLHPLIRAEILSKETIGLQNVMCTAQLIEDRIVARAYNEGTKIPSPNPTQSTLP
        + G+ C RFLAI Q  +V E+ + FEELSA +  + ++VL GTF NGL P+IR ++ +   +GL+++M  A+L E+++     ++G       PTQ   P
Subjt:  KEGSLCPRFLAIHQTRTVAEFRRTFEELSAPVQGLSDEVLEGTFVNGLHPLIRAEILSKETIGLQNVMCTAQLIEDRIVARAYNEGTKIPSPNPTQSTLP

Query:  SRNKAAQSPLTRTVTLASR---------NPMFSTASNLNRKDPAFKRLSEMVYNACKAKGLCFRCNERYTIGHRCKNHKLRVLLAHDDDPRNMEIDSEIS
           K ++ P T+ +TLA R         NP    A N  R++  F+R ++    A + KGLC+RC+E ++ GHRCKN +L + +  DD     +I+ E S
Subjt:  SRNKAAQSPLTRTVTLASR---------NPMFSTASNLNRKDPAFKRLSEMVYNACKAKGLCFRCNERYTIGHRCKNHKLRVLLAHDDDPRNMEIDSEIS

Query:  VEQASVKKVREVVELSLNMVVGFSTPGTMKLHGEINQSGVVVLIDCGATHNFISQKLVEALQIPTAETSNYGVVMGIGVAVKGKGICKGVILKLPELTIV
          +  + +V  VVELSLN VVG +  GT K+ G ++    V+++DCGATHNFISQKLVE+L +P AET+NYGV+MG G AV+G+ +CKG+ + LP LTIV
Subjt:  VEQASVKKVREVVELSLNMVVGFSTPGTMKLHGEINQSGVVVLIDCGATHNFISQKLVEALQIPTAETSNYGVVMGIGVAVKGKGICKGVILKLPELTIV

Query:  ENFLPLELGGINVVL----------------------------------------EVTLKRLARSLDAHDQGFLMELCALTGYQA--EAPNAAPPMELPR
        E+FLPLELG +++VL                                        E++LK L ++    DQGFL+   A+   +A  E        +   
Subjt:  ENFLPLELGGINVVL----------------------------------------EVTLKRLARSLDAHDQGFLMELCALTGYQA--EAPNAAPPMELPR

Query:  SIDRLLTECQVVFHLPEGLPPTREVDHRILLTEG------HPPNHPPTFIESYEWVAEPLDIMGICRSPDTNEWE---LLLQWKD
          ++L  E   +F +P+ LPP R++DHRI L EG       P  +P       E +   +   GI R P TN +    +L++ KD
Subjt:  SIDRLLTECQVVFHLPEGLPPTREVDHRILLTEG------HPPNHPPTFIESYEWVAEPLDIMGICRSPDTNEWE---LLLQWKD

A0A5D3BL18 Ty3/gypsy retrotransposon protein1.4e-7136.29Show/hide
Query:  KEGSLCPRFLAIHQTRTVAEFRRTFEELSAPVQGLSDEVLEGTFVNGLHPLIRAEILSKETIGLQNVMCTAQLIEDRIVARAYNEGTKIPSPNPTQSTLP
        + G+ C RFLAI Q  +V E+ + FEELSA +  + ++VL GTF NGL P+IR ++ +   +GL+++M  A+L E+++     ++G       PTQ   P
Subjt:  KEGSLCPRFLAIHQTRTVAEFRRTFEELSAPVQGLSDEVLEGTFVNGLHPLIRAEILSKETIGLQNVMCTAQLIEDRIVARAYNEGTKIPSPNPTQSTLP

Query:  SRNKAAQSPLTRTVTLASR---------NPMFSTASNLNRKDPAFKRLSEMVYNACKAKGLCFRCNERYTIGHRCKNHKLRVLLAHDDDPRNMEIDSEIS
           K ++ P T+ +TLA R         NP    A N  R++  F+R ++    A + KGLC+RC+E ++ GHRCKN +L + +  DD     +I+ E S
Subjt:  SRNKAAQSPLTRTVTLASR---------NPMFSTASNLNRKDPAFKRLSEMVYNACKAKGLCFRCNERYTIGHRCKNHKLRVLLAHDDDPRNMEIDSEIS

Query:  VEQASVKKVREVVELSLNMVVGFSTPGTMKLHGEINQSGVVVLIDCGATHNFISQKLVEALQIPTAETSNYGVVMGIGVAVKGKGICKGVILKLPELTIV
          +  + +V  VVELSLN VVG +  GT K+ G ++    V+++DCGATHNFISQKLVE+L +P AET+NYGV+MG G AV+G+ +CKG+ + LP LTIV
Subjt:  VEQASVKKVREVVELSLNMVVGFSTPGTMKLHGEINQSGVVVLIDCGATHNFISQKLVEALQIPTAETSNYGVVMGIGVAVKGKGICKGVILKLPELTIV

Query:  ENFLPLELGGINVVL----------------------------------------EVTLKRLARSLDAHDQGFLMELCALTGYQA--EAPNAAPPMELPR
        E+FLPLELG +++VL                                        E++LK L ++    DQGFL+   A+   +A  E        +   
Subjt:  ENFLPLELGGINVVL----------------------------------------EVTLKRLARSLDAHDQGFLMELCALTGYQA--EAPNAAPPMELPR

Query:  SIDRLLTECQVVFHLPEGLPPTREVDHRILLTEG------HPPNHPPTFIESYEWVAEPLDIMGICRSPDTNEWE---LLLQWKD
          ++L  E   +F +P+ LPP R++DHRI L EG       P  +P       E +   +   GI R P TN +    +L++ KD
Subjt:  SIDRLLTECQVVFHLPEGLPPTREVDHRILLTEG------HPPNHPPTFIESYEWVAEPLDIMGICRSPDTNEWE---LLLQWKD

A0A5D3D378 Gypsy/ty3 element polyprotein2.2e-7239.08Show/hide
Query:  GSLCPRFLAIHQTRTVAEFRRTFEELSAPVQGLSDEVLEGTFVNGLHPLIRAEILSKETIGLQNVMCTAQLIEDRIVARAYNEGTKIPSPNPTQSTLPSR
        G+ C RFLAI Q  +V E+ + FEELSAP+  ++++VL G F NGL P+IR E+ +   +GL+N+M  A+L E+++     + G         Q   P  
Subjt:  GSLCPRFLAIHQTRTVAEFRRTFEELSAPVQGLSDEVLEGTFVNGLHPLIRAEILSKETIGLQNVMCTAQLIEDRIVARAYNEGTKIPSPNPTQSTLPSR

Query:  NKAAQSPLTRTVTLASRNPMFSTASNLN---------RKDPAFKRLSEMVYNACKAKGLCFRCNERYTIGHRCKNHKLRVLLAHDDDPRNMEIDSEISVE
         K  ++P T+ VTLA R P     +N +         R+D  F+R ++    A + KGLC+RC E ++ GHRCKN +LR+ +  DD      +DS    E
Subjt:  NKAAQSPLTRTVTLASRNPMFSTASNLN---------RKDPAFKRLSEMVYNACKAKGLCFRCNERYTIGHRCKNHKLRVLLAHDDDPRNMEIDSEISVE

Query:  QASVKKVREVVELSLNMVVGFSTPGTMKLHGEINQSGVVVLIDCGATHNFISQKLVEALQIPTAETSNYGVVMGIGVAVKGKGICKGVILKLPELTIVEN
             +V  VVELSLN VVG + PGT KL G +    +V+++DCGATHNFIS KLVE L++P AET+NYGV+MG G AV+G+GICKG+ + LP ++IVE+
Subjt:  QASVKKVREVVELSLNMVVGFSTPGTMKLHGEINQSGVVVLIDCGATHNFISQKLVEALQIPTAETSNYGVVMGIGVAVKGKGICKGVILKLPELTIVEN

Query:  FLPLELGGINVVL----------------------------------------EVTLKRLARSLDAHDQGFLMELCALTGYQAEAPNAAPPM--ELPRSI
        FLPLELG I++VL                                        E++LK L ++    DQGFL+   A+   +A+          E     
Subjt:  FLPLELGGINVVL----------------------------------------EVTLKRLARSLDAHDQGFLMELCALTGYQAEAPNAAPPM--ELPRSI

Query:  DRLLTECQVVFHLPEGLPPTREVDHRILLTEGHPP
         +L  E   VF +P+GLPP R +DHRI L EG  P
Subjt:  DRLLTECQVVFHLPEGLPPTREVDHRILLTEGHPP

A0A6J1DN22 Reverse transcriptase2.9e-7241.63Show/hide
Query:  KEGSLCPRFLAIHQTRTVAEFRRTFEELSAPVQGLSDEVLEGTFVNGLHPLIRAEILSKETIGLQNVMCTAQLIEDRIVARAYNEGTKIPSPNPTQSTLP
        K+ +L  RFL+I +  TV E+R+TFE  SA +  ++++V+E                                           +G+    P    +T  
Subjt:  KEGSLCPRFLAIHQTRTVAEFRRTFEELSAPVQGLSDEVLEGTFVNGLHPLIRAEILSKETIGLQNVMCTAQLIEDRIVARAYNEGTKIPSPNPTQSTLP

Query:  SRNKAAQSPLTRTVTLASRNPMFSTASNLNRKDPAFKRLSEMVYNACKAKGLCFRCNERYTIGHRCKNHKLRVLLAHDDDPRNMEIDSE---ISVEQASV
        +  K +++  TRTVTL+ ++  + TA  L RK+   K+L+E  Y   K KGLCFR  E+Y+IGHRCKN +L+V + HDD+   ME+D E   +S E    
Subjt:  SRNKAAQSPLTRTVTLASRNPMFSTASNLNRKDPAFKRLSEMVYNACKAKGLCFRCNERYTIGHRCKNHKLRVLLAHDDDPRNMEIDSE---ISVEQASV

Query:  KKVREVVELSLNMVVGFSTPGTMKLHGEINQSGVVVLIDCGATHNFISQKLVEALQIPTAETSNYGVVMGIGVAVKGKGICKGVILKLPELTIVENFLPL
          V EV EL+LN VVGFSTPGTMKL G I    VV+LIDCGATHNFISQKLV+A  +P  ETSNYGV+MG GV V+GKGICKG+IL LPELTI ENFLPL
Subjt:  KKVREVVELSLNMVVGFSTPGTMKLHGEINQSGVVVLIDCGATHNFISQKLVEALQIPTAETSNYGVVMGIGVAVKGKGICKGVILKLPELTIVENFLPL

Query:  ELGGINVVL----------------------------------------EVTLKRLARSLDAHDQGFLMELCALTGYQ--AEAPNAAPPMELPRSIDRLL
        ELG ++VVL                                        EVTLK+LAR+ + HDQGFL+EL A +      EA  A  P    + I+ L+
Subjt:  ELGGINVVL----------------------------------------EVTLKRLARSLDAHDQGFLMELCALTGYQ--AEAPNAAPPMELPRSIDRLL

Query:  TECQVVFHLPEGLPPTREVDHRILLTEGHP
             VF  P  LPP R +DH+I L +G P
Subjt:  TECQVVFHLPEGLPPTREVDHRILLTEGHP

A0A6J1DN22 Reverse transcriptase3.5e-0639.13Show/hide
Query:  PTFIESYEWVAEPLDIMGICRSPDTNEWELLLQWKDLPEHDVTWETVTTIRRQFPHLHLEDKGVFLPRG
        P  ++ +E + EP ++  +  + DT E E L+ WK   E D TWE + T + QFP+ HLEDK    P G
Subjt:  PTFIESYEWVAEPLDIMGICRSPDTNEWELLLQWKDLPEHDVTWETVTTIRRQFPHLHLEDKGVFLPRG

A0A6J1DN22 Reverse transcriptase1.1e-7138.85Show/hide
Query:  GSLCPRFLAIHQTRTVAEFRRTFEELSAPVQGLSDEVLEGTFVNGLHPLIRAEILSKETIGLQNVMCTAQLIEDRIVARAYNEGTKIPSPNPTQSTLPSR
        G+ C RFLAI Q  +V E+ + FEELSAP+  ++++VL G F NGL P+IR E+ +   +GL+++M  A+L E+++     + G         Q   P  
Subjt:  GSLCPRFLAIHQTRTVAEFRRTFEELSAPVQGLSDEVLEGTFVNGLHPLIRAEILSKETIGLQNVMCTAQLIEDRIVARAYNEGTKIPSPNPTQSTLPSR

Query:  NKAAQSPLTRTVTLASRNPMFSTASNLN---------RKDPAFKRLSEMVYNACKAKGLCFRCNERYTIGHRCKNHKLRVLLAHDDDPRNMEIDSEISVE
         K  ++P T+ VTLA R P     +N +         R+D  F+R ++    A + KGLC+RC E ++ GHRCKN +LR+ +  DD      +DS    E
Subjt:  NKAAQSPLTRTVTLASRNPMFSTASNLN---------RKDPAFKRLSEMVYNACKAKGLCFRCNERYTIGHRCKNHKLRVLLAHDDDPRNMEIDSEISVE

Query:  QASVKKVREVVELSLNMVVGFSTPGTMKLHGEINQSGVVVLIDCGATHNFISQKLVEALQIPTAETSNYGVVMGIGVAVKGKGICKGVILKLPELTIVEN
             +V  VVELSLN VVG + PGT KL G +    +V+++DCGATHNFIS KLVE L++P AET+NYGV+MG G AV+G+GICKG+ + LP ++IVE+
Subjt:  QASVKKVREVVELSLNMVVGFSTPGTMKLHGEINQSGVVVLIDCGATHNFISQKLVEALQIPTAETSNYGVVMGIGVAVKGKGICKGVILKLPELTIVEN

Query:  FLPLELGGINVVL----------------------------------------EVTLKRLARSLDAHDQGFLMELCALTGYQAEAPNAA--PPMELPRSI
        FLPLELG I++VL                                        E++LK L ++    DQGFL+   A+   +A+          E     
Subjt:  FLPLELGGINVVL----------------------------------------EVTLKRLARSLDAHDQGFLMELCALTGYQAEAPNAA--PPMELPRSI

Query:  DRLLTECQVVFHLPEGLPPTREVDHRILLTEGHPP
         +L  E   VF +P+GLPP R +DHRI L EG  P
Subjt:  DRLLTECQVVFHLPEGLPPTREVDHRILLTEGHPP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G29750.1 Eukaryotic aspartyl protease family protein5.8e-0936.84Show/hide
Query:  MVVGFSTPGTMKLHGEINQSGVVVLIDCGATHNFISQKLVEALQIPTAETSNYGVVMGIGVAVKGKGICKGVILKLPELTIVENFLPLELGGINV
        +V+  +    M+ +G I    VVV ID GAT NFI  +L  +L++PT+ T+   V++G    ++  G C G+ L + E+ I ENFL L+L   +V
Subjt:  MVVGFSTPGTMKLHGEINQSGVVVLIDCGATHNFISQKLVEALQIPTAETSNYGVVMGIGVAVKGKGICKGVILKLPELTIVENFLPLELGGINV

AT3G30770.1 Eukaryotic aspartyl protease family protein1.5e-0938.24Show/hide
Query:  KKVREVVELSLNMVVGFSTPGTMKLHGEINQSGVVVLIDCGATHNFISQKLVEALQIPTAETSNYGVVMGIGVAVKGKGICKGVILKLPELTIVENFLPL
        K +R+V   S      F+    M+ +G I+   VVV+ID GAT+NFIS +L   L++PT+ T+   V++G    ++  G C G+ L + E+ I ENFL L
Subjt:  KKVREVVELSLNMVVGFSTPGTMKLHGEINQSGVVVLIDCGATHNFISQKLVEALQIPTAETSNYGVVMGIGVAVKGKGICKGVILKLPELTIVENFLPL

Query:  EL
        +L
Subjt:  EL

AT3G42723.1 aminoacyl-tRNA ligases;ATP binding;nucleotide binding9.6e-0429.35Show/hide
Query:  PRFLAIHQTRTVAEFRRTFEELSAPVQGLSDEVLEGTFVNGLHPLIRAEILSKETIGLQNVMCTAQLIEDRIVARAYNEGTKI---PSPNPT
        P +  I Q  +V E+R  FE L      L  + LE  F+ GL P ++  +   +  G+  +M TAQ +E+      Y  G  +   P   PT
Subjt:  PRFLAIHQTRTVAEFRRTFEELSAPVQGLSDEVLEGTFVNGLHPLIRAEILSKETIGLQNVMCTAQLIEDRIVARAYNEGTKI---PSPNPT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGTGGTTGTGGCGGTTTGCTTTAGAGGAGAAGTCCTTGTGGAGAAGAGTGATCGCTGGAATCTATGGTGTGGACCCTTTCGGGTGGTTTACTAGGAGTAAAATTGG
CTCCTCGATTGGTCGGTCGTGGATCGATATCAAAAGCACTCGAGAAGCTTTTCTAAACTTTACGCTTTTTAAGGTCTTCTCGGGGAATCATGTTCCGTTTTGGGAGGATG
GTTGGGCAGATTCCTCCCCTCTTGCCAGTTTATTTCCAGAGATGTATAGAATGTCAAACAAGAAGGGGGTTTCGATAGCAGAGTGCTGGAATTCAGATTCTCTTACATGG
GATTTCGCCTTTAGAAAAAATTTAAATGACAGAGAATTAGAAAGATGGGCTACGTTTACTGAAAATATTGATGGAGTTTTGTTAGGACAGGGGCAAGACATATTATGGTG
GAAAGTGGATTCAAACGGGATTTTTACAACAAAAACAGCTTTCTTGGCCTTGACCTCCCCTTCTCCAAAGATGAGTACCTCCACGTCTTATCAAATCTGGAATTTTAAAG
TTCCTAAGAAAGATCGGTCCTCGGGGAAGACGTTTGAATGTGCTAGGATGCTGGATGGCCTCTACTACTTTGATGAACCTTCTGGGGGAGAATTACTACGAAATGATCAC
ATTGGTCCAAATCCTGAGTTTAAGTTTTATACTAGAAGAAAACTCAATCAAACGAATGAGGACCAGATAGCTGACTCATTACAGACCCAATCTGAAACTCCAAGGGATAA
TTCTGAAACTCCAGTAAGTAAGGAGATACCTTACAAATTGGTATCAGAGCGTTCAATCTTGGGGAGAACGAGAAGTATGGCACAGAAACAACTCGAGGAACGTTGTGAGA
TCAATGAGAAAGAGATGGTGAGCCTCAAAGAAAGTGTGGATAGGTTGACGAACCGAATGGAGGAACAAGGAAAGGTTGTAGCCATGCTGGTCACAGAGATAACGAGTTCA
AAAGCTAAGGAAGGGACTTACGCCGGTCTAGGCACCCCCGGTACGAAACGGAAACAGTCGGAAGGGGCTGAGGGAGCAGAGTCGATGCCGGATATTCTTGAAGGAGTGGG
TCAACAGGGGAAGGAGGGCAGCCTCTGCCCTCGGTTTCTGGCAATCCATCAGACACGGACAGTGGCAGAATTTCGTCGAACTTTTGAGGAACTATCCGCGCCCGTACAAG
GTTTGTCAGACGAGGTTTTGGAGGGAACATTTGTCAACGGGCTTCACCCTCTCATTCGGGCCGAAATATTGAGCAAGGAAACAATTGGGCTTCAGAATGTCATGTGTACG
GCCCAACTAATAGAGGACCGGATCGTGGCTAGGGCTTATAACGAGGGCACTAAAATTCCGTCACCCAACCCAACCCAATCTACACTACCGTCTAGAAATAAGGCCGCCCA
ATCACCCTTGACACGTACGGTCACACTGGCAAGTCGAAATCCCATGTTTTCCACGGCAAGTAACCTTAATCGAAAAGATCCTGCCTTTAAGCGTTTATCAGAGATGGTGT
ACAACGCATGCAAAGCTAAGGGCCTCTGTTTTCGTTGCAACGAGCGGTACACCATTGGTCACCGATGCAAGAATCACAAACTGAGAGTCCTTTTGGCCCACGACGACGAT
CCAAGAAACATGGAAATAGACAGCGAGATCAGTGTCGAACAAGCTTCGGTGAAGAAAGTCAGAGAAGTGGTCGAGTTATCACTCAACATGGTGGTGGGATTCTCCACTCC
AGGTACGATGAAGCTCCACGGGGAGATCAACCAATCAGGAGTCGTGGTACTGATTGACTGTGGGGCTACCCACAACTTTATCTCCCAAAAGTTGGTCGAAGCATTGCAGA
TCCCCACAGCAGAGACATCCAACTACGGTGTAGTAATGGGTATCGGTGTTGCTGTCAAAGGGAAAGGTATTTGTAAAGGGGTAATTTTAAAACTCCCAGAACTAACCATT
GTGGAAAATTTTCTTCCCTTGGAACTAGGAGGAATCAATGTTGTTTTAGAGGTGACGCTCAAGCGTCTGGCACGCTCGTTGGATGCCCATGATCAGGGCTTTCTAATGGA
GTTATGTGCGCTGACGGGCTACCAAGCGGAAGCTCCGAATGCCGCCCCTCCCATGGAACTACCCCGATCCATTGACAGATTGCTAACCGAATGCCAAGTTGTTTTCCATC
TGCCTGAAGGTTTGCCCCCCACACGGGAAGTTGATCACCGCATACTCCTTACCGAAGGCCACCCACCCAATCATCCTCCAACGTTTATCGAAAGCTACGAATGGGTTGCT
GAACCCTTAGACATTATGGGTATCTGCCGGAGCCCCGATACCAACGAATGGGAACTCCTTCTGCAATGGAAGGACTTACCGGAGCATGATGTCACGTGGGAAACGGTCAC
TACTATCCGCCGCCAATTTCCTCATTTACACCTTGAGGACAAGGGTGTCTTTCTCCCTCGGGGGTAA
mRNA sequenceShow/hide mRNA sequence
ATGAAGTGGTTGTGGCGGTTTGCTTTAGAGGAGAAGTCCTTGTGGAGAAGAGTGATCGCTGGAATCTATGGTGTGGACCCTTTCGGGTGGTTTACTAGGAGTAAAATTGG
CTCCTCGATTGGTCGGTCGTGGATCGATATCAAAAGCACTCGAGAAGCTTTTCTAAACTTTACGCTTTTTAAGGTCTTCTCGGGGAATCATGTTCCGTTTTGGGAGGATG
GTTGGGCAGATTCCTCCCCTCTTGCCAGTTTATTTCCAGAGATGTATAGAATGTCAAACAAGAAGGGGGTTTCGATAGCAGAGTGCTGGAATTCAGATTCTCTTACATGG
GATTTCGCCTTTAGAAAAAATTTAAATGACAGAGAATTAGAAAGATGGGCTACGTTTACTGAAAATATTGATGGAGTTTTGTTAGGACAGGGGCAAGACATATTATGGTG
GAAAGTGGATTCAAACGGGATTTTTACAACAAAAACAGCTTTCTTGGCCTTGACCTCCCCTTCTCCAAAGATGAGTACCTCCACGTCTTATCAAATCTGGAATTTTAAAG
TTCCTAAGAAAGATCGGTCCTCGGGGAAGACGTTTGAATGTGCTAGGATGCTGGATGGCCTCTACTACTTTGATGAACCTTCTGGGGGAGAATTACTACGAAATGATCAC
ATTGGTCCAAATCCTGAGTTTAAGTTTTATACTAGAAGAAAACTCAATCAAACGAATGAGGACCAGATAGCTGACTCATTACAGACCCAATCTGAAACTCCAAGGGATAA
TTCTGAAACTCCAGTAAGTAAGGAGATACCTTACAAATTGGTATCAGAGCGTTCAATCTTGGGGAGAACGAGAAGTATGGCACAGAAACAACTCGAGGAACGTTGTGAGA
TCAATGAGAAAGAGATGGTGAGCCTCAAAGAAAGTGTGGATAGGTTGACGAACCGAATGGAGGAACAAGGAAAGGTTGTAGCCATGCTGGTCACAGAGATAACGAGTTCA
AAAGCTAAGGAAGGGACTTACGCCGGTCTAGGCACCCCCGGTACGAAACGGAAACAGTCGGAAGGGGCTGAGGGAGCAGAGTCGATGCCGGATATTCTTGAAGGAGTGGG
TCAACAGGGGAAGGAGGGCAGCCTCTGCCCTCGGTTTCTGGCAATCCATCAGACACGGACAGTGGCAGAATTTCGTCGAACTTTTGAGGAACTATCCGCGCCCGTACAAG
GTTTGTCAGACGAGGTTTTGGAGGGAACATTTGTCAACGGGCTTCACCCTCTCATTCGGGCCGAAATATTGAGCAAGGAAACAATTGGGCTTCAGAATGTCATGTGTACG
GCCCAACTAATAGAGGACCGGATCGTGGCTAGGGCTTATAACGAGGGCACTAAAATTCCGTCACCCAACCCAACCCAATCTACACTACCGTCTAGAAATAAGGCCGCCCA
ATCACCCTTGACACGTACGGTCACACTGGCAAGTCGAAATCCCATGTTTTCCACGGCAAGTAACCTTAATCGAAAAGATCCTGCCTTTAAGCGTTTATCAGAGATGGTGT
ACAACGCATGCAAAGCTAAGGGCCTCTGTTTTCGTTGCAACGAGCGGTACACCATTGGTCACCGATGCAAGAATCACAAACTGAGAGTCCTTTTGGCCCACGACGACGAT
CCAAGAAACATGGAAATAGACAGCGAGATCAGTGTCGAACAAGCTTCGGTGAAGAAAGTCAGAGAAGTGGTCGAGTTATCACTCAACATGGTGGTGGGATTCTCCACTCC
AGGTACGATGAAGCTCCACGGGGAGATCAACCAATCAGGAGTCGTGGTACTGATTGACTGTGGGGCTACCCACAACTTTATCTCCCAAAAGTTGGTCGAAGCATTGCAGA
TCCCCACAGCAGAGACATCCAACTACGGTGTAGTAATGGGTATCGGTGTTGCTGTCAAAGGGAAAGGTATTTGTAAAGGGGTAATTTTAAAACTCCCAGAACTAACCATT
GTGGAAAATTTTCTTCCCTTGGAACTAGGAGGAATCAATGTTGTTTTAGAGGTGACGCTCAAGCGTCTGGCACGCTCGTTGGATGCCCATGATCAGGGCTTTCTAATGGA
GTTATGTGCGCTGACGGGCTACCAAGCGGAAGCTCCGAATGCCGCCCCTCCCATGGAACTACCCCGATCCATTGACAGATTGCTAACCGAATGCCAAGTTGTTTTCCATC
TGCCTGAAGGTTTGCCCCCCACACGGGAAGTTGATCACCGCATACTCCTTACCGAAGGCCACCCACCCAATCATCCTCCAACGTTTATCGAAAGCTACGAATGGGTTGCT
GAACCCTTAGACATTATGGGTATCTGCCGGAGCCCCGATACCAACGAATGGGAACTCCTTCTGCAATGGAAGGACTTACCGGAGCATGATGTCACGTGGGAAACGGTCAC
TACTATCCGCCGCCAATTTCCTCATTTACACCTTGAGGACAAGGGTGTCTTTCTCCCTCGGGGGTAA
Protein sequenceShow/hide protein sequence
MKWLWRFALEEKSLWRRVIAGIYGVDPFGWFTRSKIGSSIGRSWIDIKSTREAFLNFTLFKVFSGNHVPFWEDGWADSSPLASLFPEMYRMSNKKGVSIAECWNSDSLTW
DFAFRKNLNDRELERWATFTENIDGVLLGQGQDILWWKVDSNGIFTTKTAFLALTSPSPKMSTSTSYQIWNFKVPKKDRSSGKTFECARMLDGLYYFDEPSGGELLRNDH
IGPNPEFKFYTRRKLNQTNEDQIADSLQTQSETPRDNSETPVSKEIPYKLVSERSILGRTRSMAQKQLEERCEINEKEMVSLKESVDRLTNRMEEQGKVVAMLVTEITSS
KAKEGTYAGLGTPGTKRKQSEGAEGAESMPDILEGVGQQGKEGSLCPRFLAIHQTRTVAEFRRTFEELSAPVQGLSDEVLEGTFVNGLHPLIRAEILSKETIGLQNVMCT
AQLIEDRIVARAYNEGTKIPSPNPTQSTLPSRNKAAQSPLTRTVTLASRNPMFSTASNLNRKDPAFKRLSEMVYNACKAKGLCFRCNERYTIGHRCKNHKLRVLLAHDDD
PRNMEIDSEISVEQASVKKVREVVELSLNMVVGFSTPGTMKLHGEINQSGVVVLIDCGATHNFISQKLVEALQIPTAETSNYGVVMGIGVAVKGKGICKGVILKLPELTI
VENFLPLELGGINVVLEVTLKRLARSLDAHDQGFLMELCALTGYQAEAPNAAPPMELPRSIDRLLTECQVVFHLPEGLPPTREVDHRILLTEGHPPNHPPTFIESYEWVA
EPLDIMGICRSPDTNEWELLLQWKDLPEHDVTWETVTTIRRQFPHLHLEDKGVFLPRG