; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc02g14980 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc02g14980
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr2:11124837..11129598
RNA-Seq ExpressionMoc02g14980
SyntenyMoc02g14980
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TXG55646.1 hypothetical protein EZV62_020902 [Acer yangbiense]3.7e-9741.43Show/hide
Query:  TTEETENSSVPPQVVTNVAVPTPNPSPQFNTS--FGHPLGTVLTVKLDDKNYSLWRGMVLAVLRGQKFDGYVLGTLAKPPQFLVSPETEGTSDHLQV-NP
        TT + +++  P    T         S   N S  FG+ L     +KLD +N+ LW+ MV  +++G + DG++  T   PP+FL SP T G SD     NP
Subjt:  TTEETENSSVPPQVVTNVAVPTPNPSPQFNTS--FGHPLGTVLTVKLDDKNYSLWRGMVLAVLRGQKFDGYVLGTLAKPPQFLVSPETEGTSDHLQV-NP

Query:  EYVEWQAVDQALLGWLFGSMTPSIACDVVDFRSSREVWKALEDLYGATSKARINQLRNVLQNTKKNSLKMSEYLGLMKQASESLKLAGEPVAFNYLMSCV
        EY +W   DQ L+GWL+ SMT ++A  V+   ++  +WKALE+L+GA SK++ N +R  +Q T+K S  M EYL  MK  ++SL +AG+P   N L + +
Subjt:  EYVEWQAVDQALLGWLFGSMTPSIACDVVDFRSSREVWKALEDLYGATSKARINQLRNVLQNTKKNSLKMSEYLGLMKQASESLKLAGEPVAFNYLMSCV

Query:  LSGLEAEYLPIVCQIEGKDSTSWQELFATLVTFENTLMRLNIVSTATAEGISDGSANYVHSKQNSVGNRQFHQSQSGQGQGRGSYNSNDAKNNVRGRGRG
        L+GL++EY+PIV  IE ++  +WQE++ TL+++++ L  +N VS A    +S  SA+   +K N+  N     +Q    QG     +        GR RG
Subjt:  LSGLEAEYLPIVCQIEGKDSTSWQELFATLVTFENTLMRLNIVSTATAEGISDGSANYVHSKQNSVGNRQFHQSQSGQGQGRGSYNSNDAKNNVRGRGRG

Query:  RFSPYRGNNSKPSCQLCGKYGHIAAVCYKRFDENFNN---LSSSNNNRNSAYMAIPEIVAEPSWLADSGATDHVTSDLSNLNVKSDYNGKGTLTVGNGNR
        R    R NNS+P+CQ+CGK+GH A+VCY R+D+N+      ++SN N  S ++A PE V + +W ADSGAT+HVT+D  NL++KS+Y G  +L VGNG +
Subjt:  RFSPYRGNNSKPSCQLCGKYGHIAAVCYKRFDENFNN---LSSSNNNRNSAYMAIPEIVAEPSWLADSGATDHVTSDLSNLNVKSDYNGKGTLTVGNGNR

Query:  LEISHIGHTCLQTKPITSGNLQLSNILHVPKIKRNLLSIAKLTADNNCFVEFHPTCCFVKDKETKKVVLHGVLKDELYQVKLPLQTSNQN
        L+ISH+G   L +  +T  ++ L  +LHVP+I++NLLS+++L  DN+ F+EFH  CCFVKDK T+  VL G LK+ LYQ+++P   S  N
Subjt:  LEISHIGHTCLQTKPITSGNLQLSNILHVPKIKRNLLSIAKLTADNNCFVEFHPTCCFVKDKETKKVVLHGVLKDELYQVKLPLQTSNQN

TXG67243.1 hypothetical protein EZV62_008518 [Acer yangbiense]3.4e-9540.72Show/hide
Query:  MTTEETENSSVPPQVVTNVAVPT----PNPSPQFNTSFGHPLGTVLTVKLDDKNYSLWRGMVLAVLRGQKFDGYVLGTLAKPPQFLVSPETEGTSDHLQ-
        M+T   + S++ P   ++ A PT     + S   ++ FG+ L     +KLD +N+ LW+ MV  +++G + DG++  T   PP+FL SP T G       
Subjt:  MTTEETENSSVPPQVVTNVAVPT----PNPSPQFNTSFGHPLGTVLTVKLDDKNYSLWRGMVLAVLRGQKFDGYVLGTLAKPPQFLVSPETEGTSDHLQ-

Query:  --------VNPEYVEWQAVDQALLGWLFGSMTPSIACDVVDFRSSREVWKALEDLYGATSKARINQLRNVLQNTKKNSLKMSEYLGLMKQASESLKLAGE
                 NPEY +W   DQ L+GWL+ SMT ++A  V+   ++  +WKALE+L+GA SK++ N +R  +Q T+K S  M EYL  MK  ++SL +AG+
Subjt:  --------VNPEYVEWQAVDQALLGWLFGSMTPSIACDVVDFRSSREVWKALEDLYGATSKARINQLRNVLQNTKKNSLKMSEYLGLMKQASESLKLAGE

Query:  PVAFNYLMSCVLSGLEAEYLPIVCQIEGKDSTSWQELFATLVTFENTLMRLNIVSTATAEGISDGSANYVHSKQNSVGNRQFHQSQSGQGQGRGSYNSND
        P   N L +  L+GL++EY+PIV  IE ++  +WQE++ TL+++++ L  +N VS A    +S  SA+   +K N+  N     +Q    QG     +  
Subjt:  PVAFNYLMSCVLSGLEAEYLPIVCQIEGKDSTSWQELFATLVTFENTLMRLNIVSTATAEGISDGSANYVHSKQNSVGNRQFHQSQSGQGQGRGSYNSND

Query:  AKNNVRGRGRGRFSPYRGNNSKPSCQLCGKYGHIAAVCYKRFDENFNN---LSSSNNNRNSAYMAIPEIVAEPSWLADSGATDHVTSDLSNLNVKSDYNG
              GR RGR    R NNS+P+CQ+CGK+GH A+VCY R+D+N+      ++SN N  S ++A PE V + +W ADSGATDHVT+D  NL++KSDY G
Subjt:  AKNNVRGRGRGRFSPYRGNNSKPSCQLCGKYGHIAAVCYKRFDENFNN---LSSSNNNRNSAYMAIPEIVAEPSWLADSGATDHVTSDLSNLNVKSDYNG

Query:  KGTLTVGNGNRLEISHIGHTCLQTKPITSGNLQLSNILHVPKIKRNLLSIAKLTADNNCFVEFHPTCCFVKDKETKKVVLHGVLKDELYQVKLPLQTSNQ
          +L VGNG +L+ISH+G   L +  +T  ++ L  +LHVP+I++NLLS+++L  DN+ F+EFH  CCFVKDK T   VL G LK+ LYQ+++P   S  
Subjt:  KGTLTVGNGNRLEISHIGHTCLQTKPITSGNLQLSNILHVPKIKRNLLSIAKLTADNNCFVEFHPTCCFVKDKETKKVVLHGVLKDELYQVKLPLQTSNQ

Query:  N
        N
Subjt:  N

TXG69253.1 hypothetical protein EZV62_004188 [Acer yangbiense]1.3e-9440.52Show/hide
Query:  MTTEETENSSVPPQVVTNVAVPT----PNPSPQFNTSFGHPLGTVLTVKLDDKNYSLWRGMVLAVLRGQKFDGYVLGTLAKPPQFLVSPETEGTSDHLQ-
        M+T   + S++ P   ++ A PT     + S   ++ FG+ L     +KLD +N+ LW+ MV  +++G + DG++  T   PP+FL SP T G       
Subjt:  MTTEETENSSVPPQVVTNVAVPT----PNPSPQFNTSFGHPLGTVLTVKLDDKNYSLWRGMVLAVLRGQKFDGYVLGTLAKPPQFLVSPETEGTSDHLQ-

Query:  --------VNPEYVEWQAVDQALLGWLFGSMTPSIACDVVDFRSSREVWKALEDLYGATSKARINQLRNVLQNTKKNSLKMSEYLGLMKQASESLKLAGE
                 NPEY +W   DQ L+GWL+ SMT ++A  V+   ++  +WKALE+L+GA SK++ N +R  +Q T+K S  M EYL  MK  ++SL +AG+
Subjt:  --------VNPEYVEWQAVDQALLGWLFGSMTPSIACDVVDFRSSREVWKALEDLYGATSKARINQLRNVLQNTKKNSLKMSEYLGLMKQASESLKLAGE

Query:  PVAFNYLMSCVLSGLEAEYLPIVCQIEGKDSTSWQELFATLVTFENTLMRLNIVSTATAEGISDGSANYVHSKQNSVGNRQFHQSQSGQGQGRGSYNSND
        P   N L +  L+GL++EY+PIV  IE ++  +WQE++ TL+++++ L  +N VS A    +S  SA+   +K N+  N     +Q    QG     +  
Subjt:  PVAFNYLMSCVLSGLEAEYLPIVCQIEGKDSTSWQELFATLVTFENTLMRLNIVSTATAEGISDGSANYVHSKQNSVGNRQFHQSQSGQGQGRGSYNSND

Query:  AKNNVRGRGRGRFSPYRGNNSKPSCQLCGKYGHIAAVCYKRFDENFNN---LSSSNNNRNSAYMAIPEIVAEPSWLADSGATDHVTSDLSNLNVKSDYNG
              GR RGR    R NNS+P+CQ+CGK+GH A+VCY R+D+N+      ++SN N  S ++A PE V + +W ADSGAT+HVT+D  NL++KSDY G
Subjt:  AKNNVRGRGRGRFSPYRGNNSKPSCQLCGKYGHIAAVCYKRFDENFNN---LSSSNNNRNSAYMAIPEIVAEPSWLADSGATDHVTSDLSNLNVKSDYNG

Query:  KGTLTVGNGNRLEISHIGHTCLQTKPITSGNLQLSNILHVPKIKRNLLSIAKLTADNNCFVEFHPTCCFVKDKETKKVVLHGVLKDELYQVKLPLQTSNQ
          +L VGNG +L+ISH+G   L +  +T  ++ L  +LHVP+I++NLLS+++L  DN+ F+EFH  CCFVKDK T   VL G LK+ LYQ+++P   S  
Subjt:  KGTLTVGNGNRLEISHIGHTCLQTKPITSGNLQLSNILHVPKIKRNLLSIAKLTADNNCFVEFHPTCCFVKDKETKKVVLHGVLKDELYQVKLPLQTSNQ

Query:  N
        N
Subjt:  N

XP_022157748.1 uncharacterized protein LOC111024384 isoform X1 [Momordica charantia]2.1e-21799.74Show/hide
Query:  MTTEETENSSVPPQVVTNVAVPTPNPSPQFNTSFGHPLGTVLTVKLDDKNYSLWRGMVLAVLRGQKFDGYVLGTLAKPPQFLVSPETEGTSDHLQVNPEY
        MTTEETENSSVPPQVVTNVAVPTPNPSPQFNTSFGHPLGTVLTVKLDDKNYSLWRGMVLAVLRGQKFDGYVLGTLAKPPQFLVSPETEGTSDHLQVNPEY
Subjt:  MTTEETENSSVPPQVVTNVAVPTPNPSPQFNTSFGHPLGTVLTVKLDDKNYSLWRGMVLAVLRGQKFDGYVLGTLAKPPQFLVSPETEGTSDHLQVNPEY

Query:  VEWQAVDQALLGWLFGSMTPSIACDVVDFRSSREVWKALEDLYGATSKARINQLRNVLQNTKKNSLKMSEYLGLMKQASESLKLAGEPVAFNYLMSCVLS
        VEWQAVDQALLGWLFGSMTPSIACDVVDFRSSREVWKALEDLYGATSKARINQLRNVLQNTKKNSLKMSEYLGLMKQASESLKLAGEPVAFNYLMSCVLS
Subjt:  VEWQAVDQALLGWLFGSMTPSIACDVVDFRSSREVWKALEDLYGATSKARINQLRNVLQNTKKNSLKMSEYLGLMKQASESLKLAGEPVAFNYLMSCVLS

Query:  GLEAEYLPIVCQIEGKDSTSWQELFATLVTFENTLMRLNIVSTATAEGISDGSANYVHSKQNSVGNRQFHQSQSGQGQGRGSYNSNDAKNNVRGRGRGRF
        GLEAEYLPIVCQIEGKDSTSWQELFATLVTFENTLMRLNIVSTATAEGISDGS NYVHSKQNSVGNRQFHQSQSGQGQGRGSYNSNDAKNNVRGRGRGRF
Subjt:  GLEAEYLPIVCQIEGKDSTSWQELFATLVTFENTLMRLNIVSTATAEGISDGSANYVHSKQNSVGNRQFHQSQSGQGQGRGSYNSNDAKNNVRGRGRGRF

Query:  SPYRGNNSKPSCQLCGKYGHIAAVCYKRFDENFNNLSSSNNNRNSAYMAIPEIVAEPSWLADSGATDHVTSDLSNLNVKSDYNGKG
        SPYRGNNSKPSCQLCGKYGHIAAVCYKRFDENFNNLSSSNNNRNSAYMAIPEIVAEPSWLADSGATDHVTSDLSNLNVKSDYNGKG
Subjt:  SPYRGNNSKPSCQLCGKYGHIAAVCYKRFDENFNNLSSSNNNRNSAYMAIPEIVAEPSWLADSGATDHVTSDLSNLNVKSDYNGKG

XP_022157750.1 uncharacterized protein LOC111024384 isoform X2 [Momordica charantia]6.1e-21799.48Show/hide
Query:  MTTEETENSSVPPQVVTNVAVPTPNPSPQFNTSFGHPLGTVLTVKLDDKNYSLWRGMVLAVLRGQKFDGYVLGTLAKPPQFLVSPETEGTSDHLQVNPEY
        MTTEETENSSVPPQVVTNVAVPTPNPSPQFNTSFGHPLGTVLTVKLDDKNYSLWRGMVLAVLRGQKFDGYVLGTLAKPPQFLVSPETEGTSDHLQVNPEY
Subjt:  MTTEETENSSVPPQVVTNVAVPTPNPSPQFNTSFGHPLGTVLTVKLDDKNYSLWRGMVLAVLRGQKFDGYVLGTLAKPPQFLVSPETEGTSDHLQVNPEY

Query:  VEWQAVDQALLGWLFGSMTPSIACDVVDFRSSREVWKALEDLYGATSKARINQLRNVLQNTKKNSLKMSEYLGLMKQASESLKLAGEPVAFNYLMSCVLS
        VEWQAVDQALLGWLFGSMTPSIACDVVDFRSSREVWKALEDLYGATSKARINQLRNVLQNTKKNSLKMSEYLGLMKQASESLKLAGEPVAFNYLMSCVLS
Subjt:  VEWQAVDQALLGWLFGSMTPSIACDVVDFRSSREVWKALEDLYGATSKARINQLRNVLQNTKKNSLKMSEYLGLMKQASESLKLAGEPVAFNYLMSCVLS

Query:  GLEAEYLPIVCQIEGKDSTSWQELFATLVTFENTLMRLNIVSTATAEGISDGSANYVHSKQNSVGNRQFHQSQSGQGQGRGSYNSNDAKNNVRGRGRGRF
        GLEAEYLPIVCQIEGKDSTSWQELFATLVTFENTLMRLNIVSTATAEGISDGS NYVHSKQNSVGNRQFHQSQSGQGQGRGSYNSNDAKNNVRGRGRGRF
Subjt:  GLEAEYLPIVCQIEGKDSTSWQELFATLVTFENTLMRLNIVSTATAEGISDGSANYVHSKQNSVGNRQFHQSQSGQGQGRGSYNSNDAKNNVRGRGRGRF

Query:  SPYRGNNSKPSCQLCGKYGHIAAVCYKRFDENFNNLSSSNNNRNSAYMAIPEIVAEPSWLADSGATDHVTSDLSNLNVKSDYNGKG
        SPYRGNNSKPSCQLCGKYGHIAAVCYKRFDENFNNLSSSNNNRNSAYMAIPEIVAEPSWLADSGATDHVTSDLSNLNVKSDYNG+G
Subjt:  SPYRGNNSKPSCQLCGKYGHIAAVCYKRFDENFNNLSSSNNNRNSAYMAIPEIVAEPSWLADSGATDHVTSDLSNLNVKSDYNGKG

TrEMBL top hitse value%identityAlignment
A0A5C7HHE9 Uncharacterized protein1.8e-9741.43Show/hide
Query:  TTEETENSSVPPQVVTNVAVPTPNPSPQFNTS--FGHPLGTVLTVKLDDKNYSLWRGMVLAVLRGQKFDGYVLGTLAKPPQFLVSPETEGTSDHLQV-NP
        TT + +++  P    T         S   N S  FG+ L     +KLD +N+ LW+ MV  +++G + DG++  T   PP+FL SP T G SD     NP
Subjt:  TTEETENSSVPPQVVTNVAVPTPNPSPQFNTS--FGHPLGTVLTVKLDDKNYSLWRGMVLAVLRGQKFDGYVLGTLAKPPQFLVSPETEGTSDHLQV-NP

Query:  EYVEWQAVDQALLGWLFGSMTPSIACDVVDFRSSREVWKALEDLYGATSKARINQLRNVLQNTKKNSLKMSEYLGLMKQASESLKLAGEPVAFNYLMSCV
        EY +W   DQ L+GWL+ SMT ++A  V+   ++  +WKALE+L+GA SK++ N +R  +Q T+K S  M EYL  MK  ++SL +AG+P   N L + +
Subjt:  EYVEWQAVDQALLGWLFGSMTPSIACDVVDFRSSREVWKALEDLYGATSKARINQLRNVLQNTKKNSLKMSEYLGLMKQASESLKLAGEPVAFNYLMSCV

Query:  LSGLEAEYLPIVCQIEGKDSTSWQELFATLVTFENTLMRLNIVSTATAEGISDGSANYVHSKQNSVGNRQFHQSQSGQGQGRGSYNSNDAKNNVRGRGRG
        L+GL++EY+PIV  IE ++  +WQE++ TL+++++ L  +N VS A    +S  SA+   +K N+  N     +Q    QG     +        GR RG
Subjt:  LSGLEAEYLPIVCQIEGKDSTSWQELFATLVTFENTLMRLNIVSTATAEGISDGSANYVHSKQNSVGNRQFHQSQSGQGQGRGSYNSNDAKNNVRGRGRG

Query:  RFSPYRGNNSKPSCQLCGKYGHIAAVCYKRFDENFNN---LSSSNNNRNSAYMAIPEIVAEPSWLADSGATDHVTSDLSNLNVKSDYNGKGTLTVGNGNR
        R    R NNS+P+CQ+CGK+GH A+VCY R+D+N+      ++SN N  S ++A PE V + +W ADSGAT+HVT+D  NL++KS+Y G  +L VGNG +
Subjt:  RFSPYRGNNSKPSCQLCGKYGHIAAVCYKRFDENFNN---LSSSNNNRNSAYMAIPEIVAEPSWLADSGATDHVTSDLSNLNVKSDYNGKGTLTVGNGNR

Query:  LEISHIGHTCLQTKPITSGNLQLSNILHVPKIKRNLLSIAKLTADNNCFVEFHPTCCFVKDKETKKVVLHGVLKDELYQVKLPLQTSNQN
        L+ISH+G   L +  +T  ++ L  +LHVP+I++NLLS+++L  DN+ F+EFH  CCFVKDK T+  VL G LK+ LYQ+++P   S  N
Subjt:  LEISHIGHTCLQTKPITSGNLQLSNILHVPKIKRNLLSIAKLTADNNCFVEFHPTCCFVKDKETKKVVLHGVLKDELYQVKLPLQTSNQN

A0A5C7ID32 Uncharacterized protein1.7e-9540.72Show/hide
Query:  MTTEETENSSVPPQVVTNVAVPT----PNPSPQFNTSFGHPLGTVLTVKLDDKNYSLWRGMVLAVLRGQKFDGYVLGTLAKPPQFLVSPETEGTSDHLQ-
        M+T   + S++ P   ++ A PT     + S   ++ FG+ L     +KLD +N+ LW+ MV  +++G + DG++  T   PP+FL SP T G       
Subjt:  MTTEETENSSVPPQVVTNVAVPT----PNPSPQFNTSFGHPLGTVLTVKLDDKNYSLWRGMVLAVLRGQKFDGYVLGTLAKPPQFLVSPETEGTSDHLQ-

Query:  --------VNPEYVEWQAVDQALLGWLFGSMTPSIACDVVDFRSSREVWKALEDLYGATSKARINQLRNVLQNTKKNSLKMSEYLGLMKQASESLKLAGE
                 NPEY +W   DQ L+GWL+ SMT ++A  V+   ++  +WKALE+L+GA SK++ N +R  +Q T+K S  M EYL  MK  ++SL +AG+
Subjt:  --------VNPEYVEWQAVDQALLGWLFGSMTPSIACDVVDFRSSREVWKALEDLYGATSKARINQLRNVLQNTKKNSLKMSEYLGLMKQASESLKLAGE

Query:  PVAFNYLMSCVLSGLEAEYLPIVCQIEGKDSTSWQELFATLVTFENTLMRLNIVSTATAEGISDGSANYVHSKQNSVGNRQFHQSQSGQGQGRGSYNSND
        P   N L +  L+GL++EY+PIV  IE ++  +WQE++ TL+++++ L  +N VS A    +S  SA+   +K N+  N     +Q    QG     +  
Subjt:  PVAFNYLMSCVLSGLEAEYLPIVCQIEGKDSTSWQELFATLVTFENTLMRLNIVSTATAEGISDGSANYVHSKQNSVGNRQFHQSQSGQGQGRGSYNSND

Query:  AKNNVRGRGRGRFSPYRGNNSKPSCQLCGKYGHIAAVCYKRFDENFNN---LSSSNNNRNSAYMAIPEIVAEPSWLADSGATDHVTSDLSNLNVKSDYNG
              GR RGR    R NNS+P+CQ+CGK+GH A+VCY R+D+N+      ++SN N  S ++A PE V + +W ADSGATDHVT+D  NL++KSDY G
Subjt:  AKNNVRGRGRGRFSPYRGNNSKPSCQLCGKYGHIAAVCYKRFDENFNN---LSSSNNNRNSAYMAIPEIVAEPSWLADSGATDHVTSDLSNLNVKSDYNG

Query:  KGTLTVGNGNRLEISHIGHTCLQTKPITSGNLQLSNILHVPKIKRNLLSIAKLTADNNCFVEFHPTCCFVKDKETKKVVLHGVLKDELYQVKLPLQTSNQ
          +L VGNG +L+ISH+G   L +  +T  ++ L  +LHVP+I++NLLS+++L  DN+ F+EFH  CCFVKDK T   VL G LK+ LYQ+++P   S  
Subjt:  KGTLTVGNGNRLEISHIGHTCLQTKPITSGNLQLSNILHVPKIKRNLLSIAKLTADNNCFVEFHPTCCFVKDKETKKVVLHGVLKDELYQVKLPLQTSNQ

Query:  N
        N
Subjt:  N

A0A6J1DTZ7 uncharacterized protein LOC111024384 isoform X22.9e-21799.48Show/hide
Query:  MTTEETENSSVPPQVVTNVAVPTPNPSPQFNTSFGHPLGTVLTVKLDDKNYSLWRGMVLAVLRGQKFDGYVLGTLAKPPQFLVSPETEGTSDHLQVNPEY
        MTTEETENSSVPPQVVTNVAVPTPNPSPQFNTSFGHPLGTVLTVKLDDKNYSLWRGMVLAVLRGQKFDGYVLGTLAKPPQFLVSPETEGTSDHLQVNPEY
Subjt:  MTTEETENSSVPPQVVTNVAVPTPNPSPQFNTSFGHPLGTVLTVKLDDKNYSLWRGMVLAVLRGQKFDGYVLGTLAKPPQFLVSPETEGTSDHLQVNPEY

Query:  VEWQAVDQALLGWLFGSMTPSIACDVVDFRSSREVWKALEDLYGATSKARINQLRNVLQNTKKNSLKMSEYLGLMKQASESLKLAGEPVAFNYLMSCVLS
        VEWQAVDQALLGWLFGSMTPSIACDVVDFRSSREVWKALEDLYGATSKARINQLRNVLQNTKKNSLKMSEYLGLMKQASESLKLAGEPVAFNYLMSCVLS
Subjt:  VEWQAVDQALLGWLFGSMTPSIACDVVDFRSSREVWKALEDLYGATSKARINQLRNVLQNTKKNSLKMSEYLGLMKQASESLKLAGEPVAFNYLMSCVLS

Query:  GLEAEYLPIVCQIEGKDSTSWQELFATLVTFENTLMRLNIVSTATAEGISDGSANYVHSKQNSVGNRQFHQSQSGQGQGRGSYNSNDAKNNVRGRGRGRF
        GLEAEYLPIVCQIEGKDSTSWQELFATLVTFENTLMRLNIVSTATAEGISDGS NYVHSKQNSVGNRQFHQSQSGQGQGRGSYNSNDAKNNVRGRGRGRF
Subjt:  GLEAEYLPIVCQIEGKDSTSWQELFATLVTFENTLMRLNIVSTATAEGISDGSANYVHSKQNSVGNRQFHQSQSGQGQGRGSYNSNDAKNNVRGRGRGRF

Query:  SPYRGNNSKPSCQLCGKYGHIAAVCYKRFDENFNNLSSSNNNRNSAYMAIPEIVAEPSWLADSGATDHVTSDLSNLNVKSDYNGKG
        SPYRGNNSKPSCQLCGKYGHIAAVCYKRFDENFNNLSSSNNNRNSAYMAIPEIVAEPSWLADSGATDHVTSDLSNLNVKSDYNG+G
Subjt:  SPYRGNNSKPSCQLCGKYGHIAAVCYKRFDENFNNLSSSNNNRNSAYMAIPEIVAEPSWLADSGATDHVTSDLSNLNVKSDYNGKG

A0A6J1DU77 uncharacterized protein LOC111024384 isoform X11.0e-21799.74Show/hide
Query:  MTTEETENSSVPPQVVTNVAVPTPNPSPQFNTSFGHPLGTVLTVKLDDKNYSLWRGMVLAVLRGQKFDGYVLGTLAKPPQFLVSPETEGTSDHLQVNPEY
        MTTEETENSSVPPQVVTNVAVPTPNPSPQFNTSFGHPLGTVLTVKLDDKNYSLWRGMVLAVLRGQKFDGYVLGTLAKPPQFLVSPETEGTSDHLQVNPEY
Subjt:  MTTEETENSSVPPQVVTNVAVPTPNPSPQFNTSFGHPLGTVLTVKLDDKNYSLWRGMVLAVLRGQKFDGYVLGTLAKPPQFLVSPETEGTSDHLQVNPEY

Query:  VEWQAVDQALLGWLFGSMTPSIACDVVDFRSSREVWKALEDLYGATSKARINQLRNVLQNTKKNSLKMSEYLGLMKQASESLKLAGEPVAFNYLMSCVLS
        VEWQAVDQALLGWLFGSMTPSIACDVVDFRSSREVWKALEDLYGATSKARINQLRNVLQNTKKNSLKMSEYLGLMKQASESLKLAGEPVAFNYLMSCVLS
Subjt:  VEWQAVDQALLGWLFGSMTPSIACDVVDFRSSREVWKALEDLYGATSKARINQLRNVLQNTKKNSLKMSEYLGLMKQASESLKLAGEPVAFNYLMSCVLS

Query:  GLEAEYLPIVCQIEGKDSTSWQELFATLVTFENTLMRLNIVSTATAEGISDGSANYVHSKQNSVGNRQFHQSQSGQGQGRGSYNSNDAKNNVRGRGRGRF
        GLEAEYLPIVCQIEGKDSTSWQELFATLVTFENTLMRLNIVSTATAEGISDGS NYVHSKQNSVGNRQFHQSQSGQGQGRGSYNSNDAKNNVRGRGRGRF
Subjt:  GLEAEYLPIVCQIEGKDSTSWQELFATLVTFENTLMRLNIVSTATAEGISDGSANYVHSKQNSVGNRQFHQSQSGQGQGRGSYNSNDAKNNVRGRGRGRF

Query:  SPYRGNNSKPSCQLCGKYGHIAAVCYKRFDENFNNLSSSNNNRNSAYMAIPEIVAEPSWLADSGATDHVTSDLSNLNVKSDYNGKG
        SPYRGNNSKPSCQLCGKYGHIAAVCYKRFDENFNNLSSSNNNRNSAYMAIPEIVAEPSWLADSGATDHVTSDLSNLNVKSDYNGKG
Subjt:  SPYRGNNSKPSCQLCGKYGHIAAVCYKRFDENFNNLSSSNNNRNSAYMAIPEIVAEPSWLADSGATDHVTSDLSNLNVKSDYNGKG

A0A803PEH4 Uncharacterized protein1.3e-9542.22Show/hide
Query:  TTEETENSSVPPQVVTNVAVPTPNPSPQFNTSFGHP-LGTVLTVKLDDKNYSLWRGMVLAVLRGQKFDGYVLGTLAKPPQFLVSPETEGTSDHLQVNPEY
        T     NSSV     +N      N + Q   +F  P L    ++KLD  NY+LW+ MV  ++RG +  GY+ GTL  PP+F++  +T+ T      NPEY
Subjt:  TTEETENSSVPPQVVTNVAVPTPNPSPQFNTSFGHP-LGTVLTVKLDDKNYSLWRGMVLAVLRGQKFDGYVLGTLAKPPQFLVSPETEGTSDHLQVNPEY

Query:  VEWQAVDQALLGWLFGSMTPSIACDVVDFRSSREVWKALEDLYGATSKARINQLRNVLQNTKKNSLKMSEYLGLMKQASESLKLAGEPVAFNYLMSCVLS
          W   DQ L+GWL+ SMT  IA +V+   S+  + + LE LYGA SK++++  R ++Q T+K S  MSEYL   K  S  L LAG+P    +L++ VL 
Subjt:  VEWQAVDQALLGWLFGSMTPSIACDVVDFRSSREVWKALEDLYGATSKARINQLRNVLQNTKKNSLKMSEYLGLMKQASESLKLAGEPVAFNYLMSCVLS

Query:  GLEAEYLPIVCQIEGKDSTSWQELFATLVTFENTLMRLNIVSTATAEGISDGSANYVHSKQNSVGNRQFHQSQSGQGQGRGSY-NSNDAKNNVRGRGRGR
        GL+AEYL IV QIE + +T+WQEL   L++F++ + RL  ++  + +  S      + +K N+ G  +  QSQ+      G + NS    N  RGRGRG 
Subjt:  GLEAEYLPIVCQIEGKDSTSWQELFATLVTFENTLMRLNIVSTATAEGISDGSANYVHSKQNSVGNRQFHQSQSGQGQGRGSY-NSNDAKNNVRGRGRGR

Query:  FSPYRGNNSKPSCQLCGKYGHIAAVCYKRFDENF----------NNLSSSNNNRNSAYMAIPEIVAEPSWLADSGATDHVTSDLSNLNVKSDYNGKGTLT
             G+ S+P+CQ+ GKYGH AAVCY RFDE++           N +   NN +SA++A PE++   +W ADSGA++H+TSD +NL  K DYNGK ++ 
Subjt:  FSPYRGNNSKPSCQLCGKYGHIAAVCYKRFDENF----------NNLSSSNNNRNSAYMAIPEIVAEPSWLADSGATDHVTSDLSNLNVKSDYNGKGTLT

Query:  VGNGNRLEISHIGHTCLQTKPITSGN-LQLSNILHVPKIKRNLLSIAKLTADNNCFVEFHPTCCFVKDKETKKVVLHGVLKDELYQVKLPLQTSNQNQNQ
        VGNG++L I+HIG+  L    I SGN L L ++L VPKI +NL+S++KL  DNN  +EF+   C VKDK TKKV+LHGVLKDELYQ+  P   S+    Q
Subjt:  VGNGNRLEISHIGHTCLQTKPITSGN-LQLSNILHVPKIKRNLLSIAKLTADNNCFVEFHPTCCFVKDKETKKVVLHGVLKDELYQVKLPLQTSNQNQNQ

Query:  QRSMSSVQQCLASN
           +S+    + SN
Subjt:  QRSMSSVQQCLASN

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.7e-4431.24Show/hide
Query:  KLDDKNYSLWRGMVLAVLRGQKFDGYVLGTLAKPPQFLVSPETEGTSDHLQVNPEYVEWQAVDQALLGWLFGSMTPSIACDVVDFRSSREVWKALEDLYG
        KL   NY +W   V A+  G +  G++ G+   P      P T GT    +VNP+Y  W+  D+ +   + G+++ S+   V    ++ ++W+ L  +Y 
Subjt:  KLDDKNYSLWRGMVLAVLRGQKFDGYVLGTLAKPPQFLVSPETEGTSDHLQVNPEYVEWQAVDQALLGWLFGSMTPSIACDVVDFRSSREVWKALEDLYG

Query:  ATSKARINQLRNVLQNTKKNSLKMSEYLGLMKQASESLKLAGEPVAFNYLMSCVLSGLEAEYLPIVCQIEGKDS-TSWQELFATLVTFENTLMRLNIVST
          S   + QLR  L+   K +  + +Y+  +    + L L G+P+  +  +  VL  L  EY P++ QI  KD+  +  E+   L+  E+ ++    VS+
Subjt:  ATSKARINQLRNVLQNTKKNSLKMSEYLGLMKQASESLKLAGEPVAFNYLMSCVLSGLEAEYLPIVCQIEGKDS-TSWQELFATLVTFENTLMRLNIVST

Query:  ATAEGISDGSANYVHSKQNSVGNRQFHQSQSGQGQGRGSYNSNDAKNNVR--GRGRGRFSPYRGNNSKP---SCQLCGKYGHIAAVCYKRFDENFNNLSS
        AT   I   +AN V  +  +  N       +  G     Y++ +  NN +   +    F P   N SKP    CQ+CG  GH A    KR  +  + LSS
Subjt:  ATAEGISDGSANYVHSKQNSVGNRQFHQSQSGQGQGRGSYNSNDAKNNVR--GRGRGRFSPYRGNNSKP---SCQLCGKYGHIAAVCYKRFDENFNNLSS

Query:  SNNN---------RNSAYMAIPEIVAEPSWLADSGATDHVTSDLSNLNVKSDYNGKGTLTVGNGNRLEISHIGHTCLQTKPITSGNLQLSNILHVPKIKR
         N+          +  A +A+    +  +WL DSGAT H+TSD +NL++   Y G   + V +G+ + ISH G T L TK   S  L L NIL+VP I +
Subjt:  SNNN---------RNSAYMAIPEIVAEPSWLADSGATDHVTSDLSNLNVKSDYNGKGTLTVGNGNRLEISHIGHTCLQTKPITSGNLQLSNILHVPKIKR

Query:  NLLSIAKLTADNNCFVEFHPTCCFVKDKETKKVVLHGVLKDELYQ
        NL+S+ +L   N   VEF P    VKD  T   +L G  KDELY+
Subjt:  NLLSIAKLTADNNCFVEFHPTCCFVKDKETKKVVLHGVLKDELYQ

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE23.6e-3928.34Show/hide
Query:  KLDDKNYSLWRGMVLAVLRGQKFDGYVLGTLAKPPQFLVSPETEGTSDHLQVNPEYVEWQAVDQALLGWLFGSMTPSIACDVVDFRSSREVWKALEDLYG
        KL   NY +W   V A+  G +  G++ G+   P      P T GT    +VNP+Y  W+  D+ +   + G+++ S+   V    ++ ++W+ L  +Y 
Subjt:  KLDDKNYSLWRGMVLAVLRGQKFDGYVLGTLAKPPQFLVSPETEGTSDHLQVNPEYVEWQAVDQALLGWLFGSMTPSIACDVVDFRSSREVWKALEDLYG

Query:  ATSKARINQLRNVLQNTKKNSLKMSEYLGLMKQASESLKLAGEPVAFNYLMSCVLSGLEAEYLPIVCQIEGKDS-TSWQELFATLVTFENTLMRLNIVST
          S   + QLR + +                    + L L G+P+  +  +  VL  L  +Y P++ QI  KD+  S  E+   L+  E+ L+ LN    
Subjt:  ATSKARINQLRNVLQNTKKNSLKMSEYLGLMKQASESLKLAGEPVAFNYLMSCVLSGLEAEYLPIVCQIEGKDS-TSWQELFATLVTFENTLMRLNIVST

Query:  ATAEGISDGSANYVHSKQNSVGNRQFHQSQSGQGQGRGSYNSNDAKNNVRGRGRGRFSPYRGNNSKPS-----CQLCGKYGHIAAVC--YKRFDENFNNL
                 +AN V  +     N   +++Q+ +G  R   N+N+  N+ +    G     R +N +P      CQ+C   GH A  C    +F    N  
Subjt:  ATAEGISDGSANYVHSKQNSVGNRQFHQSQSGQGQGRGSYNSNDAKNNVRGRGRGRFSPYRGNNSKPS-----CQLCGKYGHIAAVC--YKRFDENFNNL

Query:  SSSNNN---RNSAYMAIPEIVAEPSWLADSGATDHVTSDLSNLNVKSDYNGKGTLTVGNGNRLEISHIGHTCLQTKPITSGNLQLSNILHVPKIKRNLLS
         S++     +  A +A+       +WL DSGAT H+TSD +NL+    Y G   + + +G+ + I+H G   L   P +S +L L+ +L+VP I +NL+S
Subjt:  SSSNNN---RNSAYMAIPEIVAEPSWLADSGATDHVTSDLSNLNVKSDYNGKGTLTVGNGNRLEISHIGHTCLQTKPITSGNLQLSNILHVPKIKRNLLS

Query:  IAKLTADNNCFVEFHPTCCFVKDKETKKVVLHGVLKDELYQ
        + +L   N   VEF P    VKD  T   +L G  KDELY+
Subjt:  IAKLTADNNCFVEFHPTCCFVKDKETKKVVLHGVLKDELYQ

Arabidopsis top hitse value%identityAlignment
AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)5.3e-0923.79Show/hide
Query:  LTVKLDDKNYSLWRGMVLAVLRGQKFDGYVLGTLAKPPQFLVSPETEGTSDHLQVNPEYVEWQAVDQALLGWLFGSMTPSIACDVVDFR-SSREVWKALE
        +T+ L+  NY +WR +   +       G++ G+         +P TE              W+  D  +  W++G++T S+   ++    ++R++W +LE
Subjt:  LTVKLDDKNYSLWRGMVLAVLRGQKFDGYVLGTLAKPPQFLVSPETEGTSDHLQVNPEYVEWQAVDQALLGWLFGSMTPSIACDVVDFR-SSREVWKALE

Query:  DLYGATSKARINQLRNVLQNTKKNSLKMSEYLGLMKQASESLKLAGEPVAFNYLMSCVLSGLEAEYLPIVCQIEGKDS-TSWQELFATLVTFENTLMRLN
        +L+    +AR  Q  N L+ T  + L + EY   +K  S+ L     P++   L+  +L+GL  +Y  I+  I+ K    S+ E  + L+  E+ L   +
Subjt:  DLYGATSKARINQLRNVLQNTKKNSLKMSEYLGLMKQASESLKLAGEPVAFNYLMSCVLSGLEAEYLPIVCQIEGKDS-TSWQELFATLVTFENTLMRLN

Query:  IVSTATAEGISDGSANYVHSKQNSVGNRQFHQSQSGQGQGRGSYNSNDAKNNVRGRGRGRFSPYRGNNS
          S +     S  +  +   +Q     +++H + S  G+GR     +  KN   G   GR   Y  NN+
Subjt:  IVSTATAEGISDGSANYVHSKQNSVGNRQFHQSQSGQGQGRGSYNSNDAKNNVRGRGRGRFSPYRGNNS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACGACCGAAGAAACCGAAAATTCATCTGTTCCTCCACAAGTGGTCACAAATGTGGCTGTCCCAACACCAAATCCTTCACCACAATTTAATACCTCCTTTGGTCATCC
CCTGGGCACTGTTTTAACAGTAAAGTTGGATGACAAAAATTATTCTCTTTGGAGAGGAATGGTGCTCGCTGTCTTAAGGGGTCAAAAATTTGATGGGTATGTGCTGGGAA
CCTTGGCCAAACCACCACAGTTTCTTGTCTCACCAGAAACTGAAGGAACTTCAGACCATCTTCAAGTGAATCCTGAATATGTGGAGTGGCAAGCAGTTGATCAAGCTCTA
CTTGGTTGGCTTTTTGGATCAATGACTCCTTCTATTGCCTGCGATGTCGTTGACTTCAGAAGTTCAAGAGAAGTATGGAAAGCTCTTGAGGATCTCTATGGAGCAACAAG
TAAGGCACGCATAAATCAGTTGCGGAATGTTCTTCAAAATACCAAGAAAAACTCTCTGAAGATGTCAGAATATCTTGGACTTATGAAACAAGCCTCTGAAAGTCTCAAAT
TAGCAGGTGAGCCTGTTGCTTTTAATTATTTAATGTCTTGTGTACTCTCAGGTTTAGAGGCAGAATATCTTCCAATTGTCTGTCAAATTGAAGGGAAAGATTCAACTTCA
TGGCAAGAGTTGTTTGCTACACTAGTGACGTTTGAAAACACTTTAATGAGGCTAAATATTGTTTCTACCGCTACTGCTGAGGGCATCTCTGATGGGAGTGCTAATTATGT
ACATTCAAAGCAAAATTCAGTTGGGAATAGACAGTTCCATCAGTCTCAATCAGGACAAGGACAAGGAAGAGGCAGTTACAACTCAAATGATGCTAAAAACAACGTGAGAG
GAAGAGGTCGTGGCAGATTCAGTCCTTATAGAGGAAATAACTCTAAACCAAGTTGTCAACTATGTGGCAAATATGGGCATATAGCAGCTGTTTGTTACAAAAGGTTTGAT
GAAAACTTCAATAATTTGTCTAGCTCCAACAACAACCGTAATTCTGCATATATGGCTATCCCAGAGATTGTTGCTGAACCTAGTTGGTTAGCAGATAGTGGGGCTACAGA
TCATGTCACTTCAGACCTCTCAAACTTGAATGTTAAGTCTGATTACAATGGTAAAGGTACATTAACTGTTGGTAATGGTAATAGGCTAGAAATTTCACATATTGGGCACA
CTTGTTTGCAAACCAAACCTATTACTTCTGGCAATTTACAACTCAGCAATATACTTCATGTTCCAAAAATTAAAAGAAACCTCTTGAGTATTGCCAAACTCACTGCTGAT
AATAATTGTTTTGTTGAATTTCATCCGACTTGTTGTTTTGTGAAGGACAAGGAAACAAAGAAGGTGGTGCTGCACGGAGTTCTCAAAGATGAACTATACCAAGTCAAGTT
ACCTCTCCAAACCAGCAATCAAAATCAAAACCAGCAGCGTTCAATGTCTTCTGTTCAACAATGTTTAGCTAGCAACAATCTGTCTTTGTCTACTAGCAATAGCACCTTCA
GAACCACTATCCTCTTTGTTGCCAAGTCCAGAGCATACAATTGGGAGCCAAAGTCAAGGCCACGTTTTACGAATGGTTGGATGATCGATAGGCCAAGGTCGGGTGTCTTG
TCAGATTTCATCCCTATAAATAGGGATGCATGCCCCTTGTGCAAGTTACGCAAATCCATTTGCATTCTGAGAGTTAGATAG
mRNA sequenceShow/hide mRNA sequence
ATGACGACCGAAGAAACCGAAAATTCATCTGTTCCTCCACAAGTGGTCACAAATGTGGCTGTCCCAACACCAAATCCTTCACCACAATTTAATACCTCCTTTGGTCATCC
CCTGGGCACTGTTTTAACAGTAAAGTTGGATGACAAAAATTATTCTCTTTGGAGAGGAATGGTGCTCGCTGTCTTAAGGGGTCAAAAATTTGATGGGTATGTGCTGGGAA
CCTTGGCCAAACCACCACAGTTTCTTGTCTCACCAGAAACTGAAGGAACTTCAGACCATCTTCAAGTGAATCCTGAATATGTGGAGTGGCAAGCAGTTGATCAAGCTCTA
CTTGGTTGGCTTTTTGGATCAATGACTCCTTCTATTGCCTGCGATGTCGTTGACTTCAGAAGTTCAAGAGAAGTATGGAAAGCTCTTGAGGATCTCTATGGAGCAACAAG
TAAGGCACGCATAAATCAGTTGCGGAATGTTCTTCAAAATACCAAGAAAAACTCTCTGAAGATGTCAGAATATCTTGGACTTATGAAACAAGCCTCTGAAAGTCTCAAAT
TAGCAGGTGAGCCTGTTGCTTTTAATTATTTAATGTCTTGTGTACTCTCAGGTTTAGAGGCAGAATATCTTCCAATTGTCTGTCAAATTGAAGGGAAAGATTCAACTTCA
TGGCAAGAGTTGTTTGCTACACTAGTGACGTTTGAAAACACTTTAATGAGGCTAAATATTGTTTCTACCGCTACTGCTGAGGGCATCTCTGATGGGAGTGCTAATTATGT
ACATTCAAAGCAAAATTCAGTTGGGAATAGACAGTTCCATCAGTCTCAATCAGGACAAGGACAAGGAAGAGGCAGTTACAACTCAAATGATGCTAAAAACAACGTGAGAG
GAAGAGGTCGTGGCAGATTCAGTCCTTATAGAGGAAATAACTCTAAACCAAGTTGTCAACTATGTGGCAAATATGGGCATATAGCAGCTGTTTGTTACAAAAGGTTTGAT
GAAAACTTCAATAATTTGTCTAGCTCCAACAACAACCGTAATTCTGCATATATGGCTATCCCAGAGATTGTTGCTGAACCTAGTTGGTTAGCAGATAGTGGGGCTACAGA
TCATGTCACTTCAGACCTCTCAAACTTGAATGTTAAGTCTGATTACAATGGTAAAGGTACATTAACTGTTGGTAATGGTAATAGGCTAGAAATTTCACATATTGGGCACA
CTTGTTTGCAAACCAAACCTATTACTTCTGGCAATTTACAACTCAGCAATATACTTCATGTTCCAAAAATTAAAAGAAACCTCTTGAGTATTGCCAAACTCACTGCTGAT
AATAATTGTTTTGTTGAATTTCATCCGACTTGTTGTTTTGTGAAGGACAAGGAAACAAAGAAGGTGGTGCTGCACGGAGTTCTCAAAGATGAACTATACCAAGTCAAGTT
ACCTCTCCAAACCAGCAATCAAAATCAAAACCAGCAGCGTTCAATGTCTTCTGTTCAACAATGTTTAGCTAGCAACAATCTGTCTTTGTCTACTAGCAATAGCACCTTCA
GAACCACTATCCTCTTTGTTGCCAAGTCCAGAGCATACAATTGGGAGCCAAAGTCAAGGCCACGTTTTACGAATGGTTGGATGATCGATAGGCCAAGGTCGGGTGTCTTG
TCAGATTTCATCCCTATAAATAGGGATGCATGCCCCTTGTGCAAGTTACGCAAATCCATTTGCATTCTGAGAGTTAGATAG
Protein sequenceShow/hide protein sequence
MTTEETENSSVPPQVVTNVAVPTPNPSPQFNTSFGHPLGTVLTVKLDDKNYSLWRGMVLAVLRGQKFDGYVLGTLAKPPQFLVSPETEGTSDHLQVNPEYVEWQAVDQAL
LGWLFGSMTPSIACDVVDFRSSREVWKALEDLYGATSKARINQLRNVLQNTKKNSLKMSEYLGLMKQASESLKLAGEPVAFNYLMSCVLSGLEAEYLPIVCQIEGKDSTS
WQELFATLVTFENTLMRLNIVSTATAEGISDGSANYVHSKQNSVGNRQFHQSQSGQGQGRGSYNSNDAKNNVRGRGRGRFSPYRGNNSKPSCQLCGKYGHIAAVCYKRFD
ENFNNLSSSNNNRNSAYMAIPEIVAEPSWLADSGATDHVTSDLSNLNVKSDYNGKGTLTVGNGNRLEISHIGHTCLQTKPITSGNLQLSNILHVPKIKRNLLSIAKLTAD
NNCFVEFHPTCCFVKDKETKKVVLHGVLKDELYQVKLPLQTSNQNQNQQRSMSSVQQCLASNNLSLSTSNSTFRTTILFVAKSRAYNWEPKSRPRFTNGWMIDRPRSGVL
SDFIPINRDACPLCKLRKSICILRVR