; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g08870 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g08870
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr9:7282372..7285163
RNA-Seq ExpressionMoc09g08870
SyntenyMoc09g08870
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0048297.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]1.3e-5737.2Show/hide
Query:  SSSTSSKLFHPGNKITTVKLDEENFLLWKLQILTTLRGYGLEDYVNSNAIVPSQLLPS--SGDTLATPILNTEYSHWIRQDSLITAWLLGSMSNSLFSEM
        +SS  +++F  GNKI+ VKL+++ FLLWK QILT L  Y LE+++ S +  PS+ L S  S    AT   N  Y  W RQD LI++WLLGSMS  + ++M
Subjt:  SSSTSSKLFHPGNKITTVKLDEENFLLWKLQILTTLRGYGLEDYVNSNAIVPSQLLPS--SGDTLATPILNTEYSHWIRQDSLITAWLLGSMSNSLFSEM

Query:  LDCKTAREVWQVLNARFSSRNMARLMDLKSKLETTKKGGLKLEEYFQKIKNLVDALAAAGLKISHEDHVLHILQGLGSEYDSVVSVITDKDISPSLQKVY
        L CK+A+E+W+ L   FSSR +A+ M  K+KL   KKG + L+EYF KI   VDALA+    +S +DH+L+IL GLGS+Y S++SVI+ +  SPS+Q+V 
Subjt:  LDCKTAREVWQVLNARFSSRNMARLMDLKSKLETTKKGGLKLEEYFQKIKNLVDALAAAGLKISHEDHVLHILQGLGSEYDSVVSVITDKDISPSLQKVY

Query:  SLLLAPENRIERHSTINPDGSLRSVNLTTHNPVKQSSTV----------NNDPNRRGKIRDRSSIIDDSGTTMVGPNVKICGRFGHTA------------
        SLLL  E++ E  S +  + +L SVN+ T    K + +           N+  N+RG   +  S     G     P  +IC + G++A            
Subjt:  SLLLAPENRIERHSTINPDGSLRSVNLTTHNPVKQSSTV----------NNDPNRRGKIRDRSSIIDDSGTTMVGPNVKICGRFGHTA------------

Query:  -------AH-----------------ATLDLSEIFKWFPDSGASNHVTNDFGNLTIGSEYLGDNKVLVGNGADHPSLSSPPLINEPSPLGVAPLSSSNVS
               +H                 A LDL+    W+PDSGA+NH+T+   NL+IGSEY G N++   NG+  P      +    S L     + +N+ 
Subjt:  -------AH-----------------ATLDLSEIFKWFPDSGASNHVTNDFGNLTIGSEYLGDNKVLVGNGADHPSLSSPPLINEPSPLGVAPLSSSNVS

Query:  LAPSSSHQTSHVSR
          PS +     VS+
Subjt:  LAPSSSHQTSHVSR

TYK10642.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]1.3e-5737.2Show/hide
Query:  SSSTSSKLFHPGNKITTVKLDEENFLLWKLQILTTLRGYGLEDYVNSNAIVPSQLLPS--SGDTLATPILNTEYSHWIRQDSLITAWLLGSMSNSLFSEM
        +SS  +++F  GNKI+ VKL+++ FLLWK QILT L  Y LE+++ S +  PS+ L S  S    AT   N  Y  W RQD LI++WLLGSMS  + ++M
Subjt:  SSSTSSKLFHPGNKITTVKLDEENFLLWKLQILTTLRGYGLEDYVNSNAIVPSQLLPS--SGDTLATPILNTEYSHWIRQDSLITAWLLGSMSNSLFSEM

Query:  LDCKTAREVWQVLNARFSSRNMARLMDLKSKLETTKKGGLKLEEYFQKIKNLVDALAAAGLKISHEDHVLHILQGLGSEYDSVVSVITDKDISPSLQKVY
        L CK+A+E+W+ L   FSSR +A+ M  K+KL   KKG + L+EYF KI   VDALA+    +S +DH+L+IL GLGS+Y S++SVI+ +  SPS+Q+V 
Subjt:  LDCKTAREVWQVLNARFSSRNMARLMDLKSKLETTKKGGLKLEEYFQKIKNLVDALAAAGLKISHEDHVLHILQGLGSEYDSVVSVITDKDISPSLQKVY

Query:  SLLLAPENRIERHSTINPDGSLRSVNLTTHNPVKQSSTV----------NNDPNRRGKIRDRSSIIDDSGTTMVGPNVKICGRFGHTA------------
        SLLL  E++ E  S +  + +L SVN+ T    K + +           N+  N+RG   +  S     G     P  +IC + G++A            
Subjt:  SLLLAPENRIERHSTINPDGSLRSVNLTTHNPVKQSSTV----------NNDPNRRGKIRDRSSIIDDSGTTMVGPNVKICGRFGHTA------------

Query:  -------AH-----------------ATLDLSEIFKWFPDSGASNHVTNDFGNLTIGSEYLGDNKVLVGNGADHPSLSSPPLINEPSPLGVAPLSSSNVS
               +H                 A LDL+    W+PDSGA+NH+T+   NL+IGSEY G N++   NG+  P      +    S L     + +N+ 
Subjt:  -------AH-----------------ATLDLSEIFKWFPDSGASNHVTNDFGNLTIGSEYLGDNKVLVGNGADHPSLSSPPLINEPSPLGVAPLSSSNVS

Query:  LAPSSSHQTSHVSR
          PS +     VS+
Subjt:  LAPSSSHQTSHVSR

XP_022136882.1 dr1-associated corepressor homolog isoform X1 [Momordica charantia]3.7e-4940.19Show/hide
Query:  IRQDSLITAWLLGSMSNSLFSEMLDCKTAREVWQVLNARFSSRNMARLMDLKSKLETTKKGGLKLEEYFQKIKNLVDALAAAGLKISHEDHVLHILQGLG
        ++QD LIT+WL  SM   +  EM+ C TAREVWQ+L   ++SRN+AR+M LKSKLE  KKG L L++YFQK+K LVD+LAAAG K++ EDH++HIL GL 
Subjt:  IRQDSLITAWLLGSMSNSLFSEMLDCKTAREVWQVLNARFSSRNMARLMDLKSKLETTKKGGLKLEEYFQKIKNLVDALAAAGLKISHEDHVLHILQGLG

Query:  SEYDSVVSVITDKDISPSLQKVYSLLLAPENRIERHSTINPDGSLRSVNLTTHNPVKQSSTVNNDPNRRGKIRDRSSIIDDSG--------TTMVGPNVK
        SE++S VSVI+ +  + +LQ+VYSLLL+ E R ER+S IN DG+L SVNLT     K S++  +   +R  +++  S   +SG         +   P  +
Subjt:  SEYDSVVSVITDKDISPSLQKVYSLLLAPENRIERHSTINPDGSLRSVNLTTHNPVKQSSTVNNDPNRRGKIRDRSSIIDDSG--------TTMVGPNVK

Query:  ICGRFGHTAAHATLDLSEIF------------------------------------------------------------KWFPDSGASNHVTNDFGNLT
        I G+FGHTA    L   + F                                                             W+PDSGA+NHVT++F NL 
Subjt:  ICGRFGHTAAHATLDLSEIF------------------------------------------------------------KWFPDSGASNHVTNDFGNLT

Query:  IGSEYLGDNKVLVGNG
          +EY GDN+V +GNG
Subjt:  IGSEYLGDNKVLVGNG

XP_022154487.1 uncharacterized protein LOC111021757 [Momordica charantia]9.0e-7241.12Show/hide
Query:  SSDLISSSTSSKLFHPGNKITTVKLDEENFLLWKLQILTTLRGYGLEDYVNSNAIVPSQLLPSSGDTLATPIL--NTEYSHWIRQDSLITAWLLGSMSNS
        +SD      +SK  +PG+K++ V+L+++N LLWK QI T L+G GLE Y++SN   P+Q + ++ D  ++  L  N  Y  WI+QD LI+AWLLGSM+  
Subjt:  SSDLISSSTSSKLFHPGNKITTVKLDEENFLLWKLQILTTLRGYGLEDYVNSNAIVPSQLLPSSGDTLATPIL--NTEYSHWIRQDSLITAWLLGSMSNS

Query:  LFSEMLDCKTAREVWQVLNARFSSRNMARLMDLKSKLETTKKGGLKLEEYFQKIKNLVDALAAAGLKISHEDHVLHILQGLGSEYDSVVSVITDKDISPS
        + S+MLDCK+ARE+W VL   F+SR +AR+M LK KLE  KKG L L++YF KIKNLVD+LA AG K+S EDH++HIL GLG E+D+++SVIT +++  +
Subjt:  LFSEMLDCKTAREVWQVLNARFSSRNMARLMDLKSKLETTKKGGLKLEEYFQKIKNLVDALAAAGLKISHEDHVLHILQGLGSEYDSVVSVITDKDISPS

Query:  LQKVYSLLLAPENRIERHSTINPDGSLRSVNLTTH-----NPVKQSSTVN---NDPNRRGKIRDRSSIIDDSGTTMVGPNVKICGRFGHTAAHATL----
        LQ+V SLLL  E R ER + IN DGSL SVNLT +     N + QS   N   ++ ++RG+  +  S    + T    P  +ICGRFGHTA    +    
Subjt:  LQKVYSLLLAPENRIERHSTINPDGSLRSVNLTTH-----NPVKQSSTVN---NDPNRRGKIRDRSSIIDDSGTTMVGPNVKICGRFGHTAAHATL----

Query:  ----------------------------------------------------------DLSEIFKWFPDSGASNHVTNDFGNLTIGSEYLGDNKVLVGNG
                                                                  D +    W+ DSG +NHVTN+FGN ++GSEY GD K+ VGNG
Subjt:  ----------------------------------------------------------DLSEIFKWFPDSGASNHVTNDFGNLTIGSEYLGDNKVLVGNG

Query:  ADH---PSLSS
          +   P+ SS
Subjt:  ADH---PSLSS

XP_022156747.1 uncharacterized protein LOC111023586 [Momordica charantia]3.3e-5038.33Show/hide
Query:  KLQILTTLRGYGLEDYVNSNAIVPSQLLPSSGD---TLATPILNTEYSHWIRQDSLITAWLLGSMSNSLFSEMLDCKTAREVWQVLNARFSSRNMARLMD
        K Q+LT ++G+GLE Y++S+   PS+ +  +GD   +  T   N EY HWI+QD LI+ WLLGSMS  + S+MLDC+  +E+W +L   F+SRN+AR+M 
Subjt:  KLQILTTLRGYGLEDYVNSNAIVPSQLLPSSGD---TLATPILNTEYSHWIRQDSLITAWLLGSMSNSLFSEMLDCKTAREVWQVLNARFSSRNMARLMD

Query:  LKSKLETTKKGGLKLEEYFQKIKNLVDALAAAGLKISHEDHVLHILQGLGSEYDSVVSVITDKDISPSLQKVYSLLLAPENRIERHSTINPDGSLRSVNL
        LKSKLE  KKG + L+ YF KIKNLVD+LA AG ++  +DH++HIL  LG E+DS+VSVI+ +    S+Q+       P +    H              
Subjt:  LKSKLETTKKGGLKLEEYFQKIKNLVDALAAAGLKISHEDHVLHILQGLGSEYDSVVSVITDKDISPSLQKVYSLLLAPENRIERHSTINPDGSLRSVNL

Query:  TTHNPVKQSSTVNNDPNRRGKIRDRSSIIDDSGTTMVGPNVKICGRFGHT----AAHATLDLSEIFKWFPDSGASNHVTNDFGNLTIGSEYLGDNKVLVG
            P  QSST  +                 S +T    N  + G  G T    A     D +    W+PDSGA+NHVTNDFGN ++GS+Y G+ K+ VG
Subjt:  TTHNPVKQSSTVNNDPNRRGKIRDRSSIIDDSGTTMVGPNVKICGRFGHT----AAHATLDLSEIFKWFPDSGASNHVTNDFGNLTIGSEYLGDNKVLVG

Query:  NGAD--------------HPSLSSPPLINEPSPLGVAPLSSSNVSLA
        NG +                S SS P+ +  + L V  ++ + +SL+
Subjt:  NGAD--------------HPSLSSPPLINEPSPLGVAPLSSSNVSLA

TrEMBL top hitse value%identityAlignment
A0A5A7U233 Retrovirus-related Pol polyprotein from transposon TNT 1-946.1e-5837.2Show/hide
Query:  SSSTSSKLFHPGNKITTVKLDEENFLLWKLQILTTLRGYGLEDYVNSNAIVPSQLLPS--SGDTLATPILNTEYSHWIRQDSLITAWLLGSMSNSLFSEM
        +SS  +++F  GNKI+ VKL+++ FLLWK QILT L  Y LE+++ S +  PS+ L S  S    AT   N  Y  W RQD LI++WLLGSMS  + ++M
Subjt:  SSSTSSKLFHPGNKITTVKLDEENFLLWKLQILTTLRGYGLEDYVNSNAIVPSQLLPS--SGDTLATPILNTEYSHWIRQDSLITAWLLGSMSNSLFSEM

Query:  LDCKTAREVWQVLNARFSSRNMARLMDLKSKLETTKKGGLKLEEYFQKIKNLVDALAAAGLKISHEDHVLHILQGLGSEYDSVVSVITDKDISPSLQKVY
        L CK+A+E+W+ L   FSSR +A+ M  K+KL   KKG + L+EYF KI   VDALA+    +S +DH+L+IL GLGS+Y S++SVI+ +  SPS+Q+V 
Subjt:  LDCKTAREVWQVLNARFSSRNMARLMDLKSKLETTKKGGLKLEEYFQKIKNLVDALAAAGLKISHEDHVLHILQGLGSEYDSVVSVITDKDISPSLQKVY

Query:  SLLLAPENRIERHSTINPDGSLRSVNLTTHNPVKQSSTV----------NNDPNRRGKIRDRSSIIDDSGTTMVGPNVKICGRFGHTA------------
        SLLL  E++ E  S +  + +L SVN+ T    K + +           N+  N+RG   +  S     G     P  +IC + G++A            
Subjt:  SLLLAPENRIERHSTINPDGSLRSVNLTTHNPVKQSSTV----------NNDPNRRGKIRDRSSIIDDSGTTMVGPNVKICGRFGHTA------------

Query:  -------AH-----------------ATLDLSEIFKWFPDSGASNHVTNDFGNLTIGSEYLGDNKVLVGNGADHPSLSSPPLINEPSPLGVAPLSSSNVS
               +H                 A LDL+    W+PDSGA+NH+T+   NL+IGSEY G N++   NG+  P      +    S L     + +N+ 
Subjt:  -------AH-----------------ATLDLSEIFKWFPDSGASNHVTNDFGNLTIGSEYLGDNKVLVGNGADHPSLSSPPLINEPSPLGVAPLSSSNVS

Query:  LAPSSSHQTSHVSR
          PS +     VS+
Subjt:  LAPSSSHQTSHVSR

A0A5D3CH97 Retrovirus-related Pol polyprotein from transposon TNT 1-946.1e-5837.2Show/hide
Query:  SSSTSSKLFHPGNKITTVKLDEENFLLWKLQILTTLRGYGLEDYVNSNAIVPSQLLPS--SGDTLATPILNTEYSHWIRQDSLITAWLLGSMSNSLFSEM
        +SS  +++F  GNKI+ VKL+++ FLLWK QILT L  Y LE+++ S +  PS+ L S  S    AT   N  Y  W RQD LI++WLLGSMS  + ++M
Subjt:  SSSTSSKLFHPGNKITTVKLDEENFLLWKLQILTTLRGYGLEDYVNSNAIVPSQLLPS--SGDTLATPILNTEYSHWIRQDSLITAWLLGSMSNSLFSEM

Query:  LDCKTAREVWQVLNARFSSRNMARLMDLKSKLETTKKGGLKLEEYFQKIKNLVDALAAAGLKISHEDHVLHILQGLGSEYDSVVSVITDKDISPSLQKVY
        L CK+A+E+W+ L   FSSR +A+ M  K+KL   KKG + L+EYF KI   VDALA+    +S +DH+L+IL GLGS+Y S++SVI+ +  SPS+Q+V 
Subjt:  LDCKTAREVWQVLNARFSSRNMARLMDLKSKLETTKKGGLKLEEYFQKIKNLVDALAAAGLKISHEDHVLHILQGLGSEYDSVVSVITDKDISPSLQKVY

Query:  SLLLAPENRIERHSTINPDGSLRSVNLTTHNPVKQSSTV----------NNDPNRRGKIRDRSSIIDDSGTTMVGPNVKICGRFGHTA------------
        SLLL  E++ E  S +  + +L SVN+ T    K + +           N+  N+RG   +  S     G     P  +IC + G++A            
Subjt:  SLLLAPENRIERHSTINPDGSLRSVNLTTHNPVKQSSTV----------NNDPNRRGKIRDRSSIIDDSGTTMVGPNVKICGRFGHTA------------

Query:  -------AH-----------------ATLDLSEIFKWFPDSGASNHVTNDFGNLTIGSEYLGDNKVLVGNGADHPSLSSPPLINEPSPLGVAPLSSSNVS
               +H                 A LDL+    W+PDSGA+NH+T+   NL+IGSEY G N++   NG+  P      +    S L     + +N+ 
Subjt:  -------AH-----------------ATLDLSEIFKWFPDSGASNHVTNDFGNLTIGSEYLGDNKVLVGNGADHPSLSSPPLINEPSPLGVAPLSSSNVS

Query:  LAPSSSHQTSHVSR
          PS +     VS+
Subjt:  LAPSSSHQTSHVSR

A0A6J1C8R2 dr1-associated corepressor homolog isoform X21.8e-4940.19Show/hide
Query:  IRQDSLITAWLLGSMSNSLFSEMLDCKTAREVWQVLNARFSSRNMARLMDLKSKLETTKKGGLKLEEYFQKIKNLVDALAAAGLKISHEDHVLHILQGLG
        ++QD LIT+WL  SM   +  EM+ C TAREVWQ+L   ++SRN+AR+M LKSKLE  KKG L L++YFQK+K LVD+LAAAG K++ EDH++HIL GL 
Subjt:  IRQDSLITAWLLGSMSNSLFSEMLDCKTAREVWQVLNARFSSRNMARLMDLKSKLETTKKGGLKLEEYFQKIKNLVDALAAAGLKISHEDHVLHILQGLG

Query:  SEYDSVVSVITDKDISPSLQKVYSLLLAPENRIERHSTINPDGSLRSVNLTTHNPVKQSSTVNNDPNRRGKIRDRSSIIDDSG--------TTMVGPNVK
        SE++S VSVI+ +  + +LQ+VYSLLL+ E R ER+S IN DG+L SVNLT     K S++  +   +R  +++  S   +SG         +   P  +
Subjt:  SEYDSVVSVITDKDISPSLQKVYSLLLAPENRIERHSTINPDGSLRSVNLTTHNPVKQSSTVNNDPNRRGKIRDRSSIIDDSG--------TTMVGPNVK

Query:  ICGRFGHTAAHATLDLSEIF------------------------------------------------------------KWFPDSGASNHVTNDFGNLT
        I G+FGHTA    L   + F                                                             W+PDSGA+NHVT++F NL 
Subjt:  ICGRFGHTAAHATLDLSEIF------------------------------------------------------------KWFPDSGASNHVTNDFGNLT

Query:  IGSEYLGDNKVLVGNG
          +EY GDN+V +GNG
Subjt:  IGSEYLGDNKVLVGNG

A0A6J1DLT9 uncharacterized protein LOC1110217574.3e-7241.12Show/hide
Query:  SSDLISSSTSSKLFHPGNKITTVKLDEENFLLWKLQILTTLRGYGLEDYVNSNAIVPSQLLPSSGDTLATPIL--NTEYSHWIRQDSLITAWLLGSMSNS
        +SD      +SK  +PG+K++ V+L+++N LLWK QI T L+G GLE Y++SN   P+Q + ++ D  ++  L  N  Y  WI+QD LI+AWLLGSM+  
Subjt:  SSDLISSSTSSKLFHPGNKITTVKLDEENFLLWKLQILTTLRGYGLEDYVNSNAIVPSQLLPSSGDTLATPIL--NTEYSHWIRQDSLITAWLLGSMSNS

Query:  LFSEMLDCKTAREVWQVLNARFSSRNMARLMDLKSKLETTKKGGLKLEEYFQKIKNLVDALAAAGLKISHEDHVLHILQGLGSEYDSVVSVITDKDISPS
        + S+MLDCK+ARE+W VL   F+SR +AR+M LK KLE  KKG L L++YF KIKNLVD+LA AG K+S EDH++HIL GLG E+D+++SVIT +++  +
Subjt:  LFSEMLDCKTAREVWQVLNARFSSRNMARLMDLKSKLETTKKGGLKLEEYFQKIKNLVDALAAAGLKISHEDHVLHILQGLGSEYDSVVSVITDKDISPS

Query:  LQKVYSLLLAPENRIERHSTINPDGSLRSVNLTTH-----NPVKQSSTVN---NDPNRRGKIRDRSSIIDDSGTTMVGPNVKICGRFGHTAAHATL----
        LQ+V SLLL  E R ER + IN DGSL SVNLT +     N + QS   N   ++ ++RG+  +  S    + T    P  +ICGRFGHTA    +    
Subjt:  LQKVYSLLLAPENRIERHSTINPDGSLRSVNLTTH-----NPVKQSSTVN---NDPNRRGKIRDRSSIIDDSGTTMVGPNVKICGRFGHTAAHATL----

Query:  ----------------------------------------------------------DLSEIFKWFPDSGASNHVTNDFGNLTIGSEYLGDNKVLVGNG
                                                                  D +    W+ DSG +NHVTN+FGN ++GSEY GD K+ VGNG
Subjt:  ----------------------------------------------------------DLSEIFKWFPDSGASNHVTNDFGNLTIGSEYLGDNKVLVGNG

Query:  ADH---PSLSS
          +   P+ SS
Subjt:  ADH---PSLSS

A0A6J1DSS1 uncharacterized protein LOC1110235861.6e-5038.33Show/hide
Query:  KLQILTTLRGYGLEDYVNSNAIVPSQLLPSSGD---TLATPILNTEYSHWIRQDSLITAWLLGSMSNSLFSEMLDCKTAREVWQVLNARFSSRNMARLMD
        K Q+LT ++G+GLE Y++S+   PS+ +  +GD   +  T   N EY HWI+QD LI+ WLLGSMS  + S+MLDC+  +E+W +L   F+SRN+AR+M 
Subjt:  KLQILTTLRGYGLEDYVNSNAIVPSQLLPSSGD---TLATPILNTEYSHWIRQDSLITAWLLGSMSNSLFSEMLDCKTAREVWQVLNARFSSRNMARLMD

Query:  LKSKLETTKKGGLKLEEYFQKIKNLVDALAAAGLKISHEDHVLHILQGLGSEYDSVVSVITDKDISPSLQKVYSLLLAPENRIERHSTINPDGSLRSVNL
        LKSKLE  KKG + L+ YF KIKNLVD+LA AG ++  +DH++HIL  LG E+DS+VSVI+ +    S+Q+       P +    H              
Subjt:  LKSKLETTKKGGLKLEEYFQKIKNLVDALAAAGLKISHEDHVLHILQGLGSEYDSVVSVITDKDISPSLQKVYSLLLAPENRIERHSTINPDGSLRSVNL

Query:  TTHNPVKQSSTVNNDPNRRGKIRDRSSIIDDSGTTMVGPNVKICGRFGHT----AAHATLDLSEIFKWFPDSGASNHVTNDFGNLTIGSEYLGDNKVLVG
            P  QSST  +                 S +T    N  + G  G T    A     D +    W+PDSGA+NHVTNDFGN ++GS+Y G+ K+ VG
Subjt:  TTHNPVKQSSTVNNDPNRRGKIRDRSSIIDDSGTTMVGPNVKICGRFGHT----AAHATLDLSEIFKWFPDSGASNHVTNDFGNLTIGSEYLGDNKVLVG

Query:  NGAD--------------HPSLSSPPLINEPSPLGVAPLSSSNVSLA
        NG +                S SS P+ +  + L V  ++ + +SL+
Subjt:  NGAD--------------HPSLSSPPLINEPSPLGVAPLSSSNVSLA

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.2e-0522.9Show/hide
Query:  GNKITTVKLDEEN-FLLWKLQILTTLRGYGLEDYVNSNAIVPSQLLPSSGDTLATPILNTEYSHWIRQDSLITAWLLGSMSNSLFSEMLDCKTAREVWQV
        G K    K + +N F  W+ ++   L   GL   ++ ++  P        DT+       +   W   D    + +   +S+ + + ++D  TAR +W  
Subjt:  GNKITTVKLDEEN-FLLWKLQILTTLRGYGLEDYVNSNAIVPSQLLPSSGDTLATPILNTEYSHWIRQDSLITAWLLGSMSNSLFSEMLDCKTAREVWQV

Query:  LNARFSSRNMARLMDLKSKLETTKKG-GLKLEEYFQKIKNLVDALAAAGLKISHEDHVLHILQGLGSEYDSVVSVITDKDISPSLQKVYSLLL-------
        L + + S+ +   + LK +L       G     +      L+  LA  G+KI  ED  + +L  L S YD++ + I     +  L+ V S LL       
Subjt:  LNARFSSRNMARLMDLKSKLETTKKG-GLKLEEYFQKIKNLVDALAAAGLKISHEDHVLHILQGLGSEYDSVVSVITDKDISPSLQKVYSLLL-------

Query:  APENRIERHSTINPDGSLRSVNLTTHN----------PVKQSSTVNNDPN--------------RRGKIRDRSSIIDDSGTTMV--GPNVKICGRFGHTA
         PEN+ +   T   +G  RS   +++N            +  S V N  N              R+GK        DD+   MV    NV +        
Subjt:  APENRIERHSTINPDGSLRSVNLTTHN----------PVKQSSTVNNDPN--------------RRGKIRDRSSIIDDSGTTMV--GPNVKICGRFGHTA

Query:  AHATLDLSEIFKWFPDSGASNHVT-----------NDFGNLTIGS
         H +   SE   W  D+ AS+H T            DFG + +G+
Subjt:  AHATLDLSEIFKWFPDSGASNHVT-----------NDFGNLTIGS

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.5e-2926.39Show/hide
Query:  NKITTVKLDEENFLLWKLQILTTLRGYGLEDYVNSNAIVPSQLLPSSGDTLATPILNTEYSHWIRQDSLITAWLLGSMSNSLFSEMLDCKTAREVWQVLN
        N     KL   N+L+W  Q+     GY L  +++ +  +P    P++  T A P +N +Y+ W RQD LI + +LG++S S+   +    TA ++W+ L 
Subjt:  NKITTVKLDEENFLLWKLQILTTLRGYGLEDYVNSNAIVPSQLLPSSGDTLATPILNTEYSHWIRQDSLITAWLLGSMSNSLFSEMLDCKTAREVWQVLN

Query:  ARFSSRNMARLMDLKSKLETTKKGGLKLEEYFQKIKNLVDALAAAGLKISHEDHVLHILQGLGSEYDSVVSVITDKDISPSLQKVYSLLLAPENRI--ER
          +++ +   +  L+++L+   KG   +++Y Q +    D LA  G  + H++ V  +L+ L  EY  V+  I  KD  P+L +++  LL  E++I    
Subjt:  ARFSSRNMARLMDLKSKLETTKKGGLKLEEYFQKIKNLVDALAAAGLKISHEDHVLHILQGLGSEYDSVVSVITDKDISPSLQKVYSLLLAPENRI--ER

Query:  HSTINPDGSLRSVNLTTHNPVKQSSTVNNDPNRRGKIRDRSSIID----DSGTTMVGPN----------VKICGRFGHTA--------------------
         +T+ P     + N  +H     ++  NN+ NR  +  +R++  +       +T   PN           +ICG  GH+A                    
Subjt:  HSTINPDGSLRSVNLTTHNPVKQSSTVNNDPNRRGKIRDRSSIID----DSGTTMVGPN----------VKICGRFGHTA--------------------

Query:  ------AHATLDLSEIF---KWFPDSGASNHVTNDFGNLTIGSEYLGDNKVLVGNGADHP
                A L L   +    W  DSGA++H+T+DF NL++   Y G + V+V +G+  P
Subjt:  ------AHATLDLSEIF---KWFPDSGASNHVTNDFGNLTIGSEYLGDNKVLVGNGADHP

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE26.8e-2224.23Show/hide
Query:  NKITTVKLDEENFLLWKLQILTTLRGYGLEDYVNSNAIVPSQLLPSSGDTLATPILNTEYSHWIRQDSLITAWLLGSMSNSLFSEMLDCKTAREVWQVLN
        N     KL   N+L+W  Q+     GY L  +++ +  +P    P++  T A P +N +Y+ W RQD LI + +LG++S S+   +    TA ++W+ L 
Subjt:  NKITTVKLDEENFLLWKLQILTTLRGYGLEDYVNSNAIVPSQLLPSSGDTLATPILNTEYSHWIRQDSLITAWLLGSMSNSLFSEMLDCKTAREVWQVLN

Query:  ARFSSRNMARLMDLKSKLETTKKGGLKLEEYFQKIKNLVDALAAAGLKISHEDHVLHILQGLGSEYDSVVSVITDKDISPSLQKVYSLLLAPENRIERHS
          +++ +   +  L                   +     D LA  G  + H++ V  +L+ L  +Y  V+  I  KD  PSL +++      E  I R S
Subjt:  ARFSSRNMARLMDLKSKLETTKKGGLKLEEYFQKIKNLVDALAAAGLKISHEDHVLHILQGLGSEYDSVVSVITDKDISPSLQKVYSLLLAPENRIERHS

Query:  TINPDGSLRSVNLTTHNPVKQSSTVNNDPNRRGKIR------DRSSIIDDSGTTMVGPN---------VKICGRFGHTA---------------------
         +    S   V +T +    +++  N + N RG  R      +RS+    S +     N          +IC   GH+A                     
Subjt:  TINPDGSLRSVNLTTHNPVKQSSTVNNDPNRRGKIR------DRSSIIDDSGTTMVGPN---------VKICGRFGHTA---------------------

Query:  -----AHATLDLSEIF---KWFPDSGASNHVTNDFGNLTIGSEYLGDNKVLVGNGADHP
               A L ++  +    W  DSGA++H+T+DF NL+    Y G + V++ +G+  P
Subjt:  -----AHATLDLSEIF---KWFPDSGASNHVTNDFGNLTIGSEYLGDNKVLVGNGADHP

Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).5.2e-0923.61Show/hide
Query:  LFHPGN-KITTVKLDEENFLLWKLQILTTLRGYGLEDYVNSNAIVPSQLLPSSGDTLATPILNTEYSHWIRQDSLITAWLLGSMSNSLFSEMLDCKTARE
        + HP +  I  +  DE+N++ WK++  + LR      +++     P    P              Y  W + ++++  WL+ SM++ L   ++  +TA +
Subjt:  LFHPGN-KITTVKLDEENFLLWKLQILTTLRGYGLEDYVNSNAIVPSQLLPSSGDTLATPILNTEYSHWIRQDSLITAWLLGSMSNSLFSEMLDCKTARE

Query:  VWQVLNARFSSRNMARLMDLKSKLETTKKGGLKLEEYFQKIKNL
        +W+ L   F      ++  L+ +L T ++GG  +EEYF K+  +
Subjt:  VWQVLNARFSSRNMARLMDLKSKLETTKKGGLKLEEYFQKIKNL

AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)2.2e-1523.29Show/hide
Query:  LDEENFLLWKLQILTTLRGYGLEDYVNSNAIVPSQLLPSSGDTLATPILNTEYSHWIRQDSLITAWLLGSMSNSLF-SEMLDCKTAREVWQVLNARFSSR
        ++E N+  W+   LT    + +  +++        LLP++ + +          +W ++D ++   L G+++   F    +   T+R++W  +  +F + 
Subjt:  LDEENFLLWKLQILTTLRGYGLEDYVNSNAIVPSQLLPSSGDTLATPILNTEYSHWIRQDSLITAWLLGSMSNSLF-SEMLDCKTAREVWQVLNARFSSR

Query:  NMARLMDLKSKLETTKKGGLKLEEYFQKIKNLVDALAAAGLKISHEDHVLHILQGLGSEYDSVVSVITDKDISPSLQKVYSLLLAPENRIERHSTINPDG
          AR + L S+L T   G +++ +Y++K+K L D+L    + ++  + V+++L GL  ++D++++VI  +   PS     ++L   E+R++R    NP  
Subjt:  NMARLMDLKSKLETTKKGGLKLEEYFQKIKNLVDALAAAGLKISHEDHVLHILQGLGSEYDSVVSVITDKDISPSLQKVYSLLLAPENRIERHSTINPDG

Query:  SLRSVNLTTHNPVKQSSTV
                TH     SSTV
Subjt:  SLRSVNLTTHNPVKQSSTV

AT3G21000.1 Gag-Pol-related retrotransposon family protein3.8e-0423.3Show/hide
Query:  DEENFLLWKLQILTTLRGYGLEDYVNSNAIVPSQLLPSSGDTLATPILNTEYSHW---IRQDSLITAWLLGSMSNSLFSEMLDCKTAREVWQVLNARFSS
        D+ ++ +W     +TL   GL D V +   VP    PS    LA  I   E S W   + +D+     L  S+++S+F + L   +A++VW +L      
Subjt:  DEENFLLWKLQILTTLRGYGLEDYVNSNAIVPSQLLPSSGDTLATPILNTEYSHW---IRQDSLITAWLLGSMSNSLFSEMLDCKTAREVWQVLNARFSS

Query:  RNMARLMDLKSKLETTKKGGLKL------EEYFQKIKNLVDALAAAGLKISHEDHVLHILQGLGSEYDSVVSVITD
          + RL  +  +    +   LK+        Y  K   +++ L  A L+ S  +   ++   L   +D + S++ +
Subjt:  RNMARLMDLKSKLETTKKGGLKL------EEYFQKIKNLVDALAAAGLKISHEDHVLHILQGLGSEYDSVVSVITD

AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)1.0e-1729.44Show/hide
Query:  TVKLDEENFLLWKLQILTTLRGYGLEDYVNSNAIVPSQLLPSSGDTLATPILNTEYSHWIRQDSLITAWLLGSMSNSLFSEMLDCK-TAREVWQVLNARF
        T+ L++ N+ +W+    T    +G+  ++               D  +TP   TE   W  +D L+  W+ G++++SL   ++    TAR++W  L   F
Subjt:  TVKLDEENFLLWKLQILTTLRGYGLEDYVNSNAIVPSQLLPSSGDTLATPILNTEYSHWIRQDSLITAWLLGSMSNSLFSEMLDCK-TAREVWQVLNARF

Query:  SSRNMARLMDLKSKLETTKKGGLKLEEYFQKIKNLVDALAAAGLKISHEDHVLHILQGLGSEYDSVVSVITDKDISPSLQKVYSLLLAPENRIERHS
             AR +  +++L TT    L + EY QK+K+L D L      IS    V+H+L GL  +YD +++VI  K   PS  +  S+LL  E+R+   S
Subjt:  SSRNMARLMDLKSKLETTKKGGLKLEEYFQKIKNLVDALAAAGLKISHEDHVLHILQGLGSEYDSVVSVITDKDISPSLQKVYSLLLAPENRIERHS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTACTCGTGAAGAAACTTCTTCAGATTTGATTTCTTCATCGACATCGTCCAAACTCTTTCATCCAGGGAATAAAATCACCACAGTAAAACTTGATGAGGAGAATTT
TCTTCTCTGGAAACTTCAAATTCTTACCACTCTCAGAGGCTATGGCTTGGAGGATTATGTCAATTCGAATGCAATTGTTCCGTCACAACTCCTTCCTTCTTCGGGTGATA
CACTGGCAACTCCGATTCTCAATACTGAGTACTCTCACTGGATTCGTCAAGATAGTTTGATTACGGCGTGGCTCTTGGGTTCTATGTCCAATTCTCTCTTCTCAGAAATG
TTAGACTGCAAGACTGCTAGGGAGGTATGGCAAGTTTTGAATGCTCGGTTTTCTTCACGAAATATGGCGCGATTAATGGATTTGAAATCCAAACTCGAAACAACGAAGAA
AGGAGGTCTGAAACTTGAAGAGTATTTTCAAAAGATTAAAAATCTGGTTGATGCCTTGGCTGCAGCTGGACTCAAAATTTCGCATGAGGATCATGTATTACATATTTTGC
AAGGCTTAGGTTCTGAGTATGATTCTGTAGTCTCTGTAATTACTGATAAAGATATCTCTCCCTCCTTACAGAAAGTTTATTCACTTTTGCTGGCTCCAGAAAATAGAATT
GAACGTCACTCTACCATCAATCCCGATGGTTCTCTGCGTTCAGTAAACCTTACCACTCACAATCCAGTCAAACAGTCATCTACAGTGAACAATGATCCAAATCGAAGAGG
GAAAATTCGGGACAGAAGTTCAATAATCGACGATTCTGGAACAACAATGGTAGGGCCCAATGTCAAAATATGTGGACGCTTTGGCCACACTGCTGCACATGCTACTTTAG
ATTTGAGCGAAATTTTCAAGTGGTTCCCAGATTCTGGTGCGTCGAACCATGTTACGAATGATTTTGGCAATTTAACAATTGGATCTGAGTATCTTGGAGATAACAAAGTT
TTGGTCGGCAATGGTGCAGACCACCCATCACTTTCATCTCCGCCTTTAATAAATGAACCTTCACCATTGGGAGTTGCTCCTTTGTCCTCTTCTAATGTTTCTTTAGCTCC
TTCTTCAAGTCATCAGACTTCACATGTTTCACGTGATATTGCCCAATCTTCACCTGAAATTTCACCTTGCTTGGGTCATATATTACAAACATCTCTTCCTGCATCTGTCT
ATTCTCCATCTGCACAAGATTCGTTGCCATCTCCCATTCTTTCTCCTCCACTAGCTGCTTCGACATCTGCTTCTCCAACAAATGGTGGTAACTAG
mRNA sequenceShow/hide mRNA sequence
ATGACTACTCGTGAAGAAACTTCTTCAGATTTGATTTCTTCATCGACATCGTCCAAACTCTTTCATCCAGGGAATAAAATCACCACAGTAAAACTTGATGAGGAGAATTT
TCTTCTCTGGAAACTTCAAATTCTTACCACTCTCAGAGGCTATGGCTTGGAGGATTATGTCAATTCGAATGCAATTGTTCCGTCACAACTCCTTCCTTCTTCGGGTGATA
CACTGGCAACTCCGATTCTCAATACTGAGTACTCTCACTGGATTCGTCAAGATAGTTTGATTACGGCGTGGCTCTTGGGTTCTATGTCCAATTCTCTCTTCTCAGAAATG
TTAGACTGCAAGACTGCTAGGGAGGTATGGCAAGTTTTGAATGCTCGGTTTTCTTCACGAAATATGGCGCGATTAATGGATTTGAAATCCAAACTCGAAACAACGAAGAA
AGGAGGTCTGAAACTTGAAGAGTATTTTCAAAAGATTAAAAATCTGGTTGATGCCTTGGCTGCAGCTGGACTCAAAATTTCGCATGAGGATCATGTATTACATATTTTGC
AAGGCTTAGGTTCTGAGTATGATTCTGTAGTCTCTGTAATTACTGATAAAGATATCTCTCCCTCCTTACAGAAAGTTTATTCACTTTTGCTGGCTCCAGAAAATAGAATT
GAACGTCACTCTACCATCAATCCCGATGGTTCTCTGCGTTCAGTAAACCTTACCACTCACAATCCAGTCAAACAGTCATCTACAGTGAACAATGATCCAAATCGAAGAGG
GAAAATTCGGGACAGAAGTTCAATAATCGACGATTCTGGAACAACAATGGTAGGGCCCAATGTCAAAATATGTGGACGCTTTGGCCACACTGCTGCACATGCTACTTTAG
ATTTGAGCGAAATTTTCAAGTGGTTCCCAGATTCTGGTGCGTCGAACCATGTTACGAATGATTTTGGCAATTTAACAATTGGATCTGAGTATCTTGGAGATAACAAAGTT
TTGGTCGGCAATGGTGCAGACCACCCATCACTTTCATCTCCGCCTTTAATAAATGAACCTTCACCATTGGGAGTTGCTCCTTTGTCCTCTTCTAATGTTTCTTTAGCTCC
TTCTTCAAGTCATCAGACTTCACATGTTTCACGTGATATTGCCCAATCTTCACCTGAAATTTCACCTTGCTTGGGTCATATATTACAAACATCTCTTCCTGCATCTGTCT
ATTCTCCATCTGCACAAGATTCGTTGCCATCTCCCATTCTTTCTCCTCCACTAGCTGCTTCGACATCTGCTTCTCCAACAAATGGTGGTAACTAG
Protein sequenceShow/hide protein sequence
MTTREETSSDLISSSTSSKLFHPGNKITTVKLDEENFLLWKLQILTTLRGYGLEDYVNSNAIVPSQLLPSSGDTLATPILNTEYSHWIRQDSLITAWLLGSMSNSLFSEM
LDCKTAREVWQVLNARFSSRNMARLMDLKSKLETTKKGGLKLEEYFQKIKNLVDALAAAGLKISHEDHVLHILQGLGSEYDSVVSVITDKDISPSLQKVYSLLLAPENRI
ERHSTINPDGSLRSVNLTTHNPVKQSSTVNNDPNRRGKIRDRSSIIDDSGTTMVGPNVKICGRFGHTAAHATLDLSEIFKWFPDSGASNHVTNDFGNLTIGSEYLGDNKV
LVGNGADHPSLSSPPLINEPSPLGVAPLSSSNVSLAPSSSHQTSHVSRDIAQSSPEISPCLGHILQTSLPASVYSPSAQDSLPSPILSPPLAASTSASPTNGGN