; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g14030 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g14030
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase
Genome locationchr9:11967039..11975442
RNA-Seq ExpressionMoc09g14030
SyntenyMoc09g14030
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR005162 - Retrotransposon gag domain
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022155000.1 uncharacterized protein LOC111022144 [Momordica charantia]1.9e-15590.79Show/hide
Query:  LHTPQSEARFIKDFKRYEPPTFDSESERATAAEEWIRELEALYAYLGCEDQFKVKGAVFMLRGEALNWWDSIAAAEDHANVPIPWARFKDLLYDYYYPET
        LH  +SEA FIKDFKRY PPTFD ESERATAAEEWIRELEA YAYLGCEDQFKVKGAVFMLRGEALNWWDSIAAAEDHANV IPWARFKDLLYDYYY ET
Subjt:  LHTPQSEARFIKDFKRYEPPTFDSESERATAAEEWIRELEALYAYLGCEDQFKVKGAVFMLRGEALNWWDSIAAAEDHANVPIPWARFKDLLYDYYYPET

Query:  VKDMKEAEFLHLVQGTLSVAQYERKFAELSRFALELIPTEALKIKRFVKGLRKGIRGPVDLQRPTTYAEAVRGALVMDKDVSNKASPLPEVGSSSGVKRK
        VKDMKEAEFLHLVQGTLSVAQYERKF ELSRFALELI   A+KIKRFVKGL KGIRGPVDLQRP +YAEAVRGAL+MDKDVSNKA  LPEVGSSSGVKRK
Subjt:  VKDMKEAEFLHLVQGTLSVAQYERKFAELSRFALELIPTEALKIKRFVKGLRKGIRGPVDLQRPTTYAEAVRGALVMDKDVSNKASPLPEVGSSSGVKRK

Query:  FPPTYADPVLRAPQRQAQHQDMPPVCPTCQKRHTGQCWTGSKGCFRCGRERHFARECPMSAVNTQRLGQRISPSVSTQGNNQRARVFALTRKEAADAETV
        F PTYADP LRAPQ QAQH+ MPPVCPTCQKRH GQCWTGSKGCFRCGRERHFARECPMSA NTQRLGQRISP+VSTQGNNQRARVFALTRKEAADAETV
Subjt:  FPPTYADPVLRAPQRQAQHQDMPPVCPTCQKRHTGQCWTGSKGCFRCGRERHFARECPMSAVNTQRLGQRISPSVSTQGNNQRARVFALTRKEAADAETV

Query:  VTGN
        VTGN
Subjt:  VTGN

XP_022156326.1 uncharacterized protein LOC111023247 [Momordica charantia]8.2e-18375.77Show/hide
Query:  MPPRRSMRLRADADPAPGGENGADPPPPPVGNQAGVVPPFPPPAAAQERADPPVPPAVPQVNPQLVLLVEALQAVINNAAGVGGVQAPPPQHLHTPQSEA
        MPPR SMRLRADADPAPG                                                              GVGGVQAPPPQHLHTPQSEA
Subjt:  MPPRRSMRLRADADPAPGGENGADPPPPPVGNQAGVVPPFPPPAAAQERADPPVPPAVPQVNPQLVLLVEALQAVINNAAGVGGVQAPPPQHLHTPQSEA

Query:  RFIKDFKRYEPPTFDSESERATAAEEWIRELEALYAYLGCEDQFKVKGAVFMLRGEALNWWDSIAAAEDHANVPIPWARFKDLLYDYYYPETVKDMKEAE
        RFIKDFKRY PPTFD ESERATA EEWIRELEALYAYLGCEDQFKVKGAVFMLRGEALNWWDS+AAAED+ANVPIPWARFK+LLYDYYYPETVKDMKEAE
Subjt:  RFIKDFKRYEPPTFDSESERATAAEEWIRELEALYAYLGCEDQFKVKGAVFMLRGEALNWWDSIAAAEDHANVPIPWARFKDLLYDYYYPETVKDMKEAE

Query:  FLHLVQGTLSVAQYERKFAELSRFALELIPTEALKIKRFVKGLRKGIRGPVDLQRPTTYAEAVRGALVMDKDVSNKASPLPEVGSSSGVKRKFPPTYADP
        FLHLVQGTLSVAQYERKF ELSRFALELIPTEALKIKRFVKGLRKGIRGPVDLQRPTTYAEAVRGALVMDKDVSNKASPLPEVGSSSGVKRKFP TYAD 
Subjt:  FLHLVQGTLSVAQYERKFAELSRFALELIPTEALKIKRFVKGLRKGIRGPVDLQRPTTYAEAVRGALVMDKDVSNKASPLPEVGSSSGVKRKFPPTYADP

Query:  VLRAPQRQAQHQDMPPVCPTCQKRHTGQCWTGSKGCFRCGRERHFARECPMSAVNTQRLGQRISPSVSTQGNNQRARVFALTRKEAADAETVVTG-----
        VLRAPQRQAQHQ MPPVCPTCQKRHTGQCWTGSKGCFRCGRE HFARECPMSA NTQRLGQRI P VSTQGNNQRARVFALTRKEAADAETVVTG     
Subjt:  VLRAPQRQAQHQDMPPVCPTCQKRHTGQCWTGSKGCFRCGRERHFARECPMSAVNTQRLGQRISPSVSTQGNNQRARVFALTRKEAADAETVVTG-----

Query:  -------------------------NLELEPLGFLLSVSTPSGSVLIGSQKVRA
                                  LELEPLGFLLSVSTPSGS+LI SQKVRA
Subjt:  -------------------------NLELEPLGFLLSVSTPSGSVLIGSQKVRA

XP_022156328.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111023249 [Momordica charantia]8.5e-14852.71Show/hide
Query:  RRSMRLRADADPAPGGENGADPPPPPVGNQAGVVPPFPPPAAAQERADPPVPPAVPQVNPQLVLLVEALQAVINNAAGVGGVQAPPPQHLHTPQSEARFI
        RR+ R     DP   GEN ADP   PV    GVVPP P  A          P  VPQVNPQ+ LL EALQ +++NA G GG Q   P+    PQ E +FI
Subjt:  RRSMRLRADADPAPGGENGADPPPPPVGNQAGVVPPFPPPAAAQERADPPVPPAVPQVNPQLVLLVEALQAVINNAAGVGGVQAPPPQHLHTPQSEARFI

Query:  KDFKRYEPPTFDSESERATAAEEWIRELEALYAYLGCEDQFKVKGAVFMLRGEALNWWDSIAAAEDHANVPIPWARFKDLLYDYYYPETVKDMKEAEFLH
        +DFK + PP F+  SER TAAEEW+RELEALY YLGC D FKV+GAVFMLRGEA+NWW+S+AAAEDHANVP+ WARFKDLLY+YY+P   ++ K  EFL 
Subjt:  KDFKRYEPPTFDSESERATAAEEWIRELEALYAYLGCEDQFKVKGAVFMLRGEALNWWDSIAAAEDHANVPIPWARFKDLLYDYYYPETVKDMKEAEFLH

Query:  LVQGTLSVAQYERKFAELSRFALELIPTEALKIKRFVKGLRKGIRGPVDLQRPTTYAEAVRGALVMDKDVSNKASPLPEVGSSSGVKRKFPPTYADPVLR
        L QG+L+VAQYERKF ELSRF  + +PTE LKI +F+ GLR+ I+G + L+ PTTYA AVR ALVMDK +    S    +GS+SGVKRKF    A    R
Subjt:  LVQGTLSVAQYERKFAELSRFALELIPTEALKIKRFVKGLRKGIRGPVDLQRPTTYAEAVRGALVMDKDVSNKASPLPEVGSSSGVKRKFPPTYADPVLR

Query:  APQRQAQHQDMPPVCPTCQKRHTGQCWTGSKGCFRCGRERHFARECPMSAVNTQRLGQRISPSVSTQGNNQRARVFALTRKEAADAETVVTG--------
          Q  AQ Q  PPVCP+C+K H   CW G K CF+C +E HF REC M+  NTQ L Q+   + +TQG  Q ARVFALTR +   AE VVTG        
Subjt:  APQRQAQHQDMPPVCPTCQKRHTGQCWTGSKGCFRCGRERHFARECPMSAVNTQRLGQRISPSVSTQGNNQRARVFALTRKEAADAETVVTG--------

Query:  ----------------------NLELEPLGFLLSVSTPSGSVLIGSQKVRA-------------------------------ATNQANINCSRREVSFQL
                              +LELE  GF LSVSTPSGSVL+ SQ V+                                A N+ANINCS++EVSF L
Subjt:  ----------------------NLELEPLGFLLSVSTPSGSVLIGSQKVRA-------------------------------ATNQANINCSRREVSFQL

Query:  PSAPNFTFKGVTGRVPRTVSALKAKRLLQNGAWGYLANVVDISKTPPSIDSVHVVREFPDVFPDDLPGLSP
         S  NFTFKGV   VPR VSALKA  LLQ G W YLA+VVD  K  PSI+ V VV EF DVFP+DLPGL P
Subjt:  PSAPNFTFKGVTGRVPRTVSALKAKRLLQNGAWGYLANVVDISKTPPSIDSVHVVREFPDVFPDDLPGLSP

XP_022158750.1 uncharacterized protein LOC111025215 [Momordica charantia]7.0e-13450.82Show/hide
Query:  RRSMRLRADADPAPGGENGADPPPPPVGNQAGVVPPFPPPAAAQERADPPVPPAVPQVNPQLVLLVEALQAVINNAAGVGGVQAPPPQHLHTPQSEARFI
        RR+ R     DP P GE  ADP  PP     GV PP P  A+            VPQVNPQ+ LL EALQ +++NA G GG Q   P+    PQ E    
Subjt:  RRSMRLRADADPAPGGENGADPPPPPVGNQAGVVPPFPPPAAAQERADPPVPPAVPQVNPQLVLLVEALQAVINNAAGVGGVQAPPPQHLHTPQSEARFI

Query:  KDFKRYEPPTFDSESERATAAEEWIRELEALYAYLGCEDQFKVKGAVFMLRGEALNWWDSIAAAEDHANVPIPWARFKDLLYDYYYPETVKDMKEAEFLH
                      SER TAAEEW+RELEALY YLGC D FKV+GAVFMLRGEA+NWW+S+AAAEDHANVP+ WARFKDLLY+YY+P TV++ K  EFL 
Subjt:  KDFKRYEPPTFDSESERATAAEEWIRELEALYAYLGCEDQFKVKGAVFMLRGEALNWWDSIAAAEDHANVPIPWARFKDLLYDYYYPETVKDMKEAEFLH

Query:  LVQGTLSVAQYERKFAELSRFALELIPTEALKIKRFVKGLRKGIRGPVDLQRPTTYAEAVRGALVMDKDVSNKASPLPEVGSSSGVKRKFPPTYADPVLR
        L QG+L+VA+YERKF ELSRF ++ IPT+ LKI +F+ GLR+ I+G + L+ PTTYA AVR ALVMDK +    S    +GSSSGVKRKF    +    R
Subjt:  LVQGTLSVAQYERKFAELSRFALELIPTEALKIKRFVKGLRKGIRGPVDLQRPTTYAEAVRGALVMDKDVSNKASPLPEVGSSSGVKRKFPPTYADPVLR

Query:  APQRQAQHQDMPPVCPTCQKRHTGQCWTGSKGCFRCGRERHFARECPMSAVNTQRLGQRISPSVSTQGNNQRARVFALTRKEAADAETVVT---------
          Q   Q Q  PPVCP+C+K H G CW G + C+RC +E HFARECPM+  NTQ LGQRI  + + QG   RARVFALTR +   AE VVT         
Subjt:  APQRQAQHQDMPPVCPTCQKRHTGQCWTGSKGCFRCGRERHFARECPMSAVNTQRLGQRISPSVSTQGNNQRARVFALTRKEAADAETVVT---------

Query:  ---------------------GNLELEPLGFLLSVSTPSGSVLIGSQKVRA-------------------------------ATNQANINCSRREVSFQL
                              +LELE LGFLLSVSTPSGSVL+ SQ V+                                A N+ANI+CS+++VSF+L
Subjt:  ---------------------GNLELEPLGFLLSVSTPSGSVLIGSQKVRA-------------------------------ATNQANINCSRREVSFQL

Query:  PSAPNFTFKGVTGRVPRTVSALKAKRLLQNGAWGYLANVVDISKTPPSIDS
        PS  NFTFKGV   VPR V ALKA  LLQ GAW YLA+VVD  K  PSI++
Subjt:  PSAPNFTFKGVTGRVPRTVSALKAKRLLQNGAWGYLANVVDISKTPPSIDS

XP_022159077.1 uncharacterized protein LOC111025517 [Momordica charantia]9.4e-15569.23Show/hide
Query:  YLGCEDQFKVKGAVFMLRGEALNWWDSIAAAEDHANVPIPWARFKDLLYDYYYPETVKDMKEAEFLHLVQGTLSVAQYERKFAELSRFALELIPTEALKI
        YL CE+QFKVKG VFMLRGEALNWWDS+A AEDHANVPI WARFKDLLYDYYYP+T+KDMKEAEFLH   GTL+VAQYERKF ELS FA ELIPTEA+KI
Subjt:  YLGCEDQFKVKGAVFMLRGEALNWWDSIAAAEDHANVPIPWARFKDLLYDYYYPETVKDMKEAEFLHLVQGTLSVAQYERKFAELSRFALELIPTEALKI

Query:  KRFVKGLRKGIRGPVDLQRPTTYAEAVRGALVMDKDVSNKASPLPEVGSSSGVKRKFPPTYADPVLRAPQRQAQHQDMPPVCPTCQKRHTGQCWTGSKGC
        KRFVKGLRKGIRGPVDLQRP TYAEAVRG L+MD DVSN   PL EVGSSSGVKRK  P YAD   RAPQR AQ Q +PPVCP+CQKR  GQCWTG++GC
Subjt:  KRFVKGLRKGIRGPVDLQRPTTYAEAVRGALVMDKDVSNKASPLPEVGSSSGVKRKFPPTYADPVLRAPQRQAQHQDMPPVCPTCQKRHTGQCWTGSKGC

Query:  FRCGRERHFARECPMSAVNTQRLGQRISPSVSTQGNNQRA-RVFALTRKEAADAETVVT------GNLELEPLGFLLSVSTPSGSVLIGSQKVRA-----
        FRCGRE HFAREC M+A NTQRLGQR +P+VSTQG       V A    +   + T ++        LELEPLGFLLSVSTPSGSVLI SQ VRA     
Subjt:  FRCGRERHFARECPMSAVNTQRLGQRISPSVSTQGNNQRA-RVFALTRKEAADAETVVT------GNLELEPLGFLLSVSTPSGSVLIGSQKVRA-----

Query:  --------------------------ATNQANINCSRREVSFQLPSAPNFTFKGVTGRVPRTVSALKAKRLLQNGAWGYLANVVDISKTPPSIDSVHVVR
                                  ATNQANINCS+REVSFQLPS  +FTFKGV+G VPR VSALKA+RLL NGAW YLA+VVDIS TPPSIDS HVV+
Subjt:  --------------------------ATNQANINCSRREVSFQLPSAPNFTFKGVTGRVPRTVSALKAKRLLQNGAWGYLANVVDISKTPPSIDSVHVVR

Query:  EFPDVFPDDLPGLSPV
         F DVFP+DL GL P+
Subjt:  EFPDVFPDDLPGLSPV

TrEMBL top hitse value%identityAlignment
A0A6J1DL73 uncharacterized protein LOC1110221449.2e-15690.79Show/hide
Query:  LHTPQSEARFIKDFKRYEPPTFDSESERATAAEEWIRELEALYAYLGCEDQFKVKGAVFMLRGEALNWWDSIAAAEDHANVPIPWARFKDLLYDYYYPET
        LH  +SEA FIKDFKRY PPTFD ESERATAAEEWIRELEA YAYLGCEDQFKVKGAVFMLRGEALNWWDSIAAAEDHANV IPWARFKDLLYDYYY ET
Subjt:  LHTPQSEARFIKDFKRYEPPTFDSESERATAAEEWIRELEALYAYLGCEDQFKVKGAVFMLRGEALNWWDSIAAAEDHANVPIPWARFKDLLYDYYYPET

Query:  VKDMKEAEFLHLVQGTLSVAQYERKFAELSRFALELIPTEALKIKRFVKGLRKGIRGPVDLQRPTTYAEAVRGALVMDKDVSNKASPLPEVGSSSGVKRK
        VKDMKEAEFLHLVQGTLSVAQYERKF ELSRFALELI   A+KIKRFVKGL KGIRGPVDLQRP +YAEAVRGAL+MDKDVSNKA  LPEVGSSSGVKRK
Subjt:  VKDMKEAEFLHLVQGTLSVAQYERKFAELSRFALELIPTEALKIKRFVKGLRKGIRGPVDLQRPTTYAEAVRGALVMDKDVSNKASPLPEVGSSSGVKRK

Query:  FPPTYADPVLRAPQRQAQHQDMPPVCPTCQKRHTGQCWTGSKGCFRCGRERHFARECPMSAVNTQRLGQRISPSVSTQGNNQRARVFALTRKEAADAETV
        F PTYADP LRAPQ QAQH+ MPPVCPTCQKRH GQCWTGSKGCFRCGRERHFARECPMSA NTQRLGQRISP+VSTQGNNQRARVFALTRKEAADAETV
Subjt:  FPPTYADPVLRAPQRQAQHQDMPPVCPTCQKRHTGQCWTGSKGCFRCGRERHFARECPMSAVNTQRLGQRISPSVSTQGNNQRARVFALTRKEAADAETV

Query:  VTGN
        VTGN
Subjt:  VTGN

A0A6J1DQB9 Reverse transcriptase4.1e-14852.71Show/hide
Query:  RRSMRLRADADPAPGGENGADPPPPPVGNQAGVVPPFPPPAAAQERADPPVPPAVPQVNPQLVLLVEALQAVINNAAGVGGVQAPPPQHLHTPQSEARFI
        RR+ R     DP   GEN ADP   PV    GVVPP P  A          P  VPQVNPQ+ LL EALQ +++NA G GG Q   P+    PQ E +FI
Subjt:  RRSMRLRADADPAPGGENGADPPPPPVGNQAGVVPPFPPPAAAQERADPPVPPAVPQVNPQLVLLVEALQAVINNAAGVGGVQAPPPQHLHTPQSEARFI

Query:  KDFKRYEPPTFDSESERATAAEEWIRELEALYAYLGCEDQFKVKGAVFMLRGEALNWWDSIAAAEDHANVPIPWARFKDLLYDYYYPETVKDMKEAEFLH
        +DFK + PP F+  SER TAAEEW+RELEALY YLGC D FKV+GAVFMLRGEA+NWW+S+AAAEDHANVP+ WARFKDLLY+YY+P   ++ K  EFL 
Subjt:  KDFKRYEPPTFDSESERATAAEEWIRELEALYAYLGCEDQFKVKGAVFMLRGEALNWWDSIAAAEDHANVPIPWARFKDLLYDYYYPETVKDMKEAEFLH

Query:  LVQGTLSVAQYERKFAELSRFALELIPTEALKIKRFVKGLRKGIRGPVDLQRPTTYAEAVRGALVMDKDVSNKASPLPEVGSSSGVKRKFPPTYADPVLR
        L QG+L+VAQYERKF ELSRF  + +PTE LKI +F+ GLR+ I+G + L+ PTTYA AVR ALVMDK +    S    +GS+SGVKRKF    A    R
Subjt:  LVQGTLSVAQYERKFAELSRFALELIPTEALKIKRFVKGLRKGIRGPVDLQRPTTYAEAVRGALVMDKDVSNKASPLPEVGSSSGVKRKFPPTYADPVLR

Query:  APQRQAQHQDMPPVCPTCQKRHTGQCWTGSKGCFRCGRERHFARECPMSAVNTQRLGQRISPSVSTQGNNQRARVFALTRKEAADAETVVTG--------
          Q  AQ Q  PPVCP+C+K H   CW G K CF+C +E HF REC M+  NTQ L Q+   + +TQG  Q ARVFALTR +   AE VVTG        
Subjt:  APQRQAQHQDMPPVCPTCQKRHTGQCWTGSKGCFRCGRERHFARECPMSAVNTQRLGQRISPSVSTQGNNQRARVFALTRKEAADAETVVTG--------

Query:  ----------------------NLELEPLGFLLSVSTPSGSVLIGSQKVRA-------------------------------ATNQANINCSRREVSFQL
                              +LELE  GF LSVSTPSGSVL+ SQ V+                                A N+ANINCS++EVSF L
Subjt:  ----------------------NLELEPLGFLLSVSTPSGSVLIGSQKVRA-------------------------------ATNQANINCSRREVSFQL

Query:  PSAPNFTFKGVTGRVPRTVSALKAKRLLQNGAWGYLANVVDISKTPPSIDSVHVVREFPDVFPDDLPGLSP
         S  NFTFKGV   VPR VSALKA  LLQ G W YLA+VVD  K  PSI+ V VV EF DVFP+DLPGL P
Subjt:  PSAPNFTFKGVTGRVPRTVSALKAKRLLQNGAWGYLANVVDISKTPPSIDSVHVVREFPDVFPDDLPGLSP

A0A6J1DUM2 uncharacterized protein LOC1110232474.0e-18375.77Show/hide
Query:  MPPRRSMRLRADADPAPGGENGADPPPPPVGNQAGVVPPFPPPAAAQERADPPVPPAVPQVNPQLVLLVEALQAVINNAAGVGGVQAPPPQHLHTPQSEA
        MPPR SMRLRADADPAPG                                                              GVGGVQAPPPQHLHTPQSEA
Subjt:  MPPRRSMRLRADADPAPGGENGADPPPPPVGNQAGVVPPFPPPAAAQERADPPVPPAVPQVNPQLVLLVEALQAVINNAAGVGGVQAPPPQHLHTPQSEA

Query:  RFIKDFKRYEPPTFDSESERATAAEEWIRELEALYAYLGCEDQFKVKGAVFMLRGEALNWWDSIAAAEDHANVPIPWARFKDLLYDYYYPETVKDMKEAE
        RFIKDFKRY PPTFD ESERATA EEWIRELEALYAYLGCEDQFKVKGAVFMLRGEALNWWDS+AAAED+ANVPIPWARFK+LLYDYYYPETVKDMKEAE
Subjt:  RFIKDFKRYEPPTFDSESERATAAEEWIRELEALYAYLGCEDQFKVKGAVFMLRGEALNWWDSIAAAEDHANVPIPWARFKDLLYDYYYPETVKDMKEAE

Query:  FLHLVQGTLSVAQYERKFAELSRFALELIPTEALKIKRFVKGLRKGIRGPVDLQRPTTYAEAVRGALVMDKDVSNKASPLPEVGSSSGVKRKFPPTYADP
        FLHLVQGTLSVAQYERKF ELSRFALELIPTEALKIKRFVKGLRKGIRGPVDLQRPTTYAEAVRGALVMDKDVSNKASPLPEVGSSSGVKRKFP TYAD 
Subjt:  FLHLVQGTLSVAQYERKFAELSRFALELIPTEALKIKRFVKGLRKGIRGPVDLQRPTTYAEAVRGALVMDKDVSNKASPLPEVGSSSGVKRKFPPTYADP

Query:  VLRAPQRQAQHQDMPPVCPTCQKRHTGQCWTGSKGCFRCGRERHFARECPMSAVNTQRLGQRISPSVSTQGNNQRARVFALTRKEAADAETVVTG-----
        VLRAPQRQAQHQ MPPVCPTCQKRHTGQCWTGSKGCFRCGRE HFARECPMSA NTQRLGQRI P VSTQGNNQRARVFALTRKEAADAETVVTG     
Subjt:  VLRAPQRQAQHQDMPPVCPTCQKRHTGQCWTGSKGCFRCGRERHFARECPMSAVNTQRLGQRISPSVSTQGNNQRARVFALTRKEAADAETVVTG-----

Query:  -------------------------NLELEPLGFLLSVSTPSGSVLIGSQKVRA
                                  LELEPLGFLLSVSTPSGS+LI SQKVRA
Subjt:  -------------------------NLELEPLGFLLSVSTPSGSVLIGSQKVRA

A0A6J1DWP4 uncharacterized protein LOC1110252153.4e-13450.82Show/hide
Query:  RRSMRLRADADPAPGGENGADPPPPPVGNQAGVVPPFPPPAAAQERADPPVPPAVPQVNPQLVLLVEALQAVINNAAGVGGVQAPPPQHLHTPQSEARFI
        RR+ R     DP P GE  ADP  PP     GV PP P  A+            VPQVNPQ+ LL EALQ +++NA G GG Q   P+    PQ E    
Subjt:  RRSMRLRADADPAPGGENGADPPPPPVGNQAGVVPPFPPPAAAQERADPPVPPAVPQVNPQLVLLVEALQAVINNAAGVGGVQAPPPQHLHTPQSEARFI

Query:  KDFKRYEPPTFDSESERATAAEEWIRELEALYAYLGCEDQFKVKGAVFMLRGEALNWWDSIAAAEDHANVPIPWARFKDLLYDYYYPETVKDMKEAEFLH
                      SER TAAEEW+RELEALY YLGC D FKV+GAVFMLRGEA+NWW+S+AAAEDHANVP+ WARFKDLLY+YY+P TV++ K  EFL 
Subjt:  KDFKRYEPPTFDSESERATAAEEWIRELEALYAYLGCEDQFKVKGAVFMLRGEALNWWDSIAAAEDHANVPIPWARFKDLLYDYYYPETVKDMKEAEFLH

Query:  LVQGTLSVAQYERKFAELSRFALELIPTEALKIKRFVKGLRKGIRGPVDLQRPTTYAEAVRGALVMDKDVSNKASPLPEVGSSSGVKRKFPPTYADPVLR
        L QG+L+VA+YERKF ELSRF ++ IPT+ LKI +F+ GLR+ I+G + L+ PTTYA AVR ALVMDK +    S    +GSSSGVKRKF    +    R
Subjt:  LVQGTLSVAQYERKFAELSRFALELIPTEALKIKRFVKGLRKGIRGPVDLQRPTTYAEAVRGALVMDKDVSNKASPLPEVGSSSGVKRKFPPTYADPVLR

Query:  APQRQAQHQDMPPVCPTCQKRHTGQCWTGSKGCFRCGRERHFARECPMSAVNTQRLGQRISPSVSTQGNNQRARVFALTRKEAADAETVVT---------
          Q   Q Q  PPVCP+C+K H G CW G + C+RC +E HFARECPM+  NTQ LGQRI  + + QG   RARVFALTR +   AE VVT         
Subjt:  APQRQAQHQDMPPVCPTCQKRHTGQCWTGSKGCFRCGRERHFARECPMSAVNTQRLGQRISPSVSTQGNNQRARVFALTRKEAADAETVVT---------

Query:  ---------------------GNLELEPLGFLLSVSTPSGSVLIGSQKVRA-------------------------------ATNQANINCSRREVSFQL
                              +LELE LGFLLSVSTPSGSVL+ SQ V+                                A N+ANI+CS+++VSF+L
Subjt:  ---------------------GNLELEPLGFLLSVSTPSGSVLIGSQKVRA-------------------------------ATNQANINCSRREVSFQL

Query:  PSAPNFTFKGVTGRVPRTVSALKAKRLLQNGAWGYLANVVDISKTPPSIDS
        PS  NFTFKGV   VPR V ALKA  LLQ GAW YLA+VVD  K  PSI++
Subjt:  PSAPNFTFKGVTGRVPRTVSALKAKRLLQNGAWGYLANVVDISKTPPSIDS

A0A6J1DYU5 uncharacterized protein LOC1110255174.6e-15569.23Show/hide
Query:  YLGCEDQFKVKGAVFMLRGEALNWWDSIAAAEDHANVPIPWARFKDLLYDYYYPETVKDMKEAEFLHLVQGTLSVAQYERKFAELSRFALELIPTEALKI
        YL CE+QFKVKG VFMLRGEALNWWDS+A AEDHANVPI WARFKDLLYDYYYP+T+KDMKEAEFLH   GTL+VAQYERKF ELS FA ELIPTEA+KI
Subjt:  YLGCEDQFKVKGAVFMLRGEALNWWDSIAAAEDHANVPIPWARFKDLLYDYYYPETVKDMKEAEFLHLVQGTLSVAQYERKFAELSRFALELIPTEALKI

Query:  KRFVKGLRKGIRGPVDLQRPTTYAEAVRGALVMDKDVSNKASPLPEVGSSSGVKRKFPPTYADPVLRAPQRQAQHQDMPPVCPTCQKRHTGQCWTGSKGC
        KRFVKGLRKGIRGPVDLQRP TYAEAVRG L+MD DVSN   PL EVGSSSGVKRK  P YAD   RAPQR AQ Q +PPVCP+CQKR  GQCWTG++GC
Subjt:  KRFVKGLRKGIRGPVDLQRPTTYAEAVRGALVMDKDVSNKASPLPEVGSSSGVKRKFPPTYADPVLRAPQRQAQHQDMPPVCPTCQKRHTGQCWTGSKGC

Query:  FRCGRERHFARECPMSAVNTQRLGQRISPSVSTQGNNQRA-RVFALTRKEAADAETVVT------GNLELEPLGFLLSVSTPSGSVLIGSQKVRA-----
        FRCGRE HFAREC M+A NTQRLGQR +P+VSTQG       V A    +   + T ++        LELEPLGFLLSVSTPSGSVLI SQ VRA     
Subjt:  FRCGRERHFARECPMSAVNTQRLGQRISPSVSTQGNNQRA-RVFALTRKEAADAETVVT------GNLELEPLGFLLSVSTPSGSVLIGSQKVRA-----

Query:  --------------------------ATNQANINCSRREVSFQLPSAPNFTFKGVTGRVPRTVSALKAKRLLQNGAWGYLANVVDISKTPPSIDSVHVVR
                                  ATNQANINCS+REVSFQLPS  +FTFKGV+G VPR VSALKA+RLL NGAW YLA+VVDIS TPPSIDS HVV+
Subjt:  --------------------------ATNQANINCSRREVSFQLPSAPNFTFKGVTGRVPRTVSALKAKRLLQNGAWGYLANVVDISKTPPSIDSVHVVR

Query:  EFPDVFPDDLPGLSPV
         F DVFP+DL GL P+
Subjt:  EFPDVFPDDLPGLSPV

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.62.4e-0442.42Show/hide
Query:  MTILLKKGMTWMWSKESQDT-FEDLKAAMLKGSVLGLANVTRPFKVETDASDYALGGVLLQDGHPI
        MT  LKK M    +    D+ F+ LK  + +  +L + + T+ F + TDASD ALG VL QDGHP+
Subjt:  MTILLKKGMTWMWSKESQDT-FEDLKAAMLKGSVLGLANVTRPFKVETDASDYALGGVLLQDGHPI

P10273 Gag-Pol polyprotein7.1e-0438.6Show/hide
Query:  LLKKGMTWMWSKESQDTFEDLKAAMLKGSVLGLANVTRPFKVETDASDYALGGVLLQ
        L + G  + W  E Q  FED+K A+L    LGL ++T+PF++  D +     GVL+Q
Subjt:  LLKKGMTWMWSKESQDTFEDLKAAMLKGSVLGLANVTRPFKVETDASDYALGGVLLQ

P20825 Retrovirus-related Pol polyprotein from transposon 2975.4e-0443.94Show/hide
Query:  MTILLKKGMTWMWSK-ESQDTFEDLKAAMLKGSVLGLANVTRPFKVETDASDYALGGVLLQDGHPI
        MT  LKK       K E  + FE LKA +++  +L L +  + F + TDAS+ ALG VL Q+GHPI
Subjt:  MTILLKKGMTWMWSK-ESQDTFEDLKAAMLKGSVLGLANVTRPFKVETDASDYALGGVLLQDGHPI

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACAATATTACTGAAGAAAGGAATGACCTGGATGTGGTCAAAGGAAAGTCAGGACACCTTCGAAGATCTAAAGGCGGCCATGTTGAAAGGCTCAGTACTTGGGCTAGC
CAACGTGACGAGACCGTTCAAAGTAGAAACGGATGCCTCAGACTACGCTTTAGGTGGCGTTCTTCTCCAAGATGGCCACCCCATTTGTAACGCCCCGCATTACTCAGGGT
CGACTGCGGTGATGACGTGGAGAGGGCTTGACCGAGGAGGGATCACTAGTCGAGTTAGTGAGGATAGAGGGAAGGGTCGTTTTGTAGGAGTAGCGTTTGGATTTATTTTA
TGGCTAGGTGTACGCTTGTTTTATTTATTTAAGAATTTAGCCACTGAGGCAGTCGAAGCTTTATTTTATTTAAATTATTTCCGGGGAGTTGGTTATGTCGTGTTTAGAGT
TAAGGCTAGACAGACTGAGTTAGTAACTAGCATACTTAGGAGTAGTGATCTGGGTCGACCTCCTTGTCATCACCAGGAACGCTTCATGCTTGTCCTTGAGAGCTTATACA
TTCCCCATGATTTACTTACAATGCCTCCCCGTCGTAGTATGAGATTGCGAGCTGACGCCGACCCCGCTCCCGGAGGTGAGAACGGAGCGGATCCACCGCCCCCTCCTGTT
GGGAATCAGGCAGGAGTAGTCCCTCCATTTCCTCCACCAGCAGCAGCTCAAGAGCGGGCAGATCCTCCAGTTCCCCCAGCAGTTCCTCAGGTGAACCCCCAATTGGTGTT
GCTTGTGGAGGCATTGCAAGCAGTGATCAATAACGCAGCAGGGGTGGGCGGAGTCCAAGCTCCGCCACCCCAACACCTTCATACTCCGCAGAGCGAGGCTCGCTTCATCA
AGGATTTCAAGCGTTACGAACCCCCAACATTTGACAGTGAGAGTGAAAGAGCGACTGCAGCGGAAGAGTGGATCAGAGAGTTGGAAGCCCTTTACGCGTATCTTGGTTGC
GAGGACCAATTCAAGGTGAAGGGTGCGGTTTTTATGTTGAGGGGCGAGGCCCTGAACTGGTGGGACTCAATAGCAGCGGCAGAGGATCATGCAAATGTACCAATTCCGTG
GGCGAGGTTCAAGGACTTGTTGTACGACTACTACTATCCAGAGACTGTGAAAGACATGAAGGAGGCGGAATTCCTGCATCTAGTCCAAGGAACCTTATCGGTGGCACAAT
ATGAGAGGAAGTTCGCGGAACTCTCCCGTTTCGCTCTAGAGTTGATTCCCACTGAGGCATTGAAGATCAAGAGGTTTGTTAAAGGCTTGCGCAAGGGAATCAGAGGCCCG
GTGGACCTCCAGCGACCCACCACCTATGCTGAAGCGGTTAGGGGCGCCTTGGTTATGGATAAGGATGTCTCCAACAAGGCCTCACCCCTGCCAGAGGTCGGATCATCTTC
AGGTGTGAAAAGGAAGTTCCCTCCGACTTATGCCGACCCGGTATTGAGAGCACCCCAGCGCCAGGCTCAGCACCAGGACATGCCGCCAGTATGCCCCACCTGCCAGAAAA
GACATACGGGGCAGTGCTGGACGGGAAGTAAGGGTTGTTTCAGGTGCGGAAGAGAGAGGCATTTTGCAAGGGAATGTCCCATGTCGGCCGTAAATACACAGAGGTTAGGC
CAGAGGATTTCCCCATCAGTTTCGACGCAGGGAAATAACCAAAGGGCTCGTGTCTTCGCACTTACTCGCAAGGAAGCGGCGGATGCCGAAACCGTGGTCACAGGCAACCT
CGAATTAGAGCCGTTAGGGTTTTTGTTGTCGGTTTCTACACCATCAGGGTCAGTTTTGATTGGTAGTCAAAAGGTGAGGGCAGCTACCAACCAAGCCAACATTAATTGCT
CGAGAAGAGAAGTCTCGTTCCAACTACCTTCGGCTCCGAATTTTACATTTAAAGGAGTTACGGGTAGAGTCCCAAGGACAGTATCGGCGTTAAAGGCAAAACGCCTGTTA
CAGAATGGAGCTTGGGGATATTTGGCCAACGTCGTCGACATTAGTAAGACTCCACCTAGTATCGACTCCGTTCACGTGGTCAGAGAGTTCCCAGACGTGTTCCCCGATGA
CCTTCCGGGGCTATCCCCTGTTGTAGGGGAATGCCGGGGCAGAGAGGTGCCGAAACCGGATTCCGAATCCTGGGCCTGGGGCGTTACAGATTGTATCAGAGCGGAACCTC
TCCCACCAGGTCGCAGGGGAATGCCGGGGAAGAGAGGTGCCGAAACTGGATTCCGAATCCTGGGCCTGGGGCGTTACACCATTGCTTATGAAAGTCGTCCTGGAGGTAGT
ACGGTCCTGTGCATGTTAGCCCACATCCACGCCAGCAAGGTGGACGGGTCGATCAGCGACCTCATTAGAGAATATCTCCAAAGAGACCCCTCCGCTCGAACTGTGGTCGA
GCTGGCTAAGACTGGGAAGACCTGCCAGTTCTGGGTCGAAGAGGACTTATTATTTGTAAAAGGAAACAAACTGCTGGCCACTCTAGATAAGGTAGAGAGGACGAAAGTAG
CAGGGTTGCTTGAACCGCTACCGATTCCTACGAGGCCGTGGGAGAGTGTAACGCTCGACTTCATCTCCCACTTGCCCAAGACGAGCTCTTTAATAGGGAAGAGTTCCTTT
GAGATTGTGTGCGGAAGACAACTGTCAATGCCGCATATCCTCGACCACCCATATGCTGGGAAAAGTCCTCAAGCCCACAACTTCACAAAGGAATGGAAGCAGACGACAGA
AATTGCTCGAGCATATTTGGAAAAAGCTTCCAAGCACATGAAGAAATGGGTCGACCGCAAGCGAAGCCCTCTCGAGTTTCGAGCAGGTAACAAAGTTCTTATCAAGCTGA
AGCCAGAACAGATTCGTTTCCAAGGGCGCAAAGATCAGAGACTCGTGAGAACGTACGAAAGACCAGTAGAAGTTGTAAAGATCCACCCGATGGAATTTAGCCGAGCAGCT
TTGGAACTAGGTGTGTATTGGTGCGATTGTGACACTTGGCACACACGTGCCACCACCAAGGCCGCACAACTCTCCTCTGTGCGAAGTGAGTGCGACTCAATTGGCCGCGG
TGACCCTTCTCAACGCAAGGAGCGCGACCTCCATGTAATCCTTTCTCCGTGTAAAGCGAGTGCGACCCAAGCGGCACGACGCCCCTTCTCCATGCAAGGGAGCGCGACTG
TCCTTTGTGGTCATCGCAACCACCTTGGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGACAATATTACTGAAGAAAGGAATGACCTGGATGTGGTCAAAGGAAAGTCAGGACACCTTCGAAGATCTAAAGGCGGCCATGTTGAAAGGCTCAGTACTTGGGCTAGC
CAACGTGACGAGACCGTTCAAAGTAGAAACGGATGCCTCAGACTACGCTTTAGGTGGCGTTCTTCTCCAAGATGGCCACCCCATTTGTAACGCCCCGCATTACTCAGGGT
CGACTGCGGTGATGACGTGGAGAGGGCTTGACCGAGGAGGGATCACTAGTCGAGTTAGTGAGGATAGAGGGAAGGGTCGTTTTGTAGGAGTAGCGTTTGGATTTATTTTA
TGGCTAGGTGTACGCTTGTTTTATTTATTTAAGAATTTAGCCACTGAGGCAGTCGAAGCTTTATTTTATTTAAATTATTTCCGGGGAGTTGGTTATGTCGTGTTTAGAGT
TAAGGCTAGACAGACTGAGTTAGTAACTAGCATACTTAGGAGTAGTGATCTGGGTCGACCTCCTTGTCATCACCAGGAACGCTTCATGCTTGTCCTTGAGAGCTTATACA
TTCCCCATGATTTACTTACAATGCCTCCCCGTCGTAGTATGAGATTGCGAGCTGACGCCGACCCCGCTCCCGGAGGTGAGAACGGAGCGGATCCACCGCCCCCTCCTGTT
GGGAATCAGGCAGGAGTAGTCCCTCCATTTCCTCCACCAGCAGCAGCTCAAGAGCGGGCAGATCCTCCAGTTCCCCCAGCAGTTCCTCAGGTGAACCCCCAATTGGTGTT
GCTTGTGGAGGCATTGCAAGCAGTGATCAATAACGCAGCAGGGGTGGGCGGAGTCCAAGCTCCGCCACCCCAACACCTTCATACTCCGCAGAGCGAGGCTCGCTTCATCA
AGGATTTCAAGCGTTACGAACCCCCAACATTTGACAGTGAGAGTGAAAGAGCGACTGCAGCGGAAGAGTGGATCAGAGAGTTGGAAGCCCTTTACGCGTATCTTGGTTGC
GAGGACCAATTCAAGGTGAAGGGTGCGGTTTTTATGTTGAGGGGCGAGGCCCTGAACTGGTGGGACTCAATAGCAGCGGCAGAGGATCATGCAAATGTACCAATTCCGTG
GGCGAGGTTCAAGGACTTGTTGTACGACTACTACTATCCAGAGACTGTGAAAGACATGAAGGAGGCGGAATTCCTGCATCTAGTCCAAGGAACCTTATCGGTGGCACAAT
ATGAGAGGAAGTTCGCGGAACTCTCCCGTTTCGCTCTAGAGTTGATTCCCACTGAGGCATTGAAGATCAAGAGGTTTGTTAAAGGCTTGCGCAAGGGAATCAGAGGCCCG
GTGGACCTCCAGCGACCCACCACCTATGCTGAAGCGGTTAGGGGCGCCTTGGTTATGGATAAGGATGTCTCCAACAAGGCCTCACCCCTGCCAGAGGTCGGATCATCTTC
AGGTGTGAAAAGGAAGTTCCCTCCGACTTATGCCGACCCGGTATTGAGAGCACCCCAGCGCCAGGCTCAGCACCAGGACATGCCGCCAGTATGCCCCACCTGCCAGAAAA
GACATACGGGGCAGTGCTGGACGGGAAGTAAGGGTTGTTTCAGGTGCGGAAGAGAGAGGCATTTTGCAAGGGAATGTCCCATGTCGGCCGTAAATACACAGAGGTTAGGC
CAGAGGATTTCCCCATCAGTTTCGACGCAGGGAAATAACCAAAGGGCTCGTGTCTTCGCACTTACTCGCAAGGAAGCGGCGGATGCCGAAACCGTGGTCACAGGCAACCT
CGAATTAGAGCCGTTAGGGTTTTTGTTGTCGGTTTCTACACCATCAGGGTCAGTTTTGATTGGTAGTCAAAAGGTGAGGGCAGCTACCAACCAAGCCAACATTAATTGCT
CGAGAAGAGAAGTCTCGTTCCAACTACCTTCGGCTCCGAATTTTACATTTAAAGGAGTTACGGGTAGAGTCCCAAGGACAGTATCGGCGTTAAAGGCAAAACGCCTGTTA
CAGAATGGAGCTTGGGGATATTTGGCCAACGTCGTCGACATTAGTAAGACTCCACCTAGTATCGACTCCGTTCACGTGGTCAGAGAGTTCCCAGACGTGTTCCCCGATGA
CCTTCCGGGGCTATCCCCTGTTGTAGGGGAATGCCGGGGCAGAGAGGTGCCGAAACCGGATTCCGAATCCTGGGCCTGGGGCGTTACAGATTGTATCAGAGCGGAACCTC
TCCCACCAGGTCGCAGGGGAATGCCGGGGAAGAGAGGTGCCGAAACTGGATTCCGAATCCTGGGCCTGGGGCGTTACACCATTGCTTATGAAAGTCGTCCTGGAGGTAGT
ACGGTCCTGTGCATGTTAGCCCACATCCACGCCAGCAAGGTGGACGGGTCGATCAGCGACCTCATTAGAGAATATCTCCAAAGAGACCCCTCCGCTCGAACTGTGGTCGA
GCTGGCTAAGACTGGGAAGACCTGCCAGTTCTGGGTCGAAGAGGACTTATTATTTGTAAAAGGAAACAAACTGCTGGCCACTCTAGATAAGGTAGAGAGGACGAAAGTAG
CAGGGTTGCTTGAACCGCTACCGATTCCTACGAGGCCGTGGGAGAGTGTAACGCTCGACTTCATCTCCCACTTGCCCAAGACGAGCTCTTTAATAGGGAAGAGTTCCTTT
GAGATTGTGTGCGGAAGACAACTGTCAATGCCGCATATCCTCGACCACCCATATGCTGGGAAAAGTCCTCAAGCCCACAACTTCACAAAGGAATGGAAGCAGACGACAGA
AATTGCTCGAGCATATTTGGAAAAAGCTTCCAAGCACATGAAGAAATGGGTCGACCGCAAGCGAAGCCCTCTCGAGTTTCGAGCAGGTAACAAAGTTCTTATCAAGCTGA
AGCCAGAACAGATTCGTTTCCAAGGGCGCAAAGATCAGAGACTCGTGAGAACGTACGAAAGACCAGTAGAAGTTGTAAAGATCCACCCGATGGAATTTAGCCGAGCAGCT
TTGGAACTAGGTGTGTATTGGTGCGATTGTGACACTTGGCACACACGTGCCACCACCAAGGCCGCACAACTCTCCTCTGTGCGAAGTGAGTGCGACTCAATTGGCCGCGG
TGACCCTTCTCAACGCAAGGAGCGCGACCTCCATGTAATCCTTTCTCCGTGTAAAGCGAGTGCGACCCAAGCGGCACGACGCCCCTTCTCCATGCAAGGGAGCGCGACTG
TCCTTTGTGGTCATCGCAACCACCTTGGTTAG
Protein sequenceShow/hide protein sequence
MTILLKKGMTWMWSKESQDTFEDLKAAMLKGSVLGLANVTRPFKVETDASDYALGGVLLQDGHPICNAPHYSGSTAVMTWRGLDRGGITSRVSEDRGKGRFVGVAFGFIL
WLGVRLFYLFKNLATEAVEALFYLNYFRGVGYVVFRVKARQTELVTSILRSSDLGRPPCHHQERFMLVLESLYIPHDLLTMPPRRSMRLRADADPAPGGENGADPPPPPV
GNQAGVVPPFPPPAAAQERADPPVPPAVPQVNPQLVLLVEALQAVINNAAGVGGVQAPPPQHLHTPQSEARFIKDFKRYEPPTFDSESERATAAEEWIRELEALYAYLGC
EDQFKVKGAVFMLRGEALNWWDSIAAAEDHANVPIPWARFKDLLYDYYYPETVKDMKEAEFLHLVQGTLSVAQYERKFAELSRFALELIPTEALKIKRFVKGLRKGIRGP
VDLQRPTTYAEAVRGALVMDKDVSNKASPLPEVGSSSGVKRKFPPTYADPVLRAPQRQAQHQDMPPVCPTCQKRHTGQCWTGSKGCFRCGRERHFARECPMSAVNTQRLG
QRISPSVSTQGNNQRARVFALTRKEAADAETVVTGNLELEPLGFLLSVSTPSGSVLIGSQKVRAATNQANINCSRREVSFQLPSAPNFTFKGVTGRVPRTVSALKAKRLL
QNGAWGYLANVVDISKTPPSIDSVHVVREFPDVFPDDLPGLSPVVGECRGREVPKPDSESWAWGVTDCIRAEPLPPGRRGMPGKRGAETGFRILGLGRYTIAYESRPGGS
TVLCMLAHIHASKVDGSISDLIREYLQRDPSARTVVELAKTGKTCQFWVEEDLLFVKGNKLLATLDKVERTKVAGLLEPLPIPTRPWESVTLDFISHLPKTSSLIGKSSF
EIVCGRQLSMPHILDHPYAGKSPQAHNFTKEWKQTTEIARAYLEKASKHMKKWVDRKRSPLEFRAGNKVLIKLKPEQIRFQGRKDQRLVRTYERPVEVVKIHPMEFSRAA
LELGVYWCDCDTWHTRATTKAAQLSSVRSECDSIGRGDPSQRKERDLHVILSPCKASATQAARRPFSMQGSATVLCGHRNHLG