; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc02g17450 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc02g17450
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase Ty1/copia-type domain-containing protein
Genome locationchr2:13074137..13075063
RNA-Seq ExpressionMoc02g17450
SyntenyMoc02g17450
Gene Ontology termsGO:0016740 - transferase activity (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0057165.1 putative mitochondrial protein [Cucumis melo var. makuwa]9.8e-8156Show/hide
Query:  MQQPRGFVNSTYPNHVCQLLKSLYGLKQAPRVWFECFTSHLLSLDFVAFTADSSLFIRHTCDTITYLLLYVDDIILTGNSPFYISALIAQLRTKFDMSDL
        M QP GF + T PNHVC L KSLYGLKQAPR WFE F S+L +LDFVA TAD SLFIR    ++TYLLLYV +II+ G    YI  L  QL  +F +SDL
Subjt:  MQQPRGFVNSTYPNHVCQLLKSLYGLKQAPRVWFECFTSHLLSLDFVAFTADSSLFIRHTCDTITYLLLYVDDIILTGNSPFYISALIAQLRTKFDMSDL

Query:  GSLKYFLGLEV-SHTAGISVSQTKYAKDILARFGMAVAKYCNTPSALSSAVNGAECPCSDEDAKSFRSIVGAFQYLTFTRPDIAFCVSKLSQSMHSPSAA
        G LKYFLGLE+ S   GI V+Q KY  D+L    M  AK C TP  +S++++         D   +R +VG+ QYLTFTRPDIAF ++++SQ MH P+  
Subjt:  GSLKYFLGLEV-SHTAGISVSQTKYAKDILARFGMAVAKYCNTPSALSSAVNGAECPCSDEDAKSFRSIVGAFQYLTFTRPDIAFCVSKLSQSMHSPSAA

Query:  SLIDAKRVLLYVNGTLSSGLLFRKGHSNCFSLTAFSDSDWVGDSLDRRSTTGFIIFLGHNPISWAAKKQSTVSRSSTEAEYRALASTAAELFWLRQLLRD
             KR+L Y+ GT + G+ FRKG    FSL  F DS+W GD+ D+RST+ FI FLG NPISW++KKQSTVSRSSTEAEYR LA+T A+L+W+RQLL D
Subjt:  SLIDAKRVLLYVNGTLSSGLLFRKGHSNCFSLTAFSDSDWVGDSLDRRSTTGFIIFLGHNPISWAAKKQSTVSRSSTEAEYRALASTAAELFWLRQLLRD

XP_020415542.1 uncharacterized protein LOC109948051 [Prunus persica]1.7e-8557.1Show/hide
Query:  MQQPRGFVNSTYPNHVCQLLKSLYGLKQAPRVWFECFTSHLLSLDFVAFTADSSLFIRHTCDTITYLLLYVDDIILTGNSPFYISALIAQLRTKFDMSDL
        MQQP GF +S +P+HVC+LLKSLYGLKQAPR W E FT+HLL+L FV   AD+SLFIR +   +  LLLYVDDIILTGN+   I   I  L  +FDM DL
Subjt:  MQQPRGFVNSTYPNHVCQLLKSLYGLKQAPRVWFECFTSHLLSLDFVAFTADSSLFIRHTCDTITYLLLYVDDIILTGNSPFYISALIAQLRTKFDMSDL

Query:  GSLKYFLGLEVSHTA-GISVSQTKYAKDILARFGMAVAKYCNTPSALSSAVNGAE-CPCSDEDAKSFRSIVGAFQYLTFTRPDIAFCVSKLSQSMHSPSA
        G L YFLGL+V +T+ GI V+Q+KYA D+L + GM   K C+TP   ++ +   +  P SD     +RS+VG  QYLTF+RPDIAF V+ L Q MH+P+ 
Subjt:  GSLKYFLGLEVSHTA-GISVSQTKYAKDILARFGMAVAKYCNTPSALSSAVNGAE-CPCSDEDAKSFRSIVGAFQYLTFTRPDIAFCVSKLSQSMHSPSA

Query:  ASLIDAKRVLLYVNGTLSSGLLFRKGHSNCFSLTAFSDSDWVGDSLDRRSTTGFIIFLGHNPISWAAKKQSTVSRSSTEAEYRALASTAAELFWLRQLLR
        +     KRVL Y+ GTL+ G+ F  G  +   L A+SD+DW GD  DRRSTTG ++FLG+NPISW AKKQ+TVSRSSTEAEYRALA T+AEL WL QLLR
Subjt:  ASLIDAKRVLLYVNGTLSSGLLFRKGHSNCFSLTAFSDSDWVGDSLDRRSTTGFIIFLGHNPISWAAKKQSTVSRSSTEAEYRALASTAAELFWLRQLLR

Query:  DHCIFGSQPP
        D  +F  +PP
Subjt:  DHCIFGSQPP

XP_022143489.1 uncharacterized protein LOC111013365 [Momordica charantia]9.1e-10362.14Show/hide
Query:  MQQPRGFVNSTYPNHVCQLLKSLYGLKQAPRVWFECFTSHLLSLDFVAFTADSSLFIRHTCDTITYLLLYVDDIILTGNSPFYISALIAQLRTKFDMSDL
        MQQP+GFV+ T P++VC+L KSLYG KQAPR WFECFT+HLL L FVA + DSSLF+R    + TYLLLYVDDI++TG+ P YI++LI QL+ +FDM+DL
Subjt:  MQQPRGFVNSTYPNHVCQLLKSLYGLKQAPRVWFECFTSHLLSLDFVAFTADSSLFIRHTCDTITYLLLYVDDIILTGNSPFYISALIAQLRTKFDMSDL

Query:  GSLKYFLGLEVSHT-AGISVSQTKYAKDILARFGMAVAKYCNTPSALSSAVNGAECPCSDEDAKSFRSIVGAFQYLTFTRPDIAFCVSKLSQSMHSPSAA
          LKYFLGLE+S+T  G+S+SQ KY +D+L RFG+  AK C TP +LSS  +    PCS ED +++RS++G+  YLTFTRPDIAF V KLSQ MH+P  +
Subjt:  GSLKYFLGLEVSHT-AGISVSQTKYAKDILARFGMAVAKYCNTPSALSSAVNGAECPCSDEDAKSFRSIVGAFQYLTFTRPDIAFCVSKLSQSMHSPSAA

Query:  SLIDAKRVLLYVNGTLSSGLLFRKGHSNCFSLTAFSDSDWVGDSLDRRSTTGFIIFLGHNPISWAAKKQSTVSRSSTEAEYRALASTAAELFWLRQLLRD
         L+ AKR+L Y++G+L   ++F++  SN   L AFSDSDW GDS DRRST+GF+IFLG NPIS  +KKQSTVSRSSTEAEYR LASTAAELFW+RQLL+D
Subjt:  SLIDAKRVLLYVNGTLSSGLLFRKGHSNCFSLTAFSDSDWVGDSLDRRSTTGFIIFLGHNPISWAAKKQSTVSRSSTEAEYRALASTAAELFWLRQLLRD

Query:  HCIFGSQPP
          +F +Q P
Subjt:  HCIFGSQPP

XP_022151604.1 uncharacterized protein LOC111019517 [Momordica charantia]1.2e-8160.75Show/hide
Query:  MQQPRGFVNSTYPNHVCQLLKSLYGLKQAPRVWFECFTSHLLSLDFVAFTADSSLFIRHTCDTITYLLLYVDDIILTGNSPFYISALIAQLRTKFDMSDL
        MQQP+GF++   P  VC+L+KSLY LKQAPR WF+CF SHLL+L F A  ADSSLF+R T D++TYLLLYVDDI LT     YI  LI+QL+ +FDM+DL
Subjt:  MQQPRGFVNSTYPNHVCQLLKSLYGLKQAPRVWFECFTSHLLSLDFVAFTADSSLFIRHTCDTITYLLLYVDDIILTGNSPFYISALIAQLRTKFDMSDL

Query:  GSLKYFLGLEVSHTA-GISVSQTKYAKDILARFGMAVAKYCNTPSALSSAVNGAECPCSDEDAKSFRSIVGAFQYLTFTRPDIAFCVSKLSQSMHSPSAA
        G L++FLGLE+++++ GISV Q+KY KD+L RFGM  AK C+TP ALSSA   +   CSD DAK++RS+VGA  YLTF+RP+IAF VSKLSQ +HSP+  
Subjt:  GSLKYFLGLEVSHTA-GISVSQTKYAKDILARFGMAVAKYCNTPSALSSAVNGAECPCSDEDAKSFRSIVGAFQYLTFTRPDIAFCVSKLSQSMHSPSAA

Query:  SLIDAKRVLLYVNGTLSSGLLFRKG--HSNCFSLTAFSDSDWVGDSLDRRSTTGFIIFLGHNPIS
         L+ AKRVL Y+ G++  GL+F+KG    +   LTA+SDSDW GDS DRRST GF+IFLG +PIS
Subjt:  SLIDAKRVLLYVNGTLSSGLLFRKG--HSNCFSLTAFSDSDWVGDSLDRRSTTGFIIFLGHNPIS

XP_022152156.1 uncharacterized protein LOC111019945 [Momordica charantia]4.0e-8262.69Show/hide
Query:  TADSSLFIRHTCDTITYLLLYVDDIILTGNSPFYISALIAQLRTKFDMSDLGSLKYFLGLEVSHTA-GISVSQTKYAKDILARFGMAVAKYCNTPSALSS
        T  SSLF+R+T  ++TYLLLYVDDII+TGNS  YI  LI+ L+ +FDM+DLG+L YFLGLE+++T+ GI V+Q KY +DIL RFGM  AK C+TP AL +
Subjt:  TADSSLFIRHTCDTITYLLLYVDDIILTGNSPFYISALIAQLRTKFDMSDLGSLKYFLGLEVSHTA-GISVSQTKYAKDILARFGMAVAKYCNTPSALSS

Query:  AVNGAECPCSDEDAKSFRSIVGAFQYLTFTRPDIAFCVSKLSQSMHSPSAASLIDAKRVLLYVNGTLSSGLLFRKGHSNCFSLTAFSDSDWVGDSLDRRS
        + +  E PCS EDA+++RS++GA  YLTF+RPDIAF VSKLSQ MH P    L  AKR+L Y+NGT+  GL+FR+  +    L AFSDSDW GD++DRRS
Subjt:  AVNGAECPCSDEDAKSFRSIVGAFQYLTFTRPDIAFCVSKLSQSMHSPSAASLIDAKRVLLYVNGTLSSGLLFRKGHSNCFSLTAFSDSDWVGDSLDRRS

Query:  TTGFIIFLGHNPISWAAKKQSTVSRSSTEAEYRALASTAAELFWLRQLLRDHCIFGSQPP
        TTG +IFLG NPISW AKKQ+TVSRSSTEAEYRALAST AEL WLRQ+L+D  +F  Q P
Subjt:  TTGFIIFLGHNPISWAAKKQSTVSRSSTEAEYRALASTAAELFWLRQLLRDHCIFGSQPP

TrEMBL top hitse value%identityAlignment
A0A2N9FT75 Reverse transcriptase Ty1/copia-type domain-containing protein4.3e-9056.45Show/hide
Query:  MQQPRGFVNSTYPNHVCQLLKSLYGLKQAPRVWFECFTSHLLSLDFVAFTADSSLFIRHTCDTITYLLLYVDDIILTGNSPFYISALIAQLRTKFDMSDL
        MQQP GF++  YP HVCQL K+LYGLKQAPR WFE FTSHLL++ F    AD SLF+     T+ YLLLYVDDII+TGN P  + +LI +L  +FD+ DL
Subjt:  MQQPRGFVNSTYPNHVCQLLKSLYGLKQAPRVWFECFTSHLLSLDFVAFTADSSLFIRHTCDTITYLLLYVDDIILTGNSPFYISALIAQLRTKFDMSDL

Query:  GSLKYFLGLEVSH-TAGISVSQTKYAKDILARFGMAVAKYCNTPSALSSAVNGAE-CPCSDEDAKSFRSIVGAFQYLTFTRPDIAFCVSKLSQSMHSPSA
        G LK+FLGL++ + ++G  V Q KYA D+LA+F M+  K C+TP    S +   +  P S  D   FRS+VG  QYLTFTRPD+++ V+ + Q MH P+ 
Subjt:  GSLKYFLGLEVSH-TAGISVSQTKYAKDILARFGMAVAKYCNTPSALSSAVNGAE-CPCSDEDAKSFRSIVGAFQYLTFTRPDIAFCVSKLSQSMHSPSA

Query:  ASLIDAKRVLLYVNGTLSSGLLFRKGHSNCFSLTAFSDSDWVGDSLDRRSTTGFIIFLGHNPISWAAKKQSTVSRSSTEAEYRALASTAAELFWLRQLLR
          L+ AKR+L YV GTL  GL FR G     SLTAF+DSDW GD +DRRSTTG I+FLGHNPI+W +KKQ TV+RSSTEAEYRALA+ AA+L W+R +L+
Subjt:  ASLIDAKRVLLYVNGTLSSGLLFRKGHSNCFSLTAFSDSDWVGDSLDRRSTTGFIIFLGHNPISWAAKKQSTVSRSSTEAEYRALASTAAELFWLRQLLR

Query:  DHCIFGSQPP
        D  IF   PP
Subjt:  DHCIFGSQPP

A0A2N9HXK0 Uncharacterized protein4.3e-9056.45Show/hide
Query:  MQQPRGFVNSTYPNHVCQLLKSLYGLKQAPRVWFECFTSHLLSLDFVAFTADSSLFIRHTCDTITYLLLYVDDIILTGNSPFYISALIAQLRTKFDMSDL
        MQQP GF++  YP HVCQL K+LYGLKQAPR WFE FTSHLL++ F    AD SLF+     T+ YLLLYVDDII+TGN P  + +LI +L  +FD+ DL
Subjt:  MQQPRGFVNSTYPNHVCQLLKSLYGLKQAPRVWFECFTSHLLSLDFVAFTADSSLFIRHTCDTITYLLLYVDDIILTGNSPFYISALIAQLRTKFDMSDL

Query:  GSLKYFLGLEVSH-TAGISVSQTKYAKDILARFGMAVAKYCNTPSALSSAVNGAE-CPCSDEDAKSFRSIVGAFQYLTFTRPDIAFCVSKLSQSMHSPSA
        G LK+FLGL++ + ++G  V Q KYA D+LA+F M+  K C+TP    S +   +  P S  D   FRS+VG  QYLTFTRPD+++ V+ + Q MH P+ 
Subjt:  GSLKYFLGLEVSH-TAGISVSQTKYAKDILARFGMAVAKYCNTPSALSSAVNGAE-CPCSDEDAKSFRSIVGAFQYLTFTRPDIAFCVSKLSQSMHSPSA

Query:  ASLIDAKRVLLYVNGTLSSGLLFRKGHSNCFSLTAFSDSDWVGDSLDRRSTTGFIIFLGHNPISWAAKKQSTVSRSSTEAEYRALASTAAELFWLRQLLR
          L+ AKR+L YV GTL  GL FR G     SLTAF+DSDW GD +DRRSTTG I+FLGHNPI+W +KKQ TV+RSSTEAEYRALA+ AA+L W+R +L+
Subjt:  ASLIDAKRVLLYVNGTLSSGLLFRKGHSNCFSLTAFSDSDWVGDSLDRRSTTGFIIFLGHNPISWAAKKQSTVSRSSTEAEYRALASTAAELFWLRQLLR

Query:  DHCIFGSQPP
        D  IF   PP
Subjt:  DHCIFGSQPP

A0A2N9IEP2 Uncharacterized protein4.3e-9056.41Show/hide
Query:  MQQPRGFVNSTYPNHVCQLLKSLYGLKQAPRVWFECFTSHLLSLDFVAFTADSSLFIRHTCDTITYLLLYVDDIILTGNSPFYISALIAQLRTKFDMSDL
        MQQP GF++  YP HVCQL K+LYGLKQAPR WFE FTSHLL++ F    AD SLF+     T+ YLLLYVDDII+TGN P  I +LI +L  +FD+ DL
Subjt:  MQQPRGFVNSTYPNHVCQLLKSLYGLKQAPRVWFECFTSHLLSLDFVAFTADSSLFIRHTCDTITYLLLYVDDIILTGNSPFYISALIAQLRTKFDMSDL

Query:  GSLKYFLGLEVSH-TAGISVSQTKYAKDILARFGMAVAKYCNTPSALSSAV---NGAECPCSDEDAKSFRSIVGAFQYLTFTRPDIAFCVSKLSQSMHSP
        G LK+FLGL++ + ++G  V Q KYA D+LA+F M+  K C+TP    S +   +G   P    D   FRS+VG  QYLTFTRPD+++ V+ + Q MH P
Subjt:  GSLKYFLGLEVSH-TAGISVSQTKYAKDILARFGMAVAKYCNTPSALSSAV---NGAECPCSDEDAKSFRSIVGAFQYLTFTRPDIAFCVSKLSQSMHSP

Query:  SAASLIDAKRVLLYVNGTLSSGLLFRKGHSNCFSLTAFSDSDWVGDSLDRRSTTGFIIFLGHNPISWAAKKQSTVSRSSTEAEYRALASTAAELFWLRQL
        +   L+ AKR+L YV GTL  GL FR G     SLTAF+DSDW GD +DRRSTTG I+FLGHNPI+W +KKQ TV+RSSTEAEYRALA+ AA+L W+R +
Subjt:  SAASLIDAKRVLLYVNGTLSSGLLFRKGHSNCFSLTAFSDSDWVGDSLDRRSTTGFIIFLGHNPISWAAKKQSTVSRSSTEAEYRALASTAAELFWLRQL

Query:  LRDHCIFGSQPP
        L+D  IF   PP
Subjt:  LRDHCIFGSQPP

A0A2N9IXX5 CCHC-type domain-containing protein4.3e-9056.45Show/hide
Query:  MQQPRGFVNSTYPNHVCQLLKSLYGLKQAPRVWFECFTSHLLSLDFVAFTADSSLFIRHTCDTITYLLLYVDDIILTGNSPFYISALIAQLRTKFDMSDL
        MQQP GF++  YP HVCQL K+LYGLKQAPR WFE FTSHLL++ F    AD SLF+     T+ YLLLYVDDII+TGN P  + +LI +L  +FD+ DL
Subjt:  MQQPRGFVNSTYPNHVCQLLKSLYGLKQAPRVWFECFTSHLLSLDFVAFTADSSLFIRHTCDTITYLLLYVDDIILTGNSPFYISALIAQLRTKFDMSDL

Query:  GSLKYFLGLEVSH-TAGISVSQTKYAKDILARFGMAVAKYCNTPSALSSAVNGAE-CPCSDEDAKSFRSIVGAFQYLTFTRPDIAFCVSKLSQSMHSPSA
        G LK+FLGL++ + ++G  V Q KYA D+LA+F M+  K C+TP    S +   +  P S  D   FRS+VG  QYLTFTRPD+++ V+ + Q MH P+ 
Subjt:  GSLKYFLGLEVSH-TAGISVSQTKYAKDILARFGMAVAKYCNTPSALSSAVNGAE-CPCSDEDAKSFRSIVGAFQYLTFTRPDIAFCVSKLSQSMHSPSA

Query:  ASLIDAKRVLLYVNGTLSSGLLFRKGHSNCFSLTAFSDSDWVGDSLDRRSTTGFIIFLGHNPISWAAKKQSTVSRSSTEAEYRALASTAAELFWLRQLLR
          L+ AKR+L YV GTL  GL FR G     SLTAF+DSDW GD +DRRSTTG I+FLGHNPI+W +KKQ TV+RSSTEAEYRALA+ AA+L W+R +L+
Subjt:  ASLIDAKRVLLYVNGTLSSGLLFRKGHSNCFSLTAFSDSDWVGDSLDRRSTTGFIIFLGHNPISWAAKKQSTVSRSSTEAEYRALASTAAELFWLRQLLR

Query:  DHCIFGSQPP
        D  IF   PP
Subjt:  DHCIFGSQPP

A0A6J1CPG5 uncharacterized protein LOC1110133654.4e-10362.14Show/hide
Query:  MQQPRGFVNSTYPNHVCQLLKSLYGLKQAPRVWFECFTSHLLSLDFVAFTADSSLFIRHTCDTITYLLLYVDDIILTGNSPFYISALIAQLRTKFDMSDL
        MQQP+GFV+ T P++VC+L KSLYG KQAPR WFECFT+HLL L FVA + DSSLF+R    + TYLLLYVDDI++TG+ P YI++LI QL+ +FDM+DL
Subjt:  MQQPRGFVNSTYPNHVCQLLKSLYGLKQAPRVWFECFTSHLLSLDFVAFTADSSLFIRHTCDTITYLLLYVDDIILTGNSPFYISALIAQLRTKFDMSDL

Query:  GSLKYFLGLEVSHT-AGISVSQTKYAKDILARFGMAVAKYCNTPSALSSAVNGAECPCSDEDAKSFRSIVGAFQYLTFTRPDIAFCVSKLSQSMHSPSAA
          LKYFLGLE+S+T  G+S+SQ KY +D+L RFG+  AK C TP +LSS  +    PCS ED +++RS++G+  YLTFTRPDIAF V KLSQ MH+P  +
Subjt:  GSLKYFLGLEVSHT-AGISVSQTKYAKDILARFGMAVAKYCNTPSALSSAVNGAECPCSDEDAKSFRSIVGAFQYLTFTRPDIAFCVSKLSQSMHSPSAA

Query:  SLIDAKRVLLYVNGTLSSGLLFRKGHSNCFSLTAFSDSDWVGDSLDRRSTTGFIIFLGHNPISWAAKKQSTVSRSSTEAEYRALASTAAELFWLRQLLRD
         L+ AKR+L Y++G+L   ++F++  SN   L AFSDSDW GDS DRRST+GF+IFLG NPIS  +KKQSTVSRSSTEAEYR LASTAAELFW+RQLL+D
Subjt:  SLIDAKRVLLYVNGTLSSGLLFRKGHSNCFSLTAFSDSDWVGDSLDRRSTTGFIIFLGHNPISWAAKKQSTVSRSSTEAEYRALASTAAELFWLRQLLRD

Query:  HCIFGSQPP
          +F +Q P
Subjt:  HCIFGSQPP

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.9e-4236.63Show/hide
Query:  MQQPRGFVNSTYPNHVCQLLKSLYGLKQAPRVWFECFTSHLLSLDFVAFTADSSLFI--RHTCDTITYLLLYVDDIILTGNSPFYISALIAQLRTKFDMS
        M+ P+G   S   ++VC+L K++YGLKQA R WFE F   L   +FV  + D  ++I  +   +   Y+LLYVDD+++       ++     L  KF M+
Subjt:  MQQPRGFVNSTYPNHVCQLLKSLYGLKQAPRVWFECFTSHLLSLDFVAFTADSSLFI--RHTCDTITYLLLYVDDIILTGNSPFYISALIAQLRTKFDMS

Query:  DLGSLKYFLGLEVS-HTAGISVSQTKYAKDILARFGMAVAKYCNTPSALSSAVNGAECPCSDEDAKS-FRSIVGAFQYLTF-TRPDIAFCVSKLSQSMHS
        DL  +K+F+G+ +      I +SQ+ Y K IL++F M      +TP  L S +N  E   SDED  +  RS++G   Y+   TRPD+   V+ LS+    
Subjt:  DLGSLKYFLGLEVS-HTAGISVSQTKYAKDILARFGMAVAKYCNTPSALSSAVNGAECPCSDEDAKS-FRSIVGAFQYLTF-TRPDIAFCVSKLSQSMHS

Query:  PSAASLIDAKRVLLYVNGTLSSGLLFRKGHSNCFSLTAFSDSDWVGDSLDRRSTTGFII-FLGHNPISWAAKKQSTVSRSSTEAEYRALASTAAELFWLR
         ++    + KRVL Y+ GT+   L+F+K  +    +  + DSDW G  +DR+STTG++      N I W  K+Q++V+ SSTEAEY AL     E  WL+
Subjt:  PSAASLIDAKRVLLYVNGTLSSGLLFRKGHSNCFSLTAFSDSDWVGDSLDRRSTTGFII-FLGHNPISWAAKKQSTVSRSSTEAEYRALASTAAELFWLR

Query:  QLL
         LL
Subjt:  QLL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.0e-4035.6Show/hide
Query:  MQQPRGFVNSTYPNHVCQLLKSLYGLKQAPRVWFECFTSHLLSLDFVAFTADSSL-FIRHTCDTITYLLLYVDDIILTGNSPFYISALIAQLRTKFDMSD
        M+QP GF  +   + VC+L KSLYGLKQAPR W+  F S + S  ++   +D  + F R + +    LLLYVDD+++ G     I+ L   L   FDM D
Subjt:  MQQPRGFVNSTYPNHVCQLLKSLYGLKQAPRVWFECFTSHLLSLDFVAFTADSSL-FIRHTCDTITYLLLYVDDIILTGNSPFYISALIAQLRTKFDMSD

Query:  LGSLKYFLGLEVSH---TAGISVSQTKYAKDILARFGMAVAKYCNTPSALSSAVNGAECPCSDEDAKS-----FRSIVGAFQY-LTFTRPDIAFCVSKLS
        LG  +  LG+++     +  + +SQ KY + +L RF M  AK  +TP A    ++   CP + E+  +     + S VG+  Y +  TRPDIA  V  +S
Subjt:  LGSLKYFLGLEVSH---TAGISVSQTKYAKDILARFGMAVAKYCNTPSALSSAVNGAECPCSDEDAKS-----FRSIVGAFQY-LTFTRPDIAFCVSKLS

Query:  QSMHSPSAASLIDAKRVLLYVNGTLSSGLLFRKGHSNCFSLTAFSDSDWVGDSLDRRSTTGFIIFLGHNPISWAAKKQSTVSRSSTEAEYRALASTAAEL
        + + +P        K +L Y+ GT    L F  G S+   L  ++D+D  GD  +R+S+TG++       ISW +K Q  V+ S+TEAEY A   T  E+
Subjt:  QSMHSPSAASLIDAKRVLLYVNGTLSSGLLFRKGHSNCFSLTAFSDSDWVGDSLDRRSTTGFIIFLGHNPISWAAKKQSTVSRSSTEAEYRALASTAAEL

Query:  FWLRQLLRD
         WL++ L++
Subjt:  FWLRQLLRD

P92519 Uncharacterized mitochondrial protein AtMg008101.6e-5453.91Show/hide
Query:  YLLLYVDDIILTGNSPFYISALIAQLRTKFDMSDLGSLKYFLGLEV-SHTAGISVSQTKYAKDILARFGMAVAKYCNT--PSALSSAVNGAECPCSDEDA
        YLLLYVDDI+LTG+S   ++ LI QL + F M DLG + YFLG+++ +H +G+ +SQTKYA+ IL   GM   K  +T  P  L+S+V+ A+ P    D 
Subjt:  YLLLYVDDIILTGNSPFYISALIAQLRTKFDMSDLGSLKYFLGLEV-SHTAGISVSQTKYAKDILARFGMAVAKYCNT--PSALSSAVNGAECPCSDEDA

Query:  KSFRSIVGAFQYLTFTRPDIAFCVSKLSQSMHSPSAASLIDAKRVLLYVNGTLSSGLLFRKGHSNCFSLTAFSDSDWVGDSLDRRSTTGFIIFLGHNPIS
          FRSIVGA QYLT TRPDI++ V+ + Q MH P+ A     KRVL YV GT+  GL   K  ++  ++ AF DSDW G +  RRSTTGF  FLG N IS
Subjt:  KSFRSIVGAFQYLTFTRPDIAFCVSKLSQSMHSPSAASLIDAKRVLLYVNGTLSSGLLFRKGHSNCFSLTAFSDSDWVGDSLDRRSTTGFIIFLGHNPIS

Query:  WAAKKQSTVSRSSTEAEYRALASTAAELFW
        W+AK+Q TVSRSSTE EYRALA TAAEL W
Subjt:  WAAKKQSTVSRSSTEAEYRALASTAAELFW

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.6e-7045.51Show/hide
Query:  MQQPRGFVNSTYPNHVCQLLKSLYGLKQAPRVWFECFTSHLLSLDFVAFTADSSLFIRHTCDTITYLLLYVDDIILTGNSPFYISALIAQLRTKFDMSDL
        M QP GF++   PN+VC+L K+LYGLKQAPR W+    ++LL++ FV   +D+SLF+     +I Y+L+YVDDI++TGN P  +   +  L  +F + D 
Subjt:  MQQPRGFVNSTYPNHVCQLLKSLYGLKQAPRVWFECFTSHLLSLDFVAFTADSSLFIRHTCDTITYLLLYVDDIILTGNSPFYISALIAQLRTKFDMSDL

Query:  GSLKYFLGLEVSHT-AGISVSQTKYAKDILARFGMAVAKYCNTPSALS---SAVNGAECPCSDEDAKSFRSIVGAFQYLTFTRPDIAFCVSKLSQSMHSP
          L YFLG+E      G+ +SQ +Y  D+LAR  M  AK   TP A S   S  +G +      D   +R IVG+ QYL FTRPDI++ V++LSQ MH P
Subjt:  GSLKYFLGLEVSHT-AGISVSQTKYAKDILARFGMAVAKYCNTPSALS---SAVNGAECPCSDEDAKSFRSIVGAFQYLTFTRPDIAFCVSKLSQSMHSP

Query:  SAASLIDAKRVLLYVNGTLSSGLLFRKGHSNCFSLTAFSDSDWVGDSLDRRSTTGFIIFLGHNPISWAAKKQSTVSRSSTEAEYRALASTAAELFWLRQL
        +   L   KR+L Y+ GT + G+  +KG  N  SL A+SD+DW GD  D  ST G+I++LGH+PISW++KKQ  V RSSTEAEYR++A+T++E+ W+  L
Subjt:  SAASLIDAKRVLLYVNGTLSSGLLFRKGHSNCFSLTAFSDSDWVGDSLDRRSTTGFIIFLGHNPISWAAKKQSTVSRSSTEAEYRALASTAAELFWLRQL

Query:  LRDHCIFGSQPP
        L +  I  ++PP
Subjt:  LRDHCIFGSQPP

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE24.4e-6844.87Show/hide
Query:  MQQPRGFVNSTYPNHVCQLLKSLYGLKQAPRVWFECFTSHLLSLDFVAFTADSSLFIRHTCDTITYLLLYVDDIILTGNSPFYISALIAQLRTKFDMSDL
        M QP GFV+   P++VC+L K++YGLKQAPR W+    ++LL++ FV   +D+SLF+     +I Y+L+YVDDI++TGN    +   +  L  +F + + 
Subjt:  MQQPRGFVNSTYPNHVCQLLKSLYGLKQAPRVWFECFTSHLLSLDFVAFTADSSLFIRHTCDTITYLLLYVDDIILTGNSPFYISALIAQLRTKFDMSDL

Query:  GSLKYFLGLEVSHT-AGISVSQTKYAKDILARFGMAVAKYCNTPSALSSAV---NGAECPCSDEDAKSFRSIVGAFQYLTFTRPDIAFCVSKLSQSMHSP
          L YFLG+E      G+ +SQ +Y  D+LAR  M  AK   TP A S  +   +G + P    D   +R IVG+ QYL FTRPD+++ V++LSQ MH P
Subjt:  GSLKYFLGLEVSHT-AGISVSQTKYAKDILARFGMAVAKYCNTPSALSSAV---NGAECPCSDEDAKSFRSIVGAFQYLTFTRPDIAFCVSKLSQSMHSP

Query:  SAASLIDAKRVLLYVNGTLSSGLLFRKGHSNCFSLTAFSDSDWVGDSLDRRSTTGFIIFLGHNPISWAAKKQSTVSRSSTEAEYRALASTAAELFWLRQL
        +       KRVL Y+ GT   G+  +KG  N  SL A+SD+DW GD+ D  ST G+I++LGH+PISW++KKQ  V RSSTEAEYR++A+T++EL W+  L
Subjt:  SAASLIDAKRVLLYVNGTLSSGLLFRKGHSNCFSLTAFSDSDWVGDSLDRRSTTGFIIFLGHNPISWAAKKQSTVSRSSTEAEYRALASTAAELFWLRQL

Query:  LRDHCIFGSQPP
        L +  I  S PP
Subjt:  LRDHCIFGSQPP

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 88.3e-5440.72Show/hide
Query:  MQQPRGFV----NSTYPNHVCQLLKSLYGLKQAPRVWFECFTSHLLSLDFVAFTADSSLFIRHTCDTITYLLLYVDDIILTGNSPFYISALIAQLRTKFD
        M+ P G+     +S  PN VC L KS+YGLKQA R WF  F+  L+   FV   +D + F++ T      +L+YVDDII+  N+   +  L +QL++ F 
Subjt:  MQQPRGFV----NSTYPNHVCQLLKSLYGLKQAPRVWFECFTSHLLSLDFVAFTADSSLFIRHTCDTITYLLLYVDDIILTGNSPFYISALIAQLRTKFD

Query:  MSDLGSLKYFLGLEVSHT-AGISVSQTKYAKDILARFGMAVAKYCNT---PSALSSAVNGAECPCSDEDAKSFRSIVGAFQYLTFTRPDIAFCVSKLSQS
        + DLG LKYFLGLE++ + AGI++ Q KYA D+L   G+   K  +    PS   SA +G +      DAK++R ++G   YL  TR DI+F V+KLSQ 
Subjt:  MSDLGSLKYFLGLEVSHT-AGISVSQTKYAKDILARFGMAVAKYCNT---PSALSSAVNGAECPCSDEDAKSFRSIVGAFQYLTFTRPDIAFCVSKLSQS

Query:  MHSPSAASLIDAKRVLLYVNGTLSSGLLFRKGHSNCFSLTAFSDSDWVGDSLDRRSTTGFIIFLGHNPISWAAKKQSTVSRSSTEAEYRALASTAAELFW
          +P  A      ++L Y+ GT+  GL +         L  FSD+ +      RRST G+ +FLG + ISW +KKQ  VS+SS EAEYRAL+    E+ W
Subjt:  MHSPSAASLIDAKRVLLYVNGTLSSGLLFRKGHSNCFSLTAFSDSDWVGDSLDRRSTTGFIIFLGHNPISWAAKKQSTVSRSSTEAEYRALASTAAELFW

Query:  LRQLLRD
        L Q  R+
Subjt:  LRQLLRD

ATMG00240.1 Gag-Pol-related retrotransposon family protein9.7e-1043.04Show/hide
Query:  YLTFTRPDIAFCVSKLSQSMHSPSAASLIDAKRVLLYVNGTLSSGLLFRKGHSNCFSLTAFSDSDWVGDSLDRRSTTGF
        YLT TRPD+ F V++LSQ   +   A +    +VL YV GT+  GL +    ++   L AF+DSDW      RRS TGF
Subjt:  YLTFTRPDIAFCVSKLSQSMHSPSAASLIDAKRVLLYVNGTLSSGLLFRKGHSNCFSLTAFSDSDWVGDSLDRRSTTGF

ATMG00810.1 DNA/RNA polymerases superfamily protein1.2e-5553.91Show/hide
Query:  YLLLYVDDIILTGNSPFYISALIAQLRTKFDMSDLGSLKYFLGLEV-SHTAGISVSQTKYAKDILARFGMAVAKYCNT--PSALSSAVNGAECPCSDEDA
        YLLLYVDDI+LTG+S   ++ LI QL + F M DLG + YFLG+++ +H +G+ +SQTKYA+ IL   GM   K  +T  P  L+S+V+ A+ P    D 
Subjt:  YLLLYVDDIILTGNSPFYISALIAQLRTKFDMSDLGSLKYFLGLEV-SHTAGISVSQTKYAKDILARFGMAVAKYCNT--PSALSSAVNGAECPCSDEDA

Query:  KSFRSIVGAFQYLTFTRPDIAFCVSKLSQSMHSPSAASLIDAKRVLLYVNGTLSSGLLFRKGHSNCFSLTAFSDSDWVGDSLDRRSTTGFIIFLGHNPIS
          FRSIVGA QYLT TRPDI++ V+ + Q MH P+ A     KRVL YV GT+  GL   K  ++  ++ AF DSDW G +  RRSTTGF  FLG N IS
Subjt:  KSFRSIVGAFQYLTFTRPDIAFCVSKLSQSMHSPSAASLIDAKRVLLYVNGTLSSGLLFRKGHSNCFSLTAFSDSDWVGDSLDRRSTTGFIIFLGHNPIS

Query:  WAAKKQSTVSRSSTEAEYRALASTAAELFW
        W+AK+Q TVSRSSTE EYRALA TAAEL W
Subjt:  WAAKKQSTVSRSSTEAEYRALASTAAELFW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAACAACCTCGAGGCTTTGTCAATTCTACCTATCCTAATCATGTTTGTCAGTTGCTTAAGTCTCTTTATGGTCTAAAGCAAGCACCACGAGTCTGGTTTGAG
TGCTTTACATCTCATCTCTTGTCTCTTGATTTTGTTGCATTCACAGCAGATTCATCTTTGTTTATTCGGCATACTTGTGACACTATTACTTATCTGCTCTTGTAT
GTGGACGATATCATCCTTACTGGTAATAGTCCCTTTTATATTTCTGCTTTGATTGCTCAATTACGAACTAAATTTGATATGAGTGATCTTGGAAGTCTCAAGTAT
TTTTTGGGCCTTGAGGTCTCCCATACTGCTGGAATTAGTGTATCTCAAACCAAGTATGCAAAAGATATTCTTGCAAGGTTTGGTATGGCTGTTGCTAAATATTGT
AACACTCCCAGTGCTTTGTCCTCTGCTGTAAATGGTGCTGAATGTCCTTGTTCTGATGAGGATGCAAAATCTTTTCGTTCTATTGTTGGTGCCTTTCAGTATTTA
ACGTTTACTCGCCCAGATATTGCATTCTGTGTGAGTAAATTGTCTCAATCCATGCATTCTCCATCTGCTGCCAGCTTAATTGATGCCAAACGAGTCTTACTTTAT
GTAAATGGCACCTTATCATCTGGACTGCTTTTTAGAAAGGGTCATTCGAATTGTTTTAGTCTCACTGCTTTTTCTGATTCTGATTGGGTCGGGGATTCTCTTGAT
CGGCGCTCTACTACTGGTTTTATCATTTTTTTGGGTCATAATCCCATATCTTGGGCAGCCAAGAAGCAATCTACAGTCTCGAGGAGCTCTACTGAAGCTGAATAT
CGAGCGCTTGCTTCCACTGCAGCTGAGTTGTTTTGGCTCCGACAGCTTCTGCGTGATCATTGTATTTTTGGATCACAACCTCCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGCAACAACCTCGAGGCTTTGTCAATTCTACCTATCCTAATCATGTTTGTCAGTTGCTTAAGTCTCTTTATGGTCTAAAGCAAGCACCACGAGTCTGGTTTGAG
TGCTTTACATCTCATCTCTTGTCTCTTGATTTTGTTGCATTCACAGCAGATTCATCTTTGTTTATTCGGCATACTTGTGACACTATTACTTATCTGCTCTTGTAT
GTGGACGATATCATCCTTACTGGTAATAGTCCCTTTTATATTTCTGCTTTGATTGCTCAATTACGAACTAAATTTGATATGAGTGATCTTGGAAGTCTCAAGTAT
TTTTTGGGCCTTGAGGTCTCCCATACTGCTGGAATTAGTGTATCTCAAACCAAGTATGCAAAAGATATTCTTGCAAGGTTTGGTATGGCTGTTGCTAAATATTGT
AACACTCCCAGTGCTTTGTCCTCTGCTGTAAATGGTGCTGAATGTCCTTGTTCTGATGAGGATGCAAAATCTTTTCGTTCTATTGTTGGTGCCTTTCAGTATTTA
ACGTTTACTCGCCCAGATATTGCATTCTGTGTGAGTAAATTGTCTCAATCCATGCATTCTCCATCTGCTGCCAGCTTAATTGATGCCAAACGAGTCTTACTTTAT
GTAAATGGCACCTTATCATCTGGACTGCTTTTTAGAAAGGGTCATTCGAATTGTTTTAGTCTCACTGCTTTTTCTGATTCTGATTGGGTCGGGGATTCTCTTGAT
CGGCGCTCTACTACTGGTTTTATCATTTTTTTGGGTCATAATCCCATATCTTGGGCAGCCAAGAAGCAATCTACAGTCTCGAGGAGCTCTACTGAAGCTGAATAT
CGAGCGCTTGCTTCCACTGCAGCTGAGTTGTTTTGGCTCCGACAGCTTCTGCGTGATCATTGTATTTTTGGATCACAACCTCCTTGA
Protein sequenceShow/hide protein sequence
MQQPRGFVNSTYPNHVCQLLKSLYGLKQAPRVWFECFTSHLLSLDFVAFTADSSLFIRHTCDTITYLLLYVDDIILTGNSPFYISALIAQLRTKFDMSDLGSLKY
FLGLEVSHTAGISVSQTKYAKDILARFGMAVAKYCNTPSALSSAVNGAECPCSDEDAKSFRSIVGAFQYLTFTRPDIAFCVSKLSQSMHSPSAASLIDAKRVLLY
VNGTLSSGLLFRKGHSNCFSLTAFSDSDWVGDSLDRRSTTGFIIFLGHNPISWAAKKQSTVSRSSTEAEYRALASTAAELFWLRQLLRDHCIFGSQPP