; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc05g12980 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc05g12980
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionIntegrase catalytic domain-containing protein
Genome locationchr5:10018781..10025432
RNA-Seq ExpressionMoc05g12980
SyntenyMoc05g12980
Gene Ontology termsNA
InterPro domainsIPR025724 - GAG-pre-integrase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0065480.1 Cysteine-rich RLK (receptor-like protein kinase) 8 [Cucumis melo var. makuwa]2.5e-5735.25Show/hide
Query:  VNQLKRDLATITQEQQSVSLYFTKLKTSWDELHQYRPICTCKCNCGGAKSASDFLQLEYMMNFLMGLNKSYASTRAQLLLMDPPPSHST-----------
        + QL+++  T+ Q   ++  Y+TKLKT W  L++YR   T  C CGG K   D L+ EY+M FLMGLN SYA+ RAQ+LLM P PS +T           
Subjt:  VNQLKRDLATITQEQQSVSLYFTKLKTSWDELHQYRPICTCKCNCGGAKSASDFLQLEYMMNFLMGLNKSYASTRAQLLLMDPPPSHST-----------

Query:  -----FAASHSPVAAK--SSKANISSMTKGQTRPVCSYCRIVGHTADRCYKLHGYPPGYRNQNQRSSSASSSASTLVPSKPVDSSAMASLHSRPDVVN--
                   PVA    SS A  +   + + RP CSYC I GH AD+CYK HGYPPGY+ +N  S + +   S    +  V ++  A+ +  PD  +  
Subjt:  -----FAASHSPVAAK--SSKANISSMTKGQTRPVCSYCRIVGHTADRCYKLHGYPPGYRNQNQRSSSASSSASTLVPSKPVDSSAMASLHSRPDVVN--

Query:  ---QCQQLLHLLQSQFSVGKSVTTEKDTYVSYTAG----------TSQSWILDSGASAHICCSHPLFDTFSKTTPVYVNLPNKSRFLVEYSGSISFSENF
           Q  QL+ LL +      +      T +++T+G          +   WI+DSGAS HIC    LF  +S T  ++V LPN  R  V+  G I  + + 
Subjt:  ---QCQQLLHLLQSQFSVGKSVTTEKDTYVSYTAG----------TSQSWILDSGASAHICCSHPLFDTFSKTTPVYVNLPNKSRFLVEYSGSISFSENF

Query:  RLRGVLYIPEFHFNLISISMLLRDIPSLSVEFSNNCCMLKDKSISRMIGKGSLCHGLYLFDRQPATAHSICAFTSAHVSFEVWHSRFGHPSFSRLQVLKD
         L+ VL++ +F +NLIS+S LL    ++S++F + CC+++D S   MIGK S  +GLY+ +++  T           +S + WH R GH S   L  L  
Subjt:  RLRGVLYIPEFHFNLISISMLLRDIPSLSVEFSNNCCMLKDKSISRMIGKGSLCHGLYLFDRQPATAHSICAFTSAHVSFEVWHSRFGHPSFSRLQVLKD

Query:  VLSLDAYFLKQSAVDAC
         L L  + +  S+   C
Subjt:  VLSLDAYFLKQSAVDAC

XP_012836458.1 PREDICTED: uncharacterized protein LOC105957084 [Erythranthe guttata]8.8e-5536.69Show/hide
Query:  NTRLVLVN-----QLKRDLATITQEQQSVSLYFTKLKTSWDELHQYRPICTC-KCNCGGAKSASDFLQLEYMMNFLMGLNKSYASTRAQLLLMDP-PPSH
        NTR   +N     QL+R+L  +TQ+ QSV++YFTKLK  WDEL  +RP CTC  C CGG +  ++   LE++M FLMGLN+S  STR Q+LLMDP PP +
Subjt:  NTRLVLVN-----QLKRDLATITQEQQSVSLYFTKLKTSWDELHQYRPICTC-KCNCGGAKSASDFLQLEYMMNFLMGLNKSYASTRAQLLLMDP-PPSH

Query:  STFA------------ASHSPV---------------AAKSSKANISSMTKGQTRPVCSYCRIVGHTADRCYKLHGYPPGYRNQNQRSSSASSSASTLVP
          FA            +SH+ V                +  ++A  +S  K + R  C++C I GHT D+CYKLHGYPPGY+ + +  S   S  S    
Subjt:  STFA------------ASHSPV---------------AAKSSKANISSMTKGQTRPVCSYCRIVGHTADRCYKLHGYPPGYRNQNQRSSSASSSASTLVP

Query:  S---KPVDSSAMASLHSRPDVVN-----------QCQQLLHLLQSQFSVGKSVTTEKD-------TYVSYTAGTSQS-----------WILDSGASAHIC
        S    P+D ++  S  S+P +V+           QCQQL+    +Q +  K V  ++         ++S  +G   +           WI+DSGAS HIC
Subjt:  S---KPVDSSAMASLHSRPDVVN-----------QCQQLLHLLQSQFSVGKSVTTEKD-------TYVSYTAGTSQS-----------WILDSGASAHIC

Query:  CSHPLFDTFSKTTPVYVNLPNKSRFLVEYSGSISFSENFRLRGVLYIPEFHFNLISISMLLRDIPSLSVEFSNNCCMLKDKSISRMIGKGSLCHGLYLFD
            LF +  K     V LP+ S  +VEY G +  S++  L+ V Y+P F FNLIS+S LL ++P  +V F +N  +++DK + + IGKGS   GLY+  
Subjt:  CSHPLFDTFSKTTPVYVNLPNKSRFLVEYSGSISFSENFRLRGVLYIPEFHFNLISISMLLRDIPSLSVEFSNNCCMLKDKSISRMIGKGSLCHGLYLFD

Query:  RQPATAHSICAFTSAHV
            +    C   SA V
Subjt:  RQPATAHSICAFTSAHV

XP_022143573.1 uncharacterized protein LOC111013441 [Momordica charantia]5.6e-11848.99Show/hide
Query:  EQQSVSLYFTKLKTSWDELHQYRPICTCKCNCGGAKSASDFLQLEYMMNFLMGLNKSYASTRAQLLLMDPPPSHSTFAASHSPVAAKSSKANISSMTKGQ
        EQQSVSLYFTKLKT WDELHQ+RP+CTC C CGGAKS S+FLQLEY++N LMGL++ Y STRA+LLLMDPPPS +    + S V     + +I +     
Subjt:  EQQSVSLYFTKLKTSWDELHQYRPICTCKCNCGGAKSASDFLQLEYMMNFLMGLNKSYASTRAQLLLMDPPPSHSTFAASHSPVAAKSSKANISSMTKGQ

Query:  TRPVCSYCRIVGHTADR----------CYKLHGYPPGYRNQNQRSSSASSSASTLVPSKPVDSSAMASLHSRPDVVNQCQQLLHLLQSQFSVGKSVTTEK
        T P  S+  +  H+A +            KLHGYPPGYR+  QR S  S +      SK  + SA+A+ H+    ++  QQL  LLQSQ S GK V  + 
Subjt:  TRPVCSYCRIVGHTADR----------CYKLHGYPPGYRNQNQRSSSASSSASTLVPSKPVDSSAMASLHSRPDVVNQCQQLLHLLQSQFSVGKSVTTEK

Query:  DTYVSYTAGT-SQSWILDSGASAHICCSHPLFDTFSKTTPVYVNLPNKSRFLVEYSGSISFSENFRLRGVLYIPEFHFNLISISMLLRDIPSLSVEFSNN
        DT  SYT  T +QS ILD GASAHIC    LFD   K +PV+VNLPNK RF+VEYSG +  S++  + GVLYIPEF+FNLIS+++LLRD+PSLSVEF+N+
Subjt:  DTYVSYTAGT-SQSWILDSGASAHICCSHPLFDTFSKTTPVYVNLPNKSRFLVEYSGSISFSENFRLRGVLYIPEFHFNLISISMLLRDIPSLSVEFSNN

Query:  CCMLKDKSISRMIGKGSLCHGLYLFDRQ---PATAHSICAFTSAHVSFEVWHSRFGHPSFSRLQVLKDVLSLDAYFLKQSAVDAC---------SDFSVD
         C+++DKSIS+ I KG LCHGLYL D      +T  SIC      VSF+VWH+R GHPSF+RL  LK VL +D   L+    D+C         +D SV+
Subjt:  CCMLKDKSISRMIGKGSLCHGLYLFDRQ---PATAHSICAFTSAHVSFEVWHSRFGHPSFSRLQVLKDVLSLDAYFLKQSAVDAC---------SDFSVD

Query:  PFPDLVLPCAVDFQTTTSPSTIIEATSSCDTYVPLVVHSDPIVEEPPI---------TSSTSTSMPLTTDLAP-TSPAGGDLNLISNTNDRASTIVFRRS
        PFPDLVLP  +DFQ    P+  I+ T++ D  +P VV +  I    PI         ++  S+S P+ ++  P T+P+       + ++     IV RRS
Subjt:  PFPDLVLPCAVDFQTTTSPSTIIEATSSCDTYVPLVVHSDPIVEEPPI---------TSSTSTSMPLTTDLAP-TSPAGGDLNLISNTNDRASTIVFRRS

Query:  SRPTKMSSYLRDFHCNLMACTPLPIASTKHSLQQYLSYSRLSPAY
        +RP+KM SYL+DFHC+L+  +    AST+H LQQYLSYSRLS A+
Subjt:  SRPTKMSSYLRDFHCNLMACTPLPIASTKHSLQQYLSYSRLSPAY

XP_022856063.1 uncharacterized protein LOC111377235, partial [Olea europaea var. sylvestris]2.2e-6137.73Show/hide
Query:  QLKRDLATITQEQQSVSLYFTKLKTSWDELHQYRPICTC-KCNCGGAKSASDFLQLEYMMNFLMGLNKSYASTRAQLLLMDPPPSHSTFAASHSPVAAKS
        QL+R+L  +TQ Q SV +YFTKLKT W+EL  YRPIC+C KC+CG  K  S+  Q+EY+M+FLMGLN ++A  R QLLLMDP PS +      S V+ + 
Subjt:  QLKRDLATITQEQQSVSLYFTKLKTSWDELHQYRPICTC-KCNCGGAKSASDFLQLEYMMNFLMGLNKSYASTRAQLLLMDPPPSHSTFAASHSPVAAKS

Query:  SKANIS----------SMT--------KG---------QTRPVCSYCRIVGHTADRCYKLHGYPPGYRNQ----------NQRSSSASSSASTLVPSKPV
         + NIS          SM         KG         + +P+C+YC++VGH+ D+CYKLHGYPPGY+ +          NQ    A++       S+  
Subjt:  SKANIS----------SMT--------KG---------QTRPVCSYCRIVGHTADRCYKLHGYPPGYRNQ----------NQRSSSASSSASTLVPSKPV

Query:  DSSAMASLHSRPDVVNQCQQLLHLLQSQFSVGKSVTTEKDTYVSYTAGTSQS------------WILDSGASAHICCSHPLFDTFSKTTPVYVNLPNKSR
         ++A   +HS      Q QQL+++L +  +  K+  T  D + +  +GT  S            W++DSGA++HIC S   F +       YV LPN  R
Subjt:  DSSAMASLHSRPDVVNQCQQLLHLLQSQFSVGKSVTTEKDTYVSYTAGTSQS------------WILDSGASAHICCSHPLFDTFSKTTPVYVNLPNKSR

Query:  FLVEYSGSISFSENFRLRGVLYIPEFHFNLISISMLLRDIPSLSVEFSNNCCMLKDKSISRMIGKGSLCHGLYL-----FDRQPATAHS-----ICAFTS
          V + G + FS N  L  VLY+P F FNL+S+S L++D  ++ V F  N C ++D    +MIGKG    GLY+      D  P  + +     +C   S
Subjt:  FLVEYSGSISFSENFRLRGVLYIPEFHFNLISISMLLRDIPSLSVEFSNNCCMLKDKSISRMIGKGSLCHGLYL-----FDRQPATAHS-----ICAFTS

Query:  A---HVSFEVWHSRFGHPSFSRLQVLKDVLSL
        +    V+   WH R GH SF +L  LKD L L
Subjt:  A---HVSFEVWHSRFGHPSFSRLQVLKDVLSL

XP_030941568.1 uncharacterized protein LOC115966477 [Quercus lobata]1.5e-5430.5Show/hide
Query:  VNQLKRDLATITQEQQSVSLYFTKLKTSWDELHQYRPI--CTC-KCNCGGAKSASDFLQLEYMMNFLMGLNKSYASTRAQLLLMDPPPS-----------
        V  L++D+A + Q + S++ +FT+LK  WD+L    P   CTC +C C   K  +D    E +M FLMGLN S++  R+Q+LLMDP PS           
Subjt:  VNQLKRDLATITQEQQSVSLYFTKLKTSWDELHQYRPI--CTC-KCNCGGAKSASDFLQLEYMMNFLMGLNKSYASTRAQLLLMDPPPS-----------

Query:  -------HSTFA-ASHSPVAAKSSKANISSMT----KGQTRPVCSYCRIVGHTADRCYKLHGYPPGYRNQN------QRSSSAS----------------
               +++FA    + +AAK S  ++ S      KG+ RP C++    GHT ++CYK HG+PPG++ +N      Q SS AS                
Subjt:  -------HSTFA-ASHSPVAAKSSKANISSMT----KGQTRPVCSYCRIVGHTADRCYKLHGYPPGYRNQN------QRSSSAS----------------

Query:  ----------SSASTLVPSKPVDSSAMASLHSRPDVVNQCQQLLHLLQSQFSVGKSVTTEKDTYVSYTAGTSQSWILDSGASAHICCSHPLFDTFSKTTP
                  +SAS+++PS PV  +++A++ S   + +     + L  S FS   +    +  Y  +T      W+LD+GA+ H  CS  L  + + T  
Subjt:  ----------SSASTLVPSKPVDSSAMASLHSRPDVVNQCQQLLHLLQSQFSVGKSVTTEKDTYVSYTAGTSQSWILDSGASAHICCSHPLFDTFSKTTP

Query:  VYVNLPNKSRFLVEYSGSISFSENFRLRGVLYIPEFHFNLISISMLLRDIPSLSVEFSNNCCMLKDKSISRMIGKGSLCHGLYLF--DRQPATAHSICAF
          V+LPN     V + G++  S +  L  VL +P F FNL+S+S L    P   V F +  C ++D    + IG G    GLYL   D      HS  A 
Subjt:  VYVNLPNKSRFLVEYSGSISFSENFRLRGVLYIPEFHFNLISISMLLRDIPSLSVEFSNNCCMLKDKSISRMIGKGSLCHGLYLF--DRQPATAHSICAF

Query:  ------------------TSAHVSFEVWHSRFGHPSFSRLQVLKD------VLSLDAYFLKQSAVDACSDFSVDPFPDLVLPCAVDFQTTTSPSTIIEAT
                          T++  S  +WH+R GHPS  +L+VL         + LD  F +       S     PF  L LPC        SP + +   
Subjt:  ------------------TSAHVSFEVWHSRFGHPSFSRLQVLKD------VLSLDAYFLKQSAVDACSDFSVDPFPDLVLPCAVDFQTTTSPSTIIEAT

Query:  SSCDTYVPLVVHSDPIVEEPPITSSTS---TSMPLTTDLAPTSPAGGDLNLISNTNDRASTIVFRRSSRPTKMSSYLRDFHCNLMACTPLPIAS----TK
                   H DP++ + P  +  S       +  DL    P        ++ +  A+ +  RRS+R     SYL+DFHCN  A +  P++S    T 
Subjt:  SSCDTYVPLVVHSDPIVEEPPITSSTS---TSMPLTTDLAPTSPAGGDLNLISNTNDRASTIVFRRSSRPTKMSSYLRDFHCNLMACTPLPIAS----TK

Query:  HSLQQYLSYSRLSPAYYSFALNVSASYEPQFYHQAV
        H L  ++SY  LSP+Y ++   +S+  EPQFYHQAV
Subjt:  HSLQQYLSYSRLSPAYYSFALNVSASYEPQFYHQAV

TrEMBL top hitse value%identityAlignment
A0A2Z7C0E8 Integrase catalytic domain-containing protein1.5e-5234.29Show/hide
Query:  AASTQSGNEGATNPYYLHHTDNTRLVLVNQLKRDLATITQEQQSVSLYFTKLKTSWDELHQYRPICTCKCNCGGAKSASDFLQLEYMMNFLMGLNKSYAS
        A S  +  E   +P    H  N   +   Q+KR LA + Q    ++ Y+TK++T WDEL  ++PI  C+  CG  K   D+   E  M FLMGLN+SYA 
Subjt:  AASTQSGNEGATNPYYLHHTDNTRLVLVNQLKRDLATITQEQQSVSLYFTKLKTSWDELHQYRPICTCKCNCGGAKSASDFLQLEYMMNFLMGLNKSYAS

Query:  TRAQLLLMDPPPSHSTF----------AASHSPVAAK--------SSKANISSMTKGQTRP--------VCSYCRIVGHTADRCYKLHGYPPGYRNQNQR
         RAQ+LLMDP P  S             + H  V  K        +  AN++++ KG   P         C++C +  HT D+CYKLHGYPPG+     +
Subjt:  TRAQLLLMDPPPSHSTF----------AASHSPVAAK--------SSKANISSMTKGQTRP--------VCSYCRIVGHTADRCYKLHGYPPGYRNQNQR

Query:  SSSASSSASTLVPSKPVDSSAMASLHSRPDVVNQCQQLLHLLQSQFSVGKSVTTE---------------KDTY---VSYTAGTSQSWILDSGASAHICC
         S   S      P     +S +  +  +P+    C+QL+  L SQ  +G   T                  DTY    S+TA  + SWI+D+GA+ HICC
Subjt:  SSSASSSASTLVPSKPVDSSAMASLHSRPDVVNQCQQLLHLLQSQFSVGKSVTTE---------------KDTY---VSYTAGTSQSWILDSGASAHICC

Query:  SHPLFDTFSKTTPVYVNLPNKSRFLVEYSGSISFSENFRLRGVLYIPEFHFNLISISMLLRDIPSLSVEFSNNCCMLKDKSISRMIGKGSLCHGLYLFDR
        S   F +F K     V LPN     V + GS+       L+ VL++P+F FNL+SIS L + IP  SV FS+  C ++  + +R IG G     LYL   
Subjt:  SHPLFDTFSKTTPVYVNLPNKSRFLVEYSGSISFSENFRLRGVLYIPEFHFNLISISMLLRDIPSLSVEFSNNCCMLKDKSISRMIGKGSLCHGLYLFDR

Query:  QPATAHSICAFTSAHVSFEVWHSRFGHPSFSRLQVLKDVLSLDAYFLKQSAVDAC
         P++   +CA  + H   ++WH R GH S  RL +L DV  +   F    A+ AC
Subjt:  QPATAHSICAFTSAHVSFEVWHSRFGHPSFSRLQVLKDVLSLDAYFLKQSAVDAC

A0A438JAT7 Retrovirus-related Pol polyprotein from transposon TNT 1-942.8e-5436.74Show/hide
Query:  QLKRDLATITQEQQSVSLYFTKLKTSWDELHQYRPICTCKCNCGGAKSASDFLQLEYMMNFLMGLNKSYASTRAQLLLMDPPPS-------------HST
        Q+K+ L+ + Q    V+ YFTKLK  WDEL +++P+    C+CGG +  +D+   EY++ FLMGLN SYA  R Q+L+MDP P+             H T
Subjt:  QLKRDLATITQEQQSVSLYFTKLKTSWDELHQYRPICTCKCNCGGAKSASDFLQLEYMMNFLMGLNKSYASTRAQLLLMDPPPS-------------HST

Query:  ----FAASH-SPVAAKSSKANISSMTKG-----QTRPVCSYCRIVGHTADRCYKLHGYPPG--YRNQNQRSSSASSSASTLVPSKPVDSSAMASLHSRPD
            ++ SH S      S +N  + + G     + R  CSYC   GH  D+CYKL GYPPG  ++N+   SSS ++++  L       S ++ S  +   
Subjt:  ----FAASH-SPVAAKSSKANISSMTKG-----QTRPVCSYCRIVGHTADRCYKLHGYPPG--YRNQNQRSSSASSSASTLVPSKPVDSSAMASLHSRPD

Query:  VVNQCQQLLHLLQSQFSVGKSVTTEKDT---YVSYTAGT-----SQSWILDSGASAHICCSHPLFDTFSKTTPVYVNLPNKSRFLVEYSGSISFSENFRL
           QCQQL+ LL +Q S   S +TE  +    VS  AG      ++ WI+DSGA+ H+C    LFD+      V V LP      ++  GS+  S++ +L
Subjt:  VVNQCQQLLHLLQSQFSVGKSVTTEKDT---YVSYTAGT-----SQSWILDSGASAHICCSHPLFDTFSKTTPVYVNLPNKSRFLVEYSGSISFSENFRL

Query:  RGVLYIPEFHFNLISISMLLRDIPSLSVEFSNNCCMLKDKSISRMIGKGSLCHGLYLFDRQPATAHSICAFTSA-----HVSFEVWHSRFGHPSFSRLQV
          VL++P F +NL+S+S    D  SLS+ F+ + C+++  S  +MIGKGS    LY  D     A+   AF +A          +WHSR GHPSFSRL+ 
Subjt:  RGVLYIPEFHFNLISISMLLRDIPSLSVEFSNNCCMLKDKSISRMIGKGSLCHGLYLFDRQPATAHSICAFTSA-----HVSFEVWHSRFGHPSFSRLQV

Query:  LKDVLSLDAYF
        L+ VL  D+ F
Subjt:  LKDVLSLDAYF

A0A5A7VE66 Cysteine-rich RLK (Receptor-like protein kinase) 81.2e-5735.25Show/hide
Query:  VNQLKRDLATITQEQQSVSLYFTKLKTSWDELHQYRPICTCKCNCGGAKSASDFLQLEYMMNFLMGLNKSYASTRAQLLLMDPPPSHST-----------
        + QL+++  T+ Q   ++  Y+TKLKT W  L++YR   T  C CGG K   D L+ EY+M FLMGLN SYA+ RAQ+LLM P PS +T           
Subjt:  VNQLKRDLATITQEQQSVSLYFTKLKTSWDELHQYRPICTCKCNCGGAKSASDFLQLEYMMNFLMGLNKSYASTRAQLLLMDPPPSHST-----------

Query:  -----FAASHSPVAAK--SSKANISSMTKGQTRPVCSYCRIVGHTADRCYKLHGYPPGYRNQNQRSSSASSSASTLVPSKPVDSSAMASLHSRPDVVN--
                   PVA    SS A  +   + + RP CSYC I GH AD+CYK HGYPPGY+ +N  S + +   S    +  V ++  A+ +  PD  +  
Subjt:  -----FAASHSPVAAK--SSKANISSMTKGQTRPVCSYCRIVGHTADRCYKLHGYPPGYRNQNQRSSSASSSASTLVPSKPVDSSAMASLHSRPDVVN--

Query:  ---QCQQLLHLLQSQFSVGKSVTTEKDTYVSYTAG----------TSQSWILDSGASAHICCSHPLFDTFSKTTPVYVNLPNKSRFLVEYSGSISFSENF
           Q  QL+ LL +      +      T +++T+G          +   WI+DSGAS HIC    LF  +S T  ++V LPN  R  V+  G I  + + 
Subjt:  ---QCQQLLHLLQSQFSVGKSVTTEKDTYVSYTAG----------TSQSWILDSGASAHICCSHPLFDTFSKTTPVYVNLPNKSRFLVEYSGSISFSENF

Query:  RLRGVLYIPEFHFNLISISMLLRDIPSLSVEFSNNCCMLKDKSISRMIGKGSLCHGLYLFDRQPATAHSICAFTSAHVSFEVWHSRFGHPSFSRLQVLKD
         L+ VL++ +F +NLIS+S LL    ++S++F + CC+++D S   MIGK S  +GLY+ +++  T           +S + WH R GH S   L  L  
Subjt:  RLRGVLYIPEFHFNLISISMLLRDIPSLSVEFSNNCCMLKDKSISRMIGKGSLCHGLYLFDRQPATAHSICAFTSAHVSFEVWHSRFGHPSFSRLQVLKD

Query:  VLSLDAYFLKQSAVDAC
         L L  + +  S+   C
Subjt:  VLSLDAYFLKQSAVDAC

A0A6J1CR17 uncharacterized protein LOC1110134412.7e-11848.99Show/hide
Query:  EQQSVSLYFTKLKTSWDELHQYRPICTCKCNCGGAKSASDFLQLEYMMNFLMGLNKSYASTRAQLLLMDPPPSHSTFAASHSPVAAKSSKANISSMTKGQ
        EQQSVSLYFTKLKT WDELHQ+RP+CTC C CGGAKS S+FLQLEY++N LMGL++ Y STRA+LLLMDPPPS +    + S V     + +I +     
Subjt:  EQQSVSLYFTKLKTSWDELHQYRPICTCKCNCGGAKSASDFLQLEYMMNFLMGLNKSYASTRAQLLLMDPPPSHSTFAASHSPVAAKSSKANISSMTKGQ

Query:  TRPVCSYCRIVGHTADR----------CYKLHGYPPGYRNQNQRSSSASSSASTLVPSKPVDSSAMASLHSRPDVVNQCQQLLHLLQSQFSVGKSVTTEK
        T P  S+  +  H+A +            KLHGYPPGYR+  QR S  S +      SK  + SA+A+ H+    ++  QQL  LLQSQ S GK V  + 
Subjt:  TRPVCSYCRIVGHTADR----------CYKLHGYPPGYRNQNQRSSSASSSASTLVPSKPVDSSAMASLHSRPDVVNQCQQLLHLLQSQFSVGKSVTTEK

Query:  DTYVSYTAGT-SQSWILDSGASAHICCSHPLFDTFSKTTPVYVNLPNKSRFLVEYSGSISFSENFRLRGVLYIPEFHFNLISISMLLRDIPSLSVEFSNN
        DT  SYT  T +QS ILD GASAHIC    LFD   K +PV+VNLPNK RF+VEYSG +  S++  + GVLYIPEF+FNLIS+++LLRD+PSLSVEF+N+
Subjt:  DTYVSYTAGT-SQSWILDSGASAHICCSHPLFDTFSKTTPVYVNLPNKSRFLVEYSGSISFSENFRLRGVLYIPEFHFNLISISMLLRDIPSLSVEFSNN

Query:  CCMLKDKSISRMIGKGSLCHGLYLFDRQ---PATAHSICAFTSAHVSFEVWHSRFGHPSFSRLQVLKDVLSLDAYFLKQSAVDAC---------SDFSVD
         C+++DKSIS+ I KG LCHGLYL D      +T  SIC      VSF+VWH+R GHPSF+RL  LK VL +D   L+    D+C         +D SV+
Subjt:  CCMLKDKSISRMIGKGSLCHGLYLFDRQ---PATAHSICAFTSAHVSFEVWHSRFGHPSFSRLQVLKDVLSLDAYFLKQSAVDAC---------SDFSVD

Query:  PFPDLVLPCAVDFQTTTSPSTIIEATSSCDTYVPLVVHSDPIVEEPPI---------TSSTSTSMPLTTDLAP-TSPAGGDLNLISNTNDRASTIVFRRS
        PFPDLVLP  +DFQ    P+  I+ T++ D  +P VV +  I    PI         ++  S+S P+ ++  P T+P+       + ++     IV RRS
Subjt:  PFPDLVLPCAVDFQTTTSPSTIIEATSSCDTYVPLVVHSDPIVEEPPI---------TSSTSTSMPLTTDLAP-TSPAGGDLNLISNTNDRASTIVFRRS

Query:  SRPTKMSSYLRDFHCNLMACTPLPIASTKHSLQQYLSYSRLSPAY
        +RP+KM SYL+DFHC+L+  +    AST+H LQQYLSYSRLS A+
Subjt:  SRPTKMSSYLRDFHCNLMACTPLPIASTKHSLQQYLSYSRLSPAY

A0A6J1DNP7 uncharacterized protein LOC1110220652.0e-5230Show/hide
Query:  QLKRDLATITQEQQSVSLYFTKLKTSWDELHQYRPICTC-KCNCGGAKSASDFLQLEYMMNFLMGLNKSYASTRAQLLLMDPPPS---------------
        QL+R+L+ +TQ+Q SV+ YFT+LKT W EL  YRP C+C +C+ GG KS     Q EY+M FLMGLN S++  RAQLLLM+P P+               
Subjt:  QLKRDLATITQEQQSVSLYFTKLKTSWDELHQYRPICTC-KCNCGGAKSASDFLQLEYMMNFLMGLNKSYASTRAQLLLMDPPPS---------------

Query:  ----HSTFAASHSPVAAKSSKAN------ISSMTKGQTRPVCSYCRIVGHTADRCYKLHGYPPGYRNQNQR--SSSASSSASTLVPSKPVDSSAMASLHS
             S  + + S V A S+ +N       +S  K + + +C++C I GHT D+CYKLH YPPGYR+   +  SS+A+SS S   PSK V ++     +S
Subjt:  ----HSTFAASHSPVAAKSSKAN------ISSMTKGQTRPVCSYCRIVGHTADRCYKLHGYPPGYRNQNQR--SSSASSSASTLVPSKPVDSSAMASLHS

Query:  RPDV-VNQCQQLLHLLQSQFSVGKSVTTEKDTYVSYTAGTSQSWILDSGASAHICCSHPLFDTFSKTTPVYVNLPNKSRFLVEYSGSISFSENFRLRGVL
           +  +QCQ+LL LLQS  +  K+ +                   DSG       SH     F KT           RF  + +  +SF E F  +GVL
Subjt:  RPDV-VNQCQQLLHLLQSQFSVGKSVTTEKDTYVSYTAGTSQSWILDSGASAHICCSHPLFDTFSKTTPVYVNLPNKSRFLVEYSGSISFSENFRLRGVL

Query:  Y-------------IPEFHFNLISISMLLRDIPSLSVEFSNNCCMLK--------------DKSISRMIGKGSLCHGLYLFDRQPATAHSICAFTSAHVS
        +             +   H +L++++  L     +   F   C +                +   +R+ G  +    L +F         +C  +++ V+
Subjt:  Y-------------IPEFHFNLISISMLLRDIPSLSVEFSNNCCMLK--------------DKSISRMIGKGSLCHGLYLFDRQPATAHSICAFTSAHVS

Query:  FEVWHSR------FGHPSFSRLQVLKDVLSLDAYFLKQSAVDACSDFS----------VDPFPDLVLPCAVDFQTTTS----------PSTIIEATSSCD
           +H R       G+P   +   L D+ +   +F+ +  +   S F           VDPFP +V+P + D   T+S            + +  TS+  
Subjt:  FEVWHSR------FGHPSFSRLQVLKDVLSLDAYFLKQSAVDACSDFS----------VDPFPDLVLPCAVDFQTTTS----------PSTIIEATSSCD

Query:  TYVPLVVHSDPIV--------EEPPITSSTSTSMPLTTDLAPTSP-----AGGDLNLISNTNDRASTIVFRRSSRPTKMSSYLRDFHCNLMACTPLPIAS
        +   ++    PI+          P + ++ S  MP   + +  +P     A  D  ++   +D + ++  RRSSR  +  SYLRD+HC L+  T    +S
Subjt:  TYVPLVVHSDPIV--------EEPPITSSTSTSMPLTTDLAPTSP-----AGGDLNLISNTNDRASTIVFRRSSRPTKMSSYLRDFHCNLMACTPLPIAS

Query:  TKHSLQQYLSYSRLSPAYYSFALNVSASYEPQFYHQAVGRTLDFEPDDEAEPHSRHAMTA
          + LQ+YL Y+ LS +Y  F L+VS  YEPQFYHQAV     F    EA     HAM A
Subjt:  TKHSLQQYLSYSRLSPAYYSFALNVSASYEPQFYHQAVGRTLDFEPDDEAEPHSRHAMTA

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE17.1e-0724.32Show/hide
Query:  NPYYLHHTDNTRLVLVNQLKRDLATITQEQQSVSLYFTKLKTSWDELHQYRPICTCKCNCGGAKSASDFLQLEYMMNFLMGLNKSYASTRAQLLLMDPPP
        NP Y H         V QL+  L   T+  +++  Y   L T +D+L               A         E +   L  L + Y     Q+   D PP
Subjt:  NPYYLHHTDNTRLVLVNQLKRDLATITQEQQSVSLYFTKLKTSWDELHQYRPICTCKCNCGGAKSASDFLQLEYMMNFLMGLNKSYASTRAQLLLMDPPP

Query:  S----HSTFAASHSPVAAKSSKANISSMTKGQTRPVCSYCRIVGHTADRCYKLHGYPPGYRNQNQRSSSASSSASTLVPSKPVDSSAMASLH---SRPDV
        +    H       S + A SS A +  +T        +      +  +R  +        RN N  S     S++   P+       +        +   
Subjt:  S----HSTFAASHSPVAAKSSKANISSMTKGQTRPVCSYCRIVGHTADRCYKLHGYPPGYRNQNQRSSSASSSASTLVPSKPVDSSAMASLH---SRPDV

Query:  VNQCQQLLHLLQSQFS-VGKSVTTEKDTYVSYTAG---TSQSWILDSGASAHICCSHPLFDTFSKTTPVY----VNLPNKSRFLVEYSGSISFSENFR--
          +C QL H L S  S    S  T      +   G   +S +W+LDSGA+ HI      F+  S   P      V + + S   + ++GS S S   R  
Subjt:  VNQCQQLLHLLQSQFS-VGKSVTTEKDTYVSYTAG---TSQSWILDSGASAHICCSHPLFDTFSKTTPVY----VNLPNKSRFLVEYSGSISFSENFR--

Query:  -LRGVLYIPEFHFNLISISMLLRDIPSLSVEFSNNCCMLKDKSISRMIGKGSLCHGLYLFDRQPATAHSICAFTSAHVSFEVWHSRFGHPSFSRLQVLKD
         L  +LY+P  H NLIS+  L  +   +SVEF      +KD +    + +G     LY +    +   S+ A  S+  +   WH+R GHP+ S   +L  
Subjt:  -LRGVLYIPEFHFNLISISMLLRDIPSLSVEFSNNCCMLKDKSISRMIGKGSLCHGLYLFDRQPATAHSICAFTSAHVSFEVWHSRFGHPSFSRLQVLKD

Query:  VLS
        V+S
Subjt:  VLS

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE24.9e-0826.59Show/hide
Query:  EYMMNFLMGLNKSYASTRAQLLLMDPPPSHSTFAASHSPVAAKSSKANISSMTKGQTRPVCSYCRIVGH---TADRCYKLHGYPPGYRNQNQRSSS--AS
        E +   L  L   Y     Q+   D PPS       H  +  + SK  + ++   +  P+ +   +V H     +R     G    Y N N RS+S   S
Subjt:  EYMMNFLMGLNKSYASTRAQLLLMDPPPSHSTFAASHSPVAAKSSKANISSMTKGQTRPVCSYCRIVGH---TADRCYKLHGYPPGYRNQNQRSSS--AS

Query:  SSASTLVPSKPVDSSAMASLHS-RPDVVNQCQQLLHLLQSQFSVGKSVT-----TEKDTYVSYTAGTSQSWILDSGASAHICCSHPLFDTFSKTTPVY--
        SS S     +P        + S +     +C Q LH  QS  +  +S +       +      +   + +W+LDSGA+ HI      F+  S   P    
Subjt:  SSASTLVPSKPVDSSAMASLHS-RPDVVNQCQQLLHLLQSQFSVGKSVT-----TEKDTYVSYTAGTSQSWILDSGASAHICCSHPLFDTFSKTTPVY--

Query:  --VNLPNKSRFLVEYSGSISF---SENFRLRGVLYIPEFHFNLISISMLLRDIPSLSVEFSNNCCMLKDKSISRMIGKGSLCHGLYLFDRQPATAHSICA
          V + + S   + ++GS S    S +  L  VLY+P  H NLIS+  L  +   +SVEF      +KD +    + +G     LY +    + A S+ A
Subjt:  --VNLPNKSRFLVEYSGSISF---SENFRLRGVLYIPEFHFNLISISMLLRDIPSLSVEFSNNCCMLKDKSISRMIGKGSLCHGLYLFDRQPATAHSICA

Query:  FTSAHVSFEVWHSRFGHPSFSRLQVLKDVLS
           +  +   WHSR GHPS   L +L  V+S
Subjt:  FTSAHVSFEVWHSRFGHPSFSRLQVLKDVLS

Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).4.9e-1141.24Show/hide
Query:  VNQLKRDLATITQEQQSVSLYFTKLKTSWDELHQYRPICTCK---CNCGGAKSASDFLQLEYMMNFLMG--LNKSYASTRAQLLLMDPPPS-HSTFA
        + QL+R LAT+ Q   SV  YF KL   W EL +Y PI  CK   CNC   K A +  + E    FLMG  LN+ + +   +++   PPPS H  FA
Subjt:  VNQLKRDLATITQEQQSVSLYFTKLKTSWDELHQYRPICTCK---CNCGGAKSASDFLQLEYMMNFLMG--LNKSYASTRAQLLLMDPPPS-HSTFA

AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 82.3e-0530.43Show/hide
Query:  APTSPAGGDLNLISNTNDRASTIVFRRSSRPTKMSSYLRDFHCNLMACTPLPIASTKHSLQQYLSYSRLSPAYYSFALNVSASYEPQFYHQA
        A TS +  D+   +N  +         S R T+  +YL+D++C+ +A      + T H + Q+LSY ++SP Y+SF + ++ + EP  Y++A
Subjt:  APTSPAGGDLNLISNTNDRASTIVFRRSSRPTKMSSYLRDFHCNLMACTPLPIASTKHSLQQYLSYSRLSPAYYSFALNVSASYEPQFYHQA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAGACTTGATTCCTCCAATTTCTCCTATTTCCCCTCCAATTTTACCCGATCTTCCTTATGATTCTCCAGATTTGTCTGCTGCTTCTACTCAATCTGGAAATGAAGG
CGCAACGAATCCGTATTATCTTCACCATACCGATAATACCAGATTAGTTCTCGTCAATCAGCTTAAACGGGACCTTGCTACCATTACTCAAGAGCAGCAGTCTGTAAGCC
TTTATTTCACCAAATTGAAGACATCCTGGGATGAGTTGCATCAGTATCGCCCCATTTGCACTTGCAAATGTAATTGTGGTGGTGCAAAATCTGCTTCTGATTTTCTTCAA
CTGGAGTACATGATGAATTTTCTCATGGGACTAAATAAGTCATATGCATCCACGCGTGCTCAGCTTTTGCTAATGGATCCACCACCATCGCACTCTACCTTTGCTGCTTC
CCACTCCCCAGTTGCTGCCAAGTCCTCCAAGGCTAATATTTCTTCAATGACAAAAGGACAGACTCGTCCTGTTTGTTCTTATTGCAGAATAGTTGGTCATACAGCAGACC
GGTGTTATAAACTTCATGGCTACCCACCTGGGTATCGCAATCAGAATCAGCGTTCCTCATCTGCTTCATCGTCTGCATCCACATTGGTTCCCTCCAAGCCTGTTGATTCT
TCAGCTATGGCATCTTTGCATTCTAGACCTGATGTGGTAAATCAGTGTCAGCAACTTCTGCACCTATTACAGTCACAATTTTCTGTCGGCAAGTCTGTTACCACCGAGAA
AGATACGTATGTTTCTTATACAGCAGGTACTTCTCAGTCTTGGATTCTTGATTCGGGTGCTTCTGCACACATTTGTTGTTCACACCCATTATTTGATACTTTTTCCAAGA
CTACTCCTGTATATGTGAATCTGCCAAATAAGTCTCGTTTTCTAGTTGAATATTCTGGTTCCATCAGTTTTTCAGAGAATTTTAGACTCAGAGGGGTGTTATATATTCCT
GAGTTTCACTTCAACCTAATATCCATCAGCATGTTATTACGGGATATACCTTCTCTGTCTGTGGAGTTCTCTAATAATTGTTGCATGCTTAAGGACAAGTCCATTTCGAG
GATGATTGGCAAGGGTAGTCTATGTCATGGCTTATATCTATTTGACAGACAACCGGCTACTGCACATTCTATTTGTGCCTTTACTTCTGCTCATGTTTCTTTTGAGGTTT
GGCATAGTAGATTTGGCCACCCTTCGTTTAGTAGATTACAAGTTTTGAAAGATGTTTTGTCATTGGATGCTTATTTTTTGAAACAATCTGCTGTTGATGCATGTAGTGAC
TTTTCTGTAGATCCTTTTCCGGATCTTGTTCTTCCATGTGCTGTTGATTTTCAGACTACAACAAGTCCTTCCACTATTATTGAAGCTACTTCTTCGTGTGATACATATGT
ACCACTTGTTGTTCATTCTGATCCTATTGTTGAGGAACCTCCCATCACTTCATCCACAAGTACTTCCATGCCTCTCACCACTGATTTGGCCCCTACATCACCTGCAGGTG
GTGATTTGAATCTTATTTCTAATACTAATGATAGGGCATCGACCATTGTTTTTCGCCGTTCTTCTCGGCCCACTAAAATGTCATCCTATTTGCGAGATTTTCATTGTAAT
CTCATGGCTTGTACTCCTCTGCCTATTGCCTCTACTAAACATTCACTACAGCAGTACCTTTCTTATTCGCGGCTTTCCCCGGCCTATTATTCGTTTGCTTTAAATGTTTC
TGCTTCATATGAGCCACAATTCTATCATCAAGCTGTTGGCCGGACTTTGGACTTTGAGCCCGATGATGAGGCTGAGCCACATAGCAGGCACGCCATGACAGCCACACCCA
AGTGTCCATATACCTCGGGGCACCACCGCGGTCCCTTACGGTTGATGAGGAACTATAATGACAAGTATAGACTGGCTCCTCAAGTGGCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCAGACTTGATTCCTCCAATTTCTCCTATTTCCCCTCCAATTTTACCCGATCTTCCTTATGATTCTCCAGATTTGTCTGCTGCTTCTACTCAATCTGGAAATGAAGG
CGCAACGAATCCGTATTATCTTCACCATACCGATAATACCAGATTAGTTCTCGTCAATCAGCTTAAACGGGACCTTGCTACCATTACTCAAGAGCAGCAGTCTGTAAGCC
TTTATTTCACCAAATTGAAGACATCCTGGGATGAGTTGCATCAGTATCGCCCCATTTGCACTTGCAAATGTAATTGTGGTGGTGCAAAATCTGCTTCTGATTTTCTTCAA
CTGGAGTACATGATGAATTTTCTCATGGGACTAAATAAGTCATATGCATCCACGCGTGCTCAGCTTTTGCTAATGGATCCACCACCATCGCACTCTACCTTTGCTGCTTC
CCACTCCCCAGTTGCTGCCAAGTCCTCCAAGGCTAATATTTCTTCAATGACAAAAGGACAGACTCGTCCTGTTTGTTCTTATTGCAGAATAGTTGGTCATACAGCAGACC
GGTGTTATAAACTTCATGGCTACCCACCTGGGTATCGCAATCAGAATCAGCGTTCCTCATCTGCTTCATCGTCTGCATCCACATTGGTTCCCTCCAAGCCTGTTGATTCT
TCAGCTATGGCATCTTTGCATTCTAGACCTGATGTGGTAAATCAGTGTCAGCAACTTCTGCACCTATTACAGTCACAATTTTCTGTCGGCAAGTCTGTTACCACCGAGAA
AGATACGTATGTTTCTTATACAGCAGGTACTTCTCAGTCTTGGATTCTTGATTCGGGTGCTTCTGCACACATTTGTTGTTCACACCCATTATTTGATACTTTTTCCAAGA
CTACTCCTGTATATGTGAATCTGCCAAATAAGTCTCGTTTTCTAGTTGAATATTCTGGTTCCATCAGTTTTTCAGAGAATTTTAGACTCAGAGGGGTGTTATATATTCCT
GAGTTTCACTTCAACCTAATATCCATCAGCATGTTATTACGGGATATACCTTCTCTGTCTGTGGAGTTCTCTAATAATTGTTGCATGCTTAAGGACAAGTCCATTTCGAG
GATGATTGGCAAGGGTAGTCTATGTCATGGCTTATATCTATTTGACAGACAACCGGCTACTGCACATTCTATTTGTGCCTTTACTTCTGCTCATGTTTCTTTTGAGGTTT
GGCATAGTAGATTTGGCCACCCTTCGTTTAGTAGATTACAAGTTTTGAAAGATGTTTTGTCATTGGATGCTTATTTTTTGAAACAATCTGCTGTTGATGCATGTAGTGAC
TTTTCTGTAGATCCTTTTCCGGATCTTGTTCTTCCATGTGCTGTTGATTTTCAGACTACAACAAGTCCTTCCACTATTATTGAAGCTACTTCTTCGTGTGATACATATGT
ACCACTTGTTGTTCATTCTGATCCTATTGTTGAGGAACCTCCCATCACTTCATCCACAAGTACTTCCATGCCTCTCACCACTGATTTGGCCCCTACATCACCTGCAGGTG
GTGATTTGAATCTTATTTCTAATACTAATGATAGGGCATCGACCATTGTTTTTCGCCGTTCTTCTCGGCCCACTAAAATGTCATCCTATTTGCGAGATTTTCATTGTAAT
CTCATGGCTTGTACTCCTCTGCCTATTGCCTCTACTAAACATTCACTACAGCAGTACCTTTCTTATTCGCGGCTTTCCCCGGCCTATTATTCGTTTGCTTTAAATGTTTC
TGCTTCATATGAGCCACAATTCTATCATCAAGCTGTTGGCCGGACTTTGGACTTTGAGCCCGATGATGAGGCTGAGCCACATAGCAGGCACGCCATGACAGCCACACCCA
AGTGTCCATATACCTCGGGGCACCACCGCGGTCCCTTACGGTTGATGAGGAACTATAATGACAAGTATAGACTGGCTCCTCAAGTGGCTTAG
Protein sequenceShow/hide protein sequence
MADLIPPISPISPPILPDLPYDSPDLSAASTQSGNEGATNPYYLHHTDNTRLVLVNQLKRDLATITQEQQSVSLYFTKLKTSWDELHQYRPICTCKCNCGGAKSASDFLQ
LEYMMNFLMGLNKSYASTRAQLLLMDPPPSHSTFAASHSPVAAKSSKANISSMTKGQTRPVCSYCRIVGHTADRCYKLHGYPPGYRNQNQRSSSASSSASTLVPSKPVDS
SAMASLHSRPDVVNQCQQLLHLLQSQFSVGKSVTTEKDTYVSYTAGTSQSWILDSGASAHICCSHPLFDTFSKTTPVYVNLPNKSRFLVEYSGSISFSENFRLRGVLYIP
EFHFNLISISMLLRDIPSLSVEFSNNCCMLKDKSISRMIGKGSLCHGLYLFDRQPATAHSICAFTSAHVSFEVWHSRFGHPSFSRLQVLKDVLSLDAYFLKQSAVDACSD
FSVDPFPDLVLPCAVDFQTTTSPSTIIEATSSCDTYVPLVVHSDPIVEEPPITSSTSTSMPLTTDLAPTSPAGGDLNLISNTNDRASTIVFRRSSRPTKMSSYLRDFHCN
LMACTPLPIASTKHSLQQYLSYSRLSPAYYSFALNVSASYEPQFYHQAVGRTLDFEPDDEAEPHSRHAMTATPKCPYTSGHHRGPLRLMRNYNDKYRLAPQVA