; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g28010 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g28010
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr9:21063835..21065192
RNA-Seq ExpressionMoc09g28010
SyntenyMoc09g28010
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN79148.1 hypothetical protein VITISV_004343 [Vitis vinifera]6.0e-5936.6Show/hide
Query:  TLPQPLNVKLNDNNFLLWKNQLLNAVIANGLRGYLDGTIMPPPQFLDHHQLQPNPAYYAWERYNRLLMCWIYSSLSEEKMGEVVSLETTHDIWSSLTRVY
        +L   L +KL+ +N++LWK Q+ N V ANG   Y++GT   PP+ L    L  NP +  W R++R+++ W+YS+L+ + MG++V  +T+H+ W +L +++
Subjt:  TLPQPLNVKLNDNNFLLWKNQLLNAVIANGLRGYLDGTIMPPPQFLDHHQLQPNPAYYAWERYNRLLMCWIYSSLSEEKMGEVVSLETTHDIWSSLTRVY

Query:  DSKTTARIMGLKTELQNLRKDGSSVSQYLAKIKEIADKFAAVGEPLSYRDHLAHVLDGLGSEYNAFVTSIHNRADSPSLEDVRSLLLAYEARLDKQNTVD
         + + ARIM L+ E Q  +K G ++  Y+ K+K I+D  AAVGEP+  RDH+  +L GLG +YN+ V S+  R D  SL  V S+LL +E RL  Q++  
Subjt:  DSKTTARIMGLKTELQNLRKDGSSVSQYLAKIKEIADKFAAVGEPLSYRDHLAHVLDGLGSEYNAFVTSIHNRADSPSLEDVRSLLLAYEARLDKQNTVD

Query:  QLNIAQANLVNLSLQHNSKRPPPKFSFPNHYKHSFPNSPISAAQSQSIL-------GKPQSVHKWP-----PKPSSSKIQCQICGKLGHSAAVCYHRTNI
          +    +  +  +     R P +   P HY H  P+ P   A S S          +P++ H  P     P   S++ QCQ+CGK GH+A  CYHR +I
Subjt:  QLNIAQANLVNLSLQHNSKRPPPKFSFPNHYKHSFPNSPISAAQSQSIL-------GKPQSVHKWP-----PKPSSSKIQCQICGKLGHSAAVCYHRTNI

Query:  AY--HNASP--QALYHHVQ--PSPTHPSSGHEFQHPDESWFMDSGATHHMTPDSSILCNPTPYSGGEQVTVGNGSSV
         Y  +N  P  QA + H     +P H           +SWF D+GATHH++  +  L    PYSG +QVT+G+G+S+
Subjt:  AY--HNASP--QALYHHVQ--PSPTHPSSGHEFQHPDESWFMDSGATHHMTPDSSILCNPTPYSGGEQVTVGNGSSV

GFZ12741.1 UBX domain-containing protein [Actinidia rufa]8.7e-6638.75Show/hide
Query:  PPPTPNFLAQ----PPNPFSANP-----FPTLPQPLNVKLNDNNFLLWKNQLLNAVIANGLRGYLDGTIMPPPQFLDHHQLQPNPAYYAWERYNRLLMCW
        PPPT N L       PNP   N       P++ QPL VKL+D+N+++WK QLLN VIANGL  +LDG+ + PP+FLD  Q Q NP +++W+RYNRL+M W
Subjt:  PPPTPNFLAQ----PPNPFSANP-----FPTLPQPLNVKLNDNNFLLWKNQLLNAVIANGLRGYLDGTIMPPPQFLDHHQLQPNPAYYAWERYNRLLMCW

Query:  IYSSLSEEKMGEVVSLETTHDIWSSLTRVYDSKTTARIMGLKTELQNLRKDGSSVSQYLAKIKEIADKFAAVGEPLSYRDHLAHVLDGLGSEYNAFVTSI
        IY+S++E  +G++V   +   IW +L R+Y + + A +  L+T LQ ++K+G +   Y+ K + + +  A++GEP++Y DHL + L GLG +YN FVTSI
Subjt:  IYSSLSEEKMGEVVSLETTHDIWSSLTRVYDSKTTARIMGLKTELQNLRKDGSSVSQYLAKIKEIADKFAAVGEPLSYRDHLAHVLDGLGSEYNAFVTSI

Query:  HNRADSPSLEDVRSLLLAYEARLDKQNTVDQLNIAQANLVNLSLQHNSKRPPPKFSFPNHYKHSFPNSPISAAQSQSILGKPQSVHKWP---PKPSSSK-
         ++A  PS+E+  S                                 S    PKF  P+   +SFPNS        +    P+  ++ P   P PSS K 
Subjt:  HNRADSPSLEDVRSLLLAYEARLDKQNTVDQLNIAQANLVNLSLQHNSKRPPPKFSFPNHYKHSFPNSPISAAQSQSILGKPQSVHKWP---PKPSSSK-

Query:  -IQCQICGKLGHSAAVCYHRTNIAYH--NASPQALYHHVQPSPTHPSSGHE-----FQHPDESWFMDSGATHHMTPDSSILCNPTPYSGGEQVTVGNGSS
          +CQIC K GH+A  CYH TN+ Y      P+    ++ P+P+  +S ++        PD SW+MDSGA+HH TPD ++L + +PY+G +QVTVGNG +
Subjt:  -IQCQICGKLGHSAAVCYHRTNIAYH--NASPQALYHHVQPSPTHPSSGHE-----FQHPDESWFMDSGATHHMTPDSSILCNPTPYSGGEQVTVGNGSS

RVW53406.1 Retrovirus-related Pol polyprotein from transposon RE2 [Vitis vinifera]1.2e-5936.79Show/hide
Query:  PNPFSANPFP-TLPQPLNVKLNDNNFLLWKNQLLNAVIANGLRGYLDGTIMPPPQFLDHHQLQPNPAYYAWERYNRLLMCWIYSSLSEEKMGEVVSLETT
        P+  +++P   +L   L +KL+ NN++LWK Q+ N V ANG   Y+D T   PPQ L  H  + NP +  W R++R+++ W+YS+L+ + MG++V  +T+
Subjt:  PNPFSANPFP-TLPQPLNVKLNDNNFLLWKNQLLNAVIANGLRGYLDGTIMPPPQFLDHHQLQPNPAYYAWERYNRLLMCWIYSSLSEEKMGEVVSLETT

Query:  HDIWSSLTRVYDSKTTARIMGLKTELQNLRKDGSSVSQYLAKIKEIADKFAAVGEPLSYRDHLAHVLDGLGSEYNAFVTSIHNRADSPSLEDVRSLLLAY
        HD W +L +++ + + ARI+ L+ E Q  +K    + +Y+ KIK I+D  AA+GEP+   DH+  +L GLGSEYN+ V S+  R D  SL  V S+LL +
Subjt:  HDIWSSLTRVYDSKTTARIMGLKTELQNLRKDGSSVSQYLAKIKEIADKFAAVGEPLSYRDHLAHVLDGLGSEYNAFVTSIHNRADSPSLEDVRSLLLAY

Query:  EARLDKQNT--VDQLNIAQANLVNLSLQHNSKRPP----PKFSFPNHY---------KHSFP-NSPISAAQSQSILGKPQSVHKWPPKPSSSKIQCQICG
        E RL+ Q+T   D   +A   +V  S QH     P    P+  F +HY          H+ P N P +   ++S    P      P +P     QCQ+CG
Subjt:  EARLDKQNT--VDQLNIAQANLVNLSLQHNSKRPP----PKFSFPNHY---------KHSFP-NSPISAAQSQSILGKPQSVHKWPPKPSSSKIQCQICG

Query:  KLGHSAAVCYHRTNIAYHNASPQALYHHVQPSPTHPSSGHEFQHPDESWFMDSGATHHMTPDSSILCNPTPYSGGEQVTVGNGSSV
        K GH+   CYHR +I Y   S  +      P     ++    Q   +SWF D+GATHH++  +  L N  PYSG +QVT+G+G S+
Subjt:  KLGHSAAVCYHRTNIAYHNASPQALYHHVQPSPTHPSSGHEFQHPDESWFMDSGATHHMTPDSSILCNPTPYSGGEQVTVGNGSSV

RVW69807.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]1.5e-6238.42Show/hide
Query:  PPTPNFLAQPPNPFSANPFP-----TLPQP-----LNVKLNDNNFLLWKNQLLNAVIANGLRGYLDGTIMPPPQFLDHHQLQPNPAYYAWERYNRLLMCW
        PPTP   +      + NP P     TLP P     L++KL++ N LL K+QLLN +IANGL  ++D     PP++LD    Q NP +  W+R N+L+M W
Subjt:  PPTPNFLAQPPNPFSANPFP-----TLPQP-----LNVKLNDNNFLLWKNQLLNAVIANGLRGYLDGTIMPPPQFLDHHQLQPNPAYYAWERYNRLLMCW

Query:  IYSSLSEEKMGEVVSLETTHDIWSSLTRVYDSKTTARIMGLKTELQNLRKDGSSVSQYLAKIKEIADKFAAVGEPLSYRDHLAHVLDGLGSEYNAFVTSI
        IYSSL+   +G++V   T  DIW+SL   Y+S + A +M L ++LQ ++K    +S+YL+++K + D+FA +GEPLSYRD L  +L+GL  EY+ FVTSI
Subjt:  IYSSLSEEKMGEVVSLETTHDIWSSLTRVYDSKTTARIMGLKTELQNLRKDGSSVSQYLAKIKEIADKFAAVGEPLSYRDHLAHVLDGLGSEYNAFVTSI

Query:  HNRADSPSLEDVRSLLLAYEARLDKQNTVDQLNIAQANLVNLSLQHNSKRPPPKFSFPNHYKHSFPNSPISAAQSQSILGKPQSVHKWPPKPSSSKIQCQ
        HNR+D PSL++V SLL  YE RL +++    LN  QAN           R P        Y +S P                               QCQ
Subjt:  HNRADSPSLEDVRSLLLAYEARLDKQNTVDQLNIAQANLVNLSLQHNSKRPPPKFSFPNHYKHSFPNSPISAAQSQSILGKPQSVHKWPPKPSSSKIQCQ

Query:  ICGKLGHSAAVCYHRTNIAYH-----------------NASPQALYHHVQPSPTHPSSGHEFQHPDESWFMDSGATHHMTPDSSILCNPTPYSGGEQVTV
        ICGK GH A   YHRTN+ YH                  +SP +       +PT  SS    Q  D SW+MDSGATHH TP+   + +   YS G+   V
Subjt:  ICGKLGHSAAVCYHRTNIAYH-----------------NASPQALYHHVQPSPTHPSSGHEFQHPDESWFMDSGATHHMTPDSSILCNPTPYSGGEQVTV

Query:  GNGSSV
        GN   +
Subjt:  GNGSSV

XP_022155181.1 uncharacterized protein LOC111022315 [Momordica charantia]8.9e-228100Show/hide
Query:  MQFPPPTPNFLAQPPNPFSANPFPTLPQPLNVKLNDNNFLLWKNQLLNAVIANGLRGYLDGTIMPPPQFLDHHQLQPNPAYYAWERYNRLLMCWIYSSLS
        MQFPPPTPNFLAQPPNPFSANPFPTLPQPLNVKLNDNNFLLWKNQLLNAVIANGLRGYLDGTIMPPPQFLDHHQLQPNPAYYAWERYNRLLMCWIYSSLS
Subjt:  MQFPPPTPNFLAQPPNPFSANPFPTLPQPLNVKLNDNNFLLWKNQLLNAVIANGLRGYLDGTIMPPPQFLDHHQLQPNPAYYAWERYNRLLMCWIYSSLS

Query:  EEKMGEVVSLETTHDIWSSLTRVYDSKTTARIMGLKTELQNLRKDGSSVSQYLAKIKEIADKFAAVGEPLSYRDHLAHVLDGLGSEYNAFVTSIHNRADS
        EEKMGEVVSLETTHDIWSSLTRVYDSKTTARIMGLKTELQNLRKDGSSVSQYLAKIKEIADKFAAVGEPLSYRDHLAHVLDGLGSEYNAFVTSIHNRADS
Subjt:  EEKMGEVVSLETTHDIWSSLTRVYDSKTTARIMGLKTELQNLRKDGSSVSQYLAKIKEIADKFAAVGEPLSYRDHLAHVLDGLGSEYNAFVTSIHNRADS

Query:  PSLEDVRSLLLAYEARLDKQNTVDQLNIAQANLVNLSLQHNSKRPPPKFSFPNHYKHSFPNSPISAAQSQSILGKPQSVHKWPPKPSSSKIQCQICGKLG
        PSLEDVRSLLLAYEARLDKQNTVDQLNIAQANLVNLSLQHNSKRPPPKFSFPNHYKHSFPNSPISAAQSQSILGKPQSVHKWPPKPSSSKIQCQICGKLG
Subjt:  PSLEDVRSLLLAYEARLDKQNTVDQLNIAQANLVNLSLQHNSKRPPPKFSFPNHYKHSFPNSPISAAQSQSILGKPQSVHKWPPKPSSSKIQCQICGKLG

Query:  HSAAVCYHRTNIAYHNASPQALYHHVQPSPTHPSSGHEFQHPDESWFMDSGATHHMTPDSSILCNPTPYSGGEQVTVGNGSSVQDGSSSGIA
        HSAAVCYHRTNIAYHNASPQALYHHVQPSPTHPSSGHEFQHPDESWFMDSGATHHMTPDSSILCNPTPYSGGEQVTVGNGSSVQDGSSSGIA
Subjt:  HSAAVCYHRTNIAYHNASPQALYHHVQPSPTHPSSGHEFQHPDESWFMDSGATHHMTPDSSILCNPTPYSGGEQVTVGNGSSVQDGSSSGIA

TrEMBL top hitse value%identityAlignment
A0A438F0E0 Retrovirus-related Pol polyprotein from transposon RE25.9e-6036.79Show/hide
Query:  PNPFSANPFP-TLPQPLNVKLNDNNFLLWKNQLLNAVIANGLRGYLDGTIMPPPQFLDHHQLQPNPAYYAWERYNRLLMCWIYSSLSEEKMGEVVSLETT
        P+  +++P   +L   L +KL+ NN++LWK Q+ N V ANG   Y+D T   PPQ L  H  + NP +  W R++R+++ W+YS+L+ + MG++V  +T+
Subjt:  PNPFSANPFP-TLPQPLNVKLNDNNFLLWKNQLLNAVIANGLRGYLDGTIMPPPQFLDHHQLQPNPAYYAWERYNRLLMCWIYSSLSEEKMGEVVSLETT

Query:  HDIWSSLTRVYDSKTTARIMGLKTELQNLRKDGSSVSQYLAKIKEIADKFAAVGEPLSYRDHLAHVLDGLGSEYNAFVTSIHNRADSPSLEDVRSLLLAY
        HD W +L +++ + + ARI+ L+ E Q  +K    + +Y+ KIK I+D  AA+GEP+   DH+  +L GLGSEYN+ V S+  R D  SL  V S+LL +
Subjt:  HDIWSSLTRVYDSKTTARIMGLKTELQNLRKDGSSVSQYLAKIKEIADKFAAVGEPLSYRDHLAHVLDGLGSEYNAFVTSIHNRADSPSLEDVRSLLLAY

Query:  EARLDKQNT--VDQLNIAQANLVNLSLQHNSKRPP----PKFSFPNHY---------KHSFP-NSPISAAQSQSILGKPQSVHKWPPKPSSSKIQCQICG
        E RL+ Q+T   D   +A   +V  S QH     P    P+  F +HY          H+ P N P +   ++S    P      P +P     QCQ+CG
Subjt:  EARLDKQNT--VDQLNIAQANLVNLSLQHNSKRPP----PKFSFPNHY---------KHSFP-NSPISAAQSQSILGKPQSVHKWPPKPSSSKIQCQICG

Query:  KLGHSAAVCYHRTNIAYHNASPQALYHHVQPSPTHPSSGHEFQHPDESWFMDSGATHHMTPDSSILCNPTPYSGGEQVTVGNGSSV
        K GH+   CYHR +I Y   S  +      P     ++    Q   +SWF D+GATHH++  +  L N  PYSG +QVT+G+G S+
Subjt:  KLGHSAAVCYHRTNIAYHNASPQALYHHVQPSPTHPSSGHEFQHPDESWFMDSGATHHMTPDSSILCNPTPYSGGEQVTVGNGSSV

A0A438GC62 Retrovirus-related Pol polyprotein from transposon RE17.4e-6338.42Show/hide
Query:  PPTPNFLAQPPNPFSANPFP-----TLPQP-----LNVKLNDNNFLLWKNQLLNAVIANGLRGYLDGTIMPPPQFLDHHQLQPNPAYYAWERYNRLLMCW
        PPTP   +      + NP P     TLP P     L++KL++ N LL K+QLLN +IANGL  ++D     PP++LD    Q NP +  W+R N+L+M W
Subjt:  PPTPNFLAQPPNPFSANPFP-----TLPQP-----LNVKLNDNNFLLWKNQLLNAVIANGLRGYLDGTIMPPPQFLDHHQLQPNPAYYAWERYNRLLMCW

Query:  IYSSLSEEKMGEVVSLETTHDIWSSLTRVYDSKTTARIMGLKTELQNLRKDGSSVSQYLAKIKEIADKFAAVGEPLSYRDHLAHVLDGLGSEYNAFVTSI
        IYSSL+   +G++V   T  DIW+SL   Y+S + A +M L ++LQ ++K    +S+YL+++K + D+FA +GEPLSYRD L  +L+GL  EY+ FVTSI
Subjt:  IYSSLSEEKMGEVVSLETTHDIWSSLTRVYDSKTTARIMGLKTELQNLRKDGSSVSQYLAKIKEIADKFAAVGEPLSYRDHLAHVLDGLGSEYNAFVTSI

Query:  HNRADSPSLEDVRSLLLAYEARLDKQNTVDQLNIAQANLVNLSLQHNSKRPPPKFSFPNHYKHSFPNSPISAAQSQSILGKPQSVHKWPPKPSSSKIQCQ
        HNR+D PSL++V SLL  YE RL +++    LN  QAN           R P        Y +S P                               QCQ
Subjt:  HNRADSPSLEDVRSLLLAYEARLDKQNTVDQLNIAQANLVNLSLQHNSKRPPPKFSFPNHYKHSFPNSPISAAQSQSILGKPQSVHKWPPKPSSSKIQCQ

Query:  ICGKLGHSAAVCYHRTNIAYH-----------------NASPQALYHHVQPSPTHPSSGHEFQHPDESWFMDSGATHHMTPDSSILCNPTPYSGGEQVTV
        ICGK GH A   YHRTN+ YH                  +SP +       +PT  SS    Q  D SW+MDSGATHH TP+   + +   YS G+   V
Subjt:  ICGKLGHSAAVCYHRTNIAYH-----------------NASPQALYHHVQPSPTHPSSGHEFQHPDESWFMDSGATHHMTPDSSILCNPTPYSGGEQVTV

Query:  GNGSSV
        GN   +
Subjt:  GNGSSV

A0A6J1DQX7 uncharacterized protein LOC1110223154.3e-228100Show/hide
Query:  MQFPPPTPNFLAQPPNPFSANPFPTLPQPLNVKLNDNNFLLWKNQLLNAVIANGLRGYLDGTIMPPPQFLDHHQLQPNPAYYAWERYNRLLMCWIYSSLS
        MQFPPPTPNFLAQPPNPFSANPFPTLPQPLNVKLNDNNFLLWKNQLLNAVIANGLRGYLDGTIMPPPQFLDHHQLQPNPAYYAWERYNRLLMCWIYSSLS
Subjt:  MQFPPPTPNFLAQPPNPFSANPFPTLPQPLNVKLNDNNFLLWKNQLLNAVIANGLRGYLDGTIMPPPQFLDHHQLQPNPAYYAWERYNRLLMCWIYSSLS

Query:  EEKMGEVVSLETTHDIWSSLTRVYDSKTTARIMGLKTELQNLRKDGSSVSQYLAKIKEIADKFAAVGEPLSYRDHLAHVLDGLGSEYNAFVTSIHNRADS
        EEKMGEVVSLETTHDIWSSLTRVYDSKTTARIMGLKTELQNLRKDGSSVSQYLAKIKEIADKFAAVGEPLSYRDHLAHVLDGLGSEYNAFVTSIHNRADS
Subjt:  EEKMGEVVSLETTHDIWSSLTRVYDSKTTARIMGLKTELQNLRKDGSSVSQYLAKIKEIADKFAAVGEPLSYRDHLAHVLDGLGSEYNAFVTSIHNRADS

Query:  PSLEDVRSLLLAYEARLDKQNTVDQLNIAQANLVNLSLQHNSKRPPPKFSFPNHYKHSFPNSPISAAQSQSILGKPQSVHKWPPKPSSSKIQCQICGKLG
        PSLEDVRSLLLAYEARLDKQNTVDQLNIAQANLVNLSLQHNSKRPPPKFSFPNHYKHSFPNSPISAAQSQSILGKPQSVHKWPPKPSSSKIQCQICGKLG
Subjt:  PSLEDVRSLLLAYEARLDKQNTVDQLNIAQANLVNLSLQHNSKRPPPKFSFPNHYKHSFPNSPISAAQSQSILGKPQSVHKWPPKPSSSKIQCQICGKLG

Query:  HSAAVCYHRTNIAYHNASPQALYHHVQPSPTHPSSGHEFQHPDESWFMDSGATHHMTPDSSILCNPTPYSGGEQVTVGNGSSVQDGSSSGIA
        HSAAVCYHRTNIAYHNASPQALYHHVQPSPTHPSSGHEFQHPDESWFMDSGATHHMTPDSSILCNPTPYSGGEQVTVGNGSSVQDGSSSGIA
Subjt:  HSAAVCYHRTNIAYHNASPQALYHHVQPSPTHPSSGHEFQHPDESWFMDSGATHHMTPDSSILCNPTPYSGGEQVTVGNGSSVQDGSSSGIA

A0A7J0GPN0 UBX domain-containing protein4.2e-6638.75Show/hide
Query:  PPPTPNFLAQ----PPNPFSANP-----FPTLPQPLNVKLNDNNFLLWKNQLLNAVIANGLRGYLDGTIMPPPQFLDHHQLQPNPAYYAWERYNRLLMCW
        PPPT N L       PNP   N       P++ QPL VKL+D+N+++WK QLLN VIANGL  +LDG+ + PP+FLD  Q Q NP +++W+RYNRL+M W
Subjt:  PPPTPNFLAQ----PPNPFSANP-----FPTLPQPLNVKLNDNNFLLWKNQLLNAVIANGLRGYLDGTIMPPPQFLDHHQLQPNPAYYAWERYNRLLMCW

Query:  IYSSLSEEKMGEVVSLETTHDIWSSLTRVYDSKTTARIMGLKTELQNLRKDGSSVSQYLAKIKEIADKFAAVGEPLSYRDHLAHVLDGLGSEYNAFVTSI
        IY+S++E  +G++V   +   IW +L R+Y + + A +  L+T LQ ++K+G +   Y+ K + + +  A++GEP++Y DHL + L GLG +YN FVTSI
Subjt:  IYSSLSEEKMGEVVSLETTHDIWSSLTRVYDSKTTARIMGLKTELQNLRKDGSSVSQYLAKIKEIADKFAAVGEPLSYRDHLAHVLDGLGSEYNAFVTSI

Query:  HNRADSPSLEDVRSLLLAYEARLDKQNTVDQLNIAQANLVNLSLQHNSKRPPPKFSFPNHYKHSFPNSPISAAQSQSILGKPQSVHKWP---PKPSSSK-
         ++A  PS+E+  S                                 S    PKF  P+   +SFPNS        +    P+  ++ P   P PSS K 
Subjt:  HNRADSPSLEDVRSLLLAYEARLDKQNTVDQLNIAQANLVNLSLQHNSKRPPPKFSFPNHYKHSFPNSPISAAQSQSILGKPQSVHKWP---PKPSSSK-

Query:  -IQCQICGKLGHSAAVCYHRTNIAYH--NASPQALYHHVQPSPTHPSSGHE-----FQHPDESWFMDSGATHHMTPDSSILCNPTPYSGGEQVTVGNGSS
          +CQIC K GH+A  CYH TN+ Y      P+    ++ P+P+  +S ++        PD SW+MDSGA+HH TPD ++L + +PY+G +QVTVGNG +
Subjt:  -IQCQICGKLGHSAAVCYHRTNIAYH--NASPQALYHHVQPSPTHPSSGHE-----FQHPDESWFMDSGATHHMTPDSSILCNPTPYSGGEQVTVGNGSS

A0A803NL56 Uncharacterized protein1.0e-5937.4Show/hide
Query:  PNFLAQPPNPFSANPFPTLP-------QPLNVKLNDNNFLLWKNQLLNAVIANGLRGYLDGTIMPPPQFLDHHQLQPNPAYYAWERYNRLLMCWIYSSLS
        PN  A PP   S++  P+LP       Q ++VKL+D N+L+W+ Q+ N +IANGL GY+DGT+    QF +    Q +PA+  W RYN+LLM W+Y+SLS
Subjt:  PNFLAQPPNPFSANPFPTLP-------QPLNVKLNDNNFLLWKNQLLNAVIANGLRGYLDGTIMPPPQFLDHHQLQPNPAYYAWERYNRLLMCWIYSSLS

Query:  EEKMGEVVSLETTHDIWSSLTRVYDSKTTARIMGLKTELQNLRKDGSSVSQYLAKIKEIADKFAAVGEPLSYRDHLAHVLDGLGSEYNAFVTSIHNRADS
        +  +G++V   T  +IW SL R Y + + AR    +  LQNL+KD  + S YL K+K + +  A+VG+P+S ++HL ++L+GLG EYNAFVT I  R   
Subjt:  EEKMGEVVSLETTHDIWSSLTRVYDSKTTARIMGLKTELQNLRKDGSSVSQYLAKIKEIADKFAAVGEPLSYRDHLAHVLDGLGSEYNAFVTSIHNRADS

Query:  PSLEDVRSLLLAYEARLDKQNTVDQLNIAQANLVNLSL-----QHNSKRPPPKFSFPNHYKHSFPNSPISAAQSQS-ILGKPQSVHKWPPKPSSSKIQCQ
        P++E+V +LLL+YEARL++QN     +  QAN  NLS      + +S++P  +  FP+H + + P +  S  Q ++     P+S+  + P P        
Subjt:  PSLEDVRSLLLAYEARLDKQNTVDQLNIAQANLVNLSL-----QHNSKRPPPKFSFPNHYKHSFPNSPISAAQSQS-ILGKPQSVHKWPPKPSSSKIQCQ

Query:  ICGKLGHSAAVCYHRTNIAYHNASPQALYHHVQPS--------PTHPSSGHEFQHPDESWFMDSGATHHMTPDSSILCNPTPYSGGEQVTVGN
                        N  +    PQ     +  S        PT PS+G      D +W+MDSGA+HH T D ++L + TPY G + +T+GN
Subjt:  ICGKLGHSAAVCYHRTNIAYHNASPQALYHHVQPS--------PTHPSSGHEFQHPDESWFMDSGATHHMTPDSSILCNPTPYSGGEQVTVGN

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.7e-2726.89Show/hide
Query:  KLNDNNFLLWKNQLLNAVIANGLRGYLDG-TIMPPPQFLDHHQLQPNPAYYAWERYNRLLMCWIYSSLSEEKMGEVVSLETTHDIWSSLTRVYDSKTTAR
        KL   N+L+W  Q+        L G+LDG T MPP         + NP Y  W+R ++L+   +  ++S      V    T   IW +L ++Y + +   
Subjt:  KLNDNNFLLWKNQLLNAVIANGLRGYLDG-TIMPPPQFLDHHQLQPNPAYYAWERYNRLLMCWIYSSLSEEKMGEVVSLETTHDIWSSLTRVYDSKTTAR

Query:  IMGLKTELQNLRKDGSSVSQYLAKIKEIADKFAAVGEPLSYRDHLAHVLDGLGSEYNAFVTSIHNRADSPSLEDVRSLLLAYEARLDKQNTVDQLNIAQA
        +  L+T+L+   K   ++  Y+  +    D+ A +G+P+ + + +  VL+ L  EY   +  I  +   P+L ++   LL +E+++   ++   + I  A
Subjt:  IMGLKTELQNLRKDGSSVSQYLAKIKEIADKFAAVGEPLSYRDHLAHVLDGLGSEYNAFVTSIHNRADSPSLEDVRSLLLAYEARLDKQNTVDQLNIAQA

Query:  NLV---NLSLQHNSKRPPPKFSFPNHYKHSFPNSPISAAQSQSILGKPQSVHKWPPKPSSSKIQCQICGKLGHSAAVCYHRTNIAYHNASPQALYHHVQP
        N V   N +  +N+       +  N Y +   N+     Q  S    P +      KP   K  CQICG  GHSA  C   + + +  +S  +       
Subjt:  NLV---NLSLQHNSKRPPPKFSFPNHYKHSFPNSPISAAQSQSILGKPQSVHKWPPKPSSSKIQCQICGKLGHSAAVCYHRTNIAYHNASPQALYHHVQP

Query:  SPTHPSSGHEFQHP--DESWFMDSGATHHMTPDSSILCNPTPYSGGEQVTVGNGSSV
        +P  P +      P    +W +DSGATHH+T D + L    PY+GG+ V V +GS++
Subjt:  SPTHPSSGHEFQHP--DESWFMDSGATHHMTPDSSILCNPTPYSGGEQVTVGNGSSV

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE26.1e-2225.75Show/hide
Query:  KLNDNNFLLWKNQLLNAVIANGLRGYLDG-TIMPPPQFLDHHQLQPNPAYYAWERYNRLLMCWIYSSLSEEKMGEVVSLETTHDIWSSLTRVYDSKTTAR
        KL   N+L+W  Q+        L G+LDG T MPP         + NP Y  W R ++L+   I  ++S      V    T   IW +L ++Y + +   
Subjt:  KLNDNNFLLWKNQLLNAVIANGLRGYLDG-TIMPPPQFLDHHQLQPNPAYYAWERYNRLLMCWIYSSLSEEKMGEVVSLETTHDIWSSLTRVYDSKTTAR

Query:  IMGLKTELQNLRKDGSSVSQYLAKIKEIADKFAAVGEPLSYRDHLAHVLDGLGSEYNAFVTSIHNRADSPSLEDVRSLLLAYEARLDKQNTVDQLNIAQA
        +    T+L+ + +                D+ A +G+P+ + + +  VL+ L  +Y   +  I  +   PSL ++   L+  E++L   N+ + + I  A
Subjt:  IMGLKTELQNLRKDGSSVSQYLAKIKEIADKFAAVGEPLSYRDHLAHVLDGLGSEYNAFVTSIHNRADSPSLEDVRSLLLAYEARLDKQNTVDQLNIAQA

Query:  NLV-----NLSLQHNSKRPPPKFSFPNHYKHSF-PNSPISAAQSQSILGKPQSVHKWPPKPSSSKIQCQICGKLGHSAAVC-----YHRTNIAYHNASPQ
        N+V     N +   N++     ++  N+  +S+ P+S  S + ++             PKP   +  CQIC   GHSA  C     +  T     + SP 
Subjt:  NLV-----NLSLQHNSKRPPPKFSFPNHYKHSF-PNSPISAAQSQSILGKPQSVHKWPPKPSSSKIQCQICGKLGHSAAVC-----YHRTNIAYHNASPQ

Query:  ALYHHVQPSPTHPSSGHEFQHP--DESWFMDSGATHHMTPDSSILCNPTPYSGGEQVTVGNGSSV
                +P  P +      P    +W +DSGATHH+T D + L    PY+GG+ V + +GS++
Subjt:  ALYHHVQPSPTHPSSGHEFQHP--DESWFMDSGATHHMTPDSSILCNPTPYSGGEQVTVGNGSSV

Arabidopsis top hitse value%identityAlignment
AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)3.3e-1523.96Show/hide
Query:  PLNVKLNDNNFLLWKNQLLNAVIANGLRGYLDGTIMPPPQFLDHHQLQPNPAYYAWERYNRLLMCWIYSSLSEEK-MGEVVSLETTHDIWSSLTRVYDSK
        P+ + + ++N+  W+   L   ++  + G++DGT++P            N     W++ + ++   +Y +L+ ++  G  V+  T+ DIW  +   + + 
Subjt:  PLNVKLNDNNFLLWKNQLLNAVIANGLRGYLDGTIMPPPQFLDHHQLQPNPAYYAWERYNRLLMCWIYSSLSEEK-MGEVVSLETTHDIWSSLTRVYDSK

Query:  TTARIMGLKTELQNLRKDGSSVSQYLAKIKEIADKFAAVGEPLSYRDHLAHVLDGLGSEYNAFVTSIHNRADSPSLEDVRSLLLAYEARLDK
          AR + L +EL+        V+ Y  K+K++AD    V  P++ R+ + +VL+GL  +++  +  I +R   PS +D  ++L   E RL +
Subjt:  TTARIMGLKTELQNLRKDGSSVSQYLAKIKEIADKFAAVGEPLSYRDHLAHVLDGLGSEYNAFVTSIHNRADSPSLEDVRSLLLAYEARLDK

AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)2.7e-1722.74Show/hide
Query:  LNVKLNDNNFLLWKNQLLNAVIANGLRGYLDGTIMPPPQFLDHHQLQPNPAYYAWERYNRLLMCWIYSSLSEEKMGEVVSLE-TTHDIWSSLTRVYDSKT
        + + LN  N+ +W+       ++ G+ G++DG+  P P                W+  + L+  WIY ++++  +  ++ +  T  D+W SL  ++    
Subjt:  LNVKLNDNNFLLWKNQLLNAVIANGLRGYLDGTIMPPPQFLDHHQLQPNPAYYAWERYNRLLMCWIYSSLSEEKMGEVVSLE-TTHDIWSSLTRVYDSKT

Query:  TARIMGLKTELQNLRKDGSSVSQYLAKIKEIADKFAAVGEPLSYRDHLAHVLDGLGSEYNAFVTSIHNRADSPSLEDVRSLLLAYEARLDKQNTVDQLNI
         AR +  + EL+    D  SV +Y  K+K ++D    V  P+S R  + H+L+GL  +Y+  +  I +++  PS  + RS+LL  E+RL  ++     + 
Subjt:  TARIMGLKTELQNLRKDGSSVSQYLAKIKEIADKFAAVGEPLSYRDHLAHVLDGLGSEYNAFVTSIHNRADSPSLEDVRSLLLAYEARLDKQNTVDQLNI

Query:  AQANLVNLSLQHNSKRPPPKFSFPNHYKHSFPNSPISAAQSQSILGKPQSVHKWPPKPSSSKIQCQICGKLGHSAAVCYHRTNIAYHNASPQALYHHVQP
           +L N+        P  +  +P  Y ++  N           +G+ +S  K                + G S+   Y+  N    N  P  +Y   Q 
Subjt:  AQANLVNLSLQHNSKRPPPKFSFPNHYKHSFPNSPISAAQSQSILGKPQSVHKWPPKPSSSKIQCQICGKLGHSAAVCYHRTNIAYHNASPQALYHHVQP

Query:  SPTHPSSGHEFQHPDESWFMDS-----GATHHMTPDSSILCNPTPYSGGEQVTVGNGSSVQDGSS
           +P  G +F H  +++F          T H    +SIL    PY  G+       +S+ D S+
Subjt:  SPTHPSSGHEFQHPDESWFMDS-----GATHHMTPDSSILCNPTPYSGGEQVTVGNGSSVQDGSS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAGTTTCCGCCTCCGACACCAAATTTCTTAGCCCAGCCTCCAAATCCGTTTTCGGCAAATCCCTTTCCCACTCTGCCACAACCTCTAAATGTTAAACTTAATGACAA
CAATTTTCTTCTCTGGAAAAACCAGCTACTCAATGCTGTTATTGCCAATGGCCTTCGGGGATATCTTGATGGAACGATAATGCCTCCGCCACAATTTTTGGATCACCATC
AGCTTCAACCAAATCCTGCCTATTATGCTTGGGAAAGGTACAATCGCCTTTTAATGTGTTGGATTTATTCTTCTCTATCTGAAGAGAAAATGGGTGAAGTTGTATCTCTT
GAAACAACGCATGACATTTGGTCTTCTCTAACTAGAGTTTATGATTCTAAGACCACAGCTAGAATTATGGGCTTAAAAACAGAGTTACAAAATCTGAGGAAAGATGGATC
ATCTGTTAGTCAATATCTAGCTAAGATTAAAGAGATTGCTGATAAATTTGCTGCTGTTGGTGAACCTCTTTCCTATCGTGATCATTTAGCTCATGTCCTAGATGGTCTAG
GGAGTGAATACAATGCCTTTGTTACATCTATTCATAATAGGGCTGATTCCCCTTCTTTAGAAGATGTTCGCAGTCTTCTTTTAGCCTATGAAGCACGTTTGGACAAACAG
AACACGGTCGACCAGCTTAATATTGCTCAGGCCAACCTTGTCAATCTTTCCCTTCAACACAACAGTAAGCGCCCTCCTCCAAAGTTCTCATTCCCCAACCATTACAAACA
CTCTTTTCCTAATTCCCCTATTTCCGCTGCTCAATCTCAAAGCATACTCGGTAAGCCACAAAGTGTTCACAAATGGCCTCCCAAGCCCTCTAGTTCCAAAATACAATGTC
AAATTTGTGGTAAACTTGGTCATTCTGCTGCTGTGTGTTATCATAGAACCAATATTGCTTATCACAATGCTTCCCCTCAAGCTCTTTATCATCATGTTCAACCTTCACCC
ACCCATCCATCTTCTGGGCATGAATTTCAACACCCTGATGAAAGCTGGTTTATGGATTCCGGTGCCACTCACCATATGACTCCGGACTCCTCCATTCTTTGCAATCCAAC
CCCTTATAGTGGTGGTGAACAAGTCACAGTTGGAAATGGATCTTCGGTCCAAGATGGTTCTTCTTCAGGGATCGCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGCAGTTTCCGCCTCCGACACCAAATTTCTTAGCCCAGCCTCCAAATCCGTTTTCGGCAAATCCCTTTCCCACTCTGCCACAACCTCTAAATGTTAAACTTAATGACAA
CAATTTTCTTCTCTGGAAAAACCAGCTACTCAATGCTGTTATTGCCAATGGCCTTCGGGGATATCTTGATGGAACGATAATGCCTCCGCCACAATTTTTGGATCACCATC
AGCTTCAACCAAATCCTGCCTATTATGCTTGGGAAAGGTACAATCGCCTTTTAATGTGTTGGATTTATTCTTCTCTATCTGAAGAGAAAATGGGTGAAGTTGTATCTCTT
GAAACAACGCATGACATTTGGTCTTCTCTAACTAGAGTTTATGATTCTAAGACCACAGCTAGAATTATGGGCTTAAAAACAGAGTTACAAAATCTGAGGAAAGATGGATC
ATCTGTTAGTCAATATCTAGCTAAGATTAAAGAGATTGCTGATAAATTTGCTGCTGTTGGTGAACCTCTTTCCTATCGTGATCATTTAGCTCATGTCCTAGATGGTCTAG
GGAGTGAATACAATGCCTTTGTTACATCTATTCATAATAGGGCTGATTCCCCTTCTTTAGAAGATGTTCGCAGTCTTCTTTTAGCCTATGAAGCACGTTTGGACAAACAG
AACACGGTCGACCAGCTTAATATTGCTCAGGCCAACCTTGTCAATCTTTCCCTTCAACACAACAGTAAGCGCCCTCCTCCAAAGTTCTCATTCCCCAACCATTACAAACA
CTCTTTTCCTAATTCCCCTATTTCCGCTGCTCAATCTCAAAGCATACTCGGTAAGCCACAAAGTGTTCACAAATGGCCTCCCAAGCCCTCTAGTTCCAAAATACAATGTC
AAATTTGTGGTAAACTTGGTCATTCTGCTGCTGTGTGTTATCATAGAACCAATATTGCTTATCACAATGCTTCCCCTCAAGCTCTTTATCATCATGTTCAACCTTCACCC
ACCCATCCATCTTCTGGGCATGAATTTCAACACCCTGATGAAAGCTGGTTTATGGATTCCGGTGCCACTCACCATATGACTCCGGACTCCTCCATTCTTTGCAATCCAAC
CCCTTATAGTGGTGGTGAACAAGTCACAGTTGGAAATGGATCTTCGGTCCAAGATGGTTCTTCTTCAGGGATCGCTTGA
Protein sequenceShow/hide protein sequence
MQFPPPTPNFLAQPPNPFSANPFPTLPQPLNVKLNDNNFLLWKNQLLNAVIANGLRGYLDGTIMPPPQFLDHHQLQPNPAYYAWERYNRLLMCWIYSSLSEEKMGEVVSL
ETTHDIWSSLTRVYDSKTTARIMGLKTELQNLRKDGSSVSQYLAKIKEIADKFAAVGEPLSYRDHLAHVLDGLGSEYNAFVTSIHNRADSPSLEDVRSLLLAYEARLDKQ
NTVDQLNIAQANLVNLSLQHNSKRPPPKFSFPNHYKHSFPNSPISAAQSQSILGKPQSVHKWPPKPSSSKIQCQICGKLGHSAAVCYHRTNIAYHNASPQALYHHVQPSP
THPSSGHEFQHPDESWFMDSGATHHMTPDSSILCNPTPYSGGEQVTVGNGSSVQDGSSSGIA