; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g14400 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g14400
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr9:12390056..12391650
RNA-Seq ExpressionMoc09g14400
SyntenyMoc09g14400
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0048297.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]1.4e-5239.28Show/hide
Query:  KNTSEISITNQHIQVINPGNKISTVKLTDDNFLLWRLHVLTA-QGHGLEDFIDPE---------------------------------------------
        +NT   S  N   Q+   GNKIS VKL DD FLLW+  +LTA + + LE+F++ E                                             
Subjt:  KNTSEISITNQHIQVINPGNKISTVKLTDDNFLLWRLHVLTA-QGHGLEDFIDPE---------------------------------------------

Query:  -----------------------------ARVMELKSKLENLKKGSLSLKEYFAKAKQIVDALTAASKPISKTDHIMHLLAGLGTEFDSTVSVISARVDP
                                     A+ M+ K+KL N+KKGS+ LKEYF K  Q VDAL + +KP+S  DHI+++LAGLG+++ S +SVISAR D 
Subjt:  -----------------------------ARVMELKSKLENLKKGSLSLKEYFAKAKQIVDALTAASKPISKTDHIMHLLAGLGTEFDSTVSVISARVDP

Query:  PTIQETYSLLLAQEGRNERNAINTEVSLPSVNLTTQEQSKKGHQANSTDTRGNWNNN-----RGRRGN-RSNRGRNWNNNFRIQCQLCGRFGHTASRCYQ
        P++QE  SLLL QE +NE   I +E +LPSVN+ TQ  ++KG ++     + N++NN     RG RGN RSNRGR  N N + QCQ+C + G++A RC+ 
Subjt:  PTIQETYSLLLAQEGRNERNAINTEVSLPSVNLTTQEQSKKGHQANSTDTRGNWNNN-----RGRRGN-RSNRGRNWNNNFRIQCQLCGRFGHTASRCYQ

Query:  RFDRSFQGPHSSAYSFGFHPTPSYGSASNPNQPQMNAFTLSQELNRDTNWYPDSGASHHVTNDLGNLSIGAESHGNNRVLVGNDSGL
        R+       +SS YS   H T SY + +  N PQM+A   + +LN D+NWYPDSGA++H+T+ L NLSIG+E  G N++   N SGL
Subjt:  RFDRSFQGPHSSAYSFGFHPTPSYGSASNPNQPQMNAFTLSQELNRDTNWYPDSGASHHVTNDLGNLSIGAESHGNNRVLVGNDSGL

TYK10642.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]1.4e-5239.28Show/hide
Query:  KNTSEISITNQHIQVINPGNKISTVKLTDDNFLLWRLHVLTA-QGHGLEDFIDPE---------------------------------------------
        +NT   S  N   Q+   GNKIS VKL DD FLLW+  +LTA + + LE+F++ E                                             
Subjt:  KNTSEISITNQHIQVINPGNKISTVKLTDDNFLLWRLHVLTA-QGHGLEDFIDPE---------------------------------------------

Query:  -----------------------------ARVMELKSKLENLKKGSLSLKEYFAKAKQIVDALTAASKPISKTDHIMHLLAGLGTEFDSTVSVISARVDP
                                     A+ M+ K+KL N+KKGS+ LKEYF K  Q VDAL + +KP+S  DHI+++LAGLG+++ S +SVISAR D 
Subjt:  -----------------------------ARVMELKSKLENLKKGSLSLKEYFAKAKQIVDALTAASKPISKTDHIMHLLAGLGTEFDSTVSVISARVDP

Query:  PTIQETYSLLLAQEGRNERNAINTEVSLPSVNLTTQEQSKKGHQANSTDTRGNWNNN-----RGRRGN-RSNRGRNWNNNFRIQCQLCGRFGHTASRCYQ
        P++QE  SLLL QE +NE   I +E +LPSVN+ TQ  ++KG ++     + N++NN     RG RGN RSNRGR  N N + QCQ+C + G++A RC+ 
Subjt:  PTIQETYSLLLAQEGRNERNAINTEVSLPSVNLTTQEQSKKGHQANSTDTRGNWNNN-----RGRRGN-RSNRGRNWNNNFRIQCQLCGRFGHTASRCYQ

Query:  RFDRSFQGPHSSAYSFGFHPTPSYGSASNPNQPQMNAFTLSQELNRDTNWYPDSGASHHVTNDLGNLSIGAESHGNNRVLVGNDSGL
        R+       +SS YS   H T SY + +  N PQM+A   + +LN D+NWYPDSGA++H+T+ L NLSIG+E  G N++   N SGL
Subjt:  RFDRSFQGPHSSAYSFGFHPTPSYGSASNPNQPQMNAFTLSQELNRDTNWYPDSGASHHVTNDLGNLSIGAESHGNNRVLVGNDSGL

XP_022136882.1 dr1-associated corepressor homolog isoform X1 [Momordica charantia]1.1e-5750.74Show/hide
Query:  ARVMELKSKLENLKKGSLSLKEYFAKAKQIVDALTAASKPISKTDHIMHLLAGLGTEFDSTVSVISARVDPPTIQETYSLLLAQEGRNERNAINTEVSLP
        ARVM+LKSKLEN+KKG+L LK+YF K K +VD+L AA K ++  DHIMH+L GL +EF+STVSVISAR    T+QE YSLLL+ EGRNERN+INT+ +LP
Subjt:  ARVMELKSKLENLKKGSLSLKEYFAKAKQIVDALTAASKPISKTDHIMHLLAGLGTEFDSTVSVISARVDPPTIQETYSLLLAQEGRNERNAINTEVSLP

Query:  SVNLTTQEQSKKGHQANSTD-TRGNWNNNRGRRGNRSNRGRNWNNNFRIQCQLCGRFGHTASRCYQRFDRSFQGPHSSA-----YSFGFHPTPSYGSASN
        SVNLT  +Q+K  + A S D  R    NNR +     N  RNWN+N R QCQ+ G+FGHTA RCY RF+++F GP+  +     +S G + + +    + 
Subjt:  SVNLTTQEQSKKGHQANSTD-TRGNWNNNRGRRGNRSNRGRNWNNNFRIQCQLCGRFGHTASRCYQRFDRSFQGPHSSA-----YSFGFHPTPSYGSASN

Query:  PNQPQ-------------MNAFTLSQELNRDTNWYPDSGASHHVTNDLGNLSIGAESHGNNRVLVGNDSG
         NQ Q             M AF   Q+ NRDTNWYPDSGA++HVT++  NL+   E  G+N+V +GN +G
Subjt:  PNQPQ-------------MNAFTLSQELNRDTNWYPDSGASHHVTNDLGNLSIGAESHGNNRVLVGNDSG

XP_022136883.1 dr1-associated corepressor homolog isoform X2 [Momordica charantia]4.3e-5750.94Show/hide
Query:  ARVMELKSKLENLKKGSLSLKEYFAKAKQIVDALTAASKPISKTDHIMHLLAGLGTEFDSTVSVISARVDPPTIQETYSLLLAQEGRNERNAINTEVSLP
        ARVM+LKSKLEN+KKG+L LK+YF K K +VD+L AA K ++  DHIMH+L GL +EF+STVSVISAR    T+QE YSLLL+ EGRNERN+INT+ +LP
Subjt:  ARVMELKSKLENLKKGSLSLKEYFAKAKQIVDALTAASKPISKTDHIMHLLAGLGTEFDSTVSVISARVDPPTIQETYSLLLAQEGRNERNAINTEVSLP

Query:  SVNLTTQEQSKKGHQANSTD-TRGNWNNNRGRRGNRSNRGRNWNNNFRIQCQLCGRFGHTASRCYQRFDRSFQGPHSSA-----YSFGFHPTPSYGSASN
        SVNLT  +Q+K  + A S D  R    NNR +     N  RNWN+N R QCQ+ G+FGHTA RCY RF+++F GP+  +     +S G + + +    + 
Subjt:  SVNLTTQEQSKKGHQANSTD-TRGNWNNNRGRRGNRSNRGRNWNNNFRIQCQLCGRFGHTASRCYQRFDRSFQGPHSSA-----YSFGFHPTPSYGSASN

Query:  PNQPQ-------------MNAFTLSQELNRDTNWYPDSGASHHVTNDLGNLSIGAESHGNNRVLVGN
         NQ Q             M AF   Q+ NRDTNWYPDSGA++HVT++  NL+   E  G+N+V +GN
Subjt:  PNQPQ-------------MNAFTLSQELNRDTNWYPDSGASHHVTNDLGNLSIGAESHGNNRVLVGN

XP_022154487.1 uncharacterized protein LOC111021757 [Momordica charantia]2.5e-6541.73Show/hide
Query:  SSKNTSEISITNQHIQVINPGNKISTVKLTDDNFLLWRLHVLTA-QGHGLEDFIDPE-------------------------------------------
        SS   S+ +   Q  + INPG+K+S V+L DDN LLW+  + TA QG+GLE +ID                                             
Subjt:  SSKNTSEISITNQHIQVINPGNKISTVKLTDDNFLLWRLHVLTA-QGHGLEDFIDPE-------------------------------------------

Query:  -------------------------------ARVMELKSKLENLKKGSLSLKEYFAKAKQIVDALTAASKPISKTDHIMHLLAGLGTEFDSTVSVISARV
                                       ARVM+LK KLEN KKG+LSLK+YF K K +VD+L  A K +S  DHIMH+LAGLG EFD+ +SVI+AR 
Subjt:  -------------------------------ARVMELKSKLENLKKGSLSLKEYFAKAKQIVDALTAASKPISKTDHIMHLLAGLGTEFDSTVSVISARV

Query:  DPPTIQETYSLLLAQEGRNERNAINTEVSLPSVNLTTQEQSKKG--HQANSTDTRGNWNNNRGR-RGNRSNRGRNWNNNFRIQCQLCGRFGHTASRCYQR
         P T+QE  SLLL QEGRNERN IN++ SLPSVNLT  + SKK   HQ+   +   +  + RGR   NRS+  RNW  N + QCQ+CGRFGHTA RCY R
Subjt:  DPPTIQETYSLLLAQEGRNERNAINTEVSLPSVNLTTQEQSKKG--HQANSTDTRGNWNNNRGR-RGNRSNRGRNWNNNFRIQCQLCGRFGHTASRCYQR

Query:  FDRSFQGPHSSAYSF---GFHP-----TPSYGSASNP------------NQPQMNAFTLSQELNRDTNWYPDSGASHHVTNDLGNLSIGAESHGNNRVLV
        F+R+F GP+ +  +F   GF       TPS+ + S+P            +  QM A  ++Q+ NRD+NWY DSG ++HVTN+ GN S+G+E HG+ ++ V
Subjt:  FDRSFQGPHSSAYSF---GFHP-----TPSYGSASNP------------NQPQMNAFTLSQELNRDTNWYPDSGASHHVTNDLGNLSIGAESHGNNRVLV

Query:  GNDSG
        GN +G
Subjt:  GNDSG

TrEMBL top hitse value%identityAlignment
A0A5A7U233 Retrovirus-related Pol polyprotein from transposon TNT 1-946.9e-5339.28Show/hide
Query:  KNTSEISITNQHIQVINPGNKISTVKLTDDNFLLWRLHVLTA-QGHGLEDFIDPE---------------------------------------------
        +NT   S  N   Q+   GNKIS VKL DD FLLW+  +LTA + + LE+F++ E                                             
Subjt:  KNTSEISITNQHIQVINPGNKISTVKLTDDNFLLWRLHVLTA-QGHGLEDFIDPE---------------------------------------------

Query:  -----------------------------ARVMELKSKLENLKKGSLSLKEYFAKAKQIVDALTAASKPISKTDHIMHLLAGLGTEFDSTVSVISARVDP
                                     A+ M+ K+KL N+KKGS+ LKEYF K  Q VDAL + +KP+S  DHI+++LAGLG+++ S +SVISAR D 
Subjt:  -----------------------------ARVMELKSKLENLKKGSLSLKEYFAKAKQIVDALTAASKPISKTDHIMHLLAGLGTEFDSTVSVISARVDP

Query:  PTIQETYSLLLAQEGRNERNAINTEVSLPSVNLTTQEQSKKGHQANSTDTRGNWNNN-----RGRRGN-RSNRGRNWNNNFRIQCQLCGRFGHTASRCYQ
        P++QE  SLLL QE +NE   I +E +LPSVN+ TQ  ++KG ++     + N++NN     RG RGN RSNRGR  N N + QCQ+C + G++A RC+ 
Subjt:  PTIQETYSLLLAQEGRNERNAINTEVSLPSVNLTTQEQSKKGHQANSTDTRGNWNNN-----RGRRGN-RSNRGRNWNNNFRIQCQLCGRFGHTASRCYQ

Query:  RFDRSFQGPHSSAYSFGFHPTPSYGSASNPNQPQMNAFTLSQELNRDTNWYPDSGASHHVTNDLGNLSIGAESHGNNRVLVGNDSGL
        R+       +SS YS   H T SY + +  N PQM+A   + +LN D+NWYPDSGA++H+T+ L NLSIG+E  G N++   N SGL
Subjt:  RFDRSFQGPHSSAYSFGFHPTPSYGSASNPNQPQMNAFTLSQELNRDTNWYPDSGASHHVTNDLGNLSIGAESHGNNRVLVGNDSGL

A0A5D3CH97 Retrovirus-related Pol polyprotein from transposon TNT 1-946.9e-5339.28Show/hide
Query:  KNTSEISITNQHIQVINPGNKISTVKLTDDNFLLWRLHVLTA-QGHGLEDFIDPE---------------------------------------------
        +NT   S  N   Q+   GNKIS VKL DD FLLW+  +LTA + + LE+F++ E                                             
Subjt:  KNTSEISITNQHIQVINPGNKISTVKLTDDNFLLWRLHVLTA-QGHGLEDFIDPE---------------------------------------------

Query:  -----------------------------ARVMELKSKLENLKKGSLSLKEYFAKAKQIVDALTAASKPISKTDHIMHLLAGLGTEFDSTVSVISARVDP
                                     A+ M+ K+KL N+KKGS+ LKEYF K  Q VDAL + +KP+S  DHI+++LAGLG+++ S +SVISAR D 
Subjt:  -----------------------------ARVMELKSKLENLKKGSLSLKEYFAKAKQIVDALTAASKPISKTDHIMHLLAGLGTEFDSTVSVISARVDP

Query:  PTIQETYSLLLAQEGRNERNAINTEVSLPSVNLTTQEQSKKGHQANSTDTRGNWNNN-----RGRRGN-RSNRGRNWNNNFRIQCQLCGRFGHTASRCYQ
        P++QE  SLLL QE +NE   I +E +LPSVN+ TQ  ++KG ++     + N++NN     RG RGN RSNRGR  N N + QCQ+C + G++A RC+ 
Subjt:  PTIQETYSLLLAQEGRNERNAINTEVSLPSVNLTTQEQSKKGHQANSTDTRGNWNNN-----RGRRGN-RSNRGRNWNNNFRIQCQLCGRFGHTASRCYQ

Query:  RFDRSFQGPHSSAYSFGFHPTPSYGSASNPNQPQMNAFTLSQELNRDTNWYPDSGASHHVTNDLGNLSIGAESHGNNRVLVGNDSGL
        R+       +SS YS   H T SY + +  N PQM+A   + +LN D+NWYPDSGA++H+T+ L NLSIG+E  G N++   N SGL
Subjt:  RFDRSFQGPHSSAYSFGFHPTPSYGSASNPNQPQMNAFTLSQELNRDTNWYPDSGASHHVTNDLGNLSIGAESHGNNRVLVGNDSGL

A0A6J1C6N9 dr1-associated corepressor homolog isoform X15.4e-5850.74Show/hide
Query:  ARVMELKSKLENLKKGSLSLKEYFAKAKQIVDALTAASKPISKTDHIMHLLAGLGTEFDSTVSVISARVDPPTIQETYSLLLAQEGRNERNAINTEVSLP
        ARVM+LKSKLEN+KKG+L LK+YF K K +VD+L AA K ++  DHIMH+L GL +EF+STVSVISAR    T+QE YSLLL+ EGRNERN+INT+ +LP
Subjt:  ARVMELKSKLENLKKGSLSLKEYFAKAKQIVDALTAASKPISKTDHIMHLLAGLGTEFDSTVSVISARVDPPTIQETYSLLLAQEGRNERNAINTEVSLP

Query:  SVNLTTQEQSKKGHQANSTD-TRGNWNNNRGRRGNRSNRGRNWNNNFRIQCQLCGRFGHTASRCYQRFDRSFQGPHSSA-----YSFGFHPTPSYGSASN
        SVNLT  +Q+K  + A S D  R    NNR +     N  RNWN+N R QCQ+ G+FGHTA RCY RF+++F GP+  +     +S G + + +    + 
Subjt:  SVNLTTQEQSKKGHQANSTD-TRGNWNNNRGRRGNRSNRGRNWNNNFRIQCQLCGRFGHTASRCYQRFDRSFQGPHSSA-----YSFGFHPTPSYGSASN

Query:  PNQPQ-------------MNAFTLSQELNRDTNWYPDSGASHHVTNDLGNLSIGAESHGNNRVLVGNDSG
         NQ Q             M AF   Q+ NRDTNWYPDSGA++HVT++  NL+   E  G+N+V +GN +G
Subjt:  PNQPQ-------------MNAFTLSQELNRDTNWYPDSGASHHVTNDLGNLSIGAESHGNNRVLVGNDSG

A0A6J1C8R2 dr1-associated corepressor homolog isoform X22.1e-5750.94Show/hide
Query:  ARVMELKSKLENLKKGSLSLKEYFAKAKQIVDALTAASKPISKTDHIMHLLAGLGTEFDSTVSVISARVDPPTIQETYSLLLAQEGRNERNAINTEVSLP
        ARVM+LKSKLEN+KKG+L LK+YF K K +VD+L AA K ++  DHIMH+L GL +EF+STVSVISAR    T+QE YSLLL+ EGRNERN+INT+ +LP
Subjt:  ARVMELKSKLENLKKGSLSLKEYFAKAKQIVDALTAASKPISKTDHIMHLLAGLGTEFDSTVSVISARVDPPTIQETYSLLLAQEGRNERNAINTEVSLP

Query:  SVNLTTQEQSKKGHQANSTD-TRGNWNNNRGRRGNRSNRGRNWNNNFRIQCQLCGRFGHTASRCYQRFDRSFQGPHSSA-----YSFGFHPTPSYGSASN
        SVNLT  +Q+K  + A S D  R    NNR +     N  RNWN+N R QCQ+ G+FGHTA RCY RF+++F GP+  +     +S G + + +    + 
Subjt:  SVNLTTQEQSKKGHQANSTD-TRGNWNNNRGRRGNRSNRGRNWNNNFRIQCQLCGRFGHTASRCYQRFDRSFQGPHSSA-----YSFGFHPTPSYGSASN

Query:  PNQPQ-------------MNAFTLSQELNRDTNWYPDSGASHHVTNDLGNLSIGAESHGNNRVLVGN
         NQ Q             M AF   Q+ NRDTNWYPDSGA++HVT++  NL+   E  G+N+V +GN
Subjt:  PNQPQ-------------MNAFTLSQELNRDTNWYPDSGASHHVTNDLGNLSIGAESHGNNRVLVGN

A0A6J1DLT9 uncharacterized protein LOC1110217571.2e-6541.73Show/hide
Query:  SSKNTSEISITNQHIQVINPGNKISTVKLTDDNFLLWRLHVLTA-QGHGLEDFIDPE-------------------------------------------
        SS   S+ +   Q  + INPG+K+S V+L DDN LLW+  + TA QG+GLE +ID                                             
Subjt:  SSKNTSEISITNQHIQVINPGNKISTVKLTDDNFLLWRLHVLTA-QGHGLEDFIDPE-------------------------------------------

Query:  -------------------------------ARVMELKSKLENLKKGSLSLKEYFAKAKQIVDALTAASKPISKTDHIMHLLAGLGTEFDSTVSVISARV
                                       ARVM+LK KLEN KKG+LSLK+YF K K +VD+L  A K +S  DHIMH+LAGLG EFD+ +SVI+AR 
Subjt:  -------------------------------ARVMELKSKLENLKKGSLSLKEYFAKAKQIVDALTAASKPISKTDHIMHLLAGLGTEFDSTVSVISARV

Query:  DPPTIQETYSLLLAQEGRNERNAINTEVSLPSVNLTTQEQSKKG--HQANSTDTRGNWNNNRGR-RGNRSNRGRNWNNNFRIQCQLCGRFGHTASRCYQR
         P T+QE  SLLL QEGRNERN IN++ SLPSVNLT  + SKK   HQ+   +   +  + RGR   NRS+  RNW  N + QCQ+CGRFGHTA RCY R
Subjt:  DPPTIQETYSLLLAQEGRNERNAINTEVSLPSVNLTTQEQSKKG--HQANSTDTRGNWNNNRGR-RGNRSNRGRNWNNNFRIQCQLCGRFGHTASRCYQR

Query:  FDRSFQGPHSSAYSF---GFHP-----TPSYGSASNP------------NQPQMNAFTLSQELNRDTNWYPDSGASHHVTNDLGNLSIGAESHGNNRVLV
        F+R+F GP+ +  +F   GF       TPS+ + S+P            +  QM A  ++Q+ NRD+NWY DSG ++HVTN+ GN S+G+E HG+ ++ V
Subjt:  FDRSFQGPHSSAYSF---GFHP-----TPSYGSASNP------------NQPQMNAFTLSQELNRDTNWYPDSGASHHVTNDLGNLSIGAESHGNNRVLV

Query:  GNDSG
        GN +G
Subjt:  GNDSG

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.3e-1228.4Show/hide
Query:  VMELKSKLENLKKGSLSLKEYFAKAKQIVDALTAASKPISKTDHIMHLLAGLGTEFDSTVSVISARVDPPTIQETYSLLLAQEGRNERNAINTEVSLPSV
        V +L+++L+   KG+ ++ +Y        D L    KP+   + +  +L  L  E+   +  I+A+  PPT+ E +  LL  E  ++  A+++   +P  
Subjt:  VMELKSKLENLKKGSLSLKEYFAKAKQIVDALTAASKPISKTDHIMHLLAGLGTEFDSTVSVISARVDPPTIQETYSLLLAQEGRNERNAINTEVSLPSV

Query:  -------NLTTQEQSKKGHQANSTDTRGNWNNNRGRRGNRSNRGRNWNNN--FRIQCQLCGRFGHTASRCYQRFDRSFQGPHSSAYSFGFHPTPSYGSAS
               N TT   +  G++ N  D R N NN++  + + +N   N N +  +  +CQ+CG  GH+A RC Q   + F    +S       P+P      
Subjt:  -------NLTTQEQSKKGHQANSTDTRGNWNNNRGRRGNRSNRGRNWNNN--FRIQCQLCGRFGHTASRCYQRFDRSFQGPHSSAYSFGFHPTPSYGSAS

Query:  NPNQPQMNAFTLSQELNRDTNWYPDSGASHHVTNDLGNLSIGAESHGNNRVLVGNDS
         P QP+ N    S       NW  DSGA+HH+T+D  NLS+     G + V+V + S
Subjt:  NPNQPQMNAFTLSQELNRDTNWYPDSGASHHVTNDLGNLSIGAESHGNNRVLVGNDS

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.2e-0826.99Show/hide
Query:  DALTAASKPISKTDHIMHLLAGLGTEFDSTVSVISARVDPPTIQETYSLLLAQEGRNERNAINTEVSLP-SVNLTTQEQSKKGHQANSTDTRGNWNNNRG
        D L    KP+   + +  +L  L  ++   +  I+A+  PP++ E +  L+ +E  ++  A+N+   +P + N+ T   +      N+     N+NNN  
Subjt:  DALTAASKPISKTDHIMHLLAGLGTEFDSTVSVISARVDPPTIQETYSLLLAQEGRNERNAINTEVSLP-SVNLTTQEQSKKGHQANSTDTRGNWNNNRG

Query:  RRGN--RSNRGRNWNNN----FRIQCQLCGRFGHTASRCYQRFDRSFQGPHSSAYSFGFHPTPSYGSASNPNQPQMNAFTLSQELNRDTNWYPDSGASHH
        R  +   S+ G   +N     +  +CQ+C   GH+A RC Q     FQ   +   S          S   P QP+ N   ++   N + NW  DSGA+HH
Subjt:  RRGN--RSNRGRNWNNN----FRIQCQLCGRFGHTASRCYQRFDRSFQGPHSSAYSFGFHPTPSYGSASNPNQPQMNAFTLSQELNRDTNWYPDSGASHH

Query:  VTNDLGNLSIGAESHGNNRVLVGNDS
        +T+D  NLS      G + V++ + S
Subjt:  VTNDLGNLSIGAESHGNNRVLVGNDS

Arabidopsis top hitse value%identityAlignment
AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)1.4e-0529.45Show/hide
Query:  EARVMELKSKLENLKKGSLSLKEYFAKAKQIVDALTAASKPISKTDHIMHLLAGLGTEFDSTVSVISARVDPPTIQETYSLLLAQEGRNER----NAINT
        +AR + L S+L     G + + +Y+ K K++ D+L     P++  + +M++L GL  +FD+ ++VI  R   P+  +  ++L  +E R +R    N  + 
Subjt:  EARVMELKSKLENLKKGSLSLKEYFAKAKQIVDALTAASKPISKTDHIMHLLAGLGTEFDSTVSVISARVDPPTIQETYSLLLAQEGRNER----NAINT

Query:  EVSLPSVNLTTQEQSKKGHQANSTDTRGNWNNNRGR-RGNRSNRGR
        + S  S  L   E        N   + GN    RGR RGN   RGR
Subjt:  EVSLPSVNLTTQEQSKKGHQANSTDTRGNWNNNRGR-RGNRSNRGR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACGTTAGAATCGTCGAAAAACACCTCTGAAATATCGATCACAAATCAGCATATTCAGGTTATTAATCCTGGTAACAAGATCTCTACAGTCAAATTGACTGAT
GATAATTTCTTGTTGTGGCGATTGCACGTTCTGACTGCTCAAGGCCATGGACTGGAGGACTTCATCGATCCTGAAGCTCGTGTTATGGAGTTAAAATCGAAGCTT
GAAAACCTAAAGAAAGGGAGCCTCAGTCTAAAGGAGTATTTTGCAAAGGCAAAGCAAATTGTTGATGCCCTAACTGCTGCTAGTAAACCAATTTCGAAGACTGAT
CATATAATGCATCTATTAGCCGGTCTAGGAACCGAATTCGATTCAACTGTGTCGGTAATTTCTGCGCGTGTTGATCCTCCAACAATTCAAGAAACGTATTCATTA
CTACTTGCTCAAGAAGGAAGGAACGAGAGGAATGCTATCAATACTGAGGTATCACTACCATCAGTGAATTTAACAACTCAAGAACAATCGAAGAAGGGACATCAA
GCTAATTCTACAGATACTAGAGGAAATTGGAACAATAACAGAGGAAGAAGAGGCAATCGATCAAACCGTGGGCGAAATTGGAATAACAATTTCAGAATTCAGTGT
CAACTTTGCGGTCGATTTGGCCATACTGCCTCGAGGTGTTATCAACGCTTTGATCGGAGTTTTCAGGGCCCTCATTCGTCGGCTTATTCGTTCGGATTTCATCCG
ACCCCTTCGTATGGTTCAGCATCAAATCCTAATCAACCTCAGATGAATGCTTTTACTCTTTCTCAGGAGCTCAATCGGGACACTAACTGGTATCCAGATTCTGGT
GCTTCACATCACGTCACAAATGATCTTGGAAATTTGTCTATTGGAGCTGAAAGTCATGGCAATAACAGAGTTCTTGTAGGCAACGACTCAGGTTTGAACGGCAAA
GTTTCTGATGGGCTGTACACATTCTCTTTGGACAAGGCTAAGTCTTCTACATCATTCCCTTCCACCATTCTTTCTCATGGCTCTTCTTCTTCAACCATCACTCTT
CAGGTTCTTCATACATTAGCTTCTCCTGCTACTTCTTTTCCATGTGATACTGATTCTGCTGAACAATTTACACCTTTCACACAATGTAAGCCTTCTGTTTTAGAT
ATATGGCACTGA
mRNA sequenceShow/hide mRNA sequence
ATGACGTTAGAATCGTCGAAAAACACCTCTGAAATATCGATCACAAATCAGCATATTCAGGTTATTAATCCTGGTAACAAGATCTCTACAGTCAAATTGACTGAT
GATAATTTCTTGTTGTGGCGATTGCACGTTCTGACTGCTCAAGGCCATGGACTGGAGGACTTCATCGATCCTGAAGCTCGTGTTATGGAGTTAAAATCGAAGCTT
GAAAACCTAAAGAAAGGGAGCCTCAGTCTAAAGGAGTATTTTGCAAAGGCAAAGCAAATTGTTGATGCCCTAACTGCTGCTAGTAAACCAATTTCGAAGACTGAT
CATATAATGCATCTATTAGCCGGTCTAGGAACCGAATTCGATTCAACTGTGTCGGTAATTTCTGCGCGTGTTGATCCTCCAACAATTCAAGAAACGTATTCATTA
CTACTTGCTCAAGAAGGAAGGAACGAGAGGAATGCTATCAATACTGAGGTATCACTACCATCAGTGAATTTAACAACTCAAGAACAATCGAAGAAGGGACATCAA
GCTAATTCTACAGATACTAGAGGAAATTGGAACAATAACAGAGGAAGAAGAGGCAATCGATCAAACCGTGGGCGAAATTGGAATAACAATTTCAGAATTCAGTGT
CAACTTTGCGGTCGATTTGGCCATACTGCCTCGAGGTGTTATCAACGCTTTGATCGGAGTTTTCAGGGCCCTCATTCGTCGGCTTATTCGTTCGGATTTCATCCG
ACCCCTTCGTATGGTTCAGCATCAAATCCTAATCAACCTCAGATGAATGCTTTTACTCTTTCTCAGGAGCTCAATCGGGACACTAACTGGTATCCAGATTCTGGT
GCTTCACATCACGTCACAAATGATCTTGGAAATTTGTCTATTGGAGCTGAAAGTCATGGCAATAACAGAGTTCTTGTAGGCAACGACTCAGGTTTGAACGGCAAA
GTTTCTGATGGGCTGTACACATTCTCTTTGGACAAGGCTAAGTCTTCTACATCATTCCCTTCCACCATTCTTTCTCATGGCTCTTCTTCTTCAACCATCACTCTT
CAGGTTCTTCATACATTAGCTTCTCCTGCTACTTCTTTTCCATGTGATACTGATTCTGCTGAACAATTTACACCTTTCACACAATGTAAGCCTTCTGTTTTAGAT
ATATGGCACTGA
Protein sequenceShow/hide protein sequence
MTLESSKNTSEISITNQHIQVINPGNKISTVKLTDDNFLLWRLHVLTAQGHGLEDFIDPEARVMELKSKLENLKKGSLSLKEYFAKAKQIVDALTAASKPISKTD
HIMHLLAGLGTEFDSTVSVISARVDPPTIQETYSLLLAQEGRNERNAINTEVSLPSVNLTTQEQSKKGHQANSTDTRGNWNNNRGRRGNRSNRGRNWNNNFRIQC
QLCGRFGHTASRCYQRFDRSFQGPHSSAYSFGFHPTPSYGSASNPNQPQMNAFTLSQELNRDTNWYPDSGASHHVTNDLGNLSIGAESHGNNRVLVGNDSGLNGK
VSDGLYTFSLDKAKSSTSFPSTILSHGSSSSTITLQVLHTLASPATSFPCDTDSAEQFTPFTQCKPSVLDIWH