; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g02810 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g02810
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr7:2491190..2492697
RNA-Seq ExpressionMoc07g02810
SyntenyMoc07g02810
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0043170 - macromolecule metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0016787 - hydrolase activity (molecular function)
InterPro domainsIPR029472 - Retrotransposon Copia-like, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN76196.1 hypothetical protein VITISV_041073 [Vitis vinifera]2.3e-7237.89Show/hide
Query:  TSLSLKTLRSSLQITSPKLNGKNFLQWSRSALLVIRGRSRLGYINGTIAEPDEADPSF-------------------SVKGSLKCAHYGFSDFNNSAQLF
        +S    T  SS Q+T  KLNGKN+L+W++S  L I GR +LG++NG +++P   DP+                    + K   +     +SD  NS+Q+F
Subjt:  TSLSLKTLRSSLQITSPKLNGKNFLQWSRSALLVIRGRSRLGYINGTIAEPDEADPSF-------------------SVKGSLKCAHYGFSDFNNSAQLF

Query:  ELRNKARSIQQGESDVTQYYSSLRRLWAKLDLCLNLEWENSKDVERFRKHVEKKQIYDFLAGLRPELDDVRGRLLATKPIPAIDEIFAEVRWESSRKRVM
        +L++K    +QG+ +VT YY+ +  LW +LDLC   EW+   D  R +K  E  ++Y FLA L   LD+VRGR+L  KP+P+I E+F+EVR E +R++VM
Subjt:  ELRNKARSIQQGESDVTQYYSSLRRLWAKLDLCLNLEWENSKDVERFRKHVEKKQIYDFLAGLRPELDDVRGRLLATKPIPAIDEIFAEVRWESSRKRVM

Query:  MGDTPTKPLSLSLESSALAAR-----GDYRPPP---------------------PTNTPPSRTSSS-SYQVGPSVSNSQDLATSLPPFSKAQLEQLYGLL
        + D P    +  +ESSAL ++     GD R  P                     P N      S   ++Q   + S    + +  P F+K QL  LY L 
Subjt:  MGDTPTKPLSLSLESSALAAR-----GDYRPPP---------------------PTNTPPSRTSSS-SYQVGPSVSNSQDLATSLPPFSKAQLEQLYGLL

Query:  TPPVESTPSSSFMAQRGIFNAALTSQQHSDQ--WILDSGATDHMTAFHDMFTMYSPNPIQTHVKLADGSSAIIKGFGSVILSPNITLHSVLHVSKLCCNL
          P  S PS S   Q     AAL+S + +    WI+DSGATDHMT    +F+ Y P      +K+ DGS + I G GSV +SP++TLH+VLHV  L CNL
Subjt:  TPPVESTPSSSFMAQRGIFNAALTSQQHSDQ--WILDSGATDHMTAFHDMFTMYSPNPIQTHVKLADGSSAIIKGFGSVILSPNITLHSVLHVSKLCCNL

Query:  ISVQKLTHDLKCQALFTDSKCLFQDLITGTTISNADGFEGLYYFRGPSLRNKQV
        +S+ K+T D +CQA F  S C FQ+L +G TI NA    GLY+F   S   K +
Subjt:  ISVQKLTHDLKCQALFTDSKCLFQDLITGTTISNADGFEGLYYFRGPSLRNKQV

KAE8658400.1 60S ribosomal protein L38 [Hibiscus syriacus]1.1e-6937.74Show/hide
Query:  SLQITSPKLNGKNFLQWSRSALLVIRGRSRLGYINGTIAEPDEADPS--FSVKGSL---------------------------KCAHYGFSDFNNSAQLF
        ++ ITS KLNG NFLQWS+S  L IRGR + GY++G   +P E + S  +  + S+                              +  +SD  N+ Q F
Subjt:  SLQITSPKLNGKNFLQWSRSALLVIRGRSRLGYINGTIAEPDEADPS--FSVKGSL---------------------------KCAHYGFSDFNNSAQLF

Query:  ELRNKARSIQQGESDVTQYYSSLRRLWAKLDLCLNLEWENSKDVERFRKHVEKKQIYDFLAGLRPELDDVRGRLLATKPIPAIDEIFAEVRWESSRKRVM
        EL+ +   ++QGE  VTQYY+ L+ LW ++D+  + EW  +KD   F+K VEK+++++FL GL  ELD+VRGR+L  +P+P+  E+F+EVR E SR+ VM
Subjt:  ELRNKARSIQQGESDVTQYYSSLRRLWAKLDLCLNLEWENSKDVERFRKHVEKKQIYDFLAGLRPELDDVRGRLLATKPIPAIDEIFAEVRWESSRKRVM

Query:  M-GDTPTKPLSLSLESSALAARGDYR-------------------------PPPPTNTPPSRTSSSS---YQVGPSVSNSQDLATSLPPFSKAQLEQLYG
        + G +P    S +L S     R D R                           P  N   +R S ++   +      S     A+ L  FSK QLEQLY 
Subjt:  M-GDTPTKPLSLSLESSALAARGDYR-------------------------PPPPTNTPPSRTSSSS---YQVGPSVSNSQDLATSLPPFSKAQLEQLYG

Query:  LL-------TPPVESTPSSSFMAQRGIFNAALTSQQHSDQWILDSGATDHMTAFHDMFTMYSPNPIQTHVKLADGSSAIIKGFGSVILSPNITLHSVLHV
        L+       TP   S  +SS +AQ+G +  A  +   S+ WI+DSGATDHMT    +F+ Y P      VK+ADGS   I G GS+I+SP++TL +VLHV
Subjt:  LL-------TPPVESTPSSSFMAQRGIFNAALTSQQHSDQWILDSGATDHMTAFHDMFTMYSPNPIQTHVKLADGSSAIIKGFGSVILSPNITLHSVLHV

Query:  SKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGTTISNADGFEGLYYFRGPSLRNKQV
         KL CNLISV ++ HD KC A  T +   FQD  +G  I NA   +GLY+    +  NKQV
Subjt:  SKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGTTISNADGFEGLYYFRGPSLRNKQV

RVW98618.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]1.8e-7237.42Show/hide
Query:  TSLSLKTLRSSLQITSPKLNGKNFLQWSRSALLVIRGRSRLGYINGTIAEPDEADPSF------------------------------SVKGSLKCAHYG
        +S    T  SS Q+T  KLNGKN+L+W++S  L I GR +LG++NG +++P   DP+                               + K   +     
Subjt:  TSLSLKTLRSSLQITSPKLNGKNFLQWSRSALLVIRGRSRLGYINGTIAEPDEADPSF------------------------------SVKGSLKCAHYG

Query:  FSDFNNSAQLFELRNKARSIQQGESDVTQYYSSLRRLWAKLDLCLNLEWENSKDVERFRKHVEKKQIYDFLAGLRPELDDVRGRLLATKPIPAIDEIFAE
        +SD  NS+Q+F+L++K    +QG+ +VT YY+ +  LW +LDLC   EW+   D  R +K  E  ++Y FLAGL   LD+VRGR+L  KP+P+I E+F+E
Subjt:  FSDFNNSAQLFELRNKARSIQQGESDVTQYYSSLRRLWAKLDLCLNLEWENSKDVERFRKHVEKKQIYDFLAGLRPELDDVRGRLLATKPIPAIDEIFAE

Query:  VRWESSRKRVMMGDTPTKPLSLSLESSALAAR-----GDYRPPP---------------------PTNTPPSRTSSS-SYQVGPSVSNSQDLATSLPPFS
        VR E +R++VM+ D P    +  +ESSAL ++     GD R  P                     P N      S   ++Q   + S    + +  P F+
Subjt:  VRWESSRKRVMMGDTPTKPLSLSLESSALAAR-----GDYRPPP---------------------PTNTPPSRTSSS-SYQVGPSVSNSQDLATSLPPFS

Query:  KAQLEQLYGLLTPPVESTPSSSFMAQRGIFNAALTSQQHSDQ--WILDSGATDHMTAFHDMFTMYSPNPIQTHVKLADGSSAIIKGFGSVILSPNITLHS
        K QL  LY L   P  S PS S   Q     AAL+S + +    WI+DSGATDHMT    +F+ Y P      +K+ADGS + I G GSV +SP++TLH+
Subjt:  KAQLEQLYGLLTPPVESTPSSSFMAQRGIFNAALTSQQHSDQ--WILDSGATDHMTAFHDMFTMYSPNPIQTHVKLADGSSAIIKGFGSVILSPNITLHS

Query:  VLHVSKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGTTISNADGFEGLYYFRGPSLRNKQV
        VLHV  L CNL+S+ K+T D +CQA F  S C FQ+L +G TI NA    GLY+F   S   K +
Subjt:  VLHVSKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGTTISNADGFEGLYYFRGPSLRNKQV

XP_022159153.1 uncharacterized protein LOC111025577 [Momordica charantia]2.3e-17377.34Show/hide
Query:  SSLQITSPKLNGKNFLQWSRSALLVIRGRSRLGYINGTIAEPDEADPSFSV------------------------------KGSLKCAHYGFSDFNNSAQ
        +SLQITSPKLNGKNFLQWSRSALLVIRGRSRLGYINGTIAEPDEADPSFSV                              K         FSDF+NSAQ
Subjt:  SSLQITSPKLNGKNFLQWSRSALLVIRGRSRLGYINGTIAEPDEADPSFSV------------------------------KGSLKCAHYGFSDFNNSAQ

Query:  LFELRNKARSIQQGESDVTQYYSSLRRLWAKLDLCLNLEWENSKDVERFRKHVEKKQIYDFLAGLRPELDDVRGRLLATKPIPAIDEIFAEVRWESSRKR
        LFEL NKARS++QGESDVTQYYSSLRRLWA+LDLCLNLEWENSKD ERFRKHVEK++IYDFLAGLRPELDDVRGRLLATKPIPAIDEIFAEV WESSRKR
Subjt:  LFELRNKARSIQQGESDVTQYYSSLRRLWAKLDLCLNLEWENSKDVERFRKHVEKKQIYDFLAGLRPELDDVRGRLLATKPIPAIDEIFAEVRWESSRKR

Query:  VMMGDTPTKPLSLSLESSALAARGDYRPPPPTNTPPSRTSSSSYQVGPSVSNSQDLATSLPPFSKAQLEQLYGLLTPPVESTPSSSFMAQRGIFNAALTS
        VMMGDT TKPLSLSLESSALAARG    PPP+  P                              AQLEQLY LLT PVESTPSSSF+AQRGI +AALT 
Subjt:  VMMGDTPTKPLSLSLESSALAARGDYRPPPPTNTPPSRTSSSSYQVGPSVSNSQDLATSLPPFSKAQLEQLYGLLTPPVESTPSSSFMAQRGIFNAALTS

Query:  QQHSDQWILDSGATDHMTAFHDMFTMYSPNPIQTHVKLADGSSAIIKGFGSVILSPNITLHSVLHVSKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLI
        QQHSDQWILDSGATDHMTAFHDMFTMYSPNPIQTHVKLADGSSAIIKGFGSVILSPNITLHSVL V KLCCNLISVQKLTHDLKCQALFTDSKCLFQDLI
Subjt:  QQHSDQWILDSGATDHMTAFHDMFTMYSPNPIQTHVKLADGSSAIIKGFGSVILSPNITLHSVLHVSKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLI

Query:  TGTTISNADGFEGLYYFRGPSLRNKQVL
        TGTTI +ADGFEGLYYFRGPSLRNKQVL
Subjt:  TGTTISNADGFEGLYYFRGPSLRNKQVL

XP_024044152.1 uncharacterized protein LOC18046468 isoform X2 [Citrus clementina]3.0e-6738.31Show/hide
Query:  ITSPKLNGKNFLQWSRSALLVIRGRSRLGYINGTIAEPDEADPSF------------------------------SVKGSLKCAHYGFSDFNNSAQLFEL
        IT+ KLNG+NFLQWS+S  + IRG+ ++GY+ G+I EP E DP F                              + K         +SD  NSAQ+++L
Subjt:  ITSPKLNGKNFLQWSRSALLVIRGRSRLGYINGTIAEPDEADPSF------------------------------SVKGSLKCAHYGFSDFNNSAQLFEL

Query:  RNKARSIQQGESDVTQYYSSLRRLWAKLDLCLNLEWENSKDVERFRKHVEKKQIYDFLAGLRPELDDVRGRLLATKPIPAIDEIFAEVRWESSRKRVMMG
        + + R  +QG   VT+YY+ L+ LW +LD   + EWE + D  +++K +EK+++++FLAGL  +LD+VRGR+L  +P+P+  E+F+ VR E SRK VMMG
Subjt:  RNKARSIQQGESDVTQYYSSLRRLWAKLDLCLNLEWENSKDVERFRKHVEKKQIYDFLAGLRPELDDVRGRLLATKPIPAIDEIFAEVRWESSRKRVMMG

Query:  D-----------TPTKPL---SLSLESSALAAR--GDY----------------RPPPPTNTPPSRTSSSSYQ-VGPSVSNSQDLATSLPPFSKAQLEQL
                    TP  PL   + +L+ S    R   DY                +PP   N   S   S  +Q VG +   +    T    F+K QLEQL
Subjt:  D-----------TPTKPL---SLSLESSALAAR--GDY----------------RPPPPTNTPPSRTSSSSYQ-VGPSVSNSQDLATSLPPFSKAQLEQL

Query:  YGLLTPPVESTPSSSF--MAQRG-IFNAALTSQQHSDQWILDSGATDHMTAFHDMFTMYSPNPIQTHVKLADGSSAIIKGFGSVILSPNITLHSVLHVSK
        Y  L         SSF  +AQ+G  F A     +  D WI+DSGATDHMT+   +F+ Y P      +K+ADGS + + G GS+ +S N+ L SVLHV  
Subjt:  YGLLTPPVESTPSSSF--MAQRG-IFNAALTSQQHSDQWILDSGATDHMTAFHDMFTMYSPNPIQTHVKLADGSSAIIKGFGSVILSPNITLHSVLHVSK

Query:  LCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGTTISNADGFEGLYYF
        L CNL+SV K+T DL C A F+ S C FQDL +G  I +A   +GLYYF
Subjt:  LCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGTTISNADGFEGLYYF

TrEMBL top hitse value%identityAlignment
A0A438IPH4 Retrovirus-related Pol polyprotein from transposon RE18.7e-7337.42Show/hide
Query:  TSLSLKTLRSSLQITSPKLNGKNFLQWSRSALLVIRGRSRLGYINGTIAEPDEADPSF------------------------------SVKGSLKCAHYG
        +S    T  SS Q+T  KLNGKN+L+W++S  L I GR +LG++NG +++P   DP+                               + K   +     
Subjt:  TSLSLKTLRSSLQITSPKLNGKNFLQWSRSALLVIRGRSRLGYINGTIAEPDEADPSF------------------------------SVKGSLKCAHYG

Query:  FSDFNNSAQLFELRNKARSIQQGESDVTQYYSSLRRLWAKLDLCLNLEWENSKDVERFRKHVEKKQIYDFLAGLRPELDDVRGRLLATKPIPAIDEIFAE
        +SD  NS+Q+F+L++K    +QG+ +VT YY+ +  LW +LDLC   EW+   D  R +K  E  ++Y FLAGL   LD+VRGR+L  KP+P+I E+F+E
Subjt:  FSDFNNSAQLFELRNKARSIQQGESDVTQYYSSLRRLWAKLDLCLNLEWENSKDVERFRKHVEKKQIYDFLAGLRPELDDVRGRLLATKPIPAIDEIFAE

Query:  VRWESSRKRVMMGDTPTKPLSLSLESSALAAR-----GDYRPPP---------------------PTNTPPSRTSSS-SYQVGPSVSNSQDLATSLPPFS
        VR E +R++VM+ D P    +  +ESSAL ++     GD R  P                     P N      S   ++Q   + S    + +  P F+
Subjt:  VRWESSRKRVMMGDTPTKPLSLSLESSALAAR-----GDYRPPP---------------------PTNTPPSRTSSS-SYQVGPSVSNSQDLATSLPPFS

Query:  KAQLEQLYGLLTPPVESTPSSSFMAQRGIFNAALTSQQHSDQ--WILDSGATDHMTAFHDMFTMYSPNPIQTHVKLADGSSAIIKGFGSVILSPNITLHS
        K QL  LY L   P  S PS S   Q     AAL+S + +    WI+DSGATDHMT    +F+ Y P      +K+ADGS + I G GSV +SP++TLH+
Subjt:  KAQLEQLYGLLTPPVESTPSSSFMAQRGIFNAALTSQQHSDQ--WILDSGATDHMTAFHDMFTMYSPNPIQTHVKLADGSSAIIKGFGSVILSPNITLHS

Query:  VLHVSKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGTTISNADGFEGLYYFRGPSLRNKQV
        VLHV  L CNL+S+ K+T D +CQA F  S C FQ+L +G TI NA    GLY+F   S   K +
Subjt:  VLHVSKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGTTISNADGFEGLYYFRGPSLRNKQV

A0A6A2WU09 60S ribosomal protein L385.3e-7037.74Show/hide
Query:  SLQITSPKLNGKNFLQWSRSALLVIRGRSRLGYINGTIAEPDEADPS--FSVKGSL---------------------------KCAHYGFSDFNNSAQLF
        ++ ITS KLNG NFLQWS+S  L IRGR + GY++G   +P E + S  +  + S+                              +  +SD  N+ Q F
Subjt:  SLQITSPKLNGKNFLQWSRSALLVIRGRSRLGYINGTIAEPDEADPS--FSVKGSL---------------------------KCAHYGFSDFNNSAQLF

Query:  ELRNKARSIQQGESDVTQYYSSLRRLWAKLDLCLNLEWENSKDVERFRKHVEKKQIYDFLAGLRPELDDVRGRLLATKPIPAIDEIFAEVRWESSRKRVM
        EL+ +   ++QGE  VTQYY+ L+ LW ++D+  + EW  +KD   F+K VEK+++++FL GL  ELD+VRGR+L  +P+P+  E+F+EVR E SR+ VM
Subjt:  ELRNKARSIQQGESDVTQYYSSLRRLWAKLDLCLNLEWENSKDVERFRKHVEKKQIYDFLAGLRPELDDVRGRLLATKPIPAIDEIFAEVRWESSRKRVM

Query:  M-GDTPTKPLSLSLESSALAARGDYR-------------------------PPPPTNTPPSRTSSSS---YQVGPSVSNSQDLATSLPPFSKAQLEQLYG
        + G +P    S +L S     R D R                           P  N   +R S ++   +      S     A+ L  FSK QLEQLY 
Subjt:  M-GDTPTKPLSLSLESSALAARGDYR-------------------------PPPPTNTPPSRTSSSS---YQVGPSVSNSQDLATSLPPFSKAQLEQLYG

Query:  LL-------TPPVESTPSSSFMAQRGIFNAALTSQQHSDQWILDSGATDHMTAFHDMFTMYSPNPIQTHVKLADGSSAIIKGFGSVILSPNITLHSVLHV
        L+       TP   S  +SS +AQ+G +  A  +   S+ WI+DSGATDHMT    +F+ Y P      VK+ADGS   I G GS+I+SP++TL +VLHV
Subjt:  LL-------TPPVESTPSSSFMAQRGIFNAALTSQQHSDQWILDSGATDHMTAFHDMFTMYSPNPIQTHVKLADGSSAIIKGFGSVILSPNITLHSVLHV

Query:  SKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGTTISNADGFEGLYYFRGPSLRNKQV
         KL CNLISV ++ HD KC A  T +   FQD  +G  I NA   +GLY+    +  NKQV
Subjt:  SKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGTTISNADGFEGLYYFRGPSLRNKQV

A0A6J1DY12 uncharacterized protein LOC1110255771.1e-17377.34Show/hide
Query:  SSLQITSPKLNGKNFLQWSRSALLVIRGRSRLGYINGTIAEPDEADPSFSV------------------------------KGSLKCAHYGFSDFNNSAQ
        +SLQITSPKLNGKNFLQWSRSALLVIRGRSRLGYINGTIAEPDEADPSFSV                              K         FSDF+NSAQ
Subjt:  SSLQITSPKLNGKNFLQWSRSALLVIRGRSRLGYINGTIAEPDEADPSFSV------------------------------KGSLKCAHYGFSDFNNSAQ

Query:  LFELRNKARSIQQGESDVTQYYSSLRRLWAKLDLCLNLEWENSKDVERFRKHVEKKQIYDFLAGLRPELDDVRGRLLATKPIPAIDEIFAEVRWESSRKR
        LFEL NKARS++QGESDVTQYYSSLRRLWA+LDLCLNLEWENSKD ERFRKHVEK++IYDFLAGLRPELDDVRGRLLATKPIPAIDEIFAEV WESSRKR
Subjt:  LFELRNKARSIQQGESDVTQYYSSLRRLWAKLDLCLNLEWENSKDVERFRKHVEKKQIYDFLAGLRPELDDVRGRLLATKPIPAIDEIFAEVRWESSRKR

Query:  VMMGDTPTKPLSLSLESSALAARGDYRPPPPTNTPPSRTSSSSYQVGPSVSNSQDLATSLPPFSKAQLEQLYGLLTPPVESTPSSSFMAQRGIFNAALTS
        VMMGDT TKPLSLSLESSALAARG    PPP+  P                              AQLEQLY LLT PVESTPSSSF+AQRGI +AALT 
Subjt:  VMMGDTPTKPLSLSLESSALAARGDYRPPPPTNTPPSRTSSSSYQVGPSVSNSQDLATSLPPFSKAQLEQLYGLLTPPVESTPSSSFMAQRGIFNAALTS

Query:  QQHSDQWILDSGATDHMTAFHDMFTMYSPNPIQTHVKLADGSSAIIKGFGSVILSPNITLHSVLHVSKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLI
        QQHSDQWILDSGATDHMTAFHDMFTMYSPNPIQTHVKLADGSSAIIKGFGSVILSPNITLHSVL V KLCCNLISVQKLTHDLKCQALFTDSKCLFQDLI
Subjt:  QQHSDQWILDSGATDHMTAFHDMFTMYSPNPIQTHVKLADGSSAIIKGFGSVILSPNITLHSVLHVSKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLI

Query:  TGTTISNADGFEGLYYFRGPSLRNKQVL
        TGTTI +ADGFEGLYYFRGPSLRNKQVL
Subjt:  TGTTISNADGFEGLYYFRGPSLRNKQVL

A0A7J0DNJ6 Uncharacterized protein2.4e-6737.19Show/hide
Query:  QITSPKLNGKNFLQWSRSALLVIRGRSRLGYINGTIAEPDEADPSFSV------------------------------KGSLKCAHYGFSDFNNSAQLFE
        QITS KL+G+N+LQWSRS  L+IRG  R GY++G+I +P   DPSF +                              K         +SD  +S+Q+F+
Subjt:  QITSPKLNGKNFLQWSRSALLVIRGRSRLGYINGTIAEPDEADPSFSV------------------------------KGSLKCAHYGFSDFNNSAQLFE

Query:  LRNKARSIQQGESDVTQYYSSLRRLWAKLDLCLNLEWENSKDVERFRKHVEKKQIYDFLAGLRPELDDVRGRLLATKPIPAIDEIFAEVRWESSRKRVMM
        LR+++R+++Q E  VTQY+SSL +LW +LDL     W  + + E +RK + K++ Y+FL GL P LDDVRGR+L+ KP+P++D IF+EVR E  R+R+M+
Subjt:  LRNKARSIQQGESDVTQYYSSLRRLWAKLDLCLNLEWENSKDVERFRKHVEKKQIYDFLAGLRPELDDVRGRLLATKPIPAIDEIFAEVRWESSRKRVMM

Query:  GDTP-TKPLSLSLESSALAAR-GDYRPP------------------------PPTNTPPSRTSSSSYQVGPSVSNSQDLATSLPPFSKAQLEQLYGLLTP
        G  P     S++ ++SA+AAR  D R P                         P +  P     S     P  + S   ++S   F++AQL+Q+  L + 
Subjt:  GDTP-TKPLSLSLESSALAAR-GDYRPP------------------------PPTNTPPSRTSSSSYQVGPSVSNSQDLATSLPPFSKAQLEQLYGLLTP

Query:  PVESTPSSSFMAQRGIFNAALTSQQHSDQ-WILDSGATDHMTAFHDMFTMYSPNPIQTHVKLADGSSAIIKGFGSVILSPNITLHSVLHVSKLCCNLISV
              SS+ +   GI ++ +     S + WI+DSGA++HM++   +F+ YS       V LADGS + + G G+V LSPN+ LH+VL V KL  NL+S+
Subjt:  PVESTPSSSFMAQRGIFNAALTSQQHSDQ-WILDSGATDHMTAFHDMFTMYSPNPIQTHVKLADGSSAIIKGFGSVILSPNITLHSVLHVSKLCCNLISV

Query:  QKLTHDLKCQALFTDSKCLFQDLITGTTISNADGFEGLYYF
         KLT D  C A  + + C+FQD  +G TI  A    GLYYF
Subjt:  QKLTHDLKCQALFTDSKCLFQDLITGTTISNADGFEGLYYF

A5AYJ3 Integrase catalytic domain-containing protein1.1e-7237.89Show/hide
Query:  TSLSLKTLRSSLQITSPKLNGKNFLQWSRSALLVIRGRSRLGYINGTIAEPDEADPSF-------------------SVKGSLKCAHYGFSDFNNSAQLF
        +S    T  SS Q+T  KLNGKN+L+W++S  L I GR +LG++NG +++P   DP+                    + K   +     +SD  NS+Q+F
Subjt:  TSLSLKTLRSSLQITSPKLNGKNFLQWSRSALLVIRGRSRLGYINGTIAEPDEADPSF-------------------SVKGSLKCAHYGFSDFNNSAQLF

Query:  ELRNKARSIQQGESDVTQYYSSLRRLWAKLDLCLNLEWENSKDVERFRKHVEKKQIYDFLAGLRPELDDVRGRLLATKPIPAIDEIFAEVRWESSRKRVM
        +L++K    +QG+ +VT YY+ +  LW +LDLC   EW+   D  R +K  E  ++Y FLA L   LD+VRGR+L  KP+P+I E+F+EVR E +R++VM
Subjt:  ELRNKARSIQQGESDVTQYYSSLRRLWAKLDLCLNLEWENSKDVERFRKHVEKKQIYDFLAGLRPELDDVRGRLLATKPIPAIDEIFAEVRWESSRKRVM

Query:  MGDTPTKPLSLSLESSALAAR-----GDYRPPP---------------------PTNTPPSRTSSS-SYQVGPSVSNSQDLATSLPPFSKAQLEQLYGLL
        + D P    +  +ESSAL ++     GD R  P                     P N      S   ++Q   + S    + +  P F+K QL  LY L 
Subjt:  MGDTPTKPLSLSLESSALAAR-----GDYRPPP---------------------PTNTPPSRTSSS-SYQVGPSVSNSQDLATSLPPFSKAQLEQLYGLL

Query:  TPPVESTPSSSFMAQRGIFNAALTSQQHSDQ--WILDSGATDHMTAFHDMFTMYSPNPIQTHVKLADGSSAIIKGFGSVILSPNITLHSVLHVSKLCCNL
          P  S PS S   Q     AAL+S + +    WI+DSGATDHMT    +F+ Y P      +K+ DGS + I G GSV +SP++TLH+VLHV  L CNL
Subjt:  TPPVESTPSSSFMAQRGIFNAALTSQQHSDQ--WILDSGATDHMTAFHDMFTMYSPNPIQTHVKLADGSSAIIKGFGSVILSPNITLHSVLHVSKLCCNL

Query:  ISVQKLTHDLKCQALFTDSKCLFQDLITGTTISNADGFEGLYYFRGPSLRNKQV
        +S+ K+T D +CQA F  S C FQ+L +G TI NA    GLY+F   S   K +
Subjt:  ISVQKLTHDLKCQALFTDSKCLFQDLITGTTISNADGFEGLYYFRGPSLRNKQV

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.2e-0824.22Show/hide
Query:  ELRNKARSIQQGESDVTQYYSSLRRLWAKLDLCLNLEWENSKDVERFRKHV--EKKQIYDFLAG--LRPELDDVRGRLL--ATKPIPAIDEIFAEVRWES
        +LR + +   +G   +  Y   L   + +L L L    ++ + VER  +++  E K + D +A     P L ++  RLL   +K +         +   +
Subjt:  ELRNKARSIQQGESDVTQYYSSLRRLWAKLDLCLNLEWENSKDVERFRKHV--EKKQIYDFLAG--LRPELDDVRGRLL--ATKPIPAIDEIFAEVRWES

Query:  SRKRVMMGDTPTKPLSLSLESSALAARGDYRPPPPTNTPPSRTSSSSYQVGPSVSNSQDLATSLPPFSKAQLEQLYGLLTPPVESTPSSSFMAQRGIFNA
           R     T T   +    ++    R +     P     +    ++ Q  P +   Q     +   S  +  QL   L+      P S F   +   N 
Subjt:  SRKRVMMGDTPTKPLSLSLESSALAARGDYRPPPPTNTPPSRTSSSSYQVGPSVSNSQDLATSLPPFSKAQLEQLYGLLTPPVESTPSSSFMAQRGIFNA

Query:  ALTSQQHSDQWILDSGATDHMTAFHDMFTMYSPNPIQTHVKLADGSSAIIKGFGSVILSPN---ITLHSVLHVSKLCCNLISVQKLTHDLKCQALFTDSK
        AL S   S+ W+LDSGAT H+T+  +  +++ P      V +ADGS+  I   GS  LS     + LH++L+V  +  NLISV +L +       F  + 
Subjt:  ALTSQQHSDQWILDSGATDHMTAFHDMFTMYSPNPIQTHVKLADGSSAIIKGFGSVILSPN---ITLHSVLHVSKLCCNLISVQKLTHDLKCQALFTDSK

Query:  CLFQDLITGTTISNADGFEGLY
           +DL TG  +      + LY
Subjt:  CLFQDLITGTTISNADGFEGLY

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE25.0e-0922.77Show/hide
Query:  KLNGKNFLQWSRSALLVIRGRSRLGYINGTIAEPDEADPSFSVKGSLKCAHYGFSDFNNSAQLFE---LRNKARSIQQGESDVT---QYYSSLRRLWAK-
        KL   N+L WSR    +  G    G+++G+   P    P+     ++   +  ++ +    +L     L   + S+Q   S  T   Q + +LR+++A  
Subjt:  KLNGKNFLQWSRSALLVIRGRSRLGYINGTIAEPDEADPSFSVKGSLKCAHYGFSDFNNSAQLFE---LRNKARSIQQGESDVT---QYYSSLRRLWAK-

Query:  -----LDLCLNLEWENSKDVERFRKHVEKKQIYDFLAGLRPELDDVRGRLLATKPIPAIDEIFAEVRWESSRKRVMMGDTPTKPLSLSLESSALA-----
               L     ++    + +   H E  Q+   L  L  +   V  ++ A    P++ EI   +    S K + +      P++ ++ +         
Subjt:  -----LDLCLNLEWENSKDVERFRKHVEKKQIYDFLAGLRPELDDVRGRLLATKPIPAIDEIFAEVRWESSRKRVMMGDTPTKPLSLSLESSALA-----

Query:  --ARGDYRPPPPTNT-----PPSRTSSSSYQVGPSVSNSQDLATSLPPFSKAQLEQLYGLLTPPVESTPSSSFMAQRGIFNAALTSQQHSDQWILDSGAT
           RGD R     N       PS + S S    P     +    S+   S  +  QL+   +   +   +S F   +   N A+ S  +++ W+LDSGAT
Subjt:  --ARGDYRPPPPTNT-----PPSRTSSSSYQVGPSVSNSQDLATSLPPFSKAQLEQLYGLLTPPVESTPSSSFMAQRGIFNAALTSQQHSDQWILDSGAT

Query:  DHMTAFHDMFTMYSPNPIQTHVKLADGSSAIIKGFGSVIL---SPNITLHSVLHVSKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGTTISNADGF
         H+T+  +  + + P      V +ADGS+  I   GS  L   S ++ L+ VL+V  +  NLISV +L +  +    F  +    +DL TG  +      
Subjt:  DHMTAFHDMFTMYSPNPIQTHVKLADGSSAIIKGFGSVIL---SPNITLHSVLHVSKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGTTISNADGF

Query:  EGLY
        + LY
Subjt:  EGLY

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGAGTTCCGACGGATCGAATACGACCTCAATTTCTCTCCCTAAAAGCACATCCACGACACTTCACATCTCTATCTCTGAAAACACTTCGCTCTTCGCTCCAAATTAC
CTCTCCAAAACTCAACGGAAAAAATTTTCTGCAATGGTCTCGATCGGCCCTCCTAGTTATTCGTGGCCGCAGTCGACTCGGGTACATCAATGGCACGATAGCAGAACCAG
ATGAAGCTGACCCTTCTTTTTCCGTCAAAGGATCTTTGAAATGCGCTCACTATGGCTTTTCTGATTTTAATAATTCGGCTCAATTGTTTGAATTACGCAATAAGGCACGT
TCCATACAACAAGGTGAATCCGATGTCACCCAATACTACAGCTCATTACGTAGGTTGTGGGCGAAACTTGATTTATGTCTCAATCTCGAATGGGAGAACTCGAAGGATGT
TGAACGCTTCCGCAAACACGTCGAGAAGAAACAAATTTATGATTTTCTTGCAGGTCTTCGTCCGGAATTAGATGATGTGCGTGGCCGCTTACTTGCCACAAAGCCAATCC
CAGCCATTGACGAAATCTTCGCAGAAGTTCGCTGGGAGTCAAGCCGTAAACGTGTGATGATGGGTGATACACCCACAAAACCTCTGTCCCTCTCACTGGAATCATCAGCT
CTGGCGGCACGAGGGGATTATCGGCCACCTCCACCTACAAATACTCCACCTTCTCGAACCAGCTCCTCCAGCTATCAAGTGGGCCCTAGTGTGTCAAACTCCCAAGATTT
GGCCACCTCTCTCCCTCCATTTTCGAAAGCACAACTTGAACAGCTCTATGGCCTCTTAACACCGCCGGTTGAGTCTACTCCTTCATCAAGTTTTATGGCACAACGAGGTA
TTTTTAATGCAGCTTTAACAAGTCAGCAGCATTCCGATCAGTGGATCTTAGATTCGGGTGCAACTGATCATATGACCGCTTTTCATGATATGTTTACCATGTACTCACCC
AACCCGATTCAGACACATGTCAAGCTTGCAGATGGGTCATCGGCCATTATTAAGGGCTTTGGTTCTGTTATTCTTAGCCCAAACATTACATTGCACTCAGTGCTCCATGT
GTCTAAATTATGTTGCAATCTAATTTCTGTTCAGAAGTTGACTCATGATTTAAAGTGTCAAGCCCTATTCACTGACTCTAAGTGTTTGTTTCAGGACTTGATAACGGGAA
CGACGATTAGCAATGCTGATGGCTTTGAAGGGCTCTACTACTTCAGAGGACCAAGTCTAAGAAATAAACAAGTTCTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGAGTTCCGACGGATCGAATACGACCTCAATTTCTCTCCCTAAAAGCACATCCACGACACTTCACATCTCTATCTCTGAAAACACTTCGCTCTTCGCTCCAAATTAC
CTCTCCAAAACTCAACGGAAAAAATTTTCTGCAATGGTCTCGATCGGCCCTCCTAGTTATTCGTGGCCGCAGTCGACTCGGGTACATCAATGGCACGATAGCAGAACCAG
ATGAAGCTGACCCTTCTTTTTCCGTCAAAGGATCTTTGAAATGCGCTCACTATGGCTTTTCTGATTTTAATAATTCGGCTCAATTGTTTGAATTACGCAATAAGGCACGT
TCCATACAACAAGGTGAATCCGATGTCACCCAATACTACAGCTCATTACGTAGGTTGTGGGCGAAACTTGATTTATGTCTCAATCTCGAATGGGAGAACTCGAAGGATGT
TGAACGCTTCCGCAAACACGTCGAGAAGAAACAAATTTATGATTTTCTTGCAGGTCTTCGTCCGGAATTAGATGATGTGCGTGGCCGCTTACTTGCCACAAAGCCAATCC
CAGCCATTGACGAAATCTTCGCAGAAGTTCGCTGGGAGTCAAGCCGTAAACGTGTGATGATGGGTGATACACCCACAAAACCTCTGTCCCTCTCACTGGAATCATCAGCT
CTGGCGGCACGAGGGGATTATCGGCCACCTCCACCTACAAATACTCCACCTTCTCGAACCAGCTCCTCCAGCTATCAAGTGGGCCCTAGTGTGTCAAACTCCCAAGATTT
GGCCACCTCTCTCCCTCCATTTTCGAAAGCACAACTTGAACAGCTCTATGGCCTCTTAACACCGCCGGTTGAGTCTACTCCTTCATCAAGTTTTATGGCACAACGAGGTA
TTTTTAATGCAGCTTTAACAAGTCAGCAGCATTCCGATCAGTGGATCTTAGATTCGGGTGCAACTGATCATATGACCGCTTTTCATGATATGTTTACCATGTACTCACCC
AACCCGATTCAGACACATGTCAAGCTTGCAGATGGGTCATCGGCCATTATTAAGGGCTTTGGTTCTGTTATTCTTAGCCCAAACATTACATTGCACTCAGTGCTCCATGT
GTCTAAATTATGTTGCAATCTAATTTCTGTTCAGAAGTTGACTCATGATTTAAAGTGTCAAGCCCTATTCACTGACTCTAAGTGTTTGTTTCAGGACTTGATAACGGGAA
CGACGATTAGCAATGCTGATGGCTTTGAAGGGCTCTACTACTTCAGAGGACCAAGTCTAAGAAATAAACAAGTTCTCTAG
Protein sequenceShow/hide protein sequence
MRVPTDRIRPQFLSLKAHPRHFTSLSLKTLRSSLQITSPKLNGKNFLQWSRSALLVIRGRSRLGYINGTIAEPDEADPSFSVKGSLKCAHYGFSDFNNSAQLFELRNKAR
SIQQGESDVTQYYSSLRRLWAKLDLCLNLEWENSKDVERFRKHVEKKQIYDFLAGLRPELDDVRGRLLATKPIPAIDEIFAEVRWESSRKRVMMGDTPTKPLSLSLESSA
LAARGDYRPPPPTNTPPSRTSSSSYQVGPSVSNSQDLATSLPPFSKAQLEQLYGLLTPPVESTPSSSFMAQRGIFNAALTSQQHSDQWILDSGATDHMTAFHDMFTMYSP
NPIQTHVKLADGSSAIIKGFGSVILSPNITLHSVLHVSKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGTTISNADGFEGLYYFRGPSLRNKQVL