; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g30860 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g30860
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr6:23247281..23249681
RNA-Seq ExpressionMoc06g30860
SyntenyMoc06g30860
Gene Ontology termsGO:0034641 - cellular nitrogen compound metabolic process (biological process)
GO:0043170 - macromolecule metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0110165 - cellular anatomical structure (cellular component)
GO:0016787 - hydrolase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN76196.1 hypothetical protein VITISV_041073 [Vitis vinifera]5.0e-7542.56Show/hide
Query:  VFSDFDNSAQLFELRNKAHSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAGIRPELDDVRGRLLATKPIPAIDEIFA
        ++SD +NS+Q+F+L++K    RQG+ +VT YY+ +  LW ELDLC   EW+   D+ R +K  E +R+Y FLA +   LD+VRGR+L  KP+P+I E+F+
Subjt:  VFSDFDNSAQLFELRNKAHSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAGIRPELDDVRGRLLATKPIPAIDEIFA

Query:  EVRWESSRKRVMMGDTHTKPLS-LSLESSALAARGPPPPSSRSTRRNNLWCDHCKRTNHTKDQCWEFHGRPQRKNLSGDYRPPPPTNTPSSRTSSSGYQV
        EVR E +R++VM+  T  +P+S   +ESSAL ++G         RR   WCDHCK+  HTK  CW+ HG+PQ      +++    ++  + +T S+  Q 
Subjt:  EVRWESSRKRVMMGDTHTKPLS-LSLESSALAARGPPPPSSRSTRRNNLWCDHCKRTNHTKDQCWEFHGRPQRKNLSGDYRPPPPTNTPSSRTSSSGYQV

Query:  GPSMSNSQDLATSLPPFSKAQLEQLYRLLTPPVESTPSSSFVAQRGIFSAALTSQQHSDQ--WILDSGATDHMTAFHDMFTMYSPNPIQTHVKLADGSSA
        GP +++ +      P F+K QL  LY+L   P  S PS S   Q     AAL+S + +    WI+DSGATDHMT    +F+ Y P      +K+ DGS +
Subjt:  GPSMSNSQDLATSLPPFSKAQLEQLYRLLTPPVESTPSSSFVAQRGIFSAALTSQQHSDQ--WILDSGATDHMTAFHDMFTMYSPNPIQTHVKLADGSSA

Query:  IIKGFGSVILSPNITLHSVLHVPKLYCNLIFVQKLTHDLKCQALFTDSKCLFQDLITGTTIGNADGFEGLYYFRGPSLRNKQV
         I G GSV +SP++TLH+VLHVP L CNL+ + K+T D +CQA F  S C FQ+L +G TIGNA    GLY+F   S   K +
Subjt:  IIKGFGSVILSPNITLHSVLHVPKLYCNLIFVQKLTHDLKCQALFTDSKCLFQDLITGTTIGNADGFEGLYYFRGPSLRNKQV

KAE8658400.1 60S ribosomal protein L38 [Hibiscus syriacus]2.2e-7541.69Show/hide
Query:  FSDFDNSAQLFELRNKAHSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAGIRPELDDVRGRLLATKPIPAIDEIFAE
        +SD  N+ Q FEL+ +   L+QGE  VTQYY+ L+ LW E+D+  + EW  +KD   F+K VEKER+++FL G+  ELD+VRGR+L  +P+P+  E+F+E
Subjt:  FSDFDNSAQLFELRNKAHSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAGIRPELDDVRGRLLATKPIPAIDEIFAE

Query:  VRWESSRKRVMMGDTHTKPLSLSLESSALAARGPPPPSSRSTRRNNLWCDHCKRTNHTKDQCWEFHGRPQRKNLSGDYRPPPPTNTPSSRTSSSGYQVGP
        VR E SR+ VM+G     P     ESSAL +  PP  + R   R+   C+HC +  HTK +CW+ HG+P  +N S +          +SR +   +    
Subjt:  VRWESSRKRVMMGDTHTKPLSLSLESSALAARGPPPPSSRSTRRNNLWCDHCKRTNHTKDQCWEFHGRPQRKNLSGDYRPPPPTNTPSSRTSSSGYQVGP

Query:  SMSNSQDLATSLPPFSKAQLEQLYRLL-------TPPVESTPSSSFVAQRGIFSAALTSQQHSDQWILDSGATDHMTAFHDMFTMYSPNPIQTHVKLADG
          S     A+ L  FSK QLEQLY+L+       TP   S  +SS +AQ+G +  A  +   S+ WI+DSGATDHMT    +F+ Y P      VK+ADG
Subjt:  SMSNSQDLATSLPPFSKAQLEQLYRLL-------TPPVESTPSSSFVAQRGIFSAALTSQQHSDQWILDSGATDHMTAFHDMFTMYSPNPIQTHVKLADG

Query:  SSAIIKGFGSVILSPNITLHSVLHVPKLYCNLIFVQKLTHDLKCQALFTDSKCLFQDLITGTTIGNADGFEGLYYFRGPSLRNKQVLQMIKSTQIEVTSN
        S   I G GS+I+SP++TL +VLHVPKL CNLI V ++ HD KC A  T +   FQD  +G  IGNA   +GLY+    +  NKQV ++   T +   + 
Subjt:  SSAIIKGFGSVILSPNITLHSVLHVPKLYCNLIFVQKLTHDLKCQALFTDSKCLFQDLITGTTIGNADGFEGLYYFRGPSLRNKQVLQMIKSTQIEVTSN

Query:  LRL
        + L
Subjt:  LRL

RVW98618.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]8.2e-7842.75Show/hide
Query:  VFSDFDNSAQLFELRNKAHSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAGIRPELDDVRGRLLATKPIPAIDEIFA
        ++SD +NS+Q+F+L++K    RQG+ +VT YY+ +  LW ELDLC   EW+   D+ R +K  E +R+Y FLAG+   LD+VRGR+L  KP+P+I E+F+
Subjt:  VFSDFDNSAQLFELRNKAHSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAGIRPELDDVRGRLLATKPIPAIDEIFA

Query:  EVRWESSRKRVMMGDTHTKPLSLSLESSALAARGPPPPSSRSTRRNNLWCDHCKRTNHTKDQCWEFHGRPQRKNLSGDYRPPPPTNTPSSRTSSSGYQVG
        EVR E +R++VM+ D   K  +  +ESSAL ++G     S   RR   WCDHCK+  HTK+ CW+ HG+PQ      +++    ++  + +T S+  Q G
Subjt:  EVRWESSRKRVMMGDTHTKPLSLSLESSALAARGPPPPSSRSTRRNNLWCDHCKRTNHTKDQCWEFHGRPQRKNLSGDYRPPPPTNTPSSRTSSSGYQVG

Query:  PSMSNSQDLATSLPPFSKAQLEQLYRLLTPPVESTPSSSFVAQRGIFSAALTSQQHSDQ--WILDSGATDHMTAFHDMFTMYSPNPIQTHVKLADGSSAI
        P +++ +      P F+K QL  LY+L   P  S PS S   Q     AAL+S + +    WI+DSGATDHMT    +F+ Y P      +K+ADGS + 
Subjt:  PSMSNSQDLATSLPPFSKAQLEQLYRLLTPPVESTPSSSFVAQRGIFSAALTSQQHSDQ--WILDSGATDHMTAFHDMFTMYSPNPIQTHVKLADGSSAI

Query:  IKGFGSVILSPNITLHSVLHVPKLYCNLIFVQKLTHDLKCQALFTDSKCLFQDLITGTTIGNADGFEGLYYFRGPSLRNKQVLQMI
        I G GSV +SP++TLH+VLHVP L CNL+ + K+T D +CQA F  S C FQ+L +G TIGNA    GLY+F   S   K + +++
Subjt:  IKGFGSVILSPNITLHSVLHVPKLYCNLIFVQKLTHDLKCQALFTDSKCLFQDLITGTTIGNADGFEGLYYFRGPSLRNKQVLQMI

XP_022154801.1 uncharacterized protein LOC111021967 [Momordica charantia]7.4e-7999.34Show/hide
Query:  MVFSDFDNSAQLFELRNKAHSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAGIRPELDDVRGRLLATKPIPAIDEIF
        MVFSDFDNSAQLFELRNKAHSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAGIRPELDDVRGRLLATKPIPAIDEIF
Subjt:  MVFSDFDNSAQLFELRNKAHSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAGIRPELDDVRGRLLATKPIPAIDEIF

Query:  AEVRWESSRKRVMMGDTHTKPLSLSLESSALAARGPPPPSSRSTRRNNLWCD
        AEVRWESSRKRVMMGDTHTKPLSLSLESSALAARGPPPPSSRSTRRNNLW D
Subjt:  AEVRWESSRKRVMMGDTHTKPLSLSLESSALAARGPPPPSSRSTRRNNLWCD

XP_022159153.1 uncharacterized protein LOC111025577 [Momordica charantia]3.6e-15076.24Show/hide
Query:  MVFSDFDNSAQLFELRNKAHSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAGIRPELDDVRGRLLATKPIPAIDEIF
        M FSDFDNSAQLFEL NKA SLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAG+RPELDDVRGRLLATKPIPAIDEIF
Subjt:  MVFSDFDNSAQLFELRNKAHSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAGIRPELDDVRGRLLATKPIPAIDEIF

Query:  AEVRWESSRKRVMMGDTHTKPLSLSLESSALAARGPPPPSSRSTRRNNLWCDHCKRTNHTKDQCWEFHGRPQRKNLSGDYRPPPPTNTPSSRTSSSGYQV
        AEV WESSRKRVMMGDTHTKPLSLSLESSALAARGPPP                                                              
Subjt:  AEVRWESSRKRVMMGDTHTKPLSLSLESSALAARGPPPPSSRSTRRNNLWCDHCKRTNHTKDQCWEFHGRPQRKNLSGDYRPPPPTNTPSSRTSSSGYQV

Query:  GPSMSNSQDLATSLPPFSKAQLEQLYRLLTPPVESTPSSSFVAQRGIFSAALTSQQHSDQWILDSGATDHMTAFHDMFTMYSPNPIQTHVKLADGSSAII
                    S  P   AQLEQLYRLLT PVESTPSSSFVAQRGI SAALT QQHSDQWILDSGATDHMTAFHDMFTMYSPNPIQTHVKLADGSSAII
Subjt:  GPSMSNSQDLATSLPPFSKAQLEQLYRLLTPPVESTPSSSFVAQRGIFSAALTSQQHSDQWILDSGATDHMTAFHDMFTMYSPNPIQTHVKLADGSSAII

Query:  KGFGSVILSPNITLHSVLHVPKLYCNLIFVQKLTHDLKCQALFTDSKCLFQDLITGTTIGNADGFEGLYYFRGPSLRNKQVLQ
        KGFGSVILSPNITLHSVL VPKL CNLI VQKLTHDLKCQALFTDSKCLFQDLITGTTIG+ADGFEGLYYFRGPSLRNKQVLQ
Subjt:  KGFGSVILSPNITLHSVLHVPKLYCNLIFVQKLTHDLKCQALFTDSKCLFQDLITGTTIGNADGFEGLYYFRGPSLRNKQVLQ

TrEMBL top hitse value%identityAlignment
A0A438IPH4 Retrovirus-related Pol polyprotein from transposon RE14.0e-7842.75Show/hide
Query:  VFSDFDNSAQLFELRNKAHSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAGIRPELDDVRGRLLATKPIPAIDEIFA
        ++SD +NS+Q+F+L++K    RQG+ +VT YY+ +  LW ELDLC   EW+   D+ R +K  E +R+Y FLAG+   LD+VRGR+L  KP+P+I E+F+
Subjt:  VFSDFDNSAQLFELRNKAHSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAGIRPELDDVRGRLLATKPIPAIDEIFA

Query:  EVRWESSRKRVMMGDTHTKPLSLSLESSALAARGPPPPSSRSTRRNNLWCDHCKRTNHTKDQCWEFHGRPQRKNLSGDYRPPPPTNTPSSRTSSSGYQVG
        EVR E +R++VM+ D   K  +  +ESSAL ++G     S   RR   WCDHCK+  HTK+ CW+ HG+PQ      +++    ++  + +T S+  Q G
Subjt:  EVRWESSRKRVMMGDTHTKPLSLSLESSALAARGPPPPSSRSTRRNNLWCDHCKRTNHTKDQCWEFHGRPQRKNLSGDYRPPPPTNTPSSRTSSSGYQVG

Query:  PSMSNSQDLATSLPPFSKAQLEQLYRLLTPPVESTPSSSFVAQRGIFSAALTSQQHSDQ--WILDSGATDHMTAFHDMFTMYSPNPIQTHVKLADGSSAI
        P +++ +      P F+K QL  LY+L   P  S PS S   Q     AAL+S + +    WI+DSGATDHMT    +F+ Y P      +K+ADGS + 
Subjt:  PSMSNSQDLATSLPPFSKAQLEQLYRLLTPPVESTPSSSFVAQRGIFSAALTSQQHSDQ--WILDSGATDHMTAFHDMFTMYSPNPIQTHVKLADGSSAI

Query:  IKGFGSVILSPNITLHSVLHVPKLYCNLIFVQKLTHDLKCQALFTDSKCLFQDLITGTTIGNADGFEGLYYFRGPSLRNKQVLQMI
        I G GSV +SP++TLH+VLHVP L CNL+ + K+T D +CQA F  S C FQ+L +G TIGNA    GLY+F   S   K + +++
Subjt:  IKGFGSVILSPNITLHSVLHVPKLYCNLIFVQKLTHDLKCQALFTDSKCLFQDLITGTTIGNADGFEGLYYFRGPSLRNKQVLQMI

A0A6A2WU09 60S ribosomal protein L381.1e-7541.69Show/hide
Query:  FSDFDNSAQLFELRNKAHSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAGIRPELDDVRGRLLATKPIPAIDEIFAE
        +SD  N+ Q FEL+ +   L+QGE  VTQYY+ L+ LW E+D+  + EW  +KD   F+K VEKER+++FL G+  ELD+VRGR+L  +P+P+  E+F+E
Subjt:  FSDFDNSAQLFELRNKAHSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAGIRPELDDVRGRLLATKPIPAIDEIFAE

Query:  VRWESSRKRVMMGDTHTKPLSLSLESSALAARGPPPPSSRSTRRNNLWCDHCKRTNHTKDQCWEFHGRPQRKNLSGDYRPPPPTNTPSSRTSSSGYQVGP
        VR E SR+ VM+G     P     ESSAL +  PP  + R   R+   C+HC +  HTK +CW+ HG+P  +N S +          +SR +   +    
Subjt:  VRWESSRKRVMMGDTHTKPLSLSLESSALAARGPPPPSSRSTRRNNLWCDHCKRTNHTKDQCWEFHGRPQRKNLSGDYRPPPPTNTPSSRTSSSGYQVGP

Query:  SMSNSQDLATSLPPFSKAQLEQLYRLL-------TPPVESTPSSSFVAQRGIFSAALTSQQHSDQWILDSGATDHMTAFHDMFTMYSPNPIQTHVKLADG
          S     A+ L  FSK QLEQLY+L+       TP   S  +SS +AQ+G +  A  +   S+ WI+DSGATDHMT    +F+ Y P      VK+ADG
Subjt:  SMSNSQDLATSLPPFSKAQLEQLYRLL-------TPPVESTPSSSFVAQRGIFSAALTSQQHSDQWILDSGATDHMTAFHDMFTMYSPNPIQTHVKLADG

Query:  SSAIIKGFGSVILSPNITLHSVLHVPKLYCNLIFVQKLTHDLKCQALFTDSKCLFQDLITGTTIGNADGFEGLYYFRGPSLRNKQVLQMIKSTQIEVTSN
        S   I G GS+I+SP++TL +VLHVPKL CNLI V ++ HD KC A  T +   FQD  +G  IGNA   +GLY+    +  NKQV ++   T +   + 
Subjt:  SSAIIKGFGSVILSPNITLHSVLHVPKLYCNLIFVQKLTHDLKCQALFTDSKCLFQDLITGTTIGNADGFEGLYYFRGPSLRNKQVLQMIKSTQIEVTSN

Query:  LRL
        + L
Subjt:  LRL

A0A6J1DPT5 uncharacterized protein LOC1110219673.6e-7999.34Show/hide
Query:  MVFSDFDNSAQLFELRNKAHSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAGIRPELDDVRGRLLATKPIPAIDEIF
        MVFSDFDNSAQLFELRNKAHSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAGIRPELDDVRGRLLATKPIPAIDEIF
Subjt:  MVFSDFDNSAQLFELRNKAHSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAGIRPELDDVRGRLLATKPIPAIDEIF

Query:  AEVRWESSRKRVMMGDTHTKPLSLSLESSALAARGPPPPSSRSTRRNNLWCD
        AEVRWESSRKRVMMGDTHTKPLSLSLESSALAARGPPPPSSRSTRRNNLW D
Subjt:  AEVRWESSRKRVMMGDTHTKPLSLSLESSALAARGPPPPSSRSTRRNNLWCD

A0A6J1DY12 uncharacterized protein LOC1110255771.8e-15076.24Show/hide
Query:  MVFSDFDNSAQLFELRNKAHSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAGIRPELDDVRGRLLATKPIPAIDEIF
        M FSDFDNSAQLFEL NKA SLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAG+RPELDDVRGRLLATKPIPAIDEIF
Subjt:  MVFSDFDNSAQLFELRNKAHSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAGIRPELDDVRGRLLATKPIPAIDEIF

Query:  AEVRWESSRKRVMMGDTHTKPLSLSLESSALAARGPPPPSSRSTRRNNLWCDHCKRTNHTKDQCWEFHGRPQRKNLSGDYRPPPPTNTPSSRTSSSGYQV
        AEV WESSRKRVMMGDTHTKPLSLSLESSALAARGPPP                                                              
Subjt:  AEVRWESSRKRVMMGDTHTKPLSLSLESSALAARGPPPPSSRSTRRNNLWCDHCKRTNHTKDQCWEFHGRPQRKNLSGDYRPPPPTNTPSSRTSSSGYQV

Query:  GPSMSNSQDLATSLPPFSKAQLEQLYRLLTPPVESTPSSSFVAQRGIFSAALTSQQHSDQWILDSGATDHMTAFHDMFTMYSPNPIQTHVKLADGSSAII
                    S  P   AQLEQLYRLLT PVESTPSSSFVAQRGI SAALT QQHSDQWILDSGATDHMTAFHDMFTMYSPNPIQTHVKLADGSSAII
Subjt:  GPSMSNSQDLATSLPPFSKAQLEQLYRLLTPPVESTPSSSFVAQRGIFSAALTSQQHSDQWILDSGATDHMTAFHDMFTMYSPNPIQTHVKLADGSSAII

Query:  KGFGSVILSPNITLHSVLHVPKLYCNLIFVQKLTHDLKCQALFTDSKCLFQDLITGTTIGNADGFEGLYYFRGPSLRNKQVLQ
        KGFGSVILSPNITLHSVL VPKL CNLI VQKLTHDLKCQALFTDSKCLFQDLITGTTIG+ADGFEGLYYFRGPSLRNKQVLQ
Subjt:  KGFGSVILSPNITLHSVLHVPKLYCNLIFVQKLTHDLKCQALFTDSKCLFQDLITGTTIGNADGFEGLYYFRGPSLRNKQVLQ

A5AYJ3 Integrase catalytic domain-containing protein2.4e-7542.56Show/hide
Query:  VFSDFDNSAQLFELRNKAHSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAGIRPELDDVRGRLLATKPIPAIDEIFA
        ++SD +NS+Q+F+L++K    RQG+ +VT YY+ +  LW ELDLC   EW+   D+ R +K  E +R+Y FLA +   LD+VRGR+L  KP+P+I E+F+
Subjt:  VFSDFDNSAQLFELRNKAHSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAGIRPELDDVRGRLLATKPIPAIDEIFA

Query:  EVRWESSRKRVMMGDTHTKPLS-LSLESSALAARGPPPPSSRSTRRNNLWCDHCKRTNHTKDQCWEFHGRPQRKNLSGDYRPPPPTNTPSSRTSSSGYQV
        EVR E +R++VM+  T  +P+S   +ESSAL ++G         RR   WCDHCK+  HTK  CW+ HG+PQ      +++    ++  + +T S+  Q 
Subjt:  EVRWESSRKRVMMGDTHTKPLS-LSLESSALAARGPPPPSSRSTRRNNLWCDHCKRTNHTKDQCWEFHGRPQRKNLSGDYRPPPPTNTPSSRTSSSGYQV

Query:  GPSMSNSQDLATSLPPFSKAQLEQLYRLLTPPVESTPSSSFVAQRGIFSAALTSQQHSDQ--WILDSGATDHMTAFHDMFTMYSPNPIQTHVKLADGSSA
        GP +++ +      P F+K QL  LY+L   P  S PS S   Q     AAL+S + +    WI+DSGATDHMT    +F+ Y P      +K+ DGS +
Subjt:  GPSMSNSQDLATSLPPFSKAQLEQLYRLLTPPVESTPSSSFVAQRGIFSAALTSQQHSDQ--WILDSGATDHMTAFHDMFTMYSPNPIQTHVKLADGSSA

Query:  IIKGFGSVILSPNITLHSVLHVPKLYCNLIFVQKLTHDLKCQALFTDSKCLFQDLITGTTIGNADGFEGLYYFRGPSLRNKQV
         I G GSV +SP++TLH+VLHVP L CNL+ + K+T D +CQA F  S C FQ+L +G TIGNA    GLY+F   S   K +
Subjt:  IIKGFGSVILSPNITLHSVLHVPKLYCNLIFVQKLTHDLKCQALFTDSKCLFQDLITGTTIGNADGFEGLYYFRGPSLRNKQV

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE14.5e-1030.97Show/hide
Query:  SKAQLEQLYRLLTPPVESTPSSSFVAQRGIFSAALTSQQHSDQWILDSGATDHMTAFHDMFTMYSPNPIQTHVKLADGSSAIIKGFGSVILSPN---ITL
        S  +  QL   L+      P S F   +   + AL S   S+ W+LDSGAT H+T+  +  +++ P      V +ADGS+  I   GS  LS     + L
Subjt:  SKAQLEQLYRLLTPPVESTPSSSFVAQRGIFSAALTSQQHSDQWILDSGATDHMTAFHDMFTMYSPNPIQTHVKLADGSSAIIKGFGSVILSPN---ITL

Query:  HSVLHVPKLYCNLIFVQKLTHDLKCQALFTDSKCLFQDLITGTTIGNADGFEGLY
        H++L+VP ++ NLI V +L +       F  +    +DL TG  +      + LY
Subjt:  HSVLHVPKLYCNLIFVQKLTHDLKCQALFTDSKCLFQDLITGTTIGNADGFEGLY

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.1e-0725.69Show/hide
Query:  KRTNHTKDQCWEFHGRPQRKNLSGDYRPPPPTNTPSSRTSSSGYQVGPSMSNSQDLATSLPPFSKAQLEQLYRLLTPPVESTPSSSFVAQRGIFSAALTS
        + TN  ++Q    + R   +N + +          SS + S   Q  P +   Q    S+   S  +  QL++  +   +   +S F   +   + A+ S
Subjt:  KRTNHTKDQCWEFHGRPQRKNLSGDYRPPPPTNTPSSRTSSSGYQVGPSMSNSQDLATSLPPFSKAQLEQLYRLLTPPVESTPSSSFVAQRGIFSAALTS

Query:  QQHSDQWILDSGATDHMTAFHDMFTMYSPNPIQTHVKLADGSSAIIKGFGSVIL---SPNITLHSVLHVPKLYCNLIFVQKLTHDLKCQALFTDSKCLFQ
          +++ W+LDSGAT H+T+  +  + + P      V +ADGS+  I   GS  L   S ++ L+ VL+VP ++ NLI V +L +  +    F  +    +
Subjt:  QQHSDQWILDSGATDHMTAFHDMFTMYSPNPIQTHVKLADGSSAIIKGFGSVIL---SPNITLHSVLHVPKLYCNLIFVQKLTHDLKCQALFTDSKCLFQ

Query:  DLITGTTIGNADGFEGLY
        DL TG  +      + LY
Subjt:  DLITGTTIGNADGFEGLY

Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).4.7e-0731.73Show/hide
Query:  QLFELRNKAHSLRQGESDVTQYYSSLRRLWAELD--------LCLNLEWENSKDAERFRKHVEKERIYDFLAGIR--PELDDVRGRLLATKPIPAIDEIF
        ++++LR +  +LRQG   V +Y+  L ++W EL          C     E +K AE  R   EKE+ Y+FL G++     + V  +++  KP P++ E F
Subjt:  QLFELRNKAHSLRQGESDVTQYYSSLRRLWAELD--------LCLNLEWENSKDAERFRKHVEKERIYDFLAGIR--PELDDVRGRLLATKPIPAIDEIF

Query:  AEVR
        A V+
Subjt:  AEVR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTTTTTCTGATTTTGATAACTCGGCTCAATTGTTTGAATTACGCAATAAGGCACATTCCTTACGACAAGGTGAATCCGATGTCACCCAATACTACAGCTCATTACG
TAGGTTGTGGGCTGAACTTGATTTATGTCTCAATCTCGAATGGGAGAACTCGAAGGATGCTGAACGCTTCCGCAAACATGTCGAGAAGGAACGAATTTATGATTTTCTTG
CAGGTATTCGTCCGGAATTAGATGATGTGCGTGGCCGCTTACTTGCCACAAAGCCAATCCCAGCCATTGACGAAATCTTCGCAGAAGTTCGCTGGGAGTCAAGCCGTAAA
CGTGTGATGATGGGTGATACACATACAAAACCTTTGTCCCTCTCACTGGAATCATCAGCTCTGGCGGCACGAGGTCCACCACCACCTTCATCCCGATCTACTCGTCGGAA
CAACCTATGGTGTGATCATTGTAAGCGCACAAACCATACAAAAGATCAGTGTTGGGAATTCCATGGTCGTCCTCAACGGAAGAATTTATCAGGGGATTATCGGCCACCTC
CACCTACAAATACTCCATCTTCTCGAACTAGCTCCTCCGGCTATCAAGTGGGCCCTAGTATGTCAAACTCCCAAGATTTGGCCACCTCTCTCCCTCCATTTTCGAAGGCA
CAACTTGAACAGCTCTATCGCCTCTTAACACCGCCGGTTGAGTCTACTCCTTCGTCAAGTTTTGTGGCACAACGAGGTATTTTTAGTGCAGCTTTAACAAGTCAGCAGCA
TTCCGATCAGTGGATCTTAGATTCAGGTGCAACTGATCATATGACCGCTTTTCATGATATGTTTACCATGTACTCACCCAACCCGATTCAGACACATGTCAAGCTTGCAG
ATGGGTCATCGGCCATTATTAAGGGCTTTGGTTCTGTTATTCTTAGCCCAAACATTACATTGCACTCAGTGCTCCATGTGCCTAAATTATATTGCAATCTAATTTTTGTT
CAGAAGTTGACTCATGATTTAAAGTGTCAAGCCCTGTTCACTGACTCTAAGTGTTTGTTTCAGGACTTGATAACGGGAACGACGATTGGCAATGCTGATGGCTTTGAAGG
GCTCTACTACTTCAGAGGACCAAGTCTAAGAAATAAACAAGTTCTTCAGATGATCAAGTCAACCCAAATCGAAGTGACAAGCAACCTGAGACTCTTGTTTGTTCTCGGTG
ACAAACGGTTCAAAGAGGAGTGGAGCCACCACAGCCTCAACAGCAAAGTCATGAATACATCTCGTCCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTTTTTTCTGATTTTGATAACTCGGCTCAATTGTTTGAATTACGCAATAAGGCACATTCCTTACGACAAGGTGAATCCGATGTCACCCAATACTACAGCTCATTACG
TAGGTTGTGGGCTGAACTTGATTTATGTCTCAATCTCGAATGGGAGAACTCGAAGGATGCTGAACGCTTCCGCAAACATGTCGAGAAGGAACGAATTTATGATTTTCTTG
CAGGTATTCGTCCGGAATTAGATGATGTGCGTGGCCGCTTACTTGCCACAAAGCCAATCCCAGCCATTGACGAAATCTTCGCAGAAGTTCGCTGGGAGTCAAGCCGTAAA
CGTGTGATGATGGGTGATACACATACAAAACCTTTGTCCCTCTCACTGGAATCATCAGCTCTGGCGGCACGAGGTCCACCACCACCTTCATCCCGATCTACTCGTCGGAA
CAACCTATGGTGTGATCATTGTAAGCGCACAAACCATACAAAAGATCAGTGTTGGGAATTCCATGGTCGTCCTCAACGGAAGAATTTATCAGGGGATTATCGGCCACCTC
CACCTACAAATACTCCATCTTCTCGAACTAGCTCCTCCGGCTATCAAGTGGGCCCTAGTATGTCAAACTCCCAAGATTTGGCCACCTCTCTCCCTCCATTTTCGAAGGCA
CAACTTGAACAGCTCTATCGCCTCTTAACACCGCCGGTTGAGTCTACTCCTTCGTCAAGTTTTGTGGCACAACGAGGTATTTTTAGTGCAGCTTTAACAAGTCAGCAGCA
TTCCGATCAGTGGATCTTAGATTCAGGTGCAACTGATCATATGACCGCTTTTCATGATATGTTTACCATGTACTCACCCAACCCGATTCAGACACATGTCAAGCTTGCAG
ATGGGTCATCGGCCATTATTAAGGGCTTTGGTTCTGTTATTCTTAGCCCAAACATTACATTGCACTCAGTGCTCCATGTGCCTAAATTATATTGCAATCTAATTTTTGTT
CAGAAGTTGACTCATGATTTAAAGTGTCAAGCCCTGTTCACTGACTCTAAGTGTTTGTTTCAGGACTTGATAACGGGAACGACGATTGGCAATGCTGATGGCTTTGAAGG
GCTCTACTACTTCAGAGGACCAAGTCTAAGAAATAAACAAGTTCTTCAGATGATCAAGTCAACCCAAATCGAAGTGACAAGCAACCTGAGACTCTTGTTTGTTCTCGGTG
ACAAACGGTTCAAAGAGGAGTGGAGCCACCACAGCCTCAACAGCAAAGTCATGAATACATCTCGTCCTTAG
Protein sequenceShow/hide protein sequence
MVFSDFDNSAQLFELRNKAHSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAGIRPELDDVRGRLLATKPIPAIDEIFAEVRWESSRK
RVMMGDTHTKPLSLSLESSALAARGPPPPSSRSTRRNNLWCDHCKRTNHTKDQCWEFHGRPQRKNLSGDYRPPPPTNTPSSRTSSSGYQVGPSMSNSQDLATSLPPFSKA
QLEQLYRLLTPPVESTPSSSFVAQRGIFSAALTSQQHSDQWILDSGATDHMTAFHDMFTMYSPNPIQTHVKLADGSSAIIKGFGSVILSPNITLHSVLHVPKLYCNLIFV
QKLTHDLKCQALFTDSKCLFQDLITGTTIGNADGFEGLYYFRGPSLRNKQVLQMIKSTQIEVTSNLRLLFVLGDKRFKEEWSHHSLNSKVMNTSRP