; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g26320 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g26320
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr6:19863463..19864341
RNA-Seq ExpressionMoc06g26320
SyntenyMoc06g26320
Gene Ontology termsNA
InterPro domainsIPR025724 - GAG-pre-integrase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
RVW13866.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]1.7e-1030.4Show/hide
Query:  MNYALQGHHPPSQLAAMAAVSNTASGASSFWIVDNGCKSHVT----HLNLTNNYNGEEVVAVAIDGFYPVQASANGSSVMTNNFGILN--KSSMPMSAFI
        MNYA QG HPP+QLAAM A SN A      W  D+G   H+T    HL L   Y G+E VAV          +  G S+      I +  ++ + +   +
Subjt:  MNYALQGHHPPSQLAAMAAVSNTASGASSFWIVDNGCKSHVT----HLNLTNNYNGEEVVAVAIDGFYPVQASANGSSVMTNNFGILN--KSSMPMSAFI

Query:  CTDDANIFAVGNISDAKNIGQFSIDENIF-----ATEIISEVQN---------------ISLNSVATSASFATTANISAPKESFDIWHFRLGHPSPVVLH
            A+       ++  +I QF +D N           + ++Q                I L S++ + S A +A +   K S  IWH RLGH S  ++ 
Subjt:  CTDDANIFAVGNISDAKNIGQFSIDENIF-----ATEIISEVQN---------------ISLNSVATSASFATTANISAPKESFDIWHFRLGHPSPVVLH

Query:  KLLTRHSI-VAGSTPTTRECIGCLKGK
        +LL +HS+ V GS      C  C  GK
Subjt:  KLLTRHSI-VAGSTPTTRECIGCLKGK

RVW37054.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]1.4e-0931.94Show/hide
Query:  MNYALQGHHPPSQLAAMAAVSNTASGASSFWIVDNGCKSHVT----HLNLTNNYNGEEVVAV--------AIDG---FYPVQASANGSSVMTNNFGILNK
        M+YA QG HP +QLAAM A SN A      W  DNG   H+T    HL L   Y G+E VAV        A  G   F+  +A  N   V+  +    + 
Subjt:  MNYALQGHHPPSQLAAMAAVSNTASGASSFWIVDNGCKSHVT----HLNLTNNYNGEEVVAV--------AIDG---FYPVQASANGSSVMTNNFGILNK

Query:  SSMPMSAFICTDDANIFAVGNISDAKNIGQFSIDENIFATEIISEVQNISLNSVATSASFATTANISAPKESFDIWHFRLGHPSPVVLHKLLTRHSI-VA
        +S+ ++ F   ++      G     K+I        +        +  I L S++ + S A +A I   K S  +WH RLGH S  ++ +LL +HS+ V 
Subjt:  SSMPMSAFICTDDANIFAVGNISDAKNIGQFSIDENIFATEIISEVQNISLNSVATSASFATTANISAPKESFDIWHFRLGHPSPVVLHKLLTRHSI-VA

Query:  GSTPTTRECIGCLKGK
        GS      C  C  GK
Subjt:  GSTPTTRECIGCLKGK

RVW58434.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]1.4e-0930.73Show/hide
Query:  MNYALQGHHPPSQLAAMAAVSNTASGASSFWIVDNGCKSHVT----HLNLTNNYNGEEVVAVAIDGFYPVQASANGSSVMTNNFGILNKSSMPMSAFICT
        M+YA QG HPP+QLAAM A SN A      W  D+G   H+T    HL L   Y G+E VAV            NG  +   + G         +    T
Subjt:  MNYALQGHHPPSQLAAMAAVSNTASGASSFWIVDNGCKSHVT----HLNLTNNYNGEEVVAVAIDGFYPVQASANGSSVMTNNFGILNKSSMPMSAFICT

Query:  DDANIFAVGNISDAKNIGQFSIDENIFATEIISEVQNISLNSVATSASFATTANISAPKESFDIWHFRLGHPSPVVLHKLLTRHSI-VAGSTPTTRECIG
         +A +    +I     + +   +  ++          I L S++ + S A +A +   K S  +WH RLGH S  ++ +LL +HS+ V GS      C  
Subjt:  DDANIFAVGNISDAKNIGQFSIDENIFATEIISEVQNISLNSVATSASFATTANISAPKESFDIWHFRLGHPSPVVLHKLLTRHSI-VAGSTPTTRECIG

Query:  CLKGK
        C  GK
Subjt:  CLKGK

RVW70405.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]1.1e-0929.07Show/hide
Query:  MNYALQGHHPPSQLAAMAAVSNTASGASSFWIVDNGCKSHVT----HLNLTNNYNGEEVVAVAIDGFYPVQASANGSSVMTNNFGILN--KSSMPMSAFI
        M+YA QG HPP+QLAAM A SN A      W  D+G   H+T    HL L   Y G+E V V          +  G S+   +  I +  ++ + +   +
Subjt:  MNYALQGHHPPSQLAAMAAVSNTASGASSFWIVDNGCKSHVT----HLNLTNNYNGEEVVAVAIDGFYPVQASANGSSVMTNNFGILN--KSSMPMSAFI

Query:  CTDDANIFAVGNISDAKNIGQFSIDENIF-----ATEIISEVQN---------------ISLNSVATSASFATTANISAPKESFDIWHFRLGHPSPVVLH
            A+       ++  +I QF +D N           + ++Q                I L S++ + S A +A +   K S  +WH RLGH S  ++ 
Subjt:  CTDDANIFAVGNISDAKNIGQFSIDENIF-----ATEIISEVQN---------------ISLNSVATSASFATTANISAPKESFDIWHFRLGHPSPVVLH

Query:  KLLTRHSI-VAGSTPTTRECIGCLKGK
        +LL +HS+ V GS      C  C  GK
Subjt:  KLLTRHSI-VAGSTPTTRECIGCLKGK

RVW73890.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]2.9e-1028.88Show/hide
Query:  MNYALQGHHPPSQLAAMAAVSNTASGASSFWIVDNGCKSHVT----HLNLTNNYNGEEVVAVAIDGFYPVQASANGSSVMTNNFGILNKSSMPMSAFICT
        M+YA QG HPP+QL AM A SN A      W  D+G   H+T    HL L   Y G+E V V          +  G S+      I +     ++     
Subjt:  MNYALQGHHPPSQLAAMAAVSNTASGASSFWIVDNGCKSHVT----HLNLTNNYNGEEVVAVAIDGFYPVQASANGSSVMTNNFGILNKSSMPMSAFICT

Query:  DDANIFAVGNISDAKNIGQFSIDENIF-----ATEIISEVQN---------------ISLNSVATSASFATTANISAPKESFDIWHFRLGHPSPVVLHKL
             FA     +  +I QF +D N           + ++Q                I L S++ + S A +A +   K S  +WH RLGH S  ++ +L
Subjt:  DDANIFAVGNISDAKNIGQFSIDENIF-----ATEIISEVQN---------------ISLNSVATSASFATTANISAPKESFDIWHFRLGHPSPVVLHKL

Query:  LTRHSI-VAGSTPTTRECIGCLKGKMPKISFR
        L +HS+ V GS      C  C  GK  ++ F+
Subjt:  LTRHSI-VAGSTPTTRECIGCLKGKMPKISFR

TrEMBL top hitse value%identityAlignment
A0A2N9FKJ8 Uncharacterized protein8.7e-1331.05Show/hide
Query:  MNYALQGHHPPSQLAAMAAVSNTASGASSFWIVDNGCKSHVT----HLNLTNNYNGEEVVAVAIDGFYPVQASANGSSVMTNNFGILNKSSMPMSAFICT
        M++A QG HPP++LAAMA+ SN + G  + W+ D G   H+T    +L + N Y G + VAV            NG S+  NN G ++K        +CT
Subjt:  MNYALQGHHPPSQLAAMAAVSNTASGASSFWIVDNGCKSHVT----HLNLTNNYNGEEVVAVAIDGFYPVQASANGSSVMTNNFGILNKSSMPMSAFICT

Query:  D-------DANIFAVGNISDAKNIGQFSIDENIFATEIISEVQNISLNSVATSASFATTANISAPKESFDIWHFRLGHPSPVVLHKLLTRHSIVAG--ST
        D       D+N F + ++   K + +      ++    I      S+++ + +AS + +A +S+ K  + +WH RLGHPS  VL   L   S      + 
Subjt:  D-------DANIFAVGNISDAKNIGQFSIDENIFATEIISEVQNISLNSVATSASFATTANISAPKESFDIWHFRLGHPSPVVLHKLLTRHSIVAG--ST

Query:  PTTRECIGCLKGKMPKISF
             C  CL GKM K+ F
Subjt:  PTTRECIGCLKGKMPKISF

A0A2N9GCR2 Uncharacterized protein1.3e-1330.66Show/hide
Query:  MNYALQGHHPPSQLAAMAAVSNTASGASSFWIVDNGCKSHVT----HLNLTNNYNGEEVVAVAIDGFYPVQASANGSSVMTNNFGILNKSSMPMSAFICT
        M++A QG HPP++LAAMA+ SN A    S W+ D G   H+T    +LN+   Y G + VAV            NG S+  NN G ++K     +   C 
Subjt:  MNYALQGHHPPSQLAAMAAVSNTASGASSFWIVDNGCKSHVT----HLNLTNNYNGEEVVAVAIDGFYPVQASANGSSVMTNNFGILNKSSMPMSAFICT

Query:  DDANIFAVGNISDAKNIGQFSIDENIFATEIISEVQNISLNSVATSASFATTANISAPKESFDIWHFRLGHPSPVVLHKLLTRHS--IVAGSTPTTRECI
         D+N F + ++   K + +   +  ++         + +  SV T+ +  + +   + K  + +WH RLGHPS  VL   L   S  I   +      C 
Subjt:  DDANIFAVGNISDAKNIGQFSIDENIFATEIISEVQNISLNSVATSASFATTANISAPKESFDIWHFRLGHPSPVVLHKLLTRHS--IVAGSTPTTRECI

Query:  GCLKGKMPKISF
         CL GKM K+ F
Subjt:  GCLKGKMPKISF

A0A2N9I1S1 Uncharacterized protein6.3e-1129.52Show/hide
Query:  MNYALQGHHPPSQLAAMAAVSNTASGASSFWIVDNGCKSHVT----HLNLTNNYNGEEVVAVAIDGFYPVQASANGS-SVMTNNFGILN-------KSSM
        M++A QG HPP++LAAMA+ SN + G  + W+ D G   H+T    +L + + Y G + VAV      P+  + NG       NF + N        S++
Subjt:  MNYALQGHHPPSQLAAMAAVSNTASGASSFWIVDNGCKSHVT----HLNLTNNYNGEEVVAVAIDGFYPVQASANGS-SVMTNNFGILN-------KSSM

Query:  PMSAFICTD-------DANIFAVGNISDAKNIGQFSIDENIFATEIISEVQNISLNSVATSASFATTANISAPKESFDIWHFRLGHPSPVVLHKLLTRHS
             +CTD       D+N F + ++   K + +      ++    I      S++  + +AS + +A +S+ K  + +WH RLGHPS  VL   L   S
Subjt:  PMSAFICTD-------DANIFAVGNISDAKNIGQFSIDENIFATEIISEVQNISLNSVATSASFATTANISAPKESFDIWHFRLGHPSPVVLHKLLTRHS

Query:  --IVAGSTPTTRECIGCLKGKMPKISF
          +   +      C  CL GKM K+ F
Subjt:  --IVAGSTPTTRECIGCLKGKMPKISF

A0A2N9I8B6 CCHC-type domain-containing protein4.3e-1230.14Show/hide
Query:  MNYALQGHHPPSQLAAMAAVSNTASGASSFWIVDNGCKSHVT----HLNLTNNYNGEEVVAVAIDGFYPVQASANGSSVMTNNFGILNKSSMPMSAFICT
        M++A QG HPP++LAAMA++SN + G  + W+ D G   H+T    +L +   Y G + VAV            NG S+  NN G ++K        +CT
Subjt:  MNYALQGHHPPSQLAAMAAVSNTASGASSFWIVDNGCKSHVT----HLNLTNNYNGEEVVAVAIDGFYPVQASANGSSVMTNNFGILNKSSMPMSAFICT

Query:  D-------DANIFAVGNISDAKNIGQFSIDENIFATEIISEVQNISLNSVATSASFATTANISAPKESFDIWHFRLGHPSPVVLHKLLTRHS--IVAGST
        D       D+N F + ++   K + +      ++    I    + S ++ +   S + +A +S+ K  + +WH RLGHPS  VL   L   S  +   + 
Subjt:  D-------DANIFAVGNISDAKNIGQFSIDENIFATEIISEVQNISLNSVATSASFATTANISAPKESFDIWHFRLGHPSPVVLHKLLTRHS--IVAGST

Query:  PTTRECIGCLKGKMPKISF
             C  CL GKM K+ F
Subjt:  PTTRECIGCLKGKMPKISF

A0A438BSD5 Retrovirus-related Pol polyprotein from transposon RE18.2e-1130.4Show/hide
Query:  MNYALQGHHPPSQLAAMAAVSNTASGASSFWIVDNGCKSHVT----HLNLTNNYNGEEVVAVAIDGFYPVQASANGSSVMTNNFGILN--KSSMPMSAFI
        MNYA QG HPP+QLAAM A SN A      W  D+G   H+T    HL L   Y G+E VAV          +  G S+      I +  ++ + +   +
Subjt:  MNYALQGHHPPSQLAAMAAVSNTASGASSFWIVDNGCKSHVT----HLNLTNNYNGEEVVAVAIDGFYPVQASANGSSVMTNNFGILN--KSSMPMSAFI

Query:  CTDDANIFAVGNISDAKNIGQFSIDENIF-----ATEIISEVQN---------------ISLNSVATSASFATTANISAPKESFDIWHFRLGHPSPVVLH
            A+       ++  +I QF +D N           + ++Q                I L S++ + S A +A +   K S  IWH RLGH S  ++ 
Subjt:  CTDDANIFAVGNISDAKNIGQFSIDENIF-----ATEIISEVQN---------------ISLNSVATSASFATTANISAPKESFDIWHFRLGHPSPVVLH

Query:  KLLTRHSI-VAGSTPTTRECIGCLKGK
        +LL +HS+ V GS      C  C  GK
Subjt:  KLLTRHSI-VAGSTPTTRECIGCLKGK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATTACGCTCTTCAAGGACATCATCCTCCCTCTCAGCTTGCGGCAATGGCCGCCGTCTCCAATACTGCCTCGGGTGCTTCTTCATTTTGGATAGTTGATAATGGTTG
TAAGTCCCATGTTACTCATCTCAACCTAACTAACAATTATAATGGGGAGGAGGTCGTTGCTGTTGCCATTGATGGCTTTTACCCCGTTCAAGCTTCTGCAAATGGTTCAA
GTGTTATGACAAATAATTTTGGTATTTTAAATAAATCTAGCATGCCAATGTCTGCTTTCATATGTACTGATGATGCAAATATTTTTGCTGTTGGGAACATTTCTGATGCT
AAAAATATTGGACAATTTTCTATTGATGAGAATATTTTTGCTACTGAGATCATTTCTGAAGTTCAGAATATTTCTCTTAATTCTGTTGCCACATCTGCATCTTTTGCTAC
TACTGCTAATATTTCTGCTCCTAAAGAATCTTTTGATATTTGGCATTTTAGACTTGGTCACCCTTCTCCTGTTGTTTTACATAAACTTTTAACTCGTCATTCTATTGTTG
CTGGTTCTACTCCTACAACTAGGGAATGCATTGGCTGTTTGAAAGGGAAAATGCCTAAAATTTCATTCCGTTGTCCGCATCTGTTTCTGTTGCACCACTCGCCCTAG
mRNA sequenceShow/hide mRNA sequence
ATGAATTACGCTCTTCAAGGACATCATCCTCCCTCTCAGCTTGCGGCAATGGCCGCCGTCTCCAATACTGCCTCGGGTGCTTCTTCATTTTGGATAGTTGATAATGGTTG
TAAGTCCCATGTTACTCATCTCAACCTAACTAACAATTATAATGGGGAGGAGGTCGTTGCTGTTGCCATTGATGGCTTTTACCCCGTTCAAGCTTCTGCAAATGGTTCAA
GTGTTATGACAAATAATTTTGGTATTTTAAATAAATCTAGCATGCCAATGTCTGCTTTCATATGTACTGATGATGCAAATATTTTTGCTGTTGGGAACATTTCTGATGCT
AAAAATATTGGACAATTTTCTATTGATGAGAATATTTTTGCTACTGAGATCATTTCTGAAGTTCAGAATATTTCTCTTAATTCTGTTGCCACATCTGCATCTTTTGCTAC
TACTGCTAATATTTCTGCTCCTAAAGAATCTTTTGATATTTGGCATTTTAGACTTGGTCACCCTTCTCCTGTTGTTTTACATAAACTTTTAACTCGTCATTCTATTGTTG
CTGGTTCTACTCCTACAACTAGGGAATGCATTGGCTGTTTGAAAGGGAAAATGCCTAAAATTTCATTCCGTTGTCCGCATCTGTTTCTGTTGCACCACTCGCCCTAG
Protein sequenceShow/hide protein sequence
MNYALQGHHPPSQLAAMAAVSNTASGASSFWIVDNGCKSHVTHLNLTNNYNGEEVVAVAIDGFYPVQASANGSSVMTNNFGILNKSSMPMSAFICTDDANIFAVGNISDA
KNIGQFSIDENIFATEIISEVQNISLNSVATSASFATTANISAPKESFDIWHFRLGHPSPVVLHKLLTRHSIVAGSTPTTRECIGCLKGKMPKISFRCPHLFLLHHSP