; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0012387 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0012387
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr1:40638981..40639708
RNA-Seq ExpressionLag0012387
SyntenyLag0012387
Gene Ontology termsNA
InterPro domainsIPR025724 - GAG-pre-integrase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA8548185.1 hypothetical protein F0562_004554 [Nyssa sinensis]3.5e-1435.92Show/hide
Query:  MNYSFQGRHPPAKLAAMAASSTSNNFASGVGSVGCSNSAFPQDSQVWLSDTGCNAHLASDLANLSISNAYNGEENITVGQNYGQTSVPRPSVNGLYPISA
        MN++++GR P  KL AMAA ++SN                   S  W+SD+G + H+ +DL NL+I N Y G++++ VG   G+ +    S +GLYP   
Subjt:  MNYSFQGRHPPAKLAAMAASSTSNNFASGVGSVGCSNSAFPQDSQVWLSDTGCNAHLASDLANLSISNAYNGEENITVGQNYGQTSVPRPSVNGLYPISA

Query:  -SSMSTAQVSLTANVGTKNSYSVWHDRLGHPSTSVLQHLLNS
         + +ST      A VG + S  +WH RLGHP+ + L HL+++
Subjt:  -SSMSTAQVSLTANVGTKNSYSVWHDRLGHPSTSVLQHLLNS

RVW31473.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]1.6e-1438.93Show/hide
Query:  MNYSFQGRHPPAKLAAMAASSTSNNFASGVGSVGCSNSAFPQDSQVWLSDTGCNAHLASDLANLSISNAYNGEENITVGQ-NYGQTSVPRPSVNGLYPIS
        MNY+FQ RHPP +LAAM A                +N+ +    Q W  D+G N H+  D ANL+IS  Y G +  TVG    G T +  PS  GLYPI+
Subjt:  MNYSFQGRHPPAKLAAMAASSTSNNFASGVGSVGCSNSAFPQDSQVWLSDTGCNAHLASDLANLSISNAYNGEENITVGQ-NYGQTSVPRPSVNGLYPIS

Query:  ASSMSTAQV-SLTANVGTKNSYSVWHDRLGHPSTSVLQHLLNSIDCSVS
           +S+++  + +  VG K S + WH RLGHPS+S L ++L++    +S
Subjt:  ASSMSTAQV-SLTANVGTKNSYSVWHDRLGHPSTSVLQHLLNSIDCSVS

RVW58434.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]2.1e-1433.33Show/hide
Query:  MNYSFQGRHPPAKLAAMAASSTSNNFASGVGSVGCSNSAFPQDSQVWLSDTGCNAHLASDLANLSISNAYNGEENITVGQNY------------------
        M+Y++QGRHPP +LAAM A S +                  Q+ + W +D+G N H+ ++L +L++   Y G+EN+ VG                     
Subjt:  MNYSFQGRHPPAKLAAMAASSTSNNFASGVGSVGCSNSAFPQDSQVWLSDTGCNAHLASDLANLSISNAYNGEENITVGQNY------------------

Query:  ---------GQTSVPRPSVNGLYPISASSMS-TAQVSLTANVGTKNSYSVWHDRLGHPSTSVLQHLLNS----IDCSVSK
                 G T +   S  GLYPI   SMS     +L+A VG K S SVWH RLGH S  ++  LLN     ++ SV+K
Subjt:  ---------GQTSVPRPSVNGLYPISASSMS-TAQVSLTANVGTKNSYSVWHDRLGHPSTSVLQHLLNS----IDCSVSK

RVW82846.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]4.6e-1438.26Show/hide
Query:  MNYSFQGRHPPAKLAAMAASSTSNNFASGVGSVGCSNSAFPQDSQVWLSDTGCNAHLASDLANLSISNAYNGEENITVGQ-NYGQTSVPRPSVNGLYPIS
        MNY+FQ RHPP +LAAM                  +N+ +    Q W  D+G N H+  D ANL+IS  Y G +  TVG    G T +  PS  GLYPI+
Subjt:  MNYSFQGRHPPAKLAAMAASSTSNNFASGVGSVGCSNSAFPQDSQVWLSDTGCNAHLASDLANLSISNAYNGEENITVGQ-NYGQTSVPRPSVNGLYPIS

Query:  ASSMSTAQV-SLTANVGTKNSYSVWHDRLGHPSTSVLQHLLNSIDCSVS
           +S+++  + +  VG K S + WH RLGHPS+S L ++L++    +S
Subjt:  ASSMSTAQV-SLTANVGTKNSYSVWHDRLGHPSTSVLQHLLNSIDCSVS

XP_022147875.1 mitotic checkpoint serine/threonine-protein kinase BUB1 isoform X1 [Momordica charantia]2.7e-1441.38Show/hide
Query:  MNYSFQGRHPPAKLAAMAASSTSNNFASGVGSVGCSNSAFPQDSQVWLSDTGCNAHLASDLANLSISNAYNGEENITVGQ---NYGQTSVPR--------
        MNYSFQG HPP +L AM ASS S                      VWL+D+GC+ HL  DLANL+ISN YNG       Q   ++G +   R        
Subjt:  MNYSFQGRHPPAKLAAMAASSTSNNFASGVGSVGCSNSAFPQDSQVWLSDTGCNAHLASDLANLSISNAYNGEENITVGQ---NYGQTSVPR--------

Query:  -PSVNGLYPISASSMSTAQVSLTANVGTKNSYSVWHDRLGHPSTS
         P +NGLYPI+A+    A+    A++  K S+ +WH+RLGHPS S
Subjt:  -PSVNGLYPISASSMSTAQVSLTANVGTKNSYSVWHDRLGHPSTS

TrEMBL top hitse value%identityAlignment
A0A2N9F9F8 Uncharacterized protein7.4e-1840Show/hide
Query:  MNYSFQGRHPPAKLAAMAASSTSNNFASGVGSVGCSNSAFPQDSQVWLSDTGCNAHLASDLANLSISNAYNGEENITVGQNYGQTSVPRPSVNGLYPI--
        M++++QGRHPPAKLAAMA  STSN                 Q    WL+DTG   HL ++L NL  +  Y G E        G+      S NGLYPI  
Subjt:  MNYSFQGRHPPAKLAAMAASSTSNNFASGVGSVGCSNSAFPQDSQVWLSDTGCNAHLASDLANLSISNAYNGEENITVGQNYGQTSVPRPSVNGLYPI--

Query:  --SASSMSTAQVS--LTANVGTKNSYSVWHDRLGHPSTSVLQHLLNSIDCSVS---KHTLLIGKHCLDGKMAKWP
          S+SSMS +  S  ++A + +KN + +WH RLGHPS  VL   + ++   +S   KH     KHCL GKM + P
Subjt:  --SASSMSTAQVS--LTANVGTKNSYSVWHDRLGHPSTSVLQHLLNSIDCSVS---KHTLLIGKHCLDGKMAKWP

A0A2N9FKJ8 Uncharacterized protein8.8e-1936.02Show/hide
Query:  MNYSFQGRHPPAKLAAMAASSTSNNFASGVGSVGCSNSAFPQDSQVWLSDTGCNAHLASDLANLSISNAYNGEENITVGQNY------------------
        M++++QGRHPPAKLAAMA  STSNN                Q  + WL+DTG   HL ++L NL++ N Y G + + VG                     
Subjt:  MNYSFQGRHPPAKLAAMAASSTSNNFASGVGSVGCSNSAFPQDSQVWLSDTGCNAHLASDLANLSISNAYNGEENITVGQNY------------------

Query:  -----------------GQTSVPRPSVNGLYPISA---SSMSTAQV----SLTANVGTKNSYSVWHDRLGHPSTSVLQHLLNSIDCSVS---KHTLLIGK
                         G+      S NGLYPI     SS+ST  V    S++A + +KN + +WH RLGHPS  VL   L S+   +S   KH     K
Subjt:  -----------------GQTSVPRPSVNGLYPISA---SSMSTAQV----SLTANVGTKNSYSVWHDRLGHPSTSVLQHLLNSIDCSVS---KHTLLIGK

Query:  HCLDGKMAKWP
        HCL GKM K P
Subjt:  HCLDGKMAKWP

A0A2N9GCR2 Uncharacterized protein2.2e-1734.13Show/hide
Query:  MNYSFQGRHPPAKLAAMAASSTSNNFASGVGSVGCSNSAFPQDSQVWLSDTGCNAHLASDLANLSISNAYNGEENITVGQNY------------------
        M++++QGRHPPAKLAAMA  STSN   +G               + WL+DTG   HL +++ NL++   Y G + + VG                     
Subjt:  MNYSFQGRHPPAKLAAMAASSTSNNFASGVGSVGCSNSAFPQDSQVWLSDTGCNAHLASDLANLSISNAYNGEENITVGQNY------------------

Query:  -----------------GQTSVPRPSVNGLYPISASSMS----TAQVSLTANVGTKNSYSVWHDRLGHPSTSVLQHLLNSIDCSVS---KHTLLIGKHCL
                         G+      S NGLYPI  +  S    TA  S++A + +KN + +WH RLGHPS  VL   L S+   +S   KH     KHCL
Subjt:  -----------------GQTSVPRPSVNGLYPISASSMS----TAQVSLTANVGTKNSYSVWHDRLGHPSTSVLQHLLNSIDCSVS---KHTLLIGKHCL

Query:  DGKMAKWP
         GKM K P
Subjt:  DGKMAKWP

A0A2N9I8B6 CCHC-type domain-containing protein1.4e-1633.18Show/hide
Query:  MNYSFQGRHPPAKLAAMAASSTSNNFASGVGSVGCSNSAFPQDSQVWLSDTGCNAHLASDLANLSISNAYNGEENITVGQNY------------------
        M++++QGRHPPAKLAAMA+ S  +                 Q  + WL+DTG   H+ ++L NL++   Y G + + VG                     
Subjt:  MNYSFQGRHPPAKLAAMAASSTSNNFASGVGSVGCSNSAFPQDSQVWLSDTGCNAHLASDLANLSISNAYNGEENITVGQNY------------------

Query:  -----------------GQTSVPRPSVNGLYPISA---SSMSTAQV----SLTANVGTKNSYSVWHDRLGHPSTSVLQHLLNSIDCSVS---KHTLLIGK
                         G+      S NGLYPI     SS ST  V    S++A + +KN + +WH RLGHPS  VL   L S+   +S   KH     K
Subjt:  -----------------GQTSVPRPSVNGLYPISA---SSMSTAQV----SLTANVGTKNSYSVWHDRLGHPSTSVLQHLLNSIDCSVS---KHTLLIGK

Query:  HCLDGKMAKWP
        HCL GKM K P
Subjt:  HCLDGKMAKWP

A0A2N9J6E3 Uncharacterized protein2.1e-2042.05Show/hide
Query:  MNYSFQGRHPPAKLAAMAASSTSNNFASGVGSVGCSNSAFPQDSQVWLSDTGCNAHLASDLANLSISNAYNGEENITVGQNYGQTSVPRPSVNGLYPISA
        M++++QGRHPPAKLAAMA  STSNN                Q  + WL+DTG   HL ++L NL++ N Y G +        G+      S NGLYPI  
Subjt:  MNYSFQGRHPPAKLAAMAASSTSNNFASGVGSVGCSNSAFPQDSQVWLSDTGCNAHLASDLANLSISNAYNGEENITVGQNYGQTSVPRPSVNGLYPISA

Query:  ---SSMSTAQV----SLTANVGTKNSYSVWHDRLGHPSTSVLQHLLNSIDCSVS---KHTLLIGKHCLDGKMAKWP
           SS+ST  V    S++A + +KN + +WH RLGHPS  VL   L S+   +S   KH     KHCL GKM K P
Subjt:  ---SSMSTAQV----SLTANVGTKNSYSVWHDRLGHPSTSVLQHLLNSIDCSVS---KHTLLIGKHCLDGKMAKWP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACTACTCCTTCCAAGGTCGACATCCGCCAGCGAAGCTTGCTGCTATGGCTGCGTCCTCTACGTCGAACAATTTTGCCTCTGGTGTTGGGAGTGTTGGATGT
TCGAACAGTGCATTTCCTCAAGATTCTCAAGTCTGGTTGTCTGATACCGGCTGCAACGCTCACCTGGCAAGTGATCTGGCTAATCTCAGCATCTCCAATGCCTAC
AATGGAGAAGAAAATATTACAGTGGGACAAAACTACGGGCAAACCTCTGTTCCAAGGCCTAGCGTGAACGGCCTCTACCCTATCTCTGCCTCGTCCATGTCTACT
GCACAAGTAAGCCTTACTGCTAATGTAGGTACCAAGAATTCATACAGTGTGTGGCACGATAGGTTAGGTCATCCATCTACTTCTGTTCTTCAACACCTGTTAAAC
TCGATTGATTGTTCTGTTTCTAAACATACTTTGCTTATCGGTAAGCATTGCCTTGATGGAAAAATGGCCAAATGGCCAAATTACCCTTTCCCTTATCTACTACTT
CTACTGTTGCACCATTAG
mRNA sequenceShow/hide mRNA sequence
ATGAACTACTCCTTCCAAGGTCGACATCCGCCAGCGAAGCTTGCTGCTATGGCTGCGTCCTCTACGTCGAACAATTTTGCCTCTGGTGTTGGGAGTGTTGGATGT
TCGAACAGTGCATTTCCTCAAGATTCTCAAGTCTGGTTGTCTGATACCGGCTGCAACGCTCACCTGGCAAGTGATCTGGCTAATCTCAGCATCTCCAATGCCTAC
AATGGAGAAGAAAATATTACAGTGGGACAAAACTACGGGCAAACCTCTGTTCCAAGGCCTAGCGTGAACGGCCTCTACCCTATCTCTGCCTCGTCCATGTCTACT
GCACAAGTAAGCCTTACTGCTAATGTAGGTACCAAGAATTCATACAGTGTGTGGCACGATAGGTTAGGTCATCCATCTACTTCTGTTCTTCAACACCTGTTAAAC
TCGATTGATTGTTCTGTTTCTAAACATACTTTGCTTATCGGTAAGCATTGCCTTGATGGAAAAATGGCCAAATGGCCAAATTACCCTTTCCCTTATCTACTACTT
CTACTGTTGCACCATTAG
Protein sequenceShow/hide protein sequence
MNYSFQGRHPPAKLAAMAASSTSNNFASGVGSVGCSNSAFPQDSQVWLSDTGCNAHLASDLANLSISNAYNGEENITVGQNYGQTSVPRPSVNGLYPISASSMST
AQVSLTANVGTKNSYSVWHDRLGHPSTSVLQHLLNSIDCSVSKHTLLIGKHCLDGKMAKWPNYPFPYLLLLLLHH