; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0027968 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0027968
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr8:8928587..8929306
RNA-Seq ExpressionLag0027968
SyntenyLag0027968
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK05754.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]2.2e-1942.13Show/hide
Query:  VATQVMGFENAQDLWAAVQELFGVQSRAEEDY----------------------------LCQAGSPITTRSLVSQFLLGLEE-YNPVVAMIQGRICVTW
        +A Q+MGF NA+DLW A Q+LFGVQSRAEED+                            L QAGSP+  R+ +SQ LLGL+E YNPV+A+IQG+  ++W
Subjt:  VATQVMGFENAQDLWAAVQELFGVQSRAEEDY----------------------------LCQAGSPITTRSLVSQFLLGLEE-YNPVVAMIQGRICVTW

Query:  SELQTELLVFEKRLELQNNLKSLLTLNATTSVNMASSKDAGTQRN-QNSSTNGRSNNFGRGNQLGGGQRGRGRARGYG
         ++Q+ELL FEKRLE Q+  K+   +     VN+A ++++   R   N   +G + N  +G Q GG   GRGR +G G
Subjt:  SELQTELLVFEKRLELQNNLKSLLTLNATTSVNMASSKDAGTQRN-QNSSTNGRSNNFGRGNQLGGGQRGRGRARGYG

XP_022151683.1 uncharacterized protein LOC111019598 [Momordica charantia]1.3e-2244.32Show/hide
Query:  MSLEVATQVMGFENAQDLWAAVQELFGVQSRAEEDYLCQ----------------------------AGSPITTRSLVSQFLLGL-EEYNPVVAMIQGRI
        M+ +VA QVMGF  +++LW AVQELFGVQSRAE DYL Q                            AGS ++ R LVSQ L GL EEYNP+V  +QG++
Subjt:  MSLEVATQVMGFENAQDLWAAVQELFGVQSRAEEDYLCQ----------------------------AGSPITTRSLVSQFLLGL-EEYNPVVAMIQGRI

Query:  CVTWSELQTELLVFEKRLELQNNLKSLLTLN--ATTSVNMASSKDAGTQRNQNSSTNGRSNNFGRGNQLGGGQRG----RGRARG
         ++WSE+  ELL +EKRLE QN+LKS + +N   T SVN    +   T +  N+  N   +N  RG   GG QRG    R R RG
Subjt:  CVTWSELQTELLVFEKRLELQNNLKSLLTLN--ATTSVNMASSKDAGTQRNQNSSTNGRSNNFGRGNQLGGGQRG----RGRARG

XP_038896600.1 uncharacterized protein LOC120084860 [Benincasa hispida]1.5e-2046.81Show/hide
Query:  MSLEVATQVMGFENAQDLWAAVQELFGVQSRAEEDY----------------------------LCQAGSPITTRSLVSQFLLGL-EEYNPVVAMIQGRI
        M+L+VA Q+MG+E A++LW A+QELFGVQSRAEEDY                            L Q  SP++TR+L+SQ LLGL EEYN VV  IQG+ 
Subjt:  MSLEVATQVMGFENAQDLWAAVQELFGVQSRAEEDY----------------------------LCQAGSPITTRSLVSQFLLGL-EEYNPVVAMIQGRI

Query:  CVTWSELQTELLVFEKRLELQNNLKSLLTLNATTSVNMASS
         ++W ++Q+ELL +EKRLE QN++K   T      VN+A S
Subjt:  CVTWSELQTELLVFEKRLELQNNLKSLLTLNATTSVNMASS

XP_038905161.1 uncharacterized protein LOC120091275 isoform X1 [Benincasa hispida]6.1e-2546.41Show/hide
Query:  MSLEVATQVMGFENAQDLWAAVQELFGVQSRAEEDYLC----------------------------QAGSPITTRSLVSQFLLGL-EEYNPVVAMIQGRI
        M+ EVA QVMG E A+DLW ++ +LFGVQSR EEDYL                             QAGSP+  R+LVSQ LLGL EEYN +VAMIQGR+
Subjt:  MSLEVATQVMGFENAQDLWAAVQELFGVQSRAEEDYLC----------------------------QAGSPITTRSLVSQFLLGL-EEYNPVVAMIQGRI

Query:  CVTWSELQTELLVFEKRLELQNNLKSLLTLN--ATTSVNMASSKDAGTQRNQNSSTNGRSNNFGRGNQLGGGQRGRGRARG
         ++W ++Q+ELL++E+RLE Q+N K+ +  N  +  SVNM +++      NQN+ TN  + + G G Q GGG  GRGR RG
Subjt:  CVTWSELQTELLVFEKRLELQNNLKSLLTLN--ATTSVNMASSKDAGTQRNQNSSTNGRSNNFGRGNQLGGGQRGRGRARG

XP_038905164.1 uncharacterized protein LOC120091275 isoform X4 [Benincasa hispida]6.1e-2546.41Show/hide
Query:  MSLEVATQVMGFENAQDLWAAVQELFGVQSRAEEDYLC----------------------------QAGSPITTRSLVSQFLLGL-EEYNPVVAMIQGRI
        M+ EVA QVMG E A+DLW ++ +LFGVQSR EEDYL                             QAGSP+  R+LVSQ LLGL EEYN +VAMIQGR+
Subjt:  MSLEVATQVMGFENAQDLWAAVQELFGVQSRAEEDYLC----------------------------QAGSPITTRSLVSQFLLGL-EEYNPVVAMIQGRI

Query:  CVTWSELQTELLVFEKRLELQNNLKSLLTLN--ATTSVNMASSKDAGTQRNQNSSTNGRSNNFGRGNQLGGGQRGRGRARG
         ++W ++Q+ELL++E+RLE Q+N K+ +  N  +  SVNM +++      NQN+ TN  + + G G Q GGG  GRGR RG
Subjt:  CVTWSELQTELLVFEKRLELQNNLKSLLTLN--ATTSVNMASSKDAGTQRNQNSSTNGRSNNFGRGNQLGGGQRGRGRARG

TrEMBL top hitse value%identityAlignment
A0A0A0LXB7 Uncharacterized protein9.1e-1937.5Show/hide
Query:  EVATQVMGFENAQDLWAAVQELFGVQSRAEEDY----------------------------LCQAGSPITTRSLVSQFLLGLEE-YNPVVAMIQGRICVT
        EV  Q++GF NA+D+W A  + FGV+SRAEED+                            L QA SPI  R+L+SQ LLGL+E YNPV+ +IQG+  ++
Subjt:  EVATQVMGFENAQDLWAAVQELFGVQSRAEEDY----------------------------LCQAGSPITTRSLVSQFLLGLEE-YNPVVAMIQGRICVT

Query:  WSELQTELLVFEKRLELQNNLKSLLTLNATTSVNMASSKDAGTQR-NQNSSTNGRSNNFGRGNQLGGGQRGRGRARGYGFSNFNWNKLVLFQ
        W ++Q++LL+FEKRL+ QN+ K++  +    ++NMA S++    R ++N   +G + N  +G++ GG    RG  RG G  +   N+  L Q
Subjt:  WSELQTELLVFEKRLELQNNLKSLLTLNATTSVNMASSKDAGTQR-NQNSSTNGRSNNFGRGNQLGGGQRGRGRARGYGFSNFNWNKLVLFQ

A0A5A7SIT7 Uncharacterized protein6.1e-1537.5Show/hide
Query:  MSLEVATQVMGFENAQDLWAAVQELFGVQSRAEEDY----------------------------LCQAGSPITTRSLVSQFLLGLEE-YNPVVAMIQGRI
        M+ +VA Q+MGF N +DLW A Q+ FGVQSRAEED+                            L Q GSP+  R+L+SQ LLGL+E YN V+ +IQG+ 
Subjt:  MSLEVATQVMGFENAQDLWAAVQELFGVQSRAEEDY----------------------------LCQAGSPITTRSLVSQFLLGLEE-YNPVVAMIQGRI

Query:  CVTWSELQTELLVFEKRLELQNNLKSLL---TLNATTSVNMASSKDAGTQRNQNSSTNGRSNNFGRGNQLGGGQRG
         ++W ++Q++LL+FEK L+ QN  K       +  + ++NMA       QRN ++        +G   Q   GQRG
Subjt:  CVTWSELQTELLVFEKRLELQNNLKSLL---TLNATTSVNMASSKDAGTQRNQNSSTNGRSNNFGRGNQLGGGQRG

A0A5D3C373 Retrovirus-related Pol polyprotein from transposon TNT 1-941.1e-1942.13Show/hide
Query:  VATQVMGFENAQDLWAAVQELFGVQSRAEEDY----------------------------LCQAGSPITTRSLVSQFLLGLEE-YNPVVAMIQGRICVTW
        +A Q+MGF NA+DLW A Q+LFGVQSRAEED+                            L QAGSP+  R+ +SQ LLGL+E YNPV+A+IQG+  ++W
Subjt:  VATQVMGFENAQDLWAAVQELFGVQSRAEEDY----------------------------LCQAGSPITTRSLVSQFLLGLEE-YNPVVAMIQGRICVTW

Query:  SELQTELLVFEKRLELQNNLKSLLTLNATTSVNMASSKDAGTQRN-QNSSTNGRSNNFGRGNQLGGGQRGRGRARGYG
         ++Q+ELL FEKRLE Q+  K+   +     VN+A ++++   R   N   +G + N  +G Q GG   GRGR +G G
Subjt:  SELQTELLVFEKRLELQNNLKSLLTLNATTSVNMASSKDAGTQRN-QNSSTNGRSNNFGRGNQLGGGQRGRGRARGYG

A0A6J1D5J0 uncharacterized protein LOC1110175011.4e-1955.45Show/hide
Query:  MSLEVATQVMGFENAQDLWAAVQELFGVQSRAEEDY----------------------------LCQAGSPITTRSLVSQFLLGL-EEYNPVVAMIQGRI
        M+ EVATQVMG+ENA DLWAA+QELFGVQS+AEEDY                            L QAGSP+ TRSL+SQ LLGL EEYNPVVA IQG+ 
Subjt:  MSLEVATQVMGFENAQDLWAAVQELFGVQSRAEEDY----------------------------LCQAGSPITTRSLVSQFLLGL-EEYNPVVAMIQGRI

Query:  CVTWSELQTE
         ++W E+Q E
Subjt:  CVTWSELQTE

A0A6J1DCW4 uncharacterized protein LOC1110195986.1e-2344.32Show/hide
Query:  MSLEVATQVMGFENAQDLWAAVQELFGVQSRAEEDYLCQ----------------------------AGSPITTRSLVSQFLLGL-EEYNPVVAMIQGRI
        M+ +VA QVMGF  +++LW AVQELFGVQSRAE DYL Q                            AGS ++ R LVSQ L GL EEYNP+V  +QG++
Subjt:  MSLEVATQVMGFENAQDLWAAVQELFGVQSRAEEDYLCQ----------------------------AGSPITTRSLVSQFLLGL-EEYNPVVAMIQGRI

Query:  CVTWSELQTELLVFEKRLELQNNLKSLLTLN--ATTSVNMASSKDAGTQRNQNSSTNGRSNNFGRGNQLGGGQRG----RGRARG
         ++WSE+  ELL +EKRLE QN+LKS + +N   T SVN    +   T +  N+  N   +N  RG   GG QRG    R R RG
Subjt:  CVTWSELQTELLVFEKRLELQNNLKSLLTLN--ATTSVNMASSKDAGTQRNQNSSTNGRSNNFGRGNQLGGGQRG----RGRARG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTCTCGAAGTAGCCACCCAAGTAATGGGATTCGAGAATGCTCAAGATCTGTGGGCAGCGGTACAAGAGCTGTTTGGTGTTCAATCTCGAGCAGAGGAAGACTACTT
ATGCCAGGCTGGGAGTCCGATTACCACTCGGTCTCTAGTGTCTCAATTTCTTTTAGGGTTAGAAGAGTACAACCCCGTTGTGGCGATGATTCAAGGCAGAATATGTGTGA
CTTGGTCTGAGCTACAAACTGAACTTCTCGTGTTTGAGAAGAGACTTGAATTGCAGAACAACTTAAAGAGCTTGTTGACACTCAATGCCACCACCTCAGTGAACATGGCT
AGCTCTAAAGATGCTGGTACACAGAGGAATCAAAACTCATCTACCAATGGAAGGTCAAACAATTTTGGCCGTGGAAATCAGCTAGGAGGAGGACAGCGTGGCAGAGGTCG
AGCTCGGGGTTATGGTTTCTCAAACTTTAACTGGAACAAGCTCGTCCTGTTCCAACTCAAAACAGAGGGGATTCTACTCGACAGAACTACAATGGTTATTCAAATCAGTC
GGCTGACTTTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCTCTCGAAGTAGCCACCCAAGTAATGGGATTCGAGAATGCTCAAGATCTGTGGGCAGCGGTACAAGAGCTGTTTGGTGTTCAATCTCGAGCAGAGGAAGACTACTT
ATGCCAGGCTGGGAGTCCGATTACCACTCGGTCTCTAGTGTCTCAATTTCTTTTAGGGTTAGAAGAGTACAACCCCGTTGTGGCGATGATTCAAGGCAGAATATGTGTGA
CTTGGTCTGAGCTACAAACTGAACTTCTCGTGTTTGAGAAGAGACTTGAATTGCAGAACAACTTAAAGAGCTTGTTGACACTCAATGCCACCACCTCAGTGAACATGGCT
AGCTCTAAAGATGCTGGTACACAGAGGAATCAAAACTCATCTACCAATGGAAGGTCAAACAATTTTGGCCGTGGAAATCAGCTAGGAGGAGGACAGCGTGGCAGAGGTCG
AGCTCGGGGTTATGGTTTCTCAAACTTTAACTGGAACAAGCTCGTCCTGTTCCAACTCAAAACAGAGGGGATTCTACTCGACAGAACTACAATGGTTATTCAAATCAGTC
GGCTGACTTTGTAG
Protein sequenceShow/hide protein sequence
MSLEVATQVMGFENAQDLWAAVQELFGVQSRAEEDYLCQAGSPITTRSLVSQFLLGLEEYNPVVAMIQGRICVTWSELQTELLVFEKRLELQNNLKSLLTLNATTSVNMA
SSKDAGTQRNQNSSTNGRSNNFGRGNQLGGGQRGRGRARGYGFSNFNWNKLVLFQLKTEGILLDRTTMVIQISRLTL