; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0017922 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0017922
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetroelement pol polyprotein-like
Genome locationchr5:11736203..11736863
RNA-Seq ExpressionLag0017922
SyntenyLag0017922
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_030477908.1 uncharacterized protein LOC115694945 [Cannabis sativa]8.0e-2243.09Show/hide
Query:  ITNALKNVTVVSHVQQPLVVEPAAVVNQVTEESCVYCGEEHNYEFCPSNPAYVFFVGVTTLTL-RGEGKEVIHKSLKH-----------SKRWQPSQDLL
        +TN LKN++    +     ++PAA + Q  + SCV+CGE H +E CPSNP  V ++G        G      ++S K+           S    P+Q   
Subjt:  ITNALKNVTVVSHVQQPLVVEPAAVVNQVTEESCVYCGEEHNYEFCPSNPAYVFFVGVTTLTL-RGEGKEVIHKSLKH-----------SKRWQPSQDLL

Query:  NRRECMPQQNKQALPQQNSE-SSLETLMKEYMARIDAAIQSNQASLRTLELQMGQLANELKARPQGKLLADTEHPRREGKE
               QQ + +   QN++ SSLE+LM++YMA+ DA IQS  ASLR LELQ+G LANELKARPQG L +DTE+PRR+GKE
Subjt:  NRRECMPQQNKQALPQQNSE-SSLETLMKEYMARIDAAIQSNQASLRTLELQMGQLANELKARPQGKLLADTEHPRREGKE

XP_030487620.1 uncharacterized protein LOC115704559 [Cannabis sativa]2.7e-2243.89Show/hide
Query:  ITNALKNVTVVSHVQQPLVVEPAAVVNQVTEESCVYCGEEHNYEFCPSNPAYVFFVGVTTLTLRGEG-KEVIHKSLKH-----------SKRWQPSQDLL
        +TN LKN+ +   VQ      PAA + Q  E SCVYCG+ H +E CPSNPA V +VG               +   KH           S      Q   
Subjt:  ITNALKNVTVVSHVQQPLVVEPAAVVNQVTEESCVYCGEEHNYEFCPSNPAYVFFVGVTTLTLRGEG-KEVIHKSLKH-----------SKRWQPSQDLL

Query:  NRRECMPQQNKQALPQQNSESSLETLMKEYMARIDAAIQSNQASLRTLELQMGQLANELKARPQGKLLADTEHPRREGKE
        +      QQ +   PQ +  SSLE+LM++YMA+ DA IQS  ASLR LE+Q+GQLAN+LK RPQG L +DTE+PRR+GKE
Subjt:  NRRECMPQQNKQALPQQNSESSLETLMKEYMARIDAAIQSNQASLRTLELQMGQLANELKARPQGKLLADTEHPRREGKE

XP_030494802.1 uncharacterized protein LOC115710583 [Cannabis sativa]2.2e-2446.75Show/hide
Query:  ITNALKNVTVVSHVQQPLVVEPAAVVNQVTEESCVYCGEEHNYEFCPSNPAYVFFVGVTTL-TLRGEGKEVIHKSLKHSKRWQPSQDLLNRRECMPQQNK
        +TN LKN+ +   VQ      PAA + Q  E SCVYCG+ H +E CPSNPA V +VG ++    + +GK+          R              PQQ  
Subjt:  ITNALKNVTVVSHVQQPLVVEPAAVVNQVTEESCVYCGEEHNYEFCPSNPAYVFFVGVTTL-TLRGEGKEVIHKSLKHSKRWQPSQDLLNRRECMPQQNK

Query:  QALPQQNSESSLETLMKEYMARIDAAIQSNQASLRTLELQMGQLANELKARPQGKLLADTEHPRREGKE
        Q  PQ +  SSLE+LM++YMA+ DA IQS  ASL+ LE+Q+GQLAN+LK RPQG L +DTE+PRR+GKE
Subjt:  QALPQQNSESSLETLMKEYMARIDAAIQSNQASLRTLELQMGQLANELKARPQGKLLADTEHPRREGKE

XP_030498047.1 uncharacterized protein LOC115713707 [Cannabis sativa]4.7e-2242.62Show/hide
Query:  ITNALKNVTVVSHVQQPLVVEPAAVVNQVTEESCVYCGEEHNYEFCPSNPAYVFFVGVTTLTLRGEG-KEVIHKSLKHSKRW--------------QPSQ
        +TN LKN+ +   VQ      PAA + Q  E SCVYCG+ H +E CPSNPA V +VG               + + KH   +              Q  Q
Subjt:  ITNALKNVTVVSHVQQPLVVEPAAVVNQVTEESCVYCGEEHNYEFCPSNPAYVFFVGVTTLTLRGEG-KEVIHKSLKHSKRW--------------QPSQ

Query:  DLLNRRECMPQQNKQALPQQNSESSLETLMKEYMARIDAAIQSNQASLRTLELQMGQLANELKARPQGKLLADTEHPRREGKE
                 P+  +   PQ +  SSLE+LM++YMA+ DA IQS  ASLR LE+Q+GQLAN LK RPQG L +DTE+PRR+GKE
Subjt:  DLLNRRECMPQQNKQALPQQNSESSLETLMKEYMARIDAAIQSNQASLRTLELQMGQLANELKARPQGKLLADTEHPRREGKE

XP_030509259.1 uncharacterized protein LOC115723937 [Cannabis sativa]1.6e-2242.93Show/hide
Query:  ITNALKNVTVVSHVQQPLVVEPAAVVNQVTEESCVYCGEEHNYEFCPSNPAYVFFVGVTTLTLRGEG-KEVIHKSLKHSKRW---------QPSQDLLNR
        +TN LKN+ +   VQ      PAA + Q  E SCVYCG+ H +E CPSNPA V +VG               + + KH   +           +Q    +
Subjt:  ITNALKNVTVVSHVQQPLVVEPAAVVNQVTEESCVYCGEEHNYEFCPSNPAYVFFVGVTTLTLRGEG-KEVIHKSLKHSKRW---------QPSQDLLNR

Query:  RECMPQQNKQALPQQ------NSESSLETLMKEYMARIDAAIQSNQASLRTLELQMGQLANELKARPQGKLLADTEHPRREGKE
        +   P  ++Q  PQQ      +  SSLE+LM++YMA+ DA IQS  ASLR LE+Q+GQLAN+LK RPQG L +DTE+PRR+GKE
Subjt:  RECMPQQNKQALPQQ------NSESSLETLMKEYMARIDAAIQSNQASLRTLELQMGQLANELKARPQGKLLADTEHPRREGKE

TrEMBL top hitse value%identityAlignment
A0A5B6VWJ0 Retroelement pol polyprotein-like1.9e-1335.98Show/hide
Query:  ITNALKNVTVVSHVQQPLVVE-----PAAVVNQVTEESCVYCGEEHNYEFCPSNPAYVFFVGVTTLTLRGEGKEVIHKSLKHSKRWQPSQDL--------
        IT+    V+ +S + + L         A   NQ    + VYCGE H  E CPSNP  V+++G      +  G++ +  +  +S  W+   D         
Subjt:  ITNALKNVTVVSHVQQPLVVE-----PAAVVNQVTEESCVYCGEEHNYEFCPSNPAYVFFVGVTTLTLRGEGKEVIHKSLKHSKRWQPSQDL--------

Query:  ------LNRRECMPQ--QNKQALPQQNSESSLETLMKEYMARIDAAIQSNQASLRTLELQMGQLANELKARPQGKLLADTEHPRREGKE
                R   +P   Q  Q L Q  + +SLE+L+K YMA+ DA IQS  A+L+ LE Q+GQLA EL+ R QG L +DTE+PR  GKE
Subjt:  ------LNRRECMPQ--QNKQALPQQNSESSLETLMKEYMARIDAAIQSNQASLRTLELQMGQLANELKARPQGKLLADTEHPRREGKE

A0A6J1DYG0 uncharacterized protein LOC1110257641.2e-1840.24Show/hide
Query:  AVVNQVTEESCVYCGEEHNYEFCPSNPAYVFFVGVTTLTLRGEGKEVIHKSLKH-----------SKRWQPSQDLLNRRECMP-------------QQNK
        A V QV +  C +C E H Y+ CP NPA VF+VG              +   +H           S  +   Q   N++  +P              Q  
Subjt:  AVVNQVTEESCVYCGEEHNYEFCPSNPAYVFFVGVTTLTLRGEGKEVIHKSLKH-----------SKRWQPSQDLLNRRECMP-------------QQNK

Query:  QALPQQNSESSLETLMKEYMARIDAAIQSNQASLRTLELQMGQLANELKARPQGKLLADTEHPRREGKE
        Q  P QN+ S+LE +MKEYMAR DA IQS  AS+R  E Q+GQLANELK RPQG     TE P+REGKE
Subjt:  QALPQQNSESSLETLMKEYMARIDAAIQSNQASLRTLELQMGQLANELKARPQGKLLADTEHPRREGKE

A0A6J1DZC3 uncharacterized protein LOC1110244491.5e-1334.76Show/hide
Query:  AVVNQVTEESCVYCGEEHNYEFCPSNPAYVFFVGVTTLTLRGEGKEVIHKSL-------------KHSKRWQPSQDLLNRRECMPQQNKQALPQQNSESS
        A V  + E  C YCG+ HN E CPSNP  + +VG  +    G+ +   +K+              +H+++   SQ +     CM    K+ +     E +
Subjt:  AVVNQVTEESCVYCGEEHNYEFCPSNPAYVFFVGVTTLTLRGEGKEVIHKSL-------------KHSKRWQPSQDLLNRRECMPQQNKQALPQQNSESS

Query:  L--ETLMKEYMARIDAAIQS----NQASLRTLELQMGQLANELKARPQGKLLADTEHPRREGKE
        +  +T M+E+  R D AI+     N A++R LE QMGQLA+ELK RP+G L + TE P+ EG+E
Subjt:  L--ETLMKEYMARIDAAIQS----NQASLRTLELQMGQLANELKARPQGKLLADTEHPRREGKE

A0A6J1EQ90 uncharacterized protein LOC1114364116.0e-1534.03Show/hide
Query:  ITNALKNVTVVSHVQQPLVVEPAAVVNQVTEESCVYCGEEHNYEFCPSNPAYVFFVGVTTL---------------------TLRGEGKEVIHKSLKHSK
        +TN L+N+ +         V  AA +NQ   ESCVYCGEEH ++ CPSNPA +F+VG                             +G+ + ++ +    
Subjt:  ITNALKNVTVVSHVQQPLVVEPAAVVNQVTEESCVYCGEEHNYEFCPSNPAYVFFVGVTTL---------------------TLRGEGKEVIHKSLKHSK

Query:  RWQPSQDLLNRRECMPQQ-NKQ----ALPQQNSESSLETLMKEYMARIDAAIQSNQASLRTLELQMGQLANELKARPQGKLLADTEHPRRE
         +     L N+     QQ N Q       Q  SE+S+E+L+KEYMA+ DA IQS QASLR LE+Q+G   N  +     +  ADT+    E
Subjt:  RWQPSQDLLNRRECMPQQ-NKQ----ALPQQNSESSLETLMKEYMARIDAAIQSNQASLRTLELQMGQLANELKARPQGKLLADTEHPRRE

A0A6J1G7Q6 uncharacterized protein LOC1114515982.8e-2037.76Show/hide
Query:  ITNALKNVTVVSHVQQPLVVEPAAVVNQVTEESCVYCGEEHNYEFCPSNPAYVFFVG-----------------------VTTLTLRGEGKEVIHKSLKH
        +TN L+N+              A V+ Q   ESCVYCGE+H ++ CPSNPA +F+VG                             +G+G    ++ +  
Subjt:  ITNALKNVTVVSHVQQPLVVEPAAVVNQVTEESCVYCGEEHNYEFCPSNPAYVFFVG-----------------------VTTLTLRGEGKEVIHKSLKH

Query:  SKRWQPSQDLLNR-----RECMPQQNKQALPQQNSESSLETLMKEYMARIDAAIQSNQASLRTLELQMGQLANELKARPQGKLLADTEHPRREGKE
           + P   L N+     ++   Q    +  Q  S + LE+L+KEYMAR DA IQS Q SLR LE+Q+GQLANEL+ RP GKL  DTE P+REG E
Subjt:  SKRWQPSQDLLNR-----RECMPQQNKQALPQQNSESSLETLMKEYMARIDAAIQSNQASLRTLELQMGQLANELKARPQGKLLADTEHPRREGKE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTGCCGAAATTTTCGGCTAGAGGTTTGAAAACCTGGGGCATTACAAATGCTCTTAAAAATGTAACTGTGGTTAGCCATGTTCAGCAACCGCTAGTGGTGGAG
CCTGCTGCAGTTGTAAACCAAGTTACAGAAGAATCATGTGTCTATTGTGGTGAAGAGCATAATTATGAGTTTTGCCCCAGCAATCCAGCCTATGTGTTTTTTGTT
GGCGTAACCACCTTAACTTTGCGTGGGGAGGGCAAGGAAGTAATTCACAAGTCCCTCAAGCATAGCAAAAGGTGGCAACCCAGTCAGGATTTGCTAAATCGCAGG
GAATGCATGCCCCAGCAAAATAAGCAGGCATTACCCCAGCAAAATTCAGAGAGTTCTCTAGAGACTTTGATGAAAGAATATATGGCTCGTATTGATGCTGCAATT
CAAAGTAATCAAGCTTCATTGAGAACTCTGGAATTGCAAATGGGTCAGCTAGCTAATGAGCTGAAGGCACGACCTCAAGGAAAACTTCTTGCTGATACTGAACAC
CCTAGAAGGGAAGGTAAGGAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGCTGCCGAAATTTTCGGCTAGAGGTTTGAAAACCTGGGGCATTACAAATGCTCTTAAAAATGTAACTGTGGTTAGCCATGTTCAGCAACCGCTAGTGGTGGAG
CCTGCTGCAGTTGTAAACCAAGTTACAGAAGAATCATGTGTCTATTGTGGTGAAGAGCATAATTATGAGTTTTGCCCCAGCAATCCAGCCTATGTGTTTTTTGTT
GGCGTAACCACCTTAACTTTGCGTGGGGAGGGCAAGGAAGTAATTCACAAGTCCCTCAAGCATAGCAAAAGGTGGCAACCCAGTCAGGATTTGCTAAATCGCAGG
GAATGCATGCCCCAGCAAAATAAGCAGGCATTACCCCAGCAAAATTCAGAGAGTTCTCTAGAGACTTTGATGAAAGAATATATGGCTCGTATTGATGCTGCAATT
CAAAGTAATCAAGCTTCATTGAGAACTCTGGAATTGCAAATGGGTCAGCTAGCTAATGAGCTGAAGGCACGACCTCAAGGAAAACTTCTTGCTGATACTGAACAC
CCTAGAAGGGAAGGTAAGGAGTGA
Protein sequenceShow/hide protein sequence
MLPKFSARGLKTWGITNALKNVTVVSHVQQPLVVEPAAVVNQVTEESCVYCGEEHNYEFCPSNPAYVFFVGVTTLTLRGEGKEVIHKSLKHSKRWQPSQDLLNRR
ECMPQQNKQALPQQNSESSLETLMKEYMARIDAAIQSNQASLRTLELQMGQLANELKARPQGKLLADTEHPRREGKE