; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0036150 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0036150
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon opus
Genome locationchr3:40502662..40504535
RNA-Seq ExpressionLag0036150
SyntenyLag0036150
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_030494802.1 uncharacterized protein LOC115710583 [Cannabis sativa]4.8e-3039.68Show/hide
Query:  EEFTSCIRRHDPGGVHAKHFYNGLTQAFKAVVNASTNGSLLKKTFNEANEILDAIVANHSQWGEEETP-SRSSVKVQEINHLETLLEQMLTMNDMLMNLT
        E    C     P  +  + FYNGL  A + V++AS NG++L K++NEA EIL+ I +N+ QW     P SR    V E++ L  L  QM +M ++L N+ 
Subjt:  EEFTSCIRRHDPGGVHAKHFYNGLTQAFKAVVNASTNGSLLKKTFNEANEILDAIVANHSQWGEEETP-SRSSVKVQEINHLETLLEQMLTMNDMLMNLT

Query:  LGNQVQAAAD---EDNTLTYLGR----KNASTSNATLFKQGGQLEFNSQTHMKIAQP--------LQQHMQVQPQSTTNLENLMRKMMTKNDTVIQSQTA
        +G  VQ AA     + +  Y G     +N  ++ A++   G       Q   K + P         QQ  Q Q   T++LE+LMR  M KND VIQSQ A
Subjt:  LGNQVQAAAD---EDNTLTYLGR----KNASTSNATLFKQGGQLEFNSQTHMKIAQP--------LQQHMQVQPQSTTNLENLMRKMMTKNDTVIQSQTA

Query:  SLRSLESQIGQLAKEMRNRPLGTLPSNTENPKREGLGKDKASIVPSG
        SL++LE Q+GQLA +++NRP GTLPS+TENP+R+G    KA  + SG
Subjt:  SLRSLESQIGQLAKEMRNRPLGTLPSNTENPKREGLGKDKASIVPSG

XP_030497803.1 uncharacterized protein LOC115713460 [Cannabis sativa]5.3e-2937.08Show/hide
Query:  EEFTSCIRRHDPGGVHAKHFYNGLTQAFKAVVNASTNGSLLKKTFNEANEILDAIVANHSQWGEEET-PSRSSVKVQEINHLETLLEQMLTMNDMLMNLT
        E    C     P  +  + FYNGL  A + V++AS NG++L K++NEA EIL+ I +N+ QW       SR    V E++ L  L  QM +M ++L N+ 
Subjt:  EEFTSCIRRHDPGGVHAKHFYNGLTQAFKAVVNASTNGSLLKKTFNEANEILDAIVANHSQWGEEET-PSRSSVKVQEINHLETLLEQMLTMNDMLMNLT

Query:  LGNQVQAAADEDN----------------------TLTYLGRKNASTSNATL-------------FKQGGQLEFNSQTHMKIAQPLQQHMQVQPQSTTNL
        +G  VQ AA                          ++ Y+G +N + +N                F  GGQ + +           QQ  Q Q   T++L
Subjt:  LGNQVQAAADEDN----------------------TLTYLGRKNASTSNATL-------------FKQGGQLEFNSQTHMKIAQPLQQHMQVQPQSTTNL

Query:  ENLMRKMMTKNDTVIQSQTASLRSLESQIGQLAKEMRNRPLGTLPSNTENPKREGLGKDKASIVPSG
        E+LMR  M KNDTVIQSQ ASLR+LE Q+GQLA +++NRP GTLPS+TENP+R+G    KA  + SG
Subjt:  ENLMRKMMTKNDTVIQSQTASLRSLESQIGQLAKEMRNRPLGTLPSNTENPKREGLGKDKASIVPSG

XP_030503898.1 uncharacterized protein LOC115719117 [Cannabis sativa]9.7e-3139Show/hide
Query:  EEFTSCIRRHDPGGVHAKHFYNGLTQAFKAVVNASTNGSLLKKTFNEANEILDAIVANHSQWGEEETP-SRSSVKVQEINHLETLLEQMLTMNDMLMNLT
        E    C     P  +  + FYNGL  A + V++AS NG++L K++NEA EIL+ I +N+ QW     P SR    V E++ L  L  QM +M ++L N+ 
Subjt:  EEFTSCIRRHDPGGVHAKHFYNGLTQAFKAVVNASTNGSLLKKTFNEANEILDAIVANHSQWGEEETP-SRSSVKVQEINHLETLLEQMLTMNDMLMNLT

Query:  LGNQVQAA----ADEDNTLTYLGRKNASTSNATL-------------FKQGGQLEFNSQTHMKIAQPL----------QQHMQVQPQSTTNLENLMRKMM
        +G  VQ A     +  + L Y+G +N + +N                F  GGQ   +S    +  Q            QQ  Q Q   T++LE+LMR  M
Subjt:  LGNQVQAA----ADEDNTLTYLGRKNASTSNATL-------------FKQGGQLEFNSQTHMKIAQPL----------QQHMQVQPQSTTNLENLMRKMM

Query:  TKNDTVIQSQTASLRSLESQIGQLAKEMRNRPLGTLPSNTENPKREGLGKDKASIVPSG
         KND VIQSQ ASLR+LE Q+GQLA +++NRP GTLPS+TENP+R+G    KA  + SG
Subjt:  TKNDTVIQSQTASLRSLESQIGQLAKEMRNRPLGTLPSNTENPKREGLGKDKASIVPSG

XP_030509259.1 uncharacterized protein LOC115723937 [Cannabis sativa]1.2e-2837.05Show/hide
Query:  EEFTSCIRRHDPGGVHAKHFYNGLTQAFKAVVNASTNGSLLKKTFNEANEILDAIVANHSQWGEEETP-SRSSVKVQEINHLETLLEQMLTMNDMLMNLT
        E    C     P  +  + FYNGL  A + V++AS NG++L K++NEA EIL+ I +N+ QW     P SR    V E++ L  L  QM +M ++L N+ 
Subjt:  EEFTSCIRRHDPGGVHAKHFYNGLTQAFKAVVNASTNGSLLKKTFNEANEILDAIVANHSQWGEEETP-SRSSVKVQEINHLETLLEQMLTMNDMLMNLT

Query:  LGNQVQAAA----------------------DEDNTLTYLGRKNASTSNATL-------------FKQGGQLEFNS---QTHMKIAQP--------LQQH
        +G  VQ AA                          ++ Y+G +N + +N                F  GGQ   +S   Q   K + P         QQ 
Subjt:  LGNQVQAAA----------------------DEDNTLTYLGRKNASTSNATL-------------FKQGGQLEFNS---QTHMKIAQP--------LQQH

Query:  MQVQPQSTTNLENLMRKMMTKNDTVIQSQTASLRSLESQIGQLAKEMRNRPLGTLPSNTENPKREGLGKDKASIVPSG
         Q Q   T++LE+LMR  M KND VIQSQ ASLR+LE Q+GQLA +++NRP GTLPS+TENP+R+G    KA  + SG
Subjt:  MQVQPQSTTNLENLMRKMMTKNDTVIQSQTASLRSLESQIGQLAKEMRNRPLGTLPSNTENPKREGLGKDKASIVPSG

XP_030510138.1 uncharacterized protein LOC115724905 [Cannabis sativa]2.6e-2836.76Show/hide
Query:  EEFTSCIRRHDPGGVHAKHFYNGLTQAFKAVVNASTNGSLLKKTFNEANEILDAIVANHSQWGEEETP-SRSSVKVQEINHLETLLEQMLTMNDMLMNLT
        E    C     P  +  + FYNGL  A + V++AS NG++L K++NEA EIL+ I +N+ QW     P SR    V E++ L  L  QM +M ++L N+ 
Subjt:  EEFTSCIRRHDPGGVHAKHFYNGLTQAFKAVVNASTNGSLLKKTFNEANEILDAIVANHSQWGEEETP-SRSSVKVQEINHLETLLEQMLTMNDMLMNLT

Query:  LGNQVQAAA----------------------DEDNTLTYLGRKNASTSNATL-------------FKQGGQLEFNSQTHMKIAQPL-----QQHMQVQPQ
        +G  VQ AA                          ++ Y+G +N + +N                F  GGQ   +S    +  Q       QQ  Q Q  
Subjt:  LGNQVQAAA----------------------DEDNTLTYLGRKNASTSNATL-------------FKQGGQLEFNSQTHMKIAQPL-----QQHMQVQPQ

Query:  STTNLENLMRKMMTKNDTVIQSQTASLRSLESQIGQLAKEMRNRPLGTLPSNTENPKREGLGKDKASIVPSG
         T++LE+LMR  M KND VIQSQ ASLR+LE Q+GQLA +++NRP GTLPS+TENP+R+     KA  + SG
Subjt:  STTNLENLMRKMMTKNDTVIQSQTASLRSLESQIGQLAKEMRNRPLGTLPSNTENPKREGLGKDKASIVPSG

TrEMBL top hitse value%identityAlignment
A0A5B6VWJ0 Retroelement pol polyprotein-like4.9e-2031.87Show/hide
Query:  EEFTSCIRRHDPGGVHAKHFYNGLTQAFKAVVNASTNGSLLKKTFNEANEILDAIVANHSQWGEEETPS-RSSVKVQEINHLETLLEQMLTMNDMLMNLT
        E    C     P  +  + FYNGL    + VV+AS NG+LL K++NEA EI++ I +N+ QW      S R    + E++ + +L  Q+ +++ M  NLT
Subjt:  EEFTSCIRRHDPGGVHAKHFYNGLTQAFKAVVNASTNGSLLKKTFNEANEILDAIVANHSQWGEEETPS-RSSVKVQEINHLETLLEQMLTMNDMLMNLT

Query:  LGNQVQAAADEDN------------------------TLTYLGRKNASTSNATLFKQGGQLEF-NS--QTHMKIAQ----------------------PL
               AA   N                        ++ Y+G +N +       +QG Q  F NS  + H+  +                       P 
Subjt:  LGNQVQAAADEDN------------------------TLTYLGRKNASTSNATLFKQGGQLEF-NS--QTHMKIAQ----------------------PL

Query:  QQHMQVQPQSTTNLENLMRKMMTKNDTVIQSQTASLRSLESQIGQLAKEMRNRPLGTLPSNTENPKREGLGKD
        Q    VQ +++ +LE+L++  M KND +IQSQ A+L++LE+Q+GQLA E+RNR  G LPS+TENP+   LGK+
Subjt:  QQHMQVQPQSTTNLENLMRKMMTKNDTVIQSQTASLRSLESQIGQLAKEMRNRPLGTLPSNTENPKREGLGKD

A0A6J0ZX64 LOW QUALITY PROTEIN: uncharacterized protein LOC1104129456.1e-1529.26Show/hide
Query:  EEFTSCIRR---HD-PGGVHAKHFYNGLTQAFKAVVNASTNGSLLKKTFNEANEILDAIVANHSQWGEEETPSRSSVKVQEINHLETLLEQMLTMNDMLM
        E F   +RR   H  P  +  + FYNGL  + K +++A+  G+L+ K   +A  +L+ + +N+ QW  E + SR +V   EI+ L TL  Q+  ++  L 
Subjt:  EEFTSCIRR---HD-PGGVHAKHFYNGLTQAFKAVVNASTNGSLLKKTFNEANEILDAIVANHSQWGEEETPSRSSVKVQEINHLETLLEQMLTMNDMLM

Query:  NLTL---------------GNQVQAAADEDNTLTYLGRKNASTSN--ATLFKQGGQLEFN-------SQTHMKIAQP--LQQHMQVQ-PQSTTNLENLMR
         L +                +          ++ ++G  N   +N  +  +  G +   N         ++ K   P   QQ  + Q P+  + LE L+ 
Subjt:  NLTL---------------GNQVQAAADEDNTLTYLGRKNASTSN--ATLFKQGGQLEFN-------SQTHMKIAQP--LQQHMQVQ-PQSTTNLENLMR

Query:  KMMTKNDTVIQSQTASLRSLESQIGQLAKEMRNRPLGTLPSNTE-NPKREGLGKDKASIVPSGVPEKSEG
        + ++K D +IQSQ ASLR+LE+Q+GQLA  + NRP G+LPS+T+ NPK    GK++   +     ++ EG
Subjt:  KMMTKNDTVIQSQTASLRSLESQIGQLAKEMRNRPLGTLPSNTE-NPKREGLGKDKASIVPSGVPEKSEG

A0A6J1DW02 uncharacterized protein LOC1110248977.0e-1931.27Show/hide
Query:  MNEKPEEFTSCIRRHDPGGVHA----KHFYNGLTQAFKAVVNASTNGSLLKKTFNEANEILDAIVANHSQWGEEETPSRSSVKVQE------INHLETLL
        ++E  E F   IR+    G+ A    +HF+ GL    K ++N + NG+  KKTFNE  +IL+ + +++  W  +   SR++ K Q+      ++   ++ 
Subjt:  MNEKPEEFTSCIRRHDPGGVHA----KHFYNGLTQAFKAVVNASTNGSLLKKTFNEANEILDAIVANHSQWGEEETPSRSSVKVQE------INHLETLL

Query:  EQMLTMNDMLMNLTLG------NQVQAAADEDNTLTYLGRKN-----ASTSNATLFKQGGQLEFN----SQTHMKIAQPLQQHM-----------QVQP-
        ++M+TMN  L  + LG        +Q    +  T   + + N         N +   QGG   FN     Q       P QQH+           Q  P 
Subjt:  EQMLTMNDMLMNLTLG------NQVQAAADEDNTLTYLGRKN-----ASTSNATLFKQGGQLEFN----SQTHMKIAQPLQQHM-----------QVQP-

Query:  -QSTTNLENLMRKMMTKNDTVIQSQTASLRSLESQIGQLAKEMRNRPLGTLPSNTENPKREGLGKDKASIVPSGV
          + +NLEN+M++ M + D VIQSQ AS+R+  +Q+G LA E++NRP G+ P +TE P+REG  + KA  + SG+
Subjt:  -QSTTNLENLMRKMMTKNDTVIQSQTASLRSLESQIGQLAKEMRNRPLGTLPSNTENPKREGLGKDKASIVPSGV

A0A6J1DWK1 uncharacterized protein LOC1110250537.8e-1831.78Show/hide
Query:  FTSCIRRHDPGGVHAKHFYNGLTQAFKAVVNASTNGSLLKKTFNEANEILDAIVANHSQWGE----EETPSRSSVKVQEINHLETLLEQMLTMNDMLMNL
        F  C     PG +  + +Y GL  A + V++AS NG+LL K + +A  IL+ I +++  W +    E   S+  V+ +    L + +E +  +N+   N 
Subjt:  FTSCIRRHDPGGVHAKHFYNGLTQAFKAVVNASTNGSLLKKTFNEANEILDAIVANHSQWGE----EETPSRSSVKVQEINHLETLLEQMLTMNDMLMNL

Query:  TLGNQVQAAADEDNTLTYLGRK---NASTSNATLFKQGGQLEFNSQTHMKIAQPLQQHMQVQPQSTTNLENLMRKMMTKNDTVIQSQTASLRSLESQIGQ
        +  N             + G +   N   SNA  F+Q    + +          + +H Q    S T+LEN+M++ M  ND  +QSQ ASLR+LE Q+GQ
Subjt:  TLGNQVQAAADEDNTLTYLGRK---NASTSNATLFKQGGQLEFNSQTHMKIAQPLQQHMQVQPQSTTNLENLMRKMMTKNDTVIQSQTASLRSLESQIGQ

Query:  LAKEMRNRPLGTLPSNTENPKREGLGKDKASIVPSG
        LA ++++RP+G LPS+TE PKR+   +  A  + SG
Subjt:  LAKEMRNRPLGTLPSNTENPKREGLGKDKASIVPSG

A0A6J1G7Q6 uncharacterized protein LOC1114515985.5e-2433.8Show/hide
Query:  EEFTSCIRRHDPGGVHAKHFYNGLTQAFKAVVNASTNGSLLKKTFNEANEILDAIVANHSQWGE-EETPSRSSVKVQEINHLETLLEQMLTMNDMLMNLT
        E    C     P  +  + FYNGL  A K VV+AS NG +L KT+NEA EIL+ I +N+ QW +    P + + +V E++ L ++  Q+ +M ++L NL 
Subjt:  EEFTSCIRRHDPGGVHAKHFYNGLTQAFKAVVNASTNGSLLKKTFNEANEILDAIVANHSQWGE-EETPSRSSVKVQEINHLETLLEQMLTMNDMLMNLT

Query:  L--GNQVQAAADEDNTL--------TYLGRKN----ASTSNATLFKQG------------------------------GQLEFNSQTHMKIAQP----LQ
           G+ ++A A     +         Y G K+      ++ A++F  G                              GQ  +N Q   K   P    LQ
Subjt:  L--GNQVQAAADEDNTL--------TYLGRKN----ASTSNATLFKQG------------------------------GQLEFNSQTHMKIAQP----LQ

Query:  QHM---------------QVQPQSTTNLENLMRKMMTKNDTVIQSQTASLRSLESQIGQLAKEMRNRPLGTLPSNTENPKREGL
          +               Q Q  S T LE+L+++ M +ND VIQSQ  SLR+LE Q+GQLA E+RNRPLG LP++TE PKREG+
Subjt:  QHM---------------QVQPQSTTNLENLMRKMMTKNDTVIQSQTASLRSLESQIGQLAKEMRNRPLGTLPSNTENPKREGL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACGAAAAACCAGAAGAATTCACATCGTGTATTCGAAGGCACGATCCGGGTGGTGTCCATGCAAAGCATTTTTACAATGGATTGACTCAAGCCTTTAAAGCGGTAGT
CAATGCATCGACTAATGGCTCTTTGCTTAAAAAGACCTTCAATGAGGCAAATGAAATTTTAGATGCTATTGTTGCAAACCACAGTCAATGGGGAGAAGAAGAAACACCTT
CAAGGAGCAGTGTCAAGGTCCAAGAAATTAATCATCTCGAGACACTCTTAGAACAGATGTTAACTATGAACGATATGCTTATGAATCTGACTTTGGGAAATCAAGTTCAA
GCTGCAGCAGATGAAGACAACACCCTAACTTATCTTGGGAGGAAAAATGCAAGCACATCAAATGCAACCCTATTCAAGCAGGGAGGACAACTCGAATTTAATAGTCAGAC
TCACATGAAGATTGCACAGCCTTTGCAGCAGCATATGCAAGTGCAACCCCAATCCACAACAAACTTGGAGAATCTCATGAGGAAAATGATGACCAAGAATGACACTGTAA
TTCAAAGTCAAACAGCTTCCTTAAGGAGTCTTGAATCTCAGATTGGGCAGCTGGCTAAGGAAATGAGGAACAGACCATTAGGGACCCTACCGAGCAATACAGAGAATCCT
AAAAGAGAAGGCCTAGGCAAGGATAAGGCATCCATTGTCCCAAGCGGAGTTCCAGAAAAATCTGAAGGTCCAGAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGAACGAAAAACCAGAAGAATTCACATCGTGTATTCGAAGGCACGATCCGGGTGGTGTCCATGCAAAGCATTTTTACAATGGATTGACTCAAGCCTTTAAAGCGGTAGT
CAATGCATCGACTAATGGCTCTTTGCTTAAAAAGACCTTCAATGAGGCAAATGAAATTTTAGATGCTATTGTTGCAAACCACAGTCAATGGGGAGAAGAAGAAACACCTT
CAAGGAGCAGTGTCAAGGTCCAAGAAATTAATCATCTCGAGACACTCTTAGAACAGATGTTAACTATGAACGATATGCTTATGAATCTGACTTTGGGAAATCAAGTTCAA
GCTGCAGCAGATGAAGACAACACCCTAACTTATCTTGGGAGGAAAAATGCAAGCACATCAAATGCAACCCTATTCAAGCAGGGAGGACAACTCGAATTTAATAGTCAGAC
TCACATGAAGATTGCACAGCCTTTGCAGCAGCATATGCAAGTGCAACCCCAATCCACAACAAACTTGGAGAATCTCATGAGGAAAATGATGACCAAGAATGACACTGTAA
TTCAAAGTCAAACAGCTTCCTTAAGGAGTCTTGAATCTCAGATTGGGCAGCTGGCTAAGGAAATGAGGAACAGACCATTAGGGACCCTACCGAGCAATACAGAGAATCCT
AAAAGAGAAGGCCTAGGCAAGGATAAGGCATCCATTGTCCCAAGCGGAGTTCCAGAAAAATCTGAAGGTCCAGAGTGA
Protein sequenceShow/hide protein sequence
MNEKPEEFTSCIRRHDPGGVHAKHFYNGLTQAFKAVVNASTNGSLLKKTFNEANEILDAIVANHSQWGEEETPSRSSVKVQEINHLETLLEQMLTMNDMLMNLTLGNQVQ
AAADEDNTLTYLGRKNASTSNATLFKQGGQLEFNSQTHMKIAQPLQQHMQVQPQSTTNLENLMRKMMTKNDTVIQSQTASLRSLESQIGQLAKEMRNRPLGTLPSNTENP
KREGLGKDKASIVPSGVPEKSEGPE