; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0025391 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0025391
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotran_gag_3 domain-containing protein
Genome locationchr10:12355053..12356918
RNA-Seq ExpressionLag0025391
SyntenyLag0025391
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6588985.1 Retrovirus-related Pol polyprotein from transposon RE1, partial [Cucurbita argyrosperma subsp. sororia]3.4e-1932.73Show/hide
Query:  HHIVLYIDETTTEAPPAPVSSHINPLYEEWVAKDQALMTLINVTLSPATLAYVLVAHHPNKPGRFLKTL--------LFELKNQHRQFEVRSS-------
        H +  +ID T     P P+S   NP Y++W AKDQALMT+IN TLSP  LAYV+ +    +    L  L        +  LK+  +    +S        
Subjt:  HHIVLYIDETTTEAPPAPVSSHINPLYEEWVAKDQALMTLINVTLSPATLAYVLVAHHPNKPGRFLKTL--------LFELKNQHRQFEVRSS-------

Query:  ----------------INNQEVGGYSY--LCLKWLTF---------------------SEEAAIEKQSKQDDALTQPSAMFASQTTPNLSQRSNSSGNFN
                        +N++++  Y+   L  ++ TF                     +EE+A+ KQSK+DD   QP+A+ AS  +  +S  S  + NF 
Subjt:  ----------------INNQEVGGYSY--LCLKWLTF---------------------SEEAAIEKQSKQDDALTQPSAMFASQTTPNLSQRSNSSGNFN

Query:  RGRSSGRGR--------NQGRGRG----------------------------------FNPSGHHPPTQLAAMVASQNVAYCNTASSSVNN---GAEYAG
        RGR  GRG          QGRG G                                  +N  G HPP  LAAMVASQN A+ +  +S +N     + Y G
Subjt:  RGRSSGRGR--------NQGRGRG----------------------------------FNPSGHHPPTQLAAMVASQNVAYCNTASSSVNN---GAEYAG

Query:  DDQVSVGSGQSLPISHNGSGQMYGQGFVPK
        ++QV VGSGQSLPISH  SGQ   Q FVPK
Subjt:  DDQVSVGSGQSLPISHNGSGQMYGQGFVPK

KAG7015254.1 hypothetical protein SDJN02_22888, partial [Cucurbita argyrosperma subsp. argyrosperma]1.4e-1731.87Show/hide
Query:  HHIVLYIDETTTEAPPAPVSSHINPLYEEWVAKDQALMTLINVTLSPATLAYVLVAHHPNKPGRFLKTL--------LFELKNQHRQFEVRSS-------
        H +  +ID T     P P+S   NP Y++W AKDQALMT+IN TLSP  LAYV+ +    +    L  L        +  LK+  +    +S        
Subjt:  HHIVLYIDETTTEAPPAPVSSHINPLYEEWVAKDQALMTLINVTLSPATLAYVLVAHHPNKPGRFLKTL--------LFELKNQHRQFEVRSS-------

Query:  ----------------INNQEVGGYSY--LCLKWLTF---------------------SEEAAIEKQSKQDDALTQPSAMFASQTTPNLSQRSNSSGNFN
                        +N++++  Y+   L  ++ TF                     +EE+A+ KQSK+DD   QP+A+ AS  +  +S  S  + NF 
Subjt:  ----------------INNQEVGGYSY--LCLKWLTF---------------------SEEAAIEKQSKQDDALTQPSAMFASQTTPNLSQRSNSSGNFN

Query:  RGRSSGRGR--------NQGRGRG----------------------------------FNPSGHHPPTQLAAMVASQNVAYCNTASSSVNN---GAEYAG
        RGR  GRG          QGRG G                                  +N  G HPP  LAAMVASQN A+ +  +S +N     + Y G
Subjt:  RGRSSGRGR--------NQGRGRG----------------------------------FNPSGHHPPTQLAAMVASQNVAYCNTASSSVNN---GAEYAG

Query:  DDQVSVGSGQSLPISHNGSG
        ++QV VGSGQSLPISH+G G
Subjt:  DDQVSVGSGQSLPISHNGSG

XP_008448008.1 PREDICTED: uncharacterized protein LOC103490319 isoform X3 [Cucumis melo]5.1e-1529.17Show/hide
Query:  TTTEAPPAPVSSHINPLYEEWVAKDQALMTLINVTLSPATLAYVLVAHHP------------------------------NKPGRFLKTLLFELKN-QHR
        T   +  + V    NP YE+W+AKDQALMT+IN TLSP  LAYV+ +                                  KP   +   +  +K  + +
Subjt:  TTTEAPPAPVSSHINPLYEEWVAKDQALMTLINVTLSPATLAYVLVAHHP------------------------------NKPGRFLKTLLFELKN-QHR

Query:  QFEVRSSINNQEVGGYSY--LCLKWLTF---------------------SEEAAIEKQSKQDDALTQPSAMFASQTTPNLSQRSNSSGNFNRGRSSGRGR
           V + IN +++  Y+   L  ++ TF                     +EE+A+ KQSK DD+  QP+ + +S  +  LS       NF RG  +G G+
Subjt:  QFEVRSSINNQEVGGYSY--LCLKWLTF---------------------SEEAAIEKQSKQDDALTQPSAMFASQTTPNLSQRSNSSGNFNRGRSSGRGR

Query:  NQGRGR-------------------------------------------GFNPSGHHPPTQLAAMVASQNVAY-------------CNTASSS----VNN
        + G GR                                            +N  G HPP QLAAMVASQN A+             CNT  +S    V+ 
Subjt:  NQGRGR-------------------------------------------GFNPSGHHPPTQLAAMVASQNVAY-------------CNTASSS----VNN

Query:  GAEYAGDDQVSVGSGQSLPISHNGSGQMYGQGFVPK
          EY G++QV +G+GQ+ P+SH  SGQ++G+ FVPK
Subjt:  GAEYAGDDQVSVGSGQSLPISHNGSGQMYGQGFVPK

XP_011658579.1 uncharacterized protein LOC105436058 [Cucumis sativus]4.2e-1730Show/hide
Query:  HHIVLYIDET-----TTEAPPAPVSSHINPLYEEWVAKDQALMTLINVTLSPATLAYVLVAHHP------------------------------NKPGRF
        H +  ++D T     T+ +  + V    NPLYE+W+AKDQALMT+IN TLSP  LAYV+ +                                  KP   
Subjt:  HHIVLYIDET-----TTEAPPAPVSSHINPLYEEWVAKDQALMTLINVTLSPATLAYVLVAHHP------------------------------NKPGRF

Query:  LKTLLFELKN-QHRQFEVRSSINNQEVGGYSY--LCLKWLTF---------------------SEEAAIEKQSKQDDALTQPSAMFASQTTPNLSQRSNS
        +   +  +K  + +   V + IN +++  Y+   L  ++ TF                     +EE+A+ KQSK DD+  QP+ + +S  +  LS     
Subjt:  LKTLLFELKN-QHRQFEVRSSINNQEVGGYSY--LCLKWLTF---------------------SEEAAIEKQSKQDDALTQPSAMFASQTTPNLSQRSNS

Query:  SGNFNRGRSSGRGRNQGRGR-------------------------------------------GFNPSGHHPPTQLAAMVASQNVAY-------------
        + NF RG  +G G+N G GR                                            +N  G HPP QLAAMVASQN A+             
Subjt:  SGNFNRGRSSGRGRNQGRGR-------------------------------------------GFNPSGHHPPTQLAAMVASQNVAY-------------

Query:  CNTASSS----VNNGAEYAGDDQVSVGSGQSLPISHNGSGQMYGQGFVPK
        CNT  +S    V+   EY G++QV VG+GQ+ PISH  SGQ++G+ FVPK
Subjt:  CNTASSS----VNNGAEYAGDDQVSVGSGQSLPISHNGSGQMYGQGFVPK

XP_022150845.1 uncharacterized protein LOC111018892 [Momordica charantia]6.9e-2029.75Show/hide
Query:  IDGSIIHHIVLYIDETTTEAPPAPVSS--HINPLYEEWVAKDQALMTLINVTLSPATLAYV-----------LVAHHPNKPGR-----------------
        IDGS+          + TE+ P   +S   INP +E+W+AKDQALMTLIN TLS   LAYV           ++  H +   R                 
Subjt:  IDGSIIHHIVLYIDETTTEAPPAPVSS--HINPLYEEWVAKDQALMTLINVTLSPATLAYV-----------LVAHHPNKPGR-----------------

Query:  ------FLK--------------------TLLFELKNQHRQFEVRSSINNQEVGGYSYLCLKWLTFSEEAAIEKQSKQDDALTQPSAMFASQTTPNLSQR
              ++K                     L++ L     ++   S+         S+  L     SEE+AIEKQ K++D +TQP+A+FAS  +P    R
Subjt:  ------FLK--------------------TLLFELKNQHRQFEVRSSINNQEVGGYSYLCLKWLTFSEEAAIEKQSKQDDALTQPSAMFASQTTPNLSQR

Query:  S-----NSSGNFNRGRSSGRGR--------NQGRGR----------------------------------GFNPSGHHPPTQLAAMVASQNVAY------
        +     N S +  RG+++GRG+        NQGRGR                                   F+  G HPP QLAAMVA QN +Y      
Subjt:  S-----NSSGNFNRGRSSGRGR--------NQGRGR----------------------------------GFNPSGHHPPTQLAAMVASQNVAY------

Query:  ----------CNT-ASSSVNN------GAEYAGDDQVSVGSGQSLPISHNGSGQMYGQGFVPK
                  CNT  ++ ++N       ++Y G++ +SVGSGQS PI+H G GQ++G  +VP+
Subjt:  ----------CNT-ASSSVNN------GAEYAGDDQVSVGSGQSLPISHNGSGQMYGQGFVPK

TrEMBL top hitse value%identityAlignment
A0A1S3BI58 uncharacterized protein LOC103490319 isoform X21.5e-1228.09Show/hide
Query:  TTTEAPPAPVSSHINPLYEEWVAKDQALMTLINVTLSPATLAYVLVAHHP------------------------------NKPGRFLKTLLFELKN-QHR
        T   +  + V    NP YE+W+AKDQALMT+IN TLSP  LAYV+ +                                  KP   +   +  +K  + +
Subjt:  TTTEAPPAPVSSHINPLYEEWVAKDQALMTLINVTLSPATLAYVLVAHHP------------------------------NKPGRFLKTLLFELKN-QHR

Query:  QFEVRSSINNQEVGGYSY--LCLKWLTF---------------------SEEAAIEKQSKQDDALTQPSAMFASQTTPNLSQRSNSSGNFNRGRSSGRGR
           V + IN +++  Y+   L  ++ TF                     +EE+A+ KQSK DD+  QP+ + +S  +  LS       NF RG  +G G+
Subjt:  QFEVRSSINNQEVGGYSY--LCLKWLTF---------------------SEEAAIEKQSKQDDALTQPSAMFASQTTPNLSQRSNSSGNFNRGRSSGRGR

Query:  NQGRGR-------------------------------------------GFNPSGHHPPTQLAAMVASQNVAY-------------CNTASSS----VNN
        + G GR                                            +N  G HPP QLAAMVASQN A+             CNT  +S    V+ 
Subjt:  NQGRGR-------------------------------------------GFNPSGHHPPTQLAAMVASQNVAY-------------CNTASSS----VNN

Query:  GAEYAGDDQVSVGSGQSLPISHNG
          EY G++QV +G+GQ+ P+SH+G
Subjt:  GAEYAGDDQVSVGSGQSLPISHNG

A0A1S3BIR3 uncharacterized protein LOC103490319 isoform X32.5e-1529.17Show/hide
Query:  TTTEAPPAPVSSHINPLYEEWVAKDQALMTLINVTLSPATLAYVLVAHHP------------------------------NKPGRFLKTLLFELKN-QHR
        T   +  + V    NP YE+W+AKDQALMT+IN TLSP  LAYV+ +                                  KP   +   +  +K  + +
Subjt:  TTTEAPPAPVSSHINPLYEEWVAKDQALMTLINVTLSPATLAYVLVAHHP------------------------------NKPGRFLKTLLFELKN-QHR

Query:  QFEVRSSINNQEVGGYSY--LCLKWLTF---------------------SEEAAIEKQSKQDDALTQPSAMFASQTTPNLSQRSNSSGNFNRGRSSGRGR
           V + IN +++  Y+   L  ++ TF                     +EE+A+ KQSK DD+  QP+ + +S  +  LS       NF RG  +G G+
Subjt:  QFEVRSSINNQEVGGYSY--LCLKWLTF---------------------SEEAAIEKQSKQDDALTQPSAMFASQTTPNLSQRSNSSGNFNRGRSSGRGR

Query:  NQGRGR-------------------------------------------GFNPSGHHPPTQLAAMVASQNVAY-------------CNTASSS----VNN
        + G GR                                            +N  G HPP QLAAMVASQN A+             CNT  +S    V+ 
Subjt:  NQGRGR-------------------------------------------GFNPSGHHPPTQLAAMVASQNVAY-------------CNTASSS----VNN

Query:  GAEYAGDDQVSVGSGQSLPISHNGSGQMYGQGFVPK
          EY G++QV +G+GQ+ P+SH  SGQ++G+ FVPK
Subjt:  GAEYAGDDQVSVGSGQSLPISHNGSGQMYGQGFVPK

A0A1S4DWT9 uncharacterized protein LOC103490319 isoform X11.5e-1228.09Show/hide
Query:  TTTEAPPAPVSSHINPLYEEWVAKDQALMTLINVTLSPATLAYVLVAHHP------------------------------NKPGRFLKTLLFELKN-QHR
        T   +  + V    NP YE+W+AKDQALMT+IN TLSP  LAYV+ +                                  KP   +   +  +K  + +
Subjt:  TTTEAPPAPVSSHINPLYEEWVAKDQALMTLINVTLSPATLAYVLVAHHP------------------------------NKPGRFLKTLLFELKN-QHR

Query:  QFEVRSSINNQEVGGYSY--LCLKWLTF---------------------SEEAAIEKQSKQDDALTQPSAMFASQTTPNLSQRSNSSGNFNRGRSSGRGR
           V + IN +++  Y+   L  ++ TF                     +EE+A+ KQSK DD+  QP+ + +S  +  LS       NF RG  +G G+
Subjt:  QFEVRSSINNQEVGGYSY--LCLKWLTF---------------------SEEAAIEKQSKQDDALTQPSAMFASQTTPNLSQRSNSSGNFNRGRSSGRGR

Query:  NQGRGR-------------------------------------------GFNPSGHHPPTQLAAMVASQNVAY-------------CNTASSS----VNN
        + G GR                                            +N  G HPP QLAAMVASQN A+             CNT  +S    V+ 
Subjt:  NQGRGR-------------------------------------------GFNPSGHHPPTQLAAMVASQNVAY-------------CNTASSS----VNN

Query:  GAEYAGDDQVSVGSGQSLPISHNG
          EY G++QV +G+GQ+ P+SH+G
Subjt:  GAEYAGDDQVSVGSGQSLPISHNG

A0A5D3CLI6 T4.55.7e-1227.86Show/hide
Query:  TTTEAPPAPVSSHINPLYEEWVAKDQALMTLINVTLSPATLAYVLVAHHP------------------------------NKPGRFLKTLLFELKN-QHR
        T   +  + V    NP YE+W+AKDQALMT+IN TLSP  LAYV+ +                                  KP   +   +  +K  + +
Subjt:  TTTEAPPAPVSSHINPLYEEWVAKDQALMTLINVTLSPATLAYVLVAHHP------------------------------NKPGRFLKTLLFELKN-QHR

Query:  QFEVRSSINNQEVGGYSY--LCLKWLTF---------------------SEEAAIEKQSKQDDALTQPSAMFASQTTPNLSQRSNSSGNFNRGRSSGRGR
           V + IN +++  Y+   L  ++ TF                     +EE+A+ KQSK DD+  QP+ + +S  +  LS       NF RG  +G G+
Subjt:  QFEVRSSINNQEVGGYSY--LCLKWLTF---------------------SEEAAIEKQSKQDDALTQPSAMFASQTTPNLSQRSNSSGNFNRGRSSGRGR

Query:  NQGRGR-------------------------------------------GFNPSGHHPPTQLAAMVASQNVAY-------------CNTASSS----VNN
        + G GR                                            +N  G HPP QLAAMVASQN A+             CNT  +S    V+ 
Subjt:  NQGRGR-------------------------------------------GFNPSGHHPPTQLAAMVASQNVAY-------------CNTASSS----VNN

Query:  GAEYAGDDQVSVGSGQSLPISHN
          EY G++QV +G+GQ+ P+SH+
Subjt:  GAEYAGDDQVSVGSGQSLPISHN

A0A6J1D9L6 uncharacterized protein LOC1110188923.3e-2029.75Show/hide
Query:  IDGSIIHHIVLYIDETTTEAPPAPVSS--HINPLYEEWVAKDQALMTLINVTLSPATLAYV-----------LVAHHPNKPGR-----------------
        IDGS+          + TE+ P   +S   INP +E+W+AKDQALMTLIN TLS   LAYV           ++  H +   R                 
Subjt:  IDGSIIHHIVLYIDETTTEAPPAPVSS--HINPLYEEWVAKDQALMTLINVTLSPATLAYV-----------LVAHHPNKPGR-----------------

Query:  ------FLK--------------------TLLFELKNQHRQFEVRSSINNQEVGGYSYLCLKWLTFSEEAAIEKQSKQDDALTQPSAMFASQTTPNLSQR
              ++K                     L++ L     ++   S+         S+  L     SEE+AIEKQ K++D +TQP+A+FAS  +P    R
Subjt:  ------FLK--------------------TLLFELKNQHRQFEVRSSINNQEVGGYSYLCLKWLTFSEEAAIEKQSKQDDALTQPSAMFASQTTPNLSQR

Query:  S-----NSSGNFNRGRSSGRGR--------NQGRGR----------------------------------GFNPSGHHPPTQLAAMVASQNVAY------
        +     N S +  RG+++GRG+        NQGRGR                                   F+  G HPP QLAAMVA QN +Y      
Subjt:  S-----NSSGNFNRGRSSGRGR--------NQGRGR----------------------------------GFNPSGHHPPTQLAAMVASQNVAY------

Query:  ----------CNT-ASSSVNN------GAEYAGDDQVSVGSGQSLPISHNGSGQMYGQGFVPK
                  CNT  ++ ++N       ++Y G++ +SVGSGQS PI+H G GQ++G  +VP+
Subjt:  ----------CNT-ASSSVNN------GAEYAGDDQVSVGSGQSLPISHNGSGQMYGQGFVPK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTTCAAATGCTCACAAAACTTCCAAGAACAAAATCTCATCTCAAGTCACAAAACCGACAAAATCGACTGACCGACCATGTATTTGGTCAGTTGATTTTAGAGAGTGT
TCAAAAACTTCCAGCTACCGACCGACCGAAATCGATAGACGGTTCCATCATCCACCACATTGTCCTCTACATCGACGAGACTACAACTGAAGCACCTCCTGCTCCAGTAT
CTTCTCATATTAATCCACTTTATGAGGAGTGGGTTGCCAAAGATCAAGCTCTAATGACATTGATCAATGTCACTCTGTCGCCGGCAACCTTAGCCTATGTGTTGGTTGCA
CATCATCCAAACAAGCCTGGGAGGTTCTTGAAAACACTACTCTTCGAGCTCAAGAACCAACATCGTCAATTTGAAGTCAGATCTTCAATCAATAACCAAGAAGTCGGAGG
ATATTCTTATCTATGCCTTAAATGGCTTACCTTCTCAGAGGAGGCTGCTATTGAGAAACAATCAAAGCAAGATGATGCCCTAACTCAGCCTTCTGCCATGTTTGCGTCTC
AAACAACGCCCAATCTCTCTCAGCGTTCCAACTCCTCTGGAAATTTTAATCGAGGAAGGTCATCTGGCCGTGGACGCAATCAAGGTCGTGGTCGTGGATTCAATCCTTCT
GGGCATCATCCACCTACGCAACTAGCAGCCATGGTGGCCTCTCAAAATGTTGCTTACTGCAACACAGCATCAAGCAGTGTGAATAATGGAGCTGAATATGCAGGTGATGA
TCAAGTATCAGTAGGCAGTGGCCAATCCCTCCCAATTTCTCACAATGGCTCAGGACAAATGTACGGGCAAGGTTTTGTTCCAAAGACCTAG
mRNA sequenceShow/hide mRNA sequence
ATGCTTCAAATGCTCACAAAACTTCCAAGAACAAAATCTCATCTCAAGTCACAAAACCGACAAAATCGACTGACCGACCATGTATTTGGTCAGTTGATTTTAGAGAGTGT
TCAAAAACTTCCAGCTACCGACCGACCGAAATCGATAGACGGTTCCATCATCCACCACATTGTCCTCTACATCGACGAGACTACAACTGAAGCACCTCCTGCTCCAGTAT
CTTCTCATATTAATCCACTTTATGAGGAGTGGGTTGCCAAAGATCAAGCTCTAATGACATTGATCAATGTCACTCTGTCGCCGGCAACCTTAGCCTATGTGTTGGTTGCA
CATCATCCAAACAAGCCTGGGAGGTTCTTGAAAACACTACTCTTCGAGCTCAAGAACCAACATCGTCAATTTGAAGTCAGATCTTCAATCAATAACCAAGAAGTCGGAGG
ATATTCTTATCTATGCCTTAAATGGCTTACCTTCTCAGAGGAGGCTGCTATTGAGAAACAATCAAAGCAAGATGATGCCCTAACTCAGCCTTCTGCCATGTTTGCGTCTC
AAACAACGCCCAATCTCTCTCAGCGTTCCAACTCCTCTGGAAATTTTAATCGAGGAAGGTCATCTGGCCGTGGACGCAATCAAGGTCGTGGTCGTGGATTCAATCCTTCT
GGGCATCATCCACCTACGCAACTAGCAGCCATGGTGGCCTCTCAAAATGTTGCTTACTGCAACACAGCATCAAGCAGTGTGAATAATGGAGCTGAATATGCAGGTGATGA
TCAAGTATCAGTAGGCAGTGGCCAATCCCTCCCAATTTCTCACAATGGCTCAGGACAAATGTACGGGCAAGGTTTTGTTCCAAAGACCTAG
Protein sequenceShow/hide protein sequence
MLQMLTKLPRTKSHLKSQNRQNRLTDHVFGQLILESVQKLPATDRPKSIDGSIIHHIVLYIDETTTEAPPAPVSSHINPLYEEWVAKDQALMTLINVTLSPATLAYVLVA
HHPNKPGRFLKTLLFELKNQHRQFEVRSSINNQEVGGYSYLCLKWLTFSEEAAIEKQSKQDDALTQPSAMFASQTTPNLSQRSNSSGNFNRGRSSGRGRNQGRGRGFNPS
GHHPPTQLAAMVASQNVAYCNTASSSVNNGAEYAGDDQVSVGSGQSLPISHNGSGQMYGQGFVPKT