; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0004647 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0004647
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr6:5753586..5755509
RNA-Seq ExpressionLag0004647
SyntenyLag0004647
Gene Ontology termsGO:0005488 - binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GFY82848.1 hypothetical protein Acr_02g0010880 [Actinidia rufa]4.0e-3444.65Show/hide
Query:  LKTQLQRIKKDGLSVSQYLSQIKEITDKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRSDNPALEEVRSLLLAYEARLEKQSSVDQLSLAQANFA
        L+T LQ IKKDGL+   Y+ + + + +  ++IGEP++Y DHL + L GLG +YN FVTSIQ+++  P++EEV SLLL+Y+ARLE+QS+ D LS  QAN A
Subjt:  LKTQLQRIKKDGLSVSQYLSQIKEITDKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRSDNPALEEVRSLLLAYEARLEKQSSVDQLSLAQANFA

Query:  NLNVH-----SNSRRSVSRSPTNFSPAGFNHN-HYKNDP------PQCQICGKFGHTALICHHRTNLAYQTPPP-----QAMLT---TAAHSFSSPDSVS
        NL        + S  S   S +   P G N N  Y  +P      P+CQIC K GHTA  C+HRTNL YQ PPP      A +T   + + S + P S S
Subjt:  NLNVH-----SNSRRSVSRSPTNFSPAGFNHN-HYKNDP------PQCQICGKFGHTALICHHRTNLAYQTPPP-----QAMLT---TAAHSFSSPDSVS

Query:  TTSMSSYHPDENWSH
        +   SS++ D   SH
Subjt:  TTSMSSYHPDENWSH

RVW69807.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]2.8e-3535.19Show/hide
Query:  PYPTLPQPLSVKLTDTNFFAMEEPAL---------------ECAPSKFLDDQCSQPNPEFLTWER-----------------------------------
        P P+L Q LS+KL +TN    +   L               + +P K+LD  C Q NPEF+ W+R                                   
Subjt:  PYPTLPQPLSVKLTDTNFFAMEEPAL---------------ECAPSKFLDDQCSQPNPEFLTWER-----------------------------------

Query:  ----------IMGLKTQLQRIKKDGLSVSQYLSQIKEITDKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRSDNPALEEVRSLLLAYEARLEKQS
                  +M L +QLQRIKK  + +S+YLS++K + D+F+ IGEP+SYRD L  IL+GL  EY+ FVTSI NRSD P+L+EV SLL  YE RL ++S
Subjt:  ----------IMGLKTQLQRIKKDGLSVSQYLSQIKEITDKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRSDNPALEEVRSLLLAYEARLEKQS

Query:  SVDQLSLAQANFANLNVHSNSRRSVSRSPTNFSPAGFNHNHYKNDPPQCQICGKFGHTALICHHRTNLAYQTP------------------PPQAMLTTA
            L+  QAN               R P            Y N  PQCQICGK GH AL  +HRTNL Y  P                  P  AMLTT+
Subjt:  SVDQLSLAQANFANLNVHSNSRRSVSRSPTNFSPAGFNHNHYKNDPPQCQICGKFGHTALICHHRTNLAYQTP------------------PPQAMLTTA

Query:  AHSFSSPDSVSTTSMSSYHPDENW
        A         + T +SS   D +W
Subjt:  AHSFSSPDSVSTTSMSSYHPDENW

XP_022148871.1 uncharacterized protein LOC111017438 [Momordica charantia]2.1e-3852.81Show/hide
Query:  QLQRIKKDGLSVSQYLSQIKEITDKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRSDNPALEEVRSLLLAYEARLEKQSSVDQLSLAQANFANLN
        ++Q++KKDGLSVSQYL++IKEIT K S+IGEPIS +DH+++I++GLG EYNAFVTSIQNRSD   LE+VR+LLLAY+ RLEKQ+SVDQL++ QAN ANL 
Subjt:  QLQRIKKDGLSVSQYLSQIKEITDKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRSDNPALEEVRSLLLAYEARLEKQSSVDQLSLAQANFANLN

Query:  VHSNSRRSVS-RSPTNFSPAGFNHN---------HYKNDPP-------------QCQICGKFGHTALICHHRTNLAYQ
        ++  S  + + R P+  SP  FN           +  + PP             QCQIC K GHT   C+HR NL Y+
Subjt:  VHSNSRRSVS-RSPTNFSPAGFNHN---------HYKNDPP-------------QCQICGKFGHTALICHHRTNLAYQ

XP_022155181.1 uncharacterized protein LOC111022315 [Momordica charantia]3.5e-6243.82Show/hide
Query:  FFPPQVN-PSQPSTPFAPNPYPTLPQPLSVKLTDTNFFAMEEPALECA---------------PSKFLDDQCSQPNPEFLTWE-----------------
        F PP  N  +QP  PF+ NP+PTLPQPL+VKL D NF   +   L                  P +FLD    QPNP +  WE                 
Subjt:  FFPPQVN-PSQPSTPFAPNPYPTLPQPLSVKLTDTNFFAMEEPALECA---------------PSKFLDDQCSQPNPEFLTWE-----------------

Query:  ----------------------------RIMGLKTQLQRIKKDGLSVSQYLSQIKEITDKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRSDNPA
                                    RIMGLKT+LQ ++KDG SVSQYL++IKEI DKF+A+GEP+SYRDHLAH+LDGLGSEYNAFVTSI NR+D+P+
Subjt:  ----------------------------RIMGLKTQLQRIKKDGLSVSQYLSQIKEITDKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRSDNPA

Query:  LEEVRSLLLAYEARLEKQSSVDQLSLAQANFANLNVHSNSRRSVSRSPTNFSPAGFNHNHYKND------------------------PP-------QCQ
        LE+VRSLLLAYEARL+KQ++VDQL++AQAN  NL++  NS+    R P  FS      NHYK+                         PP       QCQ
Subjt:  LEEVRSLLLAYEARLEKQSSVDQLSLAQANFANLNVHSNSRRSVSRSPTNFSPAGFNHNHYKND------------------------PP-------QCQ

Query:  ICGKFGHTALICHHRTNLAYQTPPPQAMLTTAAHSFSSPDSVSTTSMSSYHPDENW
        ICGK GH+A +C+HRTN+AY    PQA+      S + P S         HPDE+W
Subjt:  ICGKFGHTALICHHRTNLAYQTPPPQAMLTTAAHSFSSPDSVSTTSMSSYHPDENW

XP_038891713.1 uncharacterized protein LOC120081111 [Benincasa hispida]4.1e-3951.27Show/hide
Query:  MGLKTQLQRIKKDGLSVSQYLSQIKEITDKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRSDNPALEEVRSLLLAYEARLEKQSSVDQLSLAQAN
        M LK +LQ+I+KD LS+SQYLSQIK++ DKFS +GE ISYRDHL HILDGLGSEYNAFVTSIQN  DN ++E+V SLLL+YEA+LEKQ+++D L++AQA 
Subjt:  MGLKTQLQRIKKDGLSVSQYLSQIKEITDKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRSDNPALEEVRSLLLAYEARLEKQSSVDQLSLAQAN

Query:  FANLNVHSNSRRSVSR-----------SPTNFSP------------AGFNHNHYKNDP----PQCQICGKFGHTALICHHRTNLAYQTPPPQAMLTT
         + L+   NS+R+  R           SP NFSP              FN     + P    PQCQI  KFGH    CH   + AYQ   PQA +++
Subjt:  FANLNVHSNSRRSVSR-----------SPTNFSP------------AGFNHNHYKNDP----PQCQICGKFGHTALICHHRTNLAYQTPPPQAMLTT

TrEMBL top hitse value%identityAlignment
A0A438GC62 Retrovirus-related Pol polyprotein from transposon RE11.3e-3535.19Show/hide
Query:  PYPTLPQPLSVKLTDTNFFAMEEPAL---------------ECAPSKFLDDQCSQPNPEFLTWER-----------------------------------
        P P+L Q LS+KL +TN    +   L               + +P K+LD  C Q NPEF+ W+R                                   
Subjt:  PYPTLPQPLSVKLTDTNFFAMEEPAL---------------ECAPSKFLDDQCSQPNPEFLTWER-----------------------------------

Query:  ----------IMGLKTQLQRIKKDGLSVSQYLSQIKEITDKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRSDNPALEEVRSLLLAYEARLEKQS
                  +M L +QLQRIKK  + +S+YLS++K + D+F+ IGEP+SYRD L  IL+GL  EY+ FVTSI NRSD P+L+EV SLL  YE RL ++S
Subjt:  ----------IMGLKTQLQRIKKDGLSVSQYLSQIKEITDKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRSDNPALEEVRSLLLAYEARLEKQS

Query:  SVDQLSLAQANFANLNVHSNSRRSVSRSPTNFSPAGFNHNHYKNDPPQCQICGKFGHTALICHHRTNLAYQTP------------------PPQAMLTTA
            L+  QAN               R P            Y N  PQCQICGK GH AL  +HRTNL Y  P                  P  AMLTT+
Subjt:  SVDQLSLAQANFANLNVHSNSRRSVSRSPTNFSPAGFNHNHYKNDPPQCQICGKFGHTALICHHRTNLAYQTP------------------PPQAMLTTA

Query:  AHSFSSPDSVSTTSMSSYHPDENW
        A         + T +SS   D +W
Subjt:  AHSFSSPDSVSTTSMSSYHPDENW

A0A6J1D6N7 uncharacterized protein LOC1110174381.0e-3852.81Show/hide
Query:  QLQRIKKDGLSVSQYLSQIKEITDKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRSDNPALEEVRSLLLAYEARLEKQSSVDQLSLAQANFANLN
        ++Q++KKDGLSVSQYL++IKEIT K S+IGEPIS +DH+++I++GLG EYNAFVTSIQNRSD   LE+VR+LLLAY+ RLEKQ+SVDQL++ QAN ANL 
Subjt:  QLQRIKKDGLSVSQYLSQIKEITDKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRSDNPALEEVRSLLLAYEARLEKQSSVDQLSLAQANFANLN

Query:  VHSNSRRSVS-RSPTNFSPAGFNHN---------HYKNDPP-------------QCQICGKFGHTALICHHRTNLAYQ
        ++  S  + + R P+  SP  FN           +  + PP             QCQIC K GHT   C+HR NL Y+
Subjt:  VHSNSRRSVS-RSPTNFSPAGFNHN---------HYKNDPP-------------QCQICGKFGHTALICHHRTNLAYQ

A0A6J1DQX7 uncharacterized protein LOC1110223151.7e-6243.82Show/hide
Query:  FFPPQVN-PSQPSTPFAPNPYPTLPQPLSVKLTDTNFFAMEEPALECA---------------PSKFLDDQCSQPNPEFLTWE-----------------
        F PP  N  +QP  PF+ NP+PTLPQPL+VKL D NF   +   L                  P +FLD    QPNP +  WE                 
Subjt:  FFPPQVN-PSQPSTPFAPNPYPTLPQPLSVKLTDTNFFAMEEPALECA---------------PSKFLDDQCSQPNPEFLTWE-----------------

Query:  ----------------------------RIMGLKTQLQRIKKDGLSVSQYLSQIKEITDKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRSDNPA
                                    RIMGLKT+LQ ++KDG SVSQYL++IKEI DKF+A+GEP+SYRDHLAH+LDGLGSEYNAFVTSI NR+D+P+
Subjt:  ----------------------------RIMGLKTQLQRIKKDGLSVSQYLSQIKEITDKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRSDNPA

Query:  LEEVRSLLLAYEARLEKQSSVDQLSLAQANFANLNVHSNSRRSVSRSPTNFSPAGFNHNHYKND------------------------PP-------QCQ
        LE+VRSLLLAYEARL+KQ++VDQL++AQAN  NL++  NS+    R P  FS      NHYK+                         PP       QCQ
Subjt:  LEEVRSLLLAYEARLEKQSSVDQLSLAQANFANLNVHSNSRRSVSRSPTNFSPAGFNHNHYKND------------------------PP-------QCQ

Query:  ICGKFGHTALICHHRTNLAYQTPPPQAMLTTAAHSFSSPDSVSTTSMSSYHPDENW
        ICGK GH+A +C+HRTN+AY    PQA+      S + P S         HPDE+W
Subjt:  ICGKFGHTALICHHRTNLAYQTPPPQAMLTTAAHSFSSPDSVSTTSMSSYHPDENW

A0A7J0DER3 Uncharacterized protein1.6e-3345.41Show/hide
Query:  LKTQLQRIKKDGLSVSQYLSQIKEITDKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRSDNPALEEVRSLLLAYEARLEKQSSVDQLSLAQANFA
        L+T LQ IKKDGL+   Y+ + + + +  ++IGEP++Y DHL + L GLG +YN FVTSIQ+++  P++EEV SLLL+Y+ARLE+QS+ D LS  QAN A
Subjt:  LKTQLQRIKKDGLSVSQYLSQIKEITDKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRSDNPALEEVRSLLLAYEARLEKQSSVDQLSLAQANFA

Query:  NLNVH-----SNSRRSVSRSPTNFSPAGFNHN-HYKNDP------PQCQICGKFGHTALICHHRTNLAYQTPPP-----QAMLT-TAAHSFSSPDSVSTT
        NL        + S  S   S +   P G N N  Y  +P      P+CQIC K GHTA  C+HRTNL YQ PPP      A +T   + S S P S    
Subjt:  NLNVH-----SNSRRSVSRSPTNFSPAGFNHN-HYKNDP------PQCQICGKFGHTALICHHRTNLAYQTPPP-----QAMLT-TAAHSFSSPDSVSTT

Query:  SMSSYHP
         +S  HP
Subjt:  SMSSYHP

A0A7J0E8R3 Uncharacterized protein1.9e-3444.65Show/hide
Query:  LKTQLQRIKKDGLSVSQYLSQIKEITDKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRSDNPALEEVRSLLLAYEARLEKQSSVDQLSLAQANFA
        L+T LQ IKKDGL+   Y+ + + + +  ++IGEP++Y DHL + L GLG +YN FVTSIQ+++  P++EEV SLLL+Y+ARLE+QS+ D LS  QAN A
Subjt:  LKTQLQRIKKDGLSVSQYLSQIKEITDKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRSDNPALEEVRSLLLAYEARLEKQSSVDQLSLAQANFA

Query:  NLNVH-----SNSRRSVSRSPTNFSPAGFNHN-HYKNDP------PQCQICGKFGHTALICHHRTNLAYQTPPP-----QAMLT---TAAHSFSSPDSVS
        NL        + S  S   S +   P G N N  Y  +P      P+CQIC K GHTA  C+HRTNL YQ PPP      A +T   + + S + P S S
Subjt:  NLNVH-----SNSRRSVSRSPTNFSPAGFNHN-HYKNDP------PQCQICGKFGHTALICHHRTNLAYQTPPP-----QAMLT---TAAHSFSSPDSVS

Query:  TTSMSSYHPDENWSH
        +   SS++ D   SH
Subjt:  TTSMSSYHPDENWSH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)1.4e-0830.88Show/hide
Query:  RIMGLKTQLQRIKKDGLSVSQYLSQIKEITDKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRSDNPALEEVRSLLLAYEARLEKQSSVDQLSLAQ
        R +  + +L+    D LSV +Y  ++K ++D  + +  PIS R  + H+L+GL  +Y+  +  I+++S  P+  E RS+LL  E+RL  +S   + SL+ 
Subjt:  RIMGLKTQLQRIKKDGLSVSQYLSQIKEITDKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRSDNPALEEVRSLLLAYEARLEKQSSVDQLSLAQ

Query:  ANFANL-NVHSNSRRSVSRSPTNFSPAGFNHNHYKN
         N  +L NV     R   R P  +      HN+  N
Subjt:  ANFANL-NVHSNSRRSVSRSPTNFSPAGFNHNHYKN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACAACAGAAGCTTCATCCTCCTCTTCGCTTTCTTCTTCTACTGAAATCACCCCACCGATTATCTTTCCATCTACACCAATCACCACTCCGATTGTCTCTCCGATTGC
CCAGACCCCCAAACAACCAACATCTCAACCGCGCCCCTTTCTCCCCCAAAATCGCCCTAATGTTGCTCCAACTCAACCCACGTTTAATCCATATCAACCACAACCGTTTT
ATCCAACTTCAGGCGTCTATCAACCTTTTTACCCCTCTTCTTTTCCTCGCCCTCAAGCTCCCCTGTTTTTTCCACCTCAAGTTAACCCATCCCAACCTTCGACTCCCTTT
GCCCCGAATCCCTATCCCACTTTACCACAGCCATTATCTGTCAAACTCACGGATACAAATTTTTTTGCTATGGAAGAACCAGCTCTTGAATGCGCTCCCTCCAAATTTCT
TGACGATCAATGCTCTCAGCCGAATCCTGAATTTCTGACTTGGGAAAGGATAATGGGTCTCAAAACTCAGCTTCAACGGATTAAGAAAGACGGTCTCTCTGTTAGTCAGT
ATTTGTCCCAAATTAAAGAGATTACTGATAAATTCTCAGCTATAGGAGAGCCCATTTCTTATCGTGACCACTTAGCTCATATTTTAGACGGTCTTGGAAGCGAATATAAT
GCGTTTGTCACTTCAATCCAGAATCGTTCTGATAACCCTGCTCTAGAGGAGGTTCGAAGTCTTCTCTTGGCTTATGAGGCGAGATTAGAAAAACAATCTAGTGTTGATCA
ACTTAGCTTAGCTCAAGCAAATTTTGCTAACCTTAATGTTCACAGTAACAGTCGCCGCTCTGTCTCTCGTAGTCCTACAAATTTTTCTCCGGCTGGTTTTAATCACAACC
ATTACAAAAATGACCCTCCACAGTGTCAAATTTGTGGAAAGTTTGGTCACACCGCCCTGATTTGTCATCACCGGACTAACTTAGCTTACCAAACACCCCCTCCCCAAGCT
ATGTTAACCACTGCTGCACATTCTTTCTCTTCACCTGATTCGGTCTCCACAACTTCCATGAGTTCTTATCACCCCGATGAGAATTGGTCTCATCGAAAAGCCACTTTAGT
TGATGTTTTCCTCAATGTGCTTGTCGACCGGTACTGTCCTGCCCTCCATATTAATCTAACCATCACAATTGAAGGCTCAGTAACCTTCGAGCGAGAGGTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGACAACAGAAGCTTCATCCTCCTCTTCGCTTTCTTCTTCTACTGAAATCACCCCACCGATTATCTTTCCATCTACACCAATCACCACTCCGATTGTCTCTCCGATTGC
CCAGACCCCCAAACAACCAACATCTCAACCGCGCCCCTTTCTCCCCCAAAATCGCCCTAATGTTGCTCCAACTCAACCCACGTTTAATCCATATCAACCACAACCGTTTT
ATCCAACTTCAGGCGTCTATCAACCTTTTTACCCCTCTTCTTTTCCTCGCCCTCAAGCTCCCCTGTTTTTTCCACCTCAAGTTAACCCATCCCAACCTTCGACTCCCTTT
GCCCCGAATCCCTATCCCACTTTACCACAGCCATTATCTGTCAAACTCACGGATACAAATTTTTTTGCTATGGAAGAACCAGCTCTTGAATGCGCTCCCTCCAAATTTCT
TGACGATCAATGCTCTCAGCCGAATCCTGAATTTCTGACTTGGGAAAGGATAATGGGTCTCAAAACTCAGCTTCAACGGATTAAGAAAGACGGTCTCTCTGTTAGTCAGT
ATTTGTCCCAAATTAAAGAGATTACTGATAAATTCTCAGCTATAGGAGAGCCCATTTCTTATCGTGACCACTTAGCTCATATTTTAGACGGTCTTGGAAGCGAATATAAT
GCGTTTGTCACTTCAATCCAGAATCGTTCTGATAACCCTGCTCTAGAGGAGGTTCGAAGTCTTCTCTTGGCTTATGAGGCGAGATTAGAAAAACAATCTAGTGTTGATCA
ACTTAGCTTAGCTCAAGCAAATTTTGCTAACCTTAATGTTCACAGTAACAGTCGCCGCTCTGTCTCTCGTAGTCCTACAAATTTTTCTCCGGCTGGTTTTAATCACAACC
ATTACAAAAATGACCCTCCACAGTGTCAAATTTGTGGAAAGTTTGGTCACACCGCCCTGATTTGTCATCACCGGACTAACTTAGCTTACCAAACACCCCCTCCCCAAGCT
ATGTTAACCACTGCTGCACATTCTTTCTCTTCACCTGATTCGGTCTCCACAACTTCCATGAGTTCTTATCACCCCGATGAGAATTGGTCTCATCGAAAAGCCACTTTAGT
TGATGTTTTCCTCAATGTGCTTGTCGACCGGTACTGTCCTGCCCTCCATATTAATCTAACCATCACAATTGAAGGCTCAGTAACCTTCGAGCGAGAGGTGTAA
Protein sequenceShow/hide protein sequence
MTTEASSSSSLSSSTEITPPIIFPSTPITTPIVSPIAQTPKQPTSQPRPFLPQNRPNVAPTQPTFNPYQPQPFYPTSGVYQPFYPSSFPRPQAPLFFPPQVNPSQPSTPF
APNPYPTLPQPLSVKLTDTNFFAMEEPALECAPSKFLDDQCSQPNPEFLTWERIMGLKTQLQRIKKDGLSVSQYLSQIKEITDKFSAIGEPISYRDHLAHILDGLGSEYN
AFVTSIQNRSDNPALEEVRSLLLAYEARLEKQSSVDQLSLAQANFANLNVHSNSRRSVSRSPTNFSPAGFNHNHYKNDPPQCQICGKFGHTALICHHRTNLAYQTPPPQA
MLTTAAHSFSSPDSVSTTSMSSYHPDENWSHRKATLVDVFLNVLVDRYCPALHINLTITIEGSVTFEREV