; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0027001 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0027001
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr10:44153934..44154889
RNA-Seq ExpressionLag0027001
SyntenyLag0027001
Gene Ontology termsNA
InterPro domainsIPR029472 - Retrotransposon Copia-like, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GFY85402.1 hypothetical protein Acr_04g0001400 [Actinidia rufa]6.7e-3338.6Show/hide
Query:  PSSFPRPQAPLFFPPQVNPSQPSTPFAPNPYPTLPQPLSVKLTDTNFLLWKNQLLNAVLANGLHGFLDGSIPAPSKFLDDQCSQPNPEFLTWERYNRFIM
        P+S P P + +   P  NP   +T   PN  P++ QPL+VKL D N+++WK QLLN V+ANGL  FLDGS   P +FLD Q  Q NPEF +W+RYNR +M
Subjt:  PSSFPRPQAPLFFPPQVNPSQPSTPFAPNPYPTLPQPLSVKLTDTNFLLWKNQLLNAVLANGLHGFLDGSIPAPSKFLDDQCSQPNPEFLTWERYNRFIM

Query:  CWMYSSLSEEKMGEIVNLDTASEIWNSLKRSYDSKTTARIMGLKTQLQRIKKDGLS--------------------------------------------
         W+Y+S++E  +G+IV   +AS+IW +L+R Y + + A +  L+T LQ IKK+GL+                                            
Subjt:  CWMYSSLSEEKMGEIVNLDTASEIWNSLKRSYDSKTTARIMGLKTQLQRIKKDGLS--------------------------------------------

Query:  ---NRSDNPALEDVRSLLLAYEARLEKQ
           +++  P++E+V SLLL+Y+ARLE+Q
Subjt:  ---NRSDNPALEDVRSLLLAYEARLEKQ

GFZ12741.1 UBX domain-containing protein [Actinidia rufa]8.2e-3147.5Show/hide
Query:  PQAPLFFPPQVNPSQPSTPFAPNP----------YPTLPQPLSVKLTDTNFLLWKNQLLNAVLANGLHGFLDGSIPAPSKFLDDQCSQPNPEFLTWERYN
        P AP   PP  NP   S+   PNP           P++ QPL+VKL D N+++WK QLLN V+ANGL  FLDGS   P +FLD Q  Q NPEF +W+RYN
Subjt:  PQAPLFFPPQVNPSQPSTPFAPNP----------YPTLPQPLSVKLTDTNFLLWKNQLLNAVLANGLHGFLDGSIPAPSKFLDDQCSQPNPEFLTWERYN

Query:  RFIMCWMYSSLSEEKMGEIVNLDTASEIWNSLKRSYDSKTTARIMGLKTQLQRIKKDGLS
        R +M W+Y+S++E  +G+IV   +AS+IW +L+R Y + + A +  L+T LQ IKK+GL+
Subjt:  RFIMCWMYSSLSEEKMGEIVNLDTASEIWNSLKRSYDSKTTARIMGLKTQLQRIKKDGLS

RVW56403.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]3.3e-3243.89Show/hide
Query:  VNPS--QPSTPFAPN----PYPTLPQPLSVKLTDTNFLLWKNQLLNAVLANGLHGFLDGSIPAPSKFLDDQCSQPNPEFLTWERYNRFIMCWMYSSLSEE
        +NP    P+  F+PN     YP+L QPL++KL +TN  LWKNQLLN ++ANGL  F++G  P P+KFLDD   Q NP F+ WER N  +M W+Y SL+  
Subjt:  VNPS--QPSTPFAPN----PYPTLPQPLSVKLTDTNFLLWKNQLLNAVLANGLHGFLDGSIPAPSKFLDDQCSQPNPEFLTWERYNRFIMCWMYSSLSEE

Query:  KMGEIVNLDTASEIWNSLKRSYDSKTTARIMGLKTQLQRIKKDGLSNRS---------DNPALEDVRSLLLAYEARLEKQ
         +G IV   TA EIW +L R Y S +   ++ L +QLQ+IKK+G++            D  +  D+ SLL  YE RL+++
Subjt:  KMGEIVNLDTASEIWNSLKRSYDSKTTARIMGLKTQLQRIKKDGLSNRS---------DNPALEDVRSLLLAYEARLEKQ

RVW64278.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]4.4e-3244.38Show/hide
Query:  FAPNPYPTLPQPLSVKLTDTNFLLWKNQLLNAVLANGLHGFLDGSIPAPSKFLDDQCSQPNPEFLTWERYNRFIMCWMYSSLSEEKMGEIVNLDTASEIW
        FA +  P+L Q  +V+L  +N+LLW+ Q+LN ++ANGL   + G IPAPS+FL D     NPE+  W+R NR +MCW+YSSL+E  M +I+ LDTASEIW
Subjt:  FAPNPYPTLPQPLSVKLTDTNFLLWKNQLLNAVLANGLHGFLDGSIPAPSKFLDDQCSQPNPEFLTWERYNRFIMCWMYSSLSEEKMGEIVNLDTASEIW

Query:  NSLKRSYDSKTTARIMGLKTQLQRIKKDGL-------------SNRSDNPALEDVRSLLLAYEARLEKQ
         +L++ + + + ARIM L+ QLQ  KK GL             ++  ++ +LE++ S+LL +E RLE+Q
Subjt:  NSLKRSYDSKTTARIMGLKTQLQRIKKDGL-------------SNRSDNPALEDVRSLLLAYEARLEKQ

XP_022155181.1 uncharacterized protein LOC111022315 [Momordica charantia]3.7e-5555.05Show/hide
Query:  FFPPQVN-PSQPSTPFAPNPYPTLPQPLSVKLTDTNFLLWKNQLLNAVLANGLHGFLDGSIPAPSKFLDDQCSQPNPEFLTWERYNRFIMCWMYSSLSEE
        F PP  N  +QP  PF+ NP+PTLPQPL+VKL D NFLLWKNQLLNAV+ANGL G+LDG+I  P +FLD    QPNP +  WERYNR +MCW+YSSLSEE
Subjt:  FFPPQVN-PSQPSTPFAPNPYPTLPQPLSVKLTDTNFLLWKNQLLNAVLANGLHGFLDGSIPAPSKFLDDQCSQPNPEFLTWERYNRFIMCWMYSSLSEE

Query:  KMGEIVNLDTASEIWNSLKRSYDSKTTARIMGLKTQLQRIKKDGLS-----------------------------------------------NRSDNPA
        KMGE+V+L+T  +IW+SL R YDSKTTARIMGLKT+LQ ++KDG S                                               NR+D+P+
Subjt:  KMGEIVNLDTASEIWNSLKRSYDSKTTARIMGLKTQLQRIKKDGLS-----------------------------------------------NRSDNPA

Query:  LEDVRSLLLAYEARLEKQ
        LEDVRSLLLAYEARL+KQ
Subjt:  LEDVRSLLLAYEARLEKQ

TrEMBL top hitse value%identityAlignment
A0A438F8X3 Retrovirus-related Pol polyprotein from transposon RE11.6e-3243.89Show/hide
Query:  VNPS--QPSTPFAPN----PYPTLPQPLSVKLTDTNFLLWKNQLLNAVLANGLHGFLDGSIPAPSKFLDDQCSQPNPEFLTWERYNRFIMCWMYSSLSEE
        +NP    P+  F+PN     YP+L QPL++KL +TN  LWKNQLLN ++ANGL  F++G  P P+KFLDD   Q NP F+ WER N  +M W+Y SL+  
Subjt:  VNPS--QPSTPFAPN----PYPTLPQPLSVKLTDTNFLLWKNQLLNAVLANGLHGFLDGSIPAPSKFLDDQCSQPNPEFLTWERYNRFIMCWMYSSLSEE

Query:  KMGEIVNLDTASEIWNSLKRSYDSKTTARIMGLKTQLQRIKKDGLSNRS---------DNPALEDVRSLLLAYEARLEKQ
         +G IV   TA EIW +L R Y S +   ++ L +QLQ+IKK+G++            D  +  D+ SLL  YE RL+++
Subjt:  KMGEIVNLDTASEIWNSLKRSYDSKTTARIMGLKTQLQRIKKDGLSNRS---------DNPALEDVRSLLLAYEARLEKQ

A0A438FWG3 Retrovirus-related Pol polyprotein from transposon RE12.1e-3244.38Show/hide
Query:  FAPNPYPTLPQPLSVKLTDTNFLLWKNQLLNAVLANGLHGFLDGSIPAPSKFLDDQCSQPNPEFLTWERYNRFIMCWMYSSLSEEKMGEIVNLDTASEIW
        FA +  P+L Q  +V+L  +N+LLW+ Q+LN ++ANGL   + G IPAPS+FL D     NPE+  W+R NR +MCW+YSSL+E  M +I+ LDTASEIW
Subjt:  FAPNPYPTLPQPLSVKLTDTNFLLWKNQLLNAVLANGLHGFLDGSIPAPSKFLDDQCSQPNPEFLTWERYNRFIMCWMYSSLSEEKMGEIVNLDTASEIW

Query:  NSLKRSYDSKTTARIMGLKTQLQRIKKDGL-------------SNRSDNPALEDVRSLLLAYEARLEKQ
         +L++ + + + ARIM L+ QLQ  KK GL             ++  ++ +LE++ S+LL +E RLE+Q
Subjt:  NSLKRSYDSKTTARIMGLKTQLQRIKKDGL-------------SNRSDNPALEDVRSLLLAYEARLEKQ

A0A6J1DQX7 uncharacterized protein LOC1110223151.8e-5555.05Show/hide
Query:  FFPPQVN-PSQPSTPFAPNPYPTLPQPLSVKLTDTNFLLWKNQLLNAVLANGLHGFLDGSIPAPSKFLDDQCSQPNPEFLTWERYNRFIMCWMYSSLSEE
        F PP  N  +QP  PF+ NP+PTLPQPL+VKL D NFLLWKNQLLNAV+ANGL G+LDG+I  P +FLD    QPNP +  WERYNR +MCW+YSSLSEE
Subjt:  FFPPQVN-PSQPSTPFAPNPYPTLPQPLSVKLTDTNFLLWKNQLLNAVLANGLHGFLDGSIPAPSKFLDDQCSQPNPEFLTWERYNRFIMCWMYSSLSEE

Query:  KMGEIVNLDTASEIWNSLKRSYDSKTTARIMGLKTQLQRIKKDGLS-----------------------------------------------NRSDNPA
        KMGE+V+L+T  +IW+SL R YDSKTTARIMGLKT+LQ ++KDG S                                               NR+D+P+
Subjt:  KMGEIVNLDTASEIWNSLKRSYDSKTTARIMGLKTQLQRIKKDGLS-----------------------------------------------NRSDNPA

Query:  LEDVRSLLLAYEARLEKQ
        LEDVRSLLLAYEARL+KQ
Subjt:  LEDVRSLLLAYEARLEKQ

A0A7J0EGI5 Uncharacterized protein3.3e-3338.6Show/hide
Query:  PSSFPRPQAPLFFPPQVNPSQPSTPFAPNPYPTLPQPLSVKLTDTNFLLWKNQLLNAVLANGLHGFLDGSIPAPSKFLDDQCSQPNPEFLTWERYNRFIM
        P+S P P + +   P  NP   +T   PN  P++ QPL+VKL D N+++WK QLLN V+ANGL  FLDGS   P +FLD Q  Q NPEF +W+RYNR +M
Subjt:  PSSFPRPQAPLFFPPQVNPSQPSTPFAPNPYPTLPQPLSVKLTDTNFLLWKNQLLNAVLANGLHGFLDGSIPAPSKFLDDQCSQPNPEFLTWERYNRFIM

Query:  CWMYSSLSEEKMGEIVNLDTASEIWNSLKRSYDSKTTARIMGLKTQLQRIKKDGLS--------------------------------------------
         W+Y+S++E  +G+IV   +AS+IW +L+R Y + + A +  L+T LQ IKK+GL+                                            
Subjt:  CWMYSSLSEEKMGEIVNLDTASEIWNSLKRSYDSKTTARIMGLKTQLQRIKKDGLS--------------------------------------------

Query:  ---NRSDNPALEDVRSLLLAYEARLEKQ
           +++  P++E+V SLLL+Y+ARLE+Q
Subjt:  ---NRSDNPALEDVRSLLLAYEARLEKQ

A0A7J0GPN0 UBX domain-containing protein4.0e-3147.5Show/hide
Query:  PQAPLFFPPQVNPSQPSTPFAPNP----------YPTLPQPLSVKLTDTNFLLWKNQLLNAVLANGLHGFLDGSIPAPSKFLDDQCSQPNPEFLTWERYN
        P AP   PP  NP   S+   PNP           P++ QPL+VKL D N+++WK QLLN V+ANGL  FLDGS   P +FLD Q  Q NPEF +W+RYN
Subjt:  PQAPLFFPPQVNPSQPSTPFAPNP----------YPTLPQPLSVKLTDTNFLLWKNQLLNAVLANGLHGFLDGSIPAPSKFLDDQCSQPNPEFLTWERYN

Query:  RFIMCWMYSSLSEEKMGEIVNLDTASEIWNSLKRSYDSKTTARIMGLKTQLQRIKKDGLS
        R +M W+Y+S++E  +G+IV   +AS+IW +L+R Y + + A +  L+T LQ IKK+GL+
Subjt:  RFIMCWMYSSLSEEKMGEIVNLDTASEIWNSLKRSYDSKTTARIMGLKTQLQRIKKDGLS

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.7e-1029.85Show/hide
Query:  KLTDTNFLLWKNQLLNAVLANGLHGFLDGSIPAPSKFL-DDQCSQPNPEFLTWERYNRFIMCWMYSSLSEEKMGEIVNLDTASEIWNSLKRSYDSKTTAR
        KLT TN+L+W  Q+        L GFLDGS   P   +  D   + NP++  W+R ++ I   +  ++S      +    TA++IW +L++ Y + +   
Subjt:  KLTDTNFLLWKNQLLNAVLANGLHGFLDGSIPAPSKFL-DDQCSQPNPEFLTWERYNRFIMCWMYSSLSEEKMGEIVNLDTASEIWNSLKRSYDSKTTAR

Query:  IMGLKTQLQRIKK---------DGLSNRSDNPAL
        +  L+TQL++  K          GL  R D  AL
Subjt:  IMGLKTQLQRIKK---------DGLSNRSDNPAL

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACAACAGAAGCTTCATCCTCCTCTTCTCTTTCTTCTTCTGCTGAAATCACCCCACCGATCATCTTTCCATCTACACCAATCACCACTCCGATTGTCTCTCCCATTGC
CCAGACCCCCAAACAACCAACATCTCAATCGCGCCCCTTTCTCCCCCAAAATCGCCCTAATATTGCTCCAACTCAACCCACGTTTAATCCATATCAACCACAACCGTTTT
ATCCAACTTCAGGCGTCTATCAACCTTTTTACCCCTCTTCTTTTCCTCGCCCTCAAGCTCCCCTGTTTTTTCCACCTCAAGTTAACCCATCCCAACCTTCGACTCCCTTT
GCCCCGAATCCCTATCCCACTTTACCACAACCATTATCTGTCAAACTCACAGATACAAATTTTTTGCTATGGAAGAACCAACTCTTGAATGCGGTATTGGCTAACGGACT
TCATGGTTTCCTTGATGGATCTATCCCAGCTCCCTCCAAATTTCTTGACGATCAATGCTCTCAGCCGAATCCTGAATTTCTGACTTGGGAAAGGTACAACCGTTTTATTA
TGTGTTGGATGTATTCCTCGCTTTCTGAAGAAAAAATGGGTGAAATAGTGAACTTAGACACTGCCTCTGAAATATGGAACTCGTTGAAACGTTCTTATGATTCTAAGACT
ACGGCTAGGATAATGGGTCTCAAAACTCAGCTTCAACGGATTAAGAAAGACGGTCTCTCTAATCGTTCTGATAACCCTGCTCTAGAGGATGTTCGAAGTCTTCTCTTGGC
TTATGAGGCGAGATTAGAAAAACAACTAGTGTTGATCAACTTAGCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGACAACAGAAGCTTCATCCTCCTCTTCTCTTTCTTCTTCTGCTGAAATCACCCCACCGATCATCTTTCCATCTACACCAATCACCACTCCGATTGTCTCTCCCATTGC
CCAGACCCCCAAACAACCAACATCTCAATCGCGCCCCTTTCTCCCCCAAAATCGCCCTAATATTGCTCCAACTCAACCCACGTTTAATCCATATCAACCACAACCGTTTT
ATCCAACTTCAGGCGTCTATCAACCTTTTTACCCCTCTTCTTTTCCTCGCCCTCAAGCTCCCCTGTTTTTTCCACCTCAAGTTAACCCATCCCAACCTTCGACTCCCTTT
GCCCCGAATCCCTATCCCACTTTACCACAACCATTATCTGTCAAACTCACAGATACAAATTTTTTGCTATGGAAGAACCAACTCTTGAATGCGGTATTGGCTAACGGACT
TCATGGTTTCCTTGATGGATCTATCCCAGCTCCCTCCAAATTTCTTGACGATCAATGCTCTCAGCCGAATCCTGAATTTCTGACTTGGGAAAGGTACAACCGTTTTATTA
TGTGTTGGATGTATTCCTCGCTTTCTGAAGAAAAAATGGGTGAAATAGTGAACTTAGACACTGCCTCTGAAATATGGAACTCGTTGAAACGTTCTTATGATTCTAAGACT
ACGGCTAGGATAATGGGTCTCAAAACTCAGCTTCAACGGATTAAGAAAGACGGTCTCTCTAATCGTTCTGATAACCCTGCTCTAGAGGATGTTCGAAGTCTTCTCTTGGC
TTATGAGGCGAGATTAGAAAAACAACTAGTGTTGATCAACTTAGCTTAG
Protein sequenceShow/hide protein sequence
MTTEASSSSSLSSSAEITPPIIFPSTPITTPIVSPIAQTPKQPTSQSRPFLPQNRPNIAPTQPTFNPYQPQPFYPTSGVYQPFYPSSFPRPQAPLFFPPQVNPSQPSTPF
APNPYPTLPQPLSVKLTDTNFLLWKNQLLNAVLANGLHGFLDGSIPAPSKFLDDQCSQPNPEFLTWERYNRFIMCWMYSSLSEEKMGEIVNLDTASEIWNSLKRSYDSKT
TARIMGLKTQLQRIKKDGLSNRSDNPALEDVRSLLLAYEARLEKQLVLINLA