; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0007913 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0007913
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr9:7928627..7933043
RNA-Seq ExpressionLag0007913
SyntenyLag0007913
Gene Ontology termsNA
InterPro domainsIPR025314 - Domain of unknown function DUF4219


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6573415.1 hypothetical protein SDJN03_27302, partial [Cucurbita argyrosperma subsp. sororia]1.3e-5651.76Show/hide
Query:  MHSQIEEKLSLLHALNIPTEKPLE---------------------KRKTSCRKKTRVQHLLDANEMKLNKVNTEVCTFQSIDRQPSQPVNKHTDSSQPTS
        +HSQIEEKLSLLHALN PTEKPL+                     KRKT C+KKTR Q LLD NEM+L+KVN EVC  +S  RQPSQPV+K TD+ QP  
Subjt:  MHSQIEEKLSLLHALNIPTEKPLE---------------------KRKTSCRKKTRVQHLLDANEMKLNKVNTEVCTFQSIDRQPSQPVNKHTDSSQPTS

Query:  KPGNFWEYSRWGLYEIASN-----WIVLLMRNATGEQWKCRCPLHFQIFISL----------------ESLVDELHKELPNEEGKPQLHGHDVMDVEIKS
        +  N    S         N     ++ LL  ++  ++   R  +   +  SL                E L+DE H     EEG+PQLH +DVMDVEIKS
Subjt:  KPGNFWEYSRWGLYEIASN-----WIVLLMRNATGEQWKCRCPLHFQIFISL----------------ESLVDELHKELPNEEGKPQLHGHDVMDVEIKS

Query:  NCTQSCNSGLLGDAHGSKRQLG--LWQGSRGSDLCDIVQAEKNCLDQIGVIVGMPGTNVPLSGREGVGTSEIKSGTPDISIPDF
        N TQ CNSGLLGD H SK  L     QGS GSDLCDIVQA++N L+QIGV V +PGTNVPLSG EGVG SEIKSGT D SIPDF
Subjt:  NCTQSCNSGLLGDAHGSKRQLG--LWQGSRGSDLCDIVQAEKNCLDQIGVIVGMPGTNVPLSGREGVGTSEIKSGTPDISIPDF

XP_022954739.1 restin homolog isoform X1 [Cucurbita moschata]5.9e-5752.11Show/hide
Query:  MHSQIEEKLSLLHALNIPTEKPLE---------------------KRKTSCRKKTRVQHLLDANEMKLNKVNTEVCTFQSIDRQPSQPVNKHTDSSQPTS
        +HSQIEEKLSLLHALN PTEKPL+                     KRKT C+KKTR Q LLD NEM+L+KVN EVC  +S  RQPSQPV+K TD+ QP  
Subjt:  MHSQIEEKLSLLHALNIPTEKPLE---------------------KRKTSCRKKTRVQHLLDANEMKLNKVNTEVCTFQSIDRQPSQPVNKHTDSSQPTS

Query:  KPGNFWEYSRWGLYEIASN-----WIVLLMRNATGEQWKCRCPLHFQIFISL----------------ESLVDELHKELPNEEGKPQLHGHDVMDVEIKS
        +  N    S         N     ++ LL  ++  ++   R  +   +  SL                E L+DE H     EEG+PQLH +DVMDVEIKS
Subjt:  KPGNFWEYSRWGLYEIASN-----WIVLLMRNATGEQWKCRCPLHFQIFISL----------------ESLVDELHKELPNEEGKPQLHGHDVMDVEIKS

Query:  NCTQSCNSGLLGDAHGSKRQLG--LWQGSRGSDLCDIVQAEKNCLDQIGVIVGMPGTNVPLSGREGVGTSEIKSGTPDISIPDF
        N TQ CNSGLLGD H SK  L     QGS GSDLCDIVQA++N L+QIGV V MPGTNVPLSG EGVG SEIKSGT D SIPDF
Subjt:  NCTQSCNSGLLGDAHGSKRQLG--LWQGSRGSDLCDIVQAEKNCLDQIGVIVGMPGTNVPLSGREGVGTSEIKSGTPDISIPDF

XP_022954745.1 restin homolog isoform X2 [Cucurbita moschata]5.9e-5752.11Show/hide
Query:  MHSQIEEKLSLLHALNIPTEKPLE---------------------KRKTSCRKKTRVQHLLDANEMKLNKVNTEVCTFQSIDRQPSQPVNKHTDSSQPTS
        +HSQIEEKLSLLHALN PTEKPL+                     KRKT C+KKTR Q LLD NEM+L+KVN EVC  +S  RQPSQPV+K TD+ QP  
Subjt:  MHSQIEEKLSLLHALNIPTEKPLE---------------------KRKTSCRKKTRVQHLLDANEMKLNKVNTEVCTFQSIDRQPSQPVNKHTDSSQPTS

Query:  KPGNFWEYSRWGLYEIASN-----WIVLLMRNATGEQWKCRCPLHFQIFISL----------------ESLVDELHKELPNEEGKPQLHGHDVMDVEIKS
        +  N    S         N     ++ LL  ++  ++   R  +   +  SL                E L+DE H     EEG+PQLH +DVMDVEIKS
Subjt:  KPGNFWEYSRWGLYEIASN-----WIVLLMRNATGEQWKCRCPLHFQIFISL----------------ESLVDELHKELPNEEGKPQLHGHDVMDVEIKS

Query:  NCTQSCNSGLLGDAHGSKRQLG--LWQGSRGSDLCDIVQAEKNCLDQIGVIVGMPGTNVPLSGREGVGTSEIKSGTPDISIPDF
        N TQ CNSGLLGD H SK  L     QGS GSDLCDIVQA++N L+QIGV V MPGTNVPLSG EGVG SEIKSGT D SIPDF
Subjt:  NCTQSCNSGLLGDAHGSKRQLG--LWQGSRGSDLCDIVQAEKNCLDQIGVIVGMPGTNVPLSGREGVGTSEIKSGTPDISIPDF

XP_023000919.1 uncharacterized protein LOC111495215 [Cucurbita maxima]3.4e-5751.76Show/hide
Query:  MHSQIEEKLSLLHALNIPTEKPLE---------------------KRKTSCRKKTRVQHLLDANEMKLNKVNTEVCTFQSIDRQPSQPVNKHTDSSQPTS
        MHSQIEEKLSLLHALN PTEKPL+                     KRK  C+KK +VQHLLD +EMKLNKV+TEVC  +SI  +PSQPV+K  D+ QP  
Subjt:  MHSQIEEKLSLLHALNIPTEKPLE---------------------KRKTSCRKKTRVQHLLDANEMKLNKVNTEVCTFQSIDRQPSQPVNKHTDSSQPTS

Query:  KPGNFWEYSRWGLYEIASN-----WIVLLMRNATGEQWKCRCPLHF------QIFI---------SLESLVDELHKELPNE-EGKPQLHGHDVMDVEIKS
        +  N +  S     E   N     ++ LL  ++  ++   R  +         I+I           E LVDELHKELP+E EG+P+ H + V+DVEIKS
Subjt:  KPGNFWEYSRWGLYEIASN-----WIVLLMRNATGEQWKCRCPLHF------QIFI---------SLESLVDELHKELPNE-EGKPQLHGHDVMDVEIKS

Query:  NCTQSCNSGLLGDAHGSKRQLG--LWQGSRGSDLCDIVQAEKNCLDQIGVIVGMPGTNVPLSGREGVGTSEIKSGTPDISIPDF
        N TQSC+  LLGD H SKRQL   L QG + +DL DIVQA  NCLDQ+GVIVGMPGTNV LSG EGVG SEIKSGT   S PDF
Subjt:  NCTQSCNSGLLGDAHGSKRQLG--LWQGSRGSDLCDIVQAEKNCLDQIGVIVGMPGTNVPLSGREGVGTSEIKSGTPDISIPDF

XP_023541502.1 uncharacterized protein LOC111801664 isoform X1 [Cucurbita pepo subsp. pepo]1.3e-5651.76Show/hide
Query:  MHSQIEEKLSLLHALNIPTEKPLE---------------------KRKTSCRKKTRVQHLLDANEMKLNKVNTEVCTFQSIDRQPSQPVNKHTDSSQPTS
        +HSQIEEKLSLLHALN PTEKPLE                     KRKT CRKKTR Q LLD NEM+L+KVN EV   ++  RQPSQPV+K TD+ QP S
Subjt:  MHSQIEEKLSLLHALNIPTEKPLE---------------------KRKTSCRKKTRVQHLLDANEMKLNKVNTEVCTFQSIDRQPSQPVNKHTDSSQPTS

Query:  KPGNFWEYSRWGLYEIASN-----WIVLLMRNATGEQWKCRCPLHFQIFISL----------------ESLVDELHKELPNEEGKPQLHGHDVMDVEIKS
        +  N    S         N     ++ LL  ++  ++   R  +   +  SL                E L+DE H     EEG+PQLH +DVMDVEIKS
Subjt:  KPGNFWEYSRWGLYEIASN-----WIVLLMRNATGEQWKCRCPLHFQIFISL----------------ESLVDELHKELPNEEGKPQLHGHDVMDVEIKS

Query:  NCTQSCNSGLLGDAHGSKRQLG--LWQGSRGSDLCDIVQAEKNCLDQIGVIVGMPGTNVPLSGREGVGTSEIKSGTPDISIPDF
        N  + CNSGLLGD H SKR L     QGS GSDLCDIV AE+NCLDQIG+ V  PGTNVPLSG EGVG SE+KSGT D SIPDF
Subjt:  NCTQSCNSGLLGDAHGSKRQLG--LWQGSRGSDLCDIVQAEKNCLDQIGVIVGMPGTNVPLSGREGVGTSEIKSGTPDISIPDF

TrEMBL top hitse value%identityAlignment
A0A6J1CE16 uncharacterized protein LOC1110108067.0e-5649.65Show/hide
Query:  MHSQIEEKLSLLHALNIPTEKPLEK-------------------RKTSC--RKKTRVQHLLDANEMKLNKVNTEVCTFQSIDRQPSQPVNKHTDSSQPTS
        MHSQIEEKLSLLH LN PTEKP  K                   +KT+   +KK ++Q LLD +E+KLNKV+TEVC  +S+ RQPSQPV+K TDS QP  
Subjt:  MHSQIEEKLSLLHALNIPTEKPLEK-------------------RKTSC--RKKTRVQHLLDANEMKLNKVNTEVCTFQSIDRQPSQPVNKHTDSSQPTS

Query:  KPGNFWEYSRWGLYEIASN-----WIVLLMRNATGEQWKCRCPLHFQIFISL----------------ESLVDELHKELPNEEGKPQLHGHDVMDVEIKS
        +  +    S   + E   N     ++ LL  ++  ++   R  +   +  SL                E LVDE HK+LP E+G+P  H +DV+DVEIKS
Subjt:  KPGNFWEYSRWGLYEIASN-----WIVLLMRNATGEQWKCRCPLHFQIFISL----------------ESLVDELHKELPNEEGKPQLHGHDVMDVEIKS

Query:  NCTQSCNSGLLGDAHGSKRQLG--LWQGSRGSDLCDIVQAEKNCLDQIGVIVGMPGTNVPLSGREGVGTSEIKSGTPDISIPDF
        N TQSCNSGLL D H SKRQ+   L  GS GSDLCDIVQAEK CLD++GVIV MPGT   LSG EGV TSEIKSG+ D  +PDF
Subjt:  NCTQSCNSGLLGDAHGSKRQLG--LWQGSRGSDLCDIVQAEKNCLDQIGVIVGMPGTNVPLSGREGVGTSEIKSGTPDISIPDF

A0A6J1GRR8 restin homolog isoform X12.8e-5752.11Show/hide
Query:  MHSQIEEKLSLLHALNIPTEKPLE---------------------KRKTSCRKKTRVQHLLDANEMKLNKVNTEVCTFQSIDRQPSQPVNKHTDSSQPTS
        +HSQIEEKLSLLHALN PTEKPL+                     KRKT C+KKTR Q LLD NEM+L+KVN EVC  +S  RQPSQPV+K TD+ QP  
Subjt:  MHSQIEEKLSLLHALNIPTEKPLE---------------------KRKTSCRKKTRVQHLLDANEMKLNKVNTEVCTFQSIDRQPSQPVNKHTDSSQPTS

Query:  KPGNFWEYSRWGLYEIASN-----WIVLLMRNATGEQWKCRCPLHFQIFISL----------------ESLVDELHKELPNEEGKPQLHGHDVMDVEIKS
        +  N    S         N     ++ LL  ++  ++   R  +   +  SL                E L+DE H     EEG+PQLH +DVMDVEIKS
Subjt:  KPGNFWEYSRWGLYEIASN-----WIVLLMRNATGEQWKCRCPLHFQIFISL----------------ESLVDELHKELPNEEGKPQLHGHDVMDVEIKS

Query:  NCTQSCNSGLLGDAHGSKRQLG--LWQGSRGSDLCDIVQAEKNCLDQIGVIVGMPGTNVPLSGREGVGTSEIKSGTPDISIPDF
        N TQ CNSGLLGD H SK  L     QGS GSDLCDIVQA++N L+QIGV V MPGTNVPLSG EGVG SEIKSGT D SIPDF
Subjt:  NCTQSCNSGLLGDAHGSKRQLG--LWQGSRGSDLCDIVQAEKNCLDQIGVIVGMPGTNVPLSGREGVGTSEIKSGTPDISIPDF

A0A6J1GT99 restin homolog isoform X22.8e-5752.11Show/hide
Query:  MHSQIEEKLSLLHALNIPTEKPLE---------------------KRKTSCRKKTRVQHLLDANEMKLNKVNTEVCTFQSIDRQPSQPVNKHTDSSQPTS
        +HSQIEEKLSLLHALN PTEKPL+                     KRKT C+KKTR Q LLD NEM+L+KVN EVC  +S  RQPSQPV+K TD+ QP  
Subjt:  MHSQIEEKLSLLHALNIPTEKPLE---------------------KRKTSCRKKTRVQHLLDANEMKLNKVNTEVCTFQSIDRQPSQPVNKHTDSSQPTS

Query:  KPGNFWEYSRWGLYEIASN-----WIVLLMRNATGEQWKCRCPLHFQIFISL----------------ESLVDELHKELPNEEGKPQLHGHDVMDVEIKS
        +  N    S         N     ++ LL  ++  ++   R  +   +  SL                E L+DE H     EEG+PQLH +DVMDVEIKS
Subjt:  KPGNFWEYSRWGLYEIASN-----WIVLLMRNATGEQWKCRCPLHFQIFISL----------------ESLVDELHKELPNEEGKPQLHGHDVMDVEIKS

Query:  NCTQSCNSGLLGDAHGSKRQLG--LWQGSRGSDLCDIVQAEKNCLDQIGVIVGMPGTNVPLSGREGVGTSEIKSGTPDISIPDF
        N TQ CNSGLLGD H SK  L     QGS GSDLCDIVQA++N L+QIGV V MPGTNVPLSG EGVG SEIKSGT D SIPDF
Subjt:  NCTQSCNSGLLGDAHGSKRQLG--LWQGSRGSDLCDIVQAEKNCLDQIGVIVGMPGTNVPLSGREGVGTSEIKSGTPDISIPDF

A0A6J1JWM7 protein MLP1-like1.3e-5451.41Show/hide
Query:  MHSQIEEKLSLLHALNIPTEKPLE---------------------KRKTSCRKKTRVQHLLDANEMKLNKVNTEVCTFQSIDRQPSQPVNKHTDSSQPTS
        +HSQIEEKLSLLHALN PTEKPLE                     KRKT C KKTR Q LLD NEM+  +VN EV   +S  RQPSQPV+K TD+ QP S
Subjt:  MHSQIEEKLSLLHALNIPTEKPLE---------------------KRKTSCRKKTRVQHLLDANEMKLNKVNTEVCTFQSIDRQPSQPVNKHTDSSQPTS

Query:  KPGNFWEYSRWGLYEIASN-----WIVLLMRNATGEQWKCRCPLHFQIFISL----------------ESLVDELHKELPNEEGKPQLHGHDVMDVEIKS
        +  N    S         N     ++ LL  ++  ++   R  +   +  SL                E L+DE H     EEG+PQLH +DVMDVEIKS
Subjt:  KPGNFWEYSRWGLYEIASN-----WIVLLMRNATGEQWKCRCPLHFQIFISL----------------ESLVDELHKELPNEEGKPQLHGHDVMDVEIKS

Query:  NCTQSCNSGLLGDAHGSKRQLG--LWQGSRGSDLCDIVQAEKNCLDQIGVIVGMPGTNVPLSGREGVGTSEIKSGTPDISIPDF
        N TQ CNSGLLGD H SKR L     QG  GSDLCDIV AE+NCLDQIG+ V  PGTNVPLSG EGVG S IKSGT D SIPDF
Subjt:  NCTQSCNSGLLGDAHGSKRQLG--LWQGSRGSDLCDIVQAEKNCLDQIGVIVGMPGTNVPLSGREGVGTSEIKSGTPDISIPDF

A0A6J1KH58 uncharacterized protein LOC1114952151.7e-5751.76Show/hide
Query:  MHSQIEEKLSLLHALNIPTEKPLE---------------------KRKTSCRKKTRVQHLLDANEMKLNKVNTEVCTFQSIDRQPSQPVNKHTDSSQPTS
        MHSQIEEKLSLLHALN PTEKPL+                     KRK  C+KK +VQHLLD +EMKLNKV+TEVC  +SI  +PSQPV+K  D+ QP  
Subjt:  MHSQIEEKLSLLHALNIPTEKPLE---------------------KRKTSCRKKTRVQHLLDANEMKLNKVNTEVCTFQSIDRQPSQPVNKHTDSSQPTS

Query:  KPGNFWEYSRWGLYEIASN-----WIVLLMRNATGEQWKCRCPLHF------QIFI---------SLESLVDELHKELPNE-EGKPQLHGHDVMDVEIKS
        +  N +  S     E   N     ++ LL  ++  ++   R  +         I+I           E LVDELHKELP+E EG+P+ H + V+DVEIKS
Subjt:  KPGNFWEYSRWGLYEIASN-----WIVLLMRNATGEQWKCRCPLHF------QIFI---------SLESLVDELHKELPNE-EGKPQLHGHDVMDVEIKS

Query:  NCTQSCNSGLLGDAHGSKRQLG--LWQGSRGSDLCDIVQAEKNCLDQIGVIVGMPGTNVPLSGREGVGTSEIKSGTPDISIPDF
        N TQSC+  LLGD H SKRQL   L QG + +DL DIVQA  NCLDQ+GVIVGMPGTNV LSG EGVG SEIKSGT   S PDF
Subjt:  NCTQSCNSGLLGDAHGSKRQLG--LWQGSRGSDLCDIVQAEKNCLDQIGVIVGMPGTNVPLSGREGVGTSEIKSGTPDISIPDF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G48720.1 unknown protein3.3e-1356.9Show/hide
Query:  MANNNLVPFQVPRLTKENYSSWCIRMKALLGSQDAWEIVDRGYEEQENDVALNQADSE
        MA+NN VPFQVP LTK NY +W +RMKA+LG+ D WEIV++G+ E EN+ +L+Q   +
Subjt:  MANNNLVPFQVPRLTKENYSSWCIRMKALLGSQDAWEIVDRGYEEQENDVALNQADSE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCATTCTCAGATTGAAGAGAAGTTGTCTCTTTTGCATGCTTTAAACATTCCTACAGAGAAGCCCTTAGAAAAGAGAAAGACTTCGTGCCGGAAGAAAACAAGGGTGCA
GCATTTACTTGATGCTAATGAGATGAAGTTGAATAAAGTTAACACTGAAGTTTGTACGTTTCAAAGTATTGATAGGCAACCTTCTCAACCTGTCAACAAACATACAGACA
GTAGTCAGCCAACTTCAAAGCCTGGGAACTTTTGGGAATATAGCAGATGGGGACTATATGAAATTGCTAGTAATTGGATAGTGCTGCTGATGAGGAATGCTACAGGAGAG
CAATGGAAATGCCGCTGTCCCCTTCACTTCCAGATATTTATATCCCTGGAGTCTTTAGTAGATGAACTCCATAAAGAATTGCCAAATGAAGAAGGGAAGCCACAATTGCA
TGGCCATGATGTCATGGATGTTGAGATTAAGTCCAATTGTACCCAATCCTGCAACTCTGGCTTGTTAGGAGATGCTCATGGCAGTAAACGCCAACTTGGTCTGTGGCAAG
GGAGCCGTGGGAGTGATCTTTGTGACATTGTACAGGCAGAAAAAAACTGTCTTGATCAGATTGGGGTCATTGTAGGGATGCCTGGGACAAATGTTCCTCTTTCTGGTCGT
GAAGGGGTGGGAACGTCAGAAATTAAATCTGGAACCCCGGACATCTCCATCCCTGATTTTTGGAGCTTGAGTTTGAAAAAAGGAAAATTGTCTTTGCAAATGGCAAACAA
CAATTTGGTTCCTTTCCAAGTGCCTCGACTTACCAAAGAAAATTATAGCAGTTGGTGTATTCGTATGAAAGCTCTACTTGGTTCACAAGATGCATGGGAAATTGTTGATA
GAGGCTATGAAGAACAAGAAAATGATGTGGCTTTAAATCAAGCTGATAGCGAGCATAGCTAG
mRNA sequenceShow/hide mRNA sequence
ATGCATTCTCAGATTGAAGAGAAGTTGTCTCTTTTGCATGCTTTAAACATTCCTACAGAGAAGCCCTTAGAAAAGAGAAAGACTTCGTGCCGGAAGAAAACAAGGGTGCA
GCATTTACTTGATGCTAATGAGATGAAGTTGAATAAAGTTAACACTGAAGTTTGTACGTTTCAAAGTATTGATAGGCAACCTTCTCAACCTGTCAACAAACATACAGACA
GTAGTCAGCCAACTTCAAAGCCTGGGAACTTTTGGGAATATAGCAGATGGGGACTATATGAAATTGCTAGTAATTGGATAGTGCTGCTGATGAGGAATGCTACAGGAGAG
CAATGGAAATGCCGCTGTCCCCTTCACTTCCAGATATTTATATCCCTGGAGTCTTTAGTAGATGAACTCCATAAAGAATTGCCAAATGAAGAAGGGAAGCCACAATTGCA
TGGCCATGATGTCATGGATGTTGAGATTAAGTCCAATTGTACCCAATCCTGCAACTCTGGCTTGTTAGGAGATGCTCATGGCAGTAAACGCCAACTTGGTCTGTGGCAAG
GGAGCCGTGGGAGTGATCTTTGTGACATTGTACAGGCAGAAAAAAACTGTCTTGATCAGATTGGGGTCATTGTAGGGATGCCTGGGACAAATGTTCCTCTTTCTGGTCGT
GAAGGGGTGGGAACGTCAGAAATTAAATCTGGAACCCCGGACATCTCCATCCCTGATTTTTGGAGCTTGAGTTTGAAAAAAGGAAAATTGTCTTTGCAAATGGCAAACAA
CAATTTGGTTCCTTTCCAAGTGCCTCGACTTACCAAAGAAAATTATAGCAGTTGGTGTATTCGTATGAAAGCTCTACTTGGTTCACAAGATGCATGGGAAATTGTTGATA
GAGGCTATGAAGAACAAGAAAATGATGTGGCTTTAAATCAAGCTGATAGCGAGCATAGCTAG
Protein sequenceShow/hide protein sequence
MHSQIEEKLSLLHALNIPTEKPLEKRKTSCRKKTRVQHLLDANEMKLNKVNTEVCTFQSIDRQPSQPVNKHTDSSQPTSKPGNFWEYSRWGLYEIASNWIVLLMRNATGE
QWKCRCPLHFQIFISLESLVDELHKELPNEEGKPQLHGHDVMDVEIKSNCTQSCNSGLLGDAHGSKRQLGLWQGSRGSDLCDIVQAEKNCLDQIGVIVGMPGTNVPLSGR
EGVGTSEIKSGTPDISIPDFWSLSLKKGKLSLQMANNNLVPFQVPRLTKENYSSWCIRMKALLGSQDAWEIVDRGYEEQENDVALNQADSEHS