; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0018626 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0018626
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr5:31136864..31137361
RNA-Seq ExpressionLag0018626
SyntenyLag0018626
Gene Ontology termsGO:0005488 - binding (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0048297.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]7.5e-2742.36Show/hide
Query:  SKVINPWNKIAIVKFDEENFLLWKIQVLITLRGHGLQQHIDTDVEVPVKYLNTGDTSSSNR--AVNPEYDHWIRQDNLITAWFLGSKSNSLLSEVVDCEI
        +++    NKI++VK +++ FLLWK Q+L  L  + L+  ++++ E P KYL + ++SS++     NP Y  W RQD LI++W LGS S  +L++++ C+ 
Subjt:  SKVINPWNKIAIVKFDEENFLLWKIQVLITLRGHGLQQHIDTDVEVPVKYLNTGDTSSSNR--AVNPEYDHWIRQDNLITAWFLGSKSNSLLSEVVDCEI

Query:  AREVWKTLNDKFFSRNLARVMDLKTKLETMKKGNLKLEEYFLKI
        A+E+W+TL   F SR LA+ M  K KL  +KKG++ L+EYFLKI
Subjt:  AREVWKTLNDKFFSRNLARVMDLKTKLETMKKGNLKLEEYFLKI

KAA0053143.1 keratin, type II cytoskeletal 1-like [Cucumis melo var. makuwa]1.1e-2747.1Show/hide
Query:  NKIAIVKFDEENFLLWKIQVLITLRGHGLQQHIDTDVEVPVKYL-NTGDTS-SSNRAVNPEYDHWIRQDNLITAWFLGSKSNSLLSEVVDCEIAREVWKT
        NKI++VK  ++NFLLWK Q+L  L  + L+   ++++E P KYL +TG +S S+ R  NPEY  W R + LI+ W LGS S  +L+++V C+ A+E+W T
Subjt:  NKIAIVKFDEENFLLWKIQVLITLRGHGLQQHIDTDVEVPVKYL-NTGDTS-SSNRAVNPEYDHWIRQDNLITAWFLGSKSNSLLSEVVDCEIAREVWKT

Query:  LNDKFFSRNLARVMDLKTKLETMKKGNLKLEEYFLKIK
        L   F SR LA+ M  K KL  +KKG++ L+EYFLKI+
Subjt:  LNDKFFSRNLARVMDLKTKLETMKKGNLKLEEYFLKIK

KAA0067213.1 keratin, type II cytoskeletal 1-like [Cucumis melo var. makuwa]7.5e-2744.14Show/hide
Query:  SKVINPWNKIAIVKFDEENFLLWKIQVLITLRGHGLQQHIDTDVEVPVKYL-NTGDTS-SSNRAVNPEYDHWIRQDNLITAWFLGSKSNSLLSEVVDCEI
        +++    NKI++VK  ++NFLLWK Q+L  L  + L+  ++++ E P KYL +TG +S S+ R  NP Y  W RQD LI++W LGS S  +L++++ C+ 
Subjt:  SKVINPWNKIAIVKFDEENFLLWKIQVLITLRGHGLQQHIDTDVEVPVKYL-NTGDTS-SSNRAVNPEYDHWIRQDNLITAWFLGSKSNSLLSEVVDCEI

Query:  AREVWKTLNDKFFSRNLARVMDLKTKLETMKKGNLKLEEYFLKIK
        A+E+W TL   F SR LA+ M  K KL  +KK ++ L+EYFLKI+
Subjt:  AREVWKTLNDKFFSRNLARVMDLKTKLETMKKGNLKLEEYFLKIK

XP_022154487.1 uncharacterized protein LOC111021757 [Momordica charantia]2.3e-3649.04Show/hide
Query:  SDVQTVLQPSKVINPWNKIAIVKFDEENFLLWKIQVLITLRGHGLQQHIDTDVEVPVKYLNT--GDTSSSNRAVNPEYDHWIRQDNLITAWFLGSKSNSL
        SD   ++Q SK INP +K++IV+ +++N LLWK Q+   L+G+GL+ +ID++ + P +++ T   ++SSS+   NP Y  WI+QD LI+AW LGS +  +
Subjt:  SDVQTVLQPSKVINPWNKIAIVKFDEENFLLWKIQVLITLRGHGLQQHIDTDVEVPVKYLNT--GDTSSSNRAVNPEYDHWIRQDNLITAWFLGSKSNSL

Query:  LSEVVDCEIAREVWKTLNDKFFSRNLARVMDLKTKLETMKKGNLKLEEYFLKIKILL
        LS+++DC+ ARE+W  L   F SR LARVM LK KLE  KKGNL L++YFLKIK L+
Subjt:  LSEVVDCEIAREVWKTLNDKFFSRNLARVMDLKTKLETMKKGNLKLEEYFLKIKILL

XP_022156747.1 uncharacterized protein LOC111023586 [Momordica charantia]2.9e-3154.4Show/hide
Query:  KIQVLITLRGHGLQQHIDTDVEVPVKYLNTGD--TSSSNRAVNPEYDHWIRQDNLITAWFLGSKSNSLLSEVVDCEIAREVWKTLNDKFFSRNLARVMDL
        K QVL  ++GHGL+Q+ID+D+E P +++  GD  TSS+ +  NPEY HWI+QD LI+ W LGS S  +LS+++DC + +E+W  L   F SRNLARVM L
Subjt:  KIQVLITLRGHGLQQHIDTDVEVPVKYLNTGD--TSSSNRAVNPEYDHWIRQDNLITAWFLGSKSNSLLSEVVDCEIAREVWKTLNDKFFSRNLARVMDL

Query:  KTKLETMKKGNLKLEEYFLKIKILL
        K+KLE MKKG++ L+ YFLKIK L+
Subjt:  KTKLETMKKGNLKLEEYFLKIKILL

TrEMBL top hitse value%identityAlignment
A0A5A7U233 Retrovirus-related Pol polyprotein from transposon TNT 1-943.6e-2742.36Show/hide
Query:  SKVINPWNKIAIVKFDEENFLLWKIQVLITLRGHGLQQHIDTDVEVPVKYLNTGDTSSSNR--AVNPEYDHWIRQDNLITAWFLGSKSNSLLSEVVDCEI
        +++    NKI++VK +++ FLLWK Q+L  L  + L+  ++++ E P KYL + ++SS++     NP Y  W RQD LI++W LGS S  +L++++ C+ 
Subjt:  SKVINPWNKIAIVKFDEENFLLWKIQVLITLRGHGLQQHIDTDVEVPVKYLNTGDTSSSNR--AVNPEYDHWIRQDNLITAWFLGSKSNSLLSEVVDCEI

Query:  AREVWKTLNDKFFSRNLARVMDLKTKLETMKKGNLKLEEYFLKI
        A+E+W+TL   F SR LA+ M  K KL  +KKG++ L+EYFLKI
Subjt:  AREVWKTLNDKFFSRNLARVMDLKTKLETMKKGNLKLEEYFLKI

A0A5A7UB21 Keratin, type II cytoskeletal 1-like5.6e-2847.1Show/hide
Query:  NKIAIVKFDEENFLLWKIQVLITLRGHGLQQHIDTDVEVPVKYL-NTGDTS-SSNRAVNPEYDHWIRQDNLITAWFLGSKSNSLLSEVVDCEIAREVWKT
        NKI++VK  ++NFLLWK Q+L  L  + L+   ++++E P KYL +TG +S S+ R  NPEY  W R + LI+ W LGS S  +L+++V C+ A+E+W T
Subjt:  NKIAIVKFDEENFLLWKIQVLITLRGHGLQQHIDTDVEVPVKYL-NTGDTS-SSNRAVNPEYDHWIRQDNLITAWFLGSKSNSLLSEVVDCEIAREVWKT

Query:  LNDKFFSRNLARVMDLKTKLETMKKGNLKLEEYFLKIK
        L   F SR LA+ M  K KL  +KKG++ L+EYFLKI+
Subjt:  LNDKFFSRNLARVMDLKTKLETMKKGNLKLEEYFLKIK

A0A5D3CH97 Retrovirus-related Pol polyprotein from transposon TNT 1-943.6e-2742.36Show/hide
Query:  SKVINPWNKIAIVKFDEENFLLWKIQVLITLRGHGLQQHIDTDVEVPVKYLNTGDTSSSNR--AVNPEYDHWIRQDNLITAWFLGSKSNSLLSEVVDCEI
        +++    NKI++VK +++ FLLWK Q+L  L  + L+  ++++ E P KYL + ++SS++     NP Y  W RQD LI++W LGS S  +L++++ C+ 
Subjt:  SKVINPWNKIAIVKFDEENFLLWKIQVLITLRGHGLQQHIDTDVEVPVKYLNTGDTSSSNR--AVNPEYDHWIRQDNLITAWFLGSKSNSLLSEVVDCEI

Query:  AREVWKTLNDKFFSRNLARVMDLKTKLETMKKGNLKLEEYFLKI
        A+E+W+TL   F SR LA+ M  K KL  +KKG++ L+EYFLKI
Subjt:  AREVWKTLNDKFFSRNLARVMDLKTKLETMKKGNLKLEEYFLKI

A0A6J1DLT9 uncharacterized protein LOC1110217571.1e-3649.04Show/hide
Query:  SDVQTVLQPSKVINPWNKIAIVKFDEENFLLWKIQVLITLRGHGLQQHIDTDVEVPVKYLNT--GDTSSSNRAVNPEYDHWIRQDNLITAWFLGSKSNSL
        SD   ++Q SK INP +K++IV+ +++N LLWK Q+   L+G+GL+ +ID++ + P +++ T   ++SSS+   NP Y  WI+QD LI+AW LGS +  +
Subjt:  SDVQTVLQPSKVINPWNKIAIVKFDEENFLLWKIQVLITLRGHGLQQHIDTDVEVPVKYLNT--GDTSSSNRAVNPEYDHWIRQDNLITAWFLGSKSNSL

Query:  LSEVVDCEIAREVWKTLNDKFFSRNLARVMDLKTKLETMKKGNLKLEEYFLKIKILL
        LS+++DC+ ARE+W  L   F SR LARVM LK KLE  KKGNL L++YFLKIK L+
Subjt:  LSEVVDCEIAREVWKTLNDKFFSRNLARVMDLKTKLETMKKGNLKLEEYFLKIKILL

A0A6J1DSS1 uncharacterized protein LOC1110235861.4e-3154.4Show/hide
Query:  KIQVLITLRGHGLQQHIDTDVEVPVKYLNTGD--TSSSNRAVNPEYDHWIRQDNLITAWFLGSKSNSLLSEVVDCEIAREVWKTLNDKFFSRNLARVMDL
        K QVL  ++GHGL+Q+ID+D+E P +++  GD  TSS+ +  NPEY HWI+QD LI+ W LGS S  +LS+++DC + +E+W  L   F SRNLARVM L
Subjt:  KIQVLITLRGHGLQQHIDTDVEVPVKYLNTGD--TSSSNRAVNPEYDHWIRQDNLITAWFLGSKSNSLLSEVVDCEIAREVWKTLNDKFFSRNLARVMDL

Query:  KTKLETMKKGNLKLEEYFLKIKILL
        K+KLE MKKG++ L+ YFLKIK L+
Subjt:  KTKLETMKKGNLKLEEYFLKIKILL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)1.9e-0432.05Show/hide
Query:  WIRQDNLITAWFLGSKSNSLLSEV--VDCEIAREVWKTLNDKFFSRNLARVMDLKTKLETMKKGNLKLEEYFLKIKIL
        W  +D L+  W  G+ ++SLL  +  V C  AR++W +L + F     AR +  + +L T    +L + EY  K+K L
Subjt:  WIRQDNLITAWFLGSKSNSLLSEV--VDCEIAREVWKTLNDKFFSRNLARVMDLKTKLETMKKGNLKLEEYFLKIKIL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGTCTCTTGAGAGGAGTACATCTGATGTTCAAACTGTATTACAACCATCAAAAGTCATAAACCCATGGAACAAGATCGCAATTGTGAAGTTCGATGAAGAGAATTT
TCTTTTATGGAAAATACAAGTTTTAATCACACTGAGGGGTCATGGTTTGCAACAGCACATTGATACAGATGTAGAAGTACCAGTGAAATATCTGAATACAGGTGATACCT
CTTCTTCAAATCGAGCAGTTAACCCTGAATATGATCACTGGATTAGGCAAGATAACCTGATTACTGCGTGGTTTTTAGGTTCGAAGTCTAATTCCTTACTGTCAGAAGTT
GTAGATTGTGAAATTGCAAGAGAAGTCTGGAAAACACTCAATGATAAGTTTTTTTCTCGAAATCTTGCCAGAGTAATGGATCTGAAAACAAAATTGGAGACAATGAAGAA
AGGAAATTTAAAGCTTGAAGAATATTTCTTGAAGATTAAAATCTTGTTGATTCTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAGTCTCTTGAGAGGAGTACATCTGATGTTCAAACTGTATTACAACCATCAAAAGTCATAAACCCATGGAACAAGATCGCAATTGTGAAGTTCGATGAAGAGAATTT
TCTTTTATGGAAAATACAAGTTTTAATCACACTGAGGGGTCATGGTTTGCAACAGCACATTGATACAGATGTAGAAGTACCAGTGAAATATCTGAATACAGGTGATACCT
CTTCTTCAAATCGAGCAGTTAACCCTGAATATGATCACTGGATTAGGCAAGATAACCTGATTACTGCGTGGTTTTTAGGTTCGAAGTCTAATTCCTTACTGTCAGAAGTT
GTAGATTGTGAAATTGCAAGAGAAGTCTGGAAAACACTCAATGATAAGTTTTTTTCTCGAAATCTTGCCAGAGTAATGGATCTGAAAACAAAATTGGAGACAATGAAGAA
AGGAAATTTAAAGCTTGAAGAATATTTCTTGAAGATTAAAATCTTGTTGATTCTTTAA
Protein sequenceShow/hide protein sequence
MESLERSTSDVQTVLQPSKVINPWNKIAIVKFDEENFLLWKIQVLITLRGHGLQQHIDTDVEVPVKYLNTGDTSSSNRAVNPEYDHWIRQDNLITAWFLGSKSNSLLSEV
VDCEIAREVWKTLNDKFFSRNLARVMDLKTKLETMKKGNLKLEEYFLKIKILLIL