; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0005223 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0005223
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr6:12216195..12219638
RNA-Seq ExpressionLag0005223
SyntenyLag0005223
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
AAR96002.1 retrotransposon-like protein [Musa acuminata]1.6e-0831.9Show/hide
Query:  EITAVKLMEELTNSWETMKITVSNSTGNNTLKFSEVCDLAIVEEIRRQGSNKESTIGSASVMIKGKDKVDEDNEPSSSRKKWKSRNEVECYYCHKK----
        EI  + L+  L+ SWET  +T+ NST   TL    V D  + E+ RR+    ES+ G+     + +        P S R + KSR +++C++C+K     
Subjt:  EITAVKLMEELTNSWETMKITVSNSTGNNTLKFSEVCDLAIVEEIRRQGSNKESTIGSASVMIKGKDKVDEDNEPSSSRKKWKSRNEVECYYCHKK----

Query:  ----------DSATSVHIASDRSLFTSFTGGHHGLVRMGNGRTSKTRGIGIRQCKLKFGSHVV
                  DS    H+ S    F S+T G  G VRMGN  T K  GIG    +   GS ++
Subjt:  ----------DSATSVHIASDRSLFTSFTGGHHGLVRMGNGRTSKTRGIGIRQCKLKFGSHVV

KAG7593230.1 Pentatricopeptide repeat [Arabidopsis thaliana x Arabidopsis arenosa]2.1e-0832Show/hide
Query:  EITAVKLMEELTNSWETMKITVSNSTGNNTLKFSEVCDLAIVEEIRRQGSNKESTIGSASVMIKGKD--KVDEDNEPSSSRK---KWKSRNEVECYYCHK
        E+ A+ L+  L NSWE M+  VSNS GN  LKF +V D  + EE+RR  + + ST  + +V  +G+D  + +++   S SR    + KSR  VEC+ C K
Subjt:  EITAVKLMEELTNSWETMKITVSNSTGNNTLKFSEVCDLAIVEEIRRQGSNKESTIGSASVMIKGKD--KVDEDNEPSSSRK---KWKSRNEVECYYCHK

Query:  K-------------------------DSATSVHIASDRSLFTSFTGGHHGLVRMGNGRTSKTRGIGIRQCKLKFG
                                  DS  S H   DR++  ++  G++G V + NG      GIG    K+  G
Subjt:  K-------------------------DSATSVHIASDRSLFTSFTGGHHGLVRMGNGRTSKTRGIGIRQCKLKFG

RVW24225.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]1.2e-0831.71Show/hide
Query:  EITAVKLMEELTNSWETMKITVSNSTGNNTLKFSEVCDLAIVEEIRRQGSNKESTIGSA-SVMIKGKDKVDEDNEPSS-------SRKKWKSRNEVECYY
        EI A+ ++  L NSWE M++ VSNSTG   LK++++ DL + EEIRR+ + + S  GSA ++  +G+      N+  S       +R K +S  +V+C+ 
Subjt:  EITAVKLMEELTNSWETMKITVSNSTGNNTLKFSEVCDLAIVEEIRRQGSNKESTIGSA-SVMIKGKDKVDEDNEPSS-------SRKKWKSRNEVECYY

Query:  CHKK----------DSATSVHIASDRSLFTSFTGGHHGLVRMGNGRTSKTRGIGIRQCKLKFGS
        C K           DS  S H    R +  ++  G  G V + +G      G+G  +  L  GS
Subjt:  CHKK----------DSATSVHIASDRSLFTSFTGGHHGLVRMGNGRTSKTRGIGIRQCKLKFGS

RVX09487.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]1.6e-0831.71Show/hide
Query:  EITAVKLMEELTNSWETMKITVSNSTGNNTLKFSEVCDLAIVEEIRRQGSNKESTIGSA-SVMIKGKDKVDEDNEPSS-------SRKKWKSRNEVECYY
        EI A+ ++  L NSWE M++ VSNSTG   LK++++ DL + EEIRR+ + + S  GSA ++  +G+      N+  S       +R K +S  +V+C+ 
Subjt:  EITAVKLMEELTNSWETMKITVSNSTGNNTLKFSEVCDLAIVEEIRRQGSNKESTIGSA-SVMIKGKDKVDEDNEPSS-------SRKKWKSRNEVECYY

Query:  CHKK----------DSATSVHIASDRSLFTSFTGGHHGLVRMGNGRTSKTRGIGIRQCKLKFGS
        C K           DS  S H    R +  ++  G  G V + +G      G+G  +  L  GS
Subjt:  CHKK----------DSATSVHIASDRSLFTSFTGGHHGLVRMGNGRTSKTRGIGIRQCKLKFGS

TKS15539.1 hypothetical protein D5086_0000032340 [Populus alba]4.6e-0829.38Show/hide
Query:  EITAVKLMEELTNSWETMKITVSNSTGNNTLKFSEVCDLAIVEEIRRQGSNKESTIGSA-SVMIKGK--DKVDEDNEPSS---SRKKWKSRNEVECYYCH
        EI A+ L+  L +SWE M+  VSNS G + LK+ ++ DL + EE+RR+ S + S+ GSA ++  +G+  D+        S   S+ K+ SR +VEC+ C 
Subjt:  EITAVKLMEELTNSWETMKITVSNSTGNNTLKFSEVCDLAIVEEIRRQGSNKESTIGSA-SVMIKGK--DKVDEDNEPSS---SRKKWKSRNEVECYYCH

Query:  KK------------------------------------------DSATSVHIASDRSLFTSFTGGHHGLVRMGNGRTSKTRGIGIRQCKLKFGS
        K                                           DS  S H      +  ++ GG HG+V + +G   K  GIG  Q K   GS
Subjt:  KK------------------------------------------DSATSVHIASDRSLFTSFTGGHHGLVRMGNGRTSKTRGIGIRQCKLKFGS

TrEMBL top hitse value%identityAlignment
A0A438CLZ5 Retrovirus-related Pol polyprotein from transposon TNT 1-945.8e-0931.71Show/hide
Query:  EITAVKLMEELTNSWETMKITVSNSTGNNTLKFSEVCDLAIVEEIRRQGSNKESTIGSA-SVMIKGKDKVDEDNEPSS-------SRKKWKSRNEVECYY
        EI A+ ++  L NSWE M++ VSNSTG   LK++++ DL + EEIRR+ + + S  GSA ++  +G+      N+  S       +R K +S  +V+C+ 
Subjt:  EITAVKLMEELTNSWETMKITVSNSTGNNTLKFSEVCDLAIVEEIRRQGSNKESTIGSA-SVMIKGKDKVDEDNEPSS-------SRKKWKSRNEVECYY

Query:  CHKK----------DSATSVHIASDRSLFTSFTGGHHGLVRMGNGRTSKTRGIGIRQCKLKFGS
        C K           DS  S H    R +  ++  G  G V + +G      G+G  +  L  GS
Subjt:  CHKK----------DSATSVHIASDRSLFTSFTGGHHGLVRMGNGRTSKTRGIGIRQCKLKFGS

A0A438JKL2 Retrovirus-related Pol polyprotein from transposon TNT 1-947.6e-0931.71Show/hide
Query:  EITAVKLMEELTNSWETMKITVSNSTGNNTLKFSEVCDLAIVEEIRRQGSNKESTIGSA-SVMIKGKDKVDEDNEPSS-------SRKKWKSRNEVECYY
        EI A+ ++  L NSWE M++ VSNSTG   LK++++ DL + EEIRR+ + + S  GSA ++  +G+      N+  S       +R K +S  +V+C+ 
Subjt:  EITAVKLMEELTNSWETMKITVSNSTGNNTLKFSEVCDLAIVEEIRRQGSNKESTIGSA-SVMIKGKDKVDEDNEPSS-------SRKKWKSRNEVECYY

Query:  CHKK----------DSATSVHIASDRSLFTSFTGGHHGLVRMGNGRTSKTRGIGIRQCKLKFGS
        C K           DS  S H    R +  ++  G  G V + +G      G+G  +  L  GS
Subjt:  CHKK----------DSATSVHIASDRSLFTSFTGGHHGLVRMGNGRTSKTRGIGIRQCKLKFGS

A0A4U5PY83 CCHC-type domain-containing protein2.2e-0829.38Show/hide
Query:  EITAVKLMEELTNSWETMKITVSNSTGNNTLKFSEVCDLAIVEEIRRQGSNKESTIGSA-SVMIKGK--DKVDEDNEPSS---SRKKWKSRNEVECYYCH
        EI A+ L+  L +SWE M+  VSNS G + LK+ ++ DL + EE+RR+ S + S+ GSA ++  +G+  D+        S   S+ K+ SR +VEC+ C 
Subjt:  EITAVKLMEELTNSWETMKITVSNSTGNNTLKFSEVCDLAIVEEIRRQGSNKESTIGSA-SVMIKGK--DKVDEDNEPSS---SRKKWKSRNEVECYYCH

Query:  KK------------------------------------------DSATSVHIASDRSLFTSFTGGHHGLVRMGNGRTSKTRGIGIRQCKLKFGS
        K                                           DS  S H      +  ++ GG HG+V + +G   K  GIG  Q K   GS
Subjt:  KK------------------------------------------DSATSVHIASDRSLFTSFTGGHHGLVRMGNGRTSKTRGIGIRQCKLKFGS

A0A4U5QGR0 Uncharacterized protein2.2e-0829.38Show/hide
Query:  EITAVKLMEELTNSWETMKITVSNSTGNNTLKFSEVCDLAIVEEIRRQGSNKESTIGSA-SVMIKGK--DKVDEDNEPSS---SRKKWKSRNEVECYYCH
        EI A+ L+  L +SWE M+  VSNS G + LK+ ++ DL + EE+RR+ S + S+ GSA ++  +G+  D+        S   S+ K+ SR +VEC+ C 
Subjt:  EITAVKLMEELTNSWETMKITVSNSTGNNTLKFSEVCDLAIVEEIRRQGSNKESTIGSA-SVMIKGK--DKVDEDNEPSS---SRKKWKSRNEVECYYCH

Query:  KK------------------------------------------DSATSVHIASDRSLFTSFTGGHHGLVRMGNGRTSKTRGIGIRQCKLKFGS
        K                                           DS  S H      +  ++ GG HG+V + +G   K  GIG  Q K   GS
Subjt:  KK------------------------------------------DSATSVHIASDRSLFTSFTGGHHGLVRMGNGRTSKTRGIGIRQCKLKFGS

A0A7N2R0F3 Integrase catalytic domain-containing protein2.6e-0932.32Show/hide
Query:  EITAVKLMEELTNSWETMKITVSNSTGNNTLKFSEVCDLAIVEEIRRQGSNKESTIGSA-SVMIKGKDKVDEDNEPSS-------SRKKWKSRNEVECYY
        EI A+ ++  L NSWE M++TVSNSTG   LK++++ DL + EEIRR+ + + S  GSA ++  +G+      N   S       +R K +S  +V+C+ 
Subjt:  EITAVKLMEELTNSWETMKITVSNSTGNNTLKFSEVCDLAIVEEIRRQGSNKESTIGSA-SVMIKGKDKVDEDNEPSS-------SRKKWKSRNEVECYY

Query:  CHKK----------DSATSVHIASDRSLFTSFTGGHHGLVRMGNGRTSKTRGIGIRQCKLKFGS
        C K           DS  S H    R +  ++  G  G V + +G      G+G  +  L  GS
Subjt:  CHKK----------DSATSVHIASDRSLFTSFTGGHHGLVRMGNGRTSKTRGIGIRQCKLKFGS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGTTTGTCTATGGATGTGGCAAGTCTAGTAGCCCATGAGATAACTGCAGTCAAGTTGATGGAAGAGCTTACAAACAGTTGGGAAACGATGAAGATAACAGTGTCTAA
TTCGACTGGAAATAACACTTTAAAATTTTCAGAAGTTTGTGATTTAGCCATAGTTGAGGAAATTCGTAGGCAGGGTAGTAATAAAGAGTCTACGATAGGGTCAGCTTCGG
TTATGATTAAGGGTAAAGATAAGGTTGATGAAGATAATGAACCGAGTAGCAGTAGGAAAAAGTGGAAAAGTAGGAATGAGGTAGAATGTTATTACTGCCATAAGAAAGAT
AGTGCAACTTCTGTTCACATAGCTTCAGATAGGAGTTTATTCACATCATTCACAGGAGGGCATCATGGCCTAGTGAGGATGGGGAATGGTAGAACCTCCAAGACTAGAGG
GATTGGAATTCGCCAGTGTAAACTCAAGTTCGGATCCCATGTAGTGGCAGTTGGTCACAGGAAATCTACACTGTACAGATGTCAGTTGAATGTTGCCAAAGGTTCAAAGA
GACGGTGGATGCCAGTTAAAGCTGCAGATGGTAGTTGTAGAGGTACAGTTGAGCCAGCAGCAAGGATAGCCAATTTCGATCATTCCGATCAAGATCCTTCAAAGTCGCTC
AAGCGAGTTGAGGCATCAAAGTGGAAGGCCAGAACAGTTGCTAAGGTCAAAGGTCAGGTCTCTAGCTTGGTAATAGGTTTGAATAGAGGATTCAAGTCATTATCAGAGTG
TATCTTCTTCAGGAACAGTTGTTCGGGTTGGAAGAAGATGATAGTGGGAGACGAGATCATTGTTTTGCCTCCAAGTGGGAAATTGTCAAGATTCTTGATAAACCTGTCAT
GGCTGGCCGAAGCAACTAGACCCGACACTGTGGACACTACTAAGGAGGTATTAAACCTTAATGAGCTGATAGAGTCATCGTCTTGTTCTCCACAATGTTCCAAATCTCTA
ACGACTGCAGCAGTCAAGACAAAAATGAAGGGGACGGGTGGGGAGTGTTCGGATAGGAGGACTTGCCTGGTAACATCCAATGACGAGCAAAGTAGAATAGGTGGGATGAA
AAACGACAGAGTTGAACTTATTGCCGTTACAGCTCAAATCGTGAATGGTTTCACCCTCACTCCCAGAAGCAAGCGTCCATACTCAAATAGAATCCTCACTTACGGACGCC
AGGAACTCTCCAGAAGGATCCCAGCAAAGAGAATGAACCATTTTGGTATGTCCCTATACTTGGAAGCACACATTGCAACCAAAATGACGAGCTTGTGTCTCGACATAAAA
AATGGATACGACATTATCTATAGCAGGCACCGAAGTACCTTCCATAACGAGGTTGAAATCTCAGCTGACCTGTACCACCCTAATTTACACAAGCCATTTAGAACCACCAT
TTTTCAATACTCGACCAAAAAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGTGTTTGTCTATGGATGTGGCAAGTCTAGTAGCCCATGAGATAACTGCAGTCAAGTTGATGGAAGAGCTTACAAACAGTTGGGAAACGATGAAGATAACAGTGTCTAA
TTCGACTGGAAATAACACTTTAAAATTTTCAGAAGTTTGTGATTTAGCCATAGTTGAGGAAATTCGTAGGCAGGGTAGTAATAAAGAGTCTACGATAGGGTCAGCTTCGG
TTATGATTAAGGGTAAAGATAAGGTTGATGAAGATAATGAACCGAGTAGCAGTAGGAAAAAGTGGAAAAGTAGGAATGAGGTAGAATGTTATTACTGCCATAAGAAAGAT
AGTGCAACTTCTGTTCACATAGCTTCAGATAGGAGTTTATTCACATCATTCACAGGAGGGCATCATGGCCTAGTGAGGATGGGGAATGGTAGAACCTCCAAGACTAGAGG
GATTGGAATTCGCCAGTGTAAACTCAAGTTCGGATCCCATGTAGTGGCAGTTGGTCACAGGAAATCTACACTGTACAGATGTCAGTTGAATGTTGCCAAAGGTTCAAAGA
GACGGTGGATGCCAGTTAAAGCTGCAGATGGTAGTTGTAGAGGTACAGTTGAGCCAGCAGCAAGGATAGCCAATTTCGATCATTCCGATCAAGATCCTTCAAAGTCGCTC
AAGCGAGTTGAGGCATCAAAGTGGAAGGCCAGAACAGTTGCTAAGGTCAAAGGTCAGGTCTCTAGCTTGGTAATAGGTTTGAATAGAGGATTCAAGTCATTATCAGAGTG
TATCTTCTTCAGGAACAGTTGTTCGGGTTGGAAGAAGATGATAGTGGGAGACGAGATCATTGTTTTGCCTCCAAGTGGGAAATTGTCAAGATTCTTGATAAACCTGTCAT
GGCTGGCCGAAGCAACTAGACCCGACACTGTGGACACTACTAAGGAGGTATTAAACCTTAATGAGCTGATAGAGTCATCGTCTTGTTCTCCACAATGTTCCAAATCTCTA
ACGACTGCAGCAGTCAAGACAAAAATGAAGGGGACGGGTGGGGAGTGTTCGGATAGGAGGACTTGCCTGGTAACATCCAATGACGAGCAAAGTAGAATAGGTGGGATGAA
AAACGACAGAGTTGAACTTATTGCCGTTACAGCTCAAATCGTGAATGGTTTCACCCTCACTCCCAGAAGCAAGCGTCCATACTCAAATAGAATCCTCACTTACGGACGCC
AGGAACTCTCCAGAAGGATCCCAGCAAAGAGAATGAACCATTTTGGTATGTCCCTATACTTGGAAGCACACATTGCAACCAAAATGACGAGCTTGTGTCTCGACATAAAA
AATGGATACGACATTATCTATAGCAGGCACCGAAGTACCTTCCATAACGAGGTTGAAATCTCAGCTGACCTGTACCACCCTAATTTACACAAGCCATTTAGAACCACCAT
TTTTCAATACTCGACCAAAAAGTAG
Protein sequenceShow/hide protein sequence
MCLSMDVASLVAHEITAVKLMEELTNSWETMKITVSNSTGNNTLKFSEVCDLAIVEEIRRQGSNKESTIGSASVMIKGKDKVDEDNEPSSSRKKWKSRNEVECYYCHKKD
SATSVHIASDRSLFTSFTGGHHGLVRMGNGRTSKTRGIGIRQCKLKFGSHVVAVGHRKSTLYRCQLNVAKGSKRRWMPVKAADGSCRGTVEPAARIANFDHSDQDPSKSL
KRVEASKWKARTVAKVKGQVSSLVIGLNRGFKSLSECIFFRNSCSGWKKMIVGDEIIVLPPSGKLSRFLINLSWLAEATRPDTVDTTKEVLNLNELIESSSCSPQCSKSL
TTAAVKTKMKGTGGECSDRRTCLVTSNDEQSRIGGMKNDRVELIAVTAQIVNGFTLTPRSKRPYSNRILTYGRQELSRRIPAKRMNHFGMSLYLEAHIATKMTSLCLDIK
NGYDIIYSRHRSTFHNEVEISADLYHPNLHKPFRTTIFQYSTKK