; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0041225 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0041225
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr13:13983355..13984406
RNA-Seq ExpressionLag0041225
SyntenyLag0041225
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TXG55646.1 hypothetical protein EZV62_020902 [Acer yangbiense]3.8e-2539.33Show/hide
Query:  HFLDGLDSEYLPIMCQINANDKMTWQEVHATLITFENTQLRLNLIQHDSD-STNPTANYVANRSNISFRPSQQSYNRSFERGVTTLRNQ-------GRGR
        + L GLDSEY+PI+  I A +  TWQE++ TL+++++    +N +    +  ++P+A+   N+ N +   ++ S  ++  +G     N+       GR R
Subjt:  HFLDGLDSEYLPIMCQINANDKMTWQEVHATLITFENTQLRLNLIQHDSD-STNPTANYVANRSNISFRPSQQSYNRSFERGVTTLRNQ-------GRGR

Query:  GRGGRSNHWKTANARATCQVCGRIGHSAAICYYRYSNNYTEG-PTPIETNNKFAANIATPAIVADQAWYADSGATNHL
        GRGGR+N     N+R TCQVCG+ GHSA++CY+RY +NY    PT     N  +  +ATP  V D  WYADSGATNH+
Subjt:  GRGGRSNHWKTANARATCQVCGRIGHSAAICYYRYSNNYTEG-PTPIETNNKFAANIATPAIVADQAWYADSGATNHL

TXG69253.1 hypothetical protein EZV62_004188 [Acer yangbiense]5.0e-2539.77Show/hide
Query:  LDGLDSEYLPIMCQINANDKMTWQEVHATLITFENTQLRLNLIQHDSD-STNPTANYVANRSNISFRPSQQSYNRSFERGVTTLRNQ-------GRGRGR
        L GLDSEY+PI+  I A +  TWQE++ TL+++++    +N +    +  ++P+A+   N+ N +   ++ S  ++  +G     N+       GR RGR
Subjt:  LDGLDSEYLPIMCQINANDKMTWQEVHATLITFENTQLRLNLIQHDSD-STNPTANYVANRSNISFRPSQQSYNRSFERGVTTLRNQ-------GRGRGR

Query:  GGRSNHWKTANARATCQVCGRIGHSAAICYYRYSNNYTEG-PTPIETNNKFAANIATPAIVADQAWYADSGATNHL
        GGR+N     N+R TCQVCG+ GHSA++CY+RY +NY    PT     N  +  +ATP  V D  WYADSGATNH+
Subjt:  GGRSNHWKTANARATCQVCGRIGHSAAICYYRYSNNYTEG-PTPIETNNKFAANIATPAIVADQAWYADSGATNHL

XP_022142770.1 uncharacterized protein LOC111012809 [Momordica charantia]1.1e-2746.11Show/hide
Query:  LDGLDSEYLPIMCQINANDKMTWQEVHATLITFENTQLRLNLIQHDSDSTNPTANYVANRS-NISFRPSQQSYNRSFERGVTTLRNQGRGRGRGGRSNHW
        L GL++EYL I+CQINA + ++WQEVHATLITFEN  + LN +   +D + P+ANY  N+S + ++ P QQ   R   R         RGR RGGR    
Subjt:  LDGLDSEYLPIMCQINANDKMTWQEVHATLITFENTQLRLNLIQHDSDSTNPTANYVANRS-NISFRPSQQSYNRSFERGVTTLRNQGRGRGRGGRSNHW

Query:  KTANARATCQVCGRIGHSAAICYYRYSNNYTEGPTPIETNNKFAANIATPAIVADQAWYADSGATNH
        ++ ++R TCQVCG+IGH A +CY+R +  Y  G TP   N    A I  P ++ D  W  DSGATNH
Subjt:  KTANARATCQVCGRIGHSAAICYYRYSNNYTEGPTPIETNNKFAANIATPAIVADQAWYADSGATNH

XP_022157748.1 uncharacterized protein LOC111024384 isoform X1 [Momordica charantia]1.1e-2441.38Show/hide
Query:  LDGLDSEYLPIMCQINANDKMTWQEVHATLITFENTQLRLNLIQHDSDS--TNPTANYVANRSN-ISFRPSQQSYNRSFE-RG---VTTLRNQGRGRGRG
        L GL++EYLPI+CQI   D  +WQE+ ATL+TFENT +RLN++   +    ++ + NYV ++ N +  R   QS +   + RG       +N  RGRGR 
Subjt:  LDGLDSEYLPIMCQINANDKMTWQEVHATLITFENTQLRLNLIQHDSDS--TNPTANYVANRSN-ISFRPSQQSYNRSFE-RG---VTTLRNQGRGRGRG

Query:  GRSNHWKTANARATCQVCGRIGHSAAICYYRYSNNYTEGPTPIETNNKFAANIATPAIVADQAWYADSGATNHL
        GR + ++  N++ +CQ+CG+ GH AA+CY R+  N+         NN+ +A +A P IVA+ +W ADSGAT+H+
Subjt:  GRSNHWKTANARATCQVCGRIGHSAAICYYRYSNNYTEGPTPIETNNKFAANIATPAIVADQAWYADSGATNHL

XP_022157750.1 uncharacterized protein LOC111024384 isoform X2 [Momordica charantia]1.1e-2441.38Show/hide
Query:  LDGLDSEYLPIMCQINANDKMTWQEVHATLITFENTQLRLNLIQHDSDS--TNPTANYVANRSN-ISFRPSQQSYNRSFE-RG---VTTLRNQGRGRGRG
        L GL++EYLPI+CQI   D  +WQE+ ATL+TFENT +RLN++   +    ++ + NYV ++ N +  R   QS +   + RG       +N  RGRGR 
Subjt:  LDGLDSEYLPIMCQINANDKMTWQEVHATLITFENTQLRLNLIQHDSDS--TNPTANYVANRSN-ISFRPSQQSYNRSFE-RG---VTTLRNQGRGRGRG

Query:  GRSNHWKTANARATCQVCGRIGHSAAICYYRYSNNYTEGPTPIETNNKFAANIATPAIVADQAWYADSGATNHL
        GR + ++  N++ +CQ+CG+ GH AA+CY R+  N+         NN+ +A +A P IVA+ +W ADSGAT+H+
Subjt:  GRSNHWKTANARATCQVCGRIGHSAAICYYRYSNNYTEGPTPIETNNKFAANIATPAIVADQAWYADSGATNHL

TrEMBL top hitse value%identityAlignment
A0A5C7HHE9 Uncharacterized protein1.8e-2539.33Show/hide
Query:  HFLDGLDSEYLPIMCQINANDKMTWQEVHATLITFENTQLRLNLIQHDSD-STNPTANYVANRSNISFRPSQQSYNRSFERGVTTLRNQ-------GRGR
        + L GLDSEY+PI+  I A +  TWQE++ TL+++++    +N +    +  ++P+A+   N+ N +   ++ S  ++  +G     N+       GR R
Subjt:  HFLDGLDSEYLPIMCQINANDKMTWQEVHATLITFENTQLRLNLIQHDSD-STNPTANYVANRSNISFRPSQQSYNRSFERGVTTLRNQ-------GRGR

Query:  GRGGRSNHWKTANARATCQVCGRIGHSAAICYYRYSNNYTEG-PTPIETNNKFAANIATPAIVADQAWYADSGATNHL
        GRGGR+N     N+R TCQVCG+ GHSA++CY+RY +NY    PT     N  +  +ATP  V D  WYADSGATNH+
Subjt:  GRGGRSNHWKTANARATCQVCGRIGHSAAICYYRYSNNYTEG-PTPIETNNKFAANIATPAIVADQAWYADSGATNHL

A0A5C7IJ06 Uncharacterized protein2.4e-2539.77Show/hide
Query:  LDGLDSEYLPIMCQINANDKMTWQEVHATLITFENTQLRLNLIQHDSD-STNPTANYVANRSNISFRPSQQSYNRSFERGVTTLRNQ-------GRGRGR
        L GLDSEY+PI+  I A +  TWQE++ TL+++++    +N +    +  ++P+A+   N+ N +   ++ S  ++  +G     N+       GR RGR
Subjt:  LDGLDSEYLPIMCQINANDKMTWQEVHATLITFENTQLRLNLIQHDSD-STNPTANYVANRSNISFRPSQQSYNRSFERGVTTLRNQ-------GRGRGR

Query:  GGRSNHWKTANARATCQVCGRIGHSAAICYYRYSNNYTEG-PTPIETNNKFAANIATPAIVADQAWYADSGATNHL
        GGR+N     N+R TCQVCG+ GHSA++CY+RY +NY    PT     N  +  +ATP  V D  WYADSGATNH+
Subjt:  GGRSNHWKTANARATCQVCGRIGHSAAICYYRYSNNYTEG-PTPIETNNKFAANIATPAIVADQAWYADSGATNHL

A0A6J1CLV9 uncharacterized protein LOC1110128095.2e-2846.11Show/hide
Query:  LDGLDSEYLPIMCQINANDKMTWQEVHATLITFENTQLRLNLIQHDSDSTNPTANYVANRS-NISFRPSQQSYNRSFERGVTTLRNQGRGRGRGGRSNHW
        L GL++EYL I+CQINA + ++WQEVHATLITFEN  + LN +   +D + P+ANY  N+S + ++ P QQ   R   R         RGR RGGR    
Subjt:  LDGLDSEYLPIMCQINANDKMTWQEVHATLITFENTQLRLNLIQHDSDSTNPTANYVANRS-NISFRPSQQSYNRSFERGVTTLRNQGRGRGRGGRSNHW

Query:  KTANARATCQVCGRIGHSAAICYYRYSNNYTEGPTPIETNNKFAANIATPAIVADQAWYADSGATNH
        ++ ++R TCQVCG+IGH A +CY+R +  Y  G TP   N    A I  P ++ D  W  DSGATNH
Subjt:  KTANARATCQVCGRIGHSAAICYYRYSNNYTEGPTPIETNNKFAANIATPAIVADQAWYADSGATNH

A0A6J1DTZ7 uncharacterized protein LOC111024384 isoform X25.4e-2541.38Show/hide
Query:  LDGLDSEYLPIMCQINANDKMTWQEVHATLITFENTQLRLNLIQHDSDS--TNPTANYVANRSN-ISFRPSQQSYNRSFE-RG---VTTLRNQGRGRGRG
        L GL++EYLPI+CQI   D  +WQE+ ATL+TFENT +RLN++   +    ++ + NYV ++ N +  R   QS +   + RG       +N  RGRGR 
Subjt:  LDGLDSEYLPIMCQINANDKMTWQEVHATLITFENTQLRLNLIQHDSDS--TNPTANYVANRSN-ISFRPSQQSYNRSFE-RG---VTTLRNQGRGRGRG

Query:  GRSNHWKTANARATCQVCGRIGHSAAICYYRYSNNYTEGPTPIETNNKFAANIATPAIVADQAWYADSGATNHL
        GR + ++  N++ +CQ+CG+ GH AA+CY R+  N+         NN+ +A +A P IVA+ +W ADSGAT+H+
Subjt:  GRSNHWKTANARATCQVCGRIGHSAAICYYRYSNNYTEGPTPIETNNKFAANIATPAIVADQAWYADSGATNHL

A0A6J1DU77 uncharacterized protein LOC111024384 isoform X15.4e-2541.38Show/hide
Query:  LDGLDSEYLPIMCQINANDKMTWQEVHATLITFENTQLRLNLIQHDSDS--TNPTANYVANRSN-ISFRPSQQSYNRSFE-RG---VTTLRNQGRGRGRG
        L GL++EYLPI+CQI   D  +WQE+ ATL+TFENT +RLN++   +    ++ + NYV ++ N +  R   QS +   + RG       +N  RGRGR 
Subjt:  LDGLDSEYLPIMCQINANDKMTWQEVHATLITFENTQLRLNLIQHDSDS--TNPTANYVANRSN-ISFRPSQQSYNRSFE-RG---VTTLRNQGRGRGRG

Query:  GRSNHWKTANARATCQVCGRIGHSAAICYYRYSNNYTEGPTPIETNNKFAANIATPAIVADQAWYADSGATNHL
        GR + ++  N++ +CQ+CG+ GH AA+CY R+  N+         NN+ +A +A P IVA+ +W ADSGAT+H+
Subjt:  GRSNHWKTANARATCQVCGRIGHSAAICYYRYSNNYTEGPTPIETNNKFAANIATPAIVADQAWYADSGATNHL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTCTGCAAGTTCATCAAGTGTTCATCAAGTTGGGGGCAATATTGCTCAACCATTTGGCAATCCTTTCAACTCAATACTGACAGTAAAGTTGGATGAGAAATTTTT
ACATTGTGGAAATCAATTATTTTGGCTATCCTACGTGGTCAAAAACTTGAAAATTTTGTCCTTTGAACTTCAAACCCACCTGCTGAATTTCTACAAACTGAAGAAAATTC
AGCACCCATTTCCAATCCAGAATATCAAAAATGGATTGCCACAGATCAAACACTTCTTGGATGGTCTTGATAGTGAGTATTTGCCGATTATGTGCCAAATTAATGCAAAT
GACAAAATGACTTGGCAAGAAGTTCACGCAACACTCATCACATTTGAAAACACACAGTTGCGGTTGAATTTAATTCAACATGATTCAGACTCAACAAATCCAACAGCCAA
TTATGTTGCAAACAGAAGCAATATATCCTTCAGACCTTCACAACAGTCATACAACAGGTCTTTTGAAAGAGGAGTGACCACACTGCGAAACCAAGGCAGAGGACGAGGTC
GTGGAGGTAGATCTAATCACTGGAAAACAGCAAATGCTCGTGCAACTTGCCAAGTTTGTGGGCGAATTGGCCATTCTGCTGCCATCTGCTATTACAGATATAGTAATAAC
TACACTGAAGGTCCTACACCAATTGAAACCAACAACAAATTTGCAGCCAATATTGCAACTCCAGCCATAGTAGCAGATCAAGCATGGTATGCAGACAGTGGAGCCACAAA
TCACCTTAAACGAGTATAA
mRNA sequenceShow/hide mRNA sequence
ATGGCGTCTGCAAGTTCATCAAGTGTTCATCAAGTTGGGGGCAATATTGCTCAACCATTTGGCAATCCTTTCAACTCAATACTGACAGTAAAGTTGGATGAGAAATTTTT
ACATTGTGGAAATCAATTATTTTGGCTATCCTACGTGGTCAAAAACTTGAAAATTTTGTCCTTTGAACTTCAAACCCACCTGCTGAATTTCTACAAACTGAAGAAAATTC
AGCACCCATTTCCAATCCAGAATATCAAAAATGGATTGCCACAGATCAAACACTTCTTGGATGGTCTTGATAGTGAGTATTTGCCGATTATGTGCCAAATTAATGCAAAT
GACAAAATGACTTGGCAAGAAGTTCACGCAACACTCATCACATTTGAAAACACACAGTTGCGGTTGAATTTAATTCAACATGATTCAGACTCAACAAATCCAACAGCCAA
TTATGTTGCAAACAGAAGCAATATATCCTTCAGACCTTCACAACAGTCATACAACAGGTCTTTTGAAAGAGGAGTGACCACACTGCGAAACCAAGGCAGAGGACGAGGTC
GTGGAGGTAGATCTAATCACTGGAAAACAGCAAATGCTCGTGCAACTTGCCAAGTTTGTGGGCGAATTGGCCATTCTGCTGCCATCTGCTATTACAGATATAGTAATAAC
TACACTGAAGGTCCTACACCAATTGAAACCAACAACAAATTTGCAGCCAATATTGCAACTCCAGCCATAGTAGCAGATCAAGCATGGTATGCAGACAGTGGAGCCACAAA
TCACCTTAAACGAGTATAA
Protein sequenceShow/hide protein sequence
MASASSSSVHQVGGNIAQPFGNPFNSILTVKLDEKFLHCGNQLFWLSYVVKNLKILSFELQTHLLNFYKLKKIQHPFPIQNIKNGLPQIKHFLDGLDSEYLPIMCQINAN
DKMTWQEVHATLITFENTQLRLNLIQHDSDSTNPTANYVANRSNISFRPSQQSYNRSFERGVTTLRNQGRGRGRGGRSNHWKTANARATCQVCGRIGHSAAICYYRYSNN
YTEGPTPIETNNKFAANIATPAIVADQAWYADSGATNHLKRV