; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0001933 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0001933
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationchr4:37219610..37220239
RNA-Seq ExpressionLag0001933
SyntenyLag0001933
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0056839.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]1.3e-4456.99Show/hide
Query:  FFKKSIVNRNVNETYIALISKKVNSSKVSDYRPISLTTT-------SLTERLKPTLIETIAENQSAFIKERQIINVILIANEAVDMWKCAKKKGIIIKLD
        F K  IVN NVN T+IALISKK   SK SDYRPISLTT+       +L  RLK  L +TIAENQ AFIK RQI + ILIANEA+D WK  K KG ++KLD
Subjt:  FFKKSIVNRNVNETYIALISKKVNSSKVSDYRPISLTTT-------SLTERLKPTLIETIAENQSAFIKERQIINVILIANEAVDMWKCAKKKGIIIKLD

Query:  VEKAFHKISWDFINTILFYKGYPPMWRKWIRGCISSVSYSIILNGKLRGNILVKRGIRQGDP-SP-LSSLSLPWITLVDSLMKLKG
        +EKAF KISW FI+ +L  K +P  WRKWI+ CIS+V YSI+LNG  +G I  +RGIRQGDP SP +  L++ +++ + S ++ KG
Subjt:  VEKAFHKISWDFINTILFYKGYPPMWRKWIRGCISSVSYSIILNGKLRGNILVKRGIRQGDP-SP-LSSLSLPWITLVDSLMKLKG

KAA0057507.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]7.7e-4556.99Show/hide
Query:  FFKKSIVNRNVNETYIALISKKVNSSKVSDYRPISLTTT-------SLTERLKPTLIETIAENQSAFIKERQIINVILIANEAVDMWKCAKKKGIIIKLD
        F K  IVN NVN T+IALISKK   SK SDYRPISLTT+       +L  RLK  L +TIAENQ AFIK RQI + ILIANEA+D WK  K KG ++KLD
Subjt:  FFKKSIVNRNVNETYIALISKKVNSSKVSDYRPISLTTT-------SLTERLKPTLIETIAENQSAFIKERQIINVILIANEAVDMWKCAKKKGIIIKLD

Query:  VEKAFHKISWDFINTILFYKGYPPMWRKWIRGCISSVSYSIILNGKLRGNILVKRGIRQGDP-SP-LSSLSLPWITLVDSLMKLKG
        +EKAF KISW FI+ +L  K +P  WRKWI+ CIS+V YSI+LNG  +G I  +RGIRQGDP SP +  L++ +++ + S ++ KG
Subjt:  VEKAFHKISWDFINTILFYKGYPPMWRKWIRGCISSVSYSIILNGKLRGNILVKRGIRQGDP-SP-LSSLSLPWITLVDSLMKLKG

KAA0058554.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]4.5e-4558.18Show/hide
Query:  SMFFFKKSIVNRNVNETYIALISKKVNSSKVSDYRPISLTTT-------SLTERLKPTLIETIAENQSAFIKERQIINVILIANEAVDMWKCAKKKGIII
        S+ FF+K ++N+N+N TYIALI+KK + S   D+RPISLTT+       +L+ RLK TL +TI+ NQ AFIK RQI + IL+ANEA+D WK  K KG I+
Subjt:  SMFFFKKSIVNRNVNETYIALISKKVNSSKVSDYRPISLTTT-------SLTERLKPTLIETIAENQSAFIKERQIINVILIANEAVDMWKCAKKKGIII

Query:  KLDVEKAFHKISWDFINTILFYKGYPPMWRKWIRGCISSVSYSIILNGKLRGNILVKRGIRQGDP
        KLD+EKAF+ ++WDFI+ +L  K YP  WRKWIRGCIS+V+YSII+NGK +G I   RG+RQGDP
Subjt:  KLDVEKAFHKISWDFINTILFYKGYPPMWRKWIRGCISSVSYSIILNGKLRGNILVKRGIRQGDP

KAA0063661.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]3.4e-4559.26Show/hide
Query:  FFKKSIVNRNVNETYIALISKKVNSSKVSDYRPISLTTT-------SLTERLKPTLIETIAENQSAFIKERQIINVILIANEAVDMWKCAKKKGIIIKLD
        F +K ++N+N+N T+IALI+KK N S   D+RPISLTT+       +L+ RLK TL +TI+ NQ AFIK RQI + IL+ANEA+D WK  K KG I+KLD
Subjt:  FFKKSIVNRNVNETYIALISKKVNSSKVSDYRPISLTTT-------SLTERLKPTLIETIAENQSAFIKERQIINVILIANEAVDMWKCAKKKGIIIKLD

Query:  VEKAFHKISWDFINTILFYKGYPPMWRKWIRGCISSVSYSIILNGKLRGNILVKRGIRQGDP
        +EKAF  +SWDFI+ +L  K YPP WRKWIRGCIS+V+YSII+NGK +G I   RG+RQGDP
Subjt:  VEKAFHKISWDFINTILFYKGYPPMWRKWIRGCISSVSYSIILNGKLRGNILVKRGIRQGDP

TYK08190.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]2.2e-4456.45Show/hide
Query:  FFKKSIVNRNVNETYIALISKKVNSSKVSDYRPISLTTT-------SLTERLKPTLIETIAENQSAFIKERQIINVILIANEAVDMWKCAKKKGIIIKLD
        F K  IVN NVN T+IALISKK   SK SDYRPISLTT+       +L  RLK  L +TIAENQ AFIK RQI + ILIANE +D WK  K KG ++KLD
Subjt:  FFKKSIVNRNVNETYIALISKKVNSSKVSDYRPISLTTT-------SLTERLKPTLIETIAENQSAFIKERQIINVILIANEAVDMWKCAKKKGIIIKLD

Query:  VEKAFHKISWDFINTILFYKGYPPMWRKWIRGCISSVSYSIILNGKLRGNILVKRGIRQGDP-SP-LSSLSLPWITLVDSLMKLKG
        +EKAF KISW FI+ +L  K +P  WRKWI+ CIS+V YSI+LNG  +G I  +RGIRQGDP SP +  L++ +++ + S ++ KG
Subjt:  VEKAFHKISWDFINTILFYKGYPPMWRKWIRGCISSVSYSIILNGKLRGNILVKRGIRQGDP-SP-LSSLSLPWITLVDSLMKLKG

TrEMBL top hitse value%identityAlignment
A0A5A7US62 LINE-1 retrotransposable element ORF2 protein3.7e-4556.99Show/hide
Query:  FFKKSIVNRNVNETYIALISKKVNSSKVSDYRPISLTTT-------SLTERLKPTLIETIAENQSAFIKERQIINVILIANEAVDMWKCAKKKGIIIKLD
        F K  IVN NVN T+IALISKK   SK SDYRPISLTT+       +L  RLK  L +TIAENQ AFIK RQI + ILIANEA+D WK  K KG ++KLD
Subjt:  FFKKSIVNRNVNETYIALISKKVNSSKVSDYRPISLTTT-------SLTERLKPTLIETIAENQSAFIKERQIINVILIANEAVDMWKCAKKKGIIIKLD

Query:  VEKAFHKISWDFINTILFYKGYPPMWRKWIRGCISSVSYSIILNGKLRGNILVKRGIRQGDP-SP-LSSLSLPWITLVDSLMKLKG
        +EKAF KISW FI+ +L  K +P  WRKWI+ CIS+V YSI+LNG  +G I  +RGIRQGDP SP +  L++ +++ + S ++ KG
Subjt:  VEKAFHKISWDFINTILFYKGYPPMWRKWIRGCISSVSYSIILNGKLRGNILVKRGIRQGDP-SP-LSSLSLPWITLVDSLMKLKG

A0A5A7UTI6 LINE-1 retrotransposable element ORF2 protein6.3e-4556.99Show/hide
Query:  FFKKSIVNRNVNETYIALISKKVNSSKVSDYRPISLTTT-------SLTERLKPTLIETIAENQSAFIKERQIINVILIANEAVDMWKCAKKKGIIIKLD
        F K  IVN NVN T+IALISKK   SK SDYRPISLTT+       +L  RLK  L +TIAENQ AFIK RQI + ILIANEA+D WK  K KG ++KLD
Subjt:  FFKKSIVNRNVNETYIALISKKVNSSKVSDYRPISLTTT-------SLTERLKPTLIETIAENQSAFIKERQIINVILIANEAVDMWKCAKKKGIIIKLD

Query:  VEKAFHKISWDFINTILFYKGYPPMWRKWIRGCISSVSYSIILNGKLRGNILVKRGIRQGDP-SP-LSSLSLPWITLVDSLMKLKG
        +EKAF KISW FI+ +L  K +P  WRKWI+ CIS+V YSI+LNG  +G I  +RGIRQGDP SP +  L++ +++ + S ++ KG
Subjt:  VEKAFHKISWDFINTILFYKGYPPMWRKWIRGCISSVSYSIILNGKLRGNILVKRGIRQGDP-SP-LSSLSLPWITLVDSLMKLKG

A0A5A7UTS4 LINE-1 retrotransposable element ORF2 protein2.2e-4558.18Show/hide
Query:  SMFFFKKSIVNRNVNETYIALISKKVNSSKVSDYRPISLTTT-------SLTERLKPTLIETIAENQSAFIKERQIINVILIANEAVDMWKCAKKKGIII
        S+ FF+K ++N+N+N TYIALI+KK + S   D+RPISLTT+       +L+ RLK TL +TI+ NQ AFIK RQI + IL+ANEA+D WK  K KG I+
Subjt:  SMFFFKKSIVNRNVNETYIALISKKVNSSKVSDYRPISLTTT-------SLTERLKPTLIETIAENQSAFIKERQIINVILIANEAVDMWKCAKKKGIII

Query:  KLDVEKAFHKISWDFINTILFYKGYPPMWRKWIRGCISSVSYSIILNGKLRGNILVKRGIRQGDP
        KLD+EKAF+ ++WDFI+ +L  K YP  WRKWIRGCIS+V+YSII+NGK +G I   RG+RQGDP
Subjt:  KLDVEKAFHKISWDFINTILFYKGYPPMWRKWIRGCISSVSYSIILNGKLRGNILVKRGIRQGDP

A0A5A7VD46 LINE-1 retrotransposable element ORF2 protein1.7e-4559.26Show/hide
Query:  FFKKSIVNRNVNETYIALISKKVNSSKVSDYRPISLTTT-------SLTERLKPTLIETIAENQSAFIKERQIINVILIANEAVDMWKCAKKKGIIIKLD
        F +K ++N+N+N T+IALI+KK N S   D+RPISLTT+       +L+ RLK TL +TI+ NQ AFIK RQI + IL+ANEA+D WK  K KG I+KLD
Subjt:  FFKKSIVNRNVNETYIALISKKVNSSKVSDYRPISLTTT-------SLTERLKPTLIETIAENQSAFIKERQIINVILIANEAVDMWKCAKKKGIIIKLD

Query:  VEKAFHKISWDFINTILFYKGYPPMWRKWIRGCISSVSYSIILNGKLRGNILVKRGIRQGDP
        +EKAF  +SWDFI+ +L  K YPP WRKWIRGCIS+V+YSII+NGK +G I   RG+RQGDP
Subjt:  VEKAFHKISWDFINTILFYKGYPPMWRKWIRGCISSVSYSIILNGKLRGNILVKRGIRQGDP

A0A5D3CA17 LINE-1 retrotransposable element ORF2 protein1.1e-4456.45Show/hide
Query:  FFKKSIVNRNVNETYIALISKKVNSSKVSDYRPISLTTT-------SLTERLKPTLIETIAENQSAFIKERQIINVILIANEAVDMWKCAKKKGIIIKLD
        F K  IVN NVN T+IALISKK   SK SDYRPISLTT+       +L  RLK  L +TIAENQ AFIK RQI + ILIANE +D WK  K KG ++KLD
Subjt:  FFKKSIVNRNVNETYIALISKKVNSSKVSDYRPISLTTT-------SLTERLKPTLIETIAENQSAFIKERQIINVILIANEAVDMWKCAKKKGIIIKLD

Query:  VEKAFHKISWDFINTILFYKGYPPMWRKWIRGCISSVSYSIILNGKLRGNILVKRGIRQGDP-SP-LSSLSLPWITLVDSLMKLKG
        +EKAF KISW FI+ +L  K +P  WRKWI+ CIS+V YSI+LNG  +G I  +RGIRQGDP SP +  L++ +++ + S ++ KG
Subjt:  VEKAFHKISWDFINTILFYKGYPPMWRKWIRGCISSVSYSIILNGKLRGNILVKRGIRQGDP-SP-LSSLSLPWITLVDSLMKLKG

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein7.8e-0833.72Show/hide
Query:  KKSIVNRNVNETYIALISKK-VNSSKVSDYRPISLTTTS-------LTERLKPTLIETIAENQSAFI-------KERQIINVILIANEAVDMWKCAKKKG
        K+ I+  +  E  I LI K   +++K  ++RPISL           L  R++  + + I  +Q  FI         R+ INVI   N A D      K  
Subjt:  KKSIVNRNVNETYIALISKK-VNSSKVSDYRPISLTTTS-------LTERLKPTLIETIAENQSAFI-------KERQIINVILIANEAVDMWKCAKKKG

Query:  IIIKLDVEKAFHKISWDFINTILFYKGYPPMWRKWIRGCISSVSYSIILNGKLRGNILVKRGIRQGDP-SPL
        +II +D EKAF KI   F+   L   G   M+ K IR      + +IILNG+      +K G RQG P SPL
Subjt:  IIIKLDVEKAFHKISWDFINTILFYKGYPPMWRKWIRGCISSVSYSIILNGKLRGNILVKRGIRQGDP-SPL

Arabidopsis top hitse value%identityAlignment
AT4G20520.1 RNA binding;RNA-directed DNA polymerases2.8e-0836.14Show/hide
Query:  LTERLKPTLIETIAENQSAFIKERQIINVILIANEAVDMWKCAKKKGI----IIKLDVEKAFHKISWDFINTILFYKGYPPMW
        + ERLKP +   I   Q++FI  R   + I+   EAV   +  +KKG+    ++KLD+EKA+ +I WD++   L   G+P +W
Subjt:  LTERLKPTLIETIAENQSAFIKERQIINVILIANEAVDMWKCAKKKGI----IIKLDVEKAFHKISWDFINTILFYKGYPPMW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAACACTCTACACTCTCAAGCCATCCATCATGTCTGTGTTCCATGTTTTTTTTCAAAAAGAGTATCGTGAACCGAAACGTCAATGAGACATACATTGCCCTCATTTC
AAAGAAGGTCAATTCTTCGAAGGTGTCTGACTATAGACCAATCAGCTTAACCACGACTAGTCTCACAGAAAGGCTTAAACCAACCCTCATTGAAACAATTGCTGAAAATC
AGTCAGCCTTTATTAAAGAGAGGCAAATCATAAACGTCATCTTAATAGCAAATGAGGCAGTAGATATGTGGAAATGCGCCAAGAAAAAGGGGATCATCATAAAGCTAGAT
GTCGAAAAAGCTTTTCATAAGATCAGCTGGGATTTCATCAACACAATCCTGTTCTACAAAGGTTATCCTCCCATGTGGCGAAAATGGATAAGAGGTTGTATCTCCTCAGT
TAGCTACTCCATCATCTTAAACGGCAAACTAAGAGGAAACATTTTAGTTAAAAGGGGCATTAGACAAGGTGATCCCTCTCCCCTTTCATCTTTGTCCTTGCCATGGATTA
CCTTAGTAGACTCATTAATGAAGCTGAAGGGAAAGGCCTCCTTGCTGGTGTTTCTATGGGTTCGGGAGAGCCCTCAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGAACACTCTACACTCTCAAGCCATCCATCATGTCTGTGTTCCATGTTTTTTTTCAAAAAGAGTATCGTGAACCGAAACGTCAATGAGACATACATTGCCCTCATTTC
AAAGAAGGTCAATTCTTCGAAGGTGTCTGACTATAGACCAATCAGCTTAACCACGACTAGTCTCACAGAAAGGCTTAAACCAACCCTCATTGAAACAATTGCTGAAAATC
AGTCAGCCTTTATTAAAGAGAGGCAAATCATAAACGTCATCTTAATAGCAAATGAGGCAGTAGATATGTGGAAATGCGCCAAGAAAAAGGGGATCATCATAAAGCTAGAT
GTCGAAAAAGCTTTTCATAAGATCAGCTGGGATTTCATCAACACAATCCTGTTCTACAAAGGTTATCCTCCCATGTGGCGAAAATGGATAAGAGGTTGTATCTCCTCAGT
TAGCTACTCCATCATCTTAAACGGCAAACTAAGAGGAAACATTTTAGTTAAAAGGGGCATTAGACAAGGTGATCCCTCTCCCCTTTCATCTTTGTCCTTGCCATGGATTA
CCTTAGTAGACTCATTAATGAAGCTGAAGGGAAAGGCCTCCTTGCTGGTGTTTCTATGGGTTCGGGAGAGCCCTCAATGA
Protein sequenceShow/hide protein sequence
MEHSTLSSHPSCLCSMFFFKKSIVNRNVNETYIALISKKVNSSKVSDYRPISLTTTSLTERLKPTLIETIAENQSAFIKERQIINVILIANEAVDMWKCAKKKGIIIKLD
VEKAFHKISWDFINTILFYKGYPPMWRKWIRGCISSVSYSIILNGKLRGNILVKRGIRQGDPSPLSSLSLPWITLVDSLMKLKGKASLLVFLWVRESPQ