; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0007975 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0007975
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionTransposon Tf2-1 polyprotein isoform X1
Genome locationchr9:9209377..9209922
RNA-Seq ExpressionLag0007975
SyntenyLag0007975
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0036018.1 transposon Tf2-1 polyprotein isoform X1 [Cucumis melo var. makuwa]2.3e-1345.04Show/hide
Query:  ATLRRSEPSVITEGSAHKSKETIDAGENSGTNSEED-------RKVQTEAGEDKDNQSDRHKFKVERYFQIHKLSEIEKMTMVVISFEGVALNWYRAKEE
        +T R SE S  T  +  K  E ID G  +  N EE        +KV+      +D   D   F+ +RYFQIH+L++ EKMT+  ISFEG ALNWYRA+EE
Subjt:  ATLRRSEPSVITEGSAHKSKETIDAGENSGTNSEED-------RKVQTEAGEDKDNQSDRHKFKVERYFQIHKLSEIEKMTMVVISFEGVALNWYRAKEE

Query:  REPFKDWKDLKLRQLKRFRKKKTGTGLRPFL
        R+ FKDW +LK R L RFR  + G+    FL
Subjt:  REPFKDWKDLKLRQLKRFRKKKTGTGLRPFL

KAA0042012.1 transposon Tf2-1 polyprotein isoform X1 [Cucumis melo var. makuwa]8.8e-1336.92Show/hide
Query:  MTQKKIEERLEANEREMSEMKTS------IDEGMKDVRKALATLAAE--------------IATLRRSEPSVITEGSAH-------KSKE-TIDAGENSG
        M Q +IEERLE  ++E++ MK        I+  + D+ K + T+ ++              IA  R +      E +AH       K KE T      SG
Subjt:  MTQKKIEERLEANEREMSEMKTS------IDEGMKDVRKALATLAAE--------------IATLRRSEPSVITEGSAH-------KSKE-TIDAGENSG

Query:  TNSEEDR---KVQTE-----------------AGEDKDNQSDRHKFKVERYFQIHKLSEIEKMTMVVISFEGVALNWYRAKEEREPFKDWKDLKLRQLKR
         N+ +DR   K +TE                 AGED D+      F+ ERYFQIHKLS+ EKM +  ISF+G A NWYR++EERE F  W +LK R L R
Subjt:  TNSEEDR---KVQTE-----------------AGEDKDNQSDRHKFKVERYFQIHKLSEIEKMTMVVISFEGVALNWYRAKEEREPFKDWKDLKLRQLKR

Query:  FRKKKTGTGLRPFL
        FR  + GT L  FL
Subjt:  FRKKKTGTGLRPFL

KAA0050168.1 transposon Tf2-1 polyprotein isoform X1 [Cucumis melo var. makuwa]1.1e-1248.39Show/hide
Query:  NSEEDRKVQTEAGEDKDNQSDRHKFKVERYFQIHKLSEIEKMTMVVISFEGVALNWYRAKEEREPFKDWKDLKLRQLKRFRKKKTGTGLRPFL
        N+++ +  + E         D   F+V+RYFQIHKL+++EKMTM  ISF+G+ LNWYRA+EER+ FK+W DLK R L RFR  + G+    FL
Subjt:  NSEEDRKVQTEAGEDKDNQSDRHKFKVERYFQIHKLSEIEKMTMVVISFEGVALNWYRAKEEREPFKDWKDLKLRQLKRFRKKKTGTGLRPFL

KAA0066460.1 uncharacterized protein E6C27_scaffold21G005610 [Cucumis melo var. makuwa]3.9e-1336.13Show/hide
Query:  MTQKKIEERLEANEREMSEMKTSIDEGMKDVRKALATLAAEIATLRRSEPSVITEGSAHKS-------KETIDAGENSGTNSEEDRKVQTEAGEDKDNQS
        M   +IEERL+  ++E+S MK  I + +  +  +L  ++  +  L  +     TE + HKS       KE   +     T + E+ +V  +A  D +  +
Subjt:  MTQKKIEERLEANEREMSEMKTSIDEGMKDVRKALATLAAEIATLRRSEPSVITEGSAHKS-------KETIDAGENSGTNSEEDRKVQTEAGEDKDNQS

Query:  DRHKFK------------------VERYFQIHKLSEIEKMTMVVISFEGVALNWYRAKEEREPFKDWKDLKLRQLKRFRKKKTGTGLRPFL
        DR KFK                   ERYFQIHKL+E EKM +  ISF+G  LNWYR++EER+ F  W +LK R L RFR    GT L  FL
Subjt:  DRHKFK------------------VERYFQIHKLSEIEKMTMVVISFEGVALNWYRAKEEREPFKDWKDLKLRQLKRFRKKKTGTGLRPFL

TYJ98817.1 transposon Tf2-1 polyprotein isoform X1 [Cucumis melo var. makuwa]3.0e-1345.04Show/hide
Query:  ATLRRSEPSVITEGSAHKSKETIDAGENSGTNSEED-------RKVQTEAGEDKDNQSDRHKFKVERYFQIHKLSEIEKMTMVVISFEGVALNWYRAKEE
        +T R SE S  T  +  K  E ID G  +  N EE        +KV+      +D   D   F+ +RYFQIH+L++ EKMT+  ISFEG ALNWYRA+EE
Subjt:  ATLRRSEPSVITEGSAHKSKETIDAGENSGTNSEED-------RKVQTEAGEDKDNQSDRHKFKVERYFQIHKLSEIEKMTMVVISFEGVALNWYRAKEE

Query:  REPFKDWKDLKLRQLKRFRKKKTGTGLRPFL
        R+ FKDW +LK R L RFR  + G+    FL
Subjt:  REPFKDWKDLKLRQLKRFRKKKTGTGLRPFL

TrEMBL top hitse value%identityAlignment
A0A5A7SZK8 Transposon Tf2-1 polyprotein isoform X11.1e-1345.04Show/hide
Query:  ATLRRSEPSVITEGSAHKSKETIDAGENSGTNSEED-------RKVQTEAGEDKDNQSDRHKFKVERYFQIHKLSEIEKMTMVVISFEGVALNWYRAKEE
        +T R SE S  T  +  K  E ID G  +  N EE        +KV+      +D   D   F+ +RYFQIH+L++ EKMT+  ISFEG ALNWYRA+EE
Subjt:  ATLRRSEPSVITEGSAHKSKETIDAGENSGTNSEED-------RKVQTEAGEDKDNQSDRHKFKVERYFQIHKLSEIEKMTMVVISFEGVALNWYRAKEE

Query:  REPFKDWKDLKLRQLKRFRKKKTGTGLRPFL
        R+ FKDW +LK R L RFR  + G+    FL
Subjt:  REPFKDWKDLKLRQLKRFRKKKTGTGLRPFL

A0A5A7TF03 Transposon Tf2-1 polyprotein isoform X14.2e-1336.92Show/hide
Query:  MTQKKIEERLEANEREMSEMKTS------IDEGMKDVRKALATLAAE--------------IATLRRSEPSVITEGSAH-------KSKE-TIDAGENSG
        M Q +IEERLE  ++E++ MK        I+  + D+ K + T+ ++              IA  R +      E +AH       K KE T      SG
Subjt:  MTQKKIEERLEANEREMSEMKTS------IDEGMKDVRKALATLAAE--------------IATLRRSEPSVITEGSAH-------KSKE-TIDAGENSG

Query:  TNSEEDR---KVQTE-----------------AGEDKDNQSDRHKFKVERYFQIHKLSEIEKMTMVVISFEGVALNWYRAKEEREPFKDWKDLKLRQLKR
         N+ +DR   K +TE                 AGED D+      F+ ERYFQIHKLS+ EKM +  ISF+G A NWYR++EERE F  W +LK R L R
Subjt:  TNSEEDR---KVQTE-----------------AGEDKDNQSDRHKFKVERYFQIHKLSEIEKMTMVVISFEGVALNWYRAKEEREPFKDWKDLKLRQLKR

Query:  FRKKKTGTGLRPFL
        FR  + GT L  FL
Subjt:  FRKKKTGTGLRPFL

A0A5A7VH17 Uncharacterized protein1.9e-1336.13Show/hide
Query:  MTQKKIEERLEANEREMSEMKTSIDEGMKDVRKALATLAAEIATLRRSEPSVITEGSAHKS-------KETIDAGENSGTNSEEDRKVQTEAGEDKDNQS
        M   +IEERL+  ++E+S MK  I + +  +  +L  ++  +  L  +     TE + HKS       KE   +     T + E+ +V  +A  D +  +
Subjt:  MTQKKIEERLEANEREMSEMKTSIDEGMKDVRKALATLAAEIATLRRSEPSVITEGSAHKS-------KETIDAGENSGTNSEEDRKVQTEAGEDKDNQS

Query:  DRHKFK------------------VERYFQIHKLSEIEKMTMVVISFEGVALNWYRAKEEREPFKDWKDLKLRQLKRFRKKKTGTGLRPFL
        DR KFK                   ERYFQIHKL+E EKM +  ISF+G  LNWYR++EER+ F  W +LK R L RFR    GT L  FL
Subjt:  DRHKFK------------------VERYFQIHKLSEIEKMTMVVISFEGVALNWYRAKEEREPFKDWKDLKLRQLKRFRKKKTGTGLRPFL

A0A5D3BI70 Transposon Tf2-1 polyprotein isoform X11.5e-1345.04Show/hide
Query:  ATLRRSEPSVITEGSAHKSKETIDAGENSGTNSEED-------RKVQTEAGEDKDNQSDRHKFKVERYFQIHKLSEIEKMTMVVISFEGVALNWYRAKEE
        +T R SE S  T  +  K  E ID G  +  N EE        +KV+      +D   D   F+ +RYFQIH+L++ EKMT+  ISFEG ALNWYRA+EE
Subjt:  ATLRRSEPSVITEGSAHKSKETIDAGENSGTNSEED-------RKVQTEAGEDKDNQSDRHKFKVERYFQIHKLSEIEKMTMVVISFEGVALNWYRAKEE

Query:  REPFKDWKDLKLRQLKRFRKKKTGTGLRPFL
        R+ FKDW +LK R L RFR  + G+    FL
Subjt:  REPFKDWKDLKLRQLKRFRKKKTGTGLRPFL

A0A5D3CC95 Ty3/gypsy retrotransposon protein7.2e-1333.82Show/hide
Query:  MTQKKIEERLEANEREMSEMK------TSIDEGMKDVRKALATLAAEIATLRRSEPSVITEGSAHKSKETIDAGENSGTN----------------SEED
        M Q +IEERLE  ++E++ MK       +I+ G+ ++ K++  +  +    ++   ++I   S  +S  +  A E +G                  +E D
Subjt:  MTQKKIEERLEANEREMSEMK------TSIDEGMKDVRKALATLAAEIATLRRSEPSVITEGSAHKSKETIDAGENSGTN----------------SEED

Query:  RKVQTEAGEDK----DNQSDRHKFK------------------VERYFQIHKLSEIEKMTMVVISFEGVALNWYRAKEEREPFKDWKDLKLRQLKRFRKK
        R  +T+ GE +    D  SDR+KFK                   ERYFQIH+L+E EKM +  ISF+G ALNWYR++EER  F  W ++K R L RFR  
Subjt:  RKVQTEAGEDK----DNQSDRHKFK------------------VERYFQIHKLSEIEKMTMVVISFEGVALNWYRAKEEREPFKDWKDLKLRQLKRFRKK

Query:  KTGT
        K GT
Subjt:  KTGT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G67020.1 unknown protein6.5e-0638.18Show/hide
Query:  KVERYFQIHKLSEIEKMTMVVISFEGVALNWYRAKEEREPFKDWKDLKLRQLKRF
        KVER+F++ +  + +K+ +V +S EGVAL W+  +     F+DW   + R L RF
Subjt:  KVERYFQIHKLSEIEKMTMVVISFEGVALNWYRAKEEREPFKDWKDLKLRQLKRF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTCAGAAGAAGATAGAGGAACGGCTAGAGGCCAATGAGCGAGAGATGTCGGAAATGAAGACTAGCATTGACGAGGGAATGAAGGACGTGAGGAAGGCTCTCGCAAC
CTTAGCGGCTGAAATAGCAACCCTCCGGCGCTCGGAACCTTCGGTGATCACTGAGGGGTCAGCGCACAAGTCGAAGGAAACTATCGACGCGGGCGAAAACTCAGGAACGA
ACAGTGAGGAAGATAGAAAGGTCCAAACCGAAGCAGGGGAGGACAAGGACAACCAGAGCGACCGCCACAAGTTCAAGGTGGAGAGGTATTTTCAAATCCACAAGTTATCT
GAAATTGAAAAAATGACTATGGTTGTGATTAGTTTCGAAGGGGTAGCCCTCAACTGGTATAGAGCGAAAGAAGAGAGAGAGCCCTTTAAGGATTGGAAGGATTTGAAGCT
GCGGCAACTGAAACGGTTCCGGAAGAAGAAAACAGGGACAGGTCTGCGGCCGTTTCTTGGCAGTGAAGCAAACCACGACAGTGCAGAAGTATTGCGAAGAGTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGACTCAGAAGAAGATAGAGGAACGGCTAGAGGCCAATGAGCGAGAGATGTCGGAAATGAAGACTAGCATTGACGAGGGAATGAAGGACGTGAGGAAGGCTCTCGCAAC
CTTAGCGGCTGAAATAGCAACCCTCCGGCGCTCGGAACCTTCGGTGATCACTGAGGGGTCAGCGCACAAGTCGAAGGAAACTATCGACGCGGGCGAAAACTCAGGAACGA
ACAGTGAGGAAGATAGAAAGGTCCAAACCGAAGCAGGGGAGGACAAGGACAACCAGAGCGACCGCCACAAGTTCAAGGTGGAGAGGTATTTTCAAATCCACAAGTTATCT
GAAATTGAAAAAATGACTATGGTTGTGATTAGTTTCGAAGGGGTAGCCCTCAACTGGTATAGAGCGAAAGAAGAGAGAGAGCCCTTTAAGGATTGGAAGGATTTGAAGCT
GCGGCAACTGAAACGGTTCCGGAAGAAGAAAACAGGGACAGGTCTGCGGCCGTTTCTTGGCAGTGAAGCAAACCACGACAGTGCAGAAGTATTGCGAAGAGTTTGA
Protein sequenceShow/hide protein sequence
MTQKKIEERLEANEREMSEMKTSIDEGMKDVRKALATLAAEIATLRRSEPSVITEGSAHKSKETIDAGENSGTNSEEDRKVQTEAGEDKDNQSDRHKFKVERYFQIHKLS
EIEKMTMVVISFEGVALNWYRAKEEREPFKDWKDLKLRQLKRFRKKKTGTGLRPFLGSEANHDSAEVLRRV