; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0038500 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0038500
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr2:18771399..18771896
RNA-Seq ExpressionLag0038500
SyntenyLag0038500
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF7121272.1 hypothetical protein RHSIM_Rhsim13G0210000 [Rhododendron simsii]8.2e-2637.89Show/hide
Query:  CMQAINDAWQGRIEGCPAFKLMEKIKSTRMAMLRWRKRTDVNTDHQIKRLDGSIRVEQDRSQPDFDAIRAYEIQLKSAIVEEESFWRAKSRIQWLKEGDR
        C   +   W+ R  G   F++ +KI+  R  + RWR   ++N+  ++  L   +  E +++  + +A +  E  ++ A  EEE +WRAKSR+ WLK GDR
Subjt:  CMQAINDAWQGRIEGCPAFKLMEKIKSTRMAMLRWRKRTDVNTDHQIKRLDGSIRVEQDRSQPDFDAIRAYEIQLKSAIVEEESFWRAKSRIQWLKEGDR

Query:  NTRFFHAKVSERRRANSLSEIEDEHDIWREEDESVTQVGLLYFQMLFKTSSPLLIESLIAG
        NT FFHAK  +RR  N +  +ED++ IWRE+D  V ++   YFQ +F +S+P L+E +IAG
Subjt:  NTRFFHAKVSERRRANSLSEIEDEHDIWREEDESVTQVGLLYFQMLFKTSSPLLIESLIAG

KAF7133372.1 hypothetical protein RHSIM_Rhsim09G0106200 [Rhododendron simsii]8.2e-2637.89Show/hide
Query:  CMQAINDAWQGRIEGCPAFKLMEKIKSTRMAMLRWRKRTDVNTDHQIKRLDGSIRVEQDRSQPDFDAIRAYEIQLKSAIVEEESFWRAKSRIQWLKEGDR
        C   +   W+ R  G   F++ +KI+  R  + RWR   ++N+  ++  L   +  E +++  + +A +  E  ++ A  EEE +WRAKSR+ WLK GDR
Subjt:  CMQAINDAWQGRIEGCPAFKLMEKIKSTRMAMLRWRKRTDVNTDHQIKRLDGSIRVEQDRSQPDFDAIRAYEIQLKSAIVEEESFWRAKSRIQWLKEGDR

Query:  NTRFFHAKVSERRRANSLSEIEDEHDIWREEDESVTQVGLLYFQMLFKTSSPLLIESLIAG
        NT FFHAK  +RR  N +  +ED++ IWRE+D  V ++   YFQ +F +S+P L+E +IAG
Subjt:  NTRFFHAKVSERRRANSLSEIEDEHDIWREEDESVTQVGLLYFQMLFKTSSPLLIESLIAG

KAF7152590.1 hypothetical protein RHSIM_Rhsim01G0112200 [Rhododendron simsii]4.2e-2234.59Show/hide
Query:  CMQAINDAWQGRIEGCPAFKLMEKIKSTRMAMLRWRKRTDVNTDHQIKRLDGSIRVEQDRSQPDFDAIRAYEIQLKSAIVEEESFWRAKSRIQWLKEGDR
        C   +   W     G   F++ +KIK  R  + +WR    +N++ ++K L   ++VE +R   + +  R  E  +K A +EEE +WRAKS++ WL+ GD+
Subjt:  CMQAINDAWQGRIEGCPAFKLMEKIKSTRMAMLRWRKRTDVNTDHQIKRLDGSIRVEQDRSQPDFDAIRAYEIQLKSAIVEEESFWRAKSRIQWLKEGDR

Query:  NTRFFHAKVSERRRANSLSEIEDEHDIWREEDESVTQVGLLYFQMLFKTSSPLLIESLI
        NT FFHAK  +RR  N +  +ED   IW+++   V  +   YFQ +F +S+P  I+ +I
Subjt:  NTRFFHAKVSERRRANSLSEIEDEHDIWREEDESVTQVGLLYFQMLFKTSSPLLIESLI

XP_028954452.1 uncharacterized protein LOC114823201, partial [Malus domestica]9.4e-2237.76Show/hide
Query:  GRIEGCPAFKLMEKIKSTRMAMLRWRKRTDVNTDHQIKRLDGSIRVEQDRSQPDFDAIRAYEIQLKSAIVEEESFWRAKSRIQWLKEGDRNTRFFHAKVS
        G   G  AF+  EKIKS R  +  W +    N+   I+ L   IR   + +    + ++ +E +L+ A   EES+W+AKSRIQWLKEGD+NT+FFHA+  
Subjt:  GRIEGCPAFKLMEKIKSTRMAMLRWRKRTDVNTDHQIKRLDGSIRVEQDRSQPDFDAIRAYEIQLKSAIVEEESFWRAKSRIQWLKEGDRNTRFFHAKVS

Query:  ERRRANSLSEIEDEHDIWREEDESVTQVGLLYFQMLFKTSSPL
        +R+++N +  +ED++ +WRE++  +  + + YF  LF  S P+
Subjt:  ERRRANSLSEIEDEHDIWREEDESVTQVGLLYFQMLFKTSSPL

XP_028962235.1 uncharacterized protein LOC114826307 [Malus domestica]3.8e-2335.85Show/hide
Query:  CMQAINDAWQGRIEGCPAFKLMEKIKSTRMAMLRWRKRTDVNTDHQIKRLDGSIRVEQDRSQPDFDAIRAYEIQLKSAIVEEESFWRAKSRIQWLKEGDR
        C + +   W  +  G  A++  EKIK+ R ++  W K T  N+  +++ L   IRV         +A++  E  L++A  +EE +W+ KSR QWL EGD+
Subjt:  CMQAINDAWQGRIEGCPAFKLMEKIKSTRMAMLRWRKRTDVNTDHQIKRLDGSIRVEQDRSQPDFDAIRAYEIQLKSAIVEEESFWRAKSRIQWLKEGDR

Query:  NTRFFHAKVSERRRANSLSEIEDEHDIWREEDESVTQVGLLYFQMLFKTSSPLLIESLI
        NT+FFHA+  +RRR N +  IED   +W EED  +    + YF  LF++S P  I+ ++
Subjt:  NTRFFHAKVSERRRANSLSEIEDEHDIWREEDESVTQVGLLYFQMLFKTSSPLLIESLI

TrEMBL top hitse value%identityAlignment
A0A2N9FMJ0 Reverse transcriptase domain-containing protein2.5e-2033.95Show/hide
Query:  CMQAINDAWQGRIEGCPAFKLMEKIKSTRMAMLRWRKRTDVNTDHQI---KRLDGSIRVEQDRSQPDFDAIRAYEIQLKSAIVEEESFWRAKSRIQWLKE
        C   I D+W+ R++G   FK+  K+   +  +  W +R+  N   Q+   KRL  S  +E  RS  +  A++  ++++ S + +EE  WR +SR  WL+E
Subjt:  CMQAINDAWQGRIEGCPAFKLMEKIKSTRMAMLRWRKRTDVNTDHQI---KRLDGSIRVEQDRSQPDFDAIRAYEIQLKSAIVEEESFWRAKSRIQWLKE

Query:  GDRNTRFFHAKVSERRRANSLSEIEDEHDIWREEDESVTQVGLLYFQMLFKTSSPLLIESLI
        GD+NTRFFH + S+RRR N +  + DE   WR+  E +  +   Y++ +F+TS P  I++ +
Subjt:  GDRNTRFFHAKVSERRRANSLSEIEDEHDIWREEDESVTQVGLLYFQMLFKTSSPLLIESLI

A0A2N9H680 Reverse transcriptase domain-containing protein1.9e-2033.73Show/hide
Query:  CMQAINDAWQGRIEGCPAFKLMEKIKSTRMAMLRWRK-------RTDVNTDHQIKRLDGSIRVEQDRSQPDFDAIRAYEIQLKSAIVEEESFWRAKSRIQ
        C   I  AW   + G P F++ +KIK+ RM +L+W +       R      +++ +L+ S   E   S+     +     ++   I +EE FWR +SR+ 
Subjt:  CMQAINDAWQGRIEGCPAFKLMEKIKSTRMAMLRWRK-------RTDVNTDHQIKRLDGSIRVEQDRSQPDFDAIRAYEIQLKSAIVEEESFWRAKSRIQ

Query:  WLKEGDRNTRFFHAKVSERRRANSLSEIEDEHDIWREEDESVTQVGLLYFQMLFKTSSPLLIESLI
        WLKEGDRNT+FFHA  S+R++AN ++ + D+H +W+ E  +++ + + YFQ LF +S P  I  ++
Subjt:  WLKEGDRNTRFFHAKVSERRRANSLSEIEDEHDIWREEDESVTQVGLLYFQMLFKTSSPLLIESLI

A0A5E4GIE6 PREDICTED: reverse mRNAase7.3e-2032.08Show/hide
Query:  CMQAINDAWQGRIEGCPAFKLMEKIKSTRMAMLRWRKRTDVNTDHQIKRLDGSIRVEQDRSQPDFDAIRAYEIQLKSAIVEEESFWRAKSRIQWLKEGDR
        C  A+ D+W+ +  G   ++L+EK+K+ R  M + R+    NT  +I+ ++  +++       D   +   +  LK  I EEE +WR KS +QWL EGD+
Subjt:  CMQAINDAWQGRIEGCPAFKLMEKIKSTRMAMLRWRKRTDVNTDHQIKRLDGSIRVEQDRSQPDFDAIRAYEIQLKSAIVEEESFWRAKSRIQWLKEGDR

Query:  NTRFFHAKVSERRRANSLSEIEDEHDIWREEDESVTQVGLLYFQMLFKTSSPLLIESLI
        NT+FFHA+   RRR N +  +E+E   W ++++ +  + + YF  LF +++ L  + +I
Subjt:  NTRFFHAKVSERRRANSLSEIEDEHDIWREEDESVTQVGLLYFQMLFKTSSPLLIESLI

A0A6A4LPS9 Uncharacterized protein (Fragment)1.5e-2037.06Show/hide
Query:  PAFKLMEKIKSTRMAMLRWRKRTDVNTDHQIKRLDGSIRVEQDRSQPDFDAIRAYEIQLKSAIVEEESFWRAKSRIQWLKEGDRNTRFFHAKVSERRRAN
        P F++ +K+K  R  + +WR    +N+  ++K L   + VE +R   + +  R  E  +K A +EEE +WRAKS++ WLK GD+NT FFHAK  +RR  N
Subjt:  PAFKLMEKIKSTRMAMLRWRKRTDVNTDHQIKRLDGSIRVEQDRSQPDFDAIRAYEIQLKSAIVEEESFWRAKSRIQWLKEGDRNTRFFHAKVSERRRAN

Query:  SLSEIEDEHDIWREEDESVTQVGLLYFQMLFKTSSPLLIESLI
         +  +ED   IW+++   V  +   YFQ +F +S+P  I+ +I
Subjt:  SLSEIEDEHDIWREEDESVTQVGLLYFQMLFKTSSPLLIESLI

A0A6P6V9A8 uncharacterized protein LOC1137179071.5e-2036.25Show/hide
Query:  QAINDAWQGRIEGCPAFKLMEKIKSTRMAMLRWRKRTDVNTDHQIKRLDGSI-RVEQDRSQPDFDAIRAYEIQLKSAIVEEESFWRAKSRIQWLKEGDRN
        + I+  W+ +  G    KL +KIKS R+A+L W ++ D N   +I  L   +  V+ DR       +R  + +L+ A   EE FW  K+RI+WLKEGD+N
Subjt:  QAINDAWQGRIEGCPAFKLMEKIKSTRMAMLRWRKRTDVNTDHQIKRLDGSI-RVEQDRSQPDFDAIRAYEIQLKSAIVEEESFWRAKSRIQWLKEGDRN

Query:  TRFFHAKVSERRRANSLSEIEDEHDIWREEDESVTQVGLLYFQMLFKTSSPLLIESLIAG
        T++FHA V+ERRR N++S ++     W E ++ +      YFQ L  +S+P  ++S++ G
Subjt:  TRFFHAKVSERRRANSLSEIEDEHDIWREEDESVTQVGLLYFQMLFKTSSPLLIESLIAG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein8.3e-0827.45Show/hide
Query:  AWQGRIE-GCPAFKLMEKIKSTRMAMLRWRKRTDVNTDHQIKRLDGSIRVEQDR--SQPDFDAIRAYEIQLKS---AIVEEESFWRAKSRIQWLKEGDRN
        AW+ +I  G   F L E +K+ +       ++   N  H+ K    S+   Q +  + P     R   +  K         ESF+R KSRI+WL++GD N
Subjt:  AWQGRIE-GCPAFKLMEKIKSTRMAMLRWRKRTDVNTDHQIKRLDGSIRVEQDR--SQPDFDAIRAYEIQLKS---AIVEEESFWRAKSRIQWLKEGDRN

Query:  TRFFHAKVSERRRANSLSEIEDEHDIWREEDESVTQVGLLYFQMLFKTSSPLL
        TRFFH  +   +  N +  +  + D+  E    V ++ + Y+  L  + S +L
Subjt:  TRFFHAKVSERRRANSLSEIEDEHDIWREEDESVTQVGLLYFQMLFKTSSPLL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGCATGCAAGCAATTAATGATGCATGGCAAGGAAGAATAGAGGGCTGCCCTGCTTTCAAACTCATGGAAAAAATCAAGAGTACTCGTATGGCCATGCTGCGA
TGGCGGAAAAGAACAGATGTGAATACTGATCACCAGATAAAAAGACTAGATGGAAGCATAAGGGTGGAACAGGATCGATCCCAGCCAGATTTTGATGCCATAAGG
GCTTACGAAATTCAACTGAAAAGTGCGATTGTTGAAGAAGAATCTTTCTGGAGAGCGAAGTCCCGTATACAATGGCTTAAGGAGGGAGACAGAAATACGAGATTC
TTCCACGCCAAAGTATCTGAGAGGAGACGAGCCAATAGCCTGTCCGAAATTGAGGACGAGCACGACATCTGGCGAGAAGAGGATGAGAGTGTTACCCAAGTTGGG
TTGTTATACTTCCAGATGCTCTTTAAAACAAGCAGTCCACTTTTGATAGAGAGTTTGATAGCTGGAGCATTCGCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGTGCATGCAAGCAATTAATGATGCATGGCAAGGAAGAATAGAGGGCTGCCCTGCTTTCAAACTCATGGAAAAAATCAAGAGTACTCGTATGGCCATGCTGCGA
TGGCGGAAAAGAACAGATGTGAATACTGATCACCAGATAAAAAGACTAGATGGAAGCATAAGGGTGGAACAGGATCGATCCCAGCCAGATTTTGATGCCATAAGG
GCTTACGAAATTCAACTGAAAAGTGCGATTGTTGAAGAAGAATCTTTCTGGAGAGCGAAGTCCCGTATACAATGGCTTAAGGAGGGAGACAGAAATACGAGATTC
TTCCACGCCAAAGTATCTGAGAGGAGACGAGCCAATAGCCTGTCCGAAATTGAGGACGAGCACGACATCTGGCGAGAAGAGGATGAGAGTGTTACCCAAGTTGGG
TTGTTATACTTCCAGATGCTCTTTAAAACAAGCAGTCCACTTTTGATAGAGAGTTTGATAGCTGGAGCATTCGCCTAA
Protein sequenceShow/hide protein sequence
MCMQAINDAWQGRIEGCPAFKLMEKIKSTRMAMLRWRKRTDVNTDHQIKRLDGSIRVEQDRSQPDFDAIRAYEIQLKSAIVEEESFWRAKSRIQWLKEGDRNTRF
FHAKVSERRRANSLSEIEDEHDIWREEDESVTQVGLLYFQMLFKTSSPLLIESLIAGAFA