; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0000266 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0000266
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr4:2550987..2553601
RNA-Seq ExpressionLag0000266
SyntenyLag0000266
Gene Ontology termsGO:0005488 - binding (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022151688.1 uncharacterized protein LOC111019603 [Momordica charantia]6.2e-4551.48Show/hide
Query:  APMLITSEALQTMFDNIAQRNARPPWNPNWVLENAEESQFIRDFKRYGPPSLDGQFENPLAVERWIANLEALFDLMNCNDSLKIRGAVFMLKDDVRTWWQ
        A + +   ALQ + DN     A     P   L    E+QFIRDF+RYGPP+ +G+ E    VE WI  LEAL+  + C+D LK++GAVFML+ +   WW 
Subjt:  APMLITSEALQTMFDNIAQRNARPPWNPNWVLENAEESQFIRDFKRYGPPSLDGQFENPLAVERWIANLEALFDLMNCNDSLKIRGAVFMLKDDVRTWWQ

Query:  SVAAAEDHANRPISWENFKDLLYDCYFPETIKDDKEAEFLHLAQGSMSVVQYERKFTALSRFAPDLVST
         VA  EDH N PI+W   KDLLYD YFP+TIKD+KE EFLHL Q ++ V QYE+KFT  SRFA DL+ T
Subjt:  SVAAAEDHANRPISWENFKDLLYDCYFPETIKDDKEAEFLHLAQGSMSVVQYERKFTALSRFAPDLVST

XP_022155341.1 uncharacterized protein LOC111022474 [Momordica charantia]3.1e-4449.48Show/hide
Query:  IPPDQCR-VDPPPP--LPPSAPPVAPML-ITSEALQTMFDNIAQRNARPPWNPNWVLENAEESQFIRDFKRYGPPSLDGQFENPLAVERWIANLEALFDL
        +PP   R V PP P   P   P V P + + +EALQ + DN           P       EE QFIRDFKR+GPP  +G  E P A E W+  LEAL+  
Subjt:  IPPDQCR-VDPPPP--LPPSAPPVAPML-ITSEALQTMFDNIAQRNARPPWNPNWVLENAEESQFIRDFKRYGPPSLDGQFENPLAVERWIANLEALFDL

Query:  MNCNDSLKIRGAVFMLKDDVRTWWQSVAAAEDHANRPISWENFKDLLYDCYFPETIKDDKEAEFLHLAQGSMSVVQYERKFTALSRFAPDLVST
        + C+D  K+RGA+FML+ +   WW+SVAAAEDHAN P++W  FKDLLY+ YFP T++++K AEFL L QGS++V QYERKFT LSRF    + T
Subjt:  MNCNDSLKIRGAVFMLKDDVRTWWQSVAAAEDHANRPISWENFKDLLYDCYFPETIKDDKEAEFLHLAQGSMSVVQYERKFTALSRFAPDLVST

XP_022155925.1 uncharacterized protein LOC111022925 [Momordica charantia]4.0e-4450.57Show/hide
Query:  SAPPVAP--MLITSEALQTMFDNIAQRNARPPWNPNWVLENAEESQFIRDFKRYGPPSLDGQFENPLAVERWIANLEALFDLMNCNDSLKIRGAVFMLKD
        + PPV P  +++ +EALQ + DN           P+    + EE QFIRDFKR+GPP  +G  E P A E W+  LEAL+  + C+D  K+RGAVFML+ 
Subjt:  SAPPVAP--MLITSEALQTMFDNIAQRNARPPWNPNWVLENAEESQFIRDFKRYGPPSLDGQFENPLAVERWIANLEALFDLMNCNDSLKIRGAVFMLKD

Query:  DVRTWWQSVAAAEDHANRPISWENFKDLLYDCYFPETIKDDKEAEFLHLAQGSMSVVQYERKFTALSRFAPDLVST
        +   WW+SVAAAEDHAN P++W  FKDLLY+ YFP T++++K AEFL L Q S+ V QYERKFT LSRF    + T
Subjt:  DVRTWWQSVAAAEDHANRPISWENFKDLLYDCYFPETIKDDKEAEFLHLAQGSMSVVQYERKFTALSRFAPDLVST

XP_022156326.1 uncharacterized protein LOC111023247 [Momordica charantia]4.0e-4463.16Show/hide
Query:  ESQFIRDFKRYGPPSLDGQFENPLAVERWIANLEALFDLMNCNDSLKIRGAVFMLKDDVRTWWQSVAAAEDHANRPISWENFKDLLYDCYFPETIKDDKE
        E++FI+DFKRYGPP+ DG+ E   AVE WI  LEAL+  + C D  K++GAVFML+ +   WW SVAAAED+AN PI W  FK+LLYD Y+PET+KD KE
Subjt:  ESQFIRDFKRYGPPSLDGQFENPLAVERWIANLEALFDLMNCNDSLKIRGAVFMLKDDVRTWWQSVAAAEDHANRPISWENFKDLLYDCYFPETIKDDKE

Query:  AEFLHLAQGSMSVVQYERKFTALSRFAPDLVST
        AEFLHL QG++SV QYERKFT LSRFA +L+ T
Subjt:  AEFLHLAQGSMSVVQYERKFTALSRFAPDLVST

XP_022158637.1 uncharacterized protein LOC111025088 [Momordica charantia]1.6e-4552.02Show/hide
Query:  PPVAPML-ITSEALQTMFDNIAQRNARPPWNPNWVLENAEESQFIRDFKRYGPPSLDGQFENPLAVERWIANLEALFDLMNCNDSLKIRGAVFMLKDDVR
        P V P + + +EALQ + +N A         P        E+QFI+DFKRYGPP+ DG  E   A E W+  LEAL+  + C D  K++G VFML+ +  
Subjt:  PPVAPML-ITSEALQTMFDNIAQRNARPPWNPNWVLENAEESQFIRDFKRYGPPSLDGQFENPLAVERWIANLEALFDLMNCNDSLKIRGAVFMLKDDVR

Query:  TWWQSVAAAEDHANRPISWENFKDLLYDCYFPETIKDDKEAEFLHLAQGSMSVVQYERKFTALSRFAPDLVST
         WW S+A AEDHAN P+ W  FKDLLYD Y+PET+KD KEAEFLHLAQG+++V QYERKFT LSRFA + + T
Subjt:  TWWQSVAAAEDHANRPISWENFKDLLYDCYFPETIKDDKEAEFLHLAQGSMSVVQYERKFTALSRFAPDLVST

TrEMBL top hitse value%identityAlignment
A0A6J1DCW8 uncharacterized protein LOC1110196033.0e-4551.48Show/hide
Query:  APMLITSEALQTMFDNIAQRNARPPWNPNWVLENAEESQFIRDFKRYGPPSLDGQFENPLAVERWIANLEALFDLMNCNDSLKIRGAVFMLKDDVRTWWQ
        A + +   ALQ + DN     A     P   L    E+QFIRDF+RYGPP+ +G+ E    VE WI  LEAL+  + C+D LK++GAVFML+ +   WW 
Subjt:  APMLITSEALQTMFDNIAQRNARPPWNPNWVLENAEESQFIRDFKRYGPPSLDGQFENPLAVERWIANLEALFDLMNCNDSLKIRGAVFMLKDDVRTWWQ

Query:  SVAAAEDHANRPISWENFKDLLYDCYFPETIKDDKEAEFLHLAQGSMSVVQYERKFTALSRFAPDLVST
         VA  EDH N PI+W   KDLLYD YFP+TIKD+KE EFLHL Q ++ V QYE+KFT  SRFA DL+ T
Subjt:  SVAAAEDHANRPISWENFKDLLYDCYFPETIKDDKEAEFLHLAQGSMSVVQYERKFTALSRFAPDLVST

A0A6J1DNV8 uncharacterized protein LOC1110229251.9e-4450.57Show/hide
Query:  SAPPVAP--MLITSEALQTMFDNIAQRNARPPWNPNWVLENAEESQFIRDFKRYGPPSLDGQFENPLAVERWIANLEALFDLMNCNDSLKIRGAVFMLKD
        + PPV P  +++ +EALQ + DN           P+    + EE QFIRDFKR+GPP  +G  E P A E W+  LEAL+  + C+D  K+RGAVFML+ 
Subjt:  SAPPVAP--MLITSEALQTMFDNIAQRNARPPWNPNWVLENAEESQFIRDFKRYGPPSLDGQFENPLAVERWIANLEALFDLMNCNDSLKIRGAVFMLKD

Query:  DVRTWWQSVAAAEDHANRPISWENFKDLLYDCYFPETIKDDKEAEFLHLAQGSMSVVQYERKFTALSRFAPDLVST
        +   WW+SVAAAEDHAN P++W  FKDLLY+ YFP T++++K AEFL L Q S+ V QYERKFT LSRF    + T
Subjt:  DVRTWWQSVAAAEDHANRPISWENFKDLLYDCYFPETIKDDKEAEFLHLAQGSMSVVQYERKFTALSRFAPDLVST

A0A6J1DRF5 uncharacterized protein LOC1110224741.5e-4449.48Show/hide
Query:  IPPDQCR-VDPPPP--LPPSAPPVAPML-ITSEALQTMFDNIAQRNARPPWNPNWVLENAEESQFIRDFKRYGPPSLDGQFENPLAVERWIANLEALFDL
        +PP   R V PP P   P   P V P + + +EALQ + DN           P       EE QFIRDFKR+GPP  +G  E P A E W+  LEAL+  
Subjt:  IPPDQCR-VDPPPP--LPPSAPPVAPML-ITSEALQTMFDNIAQRNARPPWNPNWVLENAEESQFIRDFKRYGPPSLDGQFENPLAVERWIANLEALFDL

Query:  MNCNDSLKIRGAVFMLKDDVRTWWQSVAAAEDHANRPISWENFKDLLYDCYFPETIKDDKEAEFLHLAQGSMSVVQYERKFTALSRFAPDLVST
        + C+D  K+RGA+FML+ +   WW+SVAAAEDHAN P++W  FKDLLY+ YFP T++++K AEFL L QGS++V QYERKFT LSRF    + T
Subjt:  MNCNDSLKIRGAVFMLKDDVRTWWQSVAAAEDHANRPISWENFKDLLYDCYFPETIKDDKEAEFLHLAQGSMSVVQYERKFTALSRFAPDLVST

A0A6J1DUM2 uncharacterized protein LOC1110232471.9e-4463.16Show/hide
Query:  ESQFIRDFKRYGPPSLDGQFENPLAVERWIANLEALFDLMNCNDSLKIRGAVFMLKDDVRTWWQSVAAAEDHANRPISWENFKDLLYDCYFPETIKDDKE
        E++FI+DFKRYGPP+ DG+ E   AVE WI  LEAL+  + C D  K++GAVFML+ +   WW SVAAAED+AN PI W  FK+LLYD Y+PET+KD KE
Subjt:  ESQFIRDFKRYGPPSLDGQFENPLAVERWIANLEALFDLMNCNDSLKIRGAVFMLKDDVRTWWQSVAAAEDHANRPISWENFKDLLYDCYFPETIKDDKE

Query:  AEFLHLAQGSMSVVQYERKFTALSRFAPDLVST
        AEFLHL QG++SV QYERKFT LSRFA +L+ T
Subjt:  AEFLHLAQGSMSVVQYERKFTALSRFAPDLVST

A0A6J1DXQ7 uncharacterized protein LOC1110250887.9e-4652.02Show/hide
Query:  PPVAPML-ITSEALQTMFDNIAQRNARPPWNPNWVLENAEESQFIRDFKRYGPPSLDGQFENPLAVERWIANLEALFDLMNCNDSLKIRGAVFMLKDDVR
        P V P + + +EALQ + +N A         P        E+QFI+DFKRYGPP+ DG  E   A E W+  LEAL+  + C D  K++G VFML+ +  
Subjt:  PPVAPML-ITSEALQTMFDNIAQRNARPPWNPNWVLENAEESQFIRDFKRYGPPSLDGQFENPLAVERWIANLEALFDLMNCNDSLKIRGAVFMLKDDVR

Query:  TWWQSVAAAEDHANRPISWENFKDLLYDCYFPETIKDDKEAEFLHLAQGSMSVVQYERKFTALSRFAPDLVST
         WW S+A AEDHAN P+ W  FKDLLYD Y+PET+KD KEAEFLHLAQG+++V QYERKFT LSRFA + + T
Subjt:  TWWQSVAAAEDHANRPISWENFKDLLYDCYFPETIKDDKEAEFLHLAQGSMSVVQYERKFTALSRFAPDLVST

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTCAAAGGTCGAATGCCGAGCTCTGTAGAGAAGTGTCGAGGCCCTGGTATAAATGGTCAGAGGTCGGTGCAACTCGAAGGGTCAGTCTTGGAGAGTTTTGCAGAGCG
TGACGTGGAAATCACTTGTGGAGAGCCATATTTTGTAGGATATGAGGACTACGTGGACCATGGTGATGCAGAGGAGAGTAGTAATGTCGCTAGGATAGCTATTAAAATCT
TGGGGCTTTACAGTTGGTATCAAAGCGGAGTTGTTCCTATAGACTGGCCTAGGAAATCTAGGTTGTTTGGATGTTTAGGGTTATGGTCTTCCTCGTCTTCTCTCCATCAC
CAGGCAATGTCCTGTGGTCATAATCCTGAAGTTCCAAATGTCAGGCAAGATGACCAAGTAGAGGAAGTTACTACTCAGCAGGGGATCGATCCTCTGGCTCCCTCTCTGCA
GGAGGTTAATCCCCTAATTCCTCCCGATCAATGCAGGGTTGATCCTCCTCCTCCTCTGCCTCCTTCGGCCCCTCCTGTAGCTCCTATGTTGATCACTTCGGAAGCCCTCC
AGACCATGTTCGATAACATAGCCCAGAGAAATGCTAGGCCACCGTGGAACCCTAATTGGGTACTTGAGAATGCAGAGGAATCCCAGTTCATTAGGGACTTCAAGCGCTAC
GGGCCTCCCTCCCTTGATGGGCAATTCGAAAATCCGTTGGCAGTAGAACGATGGATCGCTAATTTGGAGGCACTGTTTGACCTCATGAACTGTAATGATTCCTTGAAAAT
CAGAGGAGCAGTCTTCATGCTCAAGGATGACGTTCGCACGTGGTGGCAATCGGTGGCAGCAGCCGAAGACCATGCTAATCGGCCGATCTCGTGGGAAAATTTCAAGGATC
TGTTGTACGATTGTTACTTCCCAGAGACAATCAAGGACGACAAAGAAGCAGAATTCCTTCATTTGGCCCAGGGGAGTATGTCTGTAGTGCAGTATGAGAGGAAGTTCACT
GCACTATCACGCTTTGCTCCTGACCTGGTCAGCACGCCTGTGACGCCCGAGGCCGCAGAGGATGTAGGGGAATGCCGGGGCCGTGAGGTGCCGAATCCGGATTCCGAATC
TTGGGCTTGGGGCGTTACAGGATAG
mRNA sequenceShow/hide mRNA sequence
ATGGTCAAAGGTCGAATGCCGAGCTCTGTAGAGAAGTGTCGAGGCCCTGGTATAAATGGTCAGAGGTCGGTGCAACTCGAAGGGTCAGTCTTGGAGAGTTTTGCAGAGCG
TGACGTGGAAATCACTTGTGGAGAGCCATATTTTGTAGGATATGAGGACTACGTGGACCATGGTGATGCAGAGGAGAGTAGTAATGTCGCTAGGATAGCTATTAAAATCT
TGGGGCTTTACAGTTGGTATCAAAGCGGAGTTGTTCCTATAGACTGGCCTAGGAAATCTAGGTTGTTTGGATGTTTAGGGTTATGGTCTTCCTCGTCTTCTCTCCATCAC
CAGGCAATGTCCTGTGGTCATAATCCTGAAGTTCCAAATGTCAGGCAAGATGACCAAGTAGAGGAAGTTACTACTCAGCAGGGGATCGATCCTCTGGCTCCCTCTCTGCA
GGAGGTTAATCCCCTAATTCCTCCCGATCAATGCAGGGTTGATCCTCCTCCTCCTCTGCCTCCTTCGGCCCCTCCTGTAGCTCCTATGTTGATCACTTCGGAAGCCCTCC
AGACCATGTTCGATAACATAGCCCAGAGAAATGCTAGGCCACCGTGGAACCCTAATTGGGTACTTGAGAATGCAGAGGAATCCCAGTTCATTAGGGACTTCAAGCGCTAC
GGGCCTCCCTCCCTTGATGGGCAATTCGAAAATCCGTTGGCAGTAGAACGATGGATCGCTAATTTGGAGGCACTGTTTGACCTCATGAACTGTAATGATTCCTTGAAAAT
CAGAGGAGCAGTCTTCATGCTCAAGGATGACGTTCGCACGTGGTGGCAATCGGTGGCAGCAGCCGAAGACCATGCTAATCGGCCGATCTCGTGGGAAAATTTCAAGGATC
TGTTGTACGATTGTTACTTCCCAGAGACAATCAAGGACGACAAAGAAGCAGAATTCCTTCATTTGGCCCAGGGGAGTATGTCTGTAGTGCAGTATGAGAGGAAGTTCACT
GCACTATCACGCTTTGCTCCTGACCTGGTCAGCACGCCTGTGACGCCCGAGGCCGCAGAGGATGTAGGGGAATGCCGGGGCCGTGAGGTGCCGAATCCGGATTCCGAATC
TTGGGCTTGGGGCGTTACAGGATAG
Protein sequenceShow/hide protein sequence
MVKGRMPSSVEKCRGPGINGQRSVQLEGSVLESFAERDVEITCGEPYFVGYEDYVDHGDAEESSNVARIAIKILGLYSWYQSGVVPIDWPRKSRLFGCLGLWSSSSSLHH
QAMSCGHNPEVPNVRQDDQVEEVTTQQGIDPLAPSLQEVNPLIPPDQCRVDPPPPLPPSAPPVAPMLITSEALQTMFDNIAQRNARPPWNPNWVLENAEESQFIRDFKRY
GPPSLDGQFENPLAVERWIANLEALFDLMNCNDSLKIRGAVFMLKDDVRTWWQSVAAAEDHANRPISWENFKDLLYDCYFPETIKDDKEAEFLHLAQGSMSVVQYERKFT
ALSRFAPDLVSTPVTPEAAEDVGECRGREVPNPDSESWAWGVTG