; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0022109 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0022109
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr7:18547239..18548024
RNA-Seq ExpressionLag0022109
SyntenyLag0022109
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GFY85663.1 hypothetical protein Acr_04g0004010 [Actinidia rufa]7.7e-0941Show/hide
Query:  MEELIDQVDPPFTEEVMKAEVPQKFK------------------------MDFQDVSDAIRCRAFFFTLTGSARHWFERLKRRSISCFKDLARTFVAQFM
        ME LI Q+DPPF E VMK  VP +FK                        M  Q   D + C+AF  TL G+ R WF++L  R+I  F DL+R FVA FM
Subjt:  MEELIDQVDPPFTEEVMKAEVPQKFK------------------------MDFQDVSDAIRCRAFFFTLTGSARHWFERLKRRSISCFKDLARTFVAQFM

XP_022156542.1 uncharacterized protein LOC111023421 [Momordica charantia]7.7e-0944.09Show/hide
Query:  DQVDPPFTEEVMKAEVPQKFK-----------------------MDFQDVSDAIRCRAFFFTLTGSARHWFERLKRRSISCFKDLARTFVAQF
        D  + PFT +V++A +P KFK                       MDFQ  SDAI+CRAF   LTGSAR W+ RL  RSIS +  L R F+AQF
Subjt:  DQVDPPFTEEVMKAEVPQKFK-----------------------MDFQDVSDAIRCRAFFFTLTGSARHWFERLKRRSISCFKDLARTFVAQF

XP_022158303.1 uncharacterized protein LOC111024817 [Momordica charantia]4.5e-0947.37Show/hide
Query:  DQVDPPFTEEVMKAEVPQKFK------MDFQDVSDAIRCRAFFFTLTGSARHWFERLKRRSISCFKDLARTFVAQF
        D  + PFT ++M+A +P KFK       DFQ  +DAI+CRAF    TGSAR W+ RL  RSIS +  L + F++QF
Subjt:  DQVDPPFTEEVMKAEVPQKFK------MDFQDVSDAIRCRAFFFTLTGSARHWFERLKRRSISCFKDLARTFVAQF

XP_022158830.1 uncharacterized protein LOC111025293 [Momordica charantia]6.5e-1637.05Show/hide
Query:  LVRDPKKGNEPIEYVDESETESKGKKTNSATSKVRGLKHAERMVLRSPEPSTSRRTDLRNVIEKKRRLPKTVESEARAAEAEPKVAKAEAKKDHLPWKTE
        LVRDPKKG  P     ES+TE   + TNS  SK+R +    R   R  +P  +++       + K   P   +S+     +EP ++  + K    P  +E
Subjt:  LVRDPKKGNEPIEYVDESETESKGKKTNSATSKVRGLKHAERMVLRSPEPSTSRRTDLRNVIEKKRRLPKTVESEARAAEAEPKVAKAEAKKDHLPWKTE

Query:  LLNTLKELGNPQGDLQKLKDSGGQDMEELIDQVDPPFTEEVMKAEVPQKFK-----------------------MDFQDVSDAIRCRAFFFTLTGSARHW
          ++ KE               G D+EEL+DQ D PFTEE+M+ +VP KFK                       MD   VS+A+RCR F  TL GSAR W
Subjt:  LLNTLKELGNPQGDLQKLKDSGGQDMEELIDQVDPPFTEEVMKAEVPQKFK-----------------------MDFQDVSDAIRCRAFFFTLTGSARHW

Query:  FERLKRRSISCFKDLARTFVAQFM
        F +LKR SIS FK LAR FV QF+
Subjt:  FERLKRRSISCFKDLARTFVAQFM

XP_022159109.1 uncharacterized protein LOC111025548 [Momordica charantia]1.8e-1338.65Show/hide
Query:  KRRLPKTVESEARAAEAEPKVAKAEAKKDHLPWKTE--LLNTLKELGNPQ-GDLQKLKDSGGQDMEELIDQVDPPFTEEVMKAEVPQKFK----------
        +RR  +  +S+ +  +    +A   +K DH    +E   LN  K +  P+  + +  +   G D+EEL+ Q D PFTEE+M+ +VP KFK          
Subjt:  KRRLPKTVESEARAAEAEPKVAKAEAKKDHLPWKTE--LLNTLKELGNPQ-GDLQKLKDSGGQDMEELIDQVDPPFTEEVMKAEVPQKFK----------

Query:  -------------MDFQDVSDAIRCRAFFFTLTGSARHWFERLKRRSISCFKDLARTFVAQFM
                     MD   VSDAIRCR F  TL GSAR WF +LKR SIS FK LAR F+ QF+
Subjt:  -------------MDFQDVSDAIRCRAFFFTLTGSARHWFERLKRRSISCFKDLARTFVAQFM

TrEMBL top hitse value%identityAlignment
A0A6A1UK02 Uncharacterized protein1.9e-0833.98Show/hide
Query:  LIDQVDPPFTEEVMKAEVPQKFK-----------------------MDFQDVSDAIRCRAFFFTLTGSARHWFERLKRRSISCFKDLARTFVAQFMVPEN
        ++ QV+ PFT  +++  +P KF+                       MD  D  +AI+C+AF  T+ GSAR WF RL+R  IS FK+L+R F++ F+  ++
Subjt:  LIDQVDPPFTEEVMKAEVPQKFK-----------------------MDFQDVSDAIRCRAFFFTLTGSARHWFERLKRRSISCFKDLARTFVAQFMVPEN

Query:  SES
          S
Subjt:  SES

A0A6J1DWY0 uncharacterized protein LOC1110252933.2e-1637.05Show/hide
Query:  LVRDPKKGNEPIEYVDESETESKGKKTNSATSKVRGLKHAERMVLRSPEPSTSRRTDLRNVIEKKRRLPKTVESEARAAEAEPKVAKAEAKKDHLPWKTE
        LVRDPKKG  P     ES+TE   + TNS  SK+R +    R   R  +P  +++       + K   P   +S+     +EP ++  + K    P  +E
Subjt:  LVRDPKKGNEPIEYVDESETESKGKKTNSATSKVRGLKHAERMVLRSPEPSTSRRTDLRNVIEKKRRLPKTVESEARAAEAEPKVAKAEAKKDHLPWKTE

Query:  LLNTLKELGNPQGDLQKLKDSGGQDMEELIDQVDPPFTEEVMKAEVPQKFK-----------------------MDFQDVSDAIRCRAFFFTLTGSARHW
          ++ KE               G D+EEL+DQ D PFTEE+M+ +VP KFK                       MD   VS+A+RCR F  TL GSAR W
Subjt:  LLNTLKELGNPQGDLQKLKDSGGQDMEELIDQVDPPFTEEVMKAEVPQKFK-----------------------MDFQDVSDAIRCRAFFFTLTGSARHW

Query:  FERLKRRSISCFKDLARTFVAQFM
        F +LKR SIS FK LAR FV QF+
Subjt:  FERLKRRSISCFKDLARTFVAQFM

A0A6J1E1E7 uncharacterized protein LOC1110255488.6e-1438.65Show/hide
Query:  KRRLPKTVESEARAAEAEPKVAKAEAKKDHLPWKTE--LLNTLKELGNPQ-GDLQKLKDSGGQDMEELIDQVDPPFTEEVMKAEVPQKFK----------
        +RR  +  +S+ +  +    +A   +K DH    +E   LN  K +  P+  + +  +   G D+EEL+ Q D PFTEE+M+ +VP KFK          
Subjt:  KRRLPKTVESEARAAEAEPKVAKAEAKKDHLPWKTE--LLNTLKELGNPQ-GDLQKLKDSGGQDMEELIDQVDPPFTEEVMKAEVPQKFK----------

Query:  -------------MDFQDVSDAIRCRAFFFTLTGSARHWFERLKRRSISCFKDLARTFVAQFM
                     MD   VSDAIRCR F  TL GSAR WF +LKR SIS FK LAR F+ QF+
Subjt:  -------------MDFQDVSDAIRCRAFFFTLTGSARHWFERLKRRSISCFKDLARTFVAQFM

A0A7J0DBQ5 Retrotrans_gag domain-containing protein1.1e-0832.91Show/hide
Query:  PKTVESEARAAEAEPK--VAKAEAKKDHLPWKTELLNTLKELGNPQGDLQKLKDSGGQDMEELIDQVDPPFTEEVMKAEVPQKFK---------------
        P   E+++ A +++ K  VA     +    W  ++     +L    G +  +K+ G   +EELI++ D PFT  VM+  +P KFK               
Subjt:  PKTVESEARAAEAEPK--VAKAEAKKDHLPWKTELLNTLKELGNPQGDLQKLKDSGGQDMEELIDQVDPPFTEEVMKAEVPQKFK---------------

Query:  --------MDFQDVSDAIRCRAFFFTLTGSARHWFERLKRRSISCFKDLARTFVAQFM
                MD Q V D I CRAF  TL GS R WF RL   +IS F DL+R FV  ++
Subjt:  --------MDFQDVSDAIRCRAFFFTLTGSARHWFERLKRRSISCFKDLARTFVAQFM

A0A7J0GCV3 Ribonuclease P protein subunit P38-like protein2.4e-0825.57Show/hide
Query:  MNRSLSRILQILDKPDPSTKLHEGGLVRDPKKGNEPIEYVDES--ETESKGKKTNSATSKVRGLKHAERMVLRSPEPSTSRRTDLRNVIEKKRRLPKTVE
        MN + +R++Q+L                +P+    P+  ++ S   + S+G+  N +  + R  K  E+        S+     L    E +R   +   
Subjt:  MNRSLSRILQILDKPDPSTKLHEGGLVRDPKKGNEPIEYVDES--ETESKGKKTNSATSKVRGLKHAERMVLRSPEPSTSRRTDLRNVIEKKRRLPKTVE

Query:  SEARAAEAEPKVAKAEAKKDHLPWKTELLNTLKELGNPQGDLQKLKDSGGQDMEELIDQVDPPFTEEVMKAEVPQKFK----------------------
          AR      K+   +A+ D +   T                     S    M+ LI Q++PPFTE +++A +  KFK                      
Subjt:  SEARAAEAEPKVAKAEAKKDHLPWKTELLNTLKELGNPQGDLQKLKDSGGQDMEELIDQVDPPFTEEVMKAEVPQKFK----------------------

Query:  --MDFQDVSDAIRCRAFFFTLTGSARHWFERLKRRSISCFKDLARTFVAQFMVPENSESLTS
          M  Q  SD + CRAF  TL GSAR WF +L   +I  F DL+R FVA FM   N +   S
Subjt:  --MDFQDVSDAIRCRAFFFTLTGSARHWFERLKRRSISCFKDLARTFVAQFMVPENSESLTS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATCGGAGTTTGTCCAGAATACTTCAAATCCTGGATAAACCCGACCCTAGCACCAAACTCCATGAGGGGGGCTTGGTTAGAGACCCGAAGAAGGGGAATGAGCCAAT
CGAATACGTGGATGAATCAGAGACAGAATCAAAAGGAAAGAAGACCAACAGCGCAACCAGCAAGGTCAGGGGGCTGAAGCATGCAGAACGCATGGTACTGAGGAGCCCAG
AGCCAAGTACCAGCCGTAGAACAGACCTGAGAAATGTGATCGAGAAAAAGCGCAGATTGCCCAAAACTGTCGAGTCTGAGGCTAGAGCTGCCGAGGCCGAGCCCAAAGTT
GCCAAGGCCGAGGCTAAGAAAGACCATCTCCCTTGGAAGACTGAGCTTCTAAACACACTAAAGGAGCTTGGAAATCCTCAGGGAGACCTGCAGAAGTTGAAGGACTCAGG
AGGGCAAGACATGGAAGAACTAATCGACCAAGTCGACCCACCCTTCACTGAAGAAGTCATGAAAGCTGAGGTGCCCCAGAAGTTCAAGATGGACTTCCAAGACGTCTCAG
ATGCAATCAGGTGCCGTGCATTCTTTTTCACCCTAACAGGATCAGCCAGACATTGGTTTGAGAGGCTGAAAAGGAGATCCATCAGCTGTTTCAAGGATTTAGCCCGAACA
TTCGTTGCACAGTTCATGGTTCCAGAGAACAGCGAAAGCCTCACATCAACCTCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAATCGGAGTTTGTCCAGAATACTTCAAATCCTGGATAAACCCGACCCTAGCACCAAACTCCATGAGGGGGGCTTGGTTAGAGACCCGAAGAAGGGGAATGAGCCAAT
CGAATACGTGGATGAATCAGAGACAGAATCAAAAGGAAAGAAGACCAACAGCGCAACCAGCAAGGTCAGGGGGCTGAAGCATGCAGAACGCATGGTACTGAGGAGCCCAG
AGCCAAGTACCAGCCGTAGAACAGACCTGAGAAATGTGATCGAGAAAAAGCGCAGATTGCCCAAAACTGTCGAGTCTGAGGCTAGAGCTGCCGAGGCCGAGCCCAAAGTT
GCCAAGGCCGAGGCTAAGAAAGACCATCTCCCTTGGAAGACTGAGCTTCTAAACACACTAAAGGAGCTTGGAAATCCTCAGGGAGACCTGCAGAAGTTGAAGGACTCAGG
AGGGCAAGACATGGAAGAACTAATCGACCAAGTCGACCCACCCTTCACTGAAGAAGTCATGAAAGCTGAGGTGCCCCAGAAGTTCAAGATGGACTTCCAAGACGTCTCAG
ATGCAATCAGGTGCCGTGCATTCTTTTTCACCCTAACAGGATCAGCCAGACATTGGTTTGAGAGGCTGAAAAGGAGATCCATCAGCTGTTTCAAGGATTTAGCCCGAACA
TTCGTTGCACAGTTCATGGTTCCAGAGAACAGCGAAAGCCTCACATCAACCTCTTAA
Protein sequenceShow/hide protein sequence
MNRSLSRILQILDKPDPSTKLHEGGLVRDPKKGNEPIEYVDESETESKGKKTNSATSKVRGLKHAERMVLRSPEPSTSRRTDLRNVIEKKRRLPKTVESEARAAEAEPKV
AKAEAKKDHLPWKTELLNTLKELGNPQGDLQKLKDSGGQDMEELIDQVDPPFTEEVMKAEVPQKFKMDFQDVSDAIRCRAFFFTLTGSARHWFERLKRRSISCFKDLART
FVAQFMVPENSESLTSTS