; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0026609 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0026609
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationchr10:39634067..39635002
RNA-Seq ExpressionLag0026609
SyntenyLag0026609
Gene Ontology termsNA
InterPro domainsIPR025724 - GAG-pre-integrase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KYP54535.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan]2.2e-1139.82Show/hide
Query:  FRFNLISVSTISAKLPILISFVDDYCLLQDKSSLKMIGRAKLWQGLYTLQPDSLMSENFH--TSTQHTLNSVACNSVSQCN---ISLWHDRLGHPSIRHL
        F+FNLIS+S + + LP  + F  + CL+QD    KMIG  ++ +GLY L    L S   H  +  Q+ ++SV   S+  CN   I +WH R GHPS   L
Subjt:  FRFNLISVSTISAKLPILISFVDDYCLLQDKSSLKMIGRAKLWQGLYTLQPDSLMSENFH--TSTQHTLNSVACNSVSQCN---ISLWHDRLGHPSIRHL

Query:  TALKSILPCQNFN
         ALK   P  +F+
Subjt:  TALKSILPCQNFN

KYP54541.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan]2.2e-1139.82Show/hide
Query:  FRFNLISVSTISAKLPILISFVDDYCLLQDKSSLKMIGRAKLWQGLYTLQPDSLMSENFH--TSTQHTLNSVACNSVSQCN---ISLWHDRLGHPSIRHL
        F+FNLIS+S + + LP  + F  + CL+QD    KMIG  ++ +GLY L    L S   H  +  Q+ ++SV   S+  CN   I +WH R GHPS   L
Subjt:  FRFNLISVSTISAKLPILISFVDDYCLLQDKSSLKMIGRAKLWQGLYTLQPDSLMSENFH--TSTQHTLNSVACNSVSQCN---ISLWHDRLGHPSIRHL

Query:  TALKSILPCQNFN
         ALK   P  +F+
Subjt:  TALKSILPCQNFN

KZV21171.1 hypothetical protein F511_24735 [Dorcoceras hygrometricum]5.7e-1237.4Show/hide
Query:  HVEEFRFNLISVSTISAKLPILISFVDDYCLLQDKSSLKMIGRAKLWQGLYTL--QPDSLMSENFHTSTQHTLNSVACNSVSQCNISLWHDRLGHPSIRH
        HV +F+FNL+S+S  +      + F  D+C  QD S+ K++G  KL+ GLY L   P ++ ++   +S + T  S AC  +   +I++WH R GH SI  
Subjt:  HVEEFRFNLISVSTISAKLPILISFVDDYCLLQDKSSLKMIGRAKLWQGLYTL--QPDSLMSENFHTSTQHTLNSVACNSVSQCNISLWHDRLGHPSIRH

Query:  LTALKSILPCQNFNFQPCLICPL
        L  L  I   QN    PC ICP+
Subjt:  LTALKSILPCQNFNFQPCLICPL

XP_022143573.1 uncharacterized protein LOC111013441 [Momordica charantia]3.0e-1334.64Show/hide
Query:  MPNPPNMLQSHLTTAKVDSNSPSSSGTAHVEEFRFNLISVSTISAKLPIL-ISFVDDYCLLQDKSSLKMIGRAKLWQGLYTLQPDSLMSENFHTSTQHTL
        +PN P  +  +    ++ S+  S  G  ++ EF FNLISV+ +   +P L + F +D C++QDKS  K I + +L  GLY L   S  S         T+
Subjt:  MPNPPNMLQSHLTTAKVDSNSPSSSGTAHVEEFRFNLISVSTISAKLPIL-ISFVDDYCLLQDKSSLKMIGRAKLWQGLYTLQPDSLMSENFHTSTQHTL

Query:  NSVACNSVSQCNISLWHDRLGHPSIRHLTALKSILPCQNFNFQPCLI--CPLP
         S+  ++ +  +  +WH+RLGHPS   L ALKS+LP  N + +  L   C  P
Subjt:  NSVACNSVSQCNISLWHDRLGHPSIRHLTALKSILPCQNFNFQPCLI--CPLP

XP_022152756.1 uncharacterized protein LOC111020399 [Momordica charantia]1.2e-1441.01Show/hide
Query:  AKVDSNSPSSSGTAHVEEFRFNLISVSTISAKLPILISFVDDYCLLQDKSSLKMIGRAKLWQGLYTLQPDSLMSENFHTSTQHTLNSVACNSVSQCNISL
        +K  SNS S S  A +   R+NL+ VS ++A   + + F D+ C+LQDKSS KMIG+A+ W GLY      L+S   + +    L  V CNS        
Subjt:  AKVDSNSPSSSGTAHVEEFRFNLISVSTISAKLPILISFVDDYCLLQDKSSLKMIGRAKLWQGLYTLQPDSLMSENFHTSTQHTLNSVACNSVSQCNISL

Query:  WHDRLGHPSIRHLTALKSILPCQNFNFQ----PCLICPL
        WH+RLGHPS +HL ALK++L   + +      PC I PL
Subjt:  WHDRLGHPSIRHLTALKSILPCQNFNFQ----PCLICPL

TrEMBL top hitse value%identityAlignment
A0A2N9EDE7 Integrase catalytic domain-containing protein1.0e-1136.71Show/hide
Query:  MPNPPNMLQSHLTTAKVDSNSPSSSGTAHVEEFRFNLISVSTISAKLPILISFVDDYCLLQDKSSLKMIGRAKLWQGLYTLQ-PDSLMSENFHTSTQHTL
        +PN    L +H+ T K+ S S        V  F FNLISVS +++ L   I F+ + C +QD    K+IGR K  +GLY L+  DS++  +F + +    
Subjt:  MPNPPNMLQSHLTTAKVDSNSPSSSGTAHVEEFRFNLISVSTISAKLPILISFVDDYCLLQDKSSLKMIGRAKLWQGLYTLQ-PDSLMSENFHTSTQHTL

Query:  NSV--ACNSVSQCNISLWHDRLGHPSIRHLTALKSIL-----PCQ-NFNFQPCLICPL
        +SV  + N+ S  ++ LWH+RLGH S  +L  LK+ +     PC  N +   CL+CPL
Subjt:  NSV--ACNSVSQCNISLWHDRLGHPSIRHLTALKSIL-----PCQ-NFNFQPCLICPL

A0A2Z7AHC5 Integrase catalytic domain-containing protein2.8e-1237.4Show/hide
Query:  HVEEFRFNLISVSTISAKLPILISFVDDYCLLQDKSSLKMIGRAKLWQGLYTL--QPDSLMSENFHTSTQHTLNSVACNSVSQCNISLWHDRLGHPSIRH
        HV +F+FNL+S+S  +      + F  D+C  QD S+ K++G  KL+ GLY L   P ++ ++   +S + T  S AC  +   +I++WH R GH SI  
Subjt:  HVEEFRFNLISVSTISAKLPILISFVDDYCLLQDKSSLKMIGRAKLWQGLYTL--QPDSLMSENFHTSTQHTLNSVACNSVSQCNISLWHDRLGHPSIRH

Query:  LTALKSILPCQNFNFQPCLICPL
        L  L  I   QN    PC ICP+
Subjt:  LTALKSILPCQNFNFQPCLICPL

A0A438HH41 Retrovirus-related Pol polyprotein from transposon RE21.8e-1139.34Show/hide
Query:  VEEFRFNLISVSTISAKLPILISFVDDYCLLQDKSSLKMIGRAKLWQGLYTLQPDSLMSENFHTSTQHTLNSVACNSVSQCNI-SLWHDRLGHPSIRHLT
        V  FR+NL+SVS  +  L + + F  D C++Q+ S  KMIG+      LY L  DS +++            VA + +   NI SLWH RLGHPS   L 
Subjt:  VEEFRFNLISVSTISAKLPILISFVDDYCLLQDKSSLKMIGRAKLWQGLYTLQPDSLMSENFHTSTQHTLNSVACNSVSQCNI-SLWHDRLGHPSIRHLT

Query:  ALKSILPC-QNFNFQPCLICPL
         L+SIL    +F+  PC +CPL
Subjt:  ALKSILPC-QNFNFQPCLICPL

A0A6J1CR17 uncharacterized protein LOC1110134411.5e-1334.64Show/hide
Query:  MPNPPNMLQSHLTTAKVDSNSPSSSGTAHVEEFRFNLISVSTISAKLPIL-ISFVDDYCLLQDKSSLKMIGRAKLWQGLYTLQPDSLMSENFHTSTQHTL
        +PN P  +  +    ++ S+  S  G  ++ EF FNLISV+ +   +P L + F +D C++QDKS  K I + +L  GLY L   S  S         T+
Subjt:  MPNPPNMLQSHLTTAKVDSNSPSSSGTAHVEEFRFNLISVSTISAKLPIL-ISFVDDYCLLQDKSSLKMIGRAKLWQGLYTLQPDSLMSENFHTSTQHTL

Query:  NSVACNSVSQCNISLWHDRLGHPSIRHLTALKSILPCQNFNFQPCLI--CPLP
         S+  ++ +  +  +WH+RLGHPS   L ALKS+LP  N + +  L   C  P
Subjt:  NSVACNSVSQCNISLWHDRLGHPSIRHLTALKSILPCQNFNFQPCLI--CPLP

A0A6J1DIP8 uncharacterized protein LOC1110203995.9e-1541.01Show/hide
Query:  AKVDSNSPSSSGTAHVEEFRFNLISVSTISAKLPILISFVDDYCLLQDKSSLKMIGRAKLWQGLYTLQPDSLMSENFHTSTQHTLNSVACNSVSQCNISL
        +K  SNS S S  A +   R+NL+ VS ++A   + + F D+ C+LQDKSS KMIG+A+ W GLY      L+S   + +    L  V CNS        
Subjt:  AKVDSNSPSSSGTAHVEEFRFNLISVSTISAKLPILISFVDDYCLLQDKSSLKMIGRAKLWQGLYTLQPDSLMSENFHTSTQHTLNSVACNSVSQCNISL

Query:  WHDRLGHPSIRHLTALKSILPCQNFNFQ----PCLICPL
        WH+RLGHPS +HL ALK++L   + +      PC I PL
Subjt:  WHDRLGHPSIRHLTALKSILPCQNFNFQ----PCLICPL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCACTGCCTTTTTGGCCAAGAACTCTCAATCGAATGTGGCCAACTCCAGTCGTTCTAATCCTTAGAATACCAGGAAGAAGGAGCGTCCTCAATGCACTCACTGCAAT
GTTTTGGGGCACACTGTTGATCATTGTTATAAACTCCTTCGACTGCTTCAACCAAGATAGAAATCTCCCTTGTTCCAGGCTTCACTATAGAGCAATGCCAAATCCTCCTA
ATATGTTACAGTCACATTTGACCACTGCCAAGGTTGATTCTAATTCTCCATCTTCTTCTGGTACCGCACATGTCGAAGAATTCAGATTCAACCTCATCTCTGTCAGTACA
ATATCTGCAAAATTGCCCATTTTGATATCCTTTGTTGATGATTATTGTTTGCTTCAGGACAAGTCCTCTTTGAAGATGATTGGTAGGGCTAAACTTTGGCAAGGACTCTA
TACTTTACAGCCTGATTCCTTAATGTCTGAAAATTTTCATACTTCTACACAACATACTCTTAATTCAGTAGCTTGTAACTCTGTGTCTCAATGTAATATTTCTTTGTGGC
ATGACCGTCTTGGTCATCCTTCTATAAGACATTTGACTGCTTTGAAGAGTATTTTGCCTTGTCAAAACTTCAATTTTCAGCCTTGTTTGATATGCCCCTTACCAAACAGT
GGAGACTGTCTTTATCTTCTAGTAATAGCTCAACAACCAGCCCCTTTGAATTGA
mRNA sequenceShow/hide mRNA sequence
ATGCCACTGCCTTTTTGGCCAAGAACTCTCAATCGAATGTGGCCAACTCCAGTCGTTCTAATCCTTAGAATACCAGGAAGAAGGAGCGTCCTCAATGCACTCACTGCAAT
GTTTTGGGGCACACTGTTGATCATTGTTATAAACTCCTTCGACTGCTTCAACCAAGATAGAAATCTCCCTTGTTCCAGGCTTCACTATAGAGCAATGCCAAATCCTCCTA
ATATGTTACAGTCACATTTGACCACTGCCAAGGTTGATTCTAATTCTCCATCTTCTTCTGGTACCGCACATGTCGAAGAATTCAGATTCAACCTCATCTCTGTCAGTACA
ATATCTGCAAAATTGCCCATTTTGATATCCTTTGTTGATGATTATTGTTTGCTTCAGGACAAGTCCTCTTTGAAGATGATTGGTAGGGCTAAACTTTGGCAAGGACTCTA
TACTTTACAGCCTGATTCCTTAATGTCTGAAAATTTTCATACTTCTACACAACATACTCTTAATTCAGTAGCTTGTAACTCTGTGTCTCAATGTAATATTTCTTTGTGGC
ATGACCGTCTTGGTCATCCTTCTATAAGACATTTGACTGCTTTGAAGAGTATTTTGCCTTGTCAAAACTTCAATTTTCAGCCTTGTTTGATATGCCCCTTACCAAACAGT
GGAGACTGTCTTTATCTTCTAGTAATAGCTCAACAACCAGCCCCTTTGAATTGA
Protein sequenceShow/hide protein sequence
MPLPFWPRTLNRMWPTPVVLILRIPGRRSVLNALTAMFWGTLLIIVINSFDCFNQDRNLPCSRLHYRAMPNPPNMLQSHLTTAKVDSNSPSSSGTAHVEEFRFNLISVST
ISAKLPILISFVDDYCLLQDKSSLKMIGRAKLWQGLYTLQPDSLMSENFHTSTQHTLNSVACNSVSQCNISLWHDRLGHPSIRHLTALKSILPCQNFNFQPCLICPLPNS
GDCLYLLVIAQQPAPLN