; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0008556 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0008556
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationchr9:25312094..25317091
RNA-Seq ExpressionLag0008556
SyntenyLag0008556
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
EXB53755.1 hypothetical protein L484_022412 [Morus notabilis]5.2e-1936.49Show/hide
Query:  PDFMTKVITQYKWQEFCAHPQEGVVPLVREFYVGLREENMSLMIVRGKMVSFSLVDINRVYRIKAPLHLRGNDAIKNPSAKQLKEALKIVARKGVQWKES
        P F+T+VI Q+ W++FC HP   +VPLVREFY  L + N   + V+   V F+   IN ++ ++  +     D     + +QL+  L  VA +G  W+ S
Subjt:  PDFMTKVITQYKWQEFCAHPQEGVVPLVREFYVGLREENMSLMIVRGKMVSFSLVDINRVYRIKAPLHLRGNDAIKNPSAKQLKEALKIVARKGVQWKES

Query:  QTKVKTLVLSDLKSGPAVWFHFIKNRLMPTTHDSTISVERVVLLYSIM
             T +  +LK    +W+HF+  R MP+TH  T++ +RV+LLYSI+
Subjt:  QTKVKTLVLSDLKSGPAVWFHFIKNRLMPTTHDSTISVERVVLLYSIM

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]4.1e-2435.62Show/hide
Query:  KDVNFQ-EKMEITWKRDFLN-----EKGF----ASRVGALPDFMTKVITQYKWQEFCAHPQEGVVPLVREFYVGLREENMSLMIVRGKMVSFSLVDINRV
        K V F+ E  E  ++ +  N     EKGF    +  +G LP F+ +VITQ+ W++FCAHP++ +VPLVREFY  L +   + + VRG  VS+S   IN V
Subjt:  KDVNFQ-EKMEITWKRDFLN-----EKGF----ASRVGALPDFMTKVITQYKWQEFCAHPQEGVVPLVREFYVGLREENMSLMIVRGKMVSFSLVDINRV

Query:  YRIKAPLHLRGNDAIKNPSAKQLKEALKIVARKGVQWKESQTKVKTLVLSDLKSGPAVWFHFIKNRLMPTTHDSTISVERVVLLYSIMKRADTCFTSTTH
        + +  P+    ++ I+N +   L   L+ VA  G +W  S     T + S L     VWFHF+K+ L+PTTH  T+S +R++LL+S++           H
Subjt:  YRIKAPLHLRGNDAIKNPSAKQLKEALKIVARKGVQWKESQTKVKTLVLSDLKSGPAVWFHFIKNRLMPTTHDSTISVERVVLLYSIMKRADTCFTSTTH

Query:  SYFRATPSSEAFAFAYRQL
        S  RA  + +  A  +  L
Subjt:  SYFRATPSSEAFAFAYRQL

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]4.8e-2537.11Show/hide
Query:  EKGF----ASRVGALPDFMTKVITQYKWQEFCAHPQEGVVPLVREFYVGLREENMSLMIVRGKMVSFSLVDINRVYRIKAPLHLRGNDAIKNPSAKQLKE
        EKGF    +  +G LP F+ +VITQ+ W++FCAHP++ +VPLVREFY  L +   + + VRG  VS+S   IN V+ +  P+    ++ I+N + + L  
Subjt:  EKGF----ASRVGALPDFMTKVITQYKWQEFCAHPQEGVVPLVREFYVGLREENMSLMIVRGKMVSFSLVDINRVYRIKAPLHLRGNDAIKNPSAKQLKE

Query:  ALKIVARKGVQWKESQTKVKTLVLSDLKSGPAVWFHFIKNRLMPTTHDSTISVERVVLLYSIMKRADTCFTSTTHSYFRATPSSEAFAFAYRQL
         L+ VA  G +W  S     T + S L     VW+HF+K+RL+PTTH  T+S +R++LL+S++           HS  RA  + +  A  +  L
Subjt:  ALKIVARKGVQWKESQTKVKTLVLSDLKSGPAVWFHFIKNRLMPTTHDSTISVERVVLLYSIMKRADTCFTSTTHSYFRATPSSEAFAFAYRQL

PON70375.1 hypothetical protein PanWU01x14_080440 [Parasponia andersonii]6.5e-2235.83Show/hide
Query:  KDVNFQEK-MEITWKRDFLN-----EKGFA---SRVGALPDFMTKVITQYKWQEFCAHPQEGVVPLVREFYVGLREENMSLMIVRGKMVSFSLVDINRVY
        K V F+ K  EI ++ +  N     EK F    S+    P F+  VI Q+ WQ FCAHP++ +VPLVREFY  +   +   + +RG  V  S+  IN ++
Subjt:  KDVNFQEK-MEITWKRDFLN-----EKGFA---SRVGALPDFMTKVITQYKWQEFCAHPQEGVVPLVREFYVGLREENMSLMIVRGKMVSFSLVDINRVY

Query:  RIKAPLHLRGNDAIKNPSAKQLKEALKIVARKGVQWKESQTKVKTLVLSDLKSGPAVWFHFIKNRLMPTTHDSTISVERVVLLYSIM
         +  P+    ++ +++ +  +L   L+ VA  G +W  S     T + S L     VW+HF+K+RL+PTTH  T+S E V LLYS++
Subjt:  RIKAPLHLRGNDAIKNPSAKQLKEALKIVARKGVQWKESQTKVKTLVLSDLKSGPAVWFHFIKNRLMPTTHDSTISVERVVLLYSIM

XP_038904385.1 uncharacterized protein LOC120090747 [Benincasa hispida]1.3e-1726.96Show/hide
Query:  SLKQHSEEGQVKVVDEQQADEPLDLLFEKKAKTSSDSETESDYEIKELDDDQIPISSALRKKRIRE---LRVERRTKNKNDPIFTKRPRTKSMDASHVPP
        ++++ S EG ++VV E    + L    + K K  S  +    YE      ++    S+ +++R RE   L+ E + K    P       T S++    P 
Subjt:  SLKQHSEEGQVKVVDEQQADEPLDLLFEKKAKTSSDSETESDYEIKELDDDQIPISSALRKKRIRE---LRVERRTKNKNDPIFTKRPRTKSMDASHVPP

Query:  TTTPIKPK---------VKTPKVASPNNPFPEVFKDVNFQEKMEITWKR--------DFLNEKGFASRVGALPDFMTKVITQYKWQEFCAHPQEGVVPLV
            ++P+         V   +  S   P     K+   Q +  +T  R        D + E GF      LPDF T V+ ++ W+ F       +  +V
Subjt:  TTTPIKPK---------VKTPKVASPNNPFPEVFKDVNFQEKMEITWKR--------DFLNEKGFASRVGALPDFMTKVITQYKWQEFCAHPQEGVVPLV

Query:  REFYVGLREENMSLMIVRGKMVSFSLVDINRVYRIKAPLHLRGNDAIKNPSAKQLKEALKIVARKGVQWKESQTKVKTLVLSDLKSGPAVWFHFIKNRLM
        R FY G        +I++G +V FS  DIN +Y++K      GN  I +P  +++++AL+ + + G QW  S   +KTL  S L     +W + +K R++
Subjt:  REFYVGLREENMSLMIVRGKMVSFSLVDINRVYRIKAPLHLRGNDAIKNPSAKQLKEALKIVARKGVQWKESQTKVKTLVLSDLKSGPAVWFHFIKNRLM

Query:  PTTHDSTISVERVVLLYSI
        PT+HD T+S +RV+  Y I
Subjt:  PTTHDSTISVERVVLLYSI

TrEMBL top hitse value%identityAlignment
A0A2P5AGA5 Uncharacterized protein (Fragment)2.0e-2435.62Show/hide
Query:  KDVNFQ-EKMEITWKRDFLN-----EKGF----ASRVGALPDFMTKVITQYKWQEFCAHPQEGVVPLVREFYVGLREENMSLMIVRGKMVSFSLVDINRV
        K V F+ E  E  ++ +  N     EKGF    +  +G LP F+ +VITQ+ W++FCAHP++ +VPLVREFY  L +   + + VRG  VS+S   IN V
Subjt:  KDVNFQ-EKMEITWKRDFLN-----EKGF----ASRVGALPDFMTKVITQYKWQEFCAHPQEGVVPLVREFYVGLREENMSLMIVRGKMVSFSLVDINRV

Query:  YRIKAPLHLRGNDAIKNPSAKQLKEALKIVARKGVQWKESQTKVKTLVLSDLKSGPAVWFHFIKNRLMPTTHDSTISVERVVLLYSIMKRADTCFTSTTH
        + +  P+    ++ I+N +   L   L+ VA  G +W  S     T + S L     VWFHF+K+ L+PTTH  T+S +R++LL+S++           H
Subjt:  YRIKAPLHLRGNDAIKNPSAKQLKEALKIVARKGVQWKESQTKVKTLVLSDLKSGPAVWFHFIKNRLMPTTHDSTISVERVVLLYSIMKRADTCFTSTTH

Query:  SYFRATPSSEAFAFAYRQL
        S  RA  + +  A  +  L
Subjt:  SYFRATPSSEAFAFAYRQL

A0A2P5BCG4 Uncharacterized protein (Fragment)2.3e-2537.11Show/hide
Query:  EKGF----ASRVGALPDFMTKVITQYKWQEFCAHPQEGVVPLVREFYVGLREENMSLMIVRGKMVSFSLVDINRVYRIKAPLHLRGNDAIKNPSAKQLKE
        EKGF    +  +G LP F+ +VITQ+ W++FCAHP++ +VPLVREFY  L +   + + VRG  VS+S   IN V+ +  P+    ++ I+N + + L  
Subjt:  EKGF----ASRVGALPDFMTKVITQYKWQEFCAHPQEGVVPLVREFYVGLREENMSLMIVRGKMVSFSLVDINRVYRIKAPLHLRGNDAIKNPSAKQLKE

Query:  ALKIVARKGVQWKESQTKVKTLVLSDLKSGPAVWFHFIKNRLMPTTHDSTISVERVVLLYSIMKRADTCFTSTTHSYFRATPSSEAFAFAYRQL
         L+ VA  G +W  S     T + S L     VW+HF+K+RL+PTTH  T+S +R++LL+S++           HS  RA  + +  A  +  L
Subjt:  ALKIVARKGVQWKESQTKVKTLVLSDLKSGPAVWFHFIKNRLMPTTHDSTISVERVVLLYSIMKRADTCFTSTTHSYFRATPSSEAFAFAYRQL

A0A2P5DAQ2 Uncharacterized protein3.2e-2235.83Show/hide
Query:  KDVNFQEK-MEITWKRDFLN-----EKGFA---SRVGALPDFMTKVITQYKWQEFCAHPQEGVVPLVREFYVGLREENMSLMIVRGKMVSFSLVDINRVY
        K V F+ K  EI ++ +  N     EK F    S+    P F+  VI Q+ WQ FCAHP++ +VPLVREFY  +   +   + +RG  V  S+  IN ++
Subjt:  KDVNFQEK-MEITWKRDFLN-----EKGFA---SRVGALPDFMTKVITQYKWQEFCAHPQEGVVPLVREFYVGLREENMSLMIVRGKMVSFSLVDINRVY

Query:  RIKAPLHLRGNDAIKNPSAKQLKEALKIVARKGVQWKESQTKVKTLVLSDLKSGPAVWFHFIKNRLMPTTHDSTISVERVVLLYSIM
         +  P+    ++ +++ +  +L   L+ VA  G +W  S     T + S L     VW+HF+K+RL+PTTH  T+S E V LLYS++
Subjt:  RIKAPLHLRGNDAIKNPSAKQLKEALKIVARKGVQWKESQTKVKTLVLSDLKSGPAVWFHFIKNRLMPTTHDSTISVERVVLLYSIM

W9QTD9 Uncharacterized protein2.5e-1936.49Show/hide
Query:  PDFMTKVITQYKWQEFCAHPQEGVVPLVREFYVGLREENMSLMIVRGKMVSFSLVDINRVYRIKAPLHLRGNDAIKNPSAKQLKEALKIVARKGVQWKES
        P F+T+VI Q+ W++FC HP   +VPLVREFY  L + N   + V+   V F+   IN ++ ++  +     D     + +QL+  L  VA +G  W+ S
Subjt:  PDFMTKVITQYKWQEFCAHPQEGVVPLVREFYVGLREENMSLMIVRGKMVSFSLVDINRVYRIKAPLHLRGNDAIKNPSAKQLKEALKIVARKGVQWKES

Query:  QTKVKTLVLSDLKSGPAVWFHFIKNRLMPTTHDSTISVERVVLLYSIM
             T +  +LK    +W+HF+  R MP+TH  T++ +RV+LLYSI+
Subjt:  QTKVKTLVLSDLKSGPAVWFHFIKNRLMPTTHDSTISVERVVLLYSIM

W9RBS1 Uncharacterized protein1.8e-1733.5Show/hide
Query:  PKVKTPKVASPNNPFPEVFKDVNFQEKMEITWKRDFLNEKGFA---SRVGALPDFMTKVITQYKWQEFCAHPQEGVVPLVREFYVGLREENMSLMIVRGK
        P   T +V S N  F +   +  ++E +   W R+ + EKGF    S     P F++ VI    WQ FC HP + +VPLV+EFY  L+ +  + + V   
Subjt:  PKVKTPKVASPNNPFPEVFKDVNFQEKMEITWKRDFLNEKGFA---SRVGALPDFMTKVITQYKWQEFCAHPQEGVVPLVREFYVGLREENMSLMIVRGK

Query:  MVSFSLVDINRVYRIKAPLHLRGNDAIKNPSAKQLKEALKIVARKGVQWKESQTKVKTLVLSDLKSGPAVWFHFIKNRLMPTTHDSTISVERVVLLYSIM
         ++F+   IN V  I         + I +   +QLKE LK +A  G QW  S     T    +L+    VW+HF+ +RL+ +TH  TIS  R +LLY+++
Subjt:  MVSFSLVDINRVYRIKAPLHLRGNDAIKNPSAKQLKEALKIVARKGVQWKESQTKVKTLVLSDLKSGPAVWFHFIKNRLMPTTHDSTISVERVVLLYSIM

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTCTTCTTGAGGTCTCCAATCGTTCCTCCTCTATTCGGCTTCCAAAGATGGAAAGTAAGCCTCCACGAATGGAATAAGACTTTCCTTTCTCGAAAATTGGAATGTTT
TGGGCTTGAGACGGCTACTTCCTTTTATAGAGTGGAGTCAACCCTAATTAGGTCAAATTTTACACGTGGAAGGCTCTACAAGCTTTCTAGATCTGAGATTAATTTAGCTA
ACGCCACCTCATTTAATGAATCATGTGCGGCTCACGTGGCAGTTTTGCTCTCTGTTTTGCTGGATCACAGATTTTGCTCCCTCAAGCAGCATTCTGAGGAAGGCCAAGTT
AAGGTTGTGGATGAGCAGCAAGCTGATGAGCCCCTCGATCTTTTGTTCGAAAAAAAGGCCAAGACCTCTAGTGACAGTGAGACCGAGTCAGATTATGAAATAAAGGAATT
GGATGATGATCAAATTCCAATTTCTTCGGCATTAAGGAAGAAGAGAATAAGAGAACTTAGGGTTGAGAGGAGAACAAAGAACAAAAATGACCCCATATTCACAAAGAGGC
CGAGGACAAAGTCCATGGATGCCTCTCATGTACCTCCTACCACCACACCAATCAAGCCAAAGGTTAAAACACCTAAGGTTGCATCGCCAAATAATCCATTCCCTGAGGTC
TTTAAAGATGTTAATTTCCAAGAGAAGATGGAGATTACGTGGAAGAGGGACTTTTTGAACGAGAAAGGATTTGCAAGTCGAGTAGGGGCATTGCCTGACTTCATGACTAA
AGTCATCACCCAATACAAATGGCAAGAGTTCTGTGCTCACCCTCAAGAGGGCGTGGTGCCTCTAGTCCGAGAATTTTATGTAGGATTGAGGGAAGAAAACATGAGTTTGA
TGATTGTAAGAGGGAAGATGGTGAGTTTCTCTTTGGTTGACATTAACAGGGTATATCGGATCAAAGCTCCTTTGCATCTGAGAGGGAATGATGCCATTAAGAACCCTTCG
GCCAAGCAGTTAAAAGAAGCATTGAAGATAGTGGCCAGGAAGGGAGTCCAGTGGAAGGAGTCTCAGACCAAAGTGAAGACACTAGTGCTAAGTGATTTAAAGTCAGGACC
AGCAGTGTGGTTTCACTTCATCAAAAATAGATTGATGCCCACCACCCATGATAGTACCATTTCAGTAGAGAGGGTTGTCCTTCTCTACAGCATAATGAAAAGGGCTGACA
CCTGCTTCACCTCCACAACACACTCCTATTTCAGGGCTACACCATCATCGGAGGCCTTTGCATTTGCCTACCGCCAACTAGACCGCATTGAAGATAAACTGAAAACATAT
TGGGTCTATGTGAACGAAAGAGATGAAGCAATTAGAGAGTTCTATCTCTCTATCGCCCCAAGCATCGCTTATGTTTTCCCTGACTTCCTCCAACCTTGTTGCCTCAAGAA
GAAGAAGAGTCTGAAGATGAAGATGAAGAAGTTGAAGAGAGTTCCTCAGAGGAGGAATAGGGAAAAACAGAGGAATCTGGAACTTCCCAGAAATGCGAAGGCAAAAATCA
ATTGCGACCGCATTTCTGGGCAGATGAAGGCAGTTCCGAGTCCGTCGCGGGTCGTTTTGAACGAGTGTTTTACGCACCTAACCGGCTGTTTTGACTCTGAAACTGATGGG
CCACCTGCTAGCACCATTTTCACGACTCCTCAACCACTCCCCACACTATAA
mRNA sequenceShow/hide mRNA sequence
ATGGTCTTCTTGAGGTCTCCAATCGTTCCTCCTCTATTCGGCTTCCAAAGATGGAAAGTAAGCCTCCACGAATGGAATAAGACTTTCCTTTCTCGAAAATTGGAATGTTT
TGGGCTTGAGACGGCTACTTCCTTTTATAGAGTGGAGTCAACCCTAATTAGGTCAAATTTTACACGTGGAAGGCTCTACAAGCTTTCTAGATCTGAGATTAATTTAGCTA
ACGCCACCTCATTTAATGAATCATGTGCGGCTCACGTGGCAGTTTTGCTCTCTGTTTTGCTGGATCACAGATTTTGCTCCCTCAAGCAGCATTCTGAGGAAGGCCAAGTT
AAGGTTGTGGATGAGCAGCAAGCTGATGAGCCCCTCGATCTTTTGTTCGAAAAAAAGGCCAAGACCTCTAGTGACAGTGAGACCGAGTCAGATTATGAAATAAAGGAATT
GGATGATGATCAAATTCCAATTTCTTCGGCATTAAGGAAGAAGAGAATAAGAGAACTTAGGGTTGAGAGGAGAACAAAGAACAAAAATGACCCCATATTCACAAAGAGGC
CGAGGACAAAGTCCATGGATGCCTCTCATGTACCTCCTACCACCACACCAATCAAGCCAAAGGTTAAAACACCTAAGGTTGCATCGCCAAATAATCCATTCCCTGAGGTC
TTTAAAGATGTTAATTTCCAAGAGAAGATGGAGATTACGTGGAAGAGGGACTTTTTGAACGAGAAAGGATTTGCAAGTCGAGTAGGGGCATTGCCTGACTTCATGACTAA
AGTCATCACCCAATACAAATGGCAAGAGTTCTGTGCTCACCCTCAAGAGGGCGTGGTGCCTCTAGTCCGAGAATTTTATGTAGGATTGAGGGAAGAAAACATGAGTTTGA
TGATTGTAAGAGGGAAGATGGTGAGTTTCTCTTTGGTTGACATTAACAGGGTATATCGGATCAAAGCTCCTTTGCATCTGAGAGGGAATGATGCCATTAAGAACCCTTCG
GCCAAGCAGTTAAAAGAAGCATTGAAGATAGTGGCCAGGAAGGGAGTCCAGTGGAAGGAGTCTCAGACCAAAGTGAAGACACTAGTGCTAAGTGATTTAAAGTCAGGACC
AGCAGTGTGGTTTCACTTCATCAAAAATAGATTGATGCCCACCACCCATGATAGTACCATTTCAGTAGAGAGGGTTGTCCTTCTCTACAGCATAATGAAAAGGGCTGACA
CCTGCTTCACCTCCACAACACACTCCTATTTCAGGGCTACACCATCATCGGAGGCCTTTGCATTTGCCTACCGCCAACTAGACCGCATTGAAGATAAACTGAAAACATAT
TGGGTCTATGTGAACGAAAGAGATGAAGCAATTAGAGAGTTCTATCTCTCTATCGCCCCAAGCATCGCTTATGTTTTCCCTGACTTCCTCCAACCTTGTTGCCTCAAGAA
GAAGAAGAGTCTGAAGATGAAGATGAAGAAGTTGAAGAGAGTTCCTCAGAGGAGGAATAGGGAAAAACAGAGGAATCTGGAACTTCCCAGAAATGCGAAGGCAAAAATCA
ATTGCGACCGCATTTCTGGGCAGATGAAGGCAGTTCCGAGTCCGTCGCGGGTCGTTTTGAACGAGTGTTTTACGCACCTAACCGGCTGTTTTGACTCTGAAACTGATGGG
CCACCTGCTAGCACCATTTTCACGACTCCTCAACCACTCCCCACACTATAA
Protein sequenceShow/hide protein sequence
MVFLRSPIVPPLFGFQRWKVSLHEWNKTFLSRKLECFGLETATSFYRVESTLIRSNFTRGRLYKLSRSEINLANATSFNESCAAHVAVLLSVLLDHRFCSLKQHSEEGQV
KVVDEQQADEPLDLLFEKKAKTSSDSETESDYEIKELDDDQIPISSALRKKRIRELRVERRTKNKNDPIFTKRPRTKSMDASHVPPTTTPIKPKVKTPKVASPNNPFPEV
FKDVNFQEKMEITWKRDFLNEKGFASRVGALPDFMTKVITQYKWQEFCAHPQEGVVPLVREFYVGLREENMSLMIVRGKMVSFSLVDINRVYRIKAPLHLRGNDAIKNPS
AKQLKEALKIVARKGVQWKESQTKVKTLVLSDLKSGPAVWFHFIKNRLMPTTHDSTISVERVVLLYSIMKRADTCFTSTTHSYFRATPSSEAFAFAYRQLDRIEDKLKTY
WVYVNERDEAIREFYLSIAPSIAYVFPDFLQPCCLKKKKSLKMKMKKLKRVPQRRNREKQRNLELPRNAKAKINCDRISGQMKAVPSPSRVVLNECFTHLTGCFDSETDG
PPASTIFTTPQPLPTL