; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg002811 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg002811
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationscaffold6:382307..390047
RNA-Seq ExpressionSpg002811
SyntenySpg002811
Gene Ontology termsGO:0006298 - mismatch repair (biological process)
GO:0045910 - negative regulation of DNA recombination (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0004519 - endonuclease activity (molecular function)
GO:0005524 - ATP binding (molecular function)
GO:0016887 - ATPase activity (molecular function)
GO:0030983 - mismatched DNA binding (molecular function)
InterPro domainsIPR026960 - Reverse transcriptase zinc-binding domain
IPR045076 - DNA mismatch repair MutS family


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022141294.1 uncharacterized protein LOC111011726 isoform X4 [Momordica charantia]2.7e-2878.49Show/hide
Query:  MLSAALFRHPLTSIISATLPVKNVSSFRFQNQATS--VNFSLSANNSVSNGIRDDRNKHSIHLDSLRALEWDKLCDSVASFARTSLGREAVKV
        +LSAA+F  PLTSIISA LPVKN++S RFQN+  S  + FSLSA NSVSN I DDRNKHSIHLDSLRALEWDKLCDSVASFARTSLGR+A+KV
Subjt:  MLSAALFRHPLTSIISATLPVKNVSSFRFQNQATS--VNFSLSANNSVSNGIRDDRNKHSIHLDSLRALEWDKLCDSVASFARTSLGREAVKV

XP_022922841.1 uncharacterized protein LOC111430703 isoform X2 [Cucurbita moschata]4.5e-2840.16Show/hide
Query:  MLSAALFRHPLTSIISATLPVKNVSSFRFQNQATSVNFSLSANNSVSNGIRDDRNKHSIHLDSLRALEWDKLCDSVASFARTSLGREAVKVYIAPKFSCS
        MLSAA+F H LT I SATLPV +VSSFRFQN+A  V+FSLSAN SV N IR DRN+HSIHLDSLRALEWDKLCDSVASFARTSLGR+A+K          
Subjt:  MLSAALFRHPLTSIISATLPVKNVSSFRFQNQATSVNFSLSANNSVSNGIRDDRNKHSIHLDSLRALEWDKLCDSVASFARTSLGREAVKVYIAPKFSCS

Query:  YQPHSSFTSLDLNRIENSLDFIRMKEVWEGEQSSILPSLLSSDEEEISDQDELNAQRFFPSSEQLAVNEAINACIEVDTVRPKRISTAKDPSKRAVPMQN
                                 ++W         SL  + EE +   DE NA         + +++     +++  V  + + +A + ++R++PM  
Subjt:  YQPHSSFTSLDLNRIENSLDFIRMKEVWEGEQSSILPSLLSSDEEEISDQDELNAQRFFPSSEQLAVNEAINACIEVDTVRPKRISTAKDPSKRAVPMQN

Query:  SFDQRCLEGGKQFLDLPIRRNLKD--KEVAEWTDLSLDLTPIVL
        + +   +    QF D+ ++ NLK   KE A+W+   + LT +++
Subjt:  SFDQRCLEGGKQFLDLPIRRNLKD--KEVAEWTDLSLDLTPIVL

XP_022922843.1 uncharacterized protein LOC111430703 isoform X3 [Cucurbita moschata]4.5e-2840.16Show/hide
Query:  MLSAALFRHPLTSIISATLPVKNVSSFRFQNQATSVNFSLSANNSVSNGIRDDRNKHSIHLDSLRALEWDKLCDSVASFARTSLGREAVKVYIAPKFSCS
        MLSAA+F H LT I SATLPV +VSSFRFQN+A  V+FSLSAN SV N IR DRN+HSIHLDSLRALEWDKLCDSVASFARTSLGR+A+K          
Subjt:  MLSAALFRHPLTSIISATLPVKNVSSFRFQNQATSVNFSLSANNSVSNGIRDDRNKHSIHLDSLRALEWDKLCDSVASFARTSLGREAVKVYIAPKFSCS

Query:  YQPHSSFTSLDLNRIENSLDFIRMKEVWEGEQSSILPSLLSSDEEEISDQDELNAQRFFPSSEQLAVNEAINACIEVDTVRPKRISTAKDPSKRAVPMQN
                                 ++W         SL  + EE +   DE NA         + +++     +++  V  + + +A + ++R++PM  
Subjt:  YQPHSSFTSLDLNRIENSLDFIRMKEVWEGEQSSILPSLLSSDEEEISDQDELNAQRFFPSSEQLAVNEAINACIEVDTVRPKRISTAKDPSKRAVPMQN

Query:  SFDQRCLEGGKQFLDLPIRRNLKD--KEVAEWTDLSLDLTPIVL
        + +   +    QF D+ ++ NLK   KE A+W+   + LT +++
Subjt:  SFDQRCLEGGKQFLDLPIRRNLKD--KEVAEWTDLSLDLTPIVL

XP_022922844.1 uncharacterized protein LOC111430703 isoform X4 [Cucurbita moschata]4.5e-2840.16Show/hide
Query:  MLSAALFRHPLTSIISATLPVKNVSSFRFQNQATSVNFSLSANNSVSNGIRDDRNKHSIHLDSLRALEWDKLCDSVASFARTSLGREAVKVYIAPKFSCS
        MLSAA+F H LT I SATLPV +VSSFRFQN+A  V+FSLSAN SV N IR DRN+HSIHLDSLRALEWDKLCDSVASFARTSLGR+A+K          
Subjt:  MLSAALFRHPLTSIISATLPVKNVSSFRFQNQATSVNFSLSANNSVSNGIRDDRNKHSIHLDSLRALEWDKLCDSVASFARTSLGREAVKVYIAPKFSCS

Query:  YQPHSSFTSLDLNRIENSLDFIRMKEVWEGEQSSILPSLLSSDEEEISDQDELNAQRFFPSSEQLAVNEAINACIEVDTVRPKRISTAKDPSKRAVPMQN
                                 ++W         SL  + EE +   DE NA         + +++     +++  V  + + +A + ++R++PM  
Subjt:  YQPHSSFTSLDLNRIENSLDFIRMKEVWEGEQSSILPSLLSSDEEEISDQDELNAQRFFPSSEQLAVNEAINACIEVDTVRPKRISTAKDPSKRAVPMQN

Query:  SFDQRCLEGGKQFLDLPIRRNLKD--KEVAEWTDLSLDLTPIVL
        + +   +    QF D+ ++ NLK   KE A+W+   + LT +++
Subjt:  SFDQRCLEGGKQFLDLPIRRNLKD--KEVAEWTDLSLDLTPIVL

XP_023552491.1 uncharacterized protein LOC111810138 [Cucurbita pepo subsp. pepo]1.2e-2840.57Show/hide
Query:  MLSAALFRHPLTSIISATLPVKNVSSFRFQNQATSVNFSLSANNSVSNGIRDDRNKHSIHLDSLRALEWDKLCDSVASFARTSLGREAVKVYIAPKFSCS
        MLSAA+F H LT I SATLPV NVSSFRFQN+A  V+FSLSAN SV N IR DRN+HSIHLDSLRALEWDKLCDSVASFARTSLGR+A+K          
Subjt:  MLSAALFRHPLTSIISATLPVKNVSSFRFQNQATSVNFSLSANNSVSNGIRDDRNKHSIHLDSLRALEWDKLCDSVASFARTSLGREAVKVYIAPKFSCS

Query:  YQPHSSFTSLDLNRIENSLDFIRMKEVWEGEQSSILPSLLSSDEEEISDQDELNAQRFFPSSEQLAVNEAINACIEVDTVRPKRISTAKDPSKRAVPMQN
                                 ++W         SL  + EE +   DE NA         + +++     +++  V  + + +A + ++R++PM  
Subjt:  YQPHSSFTSLDLNRIENSLDFIRMKEVWEGEQSSILPSLLSSDEEEISDQDELNAQRFFPSSEQLAVNEAINACIEVDTVRPKRISTAKDPSKRAVPMQN

Query:  SFDQRCLEGGKQFLDLPIRRNLKD--KEVAEWTDLSLDLTPIVL
        + +   +    QF D+ ++ NLK   KE A+W+   + LT +++
Subjt:  SFDQRCLEGGKQFLDLPIRRNLKD--KEVAEWTDLSLDLTPIVL

TrEMBL top hitse value%identityAlignment
A0A6J1CI69 uncharacterized protein LOC111011726 isoform X41.3e-2878.49Show/hide
Query:  MLSAALFRHPLTSIISATLPVKNVSSFRFQNQATS--VNFSLSANNSVSNGIRDDRNKHSIHLDSLRALEWDKLCDSVASFARTSLGREAVKV
        +LSAA+F  PLTSIISA LPVKN++S RFQN+  S  + FSLSA NSVSN I DDRNKHSIHLDSLRALEWDKLCDSVASFARTSLGR+A+KV
Subjt:  MLSAALFRHPLTSIISATLPVKNVSSFRFQNQATS--VNFSLSANNSVSNGIRDDRNKHSIHLDSLRALEWDKLCDSVASFARTSLGREAVKV

A0A6J1E4M8 uncharacterized protein LOC111430703 isoform X42.2e-2840.16Show/hide
Query:  MLSAALFRHPLTSIISATLPVKNVSSFRFQNQATSVNFSLSANNSVSNGIRDDRNKHSIHLDSLRALEWDKLCDSVASFARTSLGREAVKVYIAPKFSCS
        MLSAA+F H LT I SATLPV +VSSFRFQN+A  V+FSLSAN SV N IR DRN+HSIHLDSLRALEWDKLCDSVASFARTSLGR+A+K          
Subjt:  MLSAALFRHPLTSIISATLPVKNVSSFRFQNQATSVNFSLSANNSVSNGIRDDRNKHSIHLDSLRALEWDKLCDSVASFARTSLGREAVKVYIAPKFSCS

Query:  YQPHSSFTSLDLNRIENSLDFIRMKEVWEGEQSSILPSLLSSDEEEISDQDELNAQRFFPSSEQLAVNEAINACIEVDTVRPKRISTAKDPSKRAVPMQN
                                 ++W         SL  + EE +   DE NA         + +++     +++  V  + + +A + ++R++PM  
Subjt:  YQPHSSFTSLDLNRIENSLDFIRMKEVWEGEQSSILPSLLSSDEEEISDQDELNAQRFFPSSEQLAVNEAINACIEVDTVRPKRISTAKDPSKRAVPMQN

Query:  SFDQRCLEGGKQFLDLPIRRNLKD--KEVAEWTDLSLDLTPIVL
        + +   +    QF D+ ++ NLK   KE A+W+   + LT +++
Subjt:  SFDQRCLEGGKQFLDLPIRRNLKD--KEVAEWTDLSLDLTPIVL

A0A6J1E586 uncharacterized protein LOC111430703 isoform X22.2e-2840.16Show/hide
Query:  MLSAALFRHPLTSIISATLPVKNVSSFRFQNQATSVNFSLSANNSVSNGIRDDRNKHSIHLDSLRALEWDKLCDSVASFARTSLGREAVKVYIAPKFSCS
        MLSAA+F H LT I SATLPV +VSSFRFQN+A  V+FSLSAN SV N IR DRN+HSIHLDSLRALEWDKLCDSVASFARTSLGR+A+K          
Subjt:  MLSAALFRHPLTSIISATLPVKNVSSFRFQNQATSVNFSLSANNSVSNGIRDDRNKHSIHLDSLRALEWDKLCDSVASFARTSLGREAVKVYIAPKFSCS

Query:  YQPHSSFTSLDLNRIENSLDFIRMKEVWEGEQSSILPSLLSSDEEEISDQDELNAQRFFPSSEQLAVNEAINACIEVDTVRPKRISTAKDPSKRAVPMQN
                                 ++W         SL  + EE +   DE NA         + +++     +++  V  + + +A + ++R++PM  
Subjt:  YQPHSSFTSLDLNRIENSLDFIRMKEVWEGEQSSILPSLLSSDEEEISDQDELNAQRFFPSSEQLAVNEAINACIEVDTVRPKRISTAKDPSKRAVPMQN

Query:  SFDQRCLEGGKQFLDLPIRRNLKD--KEVAEWTDLSLDLTPIVL
        + +   +    QF D+ ++ NLK   KE A+W+   + LT +++
Subjt:  SFDQRCLEGGKQFLDLPIRRNLKD--KEVAEWTDLSLDLTPIVL

A0A6J1E7X9 uncharacterized protein LOC111430703 isoform X32.2e-2840.16Show/hide
Query:  MLSAALFRHPLTSIISATLPVKNVSSFRFQNQATSVNFSLSANNSVSNGIRDDRNKHSIHLDSLRALEWDKLCDSVASFARTSLGREAVKVYIAPKFSCS
        MLSAA+F H LT I SATLPV +VSSFRFQN+A  V+FSLSAN SV N IR DRN+HSIHLDSLRALEWDKLCDSVASFARTSLGR+A+K          
Subjt:  MLSAALFRHPLTSIISATLPVKNVSSFRFQNQATSVNFSLSANNSVSNGIRDDRNKHSIHLDSLRALEWDKLCDSVASFARTSLGREAVKVYIAPKFSCS

Query:  YQPHSSFTSLDLNRIENSLDFIRMKEVWEGEQSSILPSLLSSDEEEISDQDELNAQRFFPSSEQLAVNEAINACIEVDTVRPKRISTAKDPSKRAVPMQN
                                 ++W         SL  + EE +   DE NA         + +++     +++  V  + + +A + ++R++PM  
Subjt:  YQPHSSFTSLDLNRIENSLDFIRMKEVWEGEQSSILPSLLSSDEEEISDQDELNAQRFFPSSEQLAVNEAINACIEVDTVRPKRISTAKDPSKRAVPMQN

Query:  SFDQRCLEGGKQFLDLPIRRNLKD--KEVAEWTDLSLDLTPIVL
        + +   +    QF D+ ++ NLK   KE A+W+   + LT +++
Subjt:  SFDQRCLEGGKQFLDLPIRRNLKD--KEVAEWTDLSLDLTPIVL

A0A6J1E9Y4 uncharacterized protein LOC111430703 isoform X12.2e-2840.16Show/hide
Query:  MLSAALFRHPLTSIISATLPVKNVSSFRFQNQATSVNFSLSANNSVSNGIRDDRNKHSIHLDSLRALEWDKLCDSVASFARTSLGREAVKVYIAPKFSCS
        MLSAA+F H LT I SATLPV +VSSFRFQN+A  V+FSLSAN SV N IR DRN+HSIHLDSLRALEWDKLCDSVASFARTSLGR+A+K          
Subjt:  MLSAALFRHPLTSIISATLPVKNVSSFRFQNQATSVNFSLSANNSVSNGIRDDRNKHSIHLDSLRALEWDKLCDSVASFARTSLGREAVKVYIAPKFSCS

Query:  YQPHSSFTSLDLNRIENSLDFIRMKEVWEGEQSSILPSLLSSDEEEISDQDELNAQRFFPSSEQLAVNEAINACIEVDTVRPKRISTAKDPSKRAVPMQN
                                 ++W         SL  + EE +   DE NA         + +++     +++  V  + + +A + ++R++PM  
Subjt:  YQPHSSFTSLDLNRIENSLDFIRMKEVWEGEQSSILPSLLSSDEEEISDQDELNAQRFFPSSEQLAVNEAINACIEVDTVRPKRISTAKDPSKRAVPMQN

Query:  SFDQRCLEGGKQFLDLPIRRNLKD--KEVAEWTDLSLDLTPIVL
        + +   +    QF D+ ++ NLK   KE A+W+   + LT +++
Subjt:  SFDQRCLEGGKQFLDLPIRRNLKD--KEVAEWTDLSLDLTPIVL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G54090.1 DNA mismatch repair protein MutS, type 23.0e-0929.17Show/hide
Query:  NKHSIHLDSLRALEWDKLCDSVASFARTSLGREAVKV---YIAPKFSCSYQ------------PHSSF----TSLDLNRIENSLDFIRMKEVWEGEQSSI
        +K     DSLR LEWDKLCD VASFARTSLGREA K     +   FS S +             H SF    +S+ ++ +E+ +   + +     +Q+  
Subjt:  NKHSIHLDSLRALEWDKLCDSVASFARTSLGREAVKV---YIAPKFSCSYQ------------PHSSF----TSLDLNRIENSLDFIRMKEVWEGEQSSI

Query:  LPSL------LSSDEEEISDQDELNAQRFFPSSEQLAVNEAINAC---IEVDTVRPKRISTAKDPSKRAVPMQNSFDQRCLEGGKQFLDLPIRRNLKDKE
        + SL      L  D +    QD    +RF P SE L V+  IN     +    + P    T KD +  A+       Q      +Q LD  IR    D+ 
Subjt:  LPSL------LSSDEEEISDQDELNAQRFFPSSEQLAVNEAINAC---IEVDTVRPKRISTAKDPSKRAVPMQNSFDQRCLEGGKQFLDLPIRRNLKDKE

Query:  VAEWTDLSLDLTPIVLSSKEGSWKWLPSPNNFFSTKSLMV
        V            ++ +  +G W    S N   S   L++
Subjt:  VAEWTDLSLDLTPIVLSSKEGSWKWLPSPNNFFSTKSLMV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTTTCTGCAGCTCTTTTTCGCCATCCCCTCACCTCCATTATCTCTGCTACACTGCCGGTTAAAAACGTCAGTTCGTTCAGATTCCAGAATCAAGCTACATCCGTAAA
CTTCTCCCTCTCTGCAAACAACTCCGTCAGCAATGGCATTAGAGATGACAGAAACAAGCATTCAATCCACCTCGATAGTCTCAGAGCGCTGGAATGGGATAAACTTTGCG
ATTCCGTAGCTTCCTTCGCTCGCACTTCTCTGGGCCGTGAAGCTGTCAAGGTCTATATCGCACCTAAATTCTCATGTTCTTATCAGCCTCATTCCAGTTTCACTAGTCTC
GATTTGAATAGGATTGAGAATTCTCTTGATTTTATTCGCATGAAAGAAGTGTGGGAAGGTGAACAGTCATCAATCCTCCCTTCTCTTTTGTCCTCGGATGAAGAAGAAAT
TTCTGATCAAGATGAATTAAATGCACAGAGGTTTTTTCCTTCCAGTGAGCAGTTGGCAGTGAATGAGGCCATTAATGCCTGTATAGAAGTGGATACTGTGAGGCCAAAGA
GGATTTCGACCGCCAAGGATCCATCCAAAAGGGCTGTCCCAATGCAAAACAGCTTTGATCAAAGATGTTTGGAAGGAGGAAAACAGTTTTTGGACCTACCGATTAGAAGA
AACTTAAAAGACAAGGAAGTTGCTGAATGGACTGACTTAAGTCTCGATCTCACTCCTATTGTTCTTTCCTCGAAGGAAGGTTCTTGGAAATGGCTCCCTAGCCCAAATAA
TTTTTTCTCTACAAAATCCTTGATGGTGGACATGACAAACAAATCAATCATCCTAGACCCTTCATTAGCCAAAGGAATCTGGAAAGACAAGTACCCAAAAAAGGTAAAAT
TCTTCCTCTGGGAGGTGGTTCACAAAGCCATCAGTACAAAAGAAAATCTTCAAAGAAGAATGCCTTATTTGGCTTTGTCTCCAAGCTGGTGCACTTTGTGTAAAGCTAAC
TATGAATCTCAAAACCACCTTTTCATCCACTGGCCCTACAATATTACTTCTGGAACAAAATTTTGCAAGCTTTTAGATGTGGTGACCACCTCGATAACTCTGTACAAGTG
GGGAGAGGCCCAACTTTGGTCTTTGAACCGGACATATGAAGAAAGCTTGAGACTTTTGGATGAGACTAATGCTGCAGTAGAAATGCACAAGCATGGTGGCTGCAGCTTGG
ATTTAAGTGGCGTTGACCTTCATCTGGAGAAATGGTGTAGCCGATTACTAGGTGATAATTAG
mRNA sequenceShow/hide mRNA sequence
ATGCTTTCTGCAGCTCTTTTTCGCCATCCCCTCACCTCCATTATCTCTGCTACACTGCCGGTTAAAAACGTCAGTTCGTTCAGATTCCAGAATCAAGCTACATCCGTAAA
CTTCTCCCTCTCTGCAAACAACTCCGTCAGCAATGGCATTAGAGATGACAGAAACAAGCATTCAATCCACCTCGATAGTCTCAGAGCGCTGGAATGGGATAAACTTTGCG
ATTCCGTAGCTTCCTTCGCTCGCACTTCTCTGGGCCGTGAAGCTGTCAAGGTCTATATCGCACCTAAATTCTCATGTTCTTATCAGCCTCATTCCAGTTTCACTAGTCTC
GATTTGAATAGGATTGAGAATTCTCTTGATTTTATTCGCATGAAAGAAGTGTGGGAAGGTGAACAGTCATCAATCCTCCCTTCTCTTTTGTCCTCGGATGAAGAAGAAAT
TTCTGATCAAGATGAATTAAATGCACAGAGGTTTTTTCCTTCCAGTGAGCAGTTGGCAGTGAATGAGGCCATTAATGCCTGTATAGAAGTGGATACTGTGAGGCCAAAGA
GGATTTCGACCGCCAAGGATCCATCCAAAAGGGCTGTCCCAATGCAAAACAGCTTTGATCAAAGATGTTTGGAAGGAGGAAAACAGTTTTTGGACCTACCGATTAGAAGA
AACTTAAAAGACAAGGAAGTTGCTGAATGGACTGACTTAAGTCTCGATCTCACTCCTATTGTTCTTTCCTCGAAGGAAGGTTCTTGGAAATGGCTCCCTAGCCCAAATAA
TTTTTTCTCTACAAAATCCTTGATGGTGGACATGACAAACAAATCAATCATCCTAGACCCTTCATTAGCCAAAGGAATCTGGAAAGACAAGTACCCAAAAAAGGTAAAAT
TCTTCCTCTGGGAGGTGGTTCACAAAGCCATCAGTACAAAAGAAAATCTTCAAAGAAGAATGCCTTATTTGGCTTTGTCTCCAAGCTGGTGCACTTTGTGTAAAGCTAAC
TATGAATCTCAAAACCACCTTTTCATCCACTGGCCCTACAATATTACTTCTGGAACAAAATTTTGCAAGCTTTTAGATGTGGTGACCACCTCGATAACTCTGTACAAGTG
GGGAGAGGCCCAACTTTGGTCTTTGAACCGGACATATGAAGAAAGCTTGAGACTTTTGGATGAGACTAATGCTGCAGTAGAAATGCACAAGCATGGTGGCTGCAGCTTGG
ATTTAAGTGGCGTTGACCTTCATCTGGAGAAATGGTGTAGCCGATTACTAGGTGATAATTAG
Protein sequenceShow/hide protein sequence
MLSAALFRHPLTSIISATLPVKNVSSFRFQNQATSVNFSLSANNSVSNGIRDDRNKHSIHLDSLRALEWDKLCDSVASFARTSLGREAVKVYIAPKFSCSYQPHSSFTSL
DLNRIENSLDFIRMKEVWEGEQSSILPSLLSSDEEEISDQDELNAQRFFPSSEQLAVNEAINACIEVDTVRPKRISTAKDPSKRAVPMQNSFDQRCLEGGKQFLDLPIRR
NLKDKEVAEWTDLSLDLTPIVLSSKEGSWKWLPSPNNFFSTKSLMVDMTNKSIILDPSLAKGIWKDKYPKKVKFFLWEVVHKAISTKENLQRRMPYLALSPSWCTLCKAN
YESQNHLFIHWPYNITSGTKFCKLLDVVTTSITLYKWGEAQLWSLNRTYEESLRLLDETNAAVEMHKHGGCSLDLSGVDLHLEKWCSRLLGDN