; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg006507 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg006507
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationscaffold2:48451691..48452992
RNA-Seq ExpressionSpg006507
SyntenySpg006507
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
EEC68887.1 hypothetical protein OsI_37529 [Oryza sativa Indica Group]1.1e-0726.32Show/hide
Query:  WDERRIREAVSPCDREDILNIPRCSPSSNDEILWDEDSKGVFSVKSAYRLALNLESCKGASCSDELKDKSLWKSVNKEESTGHLDRGELINLWEAMISKL
        WD  +I +     D E ILNI   S S  D I W  D  G+FSV+SAYRLA  L + + +S S        W+ + K +    + +   I  W    + L
Subjt:  WDERRIREAVSPCDREDILNIPRCSPSSNDEILWDEDSKGVFSVKSAYRLALNLESCKGASCSDELKDKSLWKSVNKEESTGHLDRGELINLWEAMISKL

Query:  DDRNIAQATMILWNLWNYRNKYKHNGNPPNLQENQ-------------ASHPNAAVRENQPRLSNI------------------SSPSPNFWRLSTDASW
            +A   M LW  W  RN+  H  + P  + +Q                P A + + +  +  +                    P   + +L+ D S+
Subjt:  DDRNIAQATMILWNLWNYRNKYKHNGNPPNLQENQ-------------ASHPNAAVRENQPRLSNI------------------SSPSPNFWRLSTDASW

Query:  TDAASRGGLGWSLHDSNGSLYAIGCKQVIRKWSIKCLEAKAIMEGLK
          ++ +GGLG  L +S G +    CK + R  +    E +A +EGLK
Subjt:  TDAASRGGLGWSLHDSNGSLYAIGCKQVIRKWSIKCLEAKAIMEGLK

XP_023897447.1 uncharacterized protein LOC112009345 [Quercus suber]2.2e-0826.5Show/hide
Query:  WDERRIREAVSPCDREDILNIPRCSPSSNDEILWDEDSKGVFSVKSAYRLALNLESCKGA-SCSDELKDKSLWKSVNKEESTGHLDRGELINLWEAMI--
        W E  IR+   P D E IL IP  +    D ++W E S G FSV+SAYR+A+ L   + A S S   + +S WK                  LW   +  
Subjt:  WDERRIREAVSPCDREDILNIPRCSPSSNDEILWDEDSKGVFSVKSAYRLALNLESCKGA-SCSDELKDKSLWKSVNKEESTGHLDRGELINLWEAMI--

Query:  SKLDDRNIAQATMILWNLWNYRNKYKHNGNPPN------------LQENQASHPNAAVRENQPRLSNISSPSPNFWRLSTDASWTDAASRGGLGWSLHDS
           D+  I     + W  W  RN+ +H     +            L+ + A+    AVRE      N   P P+  +++ D + T   +  G+G  + D 
Subjt:  SKLDDRNIAQATMILWNLWNYRNKYKHNGNPPN------------LQENQASHPNAAVRENQPRLSNISSPSPNFWRLSTDASWTDAASRGGLGWSLHDS

Query:  NGSLYAIGCKQVIRKWSIKCLEAKAIMEGLKSEK
         G + A   +++        +E KA   GL+  K
Subjt:  NGSLYAIGCKQVIRKWSIKCLEAKAIMEGLKSEK

XP_024034843.1 uncharacterized protein LOC112096145 [Citrus clementina]1.1e-1224.48Show/hide
Query:  MDSERGWDERRIREAVSPCDREDILNIPRCSPSSNDEILWDEDSKGVFSVKSAYRLALNLESCKGASCSDELKD-----KSLWKSVNKEESTGHLDRGEL
        +D +  W E +I +  +    + IL IP       D ++W  D  G +SV+S Y++AL ++      CSD  K      + +WK  +  E    + +  +
Subjt:  MDSERGWDERRIREAVSPCDREDILNIPRCSPSSNDEILWDEDSKGVFSVKSAYRLALNLESCKGASCSDELKD-----KSLWKSVNKEESTGHLDRGEL

Query:  INLWEAMISKLDDRNIAQATMILWNLWNYRNKYKHNG--NPPNLQENQASHPNAAVR----ENQPRLSNISSPSPNFW--------RLSTDASWTDAASR
        +++ + + S+   + + Q   + W +W  RN++ + G    P L   +AS      R    +N P  S     S   W        ++  DA+   +  +
Subjt:  INLWEAMISKLDDRNIAQATMILWNLWNYRNKYKHNG--NPPNLQENQASHPNAAVR----ENQPRLSNISSPSPNFW--------RLSTDASWTDAASR

Query:  GGLGWSLHDSNGSLYAIGCKQVIRKWSIKCLEAKAIMEGLK
         GLG  + +SN  + A   K+V  K ++ C+EA+AI+ G++
Subjt:  GGLGWSLHDSNGSLYAIGCKQVIRKWSIKCLEAKAIMEGLK

XP_031116510.1 uncharacterized protein LOC116020169 [Ipomoea triloba]7.4e-0926.32Show/hide
Query:  WDERRIREAVSPCDREDILNIPRCSPSSNDEILWDEDSKGVFSVKSAYRLALNLESCKGASCSDELKDKSLWKSVNKEESTGHLDRGELINLWEAM----
        WD   +R+  +  D E ILN+P       D  +W+ DSKG ++VKS YR  L           D+     LW    +  S   +D  E    W A+    
Subjt:  WDERRIREAVSPCDREDILNIPRCSPSSNDEILWDEDSKGVFSVKSAYRLALNLESCKGASCSDELKDKSLWKSVNKEESTGHLDRGELINLWEAM----

Query:  ---------------ISKLDDRNIAQATMILWNLWNYRNKYKHNGNPPNLQENQASHPN-----AAVRENQP-------RLSNISSPSPNFWRLSTDASW
                       I KL+  ++ +  M  W LW  RN Y  NG P  + +   S  +       V E Q        R    S PS    +L+TD + 
Subjt:  ---------------ISKLDDRNIAQATMILWNLWNYRNKYKHNGNPPNLQENQASHPN-----AAVRENQP-------RLSNISSPSPNFWRLSTDASW

Query:  TDAASRGGLGWSLHDSNGSLYAIGCKQVIRKWSIKCLEAKAIMEGLKSEKADNVSFVKCPREKVKEAHNLARASISHGDFVGFFG
        +   S  G+GW + D+ G       K+    +S+K  EA +I E L   K  ++  V      +K    L   ++S   F   FG
Subjt:  TDAASRGGLGWSLHDSNGSLYAIGCKQVIRKWSIKCLEAKAIMEGLKSEKADNVSFVKCPREKVKEAHNLARASISHGDFVGFFG

XP_031120971.1 uncharacterized protein LOC116024211 [Ipomoea triloba]2.2e-0827.71Show/hide
Query:  WDERRIREAVSPCDREDILNIPRCSPSSNDEILWDEDSKGVFSVKSAYRLALNLESCKGASCSDELKDKSLW-KSVNKEESTGHL--DRGELINLWEAM-
        WD   +R+  +  D E ILN+P       D  +W+ DSKG ++VKS YR  L            E  D+  W K  N + +  HL  D  E    W A+ 
Subjt:  WDERRIREAVSPCDREDILNIPRCSPSSNDEILWDEDSKGVFSVKSAYRLALNLESCKGASCSDELKDKSLW-KSVNKEESTGHL--DRGELINLWEAM-

Query:  ------------------ISKLDDRNIAQATMILWNLWNYRNKYKHNGNPPNLQENQASHPN-----AAVRENQP-------RLSNISSPSPNFWRLSTD
                          I KL+  ++ +  M  W LW  RN Y  NG P  + +   S  +       V E Q        R    S PS    +L+ D
Subjt:  ------------------ISKLDDRNIAQATMILWNLWNYRNKYKHNGNPPNLQENQASHPN-----AAVRENQP-------RLSNISSPSPNFWRLSTD

Query:  ASWTDAASRGGLGWSLHDSNGSLYAIGCKQVIRKWSIKCLEAKAIMEGL
         + +   S  G+GW + D+ G   A   K+    +S+K  E  +I E L
Subjt:  ASWTDAASRGGLGWSLHDSNGSLYAIGCKQVIRKWSIKCLEAKAIMEGL

TrEMBL top hitse value%identityAlignment
A0A2K2DNF3 RNase H domain-containing protein1.5e-0726.07Show/hide
Query:  WDERRIREAVSPCDREDILNIPRCSPSSNDEILWDEDSKGVFSVKSAYRLAL------NLESCKG-------ASCSDELKD-----------KSLWKSVN
        WDE  +R  +   D E +L IP     S+D I W  DSKG FSVKSAY++ L          C G       A C    ++           K+ W+++ 
Subjt:  WDERRIREAVSPCDREDILNIPRCSPSSNDEILWDEDSKGVFSVKSAYRLAL------NLESCKG-------ASCSDELKD-----------KSLWKSVN

Query:  KEESTGHLDRGELINLWEAMISKLDDRNIAQATMILWNLWNYRNKYKHNGNPPNLQENQASHPN-------AAVRENQPRLSNISS---PSPNFWRLSTD
         EE+   L           +I++L           LW  WN RNK        ++ E Q    +       A  +E+   + ++SS   P  +F +++ D
Subjt:  KEESTGHLDRGELINLWEAMISKLDDRNIAQATMILWNLWNYRNKYKHNGNPPNLQENQASHPN-------AAVRENQPRLSNISS---PSPNFWRLSTD

Query:  ASWTDAASRGGLGWSLHDSNGSLYAIGCKQVIRKWSIKCLEAKAIMEGLKSEKADNV
         ++   + RGG GW   D  G +       ++R       EA+A++ G+ + K  NV
Subjt:  ASWTDAASRGGLGWSLHDSNGSLYAIGCKQVIRKWSIKCLEAKAIMEGLKSEKADNV

A0A2N9ENX3 Reverse transcriptase domain-containing protein2.1e-0925.77Show/hide
Query:  WDERRIREAVSPCDREDILNIPRCSPSSNDEILWDEDSKGVFSVKSAYRLALNLESCKGASCSDELKDKSLWKSVNKEESTGHLDRGELINLWEAMISKL
        W++  I    +P D + I  +P  S  S D+++W  +  G +SV+SAYR+   +++  G SCSD     +L + +++   +  LD   L +L++    +L
Subjt:  WDERRIREAVSPCDREDILNIPRCSPSSNDEILWDEDSKGVFSVKSAYRLALNLESCKGASCSDELKDKSLWKSVNKEESTGHLDRGELINLWEAMISKL

Query:  DDRNIAQATMILWNLWNYRNK--YKHNGNP----PNL-----QENQASHPNAAVRENQPRLSNISSPSPNFWRLSTDASWTDAASRGGLGWSLHDSNGSL
            IA+   +LW +WN RNK  Y++  +P    P L      E   +H         P L     PS   ++++ DA+ + A +R G+G  + D  G  
Subjt:  DDRNIAQATMILWNLWNYRNK--YKHNGNP----PNL-----QENQASHPNAAVRENQPRLSNISSPSPNFWRLSTDASWTDAASRGGLGWSLHDSNGSL

Query:  YAIGCKQVIRKWSIKCLEAKAIMEGLKSEKADNVSFVKCPREKVKEAHNLARASISHGDF
         A  CK+    ++I   +A A  E L+      +S  +   + +       R  +S+  F
Subjt:  YAIGCKQVIRKWSIKCLEAKAIMEGLKSEKADNVSFVKCPREKVKEAHNLARASISHGDF

A0A2N9FBK3 Reverse transcriptase domain-containing protein2.1e-0925.77Show/hide
Query:  WDERRIREAVSPCDREDILNIPRCSPSSNDEILWDEDSKGVFSVKSAYRLALNLESCKGASCSDELKDKSLWKSVNKEESTGHLDRGELINLWEAMISKL
        W++  I    +P D + I  +P  S  S D+++W  +  G +SV+SAYR+   +++  G SCSD     +L + +++   +  LD   L +L++    +L
Subjt:  WDERRIREAVSPCDREDILNIPRCSPSSNDEILWDEDSKGVFSVKSAYRLALNLESCKGASCSDELKDKSLWKSVNKEESTGHLDRGELINLWEAMISKL

Query:  DDRNIAQATMILWNLWNYRNK--YKHNGNP----PNL-----QENQASHPNAAVRENQPRLSNISSPSPNFWRLSTDASWTDAASRGGLGWSLHDSNGSL
            IA+   +LW +WN RNK  Y++  +P    P L      E   +H         P L     PS   ++++ DA+ + A +R G+G  + D  G  
Subjt:  DDRNIAQATMILWNLWNYRNK--YKHNGNP----PNL-----QENQASHPNAAVRENQPRLSNISSPSPNFWRLSTDASWTDAASRGGLGWSLHDSNGSL

Query:  YAIGCKQVIRKWSIKCLEAKAIMEGLKSEKADNVSFVKCPREKVKEAHNLARASISHGDF
         A  CK+    ++I   +A A  E L+      +S  +   + +       R  +S+  F
Subjt:  YAIGCKQVIRKWSIKCLEAKAIMEGLKSEKADNVSFVKCPREKVKEAHNLARASISHGDF

A0A7N2L5N1 Uncharacterized protein3.0e-0826.14Show/hide
Query:  DSERGWDERRIREAVSPCDREDILNIPRCSPSSNDEILWDEDSKGVFSVKSAYRLALNLESCKGASCSDELKDKSLWKSV-------NKEESTGHL----
        +  R WD  +I    +   R +IL  P  +  + D ++W E+    FSVK+AY +AL L+  +    S    DKS WK +        K E+TGH+    
Subjt:  DSERGWDERRIREAVSPCDREDILNIPRCSPSSNDEILWDEDSKGVFSVKSAYRLALNLESCKGASCSDELKDKSLWKSV-------NKEESTGHL----

Query:  -------------------DRGELINLWEAMISKLDDRNIAQATMILWNLWNYRNKYKHNGNPPNLQENQASHPNA
                           +  +   L+  M+  LD R++ +   + W++WN RNK+          E++ +HP A
Subjt:  -------------------DRGELINLWEAMISKLDDRNIAQATMILWNLWNYRNKYKHNGNPPNLQENQASHPNA

B8BN96 Reverse transcriptase domain-containing protein5.2e-0826.32Show/hide
Query:  WDERRIREAVSPCDREDILNIPRCSPSSNDEILWDEDSKGVFSVKSAYRLALNLESCKGASCSDELKDKSLWKSVNKEESTGHLDRGELINLWEAMISKL
        WD  +I +     D E ILNI   S S  D I W  D  G+FSV+SAYRLA  L + + +S S        W+ + K +    + +   I  W    + L
Subjt:  WDERRIREAVSPCDREDILNIPRCSPSSNDEILWDEDSKGVFSVKSAYRLALNLESCKGASCSDELKDKSLWKSVNKEESTGHLDRGELINLWEAMISKL

Query:  DDRNIAQATMILWNLWNYRNKYKHNGNPPNLQENQ-------------ASHPNAAVRENQPRLSNI------------------SSPSPNFWRLSTDASW
            +A   M LW  W  RN+  H  + P  + +Q                P A + + +  +  +                    P   + +L+ D S+
Subjt:  DDRNIAQATMILWNLWNYRNKYKHNGNPPNLQENQ-------------ASHPNAAVRENQPRLSNI------------------SSPSPNFWRLSTDASW

Query:  TDAASRGGLGWSLHDSNGSLYAIGCKQVIRKWSIKCLEAKAIMEGLK
          ++ +GGLG  L +S G +    CK + R  +    E +A +EGLK
Subjt:  TDAASRGGLGWSLHDSNGSLYAIGCKQVIRKWSIKCLEAKAIMEGLK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACAGTGAAAGAGGGTGGGATGAGAGGAGAATAAGAGAAGCAGTTTCCCCATGTGACAGAGAAGACATCCTTAACATACCAAGATGTTCACCAAGCTCTAACGATGA
GATTCTATGGGATGAAGACTCCAAAGGCGTCTTCTCGGTGAAGAGCGCATACAGATTAGCCTTAAATTTGGAGAGCTGCAAGGGAGCTTCATGCTCAGATGAGTTAAAAG
ATAAATCCCTGTGGAAGTCAGTGAACAAAGAGGAGTCAACAGGGCACTTGGACAGGGGGGAGCTGATCAACCTCTGGGAAGCTATGATTAGCAAATTGGATGATAGAAAC
ATAGCTCAGGCTACCATGATTTTGTGGAATCTATGGAACTACAGGAACAAATACAAGCACAATGGGAACCCTCCAAATTTACAAGAAAACCAAGCGAGTCATCCAAATGC
AGCCGTTCGGGAGAATCAGCCAAGACTCTCCAACATTTCTTCGCCCAGCCCCAATTTCTGGAGACTTTCTACAGATGCGTCGTGGACCGATGCAGCAAGCAGGGGTGGGT
TAGGCTGGTCTTTGCATGACTCGAATGGGTCTCTGTATGCAATCGGTTGTAAGCAGGTGATCAGGAAATGGTCGATAAAGTGCCTTGAAGCTAAGGCGATCATGGAAGGG
TTGAAGTCGGAGAAAGCAGATAATGTTTCTTTTGTCAAATGCCCGAGGGAGAAAGTCAAGGAGGCTCACAATCTGGCGAGAGCTTCAATTTCTCATGGGGATTTTGTTGG
TTTTTTTGGGCGAGATCCTTTCCCCAATTTGGGGGATCTGACTCGCATGAGGGATGTTGTTTTCCCTCCGTGGATTTTAGATTGCTTAGTTAATGACGTTGGTGTAACTA
CCTGCCAGTTCTTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGACAGTGAAAGAGGGTGGGATGAGAGGAGAATAAGAGAAGCAGTTTCCCCATGTGACAGAGAAGACATCCTTAACATACCAAGATGTTCACCAAGCTCTAACGATGA
GATTCTATGGGATGAAGACTCCAAAGGCGTCTTCTCGGTGAAGAGCGCATACAGATTAGCCTTAAATTTGGAGAGCTGCAAGGGAGCTTCATGCTCAGATGAGTTAAAAG
ATAAATCCCTGTGGAAGTCAGTGAACAAAGAGGAGTCAACAGGGCACTTGGACAGGGGGGAGCTGATCAACCTCTGGGAAGCTATGATTAGCAAATTGGATGATAGAAAC
ATAGCTCAGGCTACCATGATTTTGTGGAATCTATGGAACTACAGGAACAAATACAAGCACAATGGGAACCCTCCAAATTTACAAGAAAACCAAGCGAGTCATCCAAATGC
AGCCGTTCGGGAGAATCAGCCAAGACTCTCCAACATTTCTTCGCCCAGCCCCAATTTCTGGAGACTTTCTACAGATGCGTCGTGGACCGATGCAGCAAGCAGGGGTGGGT
TAGGCTGGTCTTTGCATGACTCGAATGGGTCTCTGTATGCAATCGGTTGTAAGCAGGTGATCAGGAAATGGTCGATAAAGTGCCTTGAAGCTAAGGCGATCATGGAAGGG
TTGAAGTCGGAGAAAGCAGATAATGTTTCTTTTGTCAAATGCCCGAGGGAGAAAGTCAAGGAGGCTCACAATCTGGCGAGAGCTTCAATTTCTCATGGGGATTTTGTTGG
TTTTTTTGGGCGAGATCCTTTCCCCAATTTGGGGGATCTGACTCGCATGAGGGATGTTGTTTTCCCTCCGTGGATTTTAGATTGCTTAGTTAATGACGTTGGTGTAACTA
CCTGCCAGTTCTTTTAA
Protein sequenceShow/hide protein sequence
MDSERGWDERRIREAVSPCDREDILNIPRCSPSSNDEILWDEDSKGVFSVKSAYRLALNLESCKGASCSDELKDKSLWKSVNKEESTGHLDRGELINLWEAMISKLDDRN
IAQATMILWNLWNYRNKYKHNGNPPNLQENQASHPNAAVRENQPRLSNISSPSPNFWRLSTDASWTDAASRGGLGWSLHDSNGSLYAIGCKQVIRKWSIKCLEAKAIMEG
LKSEKADNVSFVKCPREKVKEAHNLARASISHGDFVGFFGRDPFPNLGDLTRMRDVVFPPWILDCLVNDVGVTTCQFF