; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0028847 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0028847
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRibonuclease H
Genome locationchr8:31760358..31761697
RNA-Seq ExpressionLag0028847
SyntenyLag0028847
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022147761.1 uncharacterized protein LOC111016619 [Momordica charantia]4.6e-3038.53Show/hide
Query:  MKAEMPQKFKVPTFKSYDGKKDPVQHLNAYKSWMDFHGVSDAIRCHAFFFTLTGSARHWFERLKRRSISCFKDLARAFLTQFMGARELRQPHINLLTVKQ
        MK ++P KFK+P  K YDG  DP+ HL+ Y  W D +G+++AIRC  F FTLTGS R WF++LKR+SIS FK+LARAF+TQF G     +P   LLT+KQ
Subjt:  MKAEMPQKFKVPTFKSYDGKKDPVQHLNAYKSWMDFHGVSDAIRCHAFFFTLTGSARHWFERLKRRSISCFKDLARAFLTQFMGARELRQPHINLLTVKQ

Query:  QP-------------------------------------------GKSQPRTYAEFVSRAQKYMSAEELLKSKKTEREHKMSSSSNYDNKKDK-------
        +                                            GK    T++E  SRAQ YMS  EL+ SK+  +       ++Y+ K+ +       
Subjt:  QP-------------------------------------------GKSQPRTYAEFVSRAQKYMSAEELLKSKKTEREHKMSSSSNYDNKKDK-------

Query:  -RQRTDEGGRGRPDQGDYLKKFEKYTPTSVP
         R    + G+GR  Q D  +KFEKYTPT+VP
Subjt:  -RQRTDEGGRGRPDQGDYLKKFEKYTPTSVP

XP_022150035.1 uncharacterized protein LOC111018307 [Momordica charantia]8.2e-2742.42Show/hide
Query:  EEVMKAEMPQKFKVPTFKSYDGKKDPVQHLNAYKSWMDFHGVSDAIRCHAFFFTLTGSARHWFERLKRRSISCFKDLARAFLTQFMGARELRQPHINLLT
        EE+MK ++P KFK+PT K +DG  D V HL+AY+ WMD +GVS+A++C  F  TL+GSAR WF +LKR SIS FK LA+AF+TQF+G R   +P   LLT
Subjt:  EEVMKAEMPQKFKVPTFKSYDGKKDPVQHLNAYKSWMDFHGVSDAIRCHAFFFTLTGSARHWFERLKRRSISCFKDLARAFLTQFMGARELRQPHINLLT

Query:  VKQQP-------------------------------------------GKSQPRTYAEFVSRAQKYMSAEELLKSKKTEREHKMSSSSNYDNKKDKRQ
        +KQ+                                            GK  P T++E +SRAQKYMSA E   SK+ E E K  S  N +   DK Q
Subjt:  VKQQP-------------------------------------------GKSQPRTYAEFVSRAQKYMSAEELLKSKKTEREHKMSSSSNYDNKKDKRQ

XP_022158344.1 uncharacterized protein LOC111024851 [Momordica charantia]8.2e-2738.18Show/hide
Query:  MKAEMPQKFKVPTFKSYDGKKDPVQHLNAYKSWMDFHGVSDAIRCHAFFFTLTGSARHWFERLKRRSISCFKDLARAFLTQFMGARELRQPHINLLTVKQ
        MK +   KFK+P    YDG  DP+ HL+AY+ W D + + +AIRC  F FTLTGSAR+WF +LKR SIS FK+LA AF+TQF+G R   +P   LLT+KQ
Subjt:  MKAEMPQKFKVPTFKSYDGKKDPVQHLNAYKSWMDFHGVSDAIRCHAFFFTLTGSARHWFERLKRRSISCFKDLARAFLTQFMGARELRQPHINLLTVKQ

Query:  QP-------------------------------------------GKSQPRTYAEFVSRAQKYMSAEELLKSKKTEREHKMSSSSNYDNKKDKRQRT---
        +                                            GK  P T+ E +SRAQKYMSA EL+   +     + + S+  + + +KR R+   
Subjt:  QP-------------------------------------------GKSQPRTYAEFVSRAQKYMSAEELLKSKKTEREHKMSSSSNYDNKKDKRQRT---

Query:  -DEGGRGRPDQGDYLKKFEK
          + G GR  Q D   KFEK
Subjt:  -DEGGRGRPDQGDYLKKFEK

XP_022158830.1 uncharacterized protein LOC111025293 [Momordica charantia]5.6e-3636.29Show/hide
Query:  LIRDPRKGKNPVEYVDELETESKGKKTNNTTSKVR-GLKHTERTVLRSPESSTSRRTDLRNLIEEKHRVAKTAESKARAAEAEAKAAEAEAKAAEAEAKK
        L+RDP+KGK P     E +TE   + TN+  SK+R G    +RT +  P     R+T      +++H+       K+   +  ++    +      + K 
Subjt:  LIRDPRKGKNPVEYVDELETESKGKKTNNTTSKVR-GLKHTERTVLRSPESSTSRRTDLRNLIEEKHRVAKTAESKARAAEAEAKAAEAEAKAAEAEAKK

Query:  DHLPWKTELLNTLKEHGNPQGDLPKLKDSGGQDMEELIDQVDPPFTEEVMKAEMPQKFKVPTFKSYDGKKDPVQHLNAYKSWMDFHGVSDAIRCHAFFFT
           P  +E  ++ KE               G D+EEL+DQ D PFTEE+M+ ++P KFK+PT K +D   DPV HL+AY+ WMD +GVS+A+RC  F  T
Subjt:  DHLPWKTELLNTLKEHGNPQGDLPKLKDSGGQDMEELIDQVDPPFTEEVMKAEMPQKFKVPTFKSYDGKKDPVQHLNAYKSWMDFHGVSDAIRCHAFFFT

Query:  LTGSARHWFERLKRRSISCFKDLARAFLTQFMGARELRQPHINLLTVKQQP-------------------------------------------GKSQPR
        L GSAR WF +LKR SIS FK LARAF+TQF+G R   +P   LLT+KQ+                                            GK  P 
Subjt:  LTGSARHWFERLKRRSISCFKDLARAFLTQFMGARELRQPHINLLTVKQQP-------------------------------------------GKSQPR

Query:  TYAEFVSRAQKYMSAEELLKSKKTEREHKMSSSSNYDNKKDKRQRTDEGGRGRPDQGDYLKKFEKYTPTSVP
        T++E +SRAQ+YMSA E   SK+ E + K +     +   DK Q +    R R  Q D  +KFEKYTPT+VP
Subjt:  TYAEFVSRAQKYMSAEELLKSKKTEREHKMSSSSNYDNKKDKRQRTDEGGRGRPDQGDYLKKFEKYTPTSVP

XP_022159109.1 uncharacterized protein LOC111025548 [Momordica charantia]1.7e-3242.06Show/hide
Query:  GQDMEELIDQVDPPFTEEVMKAEMPQKFKVPTFKSYDGKKDPVQHLNAYKSWMDFHGVSDAIRCHAFFFTLTGSARHWFERLKRRSISCFKDLARAFLTQ
        G D+EEL+ Q D PFTEE+M+ ++P KFK+PT K +DG  +PV HL+AY+ WMD +GVSDAIRC  F  TL GSAR WF +LKR SIS FK LARAF+TQ
Subjt:  GQDMEELIDQVDPPFTEEVMKAEMPQKFKVPTFKSYDGKKDPVQHLNAYKSWMDFHGVSDAIRCHAFFFTLTGSARHWFERLKRRSISCFKDLARAFLTQ

Query:  FMGARELRQPHINLLTVKQQPGKS-------------------------------------------QPRTYAEFVSRAQKYMSAEELLKSKKTEREHKM
        F+G R   +P   LLT+KQ+  +S                                            P T++E +SRAQ+YMSA E   SK+ E + K 
Subjt:  FMGARELRQPHINLLTVKQQPGKS-------------------------------------------QPRTYAEFVSRAQKYMSAEELLKSKKTEREHKM

Query:  SSSSNYDNKKDKRQRTDEGGRGRPDQGDYLKKF
        +     +   DK Q +    R R  Q D  +KF
Subjt:  SSSSNYDNKKDKRQRTDEGGRGRPDQGDYLKKF

TrEMBL top hitse value%identityAlignment
A0A6J1D3B7 uncharacterized protein LOC1110166192.2e-3038.53Show/hide
Query:  MKAEMPQKFKVPTFKSYDGKKDPVQHLNAYKSWMDFHGVSDAIRCHAFFFTLTGSARHWFERLKRRSISCFKDLARAFLTQFMGARELRQPHINLLTVKQ
        MK ++P KFK+P  K YDG  DP+ HL+ Y  W D +G+++AIRC  F FTLTGS R WF++LKR+SIS FK+LARAF+TQF G     +P   LLT+KQ
Subjt:  MKAEMPQKFKVPTFKSYDGKKDPVQHLNAYKSWMDFHGVSDAIRCHAFFFTLTGSARHWFERLKRRSISCFKDLARAFLTQFMGARELRQPHINLLTVKQ

Query:  QP-------------------------------------------GKSQPRTYAEFVSRAQKYMSAEELLKSKKTEREHKMSSSSNYDNKKDK-------
        +                                            GK    T++E  SRAQ YMS  EL+ SK+  +       ++Y+ K+ +       
Subjt:  QP-------------------------------------------GKSQPRTYAEFVSRAQKYMSAEELLKSKKTEREHKMSSSSNYDNKKDK-------

Query:  -RQRTDEGGRGRPDQGDYLKKFEKYTPTSVP
         R    + G+GR  Q D  +KFEKYTPT+VP
Subjt:  -RQRTDEGGRGRPDQGDYLKKFEKYTPTSVP

A0A6J1D7D2 uncharacterized protein LOC1110183074.0e-2742.42Show/hide
Query:  EEVMKAEMPQKFKVPTFKSYDGKKDPVQHLNAYKSWMDFHGVSDAIRCHAFFFTLTGSARHWFERLKRRSISCFKDLARAFLTQFMGARELRQPHINLLT
        EE+MK ++P KFK+PT K +DG  D V HL+AY+ WMD +GVS+A++C  F  TL+GSAR WF +LKR SIS FK LA+AF+TQF+G R   +P   LLT
Subjt:  EEVMKAEMPQKFKVPTFKSYDGKKDPVQHLNAYKSWMDFHGVSDAIRCHAFFFTLTGSARHWFERLKRRSISCFKDLARAFLTQFMGARELRQPHINLLT

Query:  VKQQP-------------------------------------------GKSQPRTYAEFVSRAQKYMSAEELLKSKKTEREHKMSSSSNYDNKKDKRQ
        +KQ+                                            GK  P T++E +SRAQKYMSA E   SK+ E E K  S  N +   DK Q
Subjt:  VKQQP-------------------------------------------GKSQPRTYAEFVSRAQKYMSAEELLKSKKTEREHKMSSSSNYDNKKDKRQ

A0A6J1DWY0 uncharacterized protein LOC1110252932.7e-3636.29Show/hide
Query:  LIRDPRKGKNPVEYVDELETESKGKKTNNTTSKVR-GLKHTERTVLRSPESSTSRRTDLRNLIEEKHRVAKTAESKARAAEAEAKAAEAEAKAAEAEAKK
        L+RDP+KGK P     E +TE   + TN+  SK+R G    +RT +  P     R+T      +++H+       K+   +  ++    +      + K 
Subjt:  LIRDPRKGKNPVEYVDELETESKGKKTNNTTSKVR-GLKHTERTVLRSPESSTSRRTDLRNLIEEKHRVAKTAESKARAAEAEAKAAEAEAKAAEAEAKK

Query:  DHLPWKTELLNTLKEHGNPQGDLPKLKDSGGQDMEELIDQVDPPFTEEVMKAEMPQKFKVPTFKSYDGKKDPVQHLNAYKSWMDFHGVSDAIRCHAFFFT
           P  +E  ++ KE               G D+EEL+DQ D PFTEE+M+ ++P KFK+PT K +D   DPV HL+AY+ WMD +GVS+A+RC  F  T
Subjt:  DHLPWKTELLNTLKEHGNPQGDLPKLKDSGGQDMEELIDQVDPPFTEEVMKAEMPQKFKVPTFKSYDGKKDPVQHLNAYKSWMDFHGVSDAIRCHAFFFT

Query:  LTGSARHWFERLKRRSISCFKDLARAFLTQFMGARELRQPHINLLTVKQQP-------------------------------------------GKSQPR
        L GSAR WF +LKR SIS FK LARAF+TQF+G R   +P   LLT+KQ+                                            GK  P 
Subjt:  LTGSARHWFERLKRRSISCFKDLARAFLTQFMGARELRQPHINLLTVKQQP-------------------------------------------GKSQPR

Query:  TYAEFVSRAQKYMSAEELLKSKKTEREHKMSSSSNYDNKKDKRQRTDEGGRGRPDQGDYLKKFEKYTPTSVP
        T++E +SRAQ+YMSA E   SK+ E + K +     +   DK Q +    R R  Q D  +KFEKYTPT+VP
Subjt:  TYAEFVSRAQKYMSAEELLKSKKTEREHKMSSSSNYDNKKDKRQRTDEGGRGRPDQGDYLKKFEKYTPTSVP

A0A6J1DZ49 uncharacterized protein LOC1110248514.0e-2738.18Show/hide
Query:  MKAEMPQKFKVPTFKSYDGKKDPVQHLNAYKSWMDFHGVSDAIRCHAFFFTLTGSARHWFERLKRRSISCFKDLARAFLTQFMGARELRQPHINLLTVKQ
        MK +   KFK+P    YDG  DP+ HL+AY+ W D + + +AIRC  F FTLTGSAR+WF +LKR SIS FK+LA AF+TQF+G R   +P   LLT+KQ
Subjt:  MKAEMPQKFKVPTFKSYDGKKDPVQHLNAYKSWMDFHGVSDAIRCHAFFFTLTGSARHWFERLKRRSISCFKDLARAFLTQFMGARELRQPHINLLTVKQ

Query:  QP-------------------------------------------GKSQPRTYAEFVSRAQKYMSAEELLKSKKTEREHKMSSSSNYDNKKDKRQRT---
        +                                            GK  P T+ E +SRAQKYMSA EL+   +     + + S+  + + +KR R+   
Subjt:  QP-------------------------------------------GKSQPRTYAEFVSRAQKYMSAEELLKSKKTEREHKMSSSSNYDNKKDKRQRT---

Query:  -DEGGRGRPDQGDYLKKFEK
          + G GR  Q D   KFEK
Subjt:  -DEGGRGRPDQGDYLKKFEK

A0A6J1E1E7 uncharacterized protein LOC1110255488.2e-3342.06Show/hide
Query:  GQDMEELIDQVDPPFTEEVMKAEMPQKFKVPTFKSYDGKKDPVQHLNAYKSWMDFHGVSDAIRCHAFFFTLTGSARHWFERLKRRSISCFKDLARAFLTQ
        G D+EEL+ Q D PFTEE+M+ ++P KFK+PT K +DG  +PV HL+AY+ WMD +GVSDAIRC  F  TL GSAR WF +LKR SIS FK LARAF+TQ
Subjt:  GQDMEELIDQVDPPFTEEVMKAEMPQKFKVPTFKSYDGKKDPVQHLNAYKSWMDFHGVSDAIRCHAFFFTLTGSARHWFERLKRRSISCFKDLARAFLTQ

Query:  FMGARELRQPHINLLTVKQQPGKS-------------------------------------------QPRTYAEFVSRAQKYMSAEELLKSKKTEREHKM
        F+G R   +P   LLT+KQ+  +S                                            P T++E +SRAQ+YMSA E   SK+ E + K 
Subjt:  FMGARELRQPHINLLTVKQQPGKS-------------------------------------------QPRTYAEFVSRAQKYMSAEELLKSKKTEREHKM

Query:  SSSSNYDNKKDKRQRTDEGGRGRPDQGDYLKKF
        +     +   DK Q +    R R  Q D  +KF
Subjt:  SSSSNYDNKKDKRQRTDEGGRGRPDQGDYLKKF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGACCGCCACCAGCGAAGGTCACGAGACGATGATAACATCCGGGGGTCACCGAGACGAACAGGCTGAGGAGCATAGGCCGAGGGGCCGAGGCCAAGCAGAGGATGC
TGATGCCAAAATTGCCGCCCTTGAGGATGAGGTGAAGGGAATGAATCAGAGTTTGTCTAGAATACTCCAGATCCTGGATAAACCCGGTCCTAGCACCAAACTCCATGAGG
GGGGCTTGATTAGAGACCCGAGGAAGGGGAAGAATCCAGTCGAATACGTGGATGAATTAGAGACAGAATCCAAAGGAAAGAAGACCAACAACACAACCAGCAAGGTCAGG
GGGCTGAAGCACACAGAGCGCACAGTACTGAGGAGCCCTGAATCAAGTACCAGCCGTAGAACAGACCTGAGAAATCTGATCGAGGAAAAGCACAGAGTGGCCAAAACTGC
TGAGTCTAAGGCCAGAGCTGCTGAGGCCGAGGCCAAAGCTGCCGAGGCTGAGGCCAAGGCAGCCGAGGCCGAGGCTAAGAAAGACCATCTCCCTTGGAAGACTGAGCTTC
TAAACACACTAAAGGAGCATGGAAATCCTCAGGGAGACCTGCCTAAGTTGAAGGATTCGGGAGGGCAAGACATGGAAGAGCTAATCGACCAAGTCGACCCACCCTTCACA
GAAGAAGTCATGAAAGCTGAGATGCCCCAGAAGTTCAAGGTACCTACATTCAAGTCGTATGATGGCAAGAAAGACCCTGTCCAGCATCTAAATGCCTACAAAAGCTGGAT
GGACTTCCACGGCGTCTCAGATGCAATCAGGTGCCATGCATTCTTTTTCACCCTAACAGGATCAGCCAGGCATTGGTTTGAAAGGCTGAAAAGGAGATCCATCAGCTGTT
TCAAGGATTTAGCCCGAGCATTCCTTACACAGTTCATGGGGGCCAGAGAACTGCGCCAGCCTCACATCAACCTCTTAACAGTCAAACAGCAGCCAGGTAAGAGCCAACCT
CGAACCTATGCGGAGTTTGTCTCCCGGGCACAGAAATACATGAGCGCAGAGGAATTGCTCAAGTCAAAGAAGACGGAACGAGAGCACAAAATGTCTTCTTCATCTAACTA
CGACAATAAGAAGGACAAAAGGCAGCGGACCGACGAGGGAGGCCGAGGCCGACCAGACCAAGGAGACTACTTGAAGAAGTTCGAAAAGTACACCCCTACTTCAGTCCCAT
AG
mRNA sequenceShow/hide mRNA sequence
ATGGTGACCGCCACCAGCGAAGGTCACGAGACGATGATAACATCCGGGGGTCACCGAGACGAACAGGCTGAGGAGCATAGGCCGAGGGGCCGAGGCCAAGCAGAGGATGC
TGATGCCAAAATTGCCGCCCTTGAGGATGAGGTGAAGGGAATGAATCAGAGTTTGTCTAGAATACTCCAGATCCTGGATAAACCCGGTCCTAGCACCAAACTCCATGAGG
GGGGCTTGATTAGAGACCCGAGGAAGGGGAAGAATCCAGTCGAATACGTGGATGAATTAGAGACAGAATCCAAAGGAAAGAAGACCAACAACACAACCAGCAAGGTCAGG
GGGCTGAAGCACACAGAGCGCACAGTACTGAGGAGCCCTGAATCAAGTACCAGCCGTAGAACAGACCTGAGAAATCTGATCGAGGAAAAGCACAGAGTGGCCAAAACTGC
TGAGTCTAAGGCCAGAGCTGCTGAGGCCGAGGCCAAAGCTGCCGAGGCTGAGGCCAAGGCAGCCGAGGCCGAGGCTAAGAAAGACCATCTCCCTTGGAAGACTGAGCTTC
TAAACACACTAAAGGAGCATGGAAATCCTCAGGGAGACCTGCCTAAGTTGAAGGATTCGGGAGGGCAAGACATGGAAGAGCTAATCGACCAAGTCGACCCACCCTTCACA
GAAGAAGTCATGAAAGCTGAGATGCCCCAGAAGTTCAAGGTACCTACATTCAAGTCGTATGATGGCAAGAAAGACCCTGTCCAGCATCTAAATGCCTACAAAAGCTGGAT
GGACTTCCACGGCGTCTCAGATGCAATCAGGTGCCATGCATTCTTTTTCACCCTAACAGGATCAGCCAGGCATTGGTTTGAAAGGCTGAAAAGGAGATCCATCAGCTGTT
TCAAGGATTTAGCCCGAGCATTCCTTACACAGTTCATGGGGGCCAGAGAACTGCGCCAGCCTCACATCAACCTCTTAACAGTCAAACAGCAGCCAGGTAAGAGCCAACCT
CGAACCTATGCGGAGTTTGTCTCCCGGGCACAGAAATACATGAGCGCAGAGGAATTGCTCAAGTCAAAGAAGACGGAACGAGAGCACAAAATGTCTTCTTCATCTAACTA
CGACAATAAGAAGGACAAAAGGCAGCGGACCGACGAGGGAGGCCGAGGCCGACCAGACCAAGGAGACTACTTGAAGAAGTTCGAAAAGTACACCCCTACTTCAGTCCCAT
AG
Protein sequenceShow/hide protein sequence
MVTATSEGHETMITSGGHRDEQAEEHRPRGRGQAEDADAKIAALEDEVKGMNQSLSRILQILDKPGPSTKLHEGGLIRDPRKGKNPVEYVDELETESKGKKTNNTTSKVR
GLKHTERTVLRSPESSTSRRTDLRNLIEEKHRVAKTAESKARAAEAEAKAAEAEAKAAEAEAKKDHLPWKTELLNTLKEHGNPQGDLPKLKDSGGQDMEELIDQVDPPFT
EEVMKAEMPQKFKVPTFKSYDGKKDPVQHLNAYKSWMDFHGVSDAIRCHAFFFTLTGSARHWFERLKRRSISCFKDLARAFLTQFMGARELRQPHINLLTVKQQPGKSQP
RTYAEFVSRAQKYMSAEELLKSKKTEREHKMSSSSNYDNKKDKRQRTDEGGRGRPDQGDYLKKFEKYTPTSVP