; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g00600 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g00600
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr3:443487..444566
RNA-Seq ExpressionMoc03g00600
SyntenyMoc03g00600
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0016740 - transferase activity (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8728571.1 hypothetical protein F3Y22_tig00004205pilonHSYRG00041 [Hibiscus syriacus]4.5e-2641.67Show/hide
Query:  NICSLVAKETTTKELLKALQDMYEKPSANTKILLWTEYFNIHMDKGTSVNSHINKLIDILNKLEGMSVKIEEEVKAMRLLTSLPDSWEMMKTAMSNSLGE
        N+   +AKE TT  L+ AL  MYEKPSA+ K+ L    FN+ M +G SV  H+N+L  I  +L  + ++ ++EV+A+ LL+SLPDSW    TA+S+S G 
Subjt:  NICSLVAKETTTKELLKALQDMYEKPSANTKILLWTEYFNIHMDKGTSVNSHINKLIDILNKLEGMSVKIEEEVKAMRLLTSLPDSWEMMKTAMSNSLGE

Query:  NSLKFSVICDAALFEEARRKLGKMYASTSEVKNGVESALVVQNKGKT-KMNYNGKRQQRYNRSSESSNGEVKCYYCHKKGHVKRFCMKFKED
        + LKF  + D  L EE RR+     ASTS       SAL  +++G+T + N N  R +     S + N +  CY C KKGH KR C   K+D
Subjt:  NSLKFSVICDAALFEEARRKLGKMYASTSEVKNGVESALVVQNKGKT-KMNYNGKRQQRYNRSSESSNGEVKCYYCHKKGHVKRFCMKFKED

KAG7561662.1 Zinc finger CCHC-type superfamily [Arabidopsis thaliana x Arabidopsis arenosa]3.4e-2635.25Show/hide
Query:  NICSLVAKETTTKELLKALQDMYEKPSANTKILLWTEYFNIHMDKGTSVNSHINKLIDILNKLEGMSVKIEEEVKAMRLLTSLPDSWEMMKTAMSNSLGE
        N+   VAKE TT+ L+K L DMYEKPSAN K+ L  + F++ M++G  V +H+N+   I+N+L  + ++ ++EV+A+ L+ SLP+SWE M+ A+SNS+G 
Subjt:  NICSLVAKETTTKELLKALQDMYEKPSANTKILLWTEYFNIHMDKGTSVNSHINKLIDILNKLEGMSVKIEEEVKAMRLLTSLPDSWEMMKTAMSNSLGE

Query:  NSLKFSVICDAALFEEARRKLGKMYASTSEVKNGVESALVVQNKGKTKMNYN---GKRQQRYNRSSESSNGEVKCYYCHKKGHVKRFC-MKFKEDFEKGN
          LKF  + D  L EE RR        T E      SA  V+N+G+ +   N   G+ + R  +    S   V+C+ C K GH K  C    K++  KG 
Subjt:  NSLKFSVICDAALFEEARRKLGKMYASTSEVKNGVESALVVQNKGKTKMNYN---GKRQQRYNRSSESSNGEVKCYYCHKKGHVKRFC-MKFKEDFEKGN

Query:  NIANVATGEERIEEFLACKLNRESKVVATGHKRTSVYVSEFEVP
          AN  T E      ++    R+ +   T    T   +S  + P
Subjt:  NIANVATGEERIEEFLACKLNRESKVVATGHKRTSVYVSEFEVP

KAG7584790.1 Zinc finger CCHC-type superfamily [Arabidopsis thaliana x Arabidopsis arenosa]2.6e-2639.23Show/hide
Query:  NICSLVAKETTTKELLKALQDMYEKPSANTKILLWTEYFNIHMDKGTSVNSHINKLIDILNKLEGMSVKIEEEVKAMRLLTSLPDSWEMMKTAMSNSLGE
        N+   VAKE TT+ L+K L DMYEKPSAN K+ L  + F++ M++G  V +H+N+   I+N+L  + ++ ++EV+A+ LL SLP+SWE M+ A+SNS+G 
Subjt:  NICSLVAKETTTKELLKALQDMYEKPSANTKILLWTEYFNIHMDKGTSVNSHINKLIDILNKLEGMSVKIEEEVKAMRLLTSLPDSWEMMKTAMSNSLGE

Query:  NSLKFSVICDAALFEEARRKLGKMYASTSEVKNGVESALVVQNKGKTKMNYN---GKRQQRYNRSSESSNGEVKCYYCHKKGHVKRFC-MKFKEDFEKGN
          LKF  + D  L EE RR        T E    + SA  V+N+G+ +   N   G+ + R  +    S   V+C+ C K GH K  C    K++  KG 
Subjt:  NSLKFSVICDAALFEEARRKLGKMYASTSEVKNGVESALVVQNKGKTKMNYN---GKRQQRYNRSSESSNGEVKCYYCHKKGHVKRFC-MKFKEDFEKGN

Query:  NIANVATGE
          AN  T E
Subjt:  NIANVATGE

RVW44710.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]2.0e-2636.8Show/hide
Query:  NICSLVAKETTTKELLKALQDMYEKPSANTKILLWTEYFNIHMDKGTSVNSHINKLIDILNKLEGMSVKIEEEVKAMRLLTSLPDSWEMMKTAMSNSLGE
        +I   V KE TT +L+KAL DMYEKPSAN K+ L  + FN+ M K  SV  H+N+   I N+L  + +  ++E++A+ +L SLP+SWE M+ A+SNS G+
Subjt:  NICSLVAKETTTKELLKALQDMYEKPSANTKILLWTEYFNIHMDKGTSVNSHINKLIDILNKLEGMSVKIEEEVKAMRLLTSLPDSWEMMKTAMSNSLGE

Query:  NSLKFSVICDAALFEEARRKLGKMYASTSEVKNGVESALVVQNKGK-----TKMNYNGKRQQRYNRSSESSNGEVKCYYCHKKGHVKRFCMKFKEDFEKG
          LK++ I D  L EE RR+        +   +G  SAL ++ +G+     +    +  R    NRS   S  +V+C+ C K GH KR C   K+  E  
Subjt:  NSLKFSVICDAALFEEARRKLGKMYASTSEVKNGVESALVVQNKGK-----TKMNYNGKRQQRYNRSSESSNGEVKCYYCHKKGHVKRFCMKFKEDFEKG

Query:  NNIANVATGEERIEEFLACKLNRESKVVATG
        ++ AN  T E +    LA     +  V+ +G
Subjt:  NNIANVATGEERIEEFLACKLNRESKVVATG

RZC29599.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Glycine soja]2.6e-2639.56Show/hide
Query:  NICSLVAKETTTKELLKALQDMYEKPSANTKILLWTEYFNIHMDKGTSVNSHINKLIDILNKLEGMSVKIEEEVKAMRLLTSLPDSWEMMKTAMSNSLGE
        N+   +  E TT  L+KAL DMYEKPSA  K+ L    FN+ M +G SV  HIN+   IL +LE + +K E+EVKA+ LL+SLPDSW    TA+S+S  E
Subjt:  NICSLVAKETTTKELLKALQDMYEKPSANTKILLWTEYFNIHMDKGTSVNSHINKLIDILNKLEGMSVKIEEEVKAMRLLTSLPDSWEMMKTAMSNSLGE

Query:  NSLKFSVICDAALFEEARRKLGKMYASTSEVKNGVESALVVQNKGK-TKMNYNGK-RQQRYNRSSESSNGEVKCYYCHKKGHVKRFCMKFKEDFEKGN--
        N+LK S I D  L E+ R++      S+S V N   SAL  + +G+ T+   NG+ R +   +       +V C+ C K+GH    C   K++    N  
Subjt:  NSLKFSVICDAALFEEARRKLGKMYASTSEVKNGVESALVVQNKGK-TKMNYNGK-RQQRYNRSSESSNGEVKCYYCHKKGHVKRFCMKFKEDFEKGN--

Query:  ----NIANVATGEERIEEFLACKLN
              ANVAT E  +++ L C L+
Subjt:  ----NIANVATGEERIEEFLACKLN

TrEMBL top hitse value%identityAlignment
A0A0D3AEM1 CCHC-type domain-containing protein1.8e-2838.86Show/hide
Query:  NICSLVAKETTTKELLKALQDMYEKPSANTKILLWTEYFNIHMDKGTSVNSHINKLIDILNKLEGMSVKIEEEVKAMRLLTSLPDSWEMMKTAMSNSLGE
        N+   V KE TT+ L+K L DMYEKPSAN+K+ L  + F++ M++G  V +HIN+   I+N+L  + ++ E+EV+A+ LL SLP+SWE M+ A+SNS+G 
Subjt:  NICSLVAKETTTKELLKALQDMYEKPSANTKILLWTEYFNIHMDKGTSVNSHINKLIDILNKLEGMSVKIEEEVKAMRLLTSLPDSWEMMKTAMSNSLGE

Query:  NSLKFSVICDAALFEEARRKLGKMYASTSEVKNGVESALVVQNKGKT---KMNYNGKRQQRYNRSSESSNGEVKCYYCHKKGHVKRFCM--KFKEDFEKG
          LKF+ + D  L EE RR +    ASTS       SA  V+N+G+        NG+ + R  R         +C+ C K GH+K+ C     KED  +G
Subjt:  NSLKFSVICDAALFEEARRKLGKMYASTSEVKNGVESALVVQNKGKT---KMNYNGKRQQRYNRSSESSNGEVKCYYCHKKGHVKRFCM--KFKEDFEKG

Query:  NNIANVATGEERIEEFLACKLNRESKVVA
           AN  T E  I++ L   ++   K+ A
Subjt:  NNIANVATGEERIEEFLACKLNRESKVVA

A0A2N9GHK9 Uncharacterized protein6.7e-2838.29Show/hide
Query:  VAKETTTKELLKALQDMYEKPSANTKILLWTEYFNIHMDKGTSVNSHINKLIDILNKLEGMSVKIEEEVKAMRLLTSLPDSWEMMKTAMSNSLGENSLKF
        V KE TT EL+ AL  MYEKPSAN K+ L  + FN+ M +GT+V  H+N+   I N+L  + ++ ++E++A+ +L SLP+SWE M+ A+SNS G+  LK+
Subjt:  VAKETTTKELLKALQDMYEKPSANTKILLWTEYFNIHMDKGTSVNSHINKLIDILNKLEGMSVKIEEEVKAMRLLTSLPDSWEMMKTAMSNSLGENSLKF

Query:  SVICDAALFEEARRKLGKMYASTSEVKNGVESALVVQNKGKTKMNYN-GKRQQRYNRSSESSNGEVKCYYCHKKGHVKRFCMKFKEDFEKGNNIANVATG
        + I D  L EE RR+     +S+    N     L  + +GK + NYN G+ + R  RS      +++C+ C K GH+++ C + K+  E  N+ ANV T 
Subjt:  SVICDAALFEEARRKLGKMYASTSEVKNGVESALVVQNKGKTKMNYN-GKRQQRYNRSSESSNGEVKCYYCHKKGHVKRFCMKFKEDFEKGNNIANVATG

Query:  EERIEEFLACKLNRESKVVATG
        E      L+     ES V+ +G
Subjt:  EERIEEFLACKLNRESKVVATG

A0A2N9IKI1 Uncharacterized protein6.7e-2838.29Show/hide
Query:  VAKETTTKELLKALQDMYEKPSANTKILLWTEYFNIHMDKGTSVNSHINKLIDILNKLEGMSVKIEEEVKAMRLLTSLPDSWEMMKTAMSNSLGENSLKF
        V KE TT EL+ AL  MYEKPSAN K+ L  + FN+ M +GT+V  H+N+   I N+L  + ++ ++E++A+ +L SLP+SWE M+ A+SNS G+  LK+
Subjt:  VAKETTTKELLKALQDMYEKPSANTKILLWTEYFNIHMDKGTSVNSHINKLIDILNKLEGMSVKIEEEVKAMRLLTSLPDSWEMMKTAMSNSLGENSLKF

Query:  SVICDAALFEEARRKLGKMYASTSEVKNGVESALVVQNKGKTKMNYN-GKRQQRYNRSSESSNGEVKCYYCHKKGHVKRFCMKFKEDFEKGNNIANVATG
        + I D  L EE RR+     +S+    N     L  + +GK + NYN G+ + R  RS      +++C+ C K GH+++ C + K+  E  N+ ANV T 
Subjt:  SVICDAALFEEARRKLGKMYASTSEVKNGVESALVVQNKGKTKMNYN-GKRQQRYNRSSESSNGEVKCYYCHKKGHVKRFCMKFKEDFEKGNNIANVATG

Query:  EERIEEFLACKLNRESKVVATG
        E      L+     ES V+ +G
Subjt:  EERIEEFLACKLNRESKVVATG

A0A2N9IPG8 Uncharacterized protein6.7e-2838.29Show/hide
Query:  VAKETTTKELLKALQDMYEKPSANTKILLWTEYFNIHMDKGTSVNSHINKLIDILNKLEGMSVKIEEEVKAMRLLTSLPDSWEMMKTAMSNSLGENSLKF
        V KE TT EL+ AL  MYEKPSAN K+ L  + FN+ M +GT+V  H+N+   I N+L  + ++ ++E++A+ +L SLP+SWE M+ A+SNS G+  LK+
Subjt:  VAKETTTKELLKALQDMYEKPSANTKILLWTEYFNIHMDKGTSVNSHINKLIDILNKLEGMSVKIEEEVKAMRLLTSLPDSWEMMKTAMSNSLGENSLKF

Query:  SVICDAALFEEARRKLGKMYASTSEVKNGVESALVVQNKGKTKMNYN-GKRQQRYNRSSESSNGEVKCYYCHKKGHVKRFCMKFKEDFEKGNNIANVATG
        + I D  L EE RR+     +S+    N     L  + +GK + NYN G+ + R  RS      +++C+ C K GH+++ C + K+  E  N+ ANV T 
Subjt:  SVICDAALFEEARRKLGKMYASTSEVKNGVESALVVQNKGKTKMNYN-GKRQQRYNRSSESSNGEVKCYYCHKKGHVKRFCMKFKEDFEKGNNIANVATG

Query:  EERIEEFLACKLNRESKVVATG
        E      L+     ES V+ +G
Subjt:  EERIEEFLACKLNRESKVVATG

A0A2N9J3Y8 Uncharacterized protein6.7e-2838.29Show/hide
Query:  VAKETTTKELLKALQDMYEKPSANTKILLWTEYFNIHMDKGTSVNSHINKLIDILNKLEGMSVKIEEEVKAMRLLTSLPDSWEMMKTAMSNSLGENSLKF
        V KE TT EL+ AL  MYEKPSAN K+ L  + FN+ M +GT+V  H+N+   I N+L  + ++ ++E++A+ +L SLP+SWE M+ A+SNS G+  LK+
Subjt:  VAKETTTKELLKALQDMYEKPSANTKILLWTEYFNIHMDKGTSVNSHINKLIDILNKLEGMSVKIEEEVKAMRLLTSLPDSWEMMKTAMSNSLGENSLKF

Query:  SVICDAALFEEARRKLGKMYASTSEVKNGVESALVVQNKGKTKMNYN-GKRQQRYNRSSESSNGEVKCYYCHKKGHVKRFCMKFKEDFEKGNNIANVATG
        + I D  L EE RR+     +S+    N     L  + +GK + NYN G+ + R  RS      +++C+ C K GH+++ C + K+  E  N+ ANV T 
Subjt:  SVICDAALFEEARRKLGKMYASTSEVKNGVESALVVQNKGKTKMNYN-GKRQQRYNRSSESSNGEVKCYYCHKKGHVKRFCMKFKEDFEKGNNIANVATG

Query:  EERIEEFLACKLNRESKVVATG
        E      L+     ES V+ +G
Subjt:  EERIEEFLACKLNRESKVVATG

SwissProt top hitse value%identityAlignment
P04146 Copia protein3.3e-0823.64Show/hide
Query:  SLVAKETTTKELLKALQDMYEKPSANTKILLWTEYFNIHMDKGTSVNSHINKLIDILNKLEGMSVKIEEEVKAMRLLTSLPDSWEMMKTAMSNSLGENSL
        +    + T +++L+ L  +YE+ S  +++ L     ++ +    S+ SH +   +++++L     KIEE  K   LL +LP  ++ + TA+  +L E +L
Subjt:  SLVAKETTTKELLKALQDMYEKPSANTKILLWTEYFNIHMDKGTSVNSHINKLIDILNKLEGMSVKIEEEVKAMRLLTSLPDSWEMMKTAMSNSLGENSL

Query:  KFSVICDAALFEEARRKLGKMYASTSEVKNGVESALVVQNKGKTKMNYNGKRQQRYNRSSE-SSNGEVKCYYCHKKGHVKRFCMKFKEDFEKGN--NIAN
          + + +  L +E      K+    ++    V +A+V  N    K N    R  +  +  + +S  +VKC++C ++GH+K+ C  +K      N  N   
Subjt:  KFSVICDAALFEEARRKLGKMYASTSEVKNGVESALVVQNKGKTKMNYNGKRQQRYNRSSE-SSNGEVKCYYCHKKGHVKRFCMKFKEDFEKGN--NIAN

Query:  VATGEERIEEFLACKLNRES
        V T       F+  ++N  S
Subjt:  VATGEERIEEFLACKLNRES

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.8e-1226.6Show/hide
Query:  NICSLVAKETTTKELLKALQDMYEKPSANTKILLWTEYFNIHMDKGTSVNSHINKLIDILNKLEGMSVKIEEEVKAMRLLTSLPDSWEMMKTAMSNSLGE
        ++ + +  E T + +   L+ +Y   +   K+ L  + + +HM +GT+  SH+N    ++ +L  + VKIEEE KA+ LL SLP S++ + T + +  G+
Subjt:  NICSLVAKETTTKELLKALQDMYEKPSANTKILLWTEYFNIHMDKGTSVNSHINKLIDILNKLEGMSVKIEEEVKAMRLLTSLPDSWEMMKTAMSNSLGE

Query:  NSLKFSVICDAALFEEARRKLGKMYASTSEVKNGVESALVVQNKGKT---KMNYNGKRQQRYNRSSESSNGEVKCYYCHKKGHVKRFC
         +++   +  A L  E  RK  +              AL+ + +G++     N  G+   R    + S +    CY C++ GH KR C
Subjt:  NSLKFSVICDAALFEEARRKLGKMYASTSEVKNGVESALVVQNKGKT---KMNYNGKRQQRYNRSSESSNGEVKCYYCHKKGHVKRFC

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATATTTGTAGTTTGGTGGCGAAAGAGACTACAACGAAGGAATTGTTGAAGGCCTTGCAAGACATGTATGAAAAACCTTCTGCCAATACAAAAATACTTTTA
TGGACGGAGTATTTTAATATCCACATGGATAAGGGAACCTCAGTAAATTCCCACATTAATAAGCTCATCGATATCTTGAACAAATTAGAAGGGATGAGTGTCAAG
ATTGAGGAAGAAGTGAAGGCTATGAGACTGTTGACATCTTTGCCTGACAGTTGGGAGATGATGAAGACCGCGATGTCAAATTCGCTAGGGGAAAACAGCTTGAAA
TTTTCAGTTATTTGTGATGCCGCCTTATTTGAGGAAGCTCGAAGAAAATTAGGGAAAATGTATGCATCTACTTCGGAGGTAAAGAACGGGGTTGAATCAGCTTTG
GTAGTTCAGAACAAAGGAAAAACAAAGATGAATTACAATGGGAAGCGGCAGCAGAGATATAACAGGAGTAGTGAAAGTTCCAATGGAGAAGTGAAGTGTTATTAC
TGCCACAAGAAGGGACACGTAAAACGCTTTTGCATGAAGTTCAAAGAAGATTTTGAGAAGGGGAATAATATTGCAAATGTTGCAACAGGAGAAGAACGGATTGAA
GAGTTTCTGGCTTGTAAGCTCAATAGGGAATCCAAGGTGGTGGCAACAGGCCACAAGAGAACTTCTGTTTATGTGTCTGAATTTGAGGTTCCTAGAGGATCTGAA
AGACATAGAATGCACAGAGTAGCTACAGATGGTTCAGACGAGACTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAATATTTGTAGTTTGGTGGCGAAAGAGACTACAACGAAGGAATTGTTGAAGGCCTTGCAAGACATGTATGAAAAACCTTCTGCCAATACAAAAATACTTTTA
TGGACGGAGTATTTTAATATCCACATGGATAAGGGAACCTCAGTAAATTCCCACATTAATAAGCTCATCGATATCTTGAACAAATTAGAAGGGATGAGTGTCAAG
ATTGAGGAAGAAGTGAAGGCTATGAGACTGTTGACATCTTTGCCTGACAGTTGGGAGATGATGAAGACCGCGATGTCAAATTCGCTAGGGGAAAACAGCTTGAAA
TTTTCAGTTATTTGTGATGCCGCCTTATTTGAGGAAGCTCGAAGAAAATTAGGGAAAATGTATGCATCTACTTCGGAGGTAAAGAACGGGGTTGAATCAGCTTTG
GTAGTTCAGAACAAAGGAAAAACAAAGATGAATTACAATGGGAAGCGGCAGCAGAGATATAACAGGAGTAGTGAAAGTTCCAATGGAGAAGTGAAGTGTTATTAC
TGCCACAAGAAGGGACACGTAAAACGCTTTTGCATGAAGTTCAAAGAAGATTTTGAGAAGGGGAATAATATTGCAAATGTTGCAACAGGAGAAGAACGGATTGAA
GAGTTTCTGGCTTGTAAGCTCAATAGGGAATCCAAGGTGGTGGCAACAGGCCACAAGAGAACTTCTGTTTATGTGTCTGAATTTGAGGTTCCTAGAGGATCTGAA
AGACATAGAATGCACAGAGTAGCTACAGATGGTTCAGACGAGACTTGA
Protein sequenceShow/hide protein sequence
MNICSLVAKETTTKELLKALQDMYEKPSANTKILLWTEYFNIHMDKGTSVNSHINKLIDILNKLEGMSVKIEEEVKAMRLLTSLPDSWEMMKTAMSNSLGENSLK
FSVICDAALFEEARRKLGKMYASTSEVKNGVESALVVQNKGKTKMNYNGKRQQRYNRSSESSNGEVKCYYCHKKGHVKRFCMKFKEDFEKGNNIANVATGEERIE
EFLACKLNRESKVVATGHKRTSVYVSEFEVPRGSERHRMHRVATDGSDET