; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g11280 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g11280
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr7:8728283..8732336
RNA-Seq ExpressionMoc07g11280
SyntenyMoc07g11280
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA3483188.1 reverse transcriptase [Gossypium australe]1.1e-1724.82Show/hide
Query:  LSKILQVYEKASRQCVNLEKSMVCFSPNIISDRRKYLQSILKVRCVESLRHIWGYLQWCIGR---------IPTISASLK----RMFGRRCKAIQDSKGS
        L  ILQ YE  S QCVN  KS V FSPN   + +  +  +L VR   +     G L   IGR         +  IS  ++    RM  +  +  Q+ + S
Subjt:  LSKILQVYEKASRQCVNLEKSMVCFSPNIISDRRKYLQSILKVRCVESLRHIWGYLQWCIGR---------IPTISASLK----RMFGRRCKAIQDSKGS

Query:  FFWRSLLCGRELLKEGLRKRMGDGCSIKVFEDKWIPRESTFRTMSPSISNRGALVADFIT-RSGEWDAAKLGSNMCLEDVEVIMKIPLSKRGMDVVPL--
        + WRS+   R +L++GL  ++G+G  I +  D W+P     R +S    +    VA+ I  ++ EW+   +      ++ + I++IPL+K   D +    
Subjt:  FFWRSLLCGRELLKEGLRKRMGDGCSIKVFEDKWIPRESTFRTMSPSISNRGALVADFIT-RSGEWDAAKLGSNMCLEDVEVIMKIPLSKRGMDVVPL--

Query:  ---CPVYSKKAETKVHALLDYS------RGHQIWQACVASLDRKIRADTDLLSRWENN-----IINGDKCLMCKRRLSGFAITLRDLGMRSNQKMRSSGD
              +S K+  K+   LD S           ++   A L  K+   +  LSRW++       IN D     K  LS   + +RD             +
Subjt:  ---CPVYSKKAETKVHALLDYS------RGHQIWQACVASLDRKIRADTDLLSRWENN-----IINGDKCLMCKRRLSGFAITLRDLGMRSNQKMRSSGD

Query:  SSISGSSIFIAAMTHCFETEYPRLYSELLAIREGLKLAKRLNFCDMMVESDCPEAIHLLQGRMRYRSEVEVWNLEISYLMRDFRRLRFSHVNRKFNGVVD
         S+      I + +   E       +E LA R+ ++L   +   ++++E D    I   +  +  RS++  +  +I     DF+ +RF  ++R  N +  
Subjt:  SSISGSSIFIAAMTHCFETEYPRLYSELLAIREGLKLAKRLNFCDMMVESDCPEAIHLLQGRMRYRSEVEVWNLEISYLMRDFRRLRFSHVNRKFNGVVD

Query:  GLAKEASSSRLSVVDELVSGVAL
         +A +   ++  +   LV GV L
Subjt:  GLAKEASSSRLSVVDELVSGVAL

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]2.0e-3023.92Show/hide
Query:  ECFFLSKILQVYEKASRQCVNLEKSMVCFSPNIISDRRKYLQSILKVRCVESL-----------------------------------------------
        EC  L ++L  Y +AS QC+N  KS + FSPN+  +R++YLQ IL V+ V                                                  
Subjt:  ECFFLSKILQVYEKASRQCVNLEKSMVCFSPNIISDRRKYLQSILKVRCVESL-----------------------------------------------

Query:  ----RHIWGYLQWCIGRIPT--ISASLKRMFGRRCKAIQ---DSKGSFFWRSLLCGRELLKEGLRKRMGDGCSIKVFEDKWIPRESTFRTMSPSISNRGA
            +H+W +LQ      P   +S  LK  + +    +Q   +SK S+FW+  L GR+LL +GLR R+G+G +IK F D W+PR +TF+   P   N GA
Subjt:  ----RHIWGYLQWCIGRIPT--ISASLKRMFGRRCKAIQ---DSKGSFFWRSLLCGRELLKEGLRKRMGDGCSIKVFEDKWIPRESTFRTMSPSISNRGA

Query:  L---VADFITRSGEWDAAKLGSNMCLEDVEVIMKIPLSK-------------------------------------------------------------
        L   VA FIT  G WD   +  + C ED ++I+ +P+S                                                              
Subjt:  L---VADFITRSGEWDAAKLGSNMCLEDVEVIMKIPLSK-------------------------------------------------------------

Query:  ----------------RGMDVVPLCPVYSKKAETKVHALLDYSRGHQIWQ------ACV------------ASLDRKIR-ADTDL--LSRW-----ENNI
                        RG+  +P C +   + E+ +HA     R  QIW+       C+            +SL  ++   D +L  ++ W      N++
Subjt:  ----------------RGMDVVPLCPVYSKKAETKVHALLDYSRGHQIWQ------ACV------------ASLDRKIR-ADTDL--LSRW-----ENNI

Query:  INGDKC----LMCK-------------------RRLSGFAITLRDLGMRSNQKMRSSGDSSISGSSI------------FIAAMTHCFETEYPRLYSELL
        I+G +       C+                   R  S     ++     S+  ++ + D++  G+S              +AA +         L +E+ 
Subjt:  INGDKC----LMCK-------------------RRLSGFAITLRDLGMRSNQKMRSSGDSSISGSSI------------FIAAMTHCFETEYPRLYSELL

Query:  AIREGLKLAKRLNFCDMMVESDCPEAIHLLQGRMRYRSEVEVWNLEISYLMRDFRRLRFSHVNRKFNGVVDGLAKEASSSRLSVVDELVSGVALCFNGWL
         I EGLK A   NF  + VESD   AI L++  +  R + + W +EI  L   F  + FSH +R+ N    GLAK   +S  +    L +     F  WL
Subjt:  AIREGLKLAKRLNFCDMMVESDCPEAIHLLQGRMRYRSEVEVWNLEISYLMRDFRRLRFSHVNRKFNGVVDGLAKEASSSRLSVVDELVSGVALCFNGWL

Query:  FD
         D
Subjt:  FD

XP_030497722.1 uncharacterized protein LOC115713379 [Cannabis sativa]1.3e-1834.69Show/hide
Query:  ADPVECFFLSKILQVYEKASRQCVNLEKSMVCFSPNIISDRRKYLQSILKVR-----CVESLRH----IWGYLQWCIGRIPT--ISASLKRMFGRRCKAI
        A+   C  L ++  VY KAS Q +N  KS + FSPN +SD R  L +   +      C  SL H    +     W I + P+  ++  L   +  R   +
Subjt:  ADPVECFFLSKILQVYEKASRQCVNLEKSMVCFSPNIISDRRKYLQSILKVR-----CVESLRH----IWGYLQWCIGRIPT--ISASLKRMFGRRCKAI

Query:  QDSKG---SFFWRSLLCGRELLKEGLRKRMGDGCSIKVFEDKWIPRESTFRTMSPSISNRGALVADFITRSGEWDAAKLGSNMCLEDVEVIMKIPL
        Q   G   SF WRS+L GRELL  GL  ++GDG  I+  ED WIP  + F+  S ++      ++ FIT SG+WD AKL +      V+ I+ +P+
Subjt:  QDSKG---SFFWRSLLCGRELLKEGLRKRMGDGCSIKVFEDKWIPRESTFRTMSPSISNRGALVADFITRSGEWDAAKLGSNMCLEDVEVIMKIPL

XP_042974642.1 uncharacterized protein LOC122306274 [Carya illinoinensis]4.7e-1624.62Show/hide
Query:  LSKILQVYEKASRQCVNLEKSMVCFSPNIISDRRKYLQSI-----------------------------------LKVRCVESL------RHIWGYLQWC
        + ++L  YEKAS Q +N EK+ V F+ N  ++ +K +  +                                   L  R +ES       +  W +LQ  
Subjt:  LSKILQVYEKASRQCVNLEKSMVCFSPNIISDRRKYLQSI-----------------------------------LKVRCVESL------RHIWGYLQWC

Query:  IGRIPTISASLKRMFGRRCKAIQDSKGSFFWRSLLCGRELLKEGLRKRMGDGCSIKVFEDKWIPRESTFRTMSP-SISNRGALVADFITRSGEWDAAKLG
           +  I        G   +     + S+ WR++    +LLKEGLR R+G+G SIK++E KW+P  S+    +P S+ N  A V++ I+ +GEWD   + 
Subjt:  IGRIPTISASLKRMFGRRCKAIQDSKGSFFWRSLLCGRELLKEGLRKRMGDGCSIKVFEDKWIPRESTFRTMSP-SISNRGALVADFITRSGEWDAAKLG

Query:  SNMCLEDVEVIMKIPLSKRGMD--VVPLCPVYSKKAETKVHALLDYSRGHQI-------------WQAC-VASLDRKI-----RADTDLLSRWEN----N
        +    E+VE I  IP+SK   +  ++  C   S  +    +  LD SR   +             WQ+    ++  K+     RA  DLL+   N     
Subjt:  SNMCLEDVEVIMKIPLSKRGMD--VVPLCPVYSKKAETKVHALLDYSRGHQI-------------WQAC-VASLDRKI-----RADTDLLSRWEN----N

Query:  IINGDKCLMCKRRLSGFAITLRDL---------GMRSNQKMRSSGDSSISGSSIFIAA---MTHCFETEYPRLYSELLAIREGLKLAKRLNFCDMMVESD
        II+  KC  CK      + TL             ++  QK +S          I +AA   M +  +        E  A+R+ +++   LNF ++  E D
Subjt:  IINGDKCLMCKRRLSGFAITLRDL---------GMRSNQKMRSSGDSSISGSSIFIAA---MTHCFETEYPRLYSELLAIREGLKLAKRLNFCDMMVESD

Query:  CPEAIHLLQGR----MRYRSEVEVWNLEISYLMRDFRRLRFSHVNRKFNGVVDGLAKEA
             + +        RY S +E    ++  +++   +   S V R+ N     LAKEA
Subjt:  CPEAIHLLQGR----MRYRSEVEVWNLEISYLMRDFRRLRFSHVNRKFNGVVDGLAKEA

XP_042987456.1 uncharacterized protein LOC122315549 [Carya illinoinensis]2.1e-1623.45Show/hide
Query:  RSSTRGLMRAAERAPDGGR-----WADPVECFFLSK---------ILQVYEKASRQCVNLEKSMVCFSPNIISDRRKYLQSI---------------LKV
        +S  RG  R    A +G R     +AD    F  +K         +L  YEK S Q +N EK+ V  + N   + +K +  +               L  
Subjt:  RSSTRGLMRAAERAPDGGR-----WADPVECFFLSK---------ILQVYEKASRQCVNLEKSMVCFSPNIISDRRKYLQSI---------------LKV

Query:  RCVESL------RHIWGYLQWCIGRIPTISASLKRMFGRRCKAIQDSKG---SFFWRSLLCGRELLKEGLRKRMGDGCSIKVFEDKWIPRESTFRTMSP-
        + +ES       +  W +LQ     +P I    K  + R C  ++   G   S+ WRS+    ELL+EGLR R+G+G SI+++  KW+P  S+F   +P 
Subjt:  RCVESL------RHIWGYLQWCIGRIPTISASLKRMFGRRCKAIQDSKG---SFFWRSLLCGRELLKEGLRKRMGDGCSIKVFEDKWIPRESTFRTMSP-

Query:  SISNRGALVADFITRSGEWDAAKLGSNMCLEDVEVIMKIPLSKRGMD-------------VV--------------------------------PLCPVY
        S  +  A V++ I+ +GEWD   + S    E+VE I  IP+SK   D             +V                                  CP+ 
Subjt:  SISNRGALVADFITRSGEWDAAKLGSNMCLEDVEVIMKIPLSKRGMD-------------VV--------------------------------PLCPVY

Query:  SKKAETKVHALLDYSRGHQIWQACVASLDRKIRADTDLLSRWENNIING-----DKCLMCKRRLSGFAITLR--DLGMRSN------QKMRSSGDS---S
            ET  HAL        +W   +    +    +T+LL  WE  +        ++ +M  R +       R  +L +++N      +K R  G     S
Subjt:  SKKAETKVHALLDYSRGHQIWQACVASLDRKIRADTDLLSRWENNIING-----DKCLMCKRRLSGFAITLR--DLGMRSN------QKMRSSGDS---S

Query:  ISGSSIFIAAMTHCFETEYPRLYSELLAIREGLKLAKRLNFCDMMVESDCPEAIHLLQGR----MRYRSEVEVWNLEISYLMRDFRRLRFSHVNRKFNGV
             I +AA  H        + +E  A+R+ +++ +  NF +++ E D    ++ +        RY S +E    ++  L++   +     + R+ N  
Subjt:  ISGSSIFIAAMTHCFETEYPRLYSELLAIREGLKLAKRLNFCDMMVESDCPEAIHLLQGR----MRYRSEVEVWNLEISYLMRDFRRLRFSHVNRKFNGV

Query:  VDGLAKEASSSRLSVV
           LA+EA   +  +V
Subjt:  VDGLAKEASSSRLSVV

TrEMBL top hitse value%identityAlignment
A0A2N9E147 Reverse transcriptase domain-containing protein3.0e-1624.36Show/hide
Query:  ECFFLSKILQVYEKASRQCVNLEKSMVCFSPNIISDRRKYLQSILKVRCVESLRHIWGY--------------LQWCIGR--------------------
        EC  L  +L++YE AS Q +N+EK+ + FS N     ++ +QS L  R    L    G               ++  IGR                    
Subjt:  ECFFLSKILQVYEKASRQCVNLEKSMVCFSPNIISDRRKYLQSILKVRCVESLRHIWGY--------------LQWCIGR--------------------

Query:  -----IPTISASLKRMFGRRCKAIQD-------SKGSFFWRSLLCGRELLKEGLRKRMGDGCSIKVFEDKWIPRESTFRTMSP-SISNRGALVADFITRS
             IPT + S+ R+ G  C  I            SF WRS+L  RE++  G R R+G+G +I++++D+WIP  STF+  SP SI      V   I + 
Subjt:  -----IPTISASLKRMFGRRCKAIQD-------SKGSFFWRSLLCGRELLKEGLRKRMGDGCSIKVFEDKWIPRESTFRTMSP-SISNRGALVADFITRS

Query:  GE-WDAAKLGSNMCLEDVEVIMKIPLSKRGMDVVPLCPVYSK---KAETKVHALLDYSRGHQ-------------------IWQACVASLDR----KIRA
           W+A  + +   + + E I  IPLS+R      +     K      +  H L+   +G +                    W+  VA  DR    +  A
Subjt:  GE-WDAAKLGSNMCLEDVEVIMKIPLSKRGMDVVPLCPVYSK---KAETKVHALLDYSRGHQ-------------------IWQACVASLDR----KIRA

Query:  DTDLLSR----------WENNIINGDKCLMCKRRLSGFAITLRDLGMRSNQKMRSSGDSSI--SGSSIFIAAMTHCFETEYPRLYSELLAIREGLKLAKR
          D++ +           E    +    ++CK      ++   ++ ++      S G   +    S   IAAM          L     AI++ L LAK 
Subjt:  DTDLLSR----------WENNIINGDKCLMCKRRLSGFAITLRDLGMRSNQKMRSSGDSSI--SGSSIFIAAMTHCFETEYPRLYSELLAIREGLKLAKR

Query:  LNFCDMMVESDCPEAIHLLQGRMRYRSEVEVWNLEISYLMRDFRRLRFSHVNRKFNGVVDGLAKEASS
        ++   ++VE DC   +  L+      +       EI  L   F    F+ + R  N V   LAKE+ S
Subjt:  LNFCDMMVESDCPEAIHLLQGRMRYRSEVEVWNLEISYLMRDFRRLRFSHVNRKFNGVVDGLAKEASS

A0A2N9GFR8 RNase H domain-containing protein2.2e-1924.6Show/hide
Query:  ECFFLSKILQVYEKASRQCVNLEKSMVCFSPNIISDRRKYLQSILKVRCVESLRHIWGYLQWCIGRIPTISASLKRMFGRRCKAIQDSKGSFFWRSLLCG
        EC  + +IL +YEKAS Q +N  K+ + FS N    +++ ++ IL V  ++      G L   IG+      S  +  G   +A + ++GSF WRS+L  
Subjt:  ECFFLSKILQVYEKASRQCVNLEKSMVCFSPNIISDRRKYLQSILKVRCVESLRHIWGYLQWCIGRIPTISASLKRMFGRRCKAIQDSKGSFFWRSLLCG

Query:  RELLKEGLRKRMGDGCSIKVFEDKWIPRESTFRTMSPSIS-NRGALVADFITRSGE-WDAAKLGSNMCLEDVEVIMKIPLSKRGMD--------VVPLCP
        ++L++ G+  R+G+G  I +    W+  E   R +SP I     A V + I  S   W+  K+ S     D E I+KIPLS+R  +        ++ +  
Subjt:  RELLKEGLRKRMGDGCSIKVFEDKWIPRESTFRTMSPSIS-NRGALVADFITRSGE-WDAAKLGSNMCLEDVEVIMKIPLSKRGMD--------VVPLCP

Query:  VYSKKAETKVHALLDYSRGHQIWQ-ACVASLDRK-------------IRADTDLL---------------------------------SRWENNI-----
          S++ E   HAL      +Q+W  A   S+ R              ++AD+DLL                                 +RW   +     
Subjt:  VYSKKAETKVHALLDYSRGHQIWQ-ACVASLDRK-------------IRADTDLL---------------------------------SRWENNI-----

Query:  INGDKCLMCKRRLSGFAITLRDLGMRSNQKMRSSGDSSISGSSIFIAAMTHCFETEYPRLYSELLAIREGLKLAKRLNFCDMMVESDCPEAIHLLQGRMR
        +N D  +  +    G  + +RD                   + + IA ++      +     E LA R  +  AK +   D+ VE D    I  L     
Subjt:  INGDKCLMCKRRLSGFAITLRDLGMRSNQKMRSSGDSSISGSSIFIAAMTHCFETEYPRLYSELLAIREGLKLAKRLNFCDMMVESDCPEAIHLLQGRMR

Query:  YRSEVEVWNLEISYLMRDFRRLRFSHVNRKFNGVVDGLAKEAS
          +   +   +   L++DF+R   SH  R  N V   LA+ AS
Subjt:  YRSEVEVWNLEISYLMRDFRRLRFSHVNRKFNGVVDGLAKEAS

A0A2N9HU49 CCHC-type domain-containing protein5.1e-1629.88Show/hide
Query:  ADPVECFFLSKILQVYEKASRQCVNLEKSMVCFSPNIISDRRKYLQSILKVRCVESLRHIWGYLQWCIGRIPTISAS-LKRMFGRRCKAIQD--------
        A+  EC  L  +L +Y  AS Q VN +K+ + FS N     R  + SI             G L   +GR    + + +K   GRR +  ++        
Subjt:  ADPVECFFLSKILQVYEKASRQCVNLEKSMVCFSPNIISDRRKYLQSILKVRCVESLRHIWGYLQWCIGRIPTISAS-LKRMFGRRCKAIQD--------

Query:  --SKGSFFWRSLLCGRELLKEGLRKRMGDGCSIKVFEDKWIPRESTFRTMSPSISNRGALVADFIT--RSGEWDAAKLGSNMCLEDVEVIMKIPLSKRGM
             S+ WRS+   +E+L  GLR R+G G  IK+++D+W+P  ST++ +SP          D +    S  W+   L S     DVE I  IPLSKR  
Subjt:  --SKGSFFWRSLLCGRELLKEGLRKRMGDGCSIKVFEDKWIPRESTFRTMSPSISNRGALVADFIT--RSGEWDAAKLGSNMCLEDVEVIMKIPLSKRGM

Query:  DVVPLCPVYSKKAETKVHALLDYSRGHQIWQACVASLDRKI
              P  + +A +       +     IW A V S  RKI
Subjt:  DVVPLCPVYSKKAETKVHALLDYSRGHQIWQACVASLDRKI

A0A6J1DX30 uncharacterized protein LOC1110248749.6e-3123.92Show/hide
Query:  ECFFLSKILQVYEKASRQCVNLEKSMVCFSPNIISDRRKYLQSILKVRCVESL-----------------------------------------------
        EC  L ++L  Y +AS QC+N  KS + FSPN+  +R++YLQ IL V+ V                                                  
Subjt:  ECFFLSKILQVYEKASRQCVNLEKSMVCFSPNIISDRRKYLQSILKVRCVESL-----------------------------------------------

Query:  ----RHIWGYLQWCIGRIPT--ISASLKRMFGRRCKAIQ---DSKGSFFWRSLLCGRELLKEGLRKRMGDGCSIKVFEDKWIPRESTFRTMSPSISNRGA
            +H+W +LQ      P   +S  LK  + +    +Q   +SK S+FW+  L GR+LL +GLR R+G+G +IK F D W+PR +TF+   P   N GA
Subjt:  ----RHIWGYLQWCIGRIPT--ISASLKRMFGRRCKAIQ---DSKGSFFWRSLLCGRELLKEGLRKRMGDGCSIKVFEDKWIPRESTFRTMSPSISNRGA

Query:  L---VADFITRSGEWDAAKLGSNMCLEDVEVIMKIPLSK-------------------------------------------------------------
        L   VA FIT  G WD   +  + C ED ++I+ +P+S                                                              
Subjt:  L---VADFITRSGEWDAAKLGSNMCLEDVEVIMKIPLSK-------------------------------------------------------------

Query:  ----------------RGMDVVPLCPVYSKKAETKVHALLDYSRGHQIWQ------ACV------------ASLDRKIR-ADTDL--LSRW-----ENNI
                        RG+  +P C +   + E+ +HA     R  QIW+       C+            +SL  ++   D +L  ++ W      N++
Subjt:  ----------------RGMDVVPLCPVYSKKAETKVHALLDYSRGHQIWQ------ACV------------ASLDRKIR-ADTDL--LSRW-----ENNI

Query:  INGDKC----LMCK-------------------RRLSGFAITLRDLGMRSNQKMRSSGDSSISGSSI------------FIAAMTHCFETEYPRLYSELL
        I+G +       C+                   R  S     ++     S+  ++ + D++  G+S              +AA +         L +E+ 
Subjt:  INGDKC----LMCK-------------------RRLSGFAITLRDLGMRSNQKMRSSGDSSISGSSI------------FIAAMTHCFETEYPRLYSELL

Query:  AIREGLKLAKRLNFCDMMVESDCPEAIHLLQGRMRYRSEVEVWNLEISYLMRDFRRLRFSHVNRKFNGVVDGLAKEASSSRLSVVDELVSGVALCFNGWL
         I EGLK A   NF  + VESD   AI L++  +  R + + W +EI  L   F  + FSH +R+ N    GLAK   +S  +    L +     F  WL
Subjt:  AIREGLKLAKRLNFCDMMVESDCPEAIHLLQGRMRYRSEVEVWNLEISYLMRDFRRLRFSHVNRKFNGVVDGLAKEASSSRLSVVDELVSGVALCFNGWL

Query:  FD
         D
Subjt:  FD

A0A6P3Z8V7 uncharacterized protein LOC1074106101.9e-1528.06Show/hide
Query:  WCIGRIPT--ISASLKRMFGRRCKAIQDSKG---SFFWRSLLCGRELLKEGLRKRMGDGCSIKVFEDKWIPRESTFRTMSPSISNRGALVADFITRSGEW
        W I R P+  ++  +K  +  RC   +   G   S  WRSL+ GR +L+ G   R+GDG SI++++D W+P   + R +SP I    A+VA  +T SG W
Subjt:  WCIGRIPT--ISASLKRMFGRRCKAIQDSKG---SFFWRSLLCGRELLKEGLRKRMGDGCSIKVFEDKWIPRESTFRTMSPSISNRGALVADFITRSGEW

Query:  DAAKLGSNMCLEDVEVIMKIPLSKRGMDVVPLCPVYSKKAETKVHALLDYSRGHQIWQACVASLDRKIRADTDLL------SRWENNIINGDKCLMCKRR
        +   L ++   ++VEVI  IP    G D        + +   K      Y+     W+A    +  +I   +D          W+ +I N  KC   + +
Subjt:  DAAKLGSNMCLEDVEVIMKIPLSKRGMDVVPLCPVYSKKAETKVHALLDYSRGHQIWQACVASLDRKIRADTDLL------SRWENNIINGDKCLMCKRR

Query:  LSGFAITLRDLGMRSNQKMRSSGDSSI--SGSSIFIAAMTHCFETEYPRLYSELLAIREGLKLAKRLNFCDMMVESDC
        L  F+     L    N      G   +  + S   + AM   FE  +  +YSEL+AIR       +  F    + SDC
Subjt:  LSGFAITLRDLGMRSNQKMRSSGDSSI--SGSSIFIAAMTHCFETEYPRLYSELLAIREGLKLAKRLNFCDMMVESDC

SwissProt top hitse value%identityAlignment
P93295 Uncharacterized mitochondrial protein AtMg003105.7e-0446.51Show/hide
Query:  SKGSFFWRSLLCGRELLKEGLRKRMGDGCSIKVFEDKWIPRES
        ++ S+ WRS++ GRELL  GL + +GDG   KV+ D+WI  E+
Subjt:  SKGSFFWRSLLCGRELLKEGLRKRMGDGCSIKVFEDKWIPRES

Arabidopsis top hitse value%identityAlignment
ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein4.0e-0546.51Show/hide
Query:  SKGSFFWRSLLCGRELLKEGLRKRMGDGCSIKVFEDKWIPRES
        ++ S+ WRS++ GRELL  GL + +GDG   KV+ D+WI  E+
Subjt:  SKGSFFWRSLLCGRELLKEGLRKRMGDGCSIKVFEDKWIPRES


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTAGCTAAGTGGCGGATCGGAAATAGATCGGAGCGGCAACGAGTGCATCTGGTCTGGGGCGGCGGACGGATTGTATCGTATCGACATCGGGTAGATCTGGTCAGGCG
ACGGGACGATGGATCTGGTTTGGGGCGGGGTGGATCAGGCAGATCTGGTGAGCGGTGGAACGACCGATCTAAATGGGGCGGCAACAAGCGATCATCGACAAGAGGGCTAA
TGAGGGCGGCAGAAAGGGCACCAGACGGAGGAAGATGGGCTGATCCGGTGGAGTGTTTTTTCTTATCGAAGATCTTACAGGTGTATGAGAAGGCTTCTAGACAGTGTGTT
AACCTGGAGAAATCCATGGTTTGTTTTTCCCCAAACATAATCTCTGATCGTAGGAAGTACCTTCAGTCCATATTGAAGGTTAGGTGTGTGGAGTCGTTGAGACATATCTG
GGGTTACCTTCAGTGGTGTATAGGAAGAATTCCAACGATTTCCGCTTCATTAAAGAGAATGTTTGGAAGGCGCTGCAAGGCTATTCAGGATAGCAAGGGGTCATTCTTCT
GGAGAAGCCTTCTCTGTGGGCGTGAACTCTTAAAGGAAGGGTTGAGGAAGCGGATGGGTGATGGGTGCTCGATAAAAGTTTTTGAAGATAAGTGGATTCCTAGAGAATCT
ACTTTCAGAACCATGTCTCCTTCTATTTCTAATCGTGGTGCTCTTGTTGCTGATTTTATTACTAGAAGCGGTGAGTGGGATGCAGCAAAACTCGGATCTAATATGTGCCT
TGAAGATGTTGAGGTGATTATGAAAATTCCTCTCAGTAAGAGGGGCATGGATGTTGTACCTTTATGTCCAGTGTACTCAAAGAAGGCGGAGACGAAGGTTCATGCATTAC
TGGATTACAGTAGAGGGCATCAAATCTGGCAGGCGTGTGTGGCCTCGTTGGACAGGAAGATTAGGGCCGACACTGATTTGCTCAGCCGTTGGGAAAACAACATCATAAAC
GGAGACAAGTGTCTGATGTGCAAGCGAAGGCTGAGTGGATTTGCAATTACCTTGAGAGATTTAGGAATGCGCAGTAACCAAAAGATGCGATCATCTGGGGACTCCAGCAT
CTCGGGCTCTTCAATTTTTATTGCCGCGATGACTCATTGTTTTGAGACAGAATATCCCCGTTTGTATTCAGAGTTACTGGCGATTCGAGAAGGGTTAAAGCTGGCAAAGA
GACTCAATTTCTGTGATATGATGGTTGAATCGGACTGTCCGGAGGCAATTCATCTCCTTCAAGGAAGGATGAGATATAGATCTGAAGTTGAGGTTTGGAACTTAGAAATC
TCATATTTGATGAGAGATTTCAGGCGGCTGAGGTTCAGTCATGTGAATCGAAAATTCAATGGTGTTGTTGATGGTCTAGCAAAGGAGGCATCTTCTTCGAGACTATCTGT
TGTGGATGAGCTCGTTTCCGGCGTGGCTCTCTGTTTTAATGGCTGGTTATTTGACAGCTTTTGTTACCCCTGTGGCATGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTAGCTAAGTGGCGGATCGGAAATAGATCGGAGCGGCAACGAGTGCATCTGGTCTGGGGCGGCGGACGGATTGTATCGTATCGACATCGGGTAGATCTGGTCAGGCG
ACGGGACGATGGATCTGGTTTGGGGCGGGGTGGATCAGGCAGATCTGGTGAGCGGTGGAACGACCGATCTAAATGGGGCGGCAACAAGCGATCATCGACAAGAGGGCTAA
TGAGGGCGGCAGAAAGGGCACCAGACGGAGGAAGATGGGCTGATCCGGTGGAGTGTTTTTTCTTATCGAAGATCTTACAGGTGTATGAGAAGGCTTCTAGACAGTGTGTT
AACCTGGAGAAATCCATGGTTTGTTTTTCCCCAAACATAATCTCTGATCGTAGGAAGTACCTTCAGTCCATATTGAAGGTTAGGTGTGTGGAGTCGTTGAGACATATCTG
GGGTTACCTTCAGTGGTGTATAGGAAGAATTCCAACGATTTCCGCTTCATTAAAGAGAATGTTTGGAAGGCGCTGCAAGGCTATTCAGGATAGCAAGGGGTCATTCTTCT
GGAGAAGCCTTCTCTGTGGGCGTGAACTCTTAAAGGAAGGGTTGAGGAAGCGGATGGGTGATGGGTGCTCGATAAAAGTTTTTGAAGATAAGTGGATTCCTAGAGAATCT
ACTTTCAGAACCATGTCTCCTTCTATTTCTAATCGTGGTGCTCTTGTTGCTGATTTTATTACTAGAAGCGGTGAGTGGGATGCAGCAAAACTCGGATCTAATATGTGCCT
TGAAGATGTTGAGGTGATTATGAAAATTCCTCTCAGTAAGAGGGGCATGGATGTTGTACCTTTATGTCCAGTGTACTCAAAGAAGGCGGAGACGAAGGTTCATGCATTAC
TGGATTACAGTAGAGGGCATCAAATCTGGCAGGCGTGTGTGGCCTCGTTGGACAGGAAGATTAGGGCCGACACTGATTTGCTCAGCCGTTGGGAAAACAACATCATAAAC
GGAGACAAGTGTCTGATGTGCAAGCGAAGGCTGAGTGGATTTGCAATTACCTTGAGAGATTTAGGAATGCGCAGTAACCAAAAGATGCGATCATCTGGGGACTCCAGCAT
CTCGGGCTCTTCAATTTTTATTGCCGCGATGACTCATTGTTTTGAGACAGAATATCCCCGTTTGTATTCAGAGTTACTGGCGATTCGAGAAGGGTTAAAGCTGGCAAAGA
GACTCAATTTCTGTGATATGATGGTTGAATCGGACTGTCCGGAGGCAATTCATCTCCTTCAAGGAAGGATGAGATATAGATCTGAAGTTGAGGTTTGGAACTTAGAAATC
TCATATTTGATGAGAGATTTCAGGCGGCTGAGGTTCAGTCATGTGAATCGAAAATTCAATGGTGTTGTTGATGGTCTAGCAAAGGAGGCATCTTCTTCGAGACTATCTGT
TGTGGATGAGCTCGTTTCCGGCGTGGCTCTCTGTTTTAATGGCTGGTTATTTGACAGCTTTTGTTACCCCTGTGGCATGTGA
Protein sequenceShow/hide protein sequence
MVAKWRIGNRSERQRVHLVWGGGRIVSYRHRVDLVRRRDDGSGLGRGGSGRSGERWNDRSKWGGNKRSSTRGLMRAAERAPDGGRWADPVECFFLSKILQVYEKASRQCV
NLEKSMVCFSPNIISDRRKYLQSILKVRCVESLRHIWGYLQWCIGRIPTISASLKRMFGRRCKAIQDSKGSFFWRSLLCGRELLKEGLRKRMGDGCSIKVFEDKWIPRES
TFRTMSPSISNRGALVADFITRSGEWDAAKLGSNMCLEDVEVIMKIPLSKRGMDVVPLCPVYSKKAETKVHALLDYSRGHQIWQACVASLDRKIRADTDLLSRWENNIIN
GDKCLMCKRRLSGFAITLRDLGMRSNQKMRSSGDSSISGSSIFIAAMTHCFETEYPRLYSELLAIREGLKLAKRLNFCDMMVESDCPEAIHLLQGRMRYRSEVEVWNLEI
SYLMRDFRRLRFSHVNRKFNGVVDGLAKEASSSRLSVVDELVSGVALCFNGWLFDSFCYPCGM