; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g11200 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g11200
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr9:9517287..9520645
RNA-Seq ExpressionMoc09g11200
SyntenyMoc09g11200
Gene Ontology termsNA
InterPro domainsIPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
OMO70912.1 reverse transcriptase [Corchorus capsularis]1.0e-6433.85Show/hide
Query:  ESGDTEQRKKLMRTWTRQQRKG-KDKVEENEAVEDRKNKLIRRKHQIEVETIELENKRRMD----DTVKKEINFDCCINVPSNGNSGGLMLLWKNETDVH
        E  DT   K+     +R+  +G  + V  + AVED   +L   KH    ET   E K +      D ++  + FD C +VP  G  GGL LLW N   V 
Subjt:  ESGDTEQRKKLMRTWTRQQRKG-KDKVEENEAVEDRKNKLIRRKHQIEVETIELENKRRMD----DTVKKEINFDCCINVPSNGNSGGLMLLWKNETDVH

Query:  VMSYSQGHIDIYIKENNQRWRFTGVYGQPVTERRGETWELLKRLNSISDMPWILGGDFNEITMDSEKEGGAGRRLSQMEDFRNVIDHCGSLDAEVAGDKF
        ++SYS  HID+ I  ++  W FTG YG+P T  R E+W LL+ LN  SD+PW+  GDFNEI    EK+GG+ R +SQMEDF +VID CG  +  V G   
Subjt:  VMSYSQGHIDIYIKENNQRWRFTGVYGQPVTERRGETWELLKRLNSISDMPWILGGDFNEITMDSEKEGGAGRRLSQMEDFRNVIDHCGSLDAEVAGDKF

Query:  TWYKQFRGETIWERLDRFLLNVELDTLCSSFVVRHLDFLASDHRPILADWEWWNTVSQQ---GRRWKPSRFEEGWTKFEHCRELIRRNWQDIEGDSVVTF
        +W ++  GE ++ERLDR L+           V +HL   ASDH P+L        +S +        P RFE  W+       ++ ++WQ     +    
Subjt:  TWYKQFRGETIWERLDRFLLNVELDTLCSSFVVRHLDFLASDHRPILADWEWWNTVSQQ---GRRWKPSRFEEGWTKFEHCRELIRRNWQDIEGDSVVTF

Query:  NLKLGKCLHDLKRWNKERLEGSIKGAIARKEEEIQDIMNRRETNWRAAMMGAEKELEHLLEEDEMYWKQRAKEDWIIWGDRNTKWFHIKASQRRKQNIIK
        N+K+ +C  DL+ WN+ R+ G+I+  + RK+ E +++ ++ +          + EL+ L +++E+ W+QR+K  W+  GDRNT++FH  AS R+++  I 
Subjt:  NLKLGKCLHDLKRWNKERLEGSIKGAIARKEEEIQDIMNRRETNWRAAMMGAEKELEHLLEEDEMYWKQRAKEDWIIWGDRNTKWFHIKASQRRKQNIIK

Query:  GLDWRNGGWTDQDEEMGEIATEYFKTLFSTKNASAEDMGKILNSISIKIRND
        G+    G W  +   + ++ T Y++ +FS+   + E + K+L  +  ++  D
Subjt:  GLDWRNGGWTDQDEEMGEIATEYFKTLFSTKNASAEDMGKILNSISIKIRND

XP_022155286.1 uncharacterized protein LOC111022423 [Momordica charantia]1.9e-6349.42Show/hide
Query:  IKENNQRWRFTGVYGQPVTERRGETWELLKRLNSISDMPWILGGDFNEITMDSEKEGGAGRRLSQMEDFRNVIDHCGSLDAEVAGDKFTWYKQFR-GETI
        +KE+   WRFTG+YG  V + R ETWEL+ RL+SI D+PWILGGDFNEI  +SEK  G  RR S M++F++ +D CG LD    GD FTW    +  + I
Subjt:  IKENNQRWRFTGVYGQPVTERRGETWELLKRLNSISDMPWILGGDFNEITMDSEKEGGAGRRLSQMEDFRNVIDHCGSLDAEVAGDKFTWYKQFR-GETI

Query:  WERLDRFLLNVELDTLCSSFVVRHLDFLASDHRPILADWEWWNTVS---QQGRRWKPSRFEEGWTKFEHCRELIRRNWQDIEGDSVVT-FNLKLGKCLHD
        WERLDRFL+N  +  +  +  +RHL+FLASDHRPILA+W      +   ++GRR  P RFEE W  F+ C+E++RR W  ++GD   T F  K+  CL +
Subjt:  WERLDRFLLNVELDTLCSSFVVRHLDFLASDHRPILADWEWWNTVS---QQGRRWKPSRFEEGWTKFEHCRELIRRNWQDIEGDSVVT-FNLKLGKCLHD

Query:  LKRWNKERLEGSIKGAIARKEEEIQDIMNRRETNWRAAMMGAEKELEHLLEEDEMYWKQ
        L +WN  RL GS++GAI RKE EIQ ++    T WR  +  A+++LE LLEE+E YW+Q
Subjt:  LKRWNKERLEGSIKGAIARKEEEIQDIMNRRETNWRAAMMGAEKELEHLLEEDEMYWKQ

XP_022155286.1 uncharacterized protein LOC111022423 [Momordica charantia]5.8e-1247.44Show/hide
Query:  KILNSISIKIRNDPWIPREGNCTPVYVQDGMKHNSVACLMSERGKWNENLVRDNFCEEEAEMILKIPLPRQSQEDEII
        K+ N  SI + +DPW+PR+GN +PV+    +++ SVA LM  RG+W+E  VR++F   EA++IL+ PLP Q ++DEII
Subjt:  KILNSISIKIRNDPWIPREGNCTPVYVQDGMKHNSVACLMSERGKWNENLVRDNFCEEEAEMILKIPLPRQSQEDEII

XP_022155286.1 uncharacterized protein LOC111022423 [Momordica charantia]8.0e-6234.02Show/hide
Query:  VKKEINFDCCINVPSNGNSGGLMLLWKNETDVHVMSYSQGHIDIYIK-ENNQRWRFTGVYGQPVTERRGETWELLKRLNSISDMPWILGGDFNEITMDSE
        + K + +D C NV  NG  GGL LLW NE DV+++SYS  HID  I   + + WR +G+YG P T ++  TW LL+RL  +   PW+  GDFNEI   +E
Subjt:  VKKEINFDCCINVPSNGNSGGLMLLWKNETDVHVMSYSQGHIDIYIK-ENNQRWRFTGVYGQPVTERRGETWELLKRLNSISDMPWILGGDFNEITMDSE

Query:  KEGGAGRRLSQMEDFRNVIDHCGSLDAEVAGDKFTW-YKQFRGETIWERLDRFLLNVELDTLCSSFVVRHLDFLASDHRPILADWEWWNTVSQQGRRWKP
        K GG  R L  +  FR  ++ C  +D    G  FTW  ++F  + I ERLDRFL + + +    + +V +LD   SDH P++ + +  N  +   +   P
Subjt:  KEGGAGRRLSQMEDFRNVIDHCGSLDAEVAGDKFTW-YKQFRGETIWERLDRFLLNVELDTLCSSFVVRHLDFLASDHRPILADWEWWNTVSQQGRRWKP

Query:  SRF-EEGWTKFEHCRELIRRNWQD----IEGDSVVTFNLKLGKCLHDLKRWNKERLEGSIK--GAIARKEEEIQDIMNRRETNWRAAMMGAEKELEHLLE
          F E+ W+ +E C+ +++  W       +GD V  F      CL  L+ W++   +G  +    + +K  E++    +RE    + +   E+++E +L 
Subjt:  SRF-EEGWTKFEHCRELIRRNWQD----IEGDSVVTFNLKLGKCLHDLKRWNKERLEGSIK--GAIARKEEEIQDIMNRRETNWRAAMMGAEKELEHLLE

Query:  EDEMYWKQRAKEDWIIWGDRNTKWFHIKASQRRKQNIIKGLDWRNGGWTDQDEEMGEIATEYFKTLFSTKNASAEDMGKILNSISIKI
        ++E+YWKQR++ DW+  GD+NTK+FH KAS R+++N I G+  ++  W D  E + +   EYF +LF+T + S + +   L  +  ++
Subjt:  EDEMYWKQRAKEDWIIWGDRNTKWFHIKASQRRKQNIIKGLDWRNGGWTDQDEEMGEIATEYFKTLFSTKNASAEDMGKILNSISIKI

XP_030479239.1 uncharacterized protein LOC115696480 [Cannabis sativa]1.0e-6135.68Show/hide
Query:  INFDCCINVPSNGNSGGLMLLWKNETDVHVMSYSQGHIDIYIKENN-QRWRFTGVYGQPVTERRGETWELLKRLNSIS-DMPWILGGDFNEITMDSEKEG
        + F   + VP  G  GGL+LLWK+  DV + S+S  H D+++  +   R+ FT  YG P T +R +TW LL+     S D+PW++ GDFNE+  + +KEG
Subjt:  INFDCCINVPSNGNSGGLMLLWKNETDVHVMSYSQGHIDIYIKENN-QRWRFTGVYGQPVTERRGETWELLKRLNSIS-DMPWILGGDFNEITMDSEKEG

Query:  GAGRRLSQMEDFRNVIDHCGSLDAEVAGDKFTW-YKQFRGETIWERLDRFLLNVELDTLCSSFVVRHLDFLASDHRPILADWEWWNTVSQQGRRWKPSRF
        G  RR   MEDFR  ID CG    +  G+KFTW  K + G  + ERLDR  +N +     S   + HLD+ +SDHR I  +           R     RF
Subjt:  GAGRRLSQMEDFRNVIDHCGSLDAEVAGDKFTW-YKQFRGETIWERLDRFLLNVELDTLCSSFVVRHLDFLASDHRPILADWEWWNTVSQQGRRWKPSRF

Query:  EEGWTKFEHCRELIRRNWQD-IEGDSVVTFNLKLGKCLHDLKRWNKERLEGSIKGAIARKEEEIQDIMNRR--ETNWRAAMMGAEKELEHLLEEDEMYWK
        E+ W K E CR LI  NW+  +  D + +    +  C   L+ W+  +  G +K  I + +++  ++ N     T+    M+ AEK L+ LLE++E+YW+
Subjt:  EEGWTKFEHCRELIRRNWQD-IEGDSVVTFNLKLGKCLHDLKRWNKERLEGSIKGAIARKEEEIQDIMNRR--ETNWRAAMMGAEKELEHLLEEDEMYWK

Query:  QRAKEDWIIWGDRNTKWFHIKASQRRKQNIIKGLDWRNGGWTDQDEEMGEIATEYFKTLFSTKNASAEDMGKILNSISIKIRND
        QRA+ DW+  GD NTK+FH +A  R   N I+ L   +G     D+   + A+ +F  LF+T     E +  I+ +I   I  D
Subjt:  QRAKEDWIIWGDRNTKWFHIKASQRRKQNIIKGLDWRNGGWTDQDEEMGEIATEYFKTLFSTKNASAEDMGKILNSISIKIRND

XP_030940268.1 uncharacterized protein LOC115965235 [Quercus lobata]1.3e-6437.57Show/hide
Query:  DTVKKEINFDCCINVPSNGNSGGLMLLWKNETDVHVMSYSQGHIDIYIKEN-NQRWRFTGVYGQPVTERRGETWELLKRLNSISDMPWILGGDFNEITMD
        + ++  + FD    VP    SGGL L W N+ D+H+ ++S  HID  +    +  WRFTG YG P    R ++W LL+ L++  D+PW+  GDFNEIT  
Subjt:  DTVKKEINFDCCINVPSNGNSGGLMLLWKNETDVHVMSYSQGHIDIYIKEN-NQRWRFTGVYGQPVTERRGETWELLKRLNSISDMPWILGGDFNEITMD

Query:  SEKEGGAGRRLSQMEDFRNVIDHCGSLDAEVAGDKFTW-YKQFRGETIWERLDRFLLNVELDTLCSSFVVRHLDFLASDHRPILADWEWWNTVSQQ-GRR
         EK GGA R   QM+ FR+ +D CG  D    G  FTW   +F G  +W RLDR L + E      S  + HL   +SDH+PI   W   + V ++  R 
Subjt:  SEKEGGAGRRLSQMEDFRNVIDHCGSLDAEVAGDKFTW-YKQFRGETIWERLDRFLLNVELDTLCSSFVVRHLDFLASDHRPILADWEWWNTVSQQ-GRR

Query:  WKPSRFEEGWTKFEHCRELIRRNWQ-DIEGDSVVTFNLKLGKCLHDLKRWNKERLEGSIKGAIARKEEEIQDIMNRRETNWRAA-----MMGAEKELEHL
         KP RFEE W K E C  ++   W   +E + ++    K+  C   LK WNK  + G+I+G + +K +    ++ + E    A      +   ++E+  L
Subjt:  WKPSRFEEGWTKFEHCRELIRRNWQ-DIEGDSVVTFNLKLGKCLHDLKRWNKERLEGSIKGAIARKEEEIQDIMNRRETNWRAA-----MMGAEKELEHL

Query:  LEEDEMYWKQRAKEDWIIWGDRNTKWFHIKASQRRKQNIIKGLDWRNGGWTDQDEEMGEIATEYFKTLFSTKNASAED
        L+ +E  W QRAK DW+ +GDRN+K+FH +AS R K+N I GL+   G W D++E +GE+  EY+ +LFS+ N +  D
Subjt:  LEEDEMYWKQRAKEDWIIWGDRNTKWFHIKASQRRKQNIIKGLDWRNGGWTDQDEEMGEIATEYFKTLFSTKNASAED

TrEMBL top hitse value%identityAlignment
A0A1R3HKR2 Reverse transcriptase4.9e-6533.85Show/hide
Query:  ESGDTEQRKKLMRTWTRQQRKG-KDKVEENEAVEDRKNKLIRRKHQIEVETIELENKRRMD----DTVKKEINFDCCINVPSNGNSGGLMLLWKNETDVH
        E  DT   K+     +R+  +G  + V  + AVED   +L   KH    ET   E K +      D ++  + FD C +VP  G  GGL LLW N   V 
Subjt:  ESGDTEQRKKLMRTWTRQQRKG-KDKVEENEAVEDRKNKLIRRKHQIEVETIELENKRRMD----DTVKKEINFDCCINVPSNGNSGGLMLLWKNETDVH

Query:  VMSYSQGHIDIYIKENNQRWRFTGVYGQPVTERRGETWELLKRLNSISDMPWILGGDFNEITMDSEKEGGAGRRLSQMEDFRNVIDHCGSLDAEVAGDKF
        ++SYS  HID+ I  ++  W FTG YG+P T  R E+W LL+ LN  SD+PW+  GDFNEI    EK+GG+ R +SQMEDF +VID CG  +  V G   
Subjt:  VMSYSQGHIDIYIKENNQRWRFTGVYGQPVTERRGETWELLKRLNSISDMPWILGGDFNEITMDSEKEGGAGRRLSQMEDFRNVIDHCGSLDAEVAGDKF

Query:  TWYKQFRGETIWERLDRFLLNVELDTLCSSFVVRHLDFLASDHRPILADWEWWNTVSQQ---GRRWKPSRFEEGWTKFEHCRELIRRNWQDIEGDSVVTF
        +W ++  GE ++ERLDR L+           V +HL   ASDH P+L        +S +        P RFE  W+       ++ ++WQ     +    
Subjt:  TWYKQFRGETIWERLDRFLLNVELDTLCSSFVVRHLDFLASDHRPILADWEWWNTVSQQ---GRRWKPSRFEEGWTKFEHCRELIRRNWQDIEGDSVVTF

Query:  NLKLGKCLHDLKRWNKERLEGSIKGAIARKEEEIQDIMNRRETNWRAAMMGAEKELEHLLEEDEMYWKQRAKEDWIIWGDRNTKWFHIKASQRRKQNIIK
        N+K+ +C  DL+ WN+ R+ G+I+  + RK+ E +++ ++ +          + EL+ L +++E+ W+QR+K  W+  GDRNT++FH  AS R+++  I 
Subjt:  NLKLGKCLHDLKRWNKERLEGSIKGAIARKEEEIQDIMNRRETNWRAAMMGAEKELEHLLEEDEMYWKQRAKEDWIIWGDRNTKWFHIKASQRRKQNIIK

Query:  GLDWRNGGWTDQDEEMGEIATEYFKTLFSTKNASAEDMGKILNSISIKIRND
        G+    G W  +   + ++ T Y++ +FS+   + E + K+L  +  ++  D
Subjt:  GLDWRNGGWTDQDEEMGEIATEYFKTLFSTKNASAEDMGKILNSISIKIRND

A0A2N9HYE3 Reverse transcriptase domain-containing protein3.0e-6235.83Show/hide
Query:  DTVKKEINFDCCINVPSNGNSGGLMLLWKNETDVHVMSYSQGHIDIYIKENN-QRWRFTGVYGQPVTERRGETWELLKRLNSISDMPWILGGDFNEITMD
        + ++ ++ FD    V      GGL L WK +  + V S+S  HID  + +N    WRFTG YG P T +R E+W+LL+RLN+   +PW   GDFNE+   
Subjt:  DTVKKEINFDCCINVPSNGNSGGLMLLWKNETDVHVMSYSQGHIDIYIKENN-QRWRFTGVYGQPVTERRGETWELLKRLNSISDMPWILGGDFNEITMD

Query:  SEKEGGAGRRLSQMEDFRNVIDHCGSLDAEVAGDKFTWYKQFRGETIWERLDRFLLNVELDTLCSSFVVRHLDFLASDHRPILADWEWWNTVSQQGRRWK
         EK+G   R  SQM+ FR+V+D CG +D    G KFTW     G+  WERLDR +   +      S  V HL+   SDH+PI     W +T +    + K
Subjt:  SEKEGGAGRRLSQMEDFRNVIDHCGSLDAEVAGDKFTWYKQFRGETIWERLDRFLLNVELDTLCSSFVVRHLDFLASDHRPILADWEWWNTVSQQGRRWK

Query:  PSRFEEGWTKFEHCRELIRRNW-QDIEGDSVVTFNLKLGKCLHDLKRWNKERLEGSIKGAIARKEEEIQDIMNRRETNWRAAMMGAE--------KELEH
        P RFEE WT  + C  +I  +W QD+ G  + T   K+  C   L+ W++        G I  + +E++ ++   E N   +M G +        +EL  
Subjt:  PSRFEEGWTKFEHCRELIRRNW-QDIEGDSVVTFNLKLGKCLHDLKRWNKERLEGSIKGAIARKEEEIQDIMNRRETNWRAAMMGAE--------KELEH

Query:  LLEEDEMYWKQRAKEDWIIWGDRNTKWFHIKASQRRKQNIIKGLDWRNGGWTDQDEEMGEIATEYFKTLFSTKN
        LL ++E  W+QR++ +W+  GDRNT++FH +A+QR+++N +  L  ++G WT    ++  +  EY+K+LF T N
Subjt:  LLEEDEMYWKQRAKEDWIIWGDRNTKWFHIKASQRRKQNIIKGLDWRNGGWTDQDEEMGEIATEYFKTLFSTKN

A0A2N9IPS8 Reverse transcriptase domain-containing protein3.4e-6636.48Show/hide
Query:  TIELENKRRMD----DTVKKEINFDCCINVPSNGNSGGLMLLWKNETDVHVMSYSQGHIDIYI--KENNQRWRFTGVYGQPVTERRGETWELLKRLNSIS
        T+   ++ R+D    + ++  I FD    VP  G  GGL +LW  + DV + +YS+ HID  I  KE  + +R TG YG P T +R E+W LLK L+ +S
Subjt:  TIELENKRRMD----DTVKKEINFDCCINVPSNGNSGGLMLLWKNETDVHVMSYSQGHIDIYI--KENNQRWRFTGVYGQPVTERRGETWELLKRLNSIS

Query:  DMPWILGGDFNEITMDSEKEGGAGRRLSQMEDFRNVIDHCGSLDAEVAGDKFTWYKQFRGET-IWERLDRFLLNVELDTLCSSFVVRHLDFLASDHRPIL
          PW+  GDFNEI  ++E+ G   R   Q+ DFR  + HCG  D    G+ +TW ++  G   +  RLDR + +V   T     VV HL    SDH PIL
Subjt:  DMPWILGGDFNEITMDSEKEGGAGRRLSQMEDFRNVIDHCGSLDAEVAGDKFTWYKQFRGET-IWERLDRFLLNVELDTLCSSFVVRHLDFLASDHRPIL

Query:  ADWEWWNTVSQQGRRWKPSRFEEGWTKFEHCRELIRRNWQD--IEGDSVVTFNLKLGKCLHDLKRWNKERLEGSIKGAIARKEEEIQDIMNRRETNWRAA
         D      V    R+ K  RFE  W K E CRE+I   W D   EG  +     K+  C   L  W++ER  GS+  +I RK E++Q ++N   + +   
Subjt:  ADWEWWNTVSQQGRRWKPSRFEEGWTKFEHCRELIRRNWQD--IEGDSVVTFNLKLGKCLHDLKRWNKERLEGSIKGAIARKEEEIQDIMNRRETNWRAA

Query:  MMGAEKELEHLLEEDEMYWKQRAKEDWIIWGDRNTKWFHIKASQRRKQNIIKGLDWRNGGWTDQDEEMGEIATEYFKTLFSTKNASAEDMGKILNSISIK
        ++  + +L  LLE++E++W+QR++  W+  GD+NTK+FH + ++RR+ N I GL  R+G W  +  ++ EIA +YF+ +F++ N SAE +  +L  +   
Subjt:  MMGAEKELEHLLEEDEMYWKQRAKEDWIIWGDRNTKWFHIKASQRRKQNIIKGLDWRNGGWTDQDEEMGEIATEYFKTLFSTKNASAEDMGKILNSISIK

Query:  IRN
        + N
Subjt:  IRN

A0A6J1DRA0 uncharacterized protein LOC1110224239.2e-6449.42Show/hide
Query:  IKENNQRWRFTGVYGQPVTERRGETWELLKRLNSISDMPWILGGDFNEITMDSEKEGGAGRRLSQMEDFRNVIDHCGSLDAEVAGDKFTWYKQFR-GETI
        +KE+   WRFTG+YG  V + R ETWEL+ RL+SI D+PWILGGDFNEI  +SEK  G  RR S M++F++ +D CG LD    GD FTW    +  + I
Subjt:  IKENNQRWRFTGVYGQPVTERRGETWELLKRLNSISDMPWILGGDFNEITMDSEKEGGAGRRLSQMEDFRNVIDHCGSLDAEVAGDKFTWYKQFR-GETI

Query:  WERLDRFLLNVELDTLCSSFVVRHLDFLASDHRPILADWEWWNTVS---QQGRRWKPSRFEEGWTKFEHCRELIRRNWQDIEGDSVVT-FNLKLGKCLHD
        WERLDRFL+N  +  +  +  +RHL+FLASDHRPILA+W      +   ++GRR  P RFEE W  F+ C+E++RR W  ++GD   T F  K+  CL +
Subjt:  WERLDRFLLNVELDTLCSSFVVRHLDFLASDHRPILADWEWWNTVS---QQGRRWKPSRFEEGWTKFEHCRELIRRNWQDIEGDSVVT-FNLKLGKCLHD

Query:  LKRWNKERLEGSIKGAIARKEEEIQDIMNRRETNWRAAMMGAEKELEHLLEEDEMYWKQ
        L +WN  RL GS++GAI RKE EIQ ++    T WR  +  A+++LE LLEE+E YW+Q
Subjt:  LKRWNKERLEGSIKGAIARKEEEIQDIMNRRETNWRAAMMGAEKELEHLLEEDEMYWKQ

A0A6J1DRA0 uncharacterized protein LOC1110224232.8e-1247.44Show/hide
Query:  KILNSISIKIRNDPWIPREGNCTPVYVQDGMKHNSVACLMSERGKWNENLVRDNFCEEEAEMILKIPLPRQSQEDEII
        K+ N  SI + +DPW+PR+GN +PV+    +++ SVA LM  RG+W+E  VR++F   EA++IL+ PLP Q ++DEII
Subjt:  KILNSISIKIRNDPWIPREGNCTPVYVQDGMKHNSVACLMSERGKWNENLVRDNFCEEEAEMILKIPLPRQSQEDEII

A0A6J1DRA0 uncharacterized protein LOC1110224232.3e-6235.44Show/hide
Query:  KRRMDDTVKKEINFDCCINVPSNGNSGGLMLLWKNETDVHVMSYSQGHIDIYI-KENNQRWRFTGVYGQPVTERRGETWELLKRLNSISDMPWILGGDFN
        K+   + ++  + +D C  V ++G SGGL+LLW N  D ++MS+S  HID +I KE  Q WRFTG YG P   +R E+W+LL R+  +   PW++GGDFN
Subjt:  KRRMDDTVKKEINFDCCINVPSNGNSGGLMLLWKNETDVHVMSYSQGHIDIYI-KENNQRWRFTGVYGQPVTERRGETWELLKRLNSISDMPWILGGDFN

Query:  EITMDSEKEGGAGRRLSQMEDFRNVIDHCGSLDAEVAGDKFTWYKQFRGETIWERLDRFLLNVELDTLCSSFVVRHLDFLASDHRPILADWEWWNTVSQQ
        EI  + EK GG  +    + +FR  ++     + E  G ++TW    + E I+ERLDR   N E   L     V HLD ++SDH P+L      N  +++
Subjt:  EITMDSEKEGGAGRRLSQMEDFRNVIDHCGSLDAEVAGDKFTWYKQFRGETIWERLDRFLLNVELDTLCSSFVVRHLDFLASDHRPILADWEWWNTVSQQ

Query:  GRRWKPS-RFEEGWTKFEHCRELIRRNWQDIEG--DSVVTFNLKLGKCLHDLKRWNKERLEGSIKGAIARKEEEIQDIMNRRETN--WRAAMMGAEKELE
        G RW     FE  W   E C E+++ +W D  G  ++ +    KL  C   L++WNK R +  +K  +   EE+I  I++R   N  W+  +   E++  
Subjt:  GRRWKPS-RFEEGWTKFEHCRELIRRNWQDIEG--DSVVTFNLKLGKCLHDLKRWNKERLEGSIKGAIARKEEEIQDIMNRRETN--WRAAMMGAEKELE

Query:  HLLEEDEMYWKQRAKEDWIIWGDRNTKWFHIKASQRRKQNIIKGLDWRNGGWTDQDEEMGEIATEYFKTLFSTKNASAEDMGKILNSISIKIRND
         LL+++E +W+QR++  W+  GDRNTK+FH KA+ R+++N I GL   N  W   ++ +G++A  YF+ +F++ +AS  D+ +    +  KI  +
Subjt:  HLLEEDEMYWKQRAKEDWIIWGDRNTKWFHIKASQRRKQNIIKGLDWRNGGWTDQDEEMGEIATEYFKTLFSTKNASAEDMGKILNSISIKIRND

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein2.5e-0521.99Show/hide
Query:  SDMPWILGGDFNEITMDSEKEGGAGRRLSQ--MEDFRNVIDHCGSLDAEVAGDKFTWYKQFRGETIWERLDRFLLNVE-LDTLCSSFVVRHLDFLASDHR
        +D   IL GDF++I   S+        +    +E+F+N +     +D    G  +TW        I  +LDR + N +   +  S+  V  L  + SDH 
Subjt:  SDMPWILGGDFNEITMDSEKEGGAGRRLSQ--MEDFRNVIDHCGSLDAEVAGDKFTWYKQFRGETIWERLDRFLLNVE-LDTLCSSFVVRHLDFLASDHR

Query:  PILADWEWWNTVSQQGRRWKPSRFEEGWTKFEHCRELIRRNWQDIEGDSVVTFNLKLG-------KCLHDLKRWNKERLEGSIKGAIARKEE-EIQDIMN
        P +   E  N   +  + ++   F        H   L+       E   V +    LG       KC   L R     ++   K A+   E  + Q + N
Subjt:  PILADWEWWNTVSQQGRRWKPSRFEEGWTKFEHCRELIRRNWQDIEGDSVVTFNLKLG-------KCLHDLKRWNKERLEGSIKGAIARKEE-EIQDIMN

Query:  RRETNWRAAMMGAEKELEHLLEEDEMYWKQRAKEDWIIWGDRNTKWFHIKASQRRKQNIIKGLDWRNGGWTDQDEEMGEIATEYFKTLFST
          ++ +R   + A K+        E +++Q+++  W+  GD NT++FH      + +N+IK L   +    +   ++ E+   Y+  L  +
Subjt:  RRETNWRAAMMGAEKELEHLLEEDEMYWKQRAKEDWIIWGDRNTKWFHIKASQRRKQNIIKGLDWRNGGWTDQDEEMGEIATEYFKTLFST


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCAAATTCACTAGTGGTTGAGGACAATTCTTGTGGAAACGAAGTGGCTAATATAGAATTGATTGTTAAGAATGGAAATGAGAGTGGTGATACAGAACAAAGGAAAAA
ATTAATGAGGACATGGACGCGGCAACAAAGGAAAGGAAAAGACAAAGTGGAAGAAAATGAAGCAGTGGAGGACAGGAAAAATAAATTGATTAGAAGAAAACACCAAATTG
AAGTAGAAACAATTGAGTTGGAGAATAAGAGGAGGATGGATGATACAGTGAAGAAAGAGATAAATTTTGATTGTTGTATAAATGTGCCGAGTAATGGAAATAGTGGCGGT
TTGATGCTTCTTTGGAAAAACGAGACAGACGTCCACGTAATGTCTTATTCTCAGGGGCACATTGACATTTATATTAAAGAAAATAATCAGCGTTGGAGGTTTACTGGAGT
TTATGGCCAACCCGTGACGGAGAGGAGAGGAGAGACTTGGGAGCTTCTGAAGCGACTCAATAGCATTTCTGATATGCCATGGATTTTGGGTGGTGATTTCAATGAGATAA
CCATGGATTCTGAAAAAGAAGGAGGTGCTGGCAGAAGATTAAGTCAGATGGAGGACTTTAGAAATGTAATCGATCACTGTGGGTCGCTAGATGCGGAAGTGGCGGGAGAT
AAATTTACTTGGTATAAACAGTTTCGTGGTGAGACGATTTGGGAGCGTTTGGACCGGTTCTTGCTGAATGTGGAGTTGGACACTTTATGCTCTTCTTTTGTTGTGAGGCA
CCTGGATTTTCTCGCCTCAGATCATAGACCGATACTTGCCGATTGGGAATGGTGGAACACGGTCAGTCAACAAGGAAGAAGATGGAAGCCTAGTCGTTTTGAGGAGGGTT
GGACGAAGTTTGAGCACTGTAGAGAGCTTATTAGGAGAAATTGGCAAGATATAGAGGGGGACAGTGTTGTTACCTTCAATTTAAAGCTAGGGAAGTGTTTACATGATTTA
AAAAGGTGGAACAAGGAAAGATTAGAAGGGTCCATTAAAGGTGCAATTGCTAGGAAAGAGGAGGAAATTCAAGATATTATGAACAGAAGGGAGACAAACTGGAGAGCAGC
CATGATGGGTGCAGAGAAAGAGCTTGAGCATCTCCTCGAAGAAGATGAAATGTACTGGAAACAAAGAGCTAAAGAAGATTGGATTATATGGGGGGACAGAAATACAAAGT
GGTTCCATATTAAAGCTAGCCAGAGAAGGAAGCAAAACATAATTAAAGGGTTGGACTGGAGAAACGGGGGATGGACCGATCAAGATGAGGAAATGGGGGAAATTGCTACA
GAGTATTTTAAGACTCTTTTTTCTACTAAGAATGCCTCTGCTGAAGATATGGGGAAAATTCTCAACTCAATTTCTATTAAGATTAGAAATGATCCATGGATTCCTCGAGA
AGGCAACTGCACACCAGTTTATGTCCAAGATGGGATGAAGCACAATTCTGTGGCATGTCTAATGAGTGAAAGGGGAAAGTGGAATGAGAATCTTGTCCGTGATAACTTTT
GTGAGGAAGAAGCTGAAATGATTCTTAAAATTCCTTTACCGCGACAGAGTCAAGAAGATGAAATTATATAG
mRNA sequenceShow/hide mRNA sequence
ATGCCAAATTCACTAGTGGTTGAGGACAATTCTTGTGGAAACGAAGTGGCTAATATAGAATTGATTGTTAAGAATGGAAATGAGAGTGGTGATACAGAACAAAGGAAAAA
ATTAATGAGGACATGGACGCGGCAACAAAGGAAAGGAAAAGACAAAGTGGAAGAAAATGAAGCAGTGGAGGACAGGAAAAATAAATTGATTAGAAGAAAACACCAAATTG
AAGTAGAAACAATTGAGTTGGAGAATAAGAGGAGGATGGATGATACAGTGAAGAAAGAGATAAATTTTGATTGTTGTATAAATGTGCCGAGTAATGGAAATAGTGGCGGT
TTGATGCTTCTTTGGAAAAACGAGACAGACGTCCACGTAATGTCTTATTCTCAGGGGCACATTGACATTTATATTAAAGAAAATAATCAGCGTTGGAGGTTTACTGGAGT
TTATGGCCAACCCGTGACGGAGAGGAGAGGAGAGACTTGGGAGCTTCTGAAGCGACTCAATAGCATTTCTGATATGCCATGGATTTTGGGTGGTGATTTCAATGAGATAA
CCATGGATTCTGAAAAAGAAGGAGGTGCTGGCAGAAGATTAAGTCAGATGGAGGACTTTAGAAATGTAATCGATCACTGTGGGTCGCTAGATGCGGAAGTGGCGGGAGAT
AAATTTACTTGGTATAAACAGTTTCGTGGTGAGACGATTTGGGAGCGTTTGGACCGGTTCTTGCTGAATGTGGAGTTGGACACTTTATGCTCTTCTTTTGTTGTGAGGCA
CCTGGATTTTCTCGCCTCAGATCATAGACCGATACTTGCCGATTGGGAATGGTGGAACACGGTCAGTCAACAAGGAAGAAGATGGAAGCCTAGTCGTTTTGAGGAGGGTT
GGACGAAGTTTGAGCACTGTAGAGAGCTTATTAGGAGAAATTGGCAAGATATAGAGGGGGACAGTGTTGTTACCTTCAATTTAAAGCTAGGGAAGTGTTTACATGATTTA
AAAAGGTGGAACAAGGAAAGATTAGAAGGGTCCATTAAAGGTGCAATTGCTAGGAAAGAGGAGGAAATTCAAGATATTATGAACAGAAGGGAGACAAACTGGAGAGCAGC
CATGATGGGTGCAGAGAAAGAGCTTGAGCATCTCCTCGAAGAAGATGAAATGTACTGGAAACAAAGAGCTAAAGAAGATTGGATTATATGGGGGGACAGAAATACAAAGT
GGTTCCATATTAAAGCTAGCCAGAGAAGGAAGCAAAACATAATTAAAGGGTTGGACTGGAGAAACGGGGGATGGACCGATCAAGATGAGGAAATGGGGGAAATTGCTACA
GAGTATTTTAAGACTCTTTTTTCTACTAAGAATGCCTCTGCTGAAGATATGGGGAAAATTCTCAACTCAATTTCTATTAAGATTAGAAATGATCCATGGATTCCTCGAGA
AGGCAACTGCACACCAGTTTATGTCCAAGATGGGATGAAGCACAATTCTGTGGCATGTCTAATGAGTGAAAGGGGAAAGTGGAATGAGAATCTTGTCCGTGATAACTTTT
GTGAGGAAGAAGCTGAAATGATTCTTAAAATTCCTTTACCGCGACAGAGTCAAGAAGATGAAATTATATAG
Protein sequenceShow/hide protein sequence
MPNSLVVEDNSCGNEVANIELIVKNGNESGDTEQRKKLMRTWTRQQRKGKDKVEENEAVEDRKNKLIRRKHQIEVETIELENKRRMDDTVKKEINFDCCINVPSNGNSGG
LMLLWKNETDVHVMSYSQGHIDIYIKENNQRWRFTGVYGQPVTERRGETWELLKRLNSISDMPWILGGDFNEITMDSEKEGGAGRRLSQMEDFRNVIDHCGSLDAEVAGD
KFTWYKQFRGETIWERLDRFLLNVELDTLCSSFVVRHLDFLASDHRPILADWEWWNTVSQQGRRWKPSRFEEGWTKFEHCRELIRRNWQDIEGDSVVTFNLKLGKCLHDL
KRWNKERLEGSIKGAIARKEEEIQDIMNRRETNWRAAMMGAEKELEHLLEEDEMYWKQRAKEDWIIWGDRNTKWFHIKASQRRKQNIIKGLDWRNGGWTDQDEEMGEIAT
EYFKTLFSTKNASAEDMGKILNSISIKIRNDPWIPREGNCTPVYVQDGMKHNSVACLMSERGKWNENLVRDNFCEEEAEMILKIPLPRQSQEDEII