; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0023058 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0023058
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr7:43532485..43534161
RNA-Seq ExpressionLag0023058
SyntenyLag0023058
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CCA66036.1 hypothetical protein [Beta vulgaris subsp. vulgaris]4.8e-10437.16Show/hide
Query:  MVFLSETRVCGARMNSIKVQLGYQNCFSVDCTG----RSGGIALLWSTEVGFSLLSFSKNHIDGWVNWDAM-NWRFTGFYGNPQTELKHLSWGLLKNLRG
        +VFLSET++    M S+K +L +++  +VDC G    R GG+A+LW +E+   ++S S NHID  V  +A   WRFTG YG P+ E K  +  LL  L  
Subjt:  MVFLSETRVCGARMNSIKVQLGYQNCFSVDCTG----RSGGIALLWSTEVGFSLLSFSKNHIDGWVNWDAM-NWRFTGFYGNPQTELKHLSWGLLKNLRG

Query:  SQSSPWLVGGDFNAILFHHEKQGGRDKSESEMSAFRDALDDCGLIDIGYKGDCFTWSNGRPGLGNIKERLDRVCVTSAWNALFPDGMVEHLAYSRSDHRP
        +   PWL GGDFN +L   EK+GG   +  E   FR+A+++C  +D+G+ G  FTW+N R G  NI+ERLDR      W   FP   V HL   +SDH P
Subjt:  SQSSPWLVGGDFNAILFHHEKQGGRDKSESEMSAFRDALDDCGLIDIGYKGDCFTWSNGRPGLGNIKERLDRVCVTSAWNALFPDGMVEHLAYSRSDHRP

Query:  LLLTLVSKGNVNPSTLAERNK---------------------------------RCLHFLSKWGRQYVGGYKKRIQEATNRVQHEMANLHIKENRTDLDR
        ++ ++  KG  + +T  +++K                                 R  + L  W +Q  G   K I+   ++++  M +   ++N   +  
Subjt:  LLLTLVSKGNVNPSTLAERNK---------------------------------RCLHFLSKWGRQYVGGYKKRIQEATNRVQHEMANLHIKENRTDLDR

Query:  AEGLLEKLLLEEEMYWKQRSRENWLRWGDRNTGWFHSRASHRRRNNRISGLENSSGEWIDNKGQVTSMISDYFETLFTSSNPSPSDISLAIAGVQNSVSA
         +  +++L   EE+YW QRSR++W++ GD+NT +FH +ASHR + N +  + N +GEW +++  VT   + YFE LF S N    D  L I  V+  ++ 
Subjt:  AEGLLEKLLLEEEMYWKQRSRENWLRWGDRNTGWFHSRASHRRRNNRISGLENSSGEWIDNKGQVTSMISDYFETLFTSSNPSPSDISLAIAGVQNSVSA

Query:  EMNRKLMRPFSEIDITTALRQIHPSKAPGPDGLPGSFYRVHWPTVGPDVVHACLQVLNNDISPGPLNDTLVTLIPKTKAPRKLSEFRPISLCNISYKLIS
        E+  +L  PF   +++ AL Q+HP+KAPGPDG+   FY+  W T+G DV    L +LNN  + G +N T + LIPK K      +FRPISLCN+ YK+++
Subjt:  EMNRKLMRPFSEIDITTALRQIHPSKAPGPDGLPGSFYRVHWPTVGPDVVHACLQVLNNDISPGPLNDTLVTLIPKTKAPRKLSEFRPISLCNISYKLIS

Query:  KAIVNRFKSVLNSVIAPNQSAFIPGRCVVDNAIMGYESLHALKSRKLGRKGLGGSQI
        K + NR K VL  VI  +QS F+PGR + DN ++ YE  H L+ +K G+KG  G ++
Subjt:  KAIVNRFKSVLNSVIAPNQSAFIPGRCVVDNAIMGYESLHALKSRKLGRKGLGGSQI

OMO59710.1 reverse transcriptase [Corchorus capsularis]7.9e-10739.82Show/hide
Query:  MVFLSETRVCGARMNSIKVQLGYQNCFSVDCTGRSGGIALLWSTEVGFSLLSFSKNHIDGWV--NWDAMNWRFTGFYGNPQTELKHLSWGLLKNLRGSQS
        +VFL ET++   +++SI+ +     CF V  TGRSGG+A+ W   V   L+SFS++HID WV  N     WR TGFYG   T  +HLSW LL+       
Subjt:  MVFLSETRVCGARMNSIKVQLGYQNCFSVDCTGRSGGIALLWSTEVGFSLLSFSKNHIDGWV--NWDAMNWRFTGFYGNPQTELKHLSWGLLKNLRGSQS

Query:  SPWLVGGDFNAILFHHEKQGGRDKSESEMSAFRDALDDCGLIDIGYKGDCFTWSNGRPGLGNIKERLDRVCVTSAWNALFPDGMVEHLAYSRSDHRPLLL
          W   GDFN +L+  EK GGR++ E++M AFR+ALDDCGL DIGY+G+ FTW  G      I ERLDR   T  W + FP   + HL+ S SDH P+LL
Subjt:  SPWLVGGDFNAILFHHEKQGGRDKSESEMSAFRDALDDCGLIDIGYKGDCFTWSNGRPGLGNIKERLDRVCVTSAWNALFPDGMVEHLAYSRSDHRPLLL

Query:  T--LVSKGNVNPSTLAERN-----------------------------KRCLHFLSKWGRQY---VGGYKKRIQEATNRVQH-EMANLHIKENRTDLDRA
           +  +     S   ++N                              R +      G++Y       ++RI E + ++        H++ +     R 
Subjt:  T--LVSKGNVNPSTLAERN-----------------------------KRCLHFLSKWGRQY---VGGYKKRIQEATNRVQH-EMANLHIKENRTDLDRA

Query:  EGLLEKLLLEEEMYWKQRSRENWLRWGDRNTGWFHSRASHRRRNNRISGLENSSGEWIDNKGQVTSMISDYFETLFTSSNPSPSDISLAIAGVQNSVSAE
        E  + +LL EEE +W Q SR NWL  GDRNT +FH++AS RR+ N I  LE  +G   D+  ++  + S YF+ LF SS     D  L    V  S++ E
Subjt:  EGLLEKLLLEEEMYWKQRSRENWLRWGDRNTGWFHSRASHRRRNNRISGLENSSGEWIDNKGQVTSMISDYFETLFTSSNPSPSDISLAIAGVQNSVSAE

Query:  MNRKLMRPFSEIDITTALRQIHPSKAPGPDGLPGSFYRVHWPTVGPDVVHACLQVLNNDISPGPLNDTLVTLIPKTKAPRKLSEFRPISLCNISYKLISK
        MN  L+  F+  +I TAL+QIHP+KAPGPDG+P  F++  W  VG DV   CL   + ++     N T + LIPK   P+ +++FRPISLCN+ YK+ISK
Subjt:  MNRKLMRPFSEIDITTALRQIHPSKAPGPDGLPGSFYRVHWPTVGPDVVHACLQVLNNDISPGPLNDTLVTLIPKTKAPRKLSEFRPISLCNISYKLISK

Query:  AIVNRFKSVLNSVIAPNQSAFIPGRCVVDNAIMGYESLHALKSRKLGRKG
         +VNR KS+L   I+ +QSAF+PGR + DN ++ +E LH+LK RK G++G
Subjt:  AIVNRFKSVLNSVIAPNQSAFIPGRCVVDNAIMGYESLHALKSRKLGRKG

XP_012836341.1 PREDICTED: uncharacterized protein LOC105956976 [Erythranthe guttata]1.0e-10638.6Show/hide
Query:  MVFLSETRVCGARMNSIKVQLGYQNCFSVDCTGRSGGIALLWSTEVGFSLLSFSKNHIDGWV---NWDAMNWRFTGFYGNPQTELKHLSWGLLKNLRGSQ
        +VFLSET+     M  ++ +    N F VD  GRSGG+ L W  +V   L+S+S NHID  V   N ++  WR TGFYG P    +H SW LL++LR  +
Subjt:  MVFLSETRVCGARMNSIKVQLGYQNCFSVDCTGRSGGIALLWSTEVGFSLLSFSKNHIDGWV---NWDAMNWRFTGFYGNPQTELKHLSWGLLKNLRGSQ

Query:  SSPWLVGGDFNAILFHHEKQGGRDKSESEMSAFRDALDDCGLIDIGYKGDCFTWSNGRPGLGNIKERLDRVCVTSAWNALFPDGMVEHLAYSRSDHRPLL
        S PW+VGGDFN IL + EK+GG  K  + + AFR+ LD C L D+G++G  FTWSN +     ++ERLDRVC  + W   +P   V+HL Y  SDH P+ 
Subjt:  SSPWLVGGDFNAILFHHEKQGGRDKSESEMSAFRDALDDCGLIDIGYKGDCFTWSNGRPGLGNIKERLDRVCVTSAWNALFPDGMVEHLAYSRSDHRPLL

Query:  LTL-----------------------------VSKGNVNPSTLAE-------RNKRCLHFLSKWGRQYVGGYKKRIQEATNRVQHEMANLHIKENRTDLD
        L L                             +     +   +A+       +N+ C   L +W + +V   ++RI++   R+   M  L   + + +++
Subjt:  LTL-----------------------------VSKGNVNPSTLAE-------RNKRCLHFLSKWGRQYVGGYKKRIQEATNRVQHEMANLHIKENRTDLD

Query:  RAEGLLEKLLLEEEMYWKQRSRENWLRWGDRNTGWFHSRASHRRRNNRISGLENSSGEWIDNKGQVTSMISDYFETLFTSSNPSPSDISLAIAGVQNSVS
        + +  +EK   E +MYW+QRS+  W++ GDRNT +FH++A+ R R NR+  L++  G W +++  +  +IS+YFE LF+S+ PS  +I   +  V+N +S
Subjt:  RAEGLLEKLLLEEEMYWKQRSRENWLRWGDRNTGWFHSRASHRRRNNRISGLENSSGEWIDNKGQVTSMISDYFETLFTSSNPSPSDISLAIAGVQNSVS

Query:  AEMNRKLMRPFSEIDITTALRQIHPSKAPGPDGLPGSFYRVHWPTVGPDVVHACLQVLNNDISPGPLNDTLVTLIPKTKAPRKLSEFRPISLCNISYKLI
         E  + L  PF+  ++T A+ Q+ P K+PGPDGLP  FY  +W  +G DVV   L  LN+   P  LN T + LIPK K P K++++RPISLCN+ YK  
Subjt:  AEMNRKLMRPFSEIDITTALRQIHPSKAPGPDGLPGSFYRVHWPTVGPDVVHACLQVLNNDISPGPLNDTLVTLIPKTKAPRKLSEFRPISLCNISYKLI

Query:  SKAIVNRFKSVLNSVIAPNQSAFIPGRCVVDNAIMGYESLHALK
        +K + NR K VLN +I+P QSAF+P R + DN ++ YE  H +K
Subjt:  SKAIVNRFKSVLNSVIAPNQSAFIPGRCVVDNAIMGYESLHALK

XP_023914298.1 uncharacterized protein LOC112025844 [Quercus suber]2.5e-10539.78Show/hide
Query:  VFLSETRVCGARMNSIKVQLGYQNCFSVDCTGRSGGIALLWSTEVGFSLLSFSKNHIDGWV-NWDAMNWRFTGFYGNPQTELKHLSWGLLKNLRGSQSSP
        +FL+ET    +R+  +  +L + +C+     G+ G +AL W   V   ++S S NHID  V       WRF+G YG   T  K  +W L+++L    S P
Subjt:  VFLSETRVCGARMNSIKVQLGYQNCFSVDCTGRSGGIALLWSTEVGFSLLSFSKNHIDGWV-NWDAMNWRFTGFYGNPQTELKHLSWGLLKNLRGSQSSP

Query:  WLVGGDFNAILFHHEKQGGRDKSESEMSAFRDALDDCGLIDIGYKGDCFTWSNGRPGLGNIKERLDRVCVTSAWNALFPDGMVEHLAYSRSDHRPLLL--
        WL  GDFN IL+ HEK G   + ES M AFR+ LD+CGL+D+G+ GD FTW   R G G + ERLDR   ++AW ALFP   V HL    SDH+ +++  
Subjt:  WLVGGDFNAILFHHEKQGGRDKSESEMSAFRDALDDCGLIDIGYKGDCFTWSNGRPGLGNIKERLDRVCVTSAWNALFPDGMVEHLAYSRSDHRPLLL--

Query:  -------------------------TLVSKGNVN--PST---LAERNKRCLHFLSKWGRQYVGGYKKRIQ---EATNRVQHEMANLHIKENRTDLDRAEG
                                 T+VS  N +  P+T   +A + K+C   L+ W +   G  +K I+   +  ++ + ++A + +K+   ++ + + 
Subjt:  -------------------------TLVSKGNVN--PST---LAERNKRCLHFLSKWGRQYVGGYKKRIQ---EATNRVQHEMANLHIKENRTDLDRAEG

Query:  LLEKLLLEEEMYWKQRSRENWLRWGDRNTGWFHSRASHRRRNNRISGLENSSGEWIDNKGQVTSMISDYFETLFTSSNPSPSDISLAIAGVQNSVSAEMN
         L  LL +E + W+QR+R  +L+ GDRNT +FHS+ASHR R N+I GL NS+  W  ++ QV  + + YF +LF +S   PS++S+ +  V+ SV+ EMN
Subjt:  LLEKLLLEEEMYWKQRSRENWLRWGDRNTGWFHSRASHRRRNNRISGLENSSGEWIDNKGQVTSMISDYFETLFTSSNPSPSDISLAIAGVQNSVSAEMN

Query:  RKLMRPFSEIDITTALRQIHPSKAPGPDGLPGSFYRVHWPTVGPDVVHACLQVLNNDISPGPLNDTLVTLIPKTKAPRKLSEFRPISLCNISYKLISKAI
         +L+ PF + ++T AL Q+    APGPDG+P  FY   W  +G +V  A L  LNN   P  +N T +TLIPK K+P  +S++RPISLCN+ YKL+SK +
Subjt:  RKLMRPFSEIDITTALRQIHPSKAPGPDGLPGSFYRVHWPTVGPDVVHACLQVLNNDISPGPLNDTLVTLIPKTKAPRKLSEFRPISLCNISYKLISKAI

Query:  VNRFKSVLNSVIAPNQSAFIPGRCVVDNAIMGYESLHALKSRKLGRKG
         NRFKSVL SVI+ NQSAF  GR + DN +M YE+LH +K+ + G+ G
Subjt:  VNRFKSVLNSVIAPNQSAFIPGRCVVDNAIMGYESLHALKSRKLGRKG

XP_042962672.1 uncharacterized protein LOC122296942 [Carya illinoinensis]6.1e-10739.89Show/hide
Query:  MVFLSETRVCGARMNSIKVQLGYQNCFSVDCTGRSGGIALLWSTEVGFSLLSFSKNHIDGWV-NWDAMNWRFTGFYGNPQTELKHLSWGLLKNLRGSQSS
        ++FL ET++   R+   K++LG++ CF VD  GRSGG+ALLW  ++   ++++S +HI   + N D + W  TG YG+  +  +   W LLK L      
Subjt:  MVFLSETRVCGARMNSIKVQLGYQNCFSVDCTGRSGGIALLWSTEVGFSLLSFSKNHIDGWV-NWDAMNWRFTGFYGNPQTELKHLSWGLLKNLRGSQSS

Query:  PWLVGGDFNAILFHHEKQGGRDKSESEMSAFRDALDDCGLIDIGYKGDCFTWSNGRPGLGNIKERLDRVCVTSAWNALFPDGMVEHLAYSRSDHRPLLL-
        PW+V GDFN IL H EK GG  +S+ +M  FR+ L DC L D+GY G  FTWSN R     +KERLDR    S W  +FP+  V H   + SDH PL L 
Subjt:  PWLVGGDFNAILFHHEKQGGRDKSESEMSAFRDALDDCGLIDIGYKGDCFTWSNGRPGLGNIKERLDRVCVTSAWNALFPDGMVEHLAYSRSDHRPLLL-

Query:  ---TLVSK------------------------------GNVNPSTLAERNKRCLHFLSKWGRQYVGGYKKRIQEATNRVQHEMANLHIKENRTDLDRAEG
            LV +                              G ++   +  R   C   L +W +   G  +K +  A  R+Q    N   +    +  +A  
Subjt:  ---TLVSK------------------------------GNVNPSTLAERNKRCLHFLSKWGRQYVGGYKKRIQEATNRVQHEMANLHIKENRTDLDRAEG

Query:  LLEKLLLEEEMYWKQRSRENWLRWGDRNTGWFHSRASHRRRNNRISGLENSSGEWIDNKG-QVTSMISDYFETLFTSSNPSPSDISLAIAGVQNSVSAEM
         ++K L  +E+ WKQRSR  WLR GD N+ +FHS+AS RRR N I  L++ SG W   KG Q+  +I++YF+TLFT+++    D+   ++GV+  V+AEM
Subjt:  LLEKLLLEEEMYWKQRSRENWLRWGDRNTGWFHSRASHRRRNNRISGLENSSGEWIDNKG-QVTSMISDYFETLFTSSNPSPSDISLAIAGVQNSVSAEM

Query:  NRKLMRPFSEIDITTALRQIHPSKAPGPDGLPGSFYRVHWPTVGPDVVHACLQVLNNDISPGPLNDTLVTLIPKTKAPRKLSEFRPISLCNISYKLISKA
        N  L++P+   ++  AL+Q+HPSKAPGPDG+P  F++ +W  +G  +  A L  LN+ + P  LN T +TLIPK  +P K+++FRPISLCN+ YK++SK 
Subjt:  NRKLMRPFSEIDITTALRQIHPSKAPGPDGLPGSFYRVHWPTVGPDVVHACLQVLNNDISPGPLNDTLVTLIPKTKAPRKLSEFRPISLCNISYKLISKA

Query:  IVNRFKSVLNSVIAPNQSAFIPGRCVVDNAIMGYESLHALKSRKLGRKG
        I NR KSVL  +I+ +QSAF+PGR + DN ++ YE LH L++++ GRKG
Subjt:  IVNRFKSVLNSVIAPNQSAFIPGRCVVDNAIMGYESLHALKSRKLGRKG

TrEMBL top hitse value%identityAlignment
A0A2N9FP20 Reverse transcriptase domain-containing protein4.1e-10938.73Show/hide
Query:  MVFLSETRVCGARMNSIKVQLGYQNCFSVDCTGRSGGIALLWSTEVGFSLLSFSKNHIDGWVNWDA-MNWRFTGFYGNPQTELKHLSWGLLKNLRGSQSS
        ++FL ET++   RM  ++++L +++CF+V   GRSGG+ALLW+ EV  S+ +FS NHID  V +   + WRFTGFYGNP    +  SW LL+ LR     
Subjt:  MVFLSETRVCGARMNSIKVQLGYQNCFSVDCTGRSGGIALLWSTEVGFSLLSFSKNHIDGWVNWDA-MNWRFTGFYGNPQTELKHLSWGLLKNLRGSQSS

Query:  PWLVGGDFNAILFHHEKQGGRDKSESEMSAFRDALDDCGLIDIGYKGDCFTWSNGRPGLGNIKERLDRVCVTSAWNALFPDGMVEHLAYSRSDHRPLLL-
        PWL+ GDFN IL   E+ G    S++ M  F + L+ CGL+D+GY+G  FTW N R    N+++RLDR   + AW ++F    ++HL  S SDH P+LL 
Subjt:  PWLVGGDFNAILFHHEKQGGRDKSESEMSAFRDALDDCGLIDIGYKGDCFTWSNGRPGLGNIKERLDRVCVTSAWNALFPDGMVEHLAYSRSDHRPLLL-

Query:  ---------------TLVSKGNVNPS--------------------TLAERNKRCLHFLSKWGRQYVGGYKKRIQEATNRVQHEMANLHIKENRTDLDRA
                           K +++P                      + E+ K+C   L +W +     +  +IQE T  + + +A      N   +   
Subjt:  ---------------TLVSKGNVNPS--------------------TLAERNKRCLHFLSKWGRQYVGGYKKRIQEATNRVQHEMANLHIKENRTDLDRA

Query:  EGLLEKLLLEEEMYWKQRSRENWLRWGDRNTGWFHSRASHRRRNNRISGLENSSGEWIDNKGQVTSMISDYFETLFTSSNPSPSDISLAIAGVQNSVSAE
        +  + +LLL EE++W+QRSR  WL  GDRNT +FH  A+ R+R N I  L + +  W D + QV  +   YF+ +FT+S  SP+ I   +A V + VSAE
Subjt:  EGLLEKLLLEEEMYWKQRSRENWLRWGDRNTGWFHSRASHRRRNNRISGLENSSGEWIDNKGQVTSMISDYFETLFTSSNPSPSDISLAIAGVQNSVSAE

Query:  MNRKLMRPFSEIDITTALRQIHPSKAPGPDGLPGSFYRVHWPTVGPDVVHACLQVLNNDISPGPLNDTLVTLIPKTKAPRKLSEFRPISLCNISYKLISK
        +N++L+ P++  ++  AL Q+HPSKAPG DG+   F++ +W  VG  V  A L VLN+      +N T + LIPK ++P K+S++RPISLCN+ YK+ISK
Subjt:  MNRKLMRPFSEIDITTALRQIHPSKAPGPDGLPGSFYRVHWPTVGPDVVHACLQVLNNDISPGPLNDTLVTLIPKTKAPRKLSEFRPISLCNISYKLISK

Query:  AIVNRFKSVLNSVIAPNQSAFIPGRCVVDNAIMGYESLHALKSRKLGRKG
         I NR K+VL+ +I+ +QSAF+PGR + DN  + +E LH +K+++ G++G
Subjt:  AIVNRFKSVLNSVIAPNQSAFIPGRCVVDNAIMGYESLHALKSRKLGRKG

A0A2N9G933 Reverse transcriptase domain-containing protein4.1e-10938.73Show/hide
Query:  MVFLSETRVCGARMNSIKVQLGYQNCFSVDCTGRSGGIALLWSTEVGFSLLSFSKNHIDGWVNWDA-MNWRFTGFYGNPQTELKHLSWGLLKNLRGSQSS
        ++FL ET++   RM  ++++L +++CF+V   GRSGG+ALLW+ EV  S+ +FS NHID  V +   + WRFTGFYGNP    +  SW LL+ LR     
Subjt:  MVFLSETRVCGARMNSIKVQLGYQNCFSVDCTGRSGGIALLWSTEVGFSLLSFSKNHIDGWVNWDA-MNWRFTGFYGNPQTELKHLSWGLLKNLRGSQSS

Query:  PWLVGGDFNAILFHHEKQGGRDKSESEMSAFRDALDDCGLIDIGYKGDCFTWSNGRPGLGNIKERLDRVCVTSAWNALFPDGMVEHLAYSRSDHRPLLL-
        PWL+ GDFN IL   E+ G    S++ M  F + L+ CGL+D+GY+G  FTW N R    N+++RLDR   + AW ++F    ++HL  S SDH P+LL 
Subjt:  PWLVGGDFNAILFHHEKQGGRDKSESEMSAFRDALDDCGLIDIGYKGDCFTWSNGRPGLGNIKERLDRVCVTSAWNALFPDGMVEHLAYSRSDHRPLLL-

Query:  ---------------TLVSKGNVNPS--------------------TLAERNKRCLHFLSKWGRQYVGGYKKRIQEATNRVQHEMANLHIKENRTDLDRA
                           K +++P                      + E+ K+C   L +W +     +  +IQE T  + + +A      N   +   
Subjt:  ---------------TLVSKGNVNPS--------------------TLAERNKRCLHFLSKWGRQYVGGYKKRIQEATNRVQHEMANLHIKENRTDLDRA

Query:  EGLLEKLLLEEEMYWKQRSRENWLRWGDRNTGWFHSRASHRRRNNRISGLENSSGEWIDNKGQVTSMISDYFETLFTSSNPSPSDISLAIAGVQNSVSAE
        +  + +LLL EE++W+QRSR  WL  GDRNT +FH  A+ R+R N I  L + +  W D + QV  +   YF+ +FT+S  SP+ I   +A V + VSAE
Subjt:  EGLLEKLLLEEEMYWKQRSRENWLRWGDRNTGWFHSRASHRRRNNRISGLENSSGEWIDNKGQVTSMISDYFETLFTSSNPSPSDISLAIAGVQNSVSAE

Query:  MNRKLMRPFSEIDITTALRQIHPSKAPGPDGLPGSFYRVHWPTVGPDVVHACLQVLNNDISPGPLNDTLVTLIPKTKAPRKLSEFRPISLCNISYKLISK
        +N++L+ P++  ++  AL Q+HPSKAPG DG+   F++ +W  VG  V  A L VLN+      +N T + LIPK ++P K+S++RPISLCN+ YK+ISK
Subjt:  MNRKLMRPFSEIDITTALRQIHPSKAPGPDGLPGSFYRVHWPTVGPDVVHACLQVLNNDISPGPLNDTLVTLIPKTKAPRKLSEFRPISLCNISYKLISK

Query:  AIVNRFKSVLNSVIAPNQSAFIPGRCVVDNAIMGYESLHALKSRKLGRKG
         I NR K+VL+ +I+ +QSAF+PGR + DN  + +E LH +K+++ G++G
Subjt:  AIVNRFKSVLNSVIAPNQSAFIPGRCVVDNAIMGYESLHALKSRKLGRKG

A0A2N9IJF6 Uncharacterized protein1.2e-10840.18Show/hide
Query:  MVFLSETRVCGARMNSIKVQLGYQNCFSVDCTGRSGGIALLWSTEVGFSLLSFSKNHIDGWV-NWDAMNWRFTGFYGNPQTELKHLSWGLLKNLRGSQSS
        ++FL ET++   +M  I+++LG+QN F V   GRS G+ALLW  EV   + +F+ +HID  + + +   WR  GFYG P+ + K  SW LL+ L    S 
Subjt:  MVFLSETRVCGARMNSIKVQLGYQNCFSVDCTGRSGGIALLWSTEVGFSLLSFSKNHIDGWV-NWDAMNWRFTGFYGNPQTELKHLSWGLLKNLRGSQSS

Query:  PWLVGGDFNAILFHHEKQGGRDKSESEMSAFRDALDDCGLIDIGYKGDCFTWSNGRPGLGNIKERLDRVCVTSAWNALFPDGMVEHLAYSRSDHRPLLLT
        PWL  GDFN IL  +EK+G R +    M  FR+ ++ C  +D+GYKG  FTW+N R     +KERLDRV  T +W  LF    V HL  S+SDH P+L+ 
Subjt:  PWLVGGDFNAILFHHEKQGGRDKSESEMSAFRDALDDCGLIDIGYKGDCFTWSNGRPGLGNIKERLDRVCVTSAWNALFPDGMVEHLAYSRSDHRPLLLT

Query:  LVSK-----------------------------------GNVNPS-TLAERNKRCLHFLSKWGRQYVGGYKKRIQEATNRVQHEMANLHIKENRTDLDRA
          ++                                   G  +P   L E+ KRC   L++W +Q  GG + +I+ A       + N    +NR+ +   
Subjt:  LVSK-----------------------------------GNVNPS-TLAERNKRCLHFLSKWGRQYVGGYKKRIQEATNRVQHEMANLHIKENRTDLDRA

Query:  EGLLEKLLLEEEMYWKQRSRENWLRWGDRNTGWFHSRASHRRRNNRISGLENSSGEWIDNKGQVTSMISDYFETLFTSSNPSPSDISLAIAGVQNSVSAE
        +  +  LLL +E++WKQRSR  WL+ GD NT +FH+ A+ R+R N+I GL N  G+WI   GQ+ S+   YF+ +FTSS  +P  I  A+  V   V+ E
Subjt:  EGLLEKLLLEEEMYWKQRSRENWLRWGDRNTGWFHSRASHRRRNNRISGLENSSGEWIDNKGQVTSMISDYFETLFTSSNPSPSDISLAIAGVQNSVSAE

Query:  MNRKLMRPFSEIDITTALRQIHPSKAPGPDGLPGSFYRVHWPTVGPDVVHACLQVLNNDISPGPLNDTLVTLIPKTKAPRKLSEFRPISLCNISYKLISK
        MNR+L RPF+  ++  A+ Q+HPSK+PGPDG+   F++  W  VG +V+ A L VL+        N T + LIPK K P+++SEFRPISLCN+ +K+ISK
Subjt:  MNRKLMRPFSEIDITTALRQIHPSKAPGPDGLPGSFYRVHWPTVGPDVVHACLQVLNNDISPGPLNDTLVTLIPKTKAPRKLSEFRPISLCNISYKLISK

Query:  AIVNRFKSVLNSVIAPNQSAFIPGRCVVDNAIMGYESLHALKSRKLGRKG
         + NR K VL+SVI+  Q+AF+PGR + DN ++ YE +++LKS++ GR G
Subjt:  AIVNRFKSVLNSVIAPNQSAFIPGRCVVDNAIMGYESLHALKSRKLGRKG

A0A7N2LIH6 Uncharacterized protein2.8e-11039.45Show/hide
Query:  MVFLSETRVCGARMNSIKVQLGYQNCFSVDCTGRSGGIALLWSTEVGFSLLSFSKNHIDGWVNWDAMN--WRFTGFYGNPQTELKHLSWGLLKNLRGSQS
        +VFL ET+    +M   + +LG+     V   GRSGG+ALLW         S S +HID  V+       WR TGFYG+P T  ++ SW LL+ L     
Subjt:  MVFLSETRVCGARMNSIKVQLGYQNCFSVDCTGRSGGIALLWSTEVGFSLLSFSKNHIDGWVNWDAMN--WRFTGFYGNPQTELKHLSWGLLKNLRGSQS

Query:  SPWLVGGDFNAILFHHEKQGGRDKSESEMSAFRDALDDCGLIDIGYKGDCFTWSNGRPGLGNIKERLDRVCVTSAWNALFPDGMVEHLAYSRSDHRPLLL
         PWLV GDFN I+   EK G +D+  ++M AFR+ L  CGLID+G+ G  FTW NGR G      RLDR+    AW+ +FP+  V H++ S SDH  L L
Subjt:  SPWLVGGDFNAILFHHEKQGGRDKSESEMSAFRDALDDCGLIDIGYKGDCFTWSNGRPGLGNIKERLDRVCVTSAWNALFPDGMVEHLAYSRSDHRPLLL

Query:  TLVSKGNV---------------------------------NPSTLAERNKRCLHFLSKWGRQYVGGYKKRIQEATNRVQH-EMANLHIKENRTDLDRAE
         L    N                                  +   + ER +RC   L +W +   G   K I++  NR+Q  E  NL + E   ++   +
Subjt:  TLVSKGNV---------------------------------NPSTLAERNKRCLHFLSKWGRQYVGGYKKRIQEATNRVQH-EMANLHIKENRTDLDRAE

Query:  GLLEKLLLEEEMYWKQRSRENWLRWGDRNTGWFHSRASHRRRNNRISGLENSSGEWIDNKGQVTSMISDYFETLFTSSNPSPSDISLAIAGVQNSVSAEM
          + +L   EE+ WKQRSR +WL++GD+N+ +FH+ AS RR+ NRI GL +  G W +++     +I DYF+ +++S+ P+  D+SL    +   V+ EM
Subjt:  GLLEKLLLEEEMYWKQRSRENWLRWGDRNTGWFHSRASHRRRNNRISGLENSSGEWIDNKGQVTSMISDYFETLFTSSNPSPSDISLAIAGVQNSVSAEM

Query:  NRKLMRPFSEIDITTALRQIHPSKAPGPDGLPGSFYRVHWPTVGPDVVHACLQVLNNDISPGPLNDTLVTLIPKTKAPRKLSEFRPISLCNISYKLISKA
        N +L + F  +++  AL+Q+HP+KAPGPDG+   FY+ +W  VG  V +  LQ LN+ + P  +N T + LIPKTK P+K++EFRPISLCN+ YK+ISK 
Subjt:  NRKLMRPFSEIDITTALRQIHPSKAPGPDGLPGSFYRVHWPTVGPDVVHACLQVLNNDISPGPLNDTLVTLIPKTKAPRKLSEFRPISLCNISYKLISKA

Query:  IVNRFKSVLNSVIAPNQSAFIPGRCVVDNAIMGYESLHALKSRKLGRKGL
        + NR K VL+ VI   QSAF+PGR + DN I+ +ES+H++  R+ G++GL
Subjt:  IVNRFKSVLNSVIAPNQSAFIPGRCVVDNAIMGYESLHALKSRKLGRKGL

A0A803P4U9 Uncharacterized protein4.1e-10940.8Show/hide
Query:  MVFLSETRVCGARMNSIKVQLGYQNCFSVDCTGRSGGIALLWSTEVGFSLLSFSKNHIDGWVNWD-AMNWRFTGFYGNPQTELKHLSWGLLKNLRGSQSS
        M+FLSETR+  + M  I+VQLG++ CFSV   G+SGG+ALLW+  V   + SF+ +HID  V  D    WRFTGFYG+P    +  SW LLK L+     
Subjt:  MVFLSETRVCGARMNSIKVQLGYQNCFSVDCTGRSGGIALLWSTEVGFSLLSFSKNHIDGWVNWD-AMNWRFTGFYGNPQTELKHLSWGLLKNLRGSQSS

Query:  PWLVGGDFNAILFHHEKQGGRDKSESEMSAFRDALDDCGLIDIGYKGDCFTWSNGRPGLGNIKERLDRVCVTSAWNALFPDGMVEHLAYSRSDHRPLLLT
         W+ GGDFN I  + EK+GG  K    M  FR  + +C L ++  +G  FTW NGR     I E+LDR+   S W   F    V  L +  SDHRPL LT
Subjt:  PWLVGGDFNAILFHHEKQGGRDKSESEMSAFRDALDDCGLIDIGYKGDCFTWSNGRPGLGNIKERLDRVCVTSAWNALFPDGMVEHLAYSRSDHRPLLLT

Query:  L----------VSKGN----------------------------VNPSTLAERNKRCLHFLSKWGRQYVGGYKKRIQEATNRVQHEMANLHIKENRTDLD
                   +++G+                             N S L    + C   L KW ++      KRI+E  +++   ++  + + +   + 
Subjt:  L----------VSKGN----------------------------VNPSTLAERNKRCLHFLSKWGRQYVGGYKKRIQEATNRVQHEMANLHIKENRTDLD

Query:  RAEGLLEKLLLEEEMYWKQRSRENWLRWGDRNTGWFHSRASHRRRNNRISGLENSSGEWIDNKGQVTSMISDYFETLFTSSNPSPSDISLAIAGVQNSVS
        + E  L  +  + EMYWKQRSR  WL+ GDRNT +FH +AS R+R N I GL +    W  +  +++ +  +YF+TLF+ SN       + +  V N +S
Subjt:  RAEGLLEKLLLEEEMYWKQRSRENWLRWGDRNTGWFHSRASHRRRNNRISGLENSSGEWIDNKGQVTSMISDYFETLFTSSNPSPSDISLAIAGVQNSVS

Query:  AEMNRKLMRPFSEIDITTALRQIHPSKAPGPDGLPGSFYRVHWPTVGPDVVHACLQVLNNDISPGPLNDTLVTLIPKTKAPRKLSEFRPISLCNISYKLI
        +E NR L++PF E ++  A+ QIHP KAPG DGLPG F++ +W  VG +V  ACL VLNN      LN+TL+ LIPKTK P K+SEFRPISLCN+ YK++
Subjt:  AEMNRKLMRPFSEIDITTALRQIHPSKAPGPDGLPGSFYRVHWPTVGPDVVHACLQVLNNDISPGPLNDTLVTLIPKTKAPRKLSEFRPISLCNISYKLI

Query:  SKAIVNRFKSVLNSVIAPNQSAFIPGRCVVDNAIMGYESLHALKSRKLG
        SK + NR K  LN+ I+ NQSAFI GR + DNAI+G+ESLH ++  + G
Subjt:  SKAIVNRFKSVLNSVIAPNQSAFIPGRCVVDNAIMGYESLHALKSRKLG

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein3.4e-1222.52Show/hide
Query:  KENRTDLDRAEGLLEKLLLEEEMYWKQRSR---------------ENWLRWGDRNTGWFHSRAS-----------HRRRNNRISGLENSSGEWIDNKGQV
        K+ R+ +D     L++L  +E+ + K   R               +  L+  + +  WF  R +            +R  N+I  ++N  G+   +  ++
Subjt:  KENRTDLDRAEGLLEKLLLEEEMYWKQRSR---------------ENWLRWGDRNTGWFHSRAS-----------HRRRNNRISGLENSSGEWIDNKGQV

Query:  TSMISDYFETLFTSSNPSPSDISLAI-AGVQNSVSAEMNRKLMRPFSEIDITTALRQIHPSKAPGPDGLPGSFYRVHWPTVGPDVVHACLQVLNNDISPG
         + I +Y++ L+ +   +  ++   +       ++ E    L RP +  +I   +  +   K+PGPDG    FY+ +   + P ++     +    I P 
Subjt:  TSMISDYFETLFTSSNPSPSDISLAI-AGVQNSVSAEMNRKLMRPFSEIDITTALRQIHPSKAPGPDGLPGSFYRVHWPTVGPDVVHACLQVLNNDISPG

Query:  PLNDTLVTLIPKT-KAPRKLSEFRPISLCNISYKLISKAIVNRFKSVLNSVIAPNQSAFIPG
           +  + LIPK  +   K   FRPISL NI  K+++K + NR +  +  +I  +Q  FIPG
Subjt:  PLNDTLVTLIPKT-KAPRKLSEFRPISLCNISYKLISKAIVNRFKSVLNSVIAPNQSAFIPG

P08548 LINE-1 reverse transcriptase homolog2.5e-1526.34Show/hide
Query:  RRRNNRISGLENSSGEWIDNKGQVTSMISDYFETLFTSSNPSPSDISLAIAGVQ-NSVSAEMNRKLMRPFSEIDITTALRQIHPSKAPGPDGLPGSFYRV
        +R  + IS + N + E   +  ++  ++++Y++ L++    +  +I   +       +S +    L RP S  +I + ++ +   K+PGPDG    FY+ 
Subjt:  RRRNNRISGLENSSGEWIDNKGQVTSMISDYFETLFTSSNPSPSDISLAIAGVQ-NSVSAEMNRKLMRPFSEIDITTALRQIHPSKAPGPDGLPGSFYRV

Query:  HWPTVGPDVVHACLQVLNNDISPGPLNDTLVTLIPKT-KAPRKLSEFRPISLCNISYKLISKAIVNRFKSVLNSVIAPNQSAFIPG
            + P +++    +    I P    +  +TLIPK  K P +   +RPISL NI  K+++K + NR +  +  +I  +Q  FIPG
Subjt:  HWPTVGPDVVHACLQVLNNDISPGPLNDTLVTLIPKT-KAPRKLSEFRPISLCNISYKLISKAIVNRFKSVLNSVIAPNQSAFIPG

P11369 LINE-1 retrotransposable element ORF2 protein1.4e-1326.11Show/hide
Query:  ISGLENSSGEWIDNKGQVTSMISDYFETLFTSSNPSPSDISLAIAGVQ-NSVSAEMNRKLMRPFSEIDITTALRQIHPSKAPGPDGLPGSFYRVHWPTVG
        I+ + N  G+   +  ++ + I  +++ L+++   +  ++   +   Q   ++ +    L  P S  +I   +  +   K+PGPDG    FY+     + 
Subjt:  ISGLENSSGEWIDNKGQVTSMISDYFETLFTSSNPSPSDISLAIAGVQ-NSVSAEMNRKLMRPFSEIDITTALRQIHPSKAPGPDGLPGSFYRVHWPTVG

Query:  PDVVHACLQVLNNDISPGPLNDTLVTLIPK-TKAPRKLSEFRPISLCNISYKLISKAIVNRFKSVLNSVIAPNQSAFIPG
        P +     ++      P    +  +TLIPK  K P K+  FRPISL NI  K+++K + NR +  + ++I P+Q  FIPG
Subjt:  PDVVHACLQVLNNDISPGPLNDTLVTLIPK-TKAPRKLSEFRPISLCNISYKLISKAIVNRFKSVLNSVIAPNQSAFIPG

P14381 Transposon TX1 uncharacterized 149 kDa protein1.3e-1929.28Show/hide
Query:  RSRENWLRWGDRNTGWFHSRASHRRRNNRISGLENSSGEWIDNKGQVTSMISDYFETLFTSSNPSPSDISLAIAGVQNSVSAEMNRKLMRPFSEIDITTA
        RSR   L   DR + +F++    +    +I+ L    G  +++   +      +++ LF+    SP        G+   VS     +L  P +  +++ A
Subjt:  RSRENWLRWGDRNTGWFHSRASHRRRNNRISGLENSSGEWIDNKGQVTSMISDYFETLFTSSNPSPSDISLAIAGVQNSVSAEMNRKLMRPFSEIDITTA

Query:  LRQIHPSKAPGPDGLPGSFYRVHWPTVGPDVVHACLQVLNNDISPGPLNDTLVTLIPKTKAPRKLSEFRPISLCNISYKLISKAIVNRFKSVLNSVIAPN
        LR +  +K+PG DGL   F++  W T+GPD      +       P      +++L+PK    R +  +RP+SL +  YK+++KAI  R KSVL  VI P+
Subjt:  LRQIHPSKAPGPDGLPGSFYRVHWPTVGPDVVHACLQVLNNDISPGPLNDTLVTLIPKTKAPRKLSEFRPISLCNISYKLISKAIVNRFKSVLNSVIAPN

Query:  QSAFIPGRCVVDNAIMGYESLH
        QS  +PGR + DN  +  + LH
Subjt:  QSAFIPGRCVVDNAIMGYESLH

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein2.2e-2224.8Show/hide
Query:  MSAFRDALDDCGLIDIGYKGDCFTWSNGRPGLGNIKERLDRVCVTSAWNALFPDGMVEHLAYSRSDHRPLLLTLVSKGNVNPSTLAERNKRCLHFLS---
        +  F++ L D  L+DI  +G  +TWSN +     I  +LDR      W + FP  +        SDH P ++ L          L +R+K+C  + S   
Subjt:  MSAFRDALDDCGLIDIGYKGDCFTWSNGRPGLGNIKERLDRVCVTSAWNALFPDGMVEHLAYSRSDHRPLLLTLVSKGNVNPSTLAERNKRCLHFLS---

Query:  -----------KW---------------------------GRQYVGGYKKRIQEATNRVQHEMANLHIKENRTDLDRAEGLLEK----LLLEEEMYWKQR
                    W                            RQ  G  + + +EA + ++   + L +      L R E +  K         E +++Q+
Subjt:  -----------KW---------------------------GRQYVGGYKKRIQEATNRVQHEMANLHIKENRTDLDRAEGLLEK----LLLEEEMYWKQR

Query:  SRENWLRWGDRNTGWFHSRASHRRRNNRISGLENSSGEWIDNKGQVTSMISDYFETLFTSSNP--SPSDISLAIAGVQNSVSAEMNRKLMRPFSEIDITT
        SR  WL+ GD NT +FH      +  N I  L       ++N  QV  MI  Y+  L  S +   +P  +           +  +  +L    S+ +IT 
Subjt:  SRENWLRWGDRNTGWFHSRASHRRRNNRISGLENSSGEWIDNKGQVTSMISDYFETLFTSSNP--SPSDISLAIAGVQNSVSAEMNRKLMRPFSEIDITT

Query:  ALRQIHPSKAPGPDGLPGSFYRVHWPTVGPDVVHACLQVLNNDISPGPLNDTLVTLIPKTKAPRKLSEFRPISLCNISYKLIS
        A+  +  +KAPGPD     F+   W  V    + A  +           N T +TLIPK     +LS FRP+S C + YK+I+
Subjt:  ALRQIHPSKAPGPDGLPGSFYRVHWPTVGPDVVHACLQVLNNDISPGPLNDTLVTLIPKTKAPRKLSEFRPISLCNISYKLIS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGTTCCTTTCAGAAACACGGGTGTGTGGAGCTAGAATGAACAGCATCAAAGTTCAATTGGGTTACCAAAACTGTTTTAGTGTGGACTGCACTGGAAGAAGTGGAGG
GATTGCTCTTCTATGGTCAACAGAGGTCGGGTTCTCTCTCCTCTCTTTCTCCAAAAACCATATTGACGGCTGGGTTAATTGGGACGCAATGAACTGGCGTTTTACTGGAT
TTTATGGGAATCCTCAGACAGAATTGAAACATTTATCTTGGGGCCTTTTAAAAAACTTACGTGGTAGTCAATCATCCCCATGGCTAGTGGGGGGTGATTTTAATGCAATT
CTATTTCATCATGAAAAGCAGGGAGGCAGAGATAAATCTGAGTCTGAAATGTCTGCGTTTCGTGATGCTTTAGATGATTGTGGGCTGATAGATATTGGTTATAAAGGGGA
CTGTTTCACTTGGTCTAATGGTCGGCCTGGCCTGGGAAATATTAAGGAGCGGTTGGATCGTGTTTGTGTGACTTCTGCTTGGAATGCTCTCTTTCCAGACGGAATGGTGG
AGCATCTCGCCTATAGCCGCTCTGACCACCGGCCACTTCTGTTGACGTTGGTCTCTAAGGGTAATGTTAATCCTTCCACTCTTGCTGAAAGAAACAAACGTTGCCTTCAT
TTTCTTTCAAAGTGGGGCCGACAGTACGTAGGCGGCTATAAAAAGCGTATTCAGGAAGCTACAAATCGTGTGCAGCATGAGATGGCCAATTTGCATATAAAGGAAAATAG
AACTGACCTGGATAGAGCAGAGGGATTGTTGGAAAAACTCCTTCTGGAAGAGGAAATGTATTGGAAACAACGATCACGGGAAAACTGGCTTCGCTGGGGGGATAGAAACA
CGGGATGGTTCCACTCTCGCGCATCCCATCGTCGGCGAAATAACAGAATTTCAGGTTTGGAGAATTCATCAGGCGAATGGATTGATAACAAAGGACAGGTGACAAGTATG
ATTTCTGATTATTTTGAAACTTTATTCACATCATCTAATCCTTCACCATCTGATATTAGTTTGGCCATTGCAGGAGTCCAAAATTCAGTTTCTGCAGAAATGAACAGAAA
GCTTATGCGTCCATTTTCTGAGATTGATATTACTACTGCTCTTCGACAAATTCATCCATCCAAGGCCCCTGGGCCCGATGGTCTCCCTGGTTCCTTCTATCGTGTTCATT
GGCCAACAGTGGGTCCGGATGTTGTACATGCATGCCTCCAAGTTCTTAACAACGACATCTCACCTGGCCCACTAAATGATACTCTGGTTACTCTTATTCCAAAAACGAAG
GCGCCCAGGAAATTATCAGAGTTTAGGCCCATATCGCTATGTAATATTTCGTATAAATTGATCTCCAAGGCCATTGTCAATCGATTCAAAAGTGTTCTAAATTCAGTTAT
CGCACCAAATCAGAGTGCTTTTATTCCAGGTCGATGTGTTGTTGACAATGCAATTATGGGATATGAGAGTCTCCATGCGTTAAAGTCACGAAAGTTGGGTCGAAAAGGGT
TGGGCGGCTCTCAAATTAGATATGAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGTGTTCCTTTCAGAAACACGGGTGTGTGGAGCTAGAATGAACAGCATCAAAGTTCAATTGGGTTACCAAAACTGTTTTAGTGTGGACTGCACTGGAAGAAGTGGAGG
GATTGCTCTTCTATGGTCAACAGAGGTCGGGTTCTCTCTCCTCTCTTTCTCCAAAAACCATATTGACGGCTGGGTTAATTGGGACGCAATGAACTGGCGTTTTACTGGAT
TTTATGGGAATCCTCAGACAGAATTGAAACATTTATCTTGGGGCCTTTTAAAAAACTTACGTGGTAGTCAATCATCCCCATGGCTAGTGGGGGGTGATTTTAATGCAATT
CTATTTCATCATGAAAAGCAGGGAGGCAGAGATAAATCTGAGTCTGAAATGTCTGCGTTTCGTGATGCTTTAGATGATTGTGGGCTGATAGATATTGGTTATAAAGGGGA
CTGTTTCACTTGGTCTAATGGTCGGCCTGGCCTGGGAAATATTAAGGAGCGGTTGGATCGTGTTTGTGTGACTTCTGCTTGGAATGCTCTCTTTCCAGACGGAATGGTGG
AGCATCTCGCCTATAGCCGCTCTGACCACCGGCCACTTCTGTTGACGTTGGTCTCTAAGGGTAATGTTAATCCTTCCACTCTTGCTGAAAGAAACAAACGTTGCCTTCAT
TTTCTTTCAAAGTGGGGCCGACAGTACGTAGGCGGCTATAAAAAGCGTATTCAGGAAGCTACAAATCGTGTGCAGCATGAGATGGCCAATTTGCATATAAAGGAAAATAG
AACTGACCTGGATAGAGCAGAGGGATTGTTGGAAAAACTCCTTCTGGAAGAGGAAATGTATTGGAAACAACGATCACGGGAAAACTGGCTTCGCTGGGGGGATAGAAACA
CGGGATGGTTCCACTCTCGCGCATCCCATCGTCGGCGAAATAACAGAATTTCAGGTTTGGAGAATTCATCAGGCGAATGGATTGATAACAAAGGACAGGTGACAAGTATG
ATTTCTGATTATTTTGAAACTTTATTCACATCATCTAATCCTTCACCATCTGATATTAGTTTGGCCATTGCAGGAGTCCAAAATTCAGTTTCTGCAGAAATGAACAGAAA
GCTTATGCGTCCATTTTCTGAGATTGATATTACTACTGCTCTTCGACAAATTCATCCATCCAAGGCCCCTGGGCCCGATGGTCTCCCTGGTTCCTTCTATCGTGTTCATT
GGCCAACAGTGGGTCCGGATGTTGTACATGCATGCCTCCAAGTTCTTAACAACGACATCTCACCTGGCCCACTAAATGATACTCTGGTTACTCTTATTCCAAAAACGAAG
GCGCCCAGGAAATTATCAGAGTTTAGGCCCATATCGCTATGTAATATTTCGTATAAATTGATCTCCAAGGCCATTGTCAATCGATTCAAAAGTGTTCTAAATTCAGTTAT
CGCACCAAATCAGAGTGCTTTTATTCCAGGTCGATGTGTTGTTGACAATGCAATTATGGGATATGAGAGTCTCCATGCGTTAAAGTCACGAAAGTTGGGTCGAAAAGGGT
TGGGCGGCTCTCAAATTAGATATGAGTAA
Protein sequenceShow/hide protein sequence
MVFLSETRVCGARMNSIKVQLGYQNCFSVDCTGRSGGIALLWSTEVGFSLLSFSKNHIDGWVNWDAMNWRFTGFYGNPQTELKHLSWGLLKNLRGSQSSPWLVGGDFNAI
LFHHEKQGGRDKSESEMSAFRDALDDCGLIDIGYKGDCFTWSNGRPGLGNIKERLDRVCVTSAWNALFPDGMVEHLAYSRSDHRPLLLTLVSKGNVNPSTLAERNKRCLH
FLSKWGRQYVGGYKKRIQEATNRVQHEMANLHIKENRTDLDRAEGLLEKLLLEEEMYWKQRSRENWLRWGDRNTGWFHSRASHRRRNNRISGLENSSGEWIDNKGQVTSM
ISDYFETLFTSSNPSPSDISLAIAGVQNSVSAEMNRKLMRPFSEIDITTALRQIHPSKAPGPDGLPGSFYRVHWPTVGPDVVHACLQVLNNDISPGPLNDTLVTLIPKTK
APRKLSEFRPISLCNISYKLISKAIVNRFKSVLNSVIAPNQSAFIPGRCVVDNAIMGYESLHALKSRKLGRKGLGGSQIRYE