; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lcy06g016600 (gene) of Sponge gourd (P93075) v1 genome

Gene IDLcy06g016600
OrganismLuffa cylindrica cv. P93075 (Sponge gourd (P93075) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationChr06:38057878..38059640
RNA-Seq ExpressionLcy06g016600
SyntenyLcy06g016600
Gene Ontology termsNA
InterPro domainsIPR025558 - Domain of unknown function DUF4283
IPR040256 - Uncharacterized protein At4g02000-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022132681.1 uncharacterized protein LOC111005481 [Momordica charantia]8.2e-4136.29Show/hide
Query:  MDPSALIDEWDRLSLTAEEEEVSVVADREAVDWSGLLLGFFLLGKLLCHRSLGAEVMRRSFKAAWKINLN-LQVDRLGHNMFIFRFANEADKVRVLRQGS
        M  S L++EW    LT+EE++++V  D  A++ +G  L   L+ KLL  RS+   V++ + K AWK++     VD +G N+F+F F   +D+ R+LR G 
Subjt:  MDPSALIDEWDRLSLTAEEEEVSVVADREAVDWSGLLLGFFLLGKLLCHRSLGAEVMRRSFKAAWKINLN-LQVDRLGHNMFIFRFANEADKVRVLRQGS

Query:  WCFEKFLLILEVPIRGLKPSDHPYSFAVFWIHVFDLPLDWFNRTMAERIGNAMGVFEDVDARNGFMFWGASLRIMVRIDLTRPLRRGIQI----------
        W F++ L+I++ P+   KP D  +     W+H FDL L   N+TMA R+GNA+G+FEDV++      WG+ LR+ VR D+ +PL RGI++          
Subjt:  WCFEKFLLILEVPIRGLKPSDHPYSFAVFWIHVFDLPLDWFNRTMAERIGNAMGVFEDVDARNGFMFWGASLRIMVRIDLTRPLRRGIQI----------

Query:  -------YPDVLLVACGCLS--VRDCVHMLQSDHGANPPLQYGDWMRFTGKGMALNVLA
                PD     CG L   ++DC          N  LQYG W+RF G   + N+L+
Subjt:  -------YPDVLLVACGCLS--VRDCVHMLQSDHGANPPLQYGDWMRFTGKGMALNVLA

XP_022156185.1 uncharacterized protein LOC111023135 [Momordica charantia]2.1e-4439.2Show/hide
Query:  MDPSALIDEWDRLSLTAEEEEVSVVADREAVDWSGLLLGFFLLGKLLCHRSLGAEVMRRSFKAAWKINLNLQVDRLGHNMFIFRFANEADKVRVLRQGSW
        MD   L+ +W +  LT+EE+E+++  D +AV  +   L + L+GKLL  R + A+V+ R    AWK+   L V+ +G N+F+F F  E D  RV++ G W
Subjt:  MDPSALIDEWDRLSLTAEEEEVSVVADREAVDWSGLLLGFFLLGKLLCHRSLGAEVMRRSFKAAWKINLNLQVDRLGHNMFIFRFANEADKVRVLRQGSW

Query:  CFEKFLLILEVPIRGLKPSDHPYSFAVFWIHVFDLPLDWFNRTMAERIGNAMGVFEDVDARNGFMFWGASLRIMVRIDLTRPLRRGIQI-----------
         F+K L++L+ P      S+  ++   FWIH+FDLP+ W N+TMA R+GNA+G F DVD       WGASLRI V ID+T+PLRRGI+I           
Subjt:  CFEKFLLILEVPIRGLKPSDHPYSFAVFWIHVFDLPLDWFNRTMAERIGNAMGVFEDVDARNGFMFWGASLRIMVRIDLTRPLRRGIQI-----------

Query:  ------YPDVLLVACGCL--SVRDC-VHMLQSDHGANPPLQYGDWMRFTG
               PD     CG +  S  DC    L +   +    +YG W+RF G
Subjt:  ------YPDVLLVACGCL--SVRDC-VHMLQSDHGANPPLQYGDWMRFTG

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]4.4e-3434.63Show/hide
Query:  LIDEWDRLSLTAEEEEVSVVADREAVDWSGLLLGFFLLGKLLCHRSLGAEVMRRSFKAAWKINLN-LQVDRLGHNMFIFRFANEADKVRVLRQGSWCFEK
        L++EW    LT+EEEE ++  D  A   +G  L   L+GKL   R +   VM+ + + AWK+  N  +V  LG+N+F+F FA   D+ ++ + G W F++
Subjt:  LIDEWDRLSLTAEEEEVSVVADREAVDWSGLLLGFFLLGKLLCHRSLGAEVMRRSFKAAWKINLN-LQVDRLGHNMFIFRFANEADKVRVLRQGSWCFEK

Query:  FLLILEVPIRGLKPSDHPYSFAVFWIHVFDLPLDWFNRTMAERIGNAMGVFEDVDARNGFMFWGASLRIMVRIDLTRPLRRGIQIYPDVLL----VACGC
         L+++  P+  + PS+  ++    W+  FDLPL    R MA R+GNA+G FE+ D  +    WG++LR+ V +D+++PLRRGI++  D  +    +    
Subjt:  FLLILEVPIRGLKPSDHPYSFAVFWIHVFDLPLDWFNRTMAERIGNAMGVFEDVDARNGFMFWGASLRIMVRIDLTRPLRRGIQIYPDVLL----VACGC

Query:  LSVRD-CVHMLQSDHGANPPLQYGDWMRFTG
          + D C H   S   +    QYG W+R+ G
Subjt:  LSVRD-CVHMLQSDHGANPPLQYGDWMRFTG

XP_028122006.1 uncharacterized protein LOC114319195 [Camellia sinensis]1.0e-3035.66Show/hide
Query:  ALIDEWDRLSLTAEEEEVSVVADREAVDWSGLLLG---FFLLGKLLCHRSLGAEVMRRSFKAAWKINLNLQVDRLGHNMFIFRFANEADKVRVLRQGSWC
        +L+D    LSLT+EE+ V  +        + L++G     L+GKLL  R    E M+ +  + W+    +QV  +G N+F+F F +  DK RVL  G W 
Subjt:  ALIDEWDRLSLTAEEEEVSVVADREAVDWSGLLLG---FFLLGKLLCHRSLGAEVMRRSFKAAWKINLNLQVDRLGHNMFIFRFANEADKVRVLRQGSWC

Query:  FEKFLLILEVPIRGLKPSDHPYSFAVFWIHVFDLPLDWFNRTMAERIGNAMGVFEDVDARNGFMFWGASLRIMVRIDLTRPLRRGIQ--------IYPD-
        F+K LL+L      ++PSD   +   FW+HV +LPL   N+ + E +GNA+G F D+D  +G + WG ++RI V +D+ +PLRRG++        I+ D 
Subjt:  FEKFLLILEVPIRGLKPSDHPYSFAVFWIHVFDLPLDWFNRTMAERIGNAMGVFEDVDARNGFMFWGASLRIMVRIDLTRPLRRGIQ--------IYPD-

Query:  ------VLLVACGCL--SVRDCVHMLQSDHGAN-PPLQYGDWMR
              +    CG L  S R+C   L S  G+    LQYG W+R
Subjt:  ------VLLVACGCL--SVRDCVHMLQSDHGAN-PPLQYGDWMR

XP_028124075.1 uncharacterized protein LOC114321128 [Camellia sinensis]3.8e-3036.07Show/hide
Query:  ALIDEWDRLSLTAEEEEVSVVADREAVDWSGLLLG---FFLLGKLLCHRSLGAEVMRRSFKAAWKINLNLQVDRLGHNMFIFRFANEADKVRVLRQGSWC
        +L+D    LSLT+EE+ V     R   D + L++G     L+GKLL  R    E M+ +  + W+    +QV  +G N+F+F F +  DK RVL  G W 
Subjt:  ALIDEWDRLSLTAEEEEVSVVADREAVDWSGLLLG---FFLLGKLLCHRSLGAEVMRRSFKAAWKINLNLQVDRLGHNMFIFRFANEADKVRVLRQGSWC

Query:  FEKFLLILEVPIRGLKPSDHPYSFAVFWIHVFDLPLDWFNRTMAERIGNAMGVFEDVDARNGFMFWGASLRIMVRIDLTRPLRRGIQ--------IYPD-
        F+K LL+L      ++PSD   +   FW+HV +LPL   N+ + + +GNA+G F D+D  +G + WG ++RI V ID+ +PLRRG++        I+ D 
Subjt:  FEKFLLILEVPIRGLKPSDHPYSFAVFWIHVFDLPLDWFNRTMAERIGNAMGVFEDVDARNGFMFWGASLRIMVRIDLTRPLRRGIQ--------IYPD-

Query:  ------VLLVACGCL--SVRDCVHMLQSDHGAN-PPLQYGDWMR
              +    CG L  S R+C   L    G     LQYG W+R
Subjt:  ------VLLVACGCL--SVRDCVHMLQSDHGAN-PPLQYGDWMR

TrEMBL top hitse value%identityAlignment
A0A1R3GTB5 Reverse transcriptase1.8e-2533.47Show/hide
Query:  LIDEWDRLSLTAEEEEVSVVADREAVDWSGLLLGFFLLGKLLCHRSLGAEVMRRSFKAAWKINLNLQVDRLGHNMFIFRFANEADKVRVLRQGSWCFEKF
        L D W+  +LT EEE + V  D   VD +       L+GKLL  R +  EVMR      WK+   LQV  +G N+FIF+F ++ +K RV +Q  W F K 
Subjt:  LIDEWDRLSLTAEEEEVSVVADREAVDWSGLLLGFFLLGKLLCHRSLGAEVMRRSFKAAWKINLNLQVDRLGHNMFIFRFANEADKVRVLRQGSWCFEKF

Query:  LLILEVPIRGLKPSDHPYSFAVFWIHVFDLPLDWFNRTMAERIGNAMGVFEDVDARNGFMFWGASLRIMVRIDLTRPLRRGI----------------QI
        LL+L+         D       FW    DLPL + N ++   IG + G  E++D     + WG  LR   R+++T+PLRRG+                + 
Subjt:  LLILEVPIRGLKPSDHPYSFAVFWIHVFDLPLDWFNRTMAERIGNAMGVFEDVDARNGFMFWGASLRIMVRIDLTRPLRRGI----------------QI

Query:  YPDVLLVACGCLS--VRDC--VHMLQSDHGANPPLQYGDWMR
         PD   V CGCL+    +C    +++ D G     +YG W+R
Subjt:  YPDVLLVACGCLS--VRDC--VHMLQSDHGANPPLQYGDWMR

A0A1R3JW24 Reverse transcriptase2.0e-2432.11Show/hide
Query:  LLGKLLCHRSLGAEVMRRSFKAAWKINLNLQVDRLGHNMFIFRFANEADKVRVLRQGSWCFEKFLLILEVPIRGLKPSDHPYSFAVFWIHVFDLPLDWFN
        ++G+LL  RS   + M  + K  WK+     +  L  N+F+F+FA+EAD  RVL    W F+K LL+       L P D+ ++ A FWI +++LPL   N
Subjt:  LLGKLLCHRSLGAEVMRRSFKAAWKINLNLQVDRLGHNMFIFRFANEADKVRVLRQGSWCFEKFLLILEVPIRGLKPSDHPYSFAVFWIHVFDLPLDWFN

Query:  RTMAERIGNAMGVFEDVDARNGFMFWGASLRIMVRIDLTRPLRRGIQI------------YPDVLLVACGCLSVRDCVHMLQSDHGAN--PPLQYGDWMR
          +AE++G  MG    VD       W   LR+ V ID+T+PLRR I +            Y  + +  C C  +  C    +   G N    + YG+W+R
Subjt:  RTMAERIGNAMGVFEDVDARNGFMFWGASLRIMVRIDLTRPLRRGIQI------------YPDVLLVACGCLSVRDCVHMLQSDHGAN--PPLQYGDWMR

Query:  FTGKGMALNVLAARVVDR
         +     L    +RV  R
Subjt:  FTGKGMALNVLAARVVDR

A0A6J1BSZ1 uncharacterized protein LOC1110054814.0e-4136.29Show/hide
Query:  MDPSALIDEWDRLSLTAEEEEVSVVADREAVDWSGLLLGFFLLGKLLCHRSLGAEVMRRSFKAAWKINLN-LQVDRLGHNMFIFRFANEADKVRVLRQGS
        M  S L++EW    LT+EE++++V  D  A++ +G  L   L+ KLL  RS+   V++ + K AWK++     VD +G N+F+F F   +D+ R+LR G 
Subjt:  MDPSALIDEWDRLSLTAEEEEVSVVADREAVDWSGLLLGFFLLGKLLCHRSLGAEVMRRSFKAAWKINLN-LQVDRLGHNMFIFRFANEADKVRVLRQGS

Query:  WCFEKFLLILEVPIRGLKPSDHPYSFAVFWIHVFDLPLDWFNRTMAERIGNAMGVFEDVDARNGFMFWGASLRIMVRIDLTRPLRRGIQI----------
        W F++ L+I++ P+   KP D  +     W+H FDL L   N+TMA R+GNA+G+FEDV++      WG+ LR+ VR D+ +PL RGI++          
Subjt:  WCFEKFLLILEVPIRGLKPSDHPYSFAVFWIHVFDLPLDWFNRTMAERIGNAMGVFEDVDARNGFMFWGASLRIMVRIDLTRPLRRGIQI----------

Query:  -------YPDVLLVACGCLS--VRDCVHMLQSDHGANPPLQYGDWMRFTGKGMALNVLA
                PD     CG L   ++DC          N  LQYG W+RF G   + N+L+
Subjt:  -------YPDVLLVACGCLS--VRDCVHMLQSDHGANPPLQYGDWMRFTGKGMALNVLA

A0A6J1DU55 uncharacterized protein LOC1110231351.0e-4439.2Show/hide
Query:  MDPSALIDEWDRLSLTAEEEEVSVVADREAVDWSGLLLGFFLLGKLLCHRSLGAEVMRRSFKAAWKINLNLQVDRLGHNMFIFRFANEADKVRVLRQGSW
        MD   L+ +W +  LT+EE+E+++  D +AV  +   L + L+GKLL  R + A+V+ R    AWK+   L V+ +G N+F+F F  E D  RV++ G W
Subjt:  MDPSALIDEWDRLSLTAEEEEVSVVADREAVDWSGLLLGFFLLGKLLCHRSLGAEVMRRSFKAAWKINLNLQVDRLGHNMFIFRFANEADKVRVLRQGSW

Query:  CFEKFLLILEVPIRGLKPSDHPYSFAVFWIHVFDLPLDWFNRTMAERIGNAMGVFEDVDARNGFMFWGASLRIMVRIDLTRPLRRGIQI-----------
         F+K L++L+ P      S+  ++   FWIH+FDLP+ W N+TMA R+GNA+G F DVD       WGASLRI V ID+T+PLRRGI+I           
Subjt:  CFEKFLLILEVPIRGLKPSDHPYSFAVFWIHVFDLPLDWFNRTMAERIGNAMGVFEDVDARNGFMFWGASLRIMVRIDLTRPLRRGIQI-----------

Query:  ------YPDVLLVACGCL--SVRDC-VHMLQSDHGANPPLQYGDWMRFTG
               PD     CG +  S  DC    L +   +    +YG W+RF G
Subjt:  ------YPDVLLVACGCL--SVRDC-VHMLQSDHGANPPLQYGDWMRFTG

A0A6J1DX30 uncharacterized protein LOC1110248742.1e-3434.63Show/hide
Query:  LIDEWDRLSLTAEEEEVSVVADREAVDWSGLLLGFFLLGKLLCHRSLGAEVMRRSFKAAWKINLN-LQVDRLGHNMFIFRFANEADKVRVLRQGSWCFEK
        L++EW    LT+EEEE ++  D  A   +G  L   L+GKL   R +   VM+ + + AWK+  N  +V  LG+N+F+F FA   D+ ++ + G W F++
Subjt:  LIDEWDRLSLTAEEEEVSVVADREAVDWSGLLLGFFLLGKLLCHRSLGAEVMRRSFKAAWKINLN-LQVDRLGHNMFIFRFANEADKVRVLRQGSWCFEK

Query:  FLLILEVPIRGLKPSDHPYSFAVFWIHVFDLPLDWFNRTMAERIGNAMGVFEDVDARNGFMFWGASLRIMVRIDLTRPLRRGIQIYPDVLL----VACGC
         L+++  P+  + PS+  ++    W+  FDLPL    R MA R+GNA+G FE+ D  +    WG++LR+ V +D+++PLRRGI++  D  +    +    
Subjt:  FLLILEVPIRGLKPSDHPYSFAVFWIHVFDLPLDWFNRTMAERIGNAMGVFEDVDARNGFMFWGASLRIMVRIDLTRPLRRGIQIYPDVLL----VACGC

Query:  LSVRD-CVHMLQSDHGANPPLQYGDWMRFTG
          + D C H   S   +    QYG W+R+ G
Subjt:  LSVRD-CVHMLQSDHGANPPLQYGDWMRFTG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G31430.1 unknown protein3.0e-0925.52Show/hide
Query:  FFLLGKLLCHRSLGAEVMRRSFKAAWKINLNLQVDRLGHNMFIFRFANEADKVRVLRQGSWCFEKFLLILEVPIRGLKPSDHPYSFAVFWIHVFDLPLDW
        F L G+ +  R      +  S    W  +  +    +    F F F  E     VLR+G W F  ++++L+      +P    + F  FW+ +  +P  +
Subjt:  FFLLGKLLCHRSLGAEVMRRSFKAAWKINLNLQVDRLGHNMFIFRFANEADKVRVLRQGSWCFEKFLLILEVPIRGLKPSDHPYSFAVFWIHVFDLPLDW

Query:  FNRTMAERIGNAMGVFEDVDARNGFMFWGASLRIMVRIDLTRPLR
         NR + E IG A+G   D D     +      R+++  D+T PLR
Subjt:  FNRTMAERIGNAMGVFEDVDARNGFMFWGASLRIMVRIDLTRPLR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCCGTCTGCATTGATTGACGAATGGGATCGTTTGAGTTTAACGGCAGAGGAAGAAGAGGTCTCGGTGGTTGCTGATCGTGAGGCGGTGGACTGGTCTGGG
CTTTTGCTCGGGTTTTTCCTACTGGGAAAGTTATTGTGTCATCGTTCTTTGGGGGCAGAGGTGATGAGGAGAAGCTTCAAGGCTGCATGGAAGATTAACCTGAAT
TTGCAGGTGGATCGGTTGGGACATAACATGTTTATCTTTCGTTTTGCTAACGAAGCGGACAAGGTTCGAGTTTTACGCCAGGGGTCGTGGTGTTTCGAGAAATTT
TTGCTGATTTTGGAAGTTCCTATTCGAGGTTTGAAGCCATCTGATCATCCCTACTCATTTGCGGTTTTTTGGATCCATGTTTTTGACTTGCCGCTTGACTGGTTT
AATCGGACAATGGCGGAGCGGATTGGCAATGCGATGGGAGTGTTCGAAGATGTTGATGCTCGAAATGGCTTCATGTTTTGGGGTGCGAGCTTGCGGATCATGGTG
CGGATTGATTTGACTCGTCCCCTTCGGCGTGGTATCCAGATTTATCCAGATGTCCTCTTAGTGGCTTGTGGGTGCCTTTCAGTTAGGGATTGTGTCCATATGCTT
CAATCTGATCACGGGGCCAATCCTCCTCTGCAATATGGGGATTGGATGAGATTCACTGGGAAGGGGATGGCTTTGAATGTCCTTGCAGCACGAGTTGTTGATCGT
CGAGCTGATCGGGAAATCCCCAGAAGCACGAGGTTGGCCATCGAGGTTGCGCCTTCAGTGGGGAATTTTCCTTCTCCTACGGTTTTTCCAGAGGTGGCCGACCTT
CGTCCAAGCCAAGGTGGAATCCGAATTTCAGAACCATCGGAGGAGGTTCTGAGGAGACTCCATGCTCCTGCATCGCCGGCGTATTATTCGTTGTCTGGAGCTCCG
AGTTCGCAGCTGAAAGGGACGGAAAAGGTTGCCGACACGATGACTAAGTACAAGGGAGGTTGTTCAGCCATCGCCAATCAATGGCGGCCAGAATGCTCAAGTCGG
AATTGCTTGCCAGAATCTGGGACGGACGGTGGGTGGAGTCCAATTCGGACAAACATTACAAATTGCGTGGGGCCCACTCGTGAACGCTTGGTCGAGTTTAATTTG
AATAATGGGCTGGGAAATTATGAGCCTTCATTGCAGGTGGATAATGGTCTTAATTTGGACCCTTTATTGCCCATGCTAAAAAATAATTCAACCGTTTCGTTGGGC
CAGACCCAGGGTAGTGGTAAGCTGCAGGAAAATTCGGATGATGTGGAGAGTGACCCTGAAATGGACGAGGATGCTGGGCTGTTGGGTTCAGATGGGCTGGTTTCT
TCTGAGGCCCATGAAGGCGATGGTCAACTGGTGGGTCTGGAGGTTGCTCCTAAGACGATTATTGATTTGGCTGATGACATGGTGACTATCGTGAATGAACGTGTG
GTTGCTGACTGTCCGTTTCAAGATCCAGATATTATAGGCGCAGGTAAGGGTAAAAAATGGAAGAAACGTGCTCACGCTGGCTTTGTGCCAAAGGGATTAAATGTG
GAAGCCTTGGAGGAATTTCAAAAGCGTAAGGATGAACCCTTTTTGTTTTCTCCGGTTAACATTAAGCGTCCAAAATTGGACAACTATGATTGTGAGACGGCGGGG
ACTGCTGAGCAGACCCGCCGTGATCCATGA
mRNA sequenceShow/hide mRNA sequence
ATGGATCCGTCTGCATTGATTGACGAATGGGATCGTTTGAGTTTAACGGCAGAGGAAGAAGAGGTCTCGGTGGTTGCTGATCGTGAGGCGGTGGACTGGTCTGGG
CTTTTGCTCGGGTTTTTCCTACTGGGAAAGTTATTGTGTCATCGTTCTTTGGGGGCAGAGGTGATGAGGAGAAGCTTCAAGGCTGCATGGAAGATTAACCTGAAT
TTGCAGGTGGATCGGTTGGGACATAACATGTTTATCTTTCGTTTTGCTAACGAAGCGGACAAGGTTCGAGTTTTACGCCAGGGGTCGTGGTGTTTCGAGAAATTT
TTGCTGATTTTGGAAGTTCCTATTCGAGGTTTGAAGCCATCTGATCATCCCTACTCATTTGCGGTTTTTTGGATCCATGTTTTTGACTTGCCGCTTGACTGGTTT
AATCGGACAATGGCGGAGCGGATTGGCAATGCGATGGGAGTGTTCGAAGATGTTGATGCTCGAAATGGCTTCATGTTTTGGGGTGCGAGCTTGCGGATCATGGTG
CGGATTGATTTGACTCGTCCCCTTCGGCGTGGTATCCAGATTTATCCAGATGTCCTCTTAGTGGCTTGTGGGTGCCTTTCAGTTAGGGATTGTGTCCATATGCTT
CAATCTGATCACGGGGCCAATCCTCCTCTGCAATATGGGGATTGGATGAGATTCACTGGGAAGGGGATGGCTTTGAATGTCCTTGCAGCACGAGTTGTTGATCGT
CGAGCTGATCGGGAAATCCCCAGAAGCACGAGGTTGGCCATCGAGGTTGCGCCTTCAGTGGGGAATTTTCCTTCTCCTACGGTTTTTCCAGAGGTGGCCGACCTT
CGTCCAAGCCAAGGTGGAATCCGAATTTCAGAACCATCGGAGGAGGTTCTGAGGAGACTCCATGCTCCTGCATCGCCGGCGTATTATTCGTTGTCTGGAGCTCCG
AGTTCGCAGCTGAAAGGGACGGAAAAGGTTGCCGACACGATGACTAAGTACAAGGGAGGTTGTTCAGCCATCGCCAATCAATGGCGGCCAGAATGCTCAAGTCGG
AATTGCTTGCCAGAATCTGGGACGGACGGTGGGTGGAGTCCAATTCGGACAAACATTACAAATTGCGTGGGGCCCACTCGTGAACGCTTGGTCGAGTTTAATTTG
AATAATGGGCTGGGAAATTATGAGCCTTCATTGCAGGTGGATAATGGTCTTAATTTGGACCCTTTATTGCCCATGCTAAAAAATAATTCAACCGTTTCGTTGGGC
CAGACCCAGGGTAGTGGTAAGCTGCAGGAAAATTCGGATGATGTGGAGAGTGACCCTGAAATGGACGAGGATGCTGGGCTGTTGGGTTCAGATGGGCTGGTTTCT
TCTGAGGCCCATGAAGGCGATGGTCAACTGGTGGGTCTGGAGGTTGCTCCTAAGACGATTATTGATTTGGCTGATGACATGGTGACTATCGTGAATGAACGTGTG
GTTGCTGACTGTCCGTTTCAAGATCCAGATATTATAGGCGCAGGTAAGGGTAAAAAATGGAAGAAACGTGCTCACGCTGGCTTTGTGCCAAAGGGATTAAATGTG
GAAGCCTTGGAGGAATTTCAAAAGCGTAAGGATGAACCCTTTTTGTTTTCTCCGGTTAACATTAAGCGTCCAAAATTGGACAACTATGATTGTGAGACGGCGGGG
ACTGCTGAGCAGACCCGCCGTGATCCATGA
Protein sequenceShow/hide protein sequence
MDPSALIDEWDRLSLTAEEEEVSVVADREAVDWSGLLLGFFLLGKLLCHRSLGAEVMRRSFKAAWKINLNLQVDRLGHNMFIFRFANEADKVRVLRQGSWCFEKF
LLILEVPIRGLKPSDHPYSFAVFWIHVFDLPLDWFNRTMAERIGNAMGVFEDVDARNGFMFWGASLRIMVRIDLTRPLRRGIQIYPDVLLVACGCLSVRDCVHML
QSDHGANPPLQYGDWMRFTGKGMALNVLAARVVDRRADREIPRSTRLAIEVAPSVGNFPSPTVFPEVADLRPSQGGIRISEPSEEVLRRLHAPASPAYYSLSGAP
SSQLKGTEKVADTMTKYKGGCSAIANQWRPECSSRNCLPESGTDGGWSPIRTNITNCVGPTRERLVEFNLNNGLGNYEPSLQVDNGLNLDPLLPMLKNNSTVSLG
QTQGSGKLQENSDDVESDPEMDEDAGLLGSDGLVSSEAHEGDGQLVGLEVAPKTIIDLADDMVTIVNERVVADCPFQDPDIIGAGKGKKWKKRAHAGFVPKGLNV
EALEEFQKRKDEPFLFSPVNIKRPKLDNYDCETAGTAEQTRRDP