; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0010432 (gene) of Snake gourd v1 genome

Gene IDTan0010432
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRNase H domain-containing protein
Genome locationLG09:48701980..48702789
RNA-Seq ExpressionTan0010432
SyntenyTan0010432
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
BBH00303.1 Ribonuclease H-like superfamily protein [Prunus dulcis]1.8e-1429.69Show/hide
Query:  TKNLSADDLDLAIIIIWKVWSVRNFISSNNGVVSNADKEKAIRDISLAKKEFFLTEKHKFPLTRLET----------RESQGDMTP----PDPDYWKLNC
        T+    + L L   ++W++W  RN +      +   D    +           LT+  +F +  + T           +S G  +P    P P   KLNC
Subjt:  TKNLSADDLDLAIIIIWKVWSVRNFISSNNGVVSNADKEKAIRDISLAKKEFFLTEKHKFPLTRLET----------RESQGDMTP----PDPDYWKLNC

Query:  DASWMNKVNVGGIGWVIRDSNGSLICAGGKQIKRSWPIKVLEAKAILEGLEAFNKCYVVHHKLLIVESNSSEVIKCLN-EDVKDLSELYNFVKAINCLAR
        D +W+ +   GG+GWV+RD  G  I AGG    R     + EA+A+    EA   C  +    + VES+S  +IK LN EDV DL E+   +  ++CL+ 
Subjt:  DASWMNKVNVGGIGWVIRDSNGSLICAGGKQIKRSWPIKVLEAKAILEGLEAFNKCYVVHHKLLIVESNSSEVIKCLN-EDVKDLSELYNFVKAINCLAR

Query:  RFPVVNFVKCHRSKNLLAHNIARSVCNFG
        R   V F+   R  N  AHN+A      G
Subjt:  RFPVVNFVKCHRSKNLLAHNIARSVCNFG

XP_021808158.1 uncharacterized protein LOC110751913 [Prunus avium]1.5e-1330.19Show/hide
Query:  SADDLDLAIIIIWKVWSVRNFISSNNGVVSNADKEKAIRDISLAKKEFFLT--EKHKFPLTRLETRESQGDMTPPDPDYWKLNCDASWMNKVNVGGIGWV
        +A+ +   +  +W++W  RN     +   + AD  +A   +     E+     E    P   + +  +Q    PP P   K+NCD +W  ++  GG+GWV
Subjt:  SADDLDLAIIIIWKVWSVRNFISSNNGVVSNADKEKAIRDISLAKKEFFLT--EKHKFPLTRLETRESQGDMTPPDPDYWKLNCDASWMNKVNVGGIGWV

Query:  IRDSNGSLICAGGKQIKRSWPIKVLEAKAILEGLEAFNKCYVVHHKLLIVESNSSEVIKCLNEDVKDLSELYNFVKAINCLARRFPVVNFVKCHRSKNLL
        IRDS+G L+CAGG+   R     ++E  AI   L A     + H   ++VES++   I  LN      S+L   V  I  L   F  V+FV   +S N +
Subjt:  IRDSNGSLICAGGKQIKRSWPIKVLEAKAILEGLEAFNKCYVVHHKLLIVESNSSEVIKCLNEDVKDLSELYNFVKAINCLARRFPVVNFVKCHRSKNLL

Query:  AHNIARSVCNFG
        AH +A  V   G
Subjt:  AHNIARSVCNFG

XP_022143535.1 uncharacterized protein LOC111013412 [Momordica charantia]2.4e-1429.52Show/hide
Query:  AIIIIWKVWSVRNFISSNNGVVSNADKEKAIRDISLAKKEFFL-------------TEKHKFPLTRLETRESQGDMTPPDPDYWKLNCDASWMNKVNVGG
        ++II W++W +R     N  +      E   RDI LA   + +             T K    + R+E   +     PP  + WKLN +A+W    N GG
Subjt:  AIIIIWKVWSVRNFISSNNGVVSNADKEKAIRDISLAKKEFFL-------------TEKHKFPLTRLETRESQGDMTPPDPDYWKLNCDASWMNKVNVGG

Query:  IGWVIRDSNGSLICAGGKQIKRSWPIKVLEAKAILEGLEAFNKCYVVHHKLLIVESNSSEVIKCLNEDVKDLSELYNFVKAINCLARRFPVVNFVKCHRS
        IGW++RD  G +I A  + I+    I  LE  AI EGL A  +    H + + +ES+S E I  L+   +D +E+   ++ I  + +   +V+     R 
Subjt:  IGWVIRDSNGSLICAGGKQIKRSWPIKVLEAKAILEGLEAFNKCYVVHHKLLIVESNSSEVIKCLNEDVKDLSELYNFVKAINCLARRFPVVNFVKCHRS

Query:  KNLLAHNIAR
         N +AH +AR
Subjt:  KNLLAHNIAR

XP_022155262.1 uncharacterized protein LOC111022403 [Momordica charantia]1.9e-1630.99Show/hide
Query:  MTKNLSADDLDLAIIIIWKVWSVRNFISSNNGVVSNADKEKAIRDISLAKKEFFLTEKHKFPLTRL----ETRESQGDMTPPDPDYWKLNCDASWMNKVN
        M    S +DLD+ +I  W +W+ RN++            E +     + +   F+TE      T L    +T  ++    PP    W LN DASW +  +
Subjt:  MTKNLSADDLDLAIIIIWKVWSVRNFISSNNGVVSNADKEKAIRDISLAKKEFFLTEKHKFPLTRL----ETRESQGDMTPPDPDYWKLNCDASWMNKVN

Query:  VGGIGWVIRDSNGSLICAGGKQIKRSWPIKVLEAKAILEGLEAFNKCYVVHHKLLIVESNSSEVIKCLNEDVKDLSELYNFVKAINCLARRFPVVNFVKC
         GGIGW+IR  +G ++ AG + ++    +K+LEA AILEGL       V+  + L +E++S+EV   LN   +DL++    V+ I  L     ++ F K 
Subjt:  VGGIGWVIRDSNGSLICAGGKQIKRSWPIKVLEAKAILEGLEAFNKCYVVHHKLLIVESNSSEVIKCLNEDVKDLSELYNFVKAINCLARRFPVVNFVKC

Query:  HRSKNLLAHNIAR
         R  N  AH++A+
Subjt:  HRSKNLLAHNIAR

XP_022156777.1 uncharacterized protein LOC111023608 [Momordica charantia]4.1e-1431.78Show/hide
Query:  AIIIIWKVWSVRNFISSNNGVVSNADKEKAIRDISLAKKEFFLTE-------KHKFPLTRLETRESQGDMT-----PPDPDYWKLNCDASWMNKVNVGGI
        ++II W++W +RN  S   GV S        RDI L    + +         K K     L      GD T     PP  + WKLN DA+W    N GGI
Subjt:  AIIIIWKVWSVRNFISSNNGVVSNADKEKAIRDISLAKKEFFLTE-------KHKFPLTRLETRESQGDMT-----PPDPDYWKLNCDASWMNKVNVGGI

Query:  GWVIRDSNGSLICAGGKQIKRSWPIKVLEAKAILEGLEAFNK--CYVV---HHKLLIVESNSSEVIKCLNEDVKDLSELYNFVKAINCLARRFPVVNFVK
        GW++RD  G +I A  + I+    I  LE  AI EGL A  +  C  +   H + + +ES+S E I  L+   +D +E+   ++ I  +     +V+   
Subjt:  GWVIRDSNGSLICAGGKQIKRSWPIKVLEAKAILEGLEAFNK--CYVV---HHKLLIVESNSSEVIKCLNEDVKDLSELYNFVKAINCLARRFPVVNFVK

Query:  CHRSKNLLAHNIAR
          R  N +AH++AR
Subjt:  CHRSKNLLAHNIAR

TrEMBL top hitse value%identityAlignment
A0A4Y1R838 Ribonuclease H-like superfamily protein8.8e-1529.69Show/hide
Query:  TKNLSADDLDLAIIIIWKVWSVRNFISSNNGVVSNADKEKAIRDISLAKKEFFLTEKHKFPLTRLET----------RESQGDMTP----PDPDYWKLNC
        T+    + L L   ++W++W  RN +      +   D    +           LT+  +F +  + T           +S G  +P    P P   KLNC
Subjt:  TKNLSADDLDLAIIIIWKVWSVRNFISSNNGVVSNADKEKAIRDISLAKKEFFLTEKHKFPLTRLET----------RESQGDMTP----PDPDYWKLNC

Query:  DASWMNKVNVGGIGWVIRDSNGSLICAGGKQIKRSWPIKVLEAKAILEGLEAFNKCYVVHHKLLIVESNSSEVIKCLN-EDVKDLSELYNFVKAINCLAR
        D +W+ +   GG+GWV+RD  G  I AGG    R     + EA+A+    EA   C  +    + VES+S  +IK LN EDV DL E+   +  ++CL+ 
Subjt:  DASWMNKVNVGGIGWVIRDSNGSLICAGGKQIKRSWPIKVLEAKAILEGLEAFNKCYVVHHKLLIVESNSSEVIKCLN-EDVKDLSELYNFVKAINCLAR

Query:  RFPVVNFVKCHRSKNLLAHNIARSVCNFG
        R   V F+   R  N  AHN+A      G
Subjt:  RFPVVNFVKCHRSKNLLAHNIARSVCNFG

A0A6J1CP26 uncharacterized protein LOC1110134121.2e-1429.52Show/hide
Query:  AIIIIWKVWSVRNFISSNNGVVSNADKEKAIRDISLAKKEFFL-------------TEKHKFPLTRLETRESQGDMTPPDPDYWKLNCDASWMNKVNVGG
        ++II W++W +R     N  +      E   RDI LA   + +             T K    + R+E   +     PP  + WKLN +A+W    N GG
Subjt:  AIIIIWKVWSVRNFISSNNGVVSNADKEKAIRDISLAKKEFFL-------------TEKHKFPLTRLETRESQGDMTPPDPDYWKLNCDASWMNKVNVGG

Query:  IGWVIRDSNGSLICAGGKQIKRSWPIKVLEAKAILEGLEAFNKCYVVHHKLLIVESNSSEVIKCLNEDVKDLSELYNFVKAINCLARRFPVVNFVKCHRS
        IGW++RD  G +I A  + I+    I  LE  AI EGL A  +    H + + +ES+S E I  L+   +D +E+   ++ I  + +   +V+     R 
Subjt:  IGWVIRDSNGSLICAGGKQIKRSWPIKVLEAKAILEGLEAFNKCYVVHHKLLIVESNSSEVIKCLNEDVKDLSELYNFVKAINCLARRFPVVNFVKCHRS

Query:  KNLLAHNIAR
         N +AH +AR
Subjt:  KNLLAHNIAR

A0A6J1DNV9 uncharacterized protein LOC1110224039.4e-1730.99Show/hide
Query:  MTKNLSADDLDLAIIIIWKVWSVRNFISSNNGVVSNADKEKAIRDISLAKKEFFLTEKHKFPLTRL----ETRESQGDMTPPDPDYWKLNCDASWMNKVN
        M    S +DLD+ +I  W +W+ RN++            E +     + +   F+TE      T L    +T  ++    PP    W LN DASW +  +
Subjt:  MTKNLSADDLDLAIIIIWKVWSVRNFISSNNGVVSNADKEKAIRDISLAKKEFFLTEKHKFPLTRL----ETRESQGDMTPPDPDYWKLNCDASWMNKVN

Query:  VGGIGWVIRDSNGSLICAGGKQIKRSWPIKVLEAKAILEGLEAFNKCYVVHHKLLIVESNSSEVIKCLNEDVKDLSELYNFVKAINCLARRFPVVNFVKC
         GGIGW+IR  +G ++ AG + ++    +K+LEA AILEGL       V+  + L +E++S+EV   LN   +DL++    V+ I  L     ++ F K 
Subjt:  VGGIGWVIRDSNGSLICAGGKQIKRSWPIKVLEAKAILEGLEAFNKCYVVHHKLLIVESNSSEVIKCLNEDVKDLSELYNFVKAINCLARRFPVVNFVKC

Query:  HRSKNLLAHNIAR
         R  N  AH++A+
Subjt:  HRSKNLLAHNIAR

A0A6J1DSV1 uncharacterized protein LOC1110236082.0e-1431.78Show/hide
Query:  AIIIIWKVWSVRNFISSNNGVVSNADKEKAIRDISLAKKEFFLTE-------KHKFPLTRLETRESQGDMT-----PPDPDYWKLNCDASWMNKVNVGGI
        ++II W++W +RN  S   GV S        RDI L    + +         K K     L      GD T     PP  + WKLN DA+W    N GGI
Subjt:  AIIIIWKVWSVRNFISSNNGVVSNADKEKAIRDISLAKKEFFLTE-------KHKFPLTRLETRESQGDMT-----PPDPDYWKLNCDASWMNKVNVGGI

Query:  GWVIRDSNGSLICAGGKQIKRSWPIKVLEAKAILEGLEAFNK--CYVV---HHKLLIVESNSSEVIKCLNEDVKDLSELYNFVKAINCLARRFPVVNFVK
        GW++RD  G +I A  + I+    I  LE  AI EGL A  +  C  +   H + + +ES+S E I  L+   +D +E+   ++ I  +     +V+   
Subjt:  GWVIRDSNGSLICAGGKQIKRSWPIKVLEAKAILEGLEAFNK--CYVV---HHKLLIVESNSSEVIKCLNEDVKDLSELYNFVKAINCLARRFPVVNFVK

Query:  CHRSKNLLAHNIAR
          R  N +AH++AR
Subjt:  CHRSKNLLAHNIAR

A0A6P5S2T0 uncharacterized protein LOC1107519137.5e-1430.19Show/hide
Query:  SADDLDLAIIIIWKVWSVRNFISSNNGVVSNADKEKAIRDISLAKKEFFLT--EKHKFPLTRLETRESQGDMTPPDPDYWKLNCDASWMNKVNVGGIGWV
        +A+ +   +  +W++W  RN     +   + AD  +A   +     E+     E    P   + +  +Q    PP P   K+NCD +W  ++  GG+GWV
Subjt:  SADDLDLAIIIIWKVWSVRNFISSNNGVVSNADKEKAIRDISLAKKEFFLT--EKHKFPLTRLETRESQGDMTPPDPDYWKLNCDASWMNKVNVGGIGWV

Query:  IRDSNGSLICAGGKQIKRSWPIKVLEAKAILEGLEAFNKCYVVHHKLLIVESNSSEVIKCLNEDVKDLSELYNFVKAINCLARRFPVVNFVKCHRSKNLL
        IRDS+G L+CAGG+   R     ++E  AI   L A     + H   ++VES++   I  LN      S+L   V  I  L   F  V+FV   +S N +
Subjt:  IRDSNGSLICAGGKQIKRSWPIKVLEAKAILEGLEAFNKCYVVHHKLLIVESNSSEVIKCLNEDVKDLSELYNFVKAINCLARRFPVVNFVKCHRSKNLL

Query:  AHNIARSVCNFG
        AH +A  V   G
Subjt:  AHNIARSVCNFG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G27870.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein4.8e-0524.84Show/hide
Query:  PDPDYWKLNCDASWMNKVNVGGIGWVIRDSNGSLICAGGKQIKRSWPIKVLEAKAILEGLEAFNKCYVVHHKLLIVESNSSEVIKCLNEDVKDLSELYNF
        P+  + K N D S++N       GWV+RDSNGS + AG    ++       E +A++  ++    C+   +K +  E ++  +   +N   K    ++N+
Subjt:  PDPDYWKLNCDASWMNKVNVGGIGWVIRDSNGSLICAGGKQIKRSWPIKVLEAKAILEGLEAFNKCYVVHHKLLIVESNSSEVIKCLNEDVKDLSELYNF

Query:  VKAINCLARRFPVVNFVKCHRSKNLLAHNIARS-VCNFGDFDGAFCSPMALTF
        ++ I+   R+F  ++F    R  N  A  +A++ + N   F   F  P  +T+
Subjt:  VKAINCLARRFPVVNFVKCHRSKNLLAHNIARS-VCNFGDFDGAFCSPMALTF

AT1G52990.1 thioredoxin family protein1.3e-0525.47Show/hide
Query:  KLNCDASWMNKVNVGGIGWVIRDSNGSLI-CAGGKQIKRSWPIKVLEAKAILEGLEAFNKCYVVHHKLLIVESNSSEVIKCLNEDVKDLSELYNFVKAIN
        K N DAS      V G+GW+IR+S G+++ C  GK   R  P +  E  A++  ++A +      +  +I E ++S V + +N    D   L +++  I 
Subjt:  KLNCDASWMNKVNVGGIGWVIRDSNGSLI-CAGGKQIKRSWPIKVLEAKAILEGLEAFNKCYVVHHKLLIVESNSSEVIKCLNEDVKDLSELYNFVKAIN

Query:  CLARRFPVVNFVKCHRSKNLLAHNIARSVCNFGDFDGAF--CSPMALTFLGAVLASTHLGF
             F    F+  HR +N  A  + +           F  C       + ++L   HL F
Subjt:  CLARRFPVVNFVKCHRSKNLLAHNIARSVCNFGDFDGAF--CSPMALTFLGAVLASTHLGF

AT2G46460.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein2.5e-0625.93Show/hide
Query:  TPPDPDYWKLNCDASWMNKVNVGGIGWVIRDSNGSLICAGGKQIKRSWPIKVLEAKAILEGLEAFNKCYVVHHKLLIVESNSSEVIKCLNEDVKDLSELY
        T P   + K N D S+ N+      GW+IRD +G  + AG      +      E +A+L  +++   C+   H+ +  E ++ EV++ LN   K   +++
Subjt:  TPPDPDYWKLNCDASWMNKVNVGGIGWVIRDSNGSLICAGGKQIKRSWPIKVLEAKAILEGLEAFNKCYVVHHKLLIVESNSSEVIKCLNEDVKDLSELY

Query:  NFVKAINCLARRFPVVNFVKCHRSKNLLAHNIARS
        N+++ +    +RF    F   +R +N  A  +A+S
Subjt:  NFVKAINCLARRFPVVNFVKCHRSKNLLAHNIARS

AT4G29090.1 Ribonuclease H-like superfamily protein4.9e-1025.98Show/hide
Query:  IIWKVWSVRNFISSNNGVVSNADKEKAIRDISLAKKEFFL-TEKHKFPLTRLETRESQGDMTPPDPDYWKLNCDASWMNKVNVGGIGWVIRDSNGSLICA
        ++W++W  RN +    G   NA  ++ +R      +E+ + TE           R S G   PP   + K N DA+W       GIGWV+R+  G +   
Subjt:  IIWKVWSVRNFISSNNGVVSNADKEKAIRDISLAKKEFFL-TEKHKFPLTRLETRESQGDMTPPDPDYWKLNCDASWMNKVNVGGIGWVIRDSNGSLICA

Query:  GGKQIKRSWPIKVLEAKAILEGLEAFNKCYVVHHKLLIVESNSSEVIKCLNEDVKDLSELYNFVKAINCLARRFPVVNFVKCHRSKNLLAHNIARSVCNF
        G + + +   +   E +A+   + + ++     +  +I ES+S  +I+ LN D +    L   ++ +  L  +F  V FV   R  N LA  +AR   +F
Subjt:  GGKQIKRSWPIKVLEAKAILEGLEAFNKCYVVHHKLLIVESNSSEVIKCLNEDVKDLSELYNFVKAINCLARRFPVVNFVKCHRSKNLLAHNIARSVCNF

Query:  GDFD
         ++D
Subjt:  GDFD

AT5G65005.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein3.5e-0823.41Show/hide
Query:  IIWKVWSVRNFISSNNGVVSNADKEKAIRDISLAKKEFFLTEKHKFPLTRLETRESQG-----------DMTPPDPDYWKLNCDASWMNKVNVGGIGWVI
        ++W++W       S N +V N  + K    + +A  +       K  L    T E Q              +PP  D  K N DAS   +  V G+GW++
Subjt:  IIWKVWSVRNFISSNNGVVSNADKEKAIRDISLAKKEFFLTEKHKFPLTRLETRESQG-----------DMTPPDPDYWKLNCDASWMNKVNVGGIGWVI

Query:  RDSNGSLICAGGKQIKRSWPIKVLEAKAILEGLEAFNKCYVVHHKLLIVESNSSEVIKCLNEDVKDLSELYNFVKAINCLARRFPVVNFVKCHRSKNLLA
        R+S G++I  G  + +     +  E   ++  ++A    Y   HK +I E ++  + + +N    +   L +F+  I      F  + F   HR +N  A
Subjt:  RDSNGSLICAGGKQIKRSWPIKVLEAKAILEGLEAFNKCYVVHHKLLIVESNSSEVIKCLNEDVKDLSELYNFVKAINCLARRFPVVNFVKCHRSKNLLA

Query:  HNIAR
          +A+
Subjt:  HNIAR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTAAAAATTTATCTGCGGATGACCTGGATCTGGCCATTATTATTATCTGGAAGGTTTGGAGTGTCAGAAACTTTATTTCTTCTAACAATGGTGTGGTTTCAAATGC
AGATAAGGAGAAGGCCATTAGAGATATCTCGCTCGCAAAGAAGGAGTTCTTCTTAACAGAGAAGCATAAGTTCCCTTTGACAAGATTGGAGACTCGCGAGAGTCAAGGAG
ATATGACCCCTCCGGACCCGGACTATTGGAAGCTAAATTGCGATGCTTCCTGGATGAATAAAGTCAATGTTGGTGGTATTGGTTGGGTTATCCGTGACTCTAATGGCTCT
CTGATTTGTGCAGGAGGGAAGCAAATTAAAAGAAGTTGGCCAATTAAAGTGCTGGAAGCGAAGGCGATTCTTGAGGGTCTTGAAGCGTTTAACAAGTGCTATGTCGTCCA
TCACAAGTTACTGATTGTGGAGTCGAACTCGAGCGAAGTGATCAAGTGCCTGAACGAGGATGTTAAAGACCTCTCTGAGTTATACAATTTTGTTAAAGCGATTAATTGTC
TTGCTAGGCGCTTTCCTGTTGTTAATTTTGTTAAGTGCCATAGGTCTAAAAACCTCTTAGCTCATAACATTGCTAGAAGTGTTTGTAATTTTGGTGATTTTGATGGAGCT
TTTTGCTCCCCCATGGCTCTAACGTTTCTAGGTGCGGTTTTGGCGAGTACCCATCTTGGGTTTCCGACTTGTTGCCTGCGGGCTGCACCCCTTGTGGGTCCCTTGTGGGC
TAGTGTGTTACTCTGTTCGTTGTTTCAAAAAAAAAAATAA
mRNA sequenceShow/hide mRNA sequence
ATGACTAAAAATTTATCTGCGGATGACCTGGATCTGGCCATTATTATTATCTGGAAGGTTTGGAGTGTCAGAAACTTTATTTCTTCTAACAATGGTGTGGTTTCAAATGC
AGATAAGGAGAAGGCCATTAGAGATATCTCGCTCGCAAAGAAGGAGTTCTTCTTAACAGAGAAGCATAAGTTCCCTTTGACAAGATTGGAGACTCGCGAGAGTCAAGGAG
ATATGACCCCTCCGGACCCGGACTATTGGAAGCTAAATTGCGATGCTTCCTGGATGAATAAAGTCAATGTTGGTGGTATTGGTTGGGTTATCCGTGACTCTAATGGCTCT
CTGATTTGTGCAGGAGGGAAGCAAATTAAAAGAAGTTGGCCAATTAAAGTGCTGGAAGCGAAGGCGATTCTTGAGGGTCTTGAAGCGTTTAACAAGTGCTATGTCGTCCA
TCACAAGTTACTGATTGTGGAGTCGAACTCGAGCGAAGTGATCAAGTGCCTGAACGAGGATGTTAAAGACCTCTCTGAGTTATACAATTTTGTTAAAGCGATTAATTGTC
TTGCTAGGCGCTTTCCTGTTGTTAATTTTGTTAAGTGCCATAGGTCTAAAAACCTCTTAGCTCATAACATTGCTAGAAGTGTTTGTAATTTTGGTGATTTTGATGGAGCT
TTTTGCTCCCCCATGGCTCTAACGTTTCTAGGTGCGGTTTTGGCGAGTACCCATCTTGGGTTTCCGACTTGTTGCCTGCGGGCTGCACCCCTTGTGGGTCCCTTGTGGGC
TAGTGTGTTACTCTGTTCGTTGTTTCAAAAAAAAAAATAA
Protein sequenceShow/hide protein sequence
MTKNLSADDLDLAIIIIWKVWSVRNFISSNNGVVSNADKEKAIRDISLAKKEFFLTEKHKFPLTRLETRESQGDMTPPDPDYWKLNCDASWMNKVNVGGIGWVIRDSNGS
LICAGGKQIKRSWPIKVLEAKAILEGLEAFNKCYVVHHKLLIVESNSSEVIKCLNEDVKDLSELYNFVKAINCLARRFPVVNFVKCHRSKNLLAHNIARSVCNFGDFDGA
FCSPMALTFLGAVLASTHLGFPTCCLRAAPLVGPLWASVLLCSLFQKKK