; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0020059 (gene) of Snake gourd v1 genome

Gene IDTan0020059
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionSWIM-type domain-containing protein
Genome locationLG06:76170410..76173045
RNA-Seq ExpressionTan0020059
SyntenyTan0020059
Gene Ontology termsGO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR006564 - Zinc finger, PMZ-type
IPR007527 - Zinc finger, SWIM-type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG8475439.1 hypothetical protein CXB51_032209 [Gossypium anomalum]4.6e-3436.07Show/hide
Query:  HWSK-HATCHPKVDFLLNNMVESFNLLILPSRNQHIIQLFETIRKIIMCRLTKKHVKYTNMKEEELGSIIMEKLATNISQSRKVILRWAGNNLFEVECGT
        HWS+ H +     D L+NN+ ESFN +IL +R + I+ + ETIR  IM  + KK  +    K   L   I +K+  NI  S + +  +AG + ++VECG 
Subjt:  HWSK-HATCHPKVDFLLNNMVESFNLLILPSRNQHIIQLFETIRKIIMCRLTKKHVKYTNMKEEELGSIIMEKLATNISQSRKVILRWAGNNLFEVECGT

Query:  -TQVFVDIERITCTCGLWQLSGIPCPHAAQCIYYVKKRPNEFVDDAYSKQTPVKTYSHFLQPLNDHTFW-PTTSFKPILPPSIRRPPGRSKKQRKRNVDE
         +Q  VD+   +C+C  W L+GIPC HA   I+  ++ P  +V   Y+KQT ++ YS+F+ P+     W P ++  PILPP +RRPPGR  K +++  DE
Subjt:  -TQVFVDIERITCTCGLWQLSGIPCPHAAQCIYYVKKRPNEFVDDAYSKQTPVKTYSHFLQPLNDHTFW-PTTSFKPILPPSIRRPPGRSKKQRKRNVDE

Query:  VGTSKSVSSRR-RLRCDKCGVSGHNSRTWKGKVNN----KRKQVNSSNSSFIGFNIPCEPSGMEGSSATLSATQPLSATP
          T++ +S R   +RC KC + GHN R+ KG+V      KR +V               P+  EG+S   ++   LS  P
Subjt:  VGTSKSVSSRR-RLRCDKCGVSGHNSRTWKGKVNN----KRKQVNSSNSSFIGFNIPCEPSGMEGSSATLSATQPLSATP

KAG8481389.1 hypothetical protein CXB51_026139 [Gossypium anomalum]1.0e-3333.23Show/hide
Query:  EEIESSYGSYSNIHTPYNFEDEREVKGPKFRHESGMDHIHGITAKHWSK-HATCHPKVDFLLNNMVESFNLLILPSRNQHIIQLFETIRKIIMCRLTKKH
        +E++      + + T   F+D  +      +H    D +      HWS+ H +     D L+NN+ ESFN +IL +R + I+ + ETIR  IM  + KK 
Subjt:  EEIESSYGSYSNIHTPYNFEDEREVKGPKFRHESGMDHIHGITAKHWSK-HATCHPKVDFLLNNMVESFNLLILPSRNQHIIQLFETIRKIIMCRLTKKH

Query:  VKYTNMKEEELGSIIMEKLATNISQSRKVILRWAGNNLFEVECGT-TQVFVDIERITCTCGLWQLSGIPCPHAAQCIYYVKKRPNEFVDDAYSKQTPVKT
         +    K   L   I +K+  NI  S + +   AG + ++VECG  +Q  VD+   +C+C  W L+GIPC HA   I+  ++ P  +V   Y+KQT ++ 
Subjt:  VKYTNMKEEELGSIIMEKLATNISQSRKVILRWAGNNLFEVECGT-TQVFVDIERITCTCGLWQLSGIPCPHAAQCIYYVKKRPNEFVDDAYSKQTPVKT

Query:  YSHFLQPLNDHTFW-PTTSFKPILPPSIRRPPGRSKKQRKRNVDEVGTSKSVSSRR-RLRCDKCGVSGHNSRTWKGKVNN----KRKQVNSSNSSFIGFN
        YS+F+ P+     W P ++  PILPP +RRPPGR  K R++  DE  T++ +S R   +RC KC + GHN R+ KG+V      KR +V           
Subjt:  YSHFLQPLNDHTFW-PTTSFKPILPPSIRRPPGRSKKQRKRNVDEVGTSKSVSSRR-RLRCDKCGVSGHNSRTWKGKVNN----KRKQVNSSNSSFIGFN

Query:  IPCEPSGMEGSSATLSATQPLSATP
            P+  EG+S   ++   LS  P
Subjt:  IPCEPSGMEGSSATLSATQPLSATP

XP_028779040.1 uncharacterized protein LOC114735508 [Prosopis alba]1.4e-3535.85Show/hide
Query:  DHIHGITAKHWSK-HATCHPKVDFLLNNMVESFNLLILPSRNQHIIQLFETIRKIIMCRLTKKHVKYTNMKEEELGSII---MEKLATNISQSRKVILRW
        DH+  I    WS+ H   + K D + NN+ ESFN  +L +R   II L + IR  +M R  +  VK    +++  G +     ++L   +++ RK   R+
Subjt:  DHIHGITAKHWSK-HATCHPKVDFLLNNMVESFNLLILPSRNQHIIQLFETIRKIIMCRLTKKHVKYTNMKEEELGSII---MEKLATNISQSRKVILRW

Query:  AGNNLFEVECGTTQVFVDIERITCTCGLWQLSGIPCPHAAQCIYYVKKRPNEFVDDAYSKQTPVKTYSHFLQPLNDHTFWPTTSFKPILPPSIRRPPGRS
        AGNN +EV   T +  VD+   TC+C  WQL+GIPC HAA CIY  ++ P+++VD   + Q+ V  YS  ++P+N    W     +PI PP   + PGR 
Subjt:  AGNNLFEVECGTTQVFVDIERITCTCGLWQLSGIPCPHAAQCIYYVKKRPNEFVDDAYSKQTPVKTYSHFLQPLNDHTFWPTTSFKPILPPSIRRPPGRS

Query:  KKQRKRNVDEVGTSKSVSSRRRLR---CDKCGVSGHNSRTWK-----GKVNNKRKQVNSSNSSFI
         K+RK+ +DE  ++++   +R+ R   C  CG  GHN RT K      K++ KRKQ +S+  S +
Subjt:  KKQRKRNVDEVGTSKSVSSRRRLR---CDKCGVSGHNSRTWK-----GKVNNKRKQVNSSNSSFI

XP_030922911.1 uncharacterized protein LOC115949770 [Quercus lobata]6.7e-3336.93Show/hide
Query:  HIHGITAKHWSKHA-TCHPKVDFLLNNMVESFNLLILPSRNQHIIQLFETIRKIIMCRLTKKHVKYTNMKEEELGSIIMEKLATNISQSRKVILRWAGNN
        H+ GI   HWS+HA +  PK D LLNN+ ESFN  I+ +R++ I+ + + IR+ +M R+ +        K   +   I +KL  N   +R     W+G +
Subjt:  HIHGITAKHWSKHA-TCHPKVDFLLNNMVESFNLLILPSRNQHIIQLFETIRKIIMCRLTKKHVKYTNMKEEELGSIIMEKLATNISQSRKVILRWAGNN

Query:  LFEVECGTTQVFVDIERITCTCGLWQLSGIPCPHAAQCIYYVKKRPNEFVDDAYSKQTPVKTYSHFLQPLNDHTFWPTTSFKPILPPSIRRPPGRSKK-Q
         FEV C   Q  VD+E+ TC C  W ++GIPC HA   I Y K+    +V   Y  +T ++TYSH +QP N   FWP      ILPP +   PGR K+ +
Subjt:  LFEVECGTTQVFVDIERITCTCGLWQLSGIPCPHAAQCIYYVKKRPNEFVDDAYSKQTPVKTYSHFLQPLNDHTFWPTTSFKPILPPSIRRPPGRSKK-Q

Query:  RKRNVDEV-----GTSKSVSSRRRLRCDKCGVSGHNSRTWK
        RK+  DE         KS ++++      CG  GHN R  K
Subjt:  RKRNVDEV-----GTSKSVSSRRRLRCDKCGVSGHNSRTWK

XP_030970679.1 uncharacterized protein LOC115991070 [Quercus lobata]2.5e-4035.91Show/hide
Query:  EREVKGPKFRHESGMDHIHGITAKHWSKHA-TCHPKVDFLLNNMVESFNLLILPSRNQHIIQLFETIRKIIMCRLTKKHVKYTNMK-EEELGSIIMEKLA
        +R +K  +       + + G +   W KHA + +PK D L NNM ESFN +IL SR + +I L E +R  +M +   +  K   MK E+EL  +  +KL 
Subjt:  EREVKGPKFRHESGMDHIHGITAKHWSKHA-TCHPKVDFLLNNMVESFNLLILPSRNQHIIQLFETIRKIIMCRLTKKHVKYTNMK-EEELGSIIMEKLA

Query:  TNISQSRKVILRWAGNNLFEVECGTTQVFVDIERITCTCGLWQLSGIPCPHAAQCIYYVKKRPNEFVDDAYSKQTPVKTYSHFLQPLNDHTFWPTTSFKP
         NI+ S   +  WAG+  FEV C   Q  VD+   +CTC  W L+G+PC HA  CI+Y K+    +V++    QT    Y   + P+N+   W  T F+P
Subjt:  TNISQSRKVILRWAGNNLFEVECGTTQVFVDIERITCTCGLWQLSGIPCPHAAQCIYYVKKRPNEFVDDAYSKQTPVKTYSHFLQPLNDHTFWPTTSFKP

Query:  ILPPSIRRPPGRSKKQRKRNVDEVGTSKSVSSRRR-LRCDKCGVSGHNSRTWKGKVNNKRKQVNSSNSSFIGFNIPCEPSGMEGSSATLSATQPLSATPP
        + PP  +RP GR KK+RKR+ DE     S   R   ++C  C   GHN RT KGKV   ++Q  S   S        +P+  EG+SA+ SA     A   
Subjt:  ILPPSIRRPPGRSKKQRKRNVDEVGTSKSVSSRRR-LRCDKCGVSGHNSRTWKGKVNNKRKQVNSSNSSFIGFNIPCEPSGMEGSSATLSATQPLSATPP

Query:  LSAPSSVTPSSGRGFTAPQIFTP
         SA +S   S+  G    Q   P
Subjt:  LSAPSSVTPSSGRGFTAPQIFTP

TrEMBL top hitse value%identityAlignment
A0A2N9FLE9 Uncharacterized protein2.0e-3838.04Show/hide
Query:  YNFEDEREVKGPKFRHESGMDHIHGITAKHWSKHA-TCHPKVDFLLNNMVESFNLLILPSRNQHIIQLFETIRKIIMCRL--TKKHVKYTNMKEEELGSI
        Y  + ER +K  K       + +     + W KH    H K D L+NNM ESFN +IL +R + II L E IR  +M R    ++ +K     E +L   
Subjt:  YNFEDEREVKGPKFRHESGMDHIHGITAKHWSKHA-TCHPKVDFLLNNMVESFNLLILPSRNQHIIQLFETIRKIIMCRL--TKKHVKYTNMKEEELGSI

Query:  IMEKLATNISQSRKVILRWAGNNLFEVECGTTQVFVDIERITCTCGLWQLSGIPCPHAAQCIYYVKKRPNEFVDDAYSKQTPVKTYSHFLQPLNDHTFWP
        I +KL+     S     + AGN  FEV CG  Q  V+++   CTC  W LSG+PC HA  CI+Y ++   ++VD  Y   T V  Y   + P+N    W 
Subjt:  IMEKLATNISQSRKVILRWAGNNLFEVECGTTQVFVDIERITCTCGLWQLSGIPCPHAAQCIYYVKKRPNEFVDDAYSKQTPVKTYSHFLQPLNDHTFWP

Query:  TTSFKPILPPSIRRPPGRSKKQRKRNVDEVGTSKSVSSRRRL--RCDKCGVSGHNSRTWKGKVNNKRKQVNSSNSS
         T   P+ PP IRRPPGR KK R+R  DE     +  S+R L  +C KCG +GHN RT KGKV   + Q N+   +
Subjt:  TTSFKPILPPSIRRPPGRSKKQRKRNVDEVGTSKSVSSRRRL--RCDKCGVSGHNSRTWKGKVNNKRKQVNSSNSS

A0A2N9FZE3 SWIM-type domain-containing protein1.4e-3635.07Show/hide
Query:  DHIHGITAKHWSKHATCH-PKVDFLLNNMVESFNLLILPSRNQHIIQLFETIRKIIMCRLTKKH---VKYTNMKEEELGSIIMEKLATNISQSRKVILRW
        D + G     W++HA  H PK D LLNN+ E+FN  I+ +R + II + ETIR+ +M R+ K     +KY    +  +   I EKL    ++S+    +W
Subjt:  DHIHGITAKHWSKHATCH-PKVDFLLNNMVESFNLLILPSRNQHIIQLFETIRKIIMCRLTKKH---VKYTNMKEEELGSIIMEKLATNISQSRKVILRW

Query:  AGNNLFEVECGTTQVFVDIERITCTCGLWQLSGIPCPHAAQCIYYVKKRPNEFVDDAYSKQTPVKTYSHFLQPLNDHTFWPTTSFKPILPPSIRRPPGRS
         G N +EV     +  VDI + +C C  W L+GIPC HA + I Y      ++VDD + K+T +K YSH +QP N   FWP  +  P+LPP  RR PGR 
Subjt:  AGNNLFEVECGTTQVFVDIERITCTCGLWQLSGIPCPHAAQCIYYVKKRPNEFVDDAYSKQTPVKTYSHFLQPLNDHTFWPTTSFKPILPPSIRRPPGRS

Query:  KK-QRKRNVDEVGTS-KSVSSRRRLRCDKCGVSGHNSRTWKGK------------------------VNNKRKQVNSSNSSFIGFNIP
        K+  R+++ DE   S K   ++  L+C +C   GHN R+ K K                          N +K+  S++ +F+GF IP
Subjt:  KK-QRKRNVDEVGTS-KSVSSRRRLRCDKCGVSGHNSRTWKGK------------------------VNNKRKQVNSSNSSFIGFNIP

A0A2N9HT52 SWIM-type domain-containing protein1.4e-3635.07Show/hide
Query:  DHIHGITAKHWSKHATCH-PKVDFLLNNMVESFNLLILPSRNQHIIQLFETIRKIIMCRLTKKH---VKYTNMKEEELGSIIMEKLATNISQSRKVILRW
        D + G     W++HA  H PK D LLNN+ E+FN  I+ +R + II + ETIR+ +M R+ K     +KY    +  +   I EKL    ++S+    +W
Subjt:  DHIHGITAKHWSKHATCH-PKVDFLLNNMVESFNLLILPSRNQHIIQLFETIRKIIMCRLTKKH---VKYTNMKEEELGSIIMEKLATNISQSRKVILRW

Query:  AGNNLFEVECGTTQVFVDIERITCTCGLWQLSGIPCPHAAQCIYYVKKRPNEFVDDAYSKQTPVKTYSHFLQPLNDHTFWPTTSFKPILPPSIRRPPGRS
         G N +EV     +  VDI + +C C  W L+GIPC HA + I Y      ++VDD + K+T +K YSH +QP N   FWP  +  P+LPP  RR PGR 
Subjt:  AGNNLFEVECGTTQVFVDIERITCTCGLWQLSGIPCPHAAQCIYYVKKRPNEFVDDAYSKQTPVKTYSHFLQPLNDHTFWPTTSFKPILPPSIRRPPGRS

Query:  KK-QRKRNVDEVGTS-KSVSSRRRLRCDKCGVSGHNSRTWKGK------------------------VNNKRKQVNSSNSSFIGFNIP
        K+  R+++ DE   S K   ++  L+C +C   GHN R+ K K                          N +K+  S++ +F+GF IP
Subjt:  KK-QRKRNVDEVGTS-KSVSSRRRLRCDKCGVSGHNSRTWKGK------------------------VNNKRKQVNSSNSSFIGFNIP

A0A2N9HUS0 Uncharacterized protein2.0e-3838.04Show/hide
Query:  YNFEDEREVKGPKFRHESGMDHIHGITAKHWSKHA-TCHPKVDFLLNNMVESFNLLILPSRNQHIIQLFETIRKIIMCRL--TKKHVKYTNMKEEELGSI
        Y  + ER +K  K       + +     + W KH    H K D L+NNM ESFN +IL +R + II L E IR  +M R    ++ +K     E +L   
Subjt:  YNFEDEREVKGPKFRHESGMDHIHGITAKHWSKHA-TCHPKVDFLLNNMVESFNLLILPSRNQHIIQLFETIRKIIMCRL--TKKHVKYTNMKEEELGSI

Query:  IMEKLATNISQSRKVILRWAGNNLFEVECGTTQVFVDIERITCTCGLWQLSGIPCPHAAQCIYYVKKRPNEFVDDAYSKQTPVKTYSHFLQPLNDHTFWP
        I +KL+     S     + AGN  FEV CG  Q  V+++   CTC  W LSG+PC HA  CI+Y ++   ++VD  Y   T V  Y   + P+N    W 
Subjt:  IMEKLATNISQSRKVILRWAGNNLFEVECGTTQVFVDIERITCTCGLWQLSGIPCPHAAQCIYYVKKRPNEFVDDAYSKQTPVKTYSHFLQPLNDHTFWP

Query:  TTSFKPILPPSIRRPPGRSKKQRKRNVDEVGTSKSVSSRRRL--RCDKCGVSGHNSRTWKGKVNNKRKQVNSSNSS
         T   P+ PP IRRPPGR KK R+R  DE     +  S+R L  +C KCG +GHN RT KGKV   + Q N+   +
Subjt:  TTSFKPILPPSIRRPPGRSKKQRKRNVDEVGTSKSVSSRRRL--RCDKCGVSGHNSRTWKGKVNNKRKQVNSSNSS

A0A2N9I0N2 SWIM-type domain-containing protein1.4e-3635.07Show/hide
Query:  DHIHGITAKHWSKHATCH-PKVDFLLNNMVESFNLLILPSRNQHIIQLFETIRKIIMCRLTKKH---VKYTNMKEEELGSIIMEKLATNISQSRKVILRW
        D + G     W++HA  H PK D LLNN+ E+FN  I+ +R + II + ETIR+ +M R+ K     +KY    +  +   I EKL    ++S+    +W
Subjt:  DHIHGITAKHWSKHATCH-PKVDFLLNNMVESFNLLILPSRNQHIIQLFETIRKIIMCRLTKKH---VKYTNMKEEELGSIIMEKLATNISQSRKVILRW

Query:  AGNNLFEVECGTTQVFVDIERITCTCGLWQLSGIPCPHAAQCIYYVKKRPNEFVDDAYSKQTPVKTYSHFLQPLNDHTFWPTTSFKPILPPSIRRPPGRS
         G N +EV     +  VDI + +C C  W L+GIPC HA + I Y      ++VDD + K+T +K YSH +QP N   FWP  +  P+LPP  RR PGR 
Subjt:  AGNNLFEVECGTTQVFVDIERITCTCGLWQLSGIPCPHAAQCIYYVKKRPNEFVDDAYSKQTPVKTYSHFLQPLNDHTFWPTTSFKPILPPSIRRPPGRS

Query:  KK-QRKRNVDEVGTS-KSVSSRRRLRCDKCGVSGHNSRTWKGK------------------------VNNKRKQVNSSNSSFIGFNIP
        K+  R+++ DE   S K   ++  L+C +C   GHN R+ K K                          N +K+  S++ +F+GF IP
Subjt:  KK-QRKRNVDEVGTS-KSVSSRRRLRCDKCGVSGHNSRTWKGK------------------------VNNKRKQVNSSNSSFIGFNIP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G49920.1 MuDR family transposase7.2e-0930Show/hide
Query:  MEKLATNISQSRKVILRWAGNNLFEVECGTTQVFVDIERITCTCGLWQLSGIPCPHAAQCIYYVKKRPNEFVDDAYSKQTPVKTYSHFLQPLNDHTFWPT
        +E+ A  +S + K   R  G +       +T   V +   TCTCG +Q +  PC HA      +K  P ++VDD Y+ +   KTYS    P+ + + WP 
Subjt:  MEKLATNISQSRKVILRWAGNNLFEVECGTTQVFVDIERITCTCGLWQLSGIPCPHAAQCIYYVKKRPNEFVDDAYSKQTPVKTYSHFLQPLNDHTFWPT

Query:  TSFKP-ILPPSIRRPPGR-SKKQRKRNVDE
            P ++PP I  PP + S K ++++ ++
Subjt:  TSFKP-ILPPSIRRPPGR-SKKQRKRNVDE

AT1G64255.1 MuDR family transposase1.8e-0726.56Show/hide
Query:  GSIIMEKLATNISQSRKVILRWA------GNNLFEVECGTT--QVFVDIERITCTCGLWQLSGIPCPHAAQCIYYVKKRPNEFVDDAYSKQTPVKTYSHF
        G +  E +   + + R   + ++       NN F+V       +  V +   +CTCG +Q    PC HA      +K  P ++VDD Y+ +   +TY+  
Subjt:  GSIIMEKLATNISQSRKVILRWA------GNNLFEVECGTT--QVFVDIERITCTCGLWQLSGIPCPHAAQCIYYVKKRPNEFVDDAYSKQTPVKTYSHF

Query:  LQPLNDHTFWPTTSFKP-ILPPSIRRPP
           + + + WP  S  P +LPP I   P
Subjt:  LQPLNDHTFWPTTSFKP-ILPPSIRRPP

AT1G64260.1 MuDR family transposase1.1e-0628.81Show/hide
Query:  MEKLATNISQSRKVILRWAGNNLFEVECGT--TQVFVDIERITCTCGLWQLSGIPCPHAAQCIYYVKKRPNEFVDDAYSKQTPVKTYSHFLQPLNDHTFW
        M+KL   ++ S   ++     + F+V   +   +  V +   TCTC  +Q    PC HA      +K  P ++VD+ Y+ +   KTY+    P+ D   W
Subjt:  MEKLATNISQSRKVILRWAGNNLFEVECGT--TQVFVDIERITCTCGLWQLSGIPCPHAAQCIYYVKKRPNEFVDDAYSKQTPVKTYSHFLQPLNDHTFW

Query:  PTTSFKPIL-PPSIRRPP
        P     P L PPS +  P
Subjt:  PTTSFKPIL-PPSIRRPP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGACTGAACTGGATGTATTTAACTGTGTCGAGTTTTATTTCCTCTTCGTTTTTAGGGTGTTGTTGACTGATGATGATATTTTAAAAATGGTTGAGCTAACACCTGA
AAGTAGAATTGTCCATATATATGTAGAACATTCACCTAATGTTGAAGTATTAGATCTTACCCAACCTCTGGAAGTTCAAATGCCTATTTTTCTAGAGTGGCACCCCAACC
ATGTTAAAGATGACATAGAGTCTAATGACATGGAGCCTAATGACTTAGTGGATGTAGATTTGTCTGAAGAGTCTTATTCTTTGCCTTCATTTGATGTTAATGTTGATATA
AATTGTGAGGAAATAGAATCATCTTATGGGTCATATTCAAACATCCATACACCTTACAATTTTGAGGATGAAAGAGAGGTTAAAGGACCAAAATTTAGACATGAAAGTGG
CATGGATCATATTCATGGTATTACTGCAAAACATTGGTCTAAACATGCAACGTGTCATCCTAAGGTTGACTTCTTGCTTAATAATATGGTTGAGTCCTTTAATTTACTAA
TACTACCATCAAGAAACCAACACATAATTCAACTTTTTGAAACCATCAGGAAGATAATTATGTGTAGATTGACTAAAAAGCATGTTAAGTATACCAACATGAAAGAAGAA
GAACTTGGTTCTATAATTATGGAAAAGTTGGCAACTAACATTTCACAATCTAGGAAAGTAATACTTCGTTGGGCTGGGAATAACTTGTTTGAGGTAGAGTGTGGGACTAC
TCAGGTTTTTGTGGATATTGAGAGAATAACTTGTACTTGTGGCCTGTGGCAATTAAGTGGGATTCCTTGTCCACATGCTGCTCAATGCATCTATTATGTGAAAAAACGTC
CTAATGAATTTGTGGATGATGCATACAGTAAGCAGACCCCAGTTAAAACATACTCACACTTTTTACAACCTTTGAATGACCATACTTTTTGGCCTACAACTTCATTCAAA
CCCATACTCCCACCATCAATTAGAAGACCACCAGGCCGATCAAAAAAACAAAGAAAACGCAATGTGGATGAGGTTGGGACTTCTAAGTCCGTGTCTTCGCGTAGAAGGTT
GCGATGTGACAAATGTGGGGTGTCTGGTCACAATTCCAGAACTTGGAAAGGTAAAGTGAACAACAAGAGGAAACAAGTGAACTCATCAAATTCTTCATTCATAGGCTTCA
ACATTCCATGTGAACCATCTGGAATGGAAGGATCTTCTGCAACACTGTCTGCTACCCAACCACTGTCTGCTACACCACCACTGTCTGCACCATCATCTGTTACACCATCT
TCTGGGAGAGGATTTACTGCACCCCAAATTTTTACACCACCAACTGCTGCAAGTAGTTCATAA
mRNA sequenceShow/hide mRNA sequence
ATGATGACTGAACTGGATGTATTTAACTGTGTCGAGTTTTATTTCCTCTTCGTTTTTAGGGTGTTGTTGACTGATGATGATATTTTAAAAATGGTTGAGCTAACACCTGA
AAGTAGAATTGTCCATATATATGTAGAACATTCACCTAATGTTGAAGTATTAGATCTTACCCAACCTCTGGAAGTTCAAATGCCTATTTTTCTAGAGTGGCACCCCAACC
ATGTTAAAGATGACATAGAGTCTAATGACATGGAGCCTAATGACTTAGTGGATGTAGATTTGTCTGAAGAGTCTTATTCTTTGCCTTCATTTGATGTTAATGTTGATATA
AATTGTGAGGAAATAGAATCATCTTATGGGTCATATTCAAACATCCATACACCTTACAATTTTGAGGATGAAAGAGAGGTTAAAGGACCAAAATTTAGACATGAAAGTGG
CATGGATCATATTCATGGTATTACTGCAAAACATTGGTCTAAACATGCAACGTGTCATCCTAAGGTTGACTTCTTGCTTAATAATATGGTTGAGTCCTTTAATTTACTAA
TACTACCATCAAGAAACCAACACATAATTCAACTTTTTGAAACCATCAGGAAGATAATTATGTGTAGATTGACTAAAAAGCATGTTAAGTATACCAACATGAAAGAAGAA
GAACTTGGTTCTATAATTATGGAAAAGTTGGCAACTAACATTTCACAATCTAGGAAAGTAATACTTCGTTGGGCTGGGAATAACTTGTTTGAGGTAGAGTGTGGGACTAC
TCAGGTTTTTGTGGATATTGAGAGAATAACTTGTACTTGTGGCCTGTGGCAATTAAGTGGGATTCCTTGTCCACATGCTGCTCAATGCATCTATTATGTGAAAAAACGTC
CTAATGAATTTGTGGATGATGCATACAGTAAGCAGACCCCAGTTAAAACATACTCACACTTTTTACAACCTTTGAATGACCATACTTTTTGGCCTACAACTTCATTCAAA
CCCATACTCCCACCATCAATTAGAAGACCACCAGGCCGATCAAAAAAACAAAGAAAACGCAATGTGGATGAGGTTGGGACTTCTAAGTCCGTGTCTTCGCGTAGAAGGTT
GCGATGTGACAAATGTGGGGTGTCTGGTCACAATTCCAGAACTTGGAAAGGTAAAGTGAACAACAAGAGGAAACAAGTGAACTCATCAAATTCTTCATTCATAGGCTTCA
ACATTCCATGTGAACCATCTGGAATGGAAGGATCTTCTGCAACACTGTCTGCTACCCAACCACTGTCTGCTACACCACCACTGTCTGCACCATCATCTGTTACACCATCT
TCTGGGAGAGGATTTACTGCACCCCAAATTTTTACACCACCAACTGCTGCAAGTAGTTCATAA
Protein sequenceShow/hide protein sequence
MMTELDVFNCVEFYFLFVFRVLLTDDDILKMVELTPESRIVHIYVEHSPNVEVLDLTQPLEVQMPIFLEWHPNHVKDDIESNDMEPNDLVDVDLSEESYSLPSFDVNVDI
NCEEIESSYGSYSNIHTPYNFEDEREVKGPKFRHESGMDHIHGITAKHWSKHATCHPKVDFLLNNMVESFNLLILPSRNQHIIQLFETIRKIIMCRLTKKHVKYTNMKEE
ELGSIIMEKLATNISQSRKVILRWAGNNLFEVECGTTQVFVDIERITCTCGLWQLSGIPCPHAAQCIYYVKKRPNEFVDDAYSKQTPVKTYSHFLQPLNDHTFWPTTSFK
PILPPSIRRPPGRSKKQRKRNVDEVGTSKSVSSRRRLRCDKCGVSGHNSRTWKGKVNNKRKQVNSSNSSFIGFNIPCEPSGMEGSSATLSATQPLSATPPLSAPSSVTPS
SGRGFTAPQIFTPPTAASSS