; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0004768 (gene) of Snake gourd v1 genome

Gene IDTan0004768
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRetrotransposable element Tf2
Genome locationLG02:77553288..77559563
RNA-Seq ExpressionTan0004768
SyntenyTan0004768
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0059217.1 Enzymatic polyprotein [Cucumis melo var. makuwa]1.5e-2828.91Show/hide
Query:  MNTNISPRALRSSLKGSTILIEANLDKSSITVPKSLSWEQITRNPTWKLTEAFTPPKRNSNLAQIIEHIDGAVEIQFSEETEVTKVKEFMSSRPSTSRTS
        M+TN+SP+AL  S KG T+L+E N++KSS+T+P++L W+++T+NP WKL     P  R+S  A I E  DG VE+QF+      +V E MSSRPSTS  S
Subjt:  MNTNISPRALRSSLKGSTILIEANLDKSSITVPKSLSWEQITRNPTWKLTEAFTPPKRNSNLAQIIEHIDGAVEIQFSEETEVTKVKEFMSSRPSTSRTS

Query:  AASEADSKFYFDRSNSLRVKSVNIEQNVANAHYEAQ--PQSPTQTDMDNRS---------------MFSSQLNVLIEGFSINKEALKKDFLSIKN-----
          SEA  +    RS S+R  SV+    + + HYE +    SPTQ+DM+ RS                F    +V I+ +       +K FL++ +     
Subjt:  AASEADSKFYFDRSNSLRVKSVNIEQNVANAHYEAQ--PQSPTQTDMDNRS---------------MFSSQLNVLIEGFSINKEALKKDFLSIKN-----

Query:  ----KTKRNAFFKKYD-ENQKSEIKSK--WYSFM-EEIEQNIP-----FFTWMK-----------QSVNEINIEGIRLCNESKVQTKL---NKTF-----
            + K  A  KK   + Q + IK    W +   +E+  N P     +F+              +++NE  ++ + +     +Q +L   NKT      
Subjt:  ----KTKRNAFFKKYD-ENQKSEIKSK--WYSFM-EEIEQNIP-----FFTWMK-----------QSVNEINIEGIRLCNESKVQTKL---NKTF-----

Query:  -------------------------------------------------------------GSKKA-------------------ELGIFCDQYGFGKVH
                                                                     GSK A                   ELG FC QYG  +  
Subjt:  -------------------------------------------------------------GSKKA-------------------ELGIFCDQYGFGKVH

Query:  PPSVVRKTAIKRY--RKHQERQNYRPYRSFQRKNYRKPYNSYTKKRPTYKGGHLKEKIQCYKCRSHGHYANKCPMKKKINELEIDDEFKHQLLNIIQSDS
         P   +K   KRY  +K   +   +   S QR+  ++ YN    K  + KG   K    C+KC   GHYAN+CP+K KIN + ID+E K  LL  I+SD 
Subjt:  PPSVVRKTAIKRY--RKHQERQNYRPYRSFQRKNYRKPYNSYTKKRPTYKGGHLKEKIQCYKCRSHGHYANKCPMKKKINELEIDDEFKHQLLNIIQSDS

Query:  E---ESIYSSDEGQILELQEETDSYSDSEYDEEEKEGKRGKSP
        +   ++  SS+E  I  LQEE  S  +  Y + +     G  P
Subjt:  E---ESIYSSDEGQILELQEETDSYSDSEYDEEEKEGKRGKSP

TYJ98361.1 hypothetical protein E5676_scaffold232G00950 [Cucumis melo var. makuwa]1.1e-3137.68Show/hide
Query:  RGKSPTSSSN-IPAPMSADQYAMDLGFTQVNRPRTRSASIQIRDSMESLTLPPRPSASSYRPL-PVTPMRPSVSPTTPSSSQQGSSIPTTFVEAVNPPKR
        +G  P+SSSN  P+PMS DQYAMDLG+T V + R++S+ I IR  MES T PPRPS +   P   V  MR S S   PSS+++ S++PTT+ +AV P K+
Subjt:  RGKSPTSSSN-IPAPMSADQYAMDLGFTQVNRPRTRSASIQIRDSMESLTLPPRPSASSYRPL-PVTPMRPSVSPTTPSSSQQGSSIPTTFVEAVNPPKR

Query:  FIPKPEIKNYFEKPLQISEPIIEVEFDVGMHFCVFVYFSQPIHKNAFPLTMRFCLYFQTPWIFCWNFQINTYTYYKQIVKVLQIKWWDKYNFSHAGIKEV
        F+P+ EIK+YF+KP+ I +PIIE+E+                                                                          
Subjt:  FIPKPEIKNYFEKPLQISEPIIEVEFDVGMHFCVFVYFSQPIHKNAFPLTMRFCLYFQTPWIFCWNFQINTYTYYKQIVKVLQIKWWDKYNFSHAGIKEV

Query:  KQWFADNGYLQDLSKKKNAEFLNSKSKLLAALAQTTTEADLQKILNQVTSIASSSSPQNSEVEEDD-----EYDINDPFLDSQP
              NG LQD+ ++KNA+FLN K K LAAL Q T +AD Q++L+   + +SSS P  S ++ED+     EYD++DPFLDSQP
Subjt:  KQWFADNGYLQDLSKKKNAEFLNSKSKLLAALAQTTTEADLQKILNQVTSIASSSSPQNSEVEEDD-----EYDINDPFLDSQP

XP_022933039.1 uncharacterized protein LOC111439730 [Cucurbita moschata]1.6e-2741.67Show/hide
Query:  IPFFTWMKQSVNEINI-EGIRLCNESKVQTKLNKTFGSKKAELGIFCDQYGFGKVHPPSVVRKTAIKRYRKHQERQNYRPYRSFQRK--NYRKPYNSYTK
        IP+ T    S+    I EG+RL NESK+Q KLN +  ++K ELG FCDQYG   +  PS   +  +K + K     +YRP  +++ K    +KP  S  K
Subjt:  IPFFTWMKQSVNEINI-EGIRLCNESKVQTKLNKTFGSKKAELGIFCDQYGFGKVHPPSVVRKTAIKRYRKHQERQNYRPYRSFQRK--NYRKPYNSYTK

Query:  KRPTYKGGHLKEKIQCYKCRSHGHYANKCPMKKKINELEIDDEFKHQLLNIIQSDSEESIYSSDEGQILELQEETDSYSDSEYDEEEKEGKRGKSPTSSS
          PT   G  K+K  C+KCR  GHYANKCP++ KINEL+ID E K+QLL +  +DSE+    S EG+ILELQEE+DSYS +EY E E+EGKR     +  
Subjt:  KRPTYKGGHLKEKIQCYKCRSHGHYANKCPMKKKINELEIDDEFKHQLLNIIQSDSEESIYSSDEGQILELQEETDSYSDSEYDEEEKEGKRGKSPTSSS

Query:  NIPAPMSADQYAMDLGFTQVNRPRTRSASIQ-IRDSMESLTLPPRPSASSYR
             ++ DQ  +     +V  P  +    Q +RD+M     P R   + YR
Subjt:  NIPAPMSADQYAMDLGFTQVNRPRTRSASIQ-IRDSMESLTLPPRPSASSYR

XP_023522280.1 uncharacterized protein LOC111786173, partial [Cucurbita pepo subsp. pepo]2.8e-2742.27Show/hide
Query:  NAFFKKYDENQKSEIKSKWYSFMEEIEQNIPFFTWMKQSVNEINI-EGIRLCNESKVQTKLNKTFGSKKAELGIFCDQYGFGKVHPPSVVRKTAIKRYRK
        N   K +    K  +K+K+          IP+ T    S+    I EG+RLCNESK+Q KL+    S + ELG FCDQYG   +  PS  R+  +K + K
Subjt:  NAFFKKYDENQKSEIKSKWYSFMEEIEQNIPFFTWMKQSVNEINI-EGIRLCNESKVQTKLNKTFGSKKAELGIFCDQYGFGKVHPPSVVRKTAIKRYRK

Query:  HQERQNYRPYRSFQRKNYRKPYNSYTKKRPTYKGGHL-KEKIQCYKCRSHGHYANKCPMKKKINELEIDDEFKHQLLNIIQSDSEESIYSSDEGQILELQ
             +YRP   ++ K  +    +Y++++ T    H  K+K  C+KCR  GHYAN+CP++ KINEL+ID E K+QLL +  +DSE+    S EG+ILELQ
Subjt:  HQERQNYRPYRSFQRKNYRKPYNSYTKKRPTYKGGHL-KEKIQCYKCRSHGHYANKCPMKKKINELEIDDEFKHQLLNIIQSDSEESIYSSDEGQILELQ

Query:  EETDSYSDSEYDEEEKEGKR
        EE+DSYS +EY E  +EGKR
Subjt:  EETDSYSDSEYDEEEKEGKR

XP_023552915.1 uncharacterized protein LOC111810441 [Cucurbita pepo subsp. pepo]6.6e-2942.24Show/hide
Query:  NAFFKKYDENQKSEIKSKWYSFMEEIEQNIPFFTWMKQSVNEINI-EGIRLCNESKVQTKLNKTFGSKKAELGIFCDQYGFGKVHPPSVVRKTAIKRYRK
        N   K +    K  +K+K+          IP+ T    S+    I EG+RLCNESK+Q KLN +  ++K ELG FCDQYG   +  PS  R+   K+++ 
Subjt:  NAFFKKYDENQKSEIKSKWYSFMEEIEQNIPFFTWMKQSVNEINI-EGIRLCNESKVQTKLNKTFGSKKAELGIFCDQYGFGKVHPPSVVRKTAIKRYRK

Query:  H-QERQNYRPYRSFQRKNYRKPYNSYTKKRPTYKGGHLKEKIQ-CYKCRSHGHYANKCPMKKKINELEIDDEFKHQLLNIIQSDSEESIYSSDEGQILEL
        H +   +YRP  +++ K  +    +Y++++ T    H  +K Q C+KCR  GHYANKCP++ KINELEID E K+QLL +  +DSE+    S +G+ILEL
Subjt:  H-QERQNYRPYRSFQRKNYRKPYNSYTKKRPTYKGGHLKEKIQ-CYKCRSHGHYANKCPMKKKINELEIDDEFKHQLLNIIQSDSEESIYSSDEGQILEL

Query:  QEETDSYSDSEYDEEEKEGKRGKSPTSSSNIP
        QEE+DSYS++EY E E+EGKR K   +    P
Subjt:  QEETDSYSDSEYDEEEKEGKRGKSPTSSSNIP

TrEMBL top hitse value%identityAlignment
A0A5A7U3G5 Polyprotein1.4e-2426.65Show/hide
Query:  EKEGKRGKSPTSSSNIPAPMSADQYAMDLGFTQVNRPRTRSASIQIRDSMESLTLPPRPSASSYRPL-PVTPMRPSVSPTTPSSSQQGSSIPTTFVEAVN
        E    +GK P++ S+ P+ MSA+ YAMDL F QV+R    SA   +    +S +LPP PS +  RP  P TP + + SPT  SS       P+++ + V 
Subjt:  EKEGKRGKSPTSSSNIPAPMSADQYAMDLGFTQVNRPRTRSASIQIRDSMESLTLPPRPSASSYRPL-PVTPMRPSVSPTTPSSSQQGSSIPTTFVEAVN

Query:  PPKRFIPKPEIKNYFEKPLQISEPIIEVEFD---------------------------------------------------------------------
         PK F P+P I  YF K   + +  IE EFD                                                                     
Subjt:  PPKRFIPKPEIKNYFEKPLQISEPIIEVEFD---------------------------------------------------------------------

Query:  ------VG-----------------------------------MHFCVFVY--------------------------------FSQPIHKNAFPLTMRFC
              VG                                   M FC   Y                                F Q I+ +    T RF 
Subjt:  ------VG-----------------------------------MHFCVFVY--------------------------------FSQPIHKNAFPLTMRFC

Query:  LYFQTPWIFCWNFQINTYTYYKQIVKVLQIKWWDKYNFSHAGIKEVKQWFADNGYLQDLSKKKNAEFLNSKSKLLAALAQTTTEADLQKILNQVT-----
        LYFQ PWIFCWNFQ+     +K + K L+IKWW+K+N+SH  + ++K WF  N +LQD++++++  FL +K+ +++ LA  +T+ +   ++N V      
Subjt:  LYFQTPWIFCWNFQINTYTYYKQIVKVLQIKWWDKYNFSHAGIKEVKQWFADNGYLQDLSKKKNAEFLNSKSKLLAALAQTTTEADLQKILNQVT-----

Query:  ---SIASSSSPQNSEVEEDDEY------DINDPFLDSQP
            +   +SP +    +D +Y      DINDPFLD+QP
Subjt:  ---SIASSSSPQNSEVEEDDEY------DINDPFLDSQP

A0A5A7UX67 Enzymatic polyprotein7.1e-2928.91Show/hide
Query:  MNTNISPRALRSSLKGSTILIEANLDKSSITVPKSLSWEQITRNPTWKLTEAFTPPKRNSNLAQIIEHIDGAVEIQFSEETEVTKVKEFMSSRPSTSRTS
        M+TN+SP+AL  S KG T+L+E N++KSS+T+P++L W+++T+NP WKL     P  R+S  A I E  DG VE+QF+      +V E MSSRPSTS  S
Subjt:  MNTNISPRALRSSLKGSTILIEANLDKSSITVPKSLSWEQITRNPTWKLTEAFTPPKRNSNLAQIIEHIDGAVEIQFSEETEVTKVKEFMSSRPSTSRTS

Query:  AASEADSKFYFDRSNSLRVKSVNIEQNVANAHYEAQ--PQSPTQTDMDNRS---------------MFSSQLNVLIEGFSINKEALKKDFLSIKN-----
          SEA  +    RS S+R  SV+    + + HYE +    SPTQ+DM+ RS                F    +V I+ +       +K FL++ +     
Subjt:  AASEADSKFYFDRSNSLRVKSVNIEQNVANAHYEAQ--PQSPTQTDMDNRS---------------MFSSQLNVLIEGFSINKEALKKDFLSIKN-----

Query:  ----KTKRNAFFKKYD-ENQKSEIKSK--WYSFM-EEIEQNIP-----FFTWMK-----------QSVNEINIEGIRLCNESKVQTKL---NKTF-----
            + K  A  KK   + Q + IK    W +   +E+  N P     +F+              +++NE  ++ + +     +Q +L   NKT      
Subjt:  ----KTKRNAFFKKYD-ENQKSEIKSK--WYSFM-EEIEQNIP-----FFTWMK-----------QSVNEINIEGIRLCNESKVQTKL---NKTF-----

Query:  -------------------------------------------------------------GSKKA-------------------ELGIFCDQYGFGKVH
                                                                     GSK A                   ELG FC QYG  +  
Subjt:  -------------------------------------------------------------GSKKA-------------------ELGIFCDQYGFGKVH

Query:  PPSVVRKTAIKRY--RKHQERQNYRPYRSFQRKNYRKPYNSYTKKRPTYKGGHLKEKIQCYKCRSHGHYANKCPMKKKINELEIDDEFKHQLLNIIQSDS
         P   +K   KRY  +K   +   +   S QR+  ++ YN    K  + KG   K    C+KC   GHYAN+CP+K KIN + ID+E K  LL  I+SD 
Subjt:  PPSVVRKTAIKRY--RKHQERQNYRPYRSFQRKNYRKPYNSYTKKRPTYKGGHLKEKIQCYKCRSHGHYANKCPMKKKINELEIDDEFKHQLLNIIQSDS

Query:  E---ESIYSSDEGQILELQEETDSYSDSEYDEEEKEGKRGKSP
        +   ++  SS+E  I  LQEE  S  +  Y + +     G  P
Subjt:  E---ESIYSSDEGQILELQEETDSYSDSEYDEEEKEGKRGKSP

A0A5D3BI61 Uncharacterized protein5.3e-3237.68Show/hide
Query:  RGKSPTSSSN-IPAPMSADQYAMDLGFTQVNRPRTRSASIQIRDSMESLTLPPRPSASSYRPL-PVTPMRPSVSPTTPSSSQQGSSIPTTFVEAVNPPKR
        +G  P+SSSN  P+PMS DQYAMDLG+T V + R++S+ I IR  MES T PPRPS +   P   V  MR S S   PSS+++ S++PTT+ +AV P K+
Subjt:  RGKSPTSSSN-IPAPMSADQYAMDLGFTQVNRPRTRSASIQIRDSMESLTLPPRPSASSYRPL-PVTPMRPSVSPTTPSSSQQGSSIPTTFVEAVNPPKR

Query:  FIPKPEIKNYFEKPLQISEPIIEVEFDVGMHFCVFVYFSQPIHKNAFPLTMRFCLYFQTPWIFCWNFQINTYTYYKQIVKVLQIKWWDKYNFSHAGIKEV
        F+P+ EIK+YF+KP+ I +PIIE+E+                                                                          
Subjt:  FIPKPEIKNYFEKPLQISEPIIEVEFDVGMHFCVFVYFSQPIHKNAFPLTMRFCLYFQTPWIFCWNFQINTYTYYKQIVKVLQIKWWDKYNFSHAGIKEV

Query:  KQWFADNGYLQDLSKKKNAEFLNSKSKLLAALAQTTTEADLQKILNQVTSIASSSSPQNSEVEEDD-----EYDINDPFLDSQP
              NG LQD+ ++KNA+FLN K K LAAL Q T +AD Q++L+   + +SSS P  S ++ED+     EYD++DPFLDSQP
Subjt:  KQWFADNGYLQDLSKKKNAEFLNSKSKLLAALAQTTTEADLQKILNQVTSIASSSSPQNSEVEEDD-----EYDINDPFLDSQP

A0A6J1EW44 uncharacterized protein LOC1114366181.5e-2642.79Show/hide
Query:  NAFFKKYDENQKSEIKSKWYSFMEEIEQNIPFFTWMKQSVNEINI-EGIRLCNESKVQTKLNKTFGSKKAELGIFCDQYGFGKVHPPSVVRKTAIKRYRK
        N   K +    K  +K+K+          IP+ T    S+  + I EG+RLCNESK+Q KLN +  ++K ELG FCDQYG   +  PS  R+  +K + K
Subjt:  NAFFKKYDENQKSEIKSKWYSFMEEIEQNIPFFTWMKQSVNEINI-EGIRLCNESKVQTKLNKTFGSKKAELGIFCDQYGFGKVHPPSVVRKTAIKRYRK

Query:  HQERQNYRPYRSFQRK--NYRKPYNSYTKKRPTYKGGHLKEKIQCYKCRSHGHYANKCPMKKKINELEIDDEFKHQLLNIIQSDSEESIYSSDEGQILEL
             +YRP  ++  K    +KP  S  K  PT K    K+K   +KCR  GHY NKCP++ KINEL+ID E K+QLL +  +DSE+    S EG+ILEL
Subjt:  HQERQNYRPYRSFQRK--NYRKPYNSYTKKRPTYKGGHLKEKIQCYKCRSHGHYANKCPMKKKINELEIDDEFKHQLLNIIQSDSEESIYSSDEGQILEL

Query:  QEETDSYSDSEYDEE
        Q+E+DSYS +EY+ E
Subjt:  QEETDSYSDSEYDEE

A0A6J1EYM2 uncharacterized protein LOC1114397307.8e-2841.67Show/hide
Query:  IPFFTWMKQSVNEINI-EGIRLCNESKVQTKLNKTFGSKKAELGIFCDQYGFGKVHPPSVVRKTAIKRYRKHQERQNYRPYRSFQRK--NYRKPYNSYTK
        IP+ T    S+    I EG+RL NESK+Q KLN +  ++K ELG FCDQYG   +  PS   +  +K + K     +YRP  +++ K    +KP  S  K
Subjt:  IPFFTWMKQSVNEINI-EGIRLCNESKVQTKLNKTFGSKKAELGIFCDQYGFGKVHPPSVVRKTAIKRYRKHQERQNYRPYRSFQRK--NYRKPYNSYTK

Query:  KRPTYKGGHLKEKIQCYKCRSHGHYANKCPMKKKINELEIDDEFKHQLLNIIQSDSEESIYSSDEGQILELQEETDSYSDSEYDEEEKEGKRGKSPTSSS
          PT   G  K+K  C+KCR  GHYANKCP++ KINEL+ID E K+QLL +  +DSE+    S EG+ILELQEE+DSYS +EY E E+EGKR     +  
Subjt:  KRPTYKGGHLKEKIQCYKCRSHGHYANKCPMKKKINELEIDDEFKHQLLNIIQSDSEESIYSSDEGQILELQEETDSYSDSEYDEEEKEGKRGKSPTSSS

Query:  NIPAPMSADQYAMDLGFTQVNRPRTRSASIQ-IRDSMESLTLPPRPSASSYR
             ++ DQ  +     +V  P  +    Q +RD+M     P R   + YR
Subjt:  NIPAPMSADQYAMDLGFTQVNRPRTRSASIQ-IRDSMESLTLPPRPSASSYR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATACAAACATTTCTCCGAGAGCTTTAAGGTCTTCTCTAAAGGGATCAACAATCCTTATAGAAGCAAATCTAGATAAGTCCTCAATTACTGTCCCAAAAAGTTTATC
TTGGGAACAAATCACTAGAAATCCAACGTGGAAGCTTACGGAAGCTTTCACTCCACCAAAAAGAAATTCAAATTTAGCCCAAATTATCGAACATATAGATGGGGCCGTTG
AAATACAATTCTCAGAAGAAACAGAGGTTACTAAAGTCAAGGAATTCATGTCCTCAAGACCAAGTACCTCTCGAACCTCTGCAGCCTCAGAAGCCGACTCAAAATTCTAT
TTTGATCGATCTAACTCGTTGCGAGTAAAATCAGTCAATATAGAACAAAATGTAGCAAATGCTCATTATGAAGCACAACCACAATCTCCAACCCAAACAGACATGGATAA
TCGATCTATGTTCTCTAGTCAATTAAATGTTCTTATTGAAGGATTTTCAATTAATAAAGAAGCACTAAAAAAGGATTTCCTTTCAATAAAAAACAAAACAAAGAGGAATG
CCTTTTTTAAAAAATATGATGAAAACCAAAAATCTGAAATCAAGTCAAAATGGTATTCATTTATGGAAGAAATAGAGCAAAATATTCCATTTTTTACATGGATGAAACAA
TCTGTAAACGAAATCAATATTGAAGGTATCCGTCTATGCAATGAATCCAAAGTTCAAACAAAACTTAACAAAACCTTTGGTTCAAAAAAGGCAGAATTAGGAATTTTCTG
TGATCAATATGGATTTGGAAAAGTCCATCCACCCTCTGTAGTCCGAAAAACTGCTATAAAAAGATACAGAAAACATCAAGAAAGACAAAATTATCGTCCATATAGATCAT
TCCAAAGAAAGAATTATAGAAAGCCCTATAATTCTTATACTAAAAAACGACCCACTTATAAAGGTGGACATTTAAAAGAAAAGATTCAATGCTATAAATGTCGATCACAT
GGACACTATGCAAATAAGTGTCCTATGAAAAAGAAGATCAATGAATTAGAAATTGATGATGAATTTAAACATCAACTTCTAAACATCATCCAATCTGATTCTGAAGAATC
AATCTATAGTTCTGATGAAGGTCAAATACTTGAATTACAAGAAGAAACTGATTCATATTCTGATAGCGAATATGATGAAGAAGAAAAAGAAGGCAAAAGAGGCAAAAGCC
CAACCTCTTCTTCAAACATTCCAGCTCCCATGAGCGCCGATCAATACGCAATGGATCTGGGTTTCACTCAAGTGAATCGCCCAAGAACCAGAAGCGCATCGATTCAAATA
AGAGATTCAATGGAGTCATTAACTCTACCACCAAGGCCATCGGCCTCCTCATATCGACCTCTGCCAGTTACACCAATGAGACCTTCTGTCTCACCTACTACTCCGTCTTC
ATCTCAACAGGGATCTTCAATCCCTACAACCTTTGTTGAAGCCGTCAATCCTCCGAAAAGGTTTATCCCTAAACCTGAAATCAAAAATTATTTCGAAAAACCACTTCAAA
TTTCGGAGCCAATTATCGAAGTAGAATTCGATGTGGGAATGCATTTTTGTGTATTTGTGTATTTTTCACAACCAATACACAAAAATGCATTCCCACTCACAATGCGTTTT
TGTTTATACTTTCAAACCCCTTGGATATTTTGTTGGAATTTCCAAATCAATACATACACTTATTACAAACAAATCGTAAAAGTTTTACAAATCAAATGGTGGGACAAGTA
CAACTTTTCTCACGCAGGAATCAAAGAAGTGAAGCAATGGTTTGCCGATAATGGCTACCTCCAAGACCTGTCAAAAAAGAAAAATGCAGAGTTCCTCAATTCCAAATCAA
AGTTGCTAGCTGCCTTAGCACAAACAACAACAGAGGCAGATTTACAAAAAATTTTGAATCAAGTTACTTCAATAGCATCTTCGTCTTCTCCACAAAACTCTGAAGTGGAA
GAAGATGATGAATATGATATCAATGATCCGTTTTTAGATTCTCAACCAATGTAA
mRNA sequenceShow/hide mRNA sequence
ATGAATACAAACATTTCTCCGAGAGCTTTAAGGTCTTCTCTAAAGGGATCAACAATCCTTATAGAAGCAAATCTAGATAAGTCCTCAATTACTGTCCCAAAAAGTTTATC
TTGGGAACAAATCACTAGAAATCCAACGTGGAAGCTTACGGAAGCTTTCACTCCACCAAAAAGAAATTCAAATTTAGCCCAAATTATCGAACATATAGATGGGGCCGTTG
AAATACAATTCTCAGAAGAAACAGAGGTTACTAAAGTCAAGGAATTCATGTCCTCAAGACCAAGTACCTCTCGAACCTCTGCAGCCTCAGAAGCCGACTCAAAATTCTAT
TTTGATCGATCTAACTCGTTGCGAGTAAAATCAGTCAATATAGAACAAAATGTAGCAAATGCTCATTATGAAGCACAACCACAATCTCCAACCCAAACAGACATGGATAA
TCGATCTATGTTCTCTAGTCAATTAAATGTTCTTATTGAAGGATTTTCAATTAATAAAGAAGCACTAAAAAAGGATTTCCTTTCAATAAAAAACAAAACAAAGAGGAATG
CCTTTTTTAAAAAATATGATGAAAACCAAAAATCTGAAATCAAGTCAAAATGGTATTCATTTATGGAAGAAATAGAGCAAAATATTCCATTTTTTACATGGATGAAACAA
TCTGTAAACGAAATCAATATTGAAGGTATCCGTCTATGCAATGAATCCAAAGTTCAAACAAAACTTAACAAAACCTTTGGTTCAAAAAAGGCAGAATTAGGAATTTTCTG
TGATCAATATGGATTTGGAAAAGTCCATCCACCCTCTGTAGTCCGAAAAACTGCTATAAAAAGATACAGAAAACATCAAGAAAGACAAAATTATCGTCCATATAGATCAT
TCCAAAGAAAGAATTATAGAAAGCCCTATAATTCTTATACTAAAAAACGACCCACTTATAAAGGTGGACATTTAAAAGAAAAGATTCAATGCTATAAATGTCGATCACAT
GGACACTATGCAAATAAGTGTCCTATGAAAAAGAAGATCAATGAATTAGAAATTGATGATGAATTTAAACATCAACTTCTAAACATCATCCAATCTGATTCTGAAGAATC
AATCTATAGTTCTGATGAAGGTCAAATACTTGAATTACAAGAAGAAACTGATTCATATTCTGATAGCGAATATGATGAAGAAGAAAAAGAAGGCAAAAGAGGCAAAAGCC
CAACCTCTTCTTCAAACATTCCAGCTCCCATGAGCGCCGATCAATACGCAATGGATCTGGGTTTCACTCAAGTGAATCGCCCAAGAACCAGAAGCGCATCGATTCAAATA
AGAGATTCAATGGAGTCATTAACTCTACCACCAAGGCCATCGGCCTCCTCATATCGACCTCTGCCAGTTACACCAATGAGACCTTCTGTCTCACCTACTACTCCGTCTTC
ATCTCAACAGGGATCTTCAATCCCTACAACCTTTGTTGAAGCCGTCAATCCTCCGAAAAGGTTTATCCCTAAACCTGAAATCAAAAATTATTTCGAAAAACCACTTCAAA
TTTCGGAGCCAATTATCGAAGTAGAATTCGATGTGGGAATGCATTTTTGTGTATTTGTGTATTTTTCACAACCAATACACAAAAATGCATTCCCACTCACAATGCGTTTT
TGTTTATACTTTCAAACCCCTTGGATATTTTGTTGGAATTTCCAAATCAATACATACACTTATTACAAACAAATCGTAAAAGTTTTACAAATCAAATGGTGGGACAAGTA
CAACTTTTCTCACGCAGGAATCAAAGAAGTGAAGCAATGGTTTGCCGATAATGGCTACCTCCAAGACCTGTCAAAAAAGAAAAATGCAGAGTTCCTCAATTCCAAATCAA
AGTTGCTAGCTGCCTTAGCACAAACAACAACAGAGGCAGATTTACAAAAAATTTTGAATCAAGTTACTTCAATAGCATCTTCGTCTTCTCCACAAAACTCTGAAGTGGAA
GAAGATGATGAATATGATATCAATGATCCGTTTTTAGATTCTCAACCAATGTAA
Protein sequenceShow/hide protein sequence
MNTNISPRALRSSLKGSTILIEANLDKSSITVPKSLSWEQITRNPTWKLTEAFTPPKRNSNLAQIIEHIDGAVEIQFSEETEVTKVKEFMSSRPSTSRTSAASEADSKFY
FDRSNSLRVKSVNIEQNVANAHYEAQPQSPTQTDMDNRSMFSSQLNVLIEGFSINKEALKKDFLSIKNKTKRNAFFKKYDENQKSEIKSKWYSFMEEIEQNIPFFTWMKQ
SVNEINIEGIRLCNESKVQTKLNKTFGSKKAELGIFCDQYGFGKVHPPSVVRKTAIKRYRKHQERQNYRPYRSFQRKNYRKPYNSYTKKRPTYKGGHLKEKIQCYKCRSH
GHYANKCPMKKKINELEIDDEFKHQLLNIIQSDSEESIYSSDEGQILELQEETDSYSDSEYDEEEKEGKRGKSPTSSSNIPAPMSADQYAMDLGFTQVNRPRTRSASIQI
RDSMESLTLPPRPSASSYRPLPVTPMRPSVSPTTPSSSQQGSSIPTTFVEAVNPPKRFIPKPEIKNYFEKPLQISEPIIEVEFDVGMHFCVFVYFSQPIHKNAFPLTMRF
CLYFQTPWIFCWNFQINTYTYYKQIVKVLQIKWWDKYNFSHAGIKEVKQWFADNGYLQDLSKKKNAEFLNSKSKLLAALAQTTTEADLQKILNQVTSIASSSSPQNSEVE
EDDEYDINDPFLDSQPM