; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0021633 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0021633
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionProtein FAR1-RELATED SEQUENCE 4-like
Genome locationchr03:20745325..20746301
RNA-Seq ExpressionPI0021633
SyntenyPI0021633
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR004332 - Transposase, MuDR, plant
IPR018289 - MULE transposase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0031318.1 protein FAR1-RELATED SEQUENCE 4-like [Cucumis melo var. makuwa]8.0e-7958.49Show/hide
Query:  SSSVSND-EVIRDI-----SYYGDLKGKKLFESKEVLSKCFCIIAVKNNFQFRTTVSNLRSLEIKCLQKGCKWYVRASRYKKSELWMLQKYISNHDCSLN
        SSS+ +D +++ DI     S   DLK K +F SKE+LSK F  IA+KNNF+F+T  SN +S+E KC Q  C WYVRASRYK  ELW L+KYI+NH+CS+N
Subjt:  SSSVSND-EVIRDI-----SYYGDLKGKKLFESKEVLSKCFCIIAVKNNFQFRTTVSNLRSLEIKCLQKGCKWYVRASRYKKSELWMLQKYISNHDCSLN

Query:  TTQSCHKQASSIIMST---------DRSILKAIFHKARTNLGVNISYQKAWRAKEHIVKILKSDVVESYSLIASFFDKLVESNPGTCAALEMDDNGHFKF
          Q+ HKQAS+ ++S          DRS    I    RT LGVN+SY KAWRAKE ++  L  +  ESY+LI +FF KL E NPG+  A E D  GHFK+
Subjt:  TTQSCHKQASSIIMST---------DRSILKAIFHKARTNLGVNISYQKAWRAKEHIVKILKSDVVESYSLIASFFDKLVESNPGTCAALEMDDNGHFKF

Query:  CFMAFGASIEGWKYCRPIISVDGTFFKCKFGGILLTVSSQDGNNQIFPLAFVIIDSENDASWTWF
        CFMA GA IEGWKYCRP ISVDGTF K K+GG LLT S+ DGNNQIFPLAF I+DSENDASW WF
Subjt:  CFMAFGASIEGWKYCRPIISVDGTFFKCKFGGILLTVSSQDGNNQIFPLAFVIIDSENDASWTWF

TYK09469.1 protein FAR1-RELATED SEQUENCE 4-like [Cucumis melo var. makuwa]8.0e-7958.49Show/hide
Query:  SSSVSND-EVIRDI-----SYYGDLKGKKLFESKEVLSKCFCIIAVKNNFQFRTTVSNLRSLEIKCLQKGCKWYVRASRYKKSELWMLQKYISNHDCSLN
        SSS+ +D +++ DI     S   DLK K +F SKE+LSK F  IA+KNNF+F+T  SN +S+E KC Q  C WYVRASRYK  ELW L+KYI+NH+CS+N
Subjt:  SSSVSND-EVIRDI-----SYYGDLKGKKLFESKEVLSKCFCIIAVKNNFQFRTTVSNLRSLEIKCLQKGCKWYVRASRYKKSELWMLQKYISNHDCSLN

Query:  TTQSCHKQASSIIMST---------DRSILKAIFHKARTNLGVNISYQKAWRAKEHIVKILKSDVVESYSLIASFFDKLVESNPGTCAALEMDDNGHFKF
          Q+ HKQAS+ ++S          DRS    I    RT LGVN+SY KAWRAKE ++  L  +  ESY+LI +FF KL E NPG+  A E D  GHFK+
Subjt:  TTQSCHKQASSIIMST---------DRSILKAIFHKARTNLGVNISYQKAWRAKEHIVKILKSDVVESYSLIASFFDKLVESNPGTCAALEMDDNGHFKF

Query:  CFMAFGASIEGWKYCRPIISVDGTFFKCKFGGILLTVSSQDGNNQIFPLAFVIIDSENDASWTWF
        CFMA GA IEGWKYCRP ISVDGTF K K+GG LLT S+ DGNNQIFPLAF I+DSENDASW WF
Subjt:  CFMAFGASIEGWKYCRPIISVDGTFFKCKFGGILLTVSSQDGNNQIFPLAFVIIDSENDASWTWF

TYK23827.1 MuDR family transposase [Cucumis melo var. makuwa]1.2e-7958.3Show/hide
Query:  LLNVGSSSSVSND-EVIRDI-----SYYGDLKGKKLFESKEVLSKCFCIIAVKNNFQFRTTVSNLRSLEIKCLQKGCKWYVRASRYKKSELWMLQKYISN
        L ++ SSSS+ +D +++ DI     S   DLK K +F SKE+LSK F  IA+KNNF+F+T  SN +S+E KC Q  C WYVRASRYK  ELW L+KYI+N
Subjt:  LLNVGSSSSVSND-EVIRDI-----SYYGDLKGKKLFESKEVLSKCFCIIAVKNNFQFRTTVSNLRSLEIKCLQKGCKWYVRASRYKKSELWMLQKYISN

Query:  HDCSLNTTQSCHKQASSIIMST---------DRSILKAIFHKARTNLGVNISYQKAWRAKEHIVKILKSDVVESYSLIASFFDKLVESNPGTCAALEMDD
        H+CS+N  Q+ HKQASS ++S          DRS    I    RT LGVN+SY KAWRAKE ++  L  +  ESY+LI +FF KL E NPG+  A E D 
Subjt:  HDCSLNTTQSCHKQASSIIMST---------DRSILKAIFHKARTNLGVNISYQKAWRAKEHIVKILKSDVVESYSLIASFFDKLVESNPGTCAALEMDD

Query:  NGHFKFCFMAFGASIEGWKYCRPIISVDGTFFKCKFGGILLTVSSQDGNNQIFPLAFVIIDSENDASWTWF
         GHFK+CFMA GA IEGWKYCRP ISVDGTF K K+GG LLT S+ DGNNQIFPLAF I+DSENDASW WF
Subjt:  NGHFKFCFMAFGASIEGWKYCRPIISVDGTFFKCKFGGILLTVSSQDGNNQIFPLAFVIIDSENDASWTWF

XP_038896605.1 uncharacterized protein LOC120084863 [Benincasa hispida]6.8e-9464.04Show/hide
Query:  LLNVGSSSSVSNDEVIRDISYYGDLKGKKLFESKEVLSKCFCIIAVKNNFQFRTTVSNLRSLEIKCLQKGCKWYVRASRYKKSELWMLQKYISNHDCSLN
        LL +GSSS  SND+VI+++ + GDLK K +F SKE+L KCF +IAV  NFQFRTT SN +S E+KCLQ+GC+WYVRAS YKKSELWML+KYIS+H+C +N
Subjt:  LLNVGSSSSVSNDEVIRDISYYGDLKGKKLFESKEVLSKCFCIIAVKNNFQFRTTVSNLRSLEIKCLQKGCKWYVRASRYKKSELWMLQKYISNHDCSLN

Query:  TTQSCHKQASSII-----------MSTDRSILKAIFHKARTNLGVNISYQKAWRAKEHIVKILKSDVVESYSLIASFFDKLVESNPGTCAALEMDDNGHF
        TTQSCH+QASS +           +STD S+ + I +KART LGVNISYQKAWR KEHI++ L  D  +SYSLI  FF +L E NPGT  AL++D+NGHF
Subjt:  TTQSCHKQASSII-----------MSTDRSILKAIFHKARTNLGVNISYQKAWRAKEHIVKILKSDVVESYSLIASFFDKLVESNPGTCAALEMDDNGHF

Query:  KFCFMAFGASIEGWKYCRPIISVDGTFFKCKFGGILLTVSSQDGNNQIFPLAFVIIDSENDASWTWF
        K CFM  GASIEGW+Y  P ISVDGTF K KFGG LL+ S+ DGNN IFPLAF I+DSEND SW WF
Subjt:  KFCFMAFGASIEGWKYCRPIISVDGTFFKCKFGGILLTVSSQDGNNQIFPLAFVIIDSENDASWTWF

XP_038907134.1 uncharacterized protein LOC120092945 [Benincasa hispida]1.3e-8160.24Show/hide
Query:  EVIRDISYYGDLKGKKLFESKEVLSKCFCIIAVKNNFQFRTTVSNLRSLEIKCLQKGCKWYVRASRYKKSELWMLQKYISNHDCSLNTTQSCHKQASSII
        +VI DI    DLK K +F SKE+LSKCF  IAVK NF+F+T  SN RS+E +C+Q GC+WYVRASRYK S+LWML+K+I  HDCS+N  Q+ H+QAS+ +
Subjt:  EVIRDISYYGDLKGKKLFESKEVLSKCFCIIAVKNNFQFRTTVSNLRSLEIKCLQKGCKWYVRASRYKKSELWMLQKYISNHDCSLNTTQSCHKQASSII

Query:  M-----------STDRSILKAIFHKARTNLGVNISYQKAWRAKEHIVKILKSDVVESYSLIASFFDKLVESNPGTCAALEMDDNGHFKFCFMAFGASIEG
        +           S+D    K I +K R  LGVNISY KAWRAKEHI+K LK D  ESY+LI  F  KL E NPGT  A E D +GHFK+C+MA G+SIEG
Subjt:  M-----------STDRSILKAIFHKARTNLGVNISYQKAWRAKEHIVKILKSDVVESYSLIASFFDKLVESNPGTCAALEMDDNGHFKFCFMAFGASIEG

Query:  WKYCRPIISVDGTFFKCKFGGILLTVSSQDGNNQIFPLAFVIIDSENDASWTWF
        WK+CRP I VDGTF KCK+ G LLT S+ DGNN+ FPLAF I+DSENDASW WF
Subjt:  WKYCRPIISVDGTFFKCKFGGILLTVSSQDGNNQIFPLAFVIIDSENDASWTWF

TrEMBL top hitse value%identityAlignment
A0A5A7T3G5 Protein FAR1-RELATED SEQUENCE 4-like3.9e-7958.49Show/hide
Query:  SSSVSND-EVIRDI-----SYYGDLKGKKLFESKEVLSKCFCIIAVKNNFQFRTTVSNLRSLEIKCLQKGCKWYVRASRYKKSELWMLQKYISNHDCSLN
        SSS+ +D +++ DI     S   DLK K +F SKE+LSK F  IA+KNNF+F+T  SN +S+E KC Q  C WYVRASRYK  ELW L+KYI+NH+CS+N
Subjt:  SSSVSND-EVIRDI-----SYYGDLKGKKLFESKEVLSKCFCIIAVKNNFQFRTTVSNLRSLEIKCLQKGCKWYVRASRYKKSELWMLQKYISNHDCSLN

Query:  TTQSCHKQASSIIMST---------DRSILKAIFHKARTNLGVNISYQKAWRAKEHIVKILKSDVVESYSLIASFFDKLVESNPGTCAALEMDDNGHFKF
          Q+ HKQAS+ ++S          DRS    I    RT LGVN+SY KAWRAKE ++  L  +  ESY+LI +FF KL E NPG+  A E D  GHFK+
Subjt:  TTQSCHKQASSIIMST---------DRSILKAIFHKARTNLGVNISYQKAWRAKEHIVKILKSDVVESYSLIASFFDKLVESNPGTCAALEMDDNGHFKF

Query:  CFMAFGASIEGWKYCRPIISVDGTFFKCKFGGILLTVSSQDGNNQIFPLAFVIIDSENDASWTWF
        CFMA GA IEGWKYCRP ISVDGTF K K+GG LLT S+ DGNNQIFPLAF I+DSENDASW WF
Subjt:  CFMAFGASIEGWKYCRPIISVDGTFFKCKFGGILLTVSSQDGNNQIFPLAFVIIDSENDASWTWF

A0A5A7UZ18 Protein FAR1-RELATED SEQUENCE 4-like8.7e-7958.11Show/hide
Query:  SSSVSND-EVIRDI-----SYYGDLKGKKLFESKEVLSKCFCIIAVKNNFQFRTTVSNLRSLEIKCLQKGCKWYVRASRYKKSELWMLQKYISNHDCSLN
        SSS+ +D +++ DI     S   DLK K +F +KE+LSK F  IA+KNNF+F+T  SN +S+E KC Q  C WYVRASRYK  ELW L+KYI+NH+CS+N
Subjt:  SSSVSND-EVIRDI-----SYYGDLKGKKLFESKEVLSKCFCIIAVKNNFQFRTTVSNLRSLEIKCLQKGCKWYVRASRYKKSELWMLQKYISNHDCSLN

Query:  TTQSCHKQASSIIMST---------DRSILKAIFHKARTNLGVNISYQKAWRAKEHIVKILKSDVVESYSLIASFFDKLVESNPGTCAALEMDDNGHFKF
          Q+ HKQAS+ ++S          DRS    I    RT LGVN+SY KAWRAKE ++  L  +  ESY+LI +FF KL E NPG+  A E D  GHFK+
Subjt:  TTQSCHKQASSIIMST---------DRSILKAIFHKARTNLGVNISYQKAWRAKEHIVKILKSDVVESYSLIASFFDKLVESNPGTCAALEMDDNGHFKF

Query:  CFMAFGASIEGWKYCRPIISVDGTFFKCKFGGILLTVSSQDGNNQIFPLAFVIIDSENDASWTWF
        CFMA GA IEGWKYCRP ISVDGTF K K+GG LLT S+ DGNNQIFPLAF I+DSENDASW WF
Subjt:  CFMAFGASIEGWKYCRPIISVDGTFFKCKFGGILLTVSSQDGNNQIFPLAFVIIDSENDASWTWF

A0A5A7VG38 Protein FAR1-RELATED SEQUENCE 4-like1.9e-7858.11Show/hide
Query:  SSSVSND-EVIRDI-----SYYGDLKGKKLFESKEVLSKCFCIIAVKNNFQFRTTVSNLRSLEIKCLQKGCKWYVRASRYKKSELWMLQKYISNHDCSLN
        SSS+ +D +++ DI     S   DLK   +F SKE+LSK F  IA+KNNF+F+T  SN +S+E KC Q  C WYVRASRYK  +LW L+KYI+NH+CS+N
Subjt:  SSSVSND-EVIRDI-----SYYGDLKGKKLFESKEVLSKCFCIIAVKNNFQFRTTVSNLRSLEIKCLQKGCKWYVRASRYKKSELWMLQKYISNHDCSLN

Query:  TTQSCHKQASSIIMST---------DRSILKAIFHKARTNLGVNISYQKAWRAKEHIVKILKSDVVESYSLIASFFDKLVESNPGTCAALEMDDNGHFKF
          Q+ HKQASS ++S          DRS    I    RT LGVN+SY KAWRAKE ++  L  +  ESY+LI +FF KL E NPG+  A E D  GHFK+
Subjt:  TTQSCHKQASSIIMST---------DRSILKAIFHKARTNLGVNISYQKAWRAKEHIVKILKSDVVESYSLIASFFDKLVESNPGTCAALEMDDNGHFKF

Query:  CFMAFGASIEGWKYCRPIISVDGTFFKCKFGGILLTVSSQDGNNQIFPLAFVIIDSENDASWTWF
        CFMA GA IEGWKYCRP ISVDGTF K K+GG LLT S+ DGNNQIFPLAF I+DSENDASW WF
Subjt:  CFMAFGASIEGWKYCRPIISVDGTFFKCKFGGILLTVSSQDGNNQIFPLAFVIIDSENDASWTWF

A0A5D3DAW8 Protein FAR1-RELATED SEQUENCE 4-like3.9e-7958.49Show/hide
Query:  SSSVSND-EVIRDI-----SYYGDLKGKKLFESKEVLSKCFCIIAVKNNFQFRTTVSNLRSLEIKCLQKGCKWYVRASRYKKSELWMLQKYISNHDCSLN
        SSS+ +D +++ DI     S   DLK K +F SKE+LSK F  IA+KNNF+F+T  SN +S+E KC Q  C WYVRASRYK  ELW L+KYI+NH+CS+N
Subjt:  SSSVSND-EVIRDI-----SYYGDLKGKKLFESKEVLSKCFCIIAVKNNFQFRTTVSNLRSLEIKCLQKGCKWYVRASRYKKSELWMLQKYISNHDCSLN

Query:  TTQSCHKQASSIIMST---------DRSILKAIFHKARTNLGVNISYQKAWRAKEHIVKILKSDVVESYSLIASFFDKLVESNPGTCAALEMDDNGHFKF
          Q+ HKQAS+ ++S          DRS    I    RT LGVN+SY KAWRAKE ++  L  +  ESY+LI +FF KL E NPG+  A E D  GHFK+
Subjt:  TTQSCHKQASSIIMST---------DRSILKAIFHKARTNLGVNISYQKAWRAKEHIVKILKSDVVESYSLIASFFDKLVESNPGTCAALEMDDNGHFKF

Query:  CFMAFGASIEGWKYCRPIISVDGTFFKCKFGGILLTVSSQDGNNQIFPLAFVIIDSENDASWTWF
        CFMA GA IEGWKYCRP ISVDGTF K K+GG LLT S+ DGNNQIFPLAF I+DSENDASW WF
Subjt:  CFMAFGASIEGWKYCRPIISVDGTFFKCKFGGILLTVSSQDGNNQIFPLAFVIIDSENDASWTWF

A0A5D3DJR8 MuDR family transposase6.0e-8058.3Show/hide
Query:  LLNVGSSSSVSND-EVIRDI-----SYYGDLKGKKLFESKEVLSKCFCIIAVKNNFQFRTTVSNLRSLEIKCLQKGCKWYVRASRYKKSELWMLQKYISN
        L ++ SSSS+ +D +++ DI     S   DLK K +F SKE+LSK F  IA+KNNF+F+T  SN +S+E KC Q  C WYVRASRYK  ELW L+KYI+N
Subjt:  LLNVGSSSSVSND-EVIRDI-----SYYGDLKGKKLFESKEVLSKCFCIIAVKNNFQFRTTVSNLRSLEIKCLQKGCKWYVRASRYKKSELWMLQKYISN

Query:  HDCSLNTTQSCHKQASSIIMST---------DRSILKAIFHKARTNLGVNISYQKAWRAKEHIVKILKSDVVESYSLIASFFDKLVESNPGTCAALEMDD
        H+CS+N  Q+ HKQASS ++S          DRS    I    RT LGVN+SY KAWRAKE ++  L  +  ESY+LI +FF KL E NPG+  A E D 
Subjt:  HDCSLNTTQSCHKQASSIIMST---------DRSILKAIFHKARTNLGVNISYQKAWRAKEHIVKILKSDVVESYSLIASFFDKLVESNPGTCAALEMDD

Query:  NGHFKFCFMAFGASIEGWKYCRPIISVDGTFFKCKFGGILLTVSSQDGNNQIFPLAFVIIDSENDASWTWF
         GHFK+CFMA GA IEGWKYCRP ISVDGTF K K+GG LLT S+ DGNNQIFPLAF I+DSENDASW WF
Subjt:  NGHFKFCFMAFGASIEGWKYCRPIISVDGTFFKCKFGGILLTVSSQDGNNQIFPLAFVIIDSENDASWTWF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G49920.1 MuDR family transposase4.8e-1327.6Show/hide
Query:  KKLFESKEVLSKCFCIIAVKNNFQFRTTVSNLRSLEIKCLQKGCKWYVRASRYKKSELWMLQKYISNHDC---SLNTTQS-CHKQASSIIMSTDRSILKA
        K L E K+ +   +C I  +     R T  ++  +E  C +  CKW + ASR ++  L+ + +    HDC    LN   + C       ++    ++  A
Subjt:  KKLFESKEVLSKCFCIIAVKNNFQFRTTVSNLRSLEIKCLQKGCKWYVRASRYKKSELWMLQKYISNHDC---SLNTTQS-CHKQASSIIMSTDRSILKA

Query:  IFHK--------ARTNLGVNISYQKAWRAKEHIVKILKSDVVESYSLIASFFDKLVESNPGTCAALEMDDNGH------FKFCFMAFGASIEGWKYCRPI
           K        A   +  + S      AK   +K    D  +S+ LI      L  SN G     + D   H      F+  F AF  SI+G+++CRP+
Subjt:  IFHK--------ARTNLGVNISYQKAWRAKEHIVKILKSDVVESYSLIASFFDKLVESNPGTCAALEMDDNGH------FKFCFMAFGASIEGWKYCRPI

Query:  ISVDGTFFKCKFGGILLTVSSQDGNNQIFPLAFVIIDSENDASWTWFVRR
        I VD      K+   L+  S+ D  NQ FPLAF +    +  SW WF+ R
Subjt:  ISVDGTFFKCKFGGILLTVSSQDGNNQIFPLAFVIIDSENDASWTWFVRR

AT1G64255.1 MuDR family transposase2.5e-1427.71Show/hide
Query:  DLKGKKLFESKEVLSKC--FCIIAVKNNFQFRTTVSNLRSLEIKCLQKGCKWYVRASRYKKSELWMLQKYISNHDCSLNTTQSCH--------KQASSII
        DL+    F+  + L K   +C +  +     R T  +    E  C++  CKW + A+R KK  L  + KY   H C     +           ++A   +
Subjt:  DLKGKKLFESKEVLSKC--FCIIAVKNNFQFRTTVSNLRSLEIKCLQKGCKWYVRASRYKKSELWMLQKYISNHDCSLNTTQSCH--------KQASSII

Query:  MSTDRSILKAIFHKARTNLGVNISYQKAWRAKEHIVKILKSDVVESYSLIASFFDKLVESNPGTCAALEMD--DNGHF-KFC--FMAFGASIEGWKYCRP
         +   S LK  + K    +G  +       AKE  +K +  D  +S+         L  SN G     + D   N +F  FC  F AF  SIEG+++CRP
Subjt:  MSTDRSILKAIFHKARTNLGVNISYQKAWRAKEHIVKILKSDVVESYSLIASFFDKLVESNPGTCAALEMD--DNGHF-KFC--FMAFGASIEGWKYCRP

Query:  IISVDGTFFKCKFGGILLTVSSQDGNNQIFPLAFVIIDSENDASWTWFV
        +I VD     C++   L+  S  D  N+ FPLAF +    +   W WF+
Subjt:  IISVDGTFFKCKFGGILLTVSSQDGNNQIFPLAFVIIDSENDASWTWFV

AT1G64260.1 MuDR family transposase1.5e-1424.79Show/hide
Query:  FESKEVLSKC--FCIIAVKNNFQFRTTVSNLRSLEIKCLQKGCKWYVRASRYKKSELWMLQKYISNHDCSLN-----TTQSCHKQASSIIMSTDRSILKA
        F+ ++ L K   +  I  + N   R T   + + E  C++  CKW +RA+R ++  L  + KY   H CS        ++    +   ++       +  
Subjt:  FESKEVLSKC--FCIIAVKNNFQFRTTVSNLRSLEIKCLQKGCKWYVRASRYKKSELWMLQKYISNHDCSLN-----TTQSCHKQASSIIMSTDRSILKA

Query:  IFHKARTNLGVNISYQKAWRAKEHIVKILKSDVVESYSLIASFFDKLVESNPGTCAALEMD-----DNGHFKFCFMAFGASIEGWKYCRPIISVDGTFFK
        +    +   G  +   K    K  ++K +  D  +S+ ++         SN G     + D     D   F+  F +F  SIEG+++CRP+I VD     
Subjt:  IFHKARTNLGVNISYQKAWRAKEHIVKILKSDVVESYSLIASFFDKLVESNPGTCAALEMD-----DNGHFKFCFMAFGASIEGWKYCRPIISVDGTFFK

Query:  CKFGGILLTVSSQDGNNQIFPLAFVIIDSENDASWTWF
         K+   L+  S  D  N+ FPLAF +    +  SW WF
Subjt:  CKFGGILLTVSSQDGNNQIFPLAFVIIDSENDASWTWF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGTTAAATGTAGGTTCTTCTTCTTCTGTTTCAAATGATGAAGTTATAAGAGACATTTCTTATTATGGTGATTTGAAGGGAAAAAAATTATTTGAAAGTAAAGAAGT
TCTTTCAAAGTGTTTTTGCATAATAGCAGTGAAGAATAACTTTCAATTTAGAACTACAGTATCAAATTTGAGGTCTCTTGAAATTAAATGTTTGCAAAAAGGGTGCAAAT
GGTATGTTAGGGCATCTCGTTATAAAAAGAGTGAGTTGTGGATGCTACAAAAATACATTTCTAACCATGACTGTTCATTGAATACTACCCAAAGTTGTCACAAGCAAGCT
TCTTCAATAATCATGTCTACTGATCGTTCCATTCTAAAAGCAATTTTCCATAAGGCTCGTACAAATCTTGGAGTTAATATAAGTTATCAAAAAGCTTGGAGGGCAAAAGA
ACACATAGTAAAGATATTAAAAAGTGATGTAGTTGAATCGTACTCGTTGATTGCAAGCTTCTTTGATAAATTGGTTGAATCTAACCCAGGTACATGCGCTGCTTTAGAGA
TGGATGATAATGGTCATTTCAAGTTTTGCTTTATGGCTTTTGGTGCATCAATTGAGGGGTGGAAATATTGTAGACCTATCATTTCTGTTGATGGCACGTTTTTTAAATGT
AAGTTTGGTGGCATCCTATTAACAGTCTCATCACAAGATGGTAACAATCAAATCTTTCCTCTTGCTTTTGTTATTATAGATTCTGAAAATGATGCATCATGGACATGGTT
TGTAAGAAGATACGAAGTAGTTTTTCATAGCAAGATAATTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTTGTTAAATGTAGGTTCTTCTTCTTCTGTTTCAAATGATGAAGTTATAAGAGACATTTCTTATTATGGTGATTTGAAGGGAAAAAAATTATTTGAAAGTAAAGAAGT
TCTTTCAAAGTGTTTTTGCATAATAGCAGTGAAGAATAACTTTCAATTTAGAACTACAGTATCAAATTTGAGGTCTCTTGAAATTAAATGTTTGCAAAAAGGGTGCAAAT
GGTATGTTAGGGCATCTCGTTATAAAAAGAGTGAGTTGTGGATGCTACAAAAATACATTTCTAACCATGACTGTTCATTGAATACTACCCAAAGTTGTCACAAGCAAGCT
TCTTCAATAATCATGTCTACTGATCGTTCCATTCTAAAAGCAATTTTCCATAAGGCTCGTACAAATCTTGGAGTTAATATAAGTTATCAAAAAGCTTGGAGGGCAAAAGA
ACACATAGTAAAGATATTAAAAAGTGATGTAGTTGAATCGTACTCGTTGATTGCAAGCTTCTTTGATAAATTGGTTGAATCTAACCCAGGTACATGCGCTGCTTTAGAGA
TGGATGATAATGGTCATTTCAAGTTTTGCTTTATGGCTTTTGGTGCATCAATTGAGGGGTGGAAATATTGTAGACCTATCATTTCTGTTGATGGCACGTTTTTTAAATGT
AAGTTTGGTGGCATCCTATTAACAGTCTCATCACAAGATGGTAACAATCAAATCTTTCCTCTTGCTTTTGTTATTATAGATTCTGAAAATGATGCATCATGGACATGGTT
TGTAAGAAGATACGAAGTAGTTTTTCATAGCAAGATAATTTAG
Protein sequenceShow/hide protein sequence
MLLNVGSSSSVSNDEVIRDISYYGDLKGKKLFESKEVLSKCFCIIAVKNNFQFRTTVSNLRSLEIKCLQKGCKWYVRASRYKKSELWMLQKYISNHDCSLNTTQSCHKQA
SSIIMSTDRSILKAIFHKARTNLGVNISYQKAWRAKEHIVKILKSDVVESYSLIASFFDKLVESNPGTCAALEMDDNGHFKFCFMAFGASIEGWKYCRPIISVDGTFFKC
KFGGILLTVSSQDGNNQIFPLAFVIIDSENDASWTWFVRRYEVVFHSKII