; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc01g14680 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc01g14680
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptionprotein FAR1-RELATED SEQUENCE 4-like
Genome locationchr1:9130516..9137094
RNA-Seq ExpressionMoc01g14680
SyntenyMoc01g14680
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR006564 - Zinc finger, PMZ-type
IPR007527 - Zinc finger, SWIM-type
IPR008998 - Agglutinin domain
IPR036242 - Agglutinin domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022131652.1 protein FAR1-RELATED SEQUENCE 4-like [Momordica charantia]9.3e-11949.28Show/hide
Query:  MGEGDDEGEYGNEYASDRLDVQHEHEKVTIDNTMSEYPVDAIHEMAGNRVTGQSEGDRLQAMVQSAGTDDVKEGDVFDSKKEL-----------------
        MGEGDDE EYGNEYASDRLDVQHEHEKVTI NTM+EYPVD +HEMA NRVTGQSEGDRLQAMVQSA TDDVKE DVFDSKKEL                 
Subjt:  MGEGDDEGEYGNEYASDRLDVQHEHEKVTIDNTMSEYPVDAIHEMAGNRVTGQSEGDRLQAMVQSAGTDDVKEGDVFDSKKEL-----------------

Query:  -------------------------------------------------------AKSWVVGHLVQSKFTDVSRTYRPKDIMQDIREEYGVNMSYDKAWR
                                                               AKSWVVGHLVQSKFTDVSRTYRPKDIMQDIREEYGVNMSYDKAWR
Subjt:  -------------------------------------------------------AKSWVVGHLVQSKFTDVSRTYRPKDIMQDIREEYGVNMSYDKAWR

Query:  SSEEALRLIRGDPASSYGLLPAYGEA--------------------------------------------------------------------------
        SSEEALRLIRGDPASSY LLPAYGEA                                                                          
Subjt:  SSEEALRLIRGDPASSYGLLPAYGEA--------------------------------------------------------------------------

Query:  -----------------------------------------------------------MNLLAKFKTPALEALFFKAAKAFHESYFNENWVQLCAHPG-
                                                                   MNLLAKFKT ALEALFFKAAKAF ESYFNENWVQLCAHPG 
Subjt:  -----------------------------------------------------------MNLLAKFKTPALEALFFKAAKAFHESYFNENWVQLCAHPG-

Query:  ---------------------------------------------------------RWFYERRTLASSRQSTLSDYTKEMIADGSDNARRHIVMNIDQF
                                                                 RWFYE RTLASSRQSTLSDY +EMIA+  DNARRHIVMNIDQF
Subjt:  ---------------------------------------------------------RWFYERRTLASSRQSTLSDYTKEMIADGSDNARRHIVMNIDQF

Query:  NFEVRDGNLNGDVDFQSQTCTCREFDYFKVSCSHAIAAASSRSINPYTLCDEAYTVNS
        NFEV DGNLNGDVD QSQTCTCREFDYFKV CSHAIAAASSRSINPYTLCDEAYTVNS
Subjt:  NFEVRDGNLNGDVDFQSQTCTCREFDYFKVSCSHAIAAASSRSINPYTLCDEAYTVNS

XP_022153146.1 uncharacterized protein LOC111020715 [Momordica charantia]4.5e-8938.14Show/hide
Query:  EGDDEGEYGNEYASDRLDVQHEHEKVTIDNTMSEYPVD--AIHEMAGNRVTGQSEGDRLQAMVQSAGTDDVKEGDVFDSKKEL-----------------
        EGD E E+ N+   D LD + E +   ++   +E   D  A+ +M  + +TGQ   + LQ +VQS+GT+DVKEG+VFD+KKEL                 
Subjt:  EGDDEGEYGNEYASDRLDVQHEHEKVTIDNTMSEYPVD--AIHEMAGNRVTGQSEGDRLQAMVQSAGTDDVKEGDVFDSKKEL-----------------

Query:  ------------------------------------------------------AKSWVVGHLVQSKFTDVSRTYRPKDIMQDIREEYGVNMSYDKAWRS
                                                              AKSWVVGHLVQ+KFTDVSRTYRPKDI+QD+R+EYGVN+SYDKAWRS
Subjt:  ------------------------------------------------------AKSWVVGHLVQSKFTDVSRTYRPKDIMQDIREEYGVNMSYDKAWRS

Query:  SEEALRLIRGDPASSYGLLPAYGEA---------------------------------------------------------------------------
        SEEALRLIRGDPASSYGLLP YGEA                                                                           
Subjt:  SEEALRLIRGDPASSYGLLPAYGEA---------------------------------------------------------------------------

Query:  ----------------------------------------------------------MNLLAKFK--TPALEALFFKAAKAFHESYFNENWVQLCAHPG
                                                                  MNLLAKFK    ALE LF KAAKA+ ESYFN  W QL A+PG
Subjt:  ----------------------------------------------------------MNLLAKFK--TPALEALFFKAAKAFHESYFNENWVQLCAHPG

Query:  ----------------------------------------------------------RWFYERRTLASSRQSTLSDYTKEMIADGSDNARRHIVMNIDQ
                                                                   WFY+RRTLASSR +TLS Y +  +A+ SDNARRH+V+NIDQ
Subjt:  ----------------------------------------------------------RWFYERRTLASSRQSTLSDYTKEMIADGSDNARRHIVMNIDQ

Query:  FNFEVRDGNLNGDVDFQSQTCTCREFDYFKVSCSHAIAAASSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSSTWKSSPGFVNIDVQPPKKVVRVGRR
        F+ +VRDGNL+G VDF S+TC CREFDYFK+ CSHAIA A  R+INPYTLCDEAYT NSW++AYAEPIFP+G  STW SSP FV+  V+ P  V RVGRR
Subjt:  FNFEVRDGNLNGDVDFQSQTCTCREFDYFKVSCSHAIAAASSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSSTWKSSPGFVNIDVQPPKKVVRVGRR

Query:  QTV
        +TV
Subjt:  QTV

XP_022154934.1 uncharacterized protein LOC111022081 [Momordica charantia]2.9e-128100Show/hide
Query:  MSLQNTFALQSKYNDKYLCYIDNEDPQLHGFLKFSEDEFSTHSKFHKEPASTADEFINIRSVYSNKYWVSRYENDEHWIVAEANEPNEDQSSSSCTLFQA
        MSLQNTFALQSKYNDKYLCYIDNEDPQLHGFLKFSEDEFSTHSKFHKEPASTADEFINIRSVYSNKYWVSRYENDEHWIVAEANEPNEDQSSSSCTLFQA
Subjt:  MSLQNTFALQSKYNDKYLCYIDNEDPQLHGFLKFSEDEFSTHSKFHKEPASTADEFINIRSVYSNKYWVSRYENDEHWIVAEANEPNEDQSSSSCTLFQA

Query:  IPVDDGSAIRLLHVHLNKYVCLSKDSSQFPLCIFAGSVDPDPDRIDVFVVKNIIPPKQSGPSRLAFKGDNEKYLKVEVIDGKPHLKFSAKDTDEAEVWFK
        IPVDDGSAIRLLHVHLNKYVCLSKDSSQFPLCIFAGSVDPDPDRIDVFVVKNIIPPKQSGPSRLAFKGDNEKYLKVEVIDGKPHLKFSAKDTDEAEVWFK
Subjt:  IPVDDGSAIRLLHVHLNKYVCLSKDSSQFPLCIFAGSVDPDPDRIDVFVVKNIIPPKQSGPSRLAFKGDNEKYLKVEVIDGKPHLKFSAKDTDEAEVWFK

Query:  CPNIAKKNDTSGHIQSNFNNEFWR
        CPNIAKKNDTSGHIQSNFNNEFWR
Subjt:  CPNIAKKNDTSGHIQSNFNNEFWR

XP_022156122.1 uncharacterized protein LOC111023087 [Momordica charantia]9.4e-8751.77Show/hide
Query:  MQDIREEYGVNMSYDKAWRSSEEALRLIRGDPASSYGLLPAYGEA-------------------------------------------------------
        MQDIREEYGVNMSYDKAWRSSEEALRLIR DPASSYGLLPAYGEA                                                       
Subjt:  MQDIREEYGVNMSYDKAWRSSEEALRLIRGDPASSYGLLPAYGEA-------------------------------------------------------

Query:  ------------------------------------------------------------------------------MNLLAKFKTPALEALFFKAAKA
                                                                                      M+LLAKFKTPALE LFFKAAKA
Subjt:  ------------------------------------------------------------------------------MNLLAKFKTPALEALFFKAAKA

Query:  FHESYFNENWVQLCAHPG-----------RWF--------YERRT--LASSRQ------------STLSDYTKEMIADGSDNARRHIVMNIDQFNFEVRD
        F ESYFNENWVQLCA+PG           RW         Y + T  +A S              + L    +EMIA  SDNARRHIVMNIDQFNFEVRD
Subjt:  FHESYFNENWVQLCAHPG-----------RWF--------YERRT--LASSRQ------------STLSDYTKEMIADGSDNARRHIVMNIDQFNFEVRD

Query:  GNLNGDVDFQSQTCTCREFDYFKVSCSHAIAAASSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSSTWKSSPGFVNIDVQPPKKVVRVGRRQTV
        GNLNGDVD QSQTCTCREFDYFKVSCS AIAAASSRSINPYTLCDE YTVNSWMLAYAEPIFPVGSSSTWKSSPGFVNIDVQPPKKVVRVGRRQTV
Subjt:  GNLNGDVDFQSQTCTCREFDYFKVSCSHAIAAASSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSSTWKSSPGFVNIDVQPPKKVVRVGRRQTV

XP_022158655.1 uncharacterized protein LOC111025117 [Momordica charantia]1.1e-8775.22Show/hide
Query:  MNLLAKFKTPALEALFFKAAKAFHESYFNENWVQLCAHPG-----------------------------------------RWFYERRTLASSRQSTLSD
        MNLLAKFKTPALE LFFKAAKAFHE YFNENWVQLCAHPG                                         RWFYER+TLASSRQSTLSD
Subjt:  MNLLAKFKTPALEALFFKAAKAFHESYFNENWVQLCAHPG-----------------------------------------RWFYERRTLASSRQSTLSD

Query:  YTKEMIADGSDNARRHIVMNIDQFNFEVRDGNLNGDVDFQSQTCTCREFDYFKVSCSHAIAAASSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSSTW
        Y +EMIA+ +DN+RRHIVMNIDQFNFEVRDGNLNGDVD QSQTCTCREFDYFKV CSHAIAAA+SRSINPYTLCDEAYTVNSWMLA+AEPIF VGSS+TW
Subjt:  YTKEMIADGSDNARRHIVMNIDQFNFEVRDGNLNGDVDFQSQTCTCREFDYFKVSCSHAIAAASSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSSTW

Query:  KSSPGFVNIDVQPPKKVVRVGRRQTV
        KSSPGFVNIDVQPPKKVVRVGRRQTV
Subjt:  KSSPGFVNIDVQPPKKVVRVGRRQTV

TrEMBL top hitse value%identityAlignment
A0A6J1BRM2 protein FAR1-RELATED SEQUENCE 4-like4.5e-11949.28Show/hide
Query:  MGEGDDEGEYGNEYASDRLDVQHEHEKVTIDNTMSEYPVDAIHEMAGNRVTGQSEGDRLQAMVQSAGTDDVKEGDVFDSKKEL-----------------
        MGEGDDE EYGNEYASDRLDVQHEHEKVTI NTM+EYPVD +HEMA NRVTGQSEGDRLQAMVQSA TDDVKE DVFDSKKEL                 
Subjt:  MGEGDDEGEYGNEYASDRLDVQHEHEKVTIDNTMSEYPVDAIHEMAGNRVTGQSEGDRLQAMVQSAGTDDVKEGDVFDSKKEL-----------------

Query:  -------------------------------------------------------AKSWVVGHLVQSKFTDVSRTYRPKDIMQDIREEYGVNMSYDKAWR
                                                               AKSWVVGHLVQSKFTDVSRTYRPKDIMQDIREEYGVNMSYDKAWR
Subjt:  -------------------------------------------------------AKSWVVGHLVQSKFTDVSRTYRPKDIMQDIREEYGVNMSYDKAWR

Query:  SSEEALRLIRGDPASSYGLLPAYGEA--------------------------------------------------------------------------
        SSEEALRLIRGDPASSY LLPAYGEA                                                                          
Subjt:  SSEEALRLIRGDPASSYGLLPAYGEA--------------------------------------------------------------------------

Query:  -----------------------------------------------------------MNLLAKFKTPALEALFFKAAKAFHESYFNENWVQLCAHPG-
                                                                   MNLLAKFKT ALEALFFKAAKAF ESYFNENWVQLCAHPG 
Subjt:  -----------------------------------------------------------MNLLAKFKTPALEALFFKAAKAFHESYFNENWVQLCAHPG-

Query:  ---------------------------------------------------------RWFYERRTLASSRQSTLSDYTKEMIADGSDNARRHIVMNIDQF
                                                                 RWFYE RTLASSRQSTLSDY +EMIA+  DNARRHIVMNIDQF
Subjt:  ---------------------------------------------------------RWFYERRTLASSRQSTLSDYTKEMIADGSDNARRHIVMNIDQF

Query:  NFEVRDGNLNGDVDFQSQTCTCREFDYFKVSCSHAIAAASSRSINPYTLCDEAYTVNS
        NFEV DGNLNGDVD QSQTCTCREFDYFKV CSHAIAAASSRSINPYTLCDEAYTVNS
Subjt:  NFEVRDGNLNGDVDFQSQTCTCREFDYFKVSCSHAIAAASSRSINPYTLCDEAYTVNS

A0A6J1DJT1 uncharacterized protein LOC1110207152.2e-8938.14Show/hide
Query:  EGDDEGEYGNEYASDRLDVQHEHEKVTIDNTMSEYPVD--AIHEMAGNRVTGQSEGDRLQAMVQSAGTDDVKEGDVFDSKKEL-----------------
        EGD E E+ N+   D LD + E +   ++   +E   D  A+ +M  + +TGQ   + LQ +VQS+GT+DVKEG+VFD+KKEL                 
Subjt:  EGDDEGEYGNEYASDRLDVQHEHEKVTIDNTMSEYPVD--AIHEMAGNRVTGQSEGDRLQAMVQSAGTDDVKEGDVFDSKKEL-----------------

Query:  ------------------------------------------------------AKSWVVGHLVQSKFTDVSRTYRPKDIMQDIREEYGVNMSYDKAWRS
                                                              AKSWVVGHLVQ+KFTDVSRTYRPKDI+QD+R+EYGVN+SYDKAWRS
Subjt:  ------------------------------------------------------AKSWVVGHLVQSKFTDVSRTYRPKDIMQDIREEYGVNMSYDKAWRS

Query:  SEEALRLIRGDPASSYGLLPAYGEA---------------------------------------------------------------------------
        SEEALRLIRGDPASSYGLLP YGEA                                                                           
Subjt:  SEEALRLIRGDPASSYGLLPAYGEA---------------------------------------------------------------------------

Query:  ----------------------------------------------------------MNLLAKFK--TPALEALFFKAAKAFHESYFNENWVQLCAHPG
                                                                  MNLLAKFK    ALE LF KAAKA+ ESYFN  W QL A+PG
Subjt:  ----------------------------------------------------------MNLLAKFK--TPALEALFFKAAKAFHESYFNENWVQLCAHPG

Query:  ----------------------------------------------------------RWFYERRTLASSRQSTLSDYTKEMIADGSDNARRHIVMNIDQ
                                                                   WFY+RRTLASSR +TLS Y +  +A+ SDNARRH+V+NIDQ
Subjt:  ----------------------------------------------------------RWFYERRTLASSRQSTLSDYTKEMIADGSDNARRHIVMNIDQ

Query:  FNFEVRDGNLNGDVDFQSQTCTCREFDYFKVSCSHAIAAASSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSSTWKSSPGFVNIDVQPPKKVVRVGRR
        F+ +VRDGNL+G VDF S+TC CREFDYFK+ CSHAIA A  R+INPYTLCDEAYT NSW++AYAEPIFP+G  STW SSP FV+  V+ P  V RVGRR
Subjt:  FNFEVRDGNLNGDVDFQSQTCTCREFDYFKVSCSHAIAAASSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSSTWKSSPGFVNIDVQPPKKVVRVGRR

Query:  QTV
        +TV
Subjt:  QTV

A0A6J1DNR0 uncharacterized protein LOC1110220811.4e-128100Show/hide
Query:  MSLQNTFALQSKYNDKYLCYIDNEDPQLHGFLKFSEDEFSTHSKFHKEPASTADEFINIRSVYSNKYWVSRYENDEHWIVAEANEPNEDQSSSSCTLFQA
        MSLQNTFALQSKYNDKYLCYIDNEDPQLHGFLKFSEDEFSTHSKFHKEPASTADEFINIRSVYSNKYWVSRYENDEHWIVAEANEPNEDQSSSSCTLFQA
Subjt:  MSLQNTFALQSKYNDKYLCYIDNEDPQLHGFLKFSEDEFSTHSKFHKEPASTADEFINIRSVYSNKYWVSRYENDEHWIVAEANEPNEDQSSSSCTLFQA

Query:  IPVDDGSAIRLLHVHLNKYVCLSKDSSQFPLCIFAGSVDPDPDRIDVFVVKNIIPPKQSGPSRLAFKGDNEKYLKVEVIDGKPHLKFSAKDTDEAEVWFK
        IPVDDGSAIRLLHVHLNKYVCLSKDSSQFPLCIFAGSVDPDPDRIDVFVVKNIIPPKQSGPSRLAFKGDNEKYLKVEVIDGKPHLKFSAKDTDEAEVWFK
Subjt:  IPVDDGSAIRLLHVHLNKYVCLSKDSSQFPLCIFAGSVDPDPDRIDVFVVKNIIPPKQSGPSRLAFKGDNEKYLKVEVIDGKPHLKFSAKDTDEAEVWFK

Query:  CPNIAKKNDTSGHIQSNFNNEFWR
        CPNIAKKNDTSGHIQSNFNNEFWR
Subjt:  CPNIAKKNDTSGHIQSNFNNEFWR

A0A6J1DR67 uncharacterized protein LOC1110230874.6e-8751.77Show/hide
Query:  MQDIREEYGVNMSYDKAWRSSEEALRLIRGDPASSYGLLPAYGEA-------------------------------------------------------
        MQDIREEYGVNMSYDKAWRSSEEALRLIR DPASSYGLLPAYGEA                                                       
Subjt:  MQDIREEYGVNMSYDKAWRSSEEALRLIRGDPASSYGLLPAYGEA-------------------------------------------------------

Query:  ------------------------------------------------------------------------------MNLLAKFKTPALEALFFKAAKA
                                                                                      M+LLAKFKTPALE LFFKAAKA
Subjt:  ------------------------------------------------------------------------------MNLLAKFKTPALEALFFKAAKA

Query:  FHESYFNENWVQLCAHPG-----------RWF--------YERRT--LASSRQ------------STLSDYTKEMIADGSDNARRHIVMNIDQFNFEVRD
        F ESYFNENWVQLCA+PG           RW         Y + T  +A S              + L    +EMIA  SDNARRHIVMNIDQFNFEVRD
Subjt:  FHESYFNENWVQLCAHPG-----------RWF--------YERRT--LASSRQ------------STLSDYTKEMIADGSDNARRHIVMNIDQFNFEVRD

Query:  GNLNGDVDFQSQTCTCREFDYFKVSCSHAIAAASSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSSTWKSSPGFVNIDVQPPKKVVRVGRRQTV
        GNLNGDVD QSQTCTCREFDYFKVSCS AIAAASSRSINPYTLCDE YTVNSWMLAYAEPIFPVGSSSTWKSSPGFVNIDVQPPKKVVRVGRRQTV
Subjt:  GNLNGDVDFQSQTCTCREFDYFKVSCSHAIAAASSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSSTWKSSPGFVNIDVQPPKKVVRVGRRQTV

A0A6J1DWF8 uncharacterized protein LOC1110251175.4e-8875.22Show/hide
Query:  MNLLAKFKTPALEALFFKAAKAFHESYFNENWVQLCAHPG-----------------------------------------RWFYERRTLASSRQSTLSD
        MNLLAKFKTPALE LFFKAAKAFHE YFNENWVQLCAHPG                                         RWFYER+TLASSRQSTLSD
Subjt:  MNLLAKFKTPALEALFFKAAKAFHESYFNENWVQLCAHPG-----------------------------------------RWFYERRTLASSRQSTLSD

Query:  YTKEMIADGSDNARRHIVMNIDQFNFEVRDGNLNGDVDFQSQTCTCREFDYFKVSCSHAIAAASSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSSTW
        Y +EMIA+ +DN+RRHIVMNIDQFNFEVRDGNLNGDVD QSQTCTCREFDYFKV CSHAIAAA+SRSINPYTLCDEAYTVNSWMLA+AEPIF VGSS+TW
Subjt:  YTKEMIADGSDNARRHIVMNIDQFNFEVRDGNLNGDVDFQSQTCTCREFDYFKVSCSHAIAAASSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSSTW

Query:  KSSPGFVNIDVQPPKKVVRVGRRQTV
        KSSPGFVNIDVQPPKKVVRVGRRQTV
Subjt:  KSSPGFVNIDVQPPKKVVRVGRRQTV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G64255.1 MuDR family transposase5.6e-0529.35Show/hide
Query:  HIVMNIDQFNFEVRDGNLNGD--VDFQSQTCTCREFDYFKVSCSHAIAAASSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSSTWKSSPG
        +IV  +D   F+V      G+  V     +CTC +F  +K  C HA+A       NP    D+ YT+      YA     V   S W  + G
Subjt:  HIVMNIDQFNFEVRDGNLNGD--VDFQSQTCTCREFDYFKVSCSHAIAAASSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSSTWKSSPG

AT1G64260.1 MuDR family transposase7.1e-0831.03Show/hide
Query:  HIVMNIDQFNFEVRDGNLNGD--VDFQSQTCTCREFDYFKVSCSHAIAAASSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSSTW
        +++  +++ +F+V + +   +  V     TCTCR+F  +K  C HA+A      INP    DE YTV  +   YA    PV   + W
Subjt:  HIVMNIDQFNFEVRDGNLNGD--VDFQSQTCTCREFDYFKVSCSHAIAAASSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSSTW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCATTGCAAAATACTTTCGCCCTTCAGTCCAAGTACAACGACAAATACTTGTGCTACATCGACAATGAAGACCCACAACTCCATGGCTTCCTCAAATTCTCCGAAGA
TGAGTTCTCAACTCATTCAAAATTCCACAAAGAGCCAGCCAGCACAGCTGATGAGTTCATCAACATAAGATCGGTCTACAGCAACAAGTATTGGGTGAGCCGATACGAGA
ACGACGAGCACTGGATCGTGGCCGAAGCCAACGAGCCAAATGAAGACCAATCGAGCTCGTCGTGCACACTCTTCCAAGCCATCCCTGTGGACGATGGGAGCGCCATAAGA
CTTCTCCATGTGCACCTCAACAAGTACGTTTGTTTATCCAAGGATTCCTCCCAATTCCCTCTTTGCATCTTTGCAGGATCCGTCGATCCCGATCCCGATCGCATCGACGT
GTTCGTGGTCAAGAATATCATCCCCCCGAAACAATCAGGGCCATCGAGGCTTGCATTCAAGGGCGACAATGAAAAATACCTCAAGGTGGAAGTGATTGATGGAAAGCCAC
ATCTGAAGTTCTCAGCTAAAGACACTGATGAAGCAGAAGTGTGGTTCAAATGCCCCAACATCGCCAAGAAGAATGATACAAGTGGTCATATTCAATCAAATTTTAACAAT
GAATTTTGGAGGTGTGACCACAATTGGGTTAATGCAGAGGCAGAAAATGCTACTGCTGATCCCAAAACTTTATTTCATTTTCATTGGGAAGATTGTTCTAATAACATAGT
TACCCTTCGTGTAGGCAACATCTATTGCCAGCGATTTGTAGATCATGAAGGTAAGTCTAGTAAGATGGATTGCCTCATTGCAAATACTGTTGAGAATACCCCACAAGCAA
AGCTACACGTGGAGGTTGCAAAAGTTCCAATTGAAGCGGCCACAAATATCAAGGTAAGGATGCCTCGTGTTTTTATATCATTCAGTGGAGAATGGAAAGATATTGAAAAG
GATTACGTGGGTGGTCGTACAAGAGGATTGACTGTGGATAGAAAAATCACCTATGCTGAATTTCTAGGACATGTATGTAGGCTAAGTAGTATAAATCCATTACAAGAAGA
TATCATAATTAGACGTGTATATAATTTTAAGGCGAATGTTTGTGTAATGGAAATAACTGACGACGATGACCTGACTTTCTTCTTGACTGGTGAAAATGTCTCTGAATTGC
CGCTATACATATCTACCGTGCCAAAGAAGACCCTCGTTTTCAAGTCCGTCAGTTCCGTCCTCGTCGTCGAACCCCTCTTCTTCCCGCCCACCACCCCCTACTTTGGTCAT
ATTGGTCATGATATAGCATCTCTCACACCGTTAGGGTCAGATGTTGTTCCTTGTAATTTGGGAGATGATAGGGCATATGATTGGGATGTGACTGACTTGTGGAATGGAAG
TGAAAATGTGGATGAAGATAGTGATGAAGCATATCGTCCAATGACCGACATGGGAGAAGGAGACGACGAAGGGGAATATGGAAATGAGTACGCCAGTGATCGACTTGATG
TGCAACATGAGCATGAAAAGGTAACAATTGATAATACAATGTCTGAATATCCTGTAGACGCCATCCATGAAATGGCAGGGAATAGAGTCACCGGTCAGTCAGAAGGTGAT
AGATTGCAAGCCATGGTCCAATCGGCTGGGACCGATGATGTTAAGGAGGGTGACGTATTCGACTCGAAGAAGGAACTAGCGAAGAGTTGGGTGGTCGGTCATCTAGTACA
ATCAAAGTTTACTGATGTTTCTCGCACGTACAGGCCGAAGGACATCATGCAAGATATTCGTGAGGAGTATGGTGTAAATATGAGTTACGACAAGGCCTGGCGTTCGAGCG
AAGAAGCACTCCGACTTATCAGAGGGGATCCAGCTTCATCGTACGGGCTACTACCCGCTTATGGGGAAGCCATGAACTTGCTGGCCAAATTTAAAACGCCCGCGTTGGAG
GCATTATTTTTTAAGGCTGCGAAGGCATTTCACGAGTCATATTTCAATGAGAACTGGGTCCAACTGTGCGCACACCCAGGAAGGTGGTTCTACGAACGTCGGACGCTTGC
TTCTTCACGTCAGAGTACGTTGTCTGACTACACAAAGGAAATGATTGCCGATGGTTCGGATAATGCACGGAGACACATTGTTATGAACATCGACCAGTTTAATTTTGAGG
TACGCGACGGGAACCTCAATGGGGACGTTGACTTCCAATCGCAGACGTGTACTTGTCGAGAGTTCGATTATTTTAAAGTCTCGTGCTCCCATGCTATTGCTGCAGCCAGT
TCTCGTAGCATAAATCCGTACACACTATGCGATGAGGCGTACACGGTCAATAGCTGGATGTTGGCATATGCAGAACCAATATTTCCAGTGGGTTCATCCTCAACATGGAA
GAGTTCTCCGGGGTTTGTGAATATTGATGTTCAACCACCGAAGAAGGTCGTTAGGGTTGGACGGCGACAGACGGTAGCTTGTACCGCAATGACTACAGCTAAAAATAAGT
ATGCTAAAACAAATCATGATAGTTACATCGTACTTGAAAAATCTAACACAAAATATGCCGCAATCTGTGAACCCGGCTTGCTGAGGTACAGTACGTCTTTGGACCCTCCA
CGGCACCGTGGGCAGGTTAGGCCGAAGTGCGAGCATCCCGCTCCAATGAAGAATCGCCGGGATTATCGTGCACATTGGCTTGAGCGCCTTCTCGAGATCATCCAACGGGG
TGATCGACTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCATTGCAAAATACTTTCGCCCTTCAGTCCAAGTACAACGACAAATACTTGTGCTACATCGACAATGAAGACCCACAACTCCATGGCTTCCTCAAATTCTCCGAAGA
TGAGTTCTCAACTCATTCAAAATTCCACAAAGAGCCAGCCAGCACAGCTGATGAGTTCATCAACATAAGATCGGTCTACAGCAACAAGTATTGGGTGAGCCGATACGAGA
ACGACGAGCACTGGATCGTGGCCGAAGCCAACGAGCCAAATGAAGACCAATCGAGCTCGTCGTGCACACTCTTCCAAGCCATCCCTGTGGACGATGGGAGCGCCATAAGA
CTTCTCCATGTGCACCTCAACAAGTACGTTTGTTTATCCAAGGATTCCTCCCAATTCCCTCTTTGCATCTTTGCAGGATCCGTCGATCCCGATCCCGATCGCATCGACGT
GTTCGTGGTCAAGAATATCATCCCCCCGAAACAATCAGGGCCATCGAGGCTTGCATTCAAGGGCGACAATGAAAAATACCTCAAGGTGGAAGTGATTGATGGAAAGCCAC
ATCTGAAGTTCTCAGCTAAAGACACTGATGAAGCAGAAGTGTGGTTCAAATGCCCCAACATCGCCAAGAAGAATGATACAAGTGGTCATATTCAATCAAATTTTAACAAT
GAATTTTGGAGGTGTGACCACAATTGGGTTAATGCAGAGGCAGAAAATGCTACTGCTGATCCCAAAACTTTATTTCATTTTCATTGGGAAGATTGTTCTAATAACATAGT
TACCCTTCGTGTAGGCAACATCTATTGCCAGCGATTTGTAGATCATGAAGGTAAGTCTAGTAAGATGGATTGCCTCATTGCAAATACTGTTGAGAATACCCCACAAGCAA
AGCTACACGTGGAGGTTGCAAAAGTTCCAATTGAAGCGGCCACAAATATCAAGGTAAGGATGCCTCGTGTTTTTATATCATTCAGTGGAGAATGGAAAGATATTGAAAAG
GATTACGTGGGTGGTCGTACAAGAGGATTGACTGTGGATAGAAAAATCACCTATGCTGAATTTCTAGGACATGTATGTAGGCTAAGTAGTATAAATCCATTACAAGAAGA
TATCATAATTAGACGTGTATATAATTTTAAGGCGAATGTTTGTGTAATGGAAATAACTGACGACGATGACCTGACTTTCTTCTTGACTGGTGAAAATGTCTCTGAATTGC
CGCTATACATATCTACCGTGCCAAAGAAGACCCTCGTTTTCAAGTCCGTCAGTTCCGTCCTCGTCGTCGAACCCCTCTTCTTCCCGCCCACCACCCCCTACTTTGGTCAT
ATTGGTCATGATATAGCATCTCTCACACCGTTAGGGTCAGATGTTGTTCCTTGTAATTTGGGAGATGATAGGGCATATGATTGGGATGTGACTGACTTGTGGAATGGAAG
TGAAAATGTGGATGAAGATAGTGATGAAGCATATCGTCCAATGACCGACATGGGAGAAGGAGACGACGAAGGGGAATATGGAAATGAGTACGCCAGTGATCGACTTGATG
TGCAACATGAGCATGAAAAGGTAACAATTGATAATACAATGTCTGAATATCCTGTAGACGCCATCCATGAAATGGCAGGGAATAGAGTCACCGGTCAGTCAGAAGGTGAT
AGATTGCAAGCCATGGTCCAATCGGCTGGGACCGATGATGTTAAGGAGGGTGACGTATTCGACTCGAAGAAGGAACTAGCGAAGAGTTGGGTGGTCGGTCATCTAGTACA
ATCAAAGTTTACTGATGTTTCTCGCACGTACAGGCCGAAGGACATCATGCAAGATATTCGTGAGGAGTATGGTGTAAATATGAGTTACGACAAGGCCTGGCGTTCGAGCG
AAGAAGCACTCCGACTTATCAGAGGGGATCCAGCTTCATCGTACGGGCTACTACCCGCTTATGGGGAAGCCATGAACTTGCTGGCCAAATTTAAAACGCCCGCGTTGGAG
GCATTATTTTTTAAGGCTGCGAAGGCATTTCACGAGTCATATTTCAATGAGAACTGGGTCCAACTGTGCGCACACCCAGGAAGGTGGTTCTACGAACGTCGGACGCTTGC
TTCTTCACGTCAGAGTACGTTGTCTGACTACACAAAGGAAATGATTGCCGATGGTTCGGATAATGCACGGAGACACATTGTTATGAACATCGACCAGTTTAATTTTGAGG
TACGCGACGGGAACCTCAATGGGGACGTTGACTTCCAATCGCAGACGTGTACTTGTCGAGAGTTCGATTATTTTAAAGTCTCGTGCTCCCATGCTATTGCTGCAGCCAGT
TCTCGTAGCATAAATCCGTACACACTATGCGATGAGGCGTACACGGTCAATAGCTGGATGTTGGCATATGCAGAACCAATATTTCCAGTGGGTTCATCCTCAACATGGAA
GAGTTCTCCGGGGTTTGTGAATATTGATGTTCAACCACCGAAGAAGGTCGTTAGGGTTGGACGGCGACAGACGGTAGCTTGTACCGCAATGACTACAGCTAAAAATAAGT
ATGCTAAAACAAATCATGATAGTTACATCGTACTTGAAAAATCTAACACAAAATATGCCGCAATCTGTGAACCCGGCTTGCTGAGGTACAGTACGTCTTTGGACCCTCCA
CGGCACCGTGGGCAGGTTAGGCCGAAGTGCGAGCATCCCGCTCCAATGAAGAATCGCCGGGATTATCGTGCACATTGGCTTGAGCGCCTTCTCGAGATCATCCAACGGGG
TGATCGACTGTAG
Protein sequenceShow/hide protein sequence
MSLQNTFALQSKYNDKYLCYIDNEDPQLHGFLKFSEDEFSTHSKFHKEPASTADEFINIRSVYSNKYWVSRYENDEHWIVAEANEPNEDQSSSSCTLFQAIPVDDGSAIR
LLHVHLNKYVCLSKDSSQFPLCIFAGSVDPDPDRIDVFVVKNIIPPKQSGPSRLAFKGDNEKYLKVEVIDGKPHLKFSAKDTDEAEVWFKCPNIAKKNDTSGHIQSNFNN
EFWRCDHNWVNAEAENATADPKTLFHFHWEDCSNNIVTLRVGNIYCQRFVDHEGKSSKMDCLIANTVENTPQAKLHVEVAKVPIEAATNIKVRMPRVFISFSGEWKDIEK
DYVGGRTRGLTVDRKITYAEFLGHVCRLSSINPLQEDIIIRRVYNFKANVCVMEITDDDDLTFFLTGENVSELPLYISTVPKKTLVFKSVSSVLVVEPLFFPPTTPYFGH
IGHDIASLTPLGSDVVPCNLGDDRAYDWDVTDLWNGSENVDEDSDEAYRPMTDMGEGDDEGEYGNEYASDRLDVQHEHEKVTIDNTMSEYPVDAIHEMAGNRVTGQSEGD
RLQAMVQSAGTDDVKEGDVFDSKKELAKSWVVGHLVQSKFTDVSRTYRPKDIMQDIREEYGVNMSYDKAWRSSEEALRLIRGDPASSYGLLPAYGEAMNLLAKFKTPALE
ALFFKAAKAFHESYFNENWVQLCAHPGRWFYERRTLASSRQSTLSDYTKEMIADGSDNARRHIVMNIDQFNFEVRDGNLNGDVDFQSQTCTCREFDYFKVSCSHAIAAAS
SRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSSTWKSSPGFVNIDVQPPKKVVRVGRRQTVACTAMTTAKNKYAKTNHDSYIVLEKSNTKYAAICEPGLLRYSTSLDPP
RHRGQVRPKCEHPAPMKNRRDYRAHWLERLLEIIQRGDRL