; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC02G042280 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC02G042280
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionUnknown protein
Genome locationCicolChr02:37258646..37278396
RNA-Seq ExpressionCcUC02G042280
SyntenyCcUC02G042280
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004149633.1 uncharacterized protein LOC101215314 isoform X1 [Cucumis sativus]3.0e-12295.9Show/hide
Query:  MLCNSLRDRLRPWLRDYDRLQSFAVILIYIQIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYAVLLFSAIFLDIFWFILFAYDTWNISS
        MLCNSLRDRLRPWLRDYDRLQSFAVILIYIQIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYAVLLFSAIFLD+FWFILFAYDTWNISS
Subjt:  MLCNSLRDRLRPWLRDYDRLQSFAVILIYIQIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYAVLLFSAIFLDIFWFILFAYDTWNISS

Query:  EQYGPLFTFSVKLTLAMQIIGFSVRLSSSLL------LGISYMETSVPREADYDLRNSFLSPATPVVVRQPSGSDDMIGGSIYDPTYYSSLFEDGQDSKC
        +QYGPLFTFSVKLTLAMQIIGFSVRLSSSLL      LGISYMETSVPREADYDLRNSFLSPATPVVVRQPSGSDDMIGGSIYDPTYYSSLFEDGQDSKC
Subjt:  EQYGPLFTFSVKLTLAMQIIGFSVRLSSSLL------LGISYMETSVPREADYDLRNSFLSPATPVVVRQPSGSDDMIGGSIYDPTYYSSLFEDGQDSKC

Query:  LSGISHFGNGDNGSTSGPDVSRSKLSRHFQVADDEHADGNQQTV
        LSGISHFGNGDNGSTSGPDVSRSKLSRHFQVADDE+ADG+QQTV
Subjt:  LSGISHFGNGDNGSTSGPDVSRSKLSRHFQVADDEHADGNQQTV

XP_008449916.1 PREDICTED: uncharacterized protein LOC103491645 isoform X4 [Cucumis melo]3.3e-12195.9Show/hide
Query:  MLCNSLRDRLRPWLRDYDRLQSFAVILIYIQIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYAVLLFSAIFLDIFWFILFAYDTWNISS
        MLCNSLRDRLRP LRDYDRLQSFAVILIYIQIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYAVLLFSAIFLD+FWFILFAYDTWNISS
Subjt:  MLCNSLRDRLRPWLRDYDRLQSFAVILIYIQIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYAVLLFSAIFLDIFWFILFAYDTWNISS

Query:  EQYGPLFTFSVKLTLAMQIIGFSVRLSSSLL------LGISYMETSVPREADYDLRNSFLSPATPVVVRQPSGSDDMIGGSIYDPTYYSSLFEDGQDSKC
        EQYGPLFTFSVKLTLAMQIIGFSVRLSSSLL      LGISYMETSVPREADYDLRNSFLSPATPVVVRQPSGSDDMIGGSIYDPTYYSSLFEDGQDSKC
Subjt:  EQYGPLFTFSVKLTLAMQIIGFSVRLSSSLL------LGISYMETSVPREADYDLRNSFLSPATPVVVRQPSGSDDMIGGSIYDPTYYSSLFEDGQDSKC

Query:  LSGISHFGNGDNGSTSGPDVSRSKLSRHFQVADDEHADGNQQTV
        LSGISHFGNGDNGSTSGPDVSRSKLSRHFQVADDE+ADG+QQTV
Subjt:  LSGISHFGNGDNGSTSGPDVSRSKLSRHFQVADDEHADGNQQTV

XP_022144718.1 uncharacterized protein LOC111014335 isoform X1 [Momordica charantia]9.9e-11892.65Show/hide
Query:  MMLCNSLRDRLRPWLRDYDRLQSFAVILIYIQIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYAVLLFSAIFLDIFWFILFAYDTWNIS
        MMLCNSLRDRLRPWLRDYDRLQSFAVILIYIQIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYAVLL SAIFLDIFWFILFAYDTWNIS
Subjt:  MMLCNSLRDRLRPWLRDYDRLQSFAVILIYIQIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYAVLLFSAIFLDIFWFILFAYDTWNIS

Query:  SEQYGPLFTFSVKLTLAMQIIGFSVRLSSSLL------LGISYMETSVPREADYDLRNSFLSPATPVVVRQPSGSDDMIGGSIYDPTYYSSLFEDGQDSK
        S+QYGPLF+FSVKLTLAMQIIGFS+RLSSSLL      LGISYMETSVPREADYDLRNSFLSP TPVV RQ SGSDDMIGGSIYDP YYSSLFEDGQDSK
Subjt:  SEQYGPLFTFSVKLTLAMQIIGFSVRLSSSLL------LGISYMETSVPREADYDLRNSFLSPATPVVVRQPSGSDDMIGGSIYDPTYYSSLFEDGQDSK

Query:  CLSGISHFGNGDNGSTSGPDVSRSKLSRHFQVADDEHADGNQQTV
        CLSGISHFGNGDNGSTSG DVSRSK+SRHFQV DDEHA GNQQTV
Subjt:  CLSGISHFGNGDNGSTSGPDVSRSKLSRHFQVADDEHADGNQQTV

XP_038901807.1 uncharacterized protein LOC120088513 isoform X1 [Benincasa hispida]3.4e-11892.09Show/hide
Query:  MMLCNSLRDRLRPWLRDYDRLQSFAVILIYIQIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYAVLLFSAIFLDIFWFILFAYDTWNIS
        MMLCNSLRDRLRPWLRDYDRLQSFAVILIYIQIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYAVLLFSAIFLD+FWFILFAYDTWN S
Subjt:  MMLCNSLRDRLRPWLRDYDRLQSFAVILIYIQIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYAVLLFSAIFLDIFWFILFAYDTWNIS

Query:  SEQYGPLFTFSVKLTLAMQIIGFSVRLSSSLL------LGISYMETSVPREADYDLRNSFLSPATPVVVRQPSGSDDMIGGSIYDPTYYSSLFEDGQDSK
        SEQYG LFTFSVKLTLAMQIIGFSVRLSSSLL      LGISYMETSVPREADYDLRNSFLSPATPVVVRQPSGSDDMIGGSIYDP+YYSSLF+DGQDSK
Subjt:  SEQYGPLFTFSVKLTLAMQIIGFSVRLSSSLL------LGISYMETSVPREADYDLRNSFLSPATPVVVRQPSGSDDMIGGSIYDPTYYSSLFEDGQDSK

Query:  CLSG-------ISHFGNGDNGSTSGPDVSRSKLSRHFQVADDEHADGNQ-QTV
        CLSG       ISHFGNGDNGSTSGPDVSRSKLSRHFQVADDE+ADGNQ QTV
Subjt:  CLSG-------ISHFGNGDNGSTSGPDVSRSKLSRHFQVADDEHADGNQ-QTV

XP_038901808.1 uncharacterized protein LOC120088513 isoform X2 [Benincasa hispida]2.8e-12094.72Show/hide
Query:  MMLCNSLRDRLRPWLRDYDRLQSFAVILIYIQIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYAVLLFSAIFLDIFWFILFAYDTWNIS
        MMLCNSLRDRLRPWLRDYDRLQSFAVILIYIQIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYAVLLFSAIFLD+FWFILFAYDTWN S
Subjt:  MMLCNSLRDRLRPWLRDYDRLQSFAVILIYIQIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYAVLLFSAIFLDIFWFILFAYDTWNIS

Query:  SEQYGPLFTFSVKLTLAMQIIGFSVRLSSSLL------LGISYMETSVPREADYDLRNSFLSPATPVVVRQPSGSDDMIGGSIYDPTYYSSLFEDGQDSK
        SEQYG LFTFSVKLTLAMQIIGFSVRLSSSLL      LGISYMETSVPREADYDLRNSFLSPATPVVVRQPSGSDDMIGGSIYDP+YYSSLF+DGQDSK
Subjt:  SEQYGPLFTFSVKLTLAMQIIGFSVRLSSSLL------LGISYMETSVPREADYDLRNSFLSPATPVVVRQPSGSDDMIGGSIYDPTYYSSLFEDGQDSK

Query:  CLSGISHFGNGDNGSTSGPDVSRSKLSRHFQVADDEHADGNQ-QTV
        CLSGISHFGNGDNGSTSGPDVSRSKLSRHFQVADDE+ADGNQ QTV
Subjt:  CLSGISHFGNGDNGSTSGPDVSRSKLSRHFQVADDEHADGNQ-QTV

TrEMBL top hitse value%identityAlignment
A0A0A0KWY7 Uncharacterized protein1.4e-12295.9Show/hide
Query:  MLCNSLRDRLRPWLRDYDRLQSFAVILIYIQIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYAVLLFSAIFLDIFWFILFAYDTWNISS
        MLCNSLRDRLRPWLRDYDRLQSFAVILIYIQIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYAVLLFSAIFLD+FWFILFAYDTWNISS
Subjt:  MLCNSLRDRLRPWLRDYDRLQSFAVILIYIQIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYAVLLFSAIFLDIFWFILFAYDTWNISS

Query:  EQYGPLFTFSVKLTLAMQIIGFSVRLSSSLL------LGISYMETSVPREADYDLRNSFLSPATPVVVRQPSGSDDMIGGSIYDPTYYSSLFEDGQDSKC
        +QYGPLFTFSVKLTLAMQIIGFSVRLSSSLL      LGISYMETSVPREADYDLRNSFLSPATPVVVRQPSGSDDMIGGSIYDPTYYSSLFEDGQDSKC
Subjt:  EQYGPLFTFSVKLTLAMQIIGFSVRLSSSLL------LGISYMETSVPREADYDLRNSFLSPATPVVVRQPSGSDDMIGGSIYDPTYYSSLFEDGQDSKC

Query:  LSGISHFGNGDNGSTSGPDVSRSKLSRHFQVADDEHADGNQQTV
        LSGISHFGNGDNGSTSGPDVSRSKLSRHFQVADDE+ADG+QQTV
Subjt:  LSGISHFGNGDNGSTSGPDVSRSKLSRHFQVADDEHADGNQQTV

A0A1S3BMI4 uncharacterized protein LOC103491645 isoform X41.6e-12195.9Show/hide
Query:  MLCNSLRDRLRPWLRDYDRLQSFAVILIYIQIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYAVLLFSAIFLDIFWFILFAYDTWNISS
        MLCNSLRDRLRP LRDYDRLQSFAVILIYIQIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYAVLLFSAIFLD+FWFILFAYDTWNISS
Subjt:  MLCNSLRDRLRPWLRDYDRLQSFAVILIYIQIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYAVLLFSAIFLDIFWFILFAYDTWNISS

Query:  EQYGPLFTFSVKLTLAMQIIGFSVRLSSSLL------LGISYMETSVPREADYDLRNSFLSPATPVVVRQPSGSDDMIGGSIYDPTYYSSLFEDGQDSKC
        EQYGPLFTFSVKLTLAMQIIGFSVRLSSSLL      LGISYMETSVPREADYDLRNSFLSPATPVVVRQPSGSDDMIGGSIYDPTYYSSLFEDGQDSKC
Subjt:  EQYGPLFTFSVKLTLAMQIIGFSVRLSSSLL------LGISYMETSVPREADYDLRNSFLSPATPVVVRQPSGSDDMIGGSIYDPTYYSSLFEDGQDSKC

Query:  LSGISHFGNGDNGSTSGPDVSRSKLSRHFQVADDEHADGNQQTV
        LSGISHFGNGDNGSTSGPDVSRSKLSRHFQVADDE+ADG+QQTV
Subjt:  LSGISHFGNGDNGSTSGPDVSRSKLSRHFQVADDEHADGNQQTV

A0A1S3BP27 uncharacterized protein LOC103491645 isoform X22.6e-11693.42Show/hide
Query:  MLCNSLRDRLRPWLRDYDRLQSFAVILIYIQIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYAVLLFSAIFLDIFWFILFAYDTWNISS
        MLCNSLRDRLRP LRDYDRLQSFAVILIYIQIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYAVLLFSAIFLD+FWFILFAYDTWNISS
Subjt:  MLCNSLRDRLRPWLRDYDRLQSFAVILIYIQIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYAVLLFSAIFLDIFWFILFAYDTWNISS

Query:  EQYGPLFTFSVKLTLAMQIIGFSVRLSSSLL------LGISYMETSVPREADYDLRNSFLSPATPVVVRQPSGSDDMIGGSIYDPTYYSSLFEDGQDSKC
        EQYGPLFTFSVKLTLAMQIIGFSVRLSSSLL      LGISYMETSVPREADYDLRNSFLSPATPVVVRQPSGSDDMIGGSIYDPTYYSSLFEDGQDSKC
Subjt:  EQYGPLFTFSVKLTLAMQIIGFSVRLSSSLL------LGISYMETSVPREADYDLRNSFLSPATPVVVRQPSGSDDMIGGSIYDPTYYSSLFEDGQDSKC

Query:  LSGISHFGNGDNGSTSGPDVSRSKLSRHFQVADDEHADGNQQT
        LSGISHFGNGDNGSTSGPDVSRSKLSRHFQVAD +   G   T
Subjt:  LSGISHFGNGDNGSTSGPDVSRSKLSRHFQVADDEHADGNQQT

A0A1S4DXR3 uncharacterized protein LOC103491645 isoform X13.4e-11696.57Show/hide
Query:  MLCNSLRDRLRPWLRDYDRLQSFAVILIYIQIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYAVLLFSAIFLDIFWFILFAYDTWNISS
        MLCNSLRDRLRP LRDYDRLQSFAVILIYIQIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYAVLLFSAIFLD+FWFILFAYDTWNISS
Subjt:  MLCNSLRDRLRPWLRDYDRLQSFAVILIYIQIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYAVLLFSAIFLDIFWFILFAYDTWNISS

Query:  EQYGPLFTFSVKLTLAMQIIGFSVRLSSSLL------LGISYMETSVPREADYDLRNSFLSPATPVVVRQPSGSDDMIGGSIYDPTYYSSLFEDGQDSKC
        EQYGPLFTFSVKLTLAMQIIGFSVRLSSSLL      LGISYMETSVPREADYDLRNSFLSPATPVVVRQPSGSDDMIGGSIYDPTYYSSLFEDGQDSKC
Subjt:  EQYGPLFTFSVKLTLAMQIIGFSVRLSSSLL------LGISYMETSVPREADYDLRNSFLSPATPVVVRQPSGSDDMIGGSIYDPTYYSSLFEDGQDSKC

Query:  LSGISHFGNGDNGSTSGPDVSRSKLSRHFQVAD
        LSGISHFGNGDNGSTSGPDVSRSKLSRHFQVAD
Subjt:  LSGISHFGNGDNGSTSGPDVSRSKLSRHFQVAD

A0A6J1CT50 uncharacterized protein LOC111014335 isoform X14.8e-11892.65Show/hide
Query:  MMLCNSLRDRLRPWLRDYDRLQSFAVILIYIQIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYAVLLFSAIFLDIFWFILFAYDTWNIS
        MMLCNSLRDRLRPWLRDYDRLQSFAVILIYIQIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYAVLL SAIFLDIFWFILFAYDTWNIS
Subjt:  MMLCNSLRDRLRPWLRDYDRLQSFAVILIYIQIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYAVLLFSAIFLDIFWFILFAYDTWNIS

Query:  SEQYGPLFTFSVKLTLAMQIIGFSVRLSSSLL------LGISYMETSVPREADYDLRNSFLSPATPVVVRQPSGSDDMIGGSIYDPTYYSSLFEDGQDSK
        S+QYGPLF+FSVKLTLAMQIIGFS+RLSSSLL      LGISYMETSVPREADYDLRNSFLSP TPVV RQ SGSDDMIGGSIYDP YYSSLFEDGQDSK
Subjt:  SEQYGPLFTFSVKLTLAMQIIGFSVRLSSSLL------LGISYMETSVPREADYDLRNSFLSPATPVVVRQPSGSDDMIGGSIYDPTYYSSLFEDGQDSK

Query:  CLSGISHFGNGDNGSTSGPDVSRSKLSRHFQVADDEHADGNQQTV
        CLSGISHFGNGDNGSTSG DVSRSK+SRHFQV DDEHA GNQQTV
Subjt:  CLSGISHFGNGDNGSTSGPDVSRSKLSRHFQVADDEHADGNQQTV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G55535.1 unknown protein3.4e-7663.49Show/hide
Query:  MMLCNSLRDRLRPWLRDYDRLQSFAVILIYIQIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYAVLLFSAIFLDIFWFILFAYDTWNIS
        MMLC SLRDR+ PWLRDY +LQS AV LIY QIGCALIGSLGALYNGVLLINLAIALFALVAIES+SQSLGRTYAVLLF A+ LDI WFILF  + W+IS
Subjt:  MMLCNSLRDRLRPWLRDYDRLQSFAVILIYIQIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYAVLLFSAIFLDIFWFILFAYDTWNIS

Query:  SEQYGPLFTFSVKLTLAMQIIGFSVRLSSSLL------LGISYMETSVPREADYDLRNSFLSPATPVVVRQPSGSDDMIGGSIYDPTYYSSLFEDGQ---
        +E YG  F FSVKLT+AM++IGF VRLSSSLL      LG + ++TS+PRE D DLRNSFL+P TP + RQ SG+++++GGSIYDP YY+SLFE+ Q   
Subjt:  SEQYGPLFTFSVKLTLAMQIIGFSVRLSSSLL------LGISYMETSVPREADYDLRNSFLSPATPVVVRQPSGSDDMIGGSIYDPTYYSSLFEDGQ---

Query:  DSKCLSGISHFGNGDNGSTSGPDVS--RSKLSRHFQVADDE
        +S   + ++H+  G+NGS S  + S  +S + R     D+E
Subjt:  DSKCLSGISHFGNGDNGSTSGPDVS--RSKLSRHFQVADDE

AT1G55535.2 unknown protein3.4e-7663.49Show/hide
Query:  MMLCNSLRDRLRPWLRDYDRLQSFAVILIYIQIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYAVLLFSAIFLDIFWFILFAYDTWNIS
        MMLC SLRDR+ PWLRDY +LQS AV LIY QIGCALIGSLGALYNGVLLINLAIALFALVAIES+SQSLGRTYAVLLF A+ LDI WFILF  + W+IS
Subjt:  MMLCNSLRDRLRPWLRDYDRLQSFAVILIYIQIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYAVLLFSAIFLDIFWFILFAYDTWNIS

Query:  SEQYGPLFTFSVKLTLAMQIIGFSVRLSSSLL------LGISYMETSVPREADYDLRNSFLSPATPVVVRQPSGSDDMIGGSIYDPTYYSSLFEDGQ---
        +E YG  F FSVKLT+AM++IGF VRLSSSLL      LG + ++TS+PRE D DLRNSFL+P TP + RQ SG+++++GGSIYDP YY+SLFE+ Q   
Subjt:  SEQYGPLFTFSVKLTLAMQIIGFSVRLSSSLL------LGISYMETSVPREADYDLRNSFLSPATPVVVRQPSGSDDMIGGSIYDPTYYSSLFEDGQ---

Query:  DSKCLSGISHFGNGDNGSTSGPDVS--RSKLSRHFQVADDE
        +S   + ++H+  G+NGS S  + S  +S + R     D+E
Subjt:  DSKCLSGISHFGNGDNGSTSGPDVS--RSKLSRHFQVADDE

AT3G13420.1 unknown protein6.1e-5758.69Show/hide
Query:  MLCNSLRDRLRPWLRDYDRLQSFAVILIYIQIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYAVLLFSAIFLDIFWFILFAYDTWNISS
        MLC SLR+R+  WLRDY RLQS  +ILIY QIGCALIGSLGALYNGV+LINLAIALF LVAIES+SQSLGRTYAVLLF AI LD+ WFILF+ + WNISS
Subjt:  MLCNSLRDRLRPWLRDYDRLQSFAVILIYIQIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYAVLLFSAIFLDIFWFILFAYDTWNISS

Query:  EQYGPLFTFSVKLTLAMQIIGFSVRLSSSLL------LGISYMETSVPREADYDLRNSFLSP------------------ATPVVVRQPSGSDDMIGGSI
        + Y   + FSVKLTLAM+I GF VRLSSSLL      LG S +++  PR++D DLRNSFL P                    P + +Q S SD+++  SI
Subjt:  EQYGPLFTFSVKLTLAMQIIGFSVRLSSSLL------LGISYMETSVPREADYDLRNSFLSP------------------ATPVVVRQPSGSDDMIGGSI

Query:  YDPTYYSSLFEDG
         +P  Y+ L + G
Subjt:  YDPTYYSSLFEDG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGAGAACTGATTCTTCCATTAAATTACTCCTCTGTCCTCGCTGCATCGCGCTCCTTCTCCTCTCAGTCGCTGGCGAAAAGGAAGGGGAAAAGAAACAGAGACAGAA
AAAGAGTATAGAGAAATTTGATATTGAGTGGGGGAGTGCGGCGGACAGGGGCGGTTACGAGGGTGGATTTTTCAAATGGTATATGATGCTTTGCAATTCATTGAGAGATC
GACTTCGGCCATGGCTTCGTGATTATGATAGGCTTCAGTCTTTCGCAGTCATTCTCATTTATATTCAGATCGGGTGCGCATTGATTGGATCCCTAGGGGCGTTGTACAAC
GGTGTTTTGCTTATAAATTTGGCGATCGCATTGTTCGCTTTGGTAGCCATAGAGAGCAGCAGTCAGAGTCTTGGTCGTACATATGCTGTTCTCCTGTTTTCTGCGATTTT
CCTCGACATCTTCTGGTTTATTCTTTTCGCCTACGACACATGGAACATCTCATCTGAGCAATATGGACCCCTCTTTACCTTTTCAGTGAAGCTTACTCTGGCTATGCAGA
TTATTGGATTTTCTGTTAGGCTATCGTCTTCACTACTGTTGGGGATTTCATACATGGAAACTTCAGTTCCCCGAGAGGCAGATTACGATTTGAGAAATAGTTTTCTTAGC
CCGGCTACTCCTGTTGTAGTTAGACAACCATCAGGTTCTGATGATATGATAGGGGGCTCTATCTACGATCCAACTTATTATTCGTCCCTATTCGAAGATGGTCAAGATAG
TAAATGTCTGTCTGGGATCTCCCATTTTGGCAATGGTGATAATGGTTCTACCTCTGGGCCAGATGTATCTCGATCAAAGCTGTCCAGACATTTCCAAGTAGCAGATGATG
AGCATGCAGATGGAAATCAGCAGACGGTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAAGAGAACTGATTCTTCCATTAAATTACTCCTCTGTCCTCGCTGCATCGCGCTCCTTCTCCTCTCAGTCGCTGGCGAAAAGGAAGGGGAAAAGAAACAGAGACAGAA
AAAGAGTATAGAGAAATTTGATATTGAGTGGGGGAGTGCGGCGGACAGGGGCGGTTACGAGGGTGGATTTTTCAAATGGTATATGATGCTTTGCAATTCATTGAGAGATC
GACTTCGGCCATGGCTTCGTGATTATGATAGGCTTCAGTCTTTCGCAGTCATTCTCATTTATATTCAGATCGGGTGCGCATTGATTGGATCCCTAGGGGCGTTGTACAAC
GGTGTTTTGCTTATAAATTTGGCGATCGCATTGTTCGCTTTGGTAGCCATAGAGAGCAGCAGTCAGAGTCTTGGTCGTACATATGCTGTTCTCCTGTTTTCTGCGATTTT
CCTCGACATCTTCTGGTTTATTCTTTTCGCCTACGACACATGGAACATCTCATCTGAGCAATATGGACCCCTCTTTACCTTTTCAGTGAAGCTTACTCTGGCTATGCAGA
TTATTGGATTTTCTGTTAGGCTATCGTCTTCACTACTGTTGGGGATTTCATACATGGAAACTTCAGTTCCCCGAGAGGCAGATTACGATTTGAGAAATAGTTTTCTTAGC
CCGGCTACTCCTGTTGTAGTTAGACAACCATCAGGTTCTGATGATATGATAGGGGGCTCTATCTACGATCCAACTTATTATTCGTCCCTATTCGAAGATGGTCAAGATAG
TAAATGTCTGTCTGGGATCTCCCATTTTGGCAATGGTGATAATGGTTCTACCTCTGGGCCAGATGTATCTCGATCAAAGCTGTCCAGACATTTCCAAGTAGCAGATGATG
AGCATGCAGATGGAAATCAGCAGACGGTTTAGAGCACTGAGTTGGTGAATTTGTTGACTATTTGCTCCTGTTATTTTGATGTCACAAATATTACGACTACTCAATACTCA
GATAAAGGGTCAGTTAGATCTTGGAGCGGAAGAAATCTCTGACACAAGGAAACGCATATTTTTTTCTTAAAAAATATTTATTATAACATTTTCAAATTTTGGGGATATTG
TGGAAATTAACCTACCTTATAGGGTAAACTACCATTTGGTTTTGTTATATTTTAATAATACAATCTGGTGGATCTCATATGAATGTGTCTC
Protein sequenceShow/hide protein sequence
MKRTDSSIKLLLCPRCIALLLLSVAGEKEGEKKQRQKKSIEKFDIEWGSAADRGGYEGGFFKWYMMLCNSLRDRLRPWLRDYDRLQSFAVILIYIQIGCALIGSLGALYN
GVLLINLAIALFALVAIESSSQSLGRTYAVLLFSAIFLDIFWFILFAYDTWNISSEQYGPLFTFSVKLTLAMQIIGFSVRLSSSLLLGISYMETSVPREADYDLRNSFLS
PATPVVVRQPSGSDDMIGGSIYDPTYYSSLFEDGQDSKCLSGISHFGNGDNGSTSGPDVSRSKLSRHFQVADDEHADGNQQTV