; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10018845 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10018845
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionNuclear transcription factor Y subunit C-4, putative
Genome locationChr04:9598898..9599785
RNA-Seq ExpressionHG10018845
SyntenyHG10018845
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0031439.1 uncharacterized protein E6C27_scaffold139G001960 [Cucumis melo var. makuwa]1.4e-8981.57Show/hide
Query:  MVILSFPCIVSILGQESG-SEFFSVSDMFDSKQLDLFFRDLGHEGWAINGHKVLILSSAETKGLIQIRVLDGDEHKLNIVVDSDFDRSGLFSDDSFDFVL
        MVIL+FPCIVSILGQESG SEFFSVSDM DS +LDLFFRDLGHEG++ NGHKVLILSSAETKGLIQIRVLDGDEHKLNIVVDSDFDR+GLFSDDSFDFVL
Subjt:  MVILSFPCIVSILGQESG-SEFFSVSDMFDSKQLDLFFRDLGHEGWAINGHKVLILSSAETKGLIQIRVLDGDEHKLNIVVDSDFDRSGLFSDDSFDFVL

Query:  SWGLVDSHFIDRILKIGGIVAFPL-NNNDPSNHFQKKPNYRPVFLNRYSSIIVALEKTAMADQLVYASASRRRLFQFSLPTRNAALRDLEDVLLEPPIKD
        SW  +DS FIDRILK GGIVAFPL NNNDPSNHF+KKPNY+P+FLNRY+SIIVA+EKTA+AD LVYASASRRRL + SLPT NAALRDLE         D
Subjt:  SWGLVDSHFIDRILKIGGIVAFPL-NNNDPSNHFQKKPNYRPVFLNRYSSIIVALEKTAMADQLVYASASRRRLFQFSLPTRNAALRDLEDVLLEPPIKD

Query:  VAKPNQLGRKIKYLPDI
        V KPN+LGRKI YL D+
Subjt:  VAKPNQLGRKIKYLPDI

XP_008455527.1 PREDICTED: uncharacterized protein LOC103495679 [Cucumis melo]4.9e-10679.01Show/hide
Query:  MDFARFNRPNS------TWNSNTHLVIKFPNTRIFRMISYSLLFAMVILSFPCIVSILGQESG-SEFFSVSDMFDSKQLDLFFRDLGHEGWAINGHKVLI
        MD ARFNRPN+      +WNS THLVI FPNTRI R+ISYS  FAMVIL+FPCIVSILGQESG SEFFSVSDM DS +LDLFFRDLGHEG++ NGHKVLI
Subjt:  MDFARFNRPNS------TWNSNTHLVIKFPNTRIFRMISYSLLFAMVILSFPCIVSILGQESG-SEFFSVSDMFDSKQLDLFFRDLGHEGWAINGHKVLI

Query:  LSSAETKGLIQIRVLDGDEHKLNIVVDSDFDRSGLFSDDSFDFVLSWGLVDSHFIDRILKIGGIVAFPL-NNNDPSNHFQKKPNYRPVFLNRYSSIIVAL
        LSSAETKGLIQIRVLDGDEHKLNIVVDSDFDR+GLFSDDSFDFVLSW  +DS FIDRILK GGIVAFPL NNNDPSNHF+KKPNY+P+FLNRY+SIIVA+
Subjt:  LSSAETKGLIQIRVLDGDEHKLNIVVDSDFDRSGLFSDDSFDFVLSWGLVDSHFIDRILKIGGIVAFPL-NNNDPSNHFQKKPNYRPVFLNRYSSIIVAL

Query:  EKTAMADQLVYASASRRRLFQFSLPTRNAALRDLEDVLLEPPIKDVAKPNQLGRKIKYLPDI
        EKTA+AD LVYASASRRRL + SLPT NAALRDLE         DV KPN+LGRKI YL D+
Subjt:  EKTAMADQLVYASASRRRLFQFSLPTRNAALRDLEDVLLEPPIKDVAKPNQLGRKIKYLPDI

XP_011659719.1 uncharacterized protein LOC105436238 [Cucumis sativus]1.3e-10678.33Show/hide
Query:  MDFARFNRP------NSTWNSNTHLVIKFPNTRIFRMISYSLLFAMVILSFPCIVSILGQESG-SEFFSVSDMFDSKQLDLFFRDLGHEGWAINGHKVLI
        MD ARFNRP      N +WNS THLVI FP T+I R+ISYS  FAMVIL+FPCIVSILGQE+G SEFFSV DM DS++LDLFFRDLGHEG++ NGHKVLI
Subjt:  MDFARFNRP------NSTWNSNTHLVIKFPNTRIFRMISYSLLFAMVILSFPCIVSILGQESG-SEFFSVSDMFDSKQLDLFFRDLGHEGWAINGHKVLI

Query:  LSSAETKGLIQIRVLDGDEHKLNIVVDSDFDRSGLFSDDSFDFVLSWGLVDSHFIDRILKIGGIVAFPL-NNNDPSNHFQKKPNYRPVFLNRYSSIIVAL
        LSSAET GLIQIRVLDGDEHKLNIVVDSDFDR+GLFSDDSFDFVLSWG +DS FIDRILKIGGIVAFPL NNNDPS+HF+KKPNY+PVFLNRY+SIIVA+
Subjt:  LSSAETKGLIQIRVLDGDEHKLNIVVDSDFDRSGLFSDDSFDFVLSWGLVDSHFIDRILKIGGIVAFPL-NNNDPSNHFQKKPNYRPVFLNRYSSIIVAL

Query:  EKTAMADQLVYASASRRRLFQFSLPTRNAALRDLEDVLLEPPIKDVAKPNQLGRKIKYLPDIA
        EKT MAD+LVY SASRRRL + SLPTRNAALRDLE         DV KPN+LGRKIKYLPD++
Subjt:  EKTAMADQLVYASASRRRLFQFSLPTRNAALRDLEDVLLEPPIKDVAKPNQLGRKIKYLPDIA

XP_022141924.1 uncharacterized protein LOC111012177 [Momordica charantia]7.2e-8962.29Show/hide
Query:  MDFARFNRPNS-------------TWNSNTHLVIKFPNTRIFRMISYSLLFAMVILSFPCIVSILGQESGSEFFSVSDMFDSKQLDLFFRDLGHEGWAIN
        MDFARFNR  +              WNS+THLVIKFP+ RI  +IS SL  A+VIL+ PCIVSILG+ES SEF SVSD+ DS QLDL FRD G+EG  IN
Subjt:  MDFARFNRPNS-------------TWNSNTHLVIKFPNTRIFRMISYSLLFAMVILSFPCIVSILGQESGSEFFSVSDMFDSKQLDLFFRDLGHEGWAIN

Query:  GHKVLILSSAETKGLIQIRVLDGDEHKLNIVVDSDFDRSGLFSDDSFDFVLSWGLVDSHFIDRILKIGGIVAFPLNNNDPSNHFQKKPNYRPVFLNRYSS
        G K +ILSS  T GL Q+RV+D DE KL+IV+DSDFD+SGLFSDDSFDFV +WG VDS F+DRILK GGI+AFP  N+ PSNHFQKKPNYRPVFL+RYSS
Subjt:  GHKVLILSSAETKGLIQIRVLDGDEHKLNIVVDSDFDRSGLFSDDSFDFVLSWGLVDSHFIDRILKIGGIVAFPLNNNDPSNHFQKKPNYRPVFLNRYSS

Query:  IIVALEKTAMADQLVYASASRRRLFQFSLPTRNAALRDL-EDVLLEPPIKDVAKPNQLGRKIKYLPDIAHSFLEASRRRI---VTVGQREWSNTLIR
        IIVA+EKTAM D +VY+SASRR L QFS  T  AA+R L ED+L E P K VAKP+ L RKIKY+ D+    L+  R+ +   VTVG  E +  +I+
Subjt:  IIVALEKTAMADQLVYASASRRRLFQFSLPTRNAALRDL-EDVLLEPPIKDVAKPNQLGRKIKYLPDIAHSFLEASRRRI---VTVGQREWSNTLIR

XP_038889013.1 uncharacterized protein LOC120078778 [Benincasa hispida]7.6e-12388.46Show/hide
Query:  MDFARFNRPNS------TWNSNTHLVIKFPNTRIFRMISYSLLFAMVILSFPCIVSILGQESGSEFFSVSDMFDSKQLDLFFRDLGHEGWAINGHKVLIL
        MDF  FNRPNS      +WNS THLVIKFPNT+I R+ISYSL FAM IL+FP IVSILGQESGSEFFSVSDM DS+QLDLFFRDLGHEG  INGHK LIL
Subjt:  MDFARFNRPNS------TWNSNTHLVIKFPNTRIFRMISYSLLFAMVILSFPCIVSILGQESGSEFFSVSDMFDSKQLDLFFRDLGHEGWAINGHKVLIL

Query:  SSAETKGLIQIRVLDGDEHKLNIVVDSDFDRSGLFSDDSFDFVLSWGLVDSHFIDRILKIGGIVAFPLNNNDPSNHFQKKPNYRPVFLNRYSSIIVALEK
        SSAETKGLIQIRVLDGDEHKLNIVVDSDFDRSGLFSDDSFDFVLS GLVDS FIDRILKIGGIVAFPLNNNDPSNHFQKKPNYRPVFLNRYSSIIV +EK
Subjt:  SSAETKGLIQIRVLDGDEHKLNIVVDSDFDRSGLFSDDSFDFVLSWGLVDSHFIDRILKIGGIVAFPLNNNDPSNHFQKKPNYRPVFLNRYSSIIVALEK

Query:  TAMADQLVYASASRRRLFQFSLPTRNAALRDLEDVLLEPPIKDVAKPNQLGRKIKYLPDI
        TAMADQLVYAS+SRRRLFQFSLPTRNAALRDLEDVLLEPPIKDVAKPN+LGRK+KYLPD+
Subjt:  TAMADQLVYASASRRRLFQFSLPTRNAALRDLEDVLLEPPIKDVAKPNQLGRKIKYLPDI

TrEMBL top hitse value%identityAlignment
A0A0A0K451 Uncharacterized protein6.3e-10778.33Show/hide
Query:  MDFARFNRP------NSTWNSNTHLVIKFPNTRIFRMISYSLLFAMVILSFPCIVSILGQESG-SEFFSVSDMFDSKQLDLFFRDLGHEGWAINGHKVLI
        MD ARFNRP      N +WNS THLVI FP T+I R+ISYS  FAMVIL+FPCIVSILGQE+G SEFFSV DM DS++LDLFFRDLGHEG++ NGHKVLI
Subjt:  MDFARFNRP------NSTWNSNTHLVIKFPNTRIFRMISYSLLFAMVILSFPCIVSILGQESG-SEFFSVSDMFDSKQLDLFFRDLGHEGWAINGHKVLI

Query:  LSSAETKGLIQIRVLDGDEHKLNIVVDSDFDRSGLFSDDSFDFVLSWGLVDSHFIDRILKIGGIVAFPL-NNNDPSNHFQKKPNYRPVFLNRYSSIIVAL
        LSSAET GLIQIRVLDGDEHKLNIVVDSDFDR+GLFSDDSFDFVLSWG +DS FIDRILKIGGIVAFPL NNNDPS+HF+KKPNY+PVFLNRY+SIIVA+
Subjt:  LSSAETKGLIQIRVLDGDEHKLNIVVDSDFDRSGLFSDDSFDFVLSWGLVDSHFIDRILKIGGIVAFPL-NNNDPSNHFQKKPNYRPVFLNRYSSIIVAL

Query:  EKTAMADQLVYASASRRRLFQFSLPTRNAALRDLEDVLLEPPIKDVAKPNQLGRKIKYLPDIA
        EKT MAD+LVY SASRRRL + SLPTRNAALRDLE         DV KPN+LGRKIKYLPD++
Subjt:  EKTAMADQLVYASASRRRLFQFSLPTRNAALRDLEDVLLEPPIKDVAKPNQLGRKIKYLPDIA

A0A1S3C0P0 uncharacterized protein LOC1034956792.4e-10679.01Show/hide
Query:  MDFARFNRPNS------TWNSNTHLVIKFPNTRIFRMISYSLLFAMVILSFPCIVSILGQESG-SEFFSVSDMFDSKQLDLFFRDLGHEGWAINGHKVLI
        MD ARFNRPN+      +WNS THLVI FPNTRI R+ISYS  FAMVIL+FPCIVSILGQESG SEFFSVSDM DS +LDLFFRDLGHEG++ NGHKVLI
Subjt:  MDFARFNRPNS------TWNSNTHLVIKFPNTRIFRMISYSLLFAMVILSFPCIVSILGQESG-SEFFSVSDMFDSKQLDLFFRDLGHEGWAINGHKVLI

Query:  LSSAETKGLIQIRVLDGDEHKLNIVVDSDFDRSGLFSDDSFDFVLSWGLVDSHFIDRILKIGGIVAFPL-NNNDPSNHFQKKPNYRPVFLNRYSSIIVAL
        LSSAETKGLIQIRVLDGDEHKLNIVVDSDFDR+GLFSDDSFDFVLSW  +DS FIDRILK GGIVAFPL NNNDPSNHF+KKPNY+P+FLNRY+SIIVA+
Subjt:  LSSAETKGLIQIRVLDGDEHKLNIVVDSDFDRSGLFSDDSFDFVLSWGLVDSHFIDRILKIGGIVAFPL-NNNDPSNHFQKKPNYRPVFLNRYSSIIVAL

Query:  EKTAMADQLVYASASRRRLFQFSLPTRNAALRDLEDVLLEPPIKDVAKPNQLGRKIKYLPDI
        EKTA+AD LVYASASRRRL + SLPT NAALRDLE         DV KPN+LGRKI YL D+
Subjt:  EKTAMADQLVYASASRRRLFQFSLPTRNAALRDLEDVLLEPPIKDVAKPNQLGRKIKYLPDI

A0A2P6QZ04 Uncharacterized protein3.1e-4544.49Show/hide
Query:  NSNTHLVIKFPNTRIFRMISYSLLFAMVILSFPCIVSILGQESGSEF-FSVSDMFDSKQLDLFFRDLGHEGWAINGHKVLILSSAETKGLIQIRVLDGDE
        +S   LVIK P+ ++ R+I  S+  A+V+L+ PCI SI  + + SE   S S +F  +QL L F DL  EG    G K LI+S      +  IR L+ D+
Subjt:  NSNTHLVIKFPNTRIFRMISYSLLFAMVILSFPCIVSILGQESGSEF-FSVSDMFDSKQLDLFFRDLGHEGWAINGHKVLILSSAETKGLIQIRVLDGDE

Query:  HKLNIVVDSDFDRSGLFSDDSFDFVLSWGLVDSHFIDRILKIGGIVAFPLNNNDPSNHFQKKPNYRPVFLNRYSSIIVALEKTAMADQLVYASASRRRLF
        +   I +DSD +R     D+S DFV ++ L D+ F+DR+LK+GGIVA PL +NDPSN F KK NY+ V+L RY+SI VA+ KT +A +L   +  RRRL 
Subjt:  HKLNIVVDSDFDRSGLFSDDSFDFVLSWGLVDSHFIDRILKIGGIVAFPLNNNDPSNHFQKKPNYRPVFLNRYSSIIVALEKTAMADQLVYASASRRRLF

Query:  QFSLPTRNAALRDLEDVLLEPPIKDVAKPNQLGRKIKYLPDIAHSFLEASRRRI
        QF    +   L+ LEDVLLEPP + +AK +Q  +K+K+LPD+  + LE   RR+
Subjt:  QFSLPTRNAALRDLEDVLLEPPIKDVAKPNQLGRKIKYLPDIAHSFLEASRRRI

A0A5A7SQ50 Uncharacterized protein7.0e-9081.57Show/hide
Query:  MVILSFPCIVSILGQESG-SEFFSVSDMFDSKQLDLFFRDLGHEGWAINGHKVLILSSAETKGLIQIRVLDGDEHKLNIVVDSDFDRSGLFSDDSFDFVL
        MVIL+FPCIVSILGQESG SEFFSVSDM DS +LDLFFRDLGHEG++ NGHKVLILSSAETKGLIQIRVLDGDEHKLNIVVDSDFDR+GLFSDDSFDFVL
Subjt:  MVILSFPCIVSILGQESG-SEFFSVSDMFDSKQLDLFFRDLGHEGWAINGHKVLILSSAETKGLIQIRVLDGDEHKLNIVVDSDFDRSGLFSDDSFDFVL

Query:  SWGLVDSHFIDRILKIGGIVAFPL-NNNDPSNHFQKKPNYRPVFLNRYSSIIVALEKTAMADQLVYASASRRRLFQFSLPTRNAALRDLEDVLLEPPIKD
        SW  +DS FIDRILK GGIVAFPL NNNDPSNHF+KKPNY+P+FLNRY+SIIVA+EKTA+AD LVYASASRRRL + SLPT NAALRDLE         D
Subjt:  SWGLVDSHFIDRILKIGGIVAFPL-NNNDPSNHFQKKPNYRPVFLNRYSSIIVALEKTAMADQLVYASASRRRLFQFSLPTRNAALRDLEDVLLEPPIKD

Query:  VAKPNQLGRKIKYLPDI
        V KPN+LGRKI YL D+
Subjt:  VAKPNQLGRKIKYLPDI

A0A6J1CK51 uncharacterized protein LOC1110121773.5e-8962.29Show/hide
Query:  MDFARFNRPNS-------------TWNSNTHLVIKFPNTRIFRMISYSLLFAMVILSFPCIVSILGQESGSEFFSVSDMFDSKQLDLFFRDLGHEGWAIN
        MDFARFNR  +              WNS+THLVIKFP+ RI  +IS SL  A+VIL+ PCIVSILG+ES SEF SVSD+ DS QLDL FRD G+EG  IN
Subjt:  MDFARFNRPNS-------------TWNSNTHLVIKFPNTRIFRMISYSLLFAMVILSFPCIVSILGQESGSEFFSVSDMFDSKQLDLFFRDLGHEGWAIN

Query:  GHKVLILSSAETKGLIQIRVLDGDEHKLNIVVDSDFDRSGLFSDDSFDFVLSWGLVDSHFIDRILKIGGIVAFPLNNNDPSNHFQKKPNYRPVFLNRYSS
        G K +ILSS  T GL Q+RV+D DE KL+IV+DSDFD+SGLFSDDSFDFV +WG VDS F+DRILK GGI+AFP  N+ PSNHFQKKPNYRPVFL+RYSS
Subjt:  GHKVLILSSAETKGLIQIRVLDGDEHKLNIVVDSDFDRSGLFSDDSFDFVLSWGLVDSHFIDRILKIGGIVAFPLNNNDPSNHFQKKPNYRPVFLNRYSS

Query:  IIVALEKTAMADQLVYASASRRRLFQFSLPTRNAALRDL-EDVLLEPPIKDVAKPNQLGRKIKYLPDIAHSFLEASRRRI---VTVGQREWSNTLIR
        IIVA+EKTAM D +VY+SASRR L QFS  T  AA+R L ED+L E P K VAKP+ L RKIKY+ D+    L+  R+ +   VTVG  E +  +I+
Subjt:  IIVALEKTAMADQLVYASASRRRLFQFSLPTRNAALRDL-EDVLLEPPIKDVAKPNQLGRKIKYLPDIAHSFLEASRRRI---VTVGQREWSNTLIR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G58120.1 BEST Arabidopsis thaliana protein match is: methyltransferases (TAIR:AT5G01710.1)4.4e-2029.93Show/hide
Query:  NSTWNSNTHLVIKFPNTRIFRMISYSLLFAMVILSFPCIVSILGQESGSEFFSVSDMFDSKQ-LDLFFRDLGHEGWAINGHKVLILSSAETKGLIQIRVL
        +S+  S+    +K   + +  +   S L A++ LSF  + S+L   + +   S S   D  + L L   DL  +G    G K L LS  + +  +     
Subjt:  NSTWNSNTHLVIKFPNTRIFRMISYSLLFAMVILSFPCIVSILGQESGSEFFSVSDMFDSKQ-LDLFFRDLGHEGWAINGHKVLILSSAETKGLIQIRVL

Query:  DGDEHKLNIVVDSDFDRSGLFSDDSFDFVL--SWGLVDSHFIDRILKIGGIVAFPLNNNDPSNHFQKKPNYRPVFLNRYSSIIVALEKTAMADQLVYASA
           E  + +V  SD +   +  D++FDF    S  +  + FIDR LK+GGI    LN  D   +F K PNY  V++      ++ + KT   +Q     A
Subjt:  DGDEHKLNIVVDSDFDRSGLFSDDSFDFVL--SWGLVDSHFIDRILKIGGIVAFPLNNNDPSNHFQKKPNYRPVFLNRYSSIIVALEKTAMADQLVYASA

Query:  SRRRLFQFS-LPTRNAALRDLEDVLLEPPIKDVAKPNQLGRKIKYLPDIAHSFLEA---SRRRIVTVGQREWSN
        + R+L   +    R  ALR LEDVLLEPP     K     ++ +YLPD+    L+    SRR  + VG  + S+
Subjt:  SRRRLFQFS-LPTRNAALRDLEDVLLEPPIKDVAKPNQLGRKIKYLPDIAHSFLEA---SRRRIVTVGQREWSN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTTTGCTCGTTTCAATCGACCCAATAGCACCTGGAATTCCAATACCCATTTGGTTATTAAGTTTCCTAATACTCGAATTTTTCGTATGATTTCTTATTCGTTGTT
ATTTGCTATGGTTATTCTCTCGTTTCCCTGTATTGTCTCCATTCTTGGGCAAGAAAGTGGGTCTGAGTTTTTTTCTGTGTCAGATATGTTTGATTCTAAGCAATTGGATT
TGTTTTTTCGTGATTTGGGTCACGAAGGCTGGGCCATTAACGGCCATAAGGTTCTCATTTTGAGCTCTGCTGAAACTAAGGGCTTGATTCAGATTCGTGTGTTGGATGGT
GATGAACACAAACTTAATATTGTTGTGGACTCTGATTTTGATAGGAGTGGATTGTTTTCTGATGATTCTTTTGATTTTGTGTTATCTTGGGGCCTTGTGGACTCTCATTT
CATTGATAGAATTTTGAAAATCGGTGGCATTGTGGCTTTTCCACTCAATAACAATGACCCATCAAATCATTTTCAAAAGAAACCAAATTACAGGCCTGTGTTTCTCAATA
GATACAGCTCCATTATTGTGGCATTGGAGAAGACAGCCATGGCTGATCAGCTGGTTTATGCTTCAGCTTCAAGAAGACGTCTCTTTCAATTCTCATTGCCAACTAGAAAT
GCAGCTTTGAGAGACCTTGAGGATGTTCTACTTGAGCCACCAATTAAGGATGTGGCCAAACCAAACCAACTTGGGAGGAAAATCAAGTACCTTCCTGACATCGCACACAG
TTTTCTCGAAGCTTCTAGGCGAAGGATCGTCACGGTTGGCCAGCGTGAATGGTCCAATACTTTGATCAGAACTACCCAAGAAAGGATCAGGAGTTTGAGGTTCCCAAAAT
TGACTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGATTTTGCTCGTTTCAATCGACCCAATAGCACCTGGAATTCCAATACCCATTTGGTTATTAAGTTTCCTAATACTCGAATTTTTCGTATGATTTCTTATTCGTTGTT
ATTTGCTATGGTTATTCTCTCGTTTCCCTGTATTGTCTCCATTCTTGGGCAAGAAAGTGGGTCTGAGTTTTTTTCTGTGTCAGATATGTTTGATTCTAAGCAATTGGATT
TGTTTTTTCGTGATTTGGGTCACGAAGGCTGGGCCATTAACGGCCATAAGGTTCTCATTTTGAGCTCTGCTGAAACTAAGGGCTTGATTCAGATTCGTGTGTTGGATGGT
GATGAACACAAACTTAATATTGTTGTGGACTCTGATTTTGATAGGAGTGGATTGTTTTCTGATGATTCTTTTGATTTTGTGTTATCTTGGGGCCTTGTGGACTCTCATTT
CATTGATAGAATTTTGAAAATCGGTGGCATTGTGGCTTTTCCACTCAATAACAATGACCCATCAAATCATTTTCAAAAGAAACCAAATTACAGGCCTGTGTTTCTCAATA
GATACAGCTCCATTATTGTGGCATTGGAGAAGACAGCCATGGCTGATCAGCTGGTTTATGCTTCAGCTTCAAGAAGACGTCTCTTTCAATTCTCATTGCCAACTAGAAAT
GCAGCTTTGAGAGACCTTGAGGATGTTCTACTTGAGCCACCAATTAAGGATGTGGCCAAACCAAACCAACTTGGGAGGAAAATCAAGTACCTTCCTGACATCGCACACAG
TTTTCTCGAAGCTTCTAGGCGAAGGATCGTCACGGTTGGCCAGCGTGAATGGTCCAATACTTTGATCAGAACTACCCAAGAAAGGATCAGGAGTTTGAGGTTCCCAAAAT
TGACTTGA
Protein sequenceShow/hide protein sequence
MDFARFNRPNSTWNSNTHLVIKFPNTRIFRMISYSLLFAMVILSFPCIVSILGQESGSEFFSVSDMFDSKQLDLFFRDLGHEGWAINGHKVLILSSAETKGLIQIRVLDG
DEHKLNIVVDSDFDRSGLFSDDSFDFVLSWGLVDSHFIDRILKIGGIVAFPLNNNDPSNHFQKKPNYRPVFLNRYSSIIVALEKTAMADQLVYASASRRRLFQFSLPTRN
AALRDLEDVLLEPPIKDVAKPNQLGRKIKYLPDIAHSFLEASRRRIVTVGQREWSNTLIRTTQERIRSLRFPKLT