; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg04078 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg04078
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionUPF0114 domain-containing protein
Genome locationCarg_Chr12:2958416..2960105
RNA-Seq ExpressionCarg04078
SyntenyCarg04078
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR005134 - Uncharacterised protein family UPF0114


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7020537.1 hypothetical protein SDJN02_17222, partial [Cucurbita argyrosperma subsp. argyrosperma]3.6e-132100Show/hide
Query:  MAATRLLRSVRPSSTVVSSSSSSSPSTAATVRCLGKAGLNSKNGERLVTSGDGERRQIVNLKAAAAPETVETETRELDLGSLLANLLVQLKNTAVKTKIR
        MAATRLLRSVRPSSTVVSSSSSSSPSTAATVRCLGKAGLNSKNGERLVTSGDGERRQIVNLKAAAAPETVETETRELDLGSLLANLLVQLKNTAVKTKIR
Subjt:  MAATRLLRSVRPSSTVVSSSSSSSPSTAATVRCLGKAGLNSKNGERLVTSGDGERRQIVNLKAAAAPETVETETRELDLGSLLANLLVQLKNTAVKTKIR

Query:  RRQIQKFIEKIIIDCRFFTLFAVAGSLLGSILCFLEGSFIVAESYLQYFNGVSRRSDESHAVELLIESLDMFLVGTALVVFGVGLFAMFNIPTWVEMESV
        RRQIQKFIEKIIIDCRFFTLFAVAGSLLGSILCFLEGSFIVAESYLQYFNGVSRRSDESHAVELLIESLDMFLVGTALVVFGVGLFAMFNIPTWVEMESV
Subjt:  RRQIQKFIEKIIIDCRFFTLFAVAGSLLGSILCFLEGSFIVAESYLQYFNGVSRRSDESHAVELLIESLDMFLVGTALVVFGVGLFAMFNIPTWVEMESV

Query:  SEAKSKIGHAVMMILQVGVLEKFKSIPLSSATDLACFAAAILISSASIFFLSRLNIGGGGGGGYK
        SEAKSKIGHAVMMILQVGVLEKFKSIPLSSATDLACFAAAILISSASIFFLSRLNIGGGGGGGYK
Subjt:  SEAKSKIGHAVMMILQVGVLEKFKSIPLSSATDLACFAAAILISSASIFFLSRLNIGGGGGGGYK

XP_022951147.1 uncharacterized protein LOC111454079 [Cucurbita moschata]2.1e-12488.81Show/hide
Query:  MAATRLLRSVRPSSTVVSSSSSSSPSTAATVRCLGKAGLNSKNGERLVTSGDGERRQIVNLK----AAAAPETVETETRELDLGSLLANLLVQLKNTAVK
        MAATRLLRSVRPS+TVVSSSSSSSPSTAATVRCLGK GLNSKNGERLVTSGDGERRQIVNLK    AAAAPETVETETRELDLGSLLANLLVQLKNTAVK
Subjt:  MAATRLLRSVRPSSTVVSSSSSSSPSTAATVRCLGKAGLNSKNGERLVTSGDGERRQIVNLK----AAAAPETVETETRELDLGSLLANLLVQLKNTAVK

Query:  TKIRRRQIQKFIEKIIIDCRFFTLFAVAGSLLGSILCFLEGSFIVAESYLQYFNGVSRRSDESHAVELLIESLDMFLVGTALVVFGVGLFAMF-------
        TKIRRRQIQKFIEKIIIDCRFFTLFAVAGSLLGSILCFLEGSFIVAESYLQYFNGVSRRSDESHAVELLIESLDMFLVGTALVVFGVGLFAMF       
Subjt:  TKIRRRQIQKFIEKIIIDCRFFTLFAVAGSLLGSILCFLEGSFIVAESYLQYFNGVSRRSDESHAVELLIESLDMFLVGTALVVFGVGLFAMF-------

Query:  -------------------NIPTWVEMESVSEAKSKIGHAVMMILQVGVLEKFKSIPLSSATDLACFAAAILISSASIFFLSRLNIGGGGGGGYK
                           NIPTWVEMESVSEAKSKIGHAVMMILQVGVLEKFKSIPLSSATDLACFAAAILISSASIFFLSRLNIGGGG GGYK
Subjt:  -------------------NIPTWVEMESVSEAKSKIGHAVMMILQVGVLEKFKSIPLSSATDLACFAAAILISSASIFFLSRLNIGGGGGGGYK

XP_023002007.1 uncharacterized protein LOC111496020 [Cucurbita maxima]6.8e-11584.12Show/hide
Query:  MAATRLLRSVRPSSTVVSSSSSSSPSTAATVRCLGKAGLNSKNGERLVTSGDGERRQIVNLK-----AAAAPETVETETRELDLGSLLANLLVQLKNTAV
        MAATRLLRSVRPS+TVV SSSSSSPSTAA VRCL K GLNSKNGERL+TSGDGERRQIVNLK     AAAAPETVET+TRELDLGSLLANLLVQLKNTAV
Subjt:  MAATRLLRSVRPSSTVVSSSSSSSPSTAATVRCLGKAGLNSKNGERLVTSGDGERRQIVNLK-----AAAAPETVETETRELDLGSLLANLLVQLKNTAV

Query:  KTKIRRRQIQKFIEKIIIDCRFFTLFAVAGSLLGSILCFLEGSFIVAESYLQYFNGVSRRSDESHAVELLIESLDMFLVGTALVVFGVGLFAMF------
        K KIRRRQIQKFIEKIII+CRFFTLFAVAGSLLGSILCFLEGSFIVAESYLQYFN VSRRSDESHAVELLIESLDMFLVGTALVVFGVGLFAMF      
Subjt:  KTKIRRRQIQKFIEKIIIDCRFFTLFAVAGSLLGSILCFLEGSFIVAESYLQYFNGVSRRSDESHAVELLIESLDMFLVGTALVVFGVGLFAMF------

Query:  --------------------NIPTWVEMESVSEAKSKIGHAVMMILQVGVLEKFKSIPLSSATDLACFAAAILISSASIFFLSRLNIGGGGGGGYK
                            NIPTWVEMESVSEAKSKIGHAVMMILQVGVLEK KSIPLSSA DLACFAAAILI SASIFFLSRLNIGGG   GYK
Subjt:  --------------------NIPTWVEMESVSEAKSKIGHAVMMILQVGVLEKFKSIPLSSATDLACFAAAILISSASIFFLSRLNIGGGGGGGYK

XP_023538418.1 uncharacterized protein LOC111799204 [Cucurbita pepo subsp. pepo]4.6e-11985.42Show/hide
Query:  MAATRLLRSVRPSSTVVSSSSSSSPSTAATVRCLGKAGLNSKNGERLVTSGDGERRQIVNLK----AAAAPETVETETRELDLGSLLANLLVQLKNTAVK
        MAATRL RSVRPS+TVVSSSSSSSPSTAATVRCLGK GLNSKNGERL+TSGDGER+  VNLK    AAAAPETVET+TRELDLGSLLANLLVQLKNT VK
Subjt:  MAATRLLRSVRPSSTVVSSSSSSSPSTAATVRCLGKAGLNSKNGERLVTSGDGERRQIVNLK----AAAAPETVETETRELDLGSLLANLLVQLKNTAVK

Query:  TKIRRRQIQKFIEKIIIDCRFFTLFAVAGSLLGSILCFLEGSFIVAESYLQYFNGVSRRSDESHAVELLIESLDMFLVGTALVVFGVGLFAMF-------
        TKIRRRQIQKFIEKIIIDCRFFTLFAVAGSLLGSILCFLEGSFIVAESYLQYFNGVSRRSDESHAVELLIESLDMFLVGTALVVFGVGLFAMF       
Subjt:  TKIRRRQIQKFIEKIIIDCRFFTLFAVAGSLLGSILCFLEGSFIVAESYLQYFNGVSRRSDESHAVELLIESLDMFLVGTALVVFGVGLFAMF-------

Query:  -------------------NIPTWVEMESVSEAKSKIGHAVMMILQVGVLEKFKSIPLSSATDLACFAAAILISSASIFFLSRLNIGGGGGGGYK
                           NIPTWVEMESVSEAKSKIGHAVMMILQVGVLEKFKSIPLSSA DLACFA AILISSASIFFLSRLNIGGG  GGYK
Subjt:  -------------------NIPTWVEMESVSEAKSKIGHAVMMILQVGVLEKFKSIPLSSATDLACFAAAILISSASIFFLSRLNIGGGGGGGYK

XP_038885641.1 uncharacterized protein LOC120075956 [Benincasa hispida]1.1e-10473.88Show/hide
Query:  MAATRLLRSVRPSSTVVSSSSSSSPSTAATVRCLGKAGLNSKNGERLVTSGDGERRQIVNLKAAAAPETVETETRELDLGSLLANLLVQLKNTAVKTKIR
        M ATR ++ +RP+S V SSSSSSSPS+A TVRCLGK GLN  NGERL+TSGDGER+QIV +KAAAAP+TVET T EL+LGSLLANLLVQLK T  KTKI+
Subjt:  MAATRLLRSVRPSSTVVSSSSSSSPSTAATVRCLGKAGLNSKNGERLVTSGDGERRQIVNLKAAAAPETVETETRELDLGSLLANLLVQLKNTAVKTKIR

Query:  RRQIQKFIEKIIIDCRFFTLFAVAGSLLGSILCFLEGSFIVAESYLQYFNGVSRRSDESHAVELLIESLDMFLVGTALVVFGVGLFAMF-----------
        RRQIQKFIEKIIIDCRFFTL AVAGSLLGSILC++EGSFIVAESYLQYF+G+S+ S+++H VELLIE+LDMFLVGTALVVFGVGLFAMF           
Subjt:  RRQIQKFIEKIIIDCRFFTLFAVAGSLLGSILCFLEGSFIVAESYLQYFNGVSRRSDESHAVELLIESLDMFLVGTALVVFGVGLFAMF-----------

Query:  ---------------NIPTWVEMESVSEAKSKIGHAVMMILQVGVLEKFKSIPLSSATDLACFAAAILISSASIFFLSRLNIGGGGGGGYK
                        IPTWVEMES+S+AKSKIGHAVMMILQVGVLEKFK+IPLSSA DLACFAAA+++SSASIFFLS+LN+GGGG GG+K
Subjt:  ---------------NIPTWVEMESVSEAKSKIGHAVMMILQVGVLEKFKSIPLSSATDLACFAAAILISSASIFFLSRLNIGGGGGGGYK

TrEMBL top hitse value%identityAlignment
A0A0A0LLC9 Uncharacterized protein4.8e-9869.39Show/hide
Query:  MAATRLLRSVRPSSTVVSSSSSSSPSTAATVRCLGKAGLNSKNGERLVTSGDGERRQIVNLKAA---AAPETVETETRELDLGSLLANLLVQLKNTAVKT
        MAATR ++ VRP++ V +SSSSSSPS+   VR LGK GLN  NGERL+TSG  ERRQ+V +KAA   AAP+TVET+T ELDLGSL+ANLL+QLKNT  KT
Subjt:  MAATRLLRSVRPSSTVVSSSSSSSPSTAATVRCLGKAGLNSKNGERLVTSGDGERRQIVNLKAA---AAPETVETETRELDLGSLLANLLVQLKNTAVKT

Query:  KIRRRQIQKFIEKIIIDCRFFTLFAVAGSLLGSILCFLEGSFIVAESYLQYFNGVSRRSDESHAVELLIESLDMFLVGTALVVFGVGLFAMF--------
        KI++ +IQKFIEKIIIDCRFFTL AV+GSL+GSILC++EGSFIV ESYLQYF+G+S+R+D++H VELLIE+LDMFLVGTAL+VFG+GLFAMF        
Subjt:  KIRRRQIQKFIEKIIIDCRFFTLFAVAGSLLGSILCFLEGSFIVAESYLQYFNGVSRRSDESHAVELLIESLDMFLVGTALVVFGVGLFAMF--------

Query:  ------------------NIPTWVEMESVSEAKSKIGHAVMMILQVGVLEKFKSIPLSSATDLACFAAAILISSASIFFLSRLNIGGGGGGGYK
                           IPTWVEMES+S AKSKIGHAVMMILQVGVLEKFK+IPLSSA DLACFAAA+LISSASIFFLS+LN+GGGG  G+K
Subjt:  ------------------NIPTWVEMESVSEAKSKIGHAVMMILQVGVLEKFKSIPLSSATDLACFAAAILISSASIFFLSRLNIGGGGGGGYK

A0A1S3BAX1 uncharacterized protein LOC1034879362.6e-9669.18Show/hide
Query:  MAATRLLRSVRPSSTVVSSSSSSSPSTAATVRCLGKAGLNSKNGERLVTSGDGERRQIVNLKAA-AAPETVETETRELDLGSLLANLLVQLKNTAVKTKI
        MAATR ++ VRP++ V SSSSSSSPS+   VR LGK GLN  NGERL+TSG GE RQ++ +KAA  AP+TVET+T ELDLGSL+++LLVQLK T  KTKI
Subjt:  MAATRLLRSVRPSSTVVSSSSSSSPSTAATVRCLGKAGLNSKNGERLVTSGDGERRQIVNLKAA-AAPETVETETRELDLGSLLANLLVQLKNTAVKTKI

Query:  RRRQIQKFIEKIIIDCRFFTLFAVAGSLLGSILCFLEGSFIVAESYLQYFNGVSRRSDESHAVELLIESLDMFLVGTALVVFGVGLFAMF----------
        ++R+IQKFIEKIIIDCRFFTL AV+GSL+GSILC++EGSFIVAESYLQYF+ +S+R++++H VELLIE+LDMFLVGTALVVFG+GLFAMF          
Subjt:  RRRQIQKFIEKIIIDCRFFTLFAVAGSLLGSILCFLEGSFIVAESYLQYFNGVSRRSDESHAVELLIESLDMFLVGTALVVFGVGLFAMF----------

Query:  ----------------NIPTWVEMESVSEAKSKIGHAVMMILQVGVLEKFKSIPLSSATDLACFAAAILISSASIFFLSRLNIGGGGGGGYK
                         IPTWVEMES+S AKSKIGHAVMMILQVGVLEKFK+IPLSSA DLACFAAA+LISSASIFFLS+LN+G GG GG+K
Subjt:  ----------------NIPTWVEMESVSEAKSKIGHAVMMILQVGVLEKFKSIPLSSATDLACFAAAILISSASIFFLSRLNIGGGGGGGYK

A0A5A7VG09 UPF0114 domain-containing protein2.0e-9669.52Show/hide
Query:  MAATRLLRSVRPSSTVVSSSSSSSPSTAATVRCLGKAGLNSKNGERLVTSGDGERRQIVNLKAA-AAPETVETETRELDLGSLLANLLVQLKNTAVKTKI
        MAATR ++ VRP++ V SSSSSSSPS+   VR LGK GLN  NGERL+TSG GE RQ+V +KAA  AP+TVET+T ELDLGSL+++LLVQLK T  KTKI
Subjt:  MAATRLLRSVRPSSTVVSSSSSSSPSTAATVRCLGKAGLNSKNGERLVTSGDGERRQIVNLKAA-AAPETVETETRELDLGSLLANLLVQLKNTAVKTKI

Query:  RRRQIQKFIEKIIIDCRFFTLFAVAGSLLGSILCFLEGSFIVAESYLQYFNGVSRRSDESHAVELLIESLDMFLVGTALVVFGVGLFAMF----------
        ++R+IQKFIEKIIIDCRFFTL AV+GSL+GSILC++EGSFIVAESYLQYF+ +S+R++++H VELLIE+LDMFLVGTALVVFG+GLFAMF          
Subjt:  RRRQIQKFIEKIIIDCRFFTLFAVAGSLLGSILCFLEGSFIVAESYLQYFNGVSRRSDESHAVELLIESLDMFLVGTALVVFGVGLFAMF----------

Query:  ----------------NIPTWVEMESVSEAKSKIGHAVMMILQVGVLEKFKSIPLSSATDLACFAAAILISSASIFFLSRLNIGGGGGGGYK
                         IPTWVEMES+S AKSKIGHAVMMILQVGVLEKFK+IPLSSA DLACFAAA+LISSASIFFLS+LN+G GG GG+K
Subjt:  ----------------NIPTWVEMESVSEAKSKIGHAVMMILQVGVLEKFKSIPLSSATDLACFAAAILISSASIFFLSRLNIGGGGGGGYK

A0A6J1GGV4 uncharacterized protein LOC1114540791.0e-12488.81Show/hide
Query:  MAATRLLRSVRPSSTVVSSSSSSSPSTAATVRCLGKAGLNSKNGERLVTSGDGERRQIVNLK----AAAAPETVETETRELDLGSLLANLLVQLKNTAVK
        MAATRLLRSVRPS+TVVSSSSSSSPSTAATVRCLGK GLNSKNGERLVTSGDGERRQIVNLK    AAAAPETVETETRELDLGSLLANLLVQLKNTAVK
Subjt:  MAATRLLRSVRPSSTVVSSSSSSSPSTAATVRCLGKAGLNSKNGERLVTSGDGERRQIVNLK----AAAAPETVETETRELDLGSLLANLLVQLKNTAVK

Query:  TKIRRRQIQKFIEKIIIDCRFFTLFAVAGSLLGSILCFLEGSFIVAESYLQYFNGVSRRSDESHAVELLIESLDMFLVGTALVVFGVGLFAMF-------
        TKIRRRQIQKFIEKIIIDCRFFTLFAVAGSLLGSILCFLEGSFIVAESYLQYFNGVSRRSDESHAVELLIESLDMFLVGTALVVFGVGLFAMF       
Subjt:  TKIRRRQIQKFIEKIIIDCRFFTLFAVAGSLLGSILCFLEGSFIVAESYLQYFNGVSRRSDESHAVELLIESLDMFLVGTALVVFGVGLFAMF-------

Query:  -------------------NIPTWVEMESVSEAKSKIGHAVMMILQVGVLEKFKSIPLSSATDLACFAAAILISSASIFFLSRLNIGGGGGGGYK
                           NIPTWVEMESVSEAKSKIGHAVMMILQVGVLEKFKSIPLSSATDLACFAAAILISSASIFFLSRLNIGGGG GGYK
Subjt:  -------------------NIPTWVEMESVSEAKSKIGHAVMMILQVGVLEKFKSIPLSSATDLACFAAAILISSASIFFLSRLNIGGGGGGGYK

A0A6J1KI88 uncharacterized protein LOC1114960203.3e-11584.12Show/hide
Query:  MAATRLLRSVRPSSTVVSSSSSSSPSTAATVRCLGKAGLNSKNGERLVTSGDGERRQIVNLK-----AAAAPETVETETRELDLGSLLANLLVQLKNTAV
        MAATRLLRSVRPS+TVV SSSSSSPSTAA VRCL K GLNSKNGERL+TSGDGERRQIVNLK     AAAAPETVET+TRELDLGSLLANLLVQLKNTAV
Subjt:  MAATRLLRSVRPSSTVVSSSSSSSPSTAATVRCLGKAGLNSKNGERLVTSGDGERRQIVNLK-----AAAAPETVETETRELDLGSLLANLLVQLKNTAV

Query:  KTKIRRRQIQKFIEKIIIDCRFFTLFAVAGSLLGSILCFLEGSFIVAESYLQYFNGVSRRSDESHAVELLIESLDMFLVGTALVVFGVGLFAMF------
        K KIRRRQIQKFIEKIII+CRFFTLFAVAGSLLGSILCFLEGSFIVAESYLQYFN VSRRSDESHAVELLIESLDMFLVGTALVVFGVGLFAMF      
Subjt:  KTKIRRRQIQKFIEKIIIDCRFFTLFAVAGSLLGSILCFLEGSFIVAESYLQYFNGVSRRSDESHAVELLIESLDMFLVGTALVVFGVGLFAMF------

Query:  --------------------NIPTWVEMESVSEAKSKIGHAVMMILQVGVLEKFKSIPLSSATDLACFAAAILISSASIFFLSRLNIGGGGGGGYK
                            NIPTWVEMESVSEAKSKIGHAVMMILQVGVLEK KSIPLSSA DLACFAAAILI SASIFFLSRLNIGGG   GYK
Subjt:  --------------------NIPTWVEMESVSEAKSKIGHAVMMILQVGVLEKFKSIPLSSATDLACFAAAILISSASIFFLSRLNIGGGGGGGYK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G19390.1 Uncharacterised protein family (UPF0114)1.6e-2939.13Show/hide
Query:  RRRQIQKFIEKIIIDCRFFTLFAVAGSLLGSILCFLEGSFIVAESYLQYFNGVSRRSDESHAVELLIESLDMFLVGTALVVFGVG---------------
        R   +++ IEK+I  CRF T     GSLLGS+LCF++G   V +S+LQY   V+R       + LL+E++D++L+GT ++VFG+G               
Subjt:  RRRQIQKFIEKIIIDCRFFTLFAVAGSLLGSILCFLEGSFIVAESYLQYFNGVSRRSDESHAVELLIESLDMFLVGTALVVFGVG---------------

Query:  ----------LFAMFNI---PTWVEMESVSEAKSKIGHAVMMILQVGVLEKFKSIPLSSATDLACFAAAILISSASIFFLSRLN
                  LF MF +   P W+E++SVSE K+K+GH ++M+L +G+ +K K + ++S TDL C + +I  SSA +F LSRLN
Subjt:  ----------LFAMFNI---PTWVEMESVSEAKSKIGHAVMMILQVGVLEKFKSIPLSSATDLACFAAAILISSASIFFLSRLN

AT5G13720.1 Uncharacterised protein family (UPF0114)7.7e-2433.15Show/hide
Query:  RQIQKFIEKIIIDCRFFTLFAVAGSLLGSILCFLEGSFIVAESYLQYFNGVSRRSDESHAVELLIESLDMFLVGTALVVFGVGL----------------
        R  +  +E+II D RF  L AV GSL GS+LCFL G   + E+Y  Y+   S+       V  L+E++D++L GT +++F +GL                
Subjt:  RQIQKFIEKIIIDCRFFTLFAVAGSLLGSILCFLEGSFIVAESYLQYFNGVSRRSDESHAVELLIESLDMFLVGTALVVFGVGL----------------

Query:  ------------FAMFNIPTWVEMESVSEAKSKIGHAVMMILQVGVLEKFKSIPLSSATDLACFAAAILISSASIFFLSRLNIG
                    FAM   P W+++ S+ E K+K+GH ++MIL V + E+ K + +++  DL  ++  I +SSAS++ L  L+ G
Subjt:  ------------FAMFNIPTWVEMESVSEAKSKIGHAVMMILQVGVLEKFKSIPLSSATDLACFAAAILISSASIFFLSRLNIG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAGCCACTCGATTGCTCCGGTCGGTTCGGCCTTCTTCCACCGTGGTGTCTTCTTCTTCTTCATCATCGCCATCCACGGCGGCGACTGTGAGGTGCTTGGGCAAAGC
AGGGCTAAATTCAAAAAATGGCGAACGGCTAGTAACCTCCGGCGACGGTGAGAGAAGACAGATAGTCAACCTTAAGGCGGCGGCGGCTCCAGAGACTGTGGAGACAGAAA
CCAGAGAACTCGATTTGGGTTCTTTGCTGGCGAATCTACTCGTTCAATTGAAGAACACGGCGGTGAAGACAAAGATTCGGAGGCGACAGATTCAGAAATTCATCGAAAAG
ATCATAATCGACTGCCGATTCTTTACATTATTCGCCGTCGCCGGATCTTTACTGGGTTCGATTCTATGTTTCCTTGAGGGGAGCTTTATTGTTGCAGAGTCGTATCTGCA
GTACTTTAATGGCGTTTCACGGAGGTCCGACGAGAGTCATGCGGTGGAGCTTCTAATTGAATCATTAGACATGTTCTTGGTCGGAACTGCTCTGGTTGTTTTTGGTGTCG
GATTGTTCGCGATGTTCAACATTCCGACATGGGTGGAAATGGAATCGGTGTCGGAGGCAAAGTCGAAGATCGGACACGCGGTGATGATGATACTGCAAGTGGGTGTGTTG
GAGAAGTTCAAGAGTATACCTTTGAGCTCCGCCACCGATCTCGCCTGTTTCGCCGCCGCCATTCTGATTTCCTCCGCCTCTATCTTCTTCCTCTCCAGACTCAACATAGG
CGGCGGCGGGGGCGGCGGTTACAAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCAGCCACTCGATTGCTCCGGTCGGTTCGGCCTTCTTCCACCGTGGTGTCTTCTTCTTCTTCATCATCGCCATCCACGGCGGCGACTGTGAGGTGCTTGGGCAAAGC
AGGGCTAAATTCAAAAAATGGCGAACGGCTAGTAACCTCCGGCGACGGTGAGAGAAGACAGATAGTCAACCTTAAGGCGGCGGCGGCTCCAGAGACTGTGGAGACAGAAA
CCAGAGAACTCGATTTGGGTTCTTTGCTGGCGAATCTACTCGTTCAATTGAAGAACACGGCGGTGAAGACAAAGATTCGGAGGCGACAGATTCAGAAATTCATCGAAAAG
ATCATAATCGACTGCCGATTCTTTACATTATTCGCCGTCGCCGGATCTTTACTGGGTTCGATTCTATGTTTCCTTGAGGGGAGCTTTATTGTTGCAGAGTCGTATCTGCA
GTACTTTAATGGCGTTTCACGGAGGTCCGACGAGAGTCATGCGGTGGAGCTTCTAATTGAATCATTAGACATGTTCTTGGTCGGAACTGCTCTGGTTGTTTTTGGTGTCG
GATTGTTCGCGATGTTCAACATTCCGACATGGGTGGAAATGGAATCGGTGTCGGAGGCAAAGTCGAAGATCGGACACGCGGTGATGATGATACTGCAAGTGGGTGTGTTG
GAGAAGTTCAAGAGTATACCTTTGAGCTCCGCCACCGATCTCGCCTGTTTCGCCGCCGCCATTCTGATTTCCTCCGCCTCTATCTTCTTCCTCTCCAGACTCAACATAGG
CGGCGGCGGGGGCGGCGGTTACAAGTAA
Protein sequenceShow/hide protein sequence
MAATRLLRSVRPSSTVVSSSSSSSPSTAATVRCLGKAGLNSKNGERLVTSGDGERRQIVNLKAAAAPETVETETRELDLGSLLANLLVQLKNTAVKTKIRRRQIQKFIEK
IIIDCRFFTLFAVAGSLLGSILCFLEGSFIVAESYLQYFNGVSRRSDESHAVELLIESLDMFLVGTALVVFGVGLFAMFNIPTWVEMESVSEAKSKIGHAVMMILQVGVL
EKFKSIPLSSATDLACFAAAILISSASIFFLSRLNIGGGGGGGYK