; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0021349 (gene) of Snake gourd v1 genome

Gene IDTan0021349
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionUnknown protein
Genome locationLG04:81445087..81447964
RNA-Seq ExpressionTan0021349
SyntenyTan0021349
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6606925.1 hypothetical protein SDJN03_00267, partial [Cucurbita argyrosperma subsp. sororia]6.5e-10280.69Show/hide
Query:  MSGLDCMATKDREFEIDLEGGGNTSEDDLSSETDSISKPHARKAFSRLRSGFLCADGSINRGCSFASSSNSTKLVKLGADENVELLMDKGSDGEKRRELG
        MSGLDCMAT+DREFEIDLEGG  TSEDDLSSE+DS SKPH RKA SRLRSGFLC DGSINRGCSFASSSNSTKL KLG DENVELL+DK  DGEKRRELG
Subjt:  MSGLDCMATKDREFEIDLEGGGNTSEDDLSSETDSISKPHARKAFSRLRSGFLCADGSINRGCSFASSSNSTKLVKLGADENVELLMDKGSDGEKRRELG

Query:  TLAE-KKNVKEKIKKNGKVHKPPRPPRGPSLDAADRIFVKEITELAVKKRATVERIKALKKMKAEKTSSFNSSLPALFITLLFFVVIIFQGMSAKGNAMV
         LAE KKN+KE IK NGK+HKPPRPPRGPSLDAADRIFVKE+T+LAVKKRA VERIKALKKMKAEKTSSFNS+LPALFITLLFFVVIIFQGMSA G+A++
Subjt:  TLAE-KKNVKEKIKKNGKVHKPPRPPRGPSLDAADRIFVKEITELAVKKRATVERIKALKKMKAEKTSSFNSSLPALFITLLFFVVIIFQGMSAKGNAMV

Query:  SESPVPSIGRSAGLIPLQHSSKPSPEVNEPQSRVFNFAGKQTSDPATVREASLVEDLKN
         ESPVPS  RSAGLI LQHSSK  P VN PQS + +FAG          EAS VEDLKN
Subjt:  SESPVPSIGRSAGLIPLQHSSKPSPEVNEPQSRVFNFAGKQTSDPATVREASLVEDLKN

KAG7036629.1 hypothetical protein SDJN02_00248 [Cucurbita argyrosperma subsp. argyrosperma]2.2e-10280.69Show/hide
Query:  MSGLDCMATKDREFEIDLEGGGNTSEDDLSSETDSISKPHARKAFSRLRSGFLCADGSINRGCSFASSSNSTKLVKLGADENVELLMDKGSDGEKRRELG
        MSGLDCMAT+DREFEIDLEGG  TSEDDLSSE+DS SKPH RKA SRLRSGFLC DGSIN+GCSFASSSNSTKL KLG DENVELL+DK  DGEKRRELG
Subjt:  MSGLDCMATKDREFEIDLEGGGNTSEDDLSSETDSISKPHARKAFSRLRSGFLCADGSINRGCSFASSSNSTKLVKLGADENVELLMDKGSDGEKRRELG

Query:  TLAE-KKNVKEKIKKNGKVHKPPRPPRGPSLDAADRIFVKEITELAVKKRATVERIKALKKMKAEKTSSFNSSLPALFITLLFFVVIIFQGMSAKGNAMV
         LAE KKN+KE IK NGK+HKPPRPPRGPSLDAADRIFVKE+T+LAVKKRA VERIKALKKMKAEKTSSFNS+LPALFITLLFFVVIIFQGMSA G+A++
Subjt:  TLAE-KKNVKEKIKKNGKVHKPPRPPRGPSLDAADRIFVKEITELAVKKRATVERIKALKKMKAEKTSSFNSSLPALFITLLFFVVIIFQGMSAKGNAMV

Query:  SESPVPSIGRSAGLIPLQHSSKPSPEVNEPQSRVFNFAGKQTSDPATVREASLVEDLKN
         ESPVPS  RSAGLIPLQHSSK  P VN PQS + +FAG          EAS VEDLKN
Subjt:  SESPVPSIGRSAGLIPLQHSSKPSPEVNEPQSRVFNFAGKQTSDPATVREASLVEDLKN

XP_004147994.1 uncharacterized protein LOC101214824 [Cucumis sativus]4.1e-10482.2Show/hide
Query:  MSGLDCMATKDREFEIDLEGGGNTSEDDLSSETDSISKPHARKAFSRLRSGFLCADGSINRGCSFASSSNSTKLVKLGADENVELLMDKGSDGEKRRELG
        M  +DCMATKDREFEIDLEGGGNTSEDDLSSETDS SKPHARK F RLRSGFL +D S++R   FASSSNSTKLVKLG DENVELLM+  SDGEKRRE G
Subjt:  MSGLDCMATKDREFEIDLEGGGNTSEDDLSSETDSISKPHARKAFSRLRSGFLCADGSINRGCSFASSSNSTKLVKLGADENVELLMDKGSDGEKRRELG

Query:  TLAEKKNVKEKIKKNGKVHKPPRPPRGPSLDAADRIFVKEITELAVKKRATVERIKALKKMKAEKTSSFNSSLPALFITLLFFVVIIFQGMSAKGNAM--
          AEK NVK KIKKNGKVHKPPRPPRGPSLDAADRIFV+EI ELAVKKRATVERIKALKKMKAEKTSSFNSSLPALFITLLFFVVIIFQGMSAKG+ M  
Subjt:  TLAEKKNVKEKIKKNGKVHKPPRPPRGPSLDAADRIFVKEITELAVKKRATVERIKALKKMKAEKTSSFNSSLPALFITLLFFVVIIFQGMSAKGNAM--

Query:  VSESPVPSIGRSAGLIPLQHS--SKPSPEVNEPQSRVFNFAGKQTSDPAT-VREASLVEDLKNH
        VS+SP PS+G SAGLI +QHS   + SP VNEP+S + NFAGKQTSDPAT VREASLVE+LKNH
Subjt:  VSESPVPSIGRSAGLIPLQHS--SKPSPEVNEPQSRVFNFAGKQTSDPAT-VREASLVEDLKNH

XP_022948547.1 uncharacterized protein LOC111452194 [Cucurbita moschata]4.2e-10180.31Show/hide
Query:  MSGLDCMATKDREFEIDLEGGGNTSEDDLSSETDSISKPHARKAFSRLRSGFLCADGSINRGCSFASSSNSTKLVKLGADENVELLMDKGSDGEKRRELG
        MSGLDCMAT+DREFEIDLEGG  TSEDDLSSE+DS SKPH RKA SRLRSGFLC DGSINRGCSFASSSNSTKL KLG DENVELL+DK  DGEKRRELG
Subjt:  MSGLDCMATKDREFEIDLEGGGNTSEDDLSSETDSISKPHARKAFSRLRSGFLCADGSINRGCSFASSSNSTKLVKLGADENVELLMDKGSDGEKRRELG

Query:  TLAE-KKNVKEKIKKNGKVHKPPRPPRGPSLDAADRIFVKEITELAVKKRATVERIKALKKMKAEKTSSFNSSLPALFITLLFFVVIIFQGMSAKGNAMV
         LAE KKN+KE IK NGK+HKPPRPPR PSLDAADRIFVKE+T+LAVKKRA VERIKALKK KAEKTSSFNS+LPALFITLLFFVVIIFQGMSA G+A++
Subjt:  TLAE-KKNVKEKIKKNGKVHKPPRPPRGPSLDAADRIFVKEITELAVKKRATVERIKALKKMKAEKTSSFNSSLPALFITLLFFVVIIFQGMSAKGNAMV

Query:  SESPVPSIGRSAGLIPLQHSSKPSPEVNEPQSRVFNFAGKQTSDPATVREASLVEDLKN
         ESPVPS  RSAGLIPLQHSSK  P VN PQS + +FAG          EAS VEDLKN
Subjt:  SESPVPSIGRSAGLIPLQHSSKPSPEVNEPQSRVFNFAGKQTSDPATVREASLVEDLKN

XP_038903260.1 uncharacterized protein LOC120089897 [Benincasa hispida]1.5e-10382.38Show/hide
Query:  MSGLDCMATKDREFEIDLEGGGNTSEDDLSSETDSISKPHARKAFSRLRSGFLCADGSINRGCSFASSSNSTKLVKLGADENVELLMDKGSDGEKRRELG
        M GLDCMATKDREFEIDLEGGGNTSEDDLSSETDS SK HARK F+RLRSGFL +DGSI+R  SFASSSNSTKLVKLG DENVEL MD  SDGEKRRE G
Subjt:  MSGLDCMATKDREFEIDLEGGGNTSEDDLSSETDSISKPHARKAFSRLRSGFLCADGSINRGCSFASSSNSTKLVKLGADENVELLMDKGSDGEKRRELG

Query:  TLAEKKNVKEKIKKNGKVHKPPRPPRGPSLDAADRIFVKEITELAVKKRATVERIKALKKMKAEKTSSFNSSLPALFITLLFFVVIIFQGMSAKGNAMV-
          AEKKNVK KI   GKVHKPPRPPRGPSLDAADRIFVKE+TELAVKKRATVERIKALKKMKAEK SSFNSSLPALFITLLFFVVIIFQGMSAKG+ MV 
Subjt:  TLAEKKNVKEKIKKNGKVHKPPRPPRGPSLDAADRIFVKEITELAVKKRATVERIKALKKMKAEKTSSFNSSLPALFITLLFFVVIIFQGMSAKGNAMV-

Query:  -SESPVPSIGRSAGLIPLQHSSKPSPEVNEPQSRVFNFAGKQTSDPATVREASLVEDLKNH
         S+SP PS+G SAGLI +QHS + SP+VNEPQS + NFAGKQTSDP  +REA  VE+LKNH
Subjt:  -SESPVPSIGRSAGLIPLQHSSKPSPEVNEPQSRVFNFAGKQTSDPATVREASLVEDLKNH

TrEMBL top hitse value%identityAlignment
A0A0A0LB07 Uncharacterized protein2.0e-10482.2Show/hide
Query:  MSGLDCMATKDREFEIDLEGGGNTSEDDLSSETDSISKPHARKAFSRLRSGFLCADGSINRGCSFASSSNSTKLVKLGADENVELLMDKGSDGEKRRELG
        M  +DCMATKDREFEIDLEGGGNTSEDDLSSETDS SKPHARK F RLRSGFL +D S++R   FASSSNSTKLVKLG DENVELLM+  SDGEKRRE G
Subjt:  MSGLDCMATKDREFEIDLEGGGNTSEDDLSSETDSISKPHARKAFSRLRSGFLCADGSINRGCSFASSSNSTKLVKLGADENVELLMDKGSDGEKRRELG

Query:  TLAEKKNVKEKIKKNGKVHKPPRPPRGPSLDAADRIFVKEITELAVKKRATVERIKALKKMKAEKTSSFNSSLPALFITLLFFVVIIFQGMSAKGNAM--
          AEK NVK KIKKNGKVHKPPRPPRGPSLDAADRIFV+EI ELAVKKRATVERIKALKKMKAEKTSSFNSSLPALFITLLFFVVIIFQGMSAKG+ M  
Subjt:  TLAEKKNVKEKIKKNGKVHKPPRPPRGPSLDAADRIFVKEITELAVKKRATVERIKALKKMKAEKTSSFNSSLPALFITLLFFVVIIFQGMSAKGNAM--

Query:  VSESPVPSIGRSAGLIPLQHS--SKPSPEVNEPQSRVFNFAGKQTSDPAT-VREASLVEDLKNH
        VS+SP PS+G SAGLI +QHS   + SP VNEP+S + NFAGKQTSDPAT VREASLVE+LKNH
Subjt:  VSESPVPSIGRSAGLIPLQHS--SKPSPEVNEPQSRVFNFAGKQTSDPAT-VREASLVEDLKNH

A0A1S3BHI9 uncharacterized protein LOC1034899092.3e-10079.55Show/hide
Query:  MSGLDCMATKDREFEIDLEGGGNTSEDDLSSETDSISKPHARKAFSRLRSGFLCADGSINRGCSFASSSNSTKLVKLGADENVELLMDKGSDGEKRRELG
        M  LDCMATKDREFEIDLEGGGNTSEDDLSSETDS SK HARK F RLRSGFL +D S++R  +FASSSNSTKLVKLG D+NVELLM+  SDGEKRRE G
Subjt:  MSGLDCMATKDREFEIDLEGGGNTSEDDLSSETDSISKPHARKAFSRLRSGFLCADGSINRGCSFASSSNSTKLVKLGADENVELLMDKGSDGEKRRELG

Query:  TLAEKKNVKEKIKKNGKVHKPPRPPRGPSLDAADRIFVKEITELAVKKRATVERIKALKKMKAEKTSSFNSSLPALFITLLFFVVIIFQGMSAKGN--AM
          AEK NVK KIKKNGKVHKPPRPPRGPSLDAADRIFV+EI ELAVKKRATVERIKALKKMKAEK SSFNSSLPALFITLLFFVVIIFQGMSAKG+   M
Subjt:  TLAEKKNVKEKIKKNGKVHKPPRPPRGPSLDAADRIFVKEITELAVKKRATVERIKALKKMKAEKTSSFNSSLPALFITLLFFVVIIFQGMSAKGN--AM

Query:  VSESPVPSIGRSAGLIPLQHS--SKPSPEVNEPQSRVFNFAGKQTSDP-ATVREASLVEDLKNH
        VS+SP PS+G  AGLI +QHS   + SP VNEP+S + NFAGKQTSDP A V+E S VE+LKNH
Subjt:  VSESPVPSIGRSAGLIPLQHS--SKPSPEVNEPQSRVFNFAGKQTSDP-ATVREASLVEDLKNH

A0A5D3DAG0 Putative transmembrane protein2.3e-10079.55Show/hide
Query:  MSGLDCMATKDREFEIDLEGGGNTSEDDLSSETDSISKPHARKAFSRLRSGFLCADGSINRGCSFASSSNSTKLVKLGADENVELLMDKGSDGEKRRELG
        M  LDCMATKDREFEIDLEGGGNTSEDDLSSETDS SK HARK F RLRSGFL +D S++R  +FASSSNSTKLVKLG D+NVELLM+  SDGEKRRE G
Subjt:  MSGLDCMATKDREFEIDLEGGGNTSEDDLSSETDSISKPHARKAFSRLRSGFLCADGSINRGCSFASSSNSTKLVKLGADENVELLMDKGSDGEKRRELG

Query:  TLAEKKNVKEKIKKNGKVHKPPRPPRGPSLDAADRIFVKEITELAVKKRATVERIKALKKMKAEKTSSFNSSLPALFITLLFFVVIIFQGMSAKGN--AM
          AEK NVK KIKKNGKVHKPPRPPRGPSLDAADRIFV+EI ELAVKKRATVERIKALKKMKAEK SSFNSSLPALFITLLFFVVIIFQGMSAKG+   M
Subjt:  TLAEKKNVKEKIKKNGKVHKPPRPPRGPSLDAADRIFVKEITELAVKKRATVERIKALKKMKAEKTSSFNSSLPALFITLLFFVVIIFQGMSAKGN--AM

Query:  VSESPVPSIGRSAGLIPLQHS--SKPSPEVNEPQSRVFNFAGKQTSDP-ATVREASLVEDLKNH
        VS+SP PS+G  AGLI +QHS   + SP VNEP+S + NFAGKQTSDP A V+E S VE+LKNH
Subjt:  VSESPVPSIGRSAGLIPLQHS--SKPSPEVNEPQSRVFNFAGKQTSDP-ATVREASLVEDLKNH

A0A6J1G9J4 uncharacterized protein LOC1114521942.0e-10180.31Show/hide
Query:  MSGLDCMATKDREFEIDLEGGGNTSEDDLSSETDSISKPHARKAFSRLRSGFLCADGSINRGCSFASSSNSTKLVKLGADENVELLMDKGSDGEKRRELG
        MSGLDCMAT+DREFEIDLEGG  TSEDDLSSE+DS SKPH RKA SRLRSGFLC DGSINRGCSFASSSNSTKL KLG DENVELL+DK  DGEKRRELG
Subjt:  MSGLDCMATKDREFEIDLEGGGNTSEDDLSSETDSISKPHARKAFSRLRSGFLCADGSINRGCSFASSSNSTKLVKLGADENVELLMDKGSDGEKRRELG

Query:  TLAE-KKNVKEKIKKNGKVHKPPRPPRGPSLDAADRIFVKEITELAVKKRATVERIKALKKMKAEKTSSFNSSLPALFITLLFFVVIIFQGMSAKGNAMV
         LAE KKN+KE IK NGK+HKPPRPPR PSLDAADRIFVKE+T+LAVKKRA VERIKALKK KAEKTSSFNS+LPALFITLLFFVVIIFQGMSA G+A++
Subjt:  TLAE-KKNVKEKIKKNGKVHKPPRPPRGPSLDAADRIFVKEITELAVKKRATVERIKALKKMKAEKTSSFNSSLPALFITLLFFVVIIFQGMSAKGNAMV

Query:  SESPVPSIGRSAGLIPLQHSSKPSPEVNEPQSRVFNFAGKQTSDPATVREASLVEDLKN
         ESPVPS  RSAGLIPLQHSSK  P VN PQS + +FAG          EAS VEDLKN
Subjt:  SESPVPSIGRSAGLIPLQHSSKPSPEVNEPQSRVFNFAGKQTSDPATVREASLVEDLKN

A0A6J1KGV9 uncharacterized protein LOC1114930998.6e-10079.62Show/hide
Query:  MSGLDCMATKDREFEIDLEGGGNTSEDDLSSETDSISKPHARKAFSRLRSGFLCADGSINRGCSFASSSNSTKLVKLGADENVELLMDKGSDGEKRRELG
        MSGLDCMAT+DREFEIDLEGG  TSE DLSSE+DS SKPH RKA SRLRSGFLC +GSINRGCSF SSSNSTKL KLG DENVELL+DK  DGEKRRELG
Subjt:  MSGLDCMATKDREFEIDLEGGGNTSEDDLSSETDSISKPHARKAFSRLRSGFLCADGSINRGCSFASSSNSTKLVKLGADENVELLMDKGSDGEKRRELG

Query:  TLAE-KKNVKEKIKKNGKVHKPPRPPRGPSLDAADRIFVKEITELAVKKRATVERIKALKKMKAEKTSSFNSSLPALFITLLFFVVIIFQGMSAKGNAMV
         LAE KKNVKE IK NGK+HKPPRPPRGPSLDAADRIFVKE+T+LAVKKRA VERIKALKKMKAEKTSSFNS+LPALFITLLFFVVII QGMSA G+A++
Subjt:  TLAE-KKNVKEKIKKNGKVHKPPRPPRGPSLDAADRIFVKEITELAVKKRATVERIKALKKMKAEKTSSFNSSLPALFITLLFFVVIIFQGMSAKGNAMV

Query:  SESPVPSIGRSAGLIPLQHSSKPSPEVNEPQSRVFNFAGKQTSDPATVREASLVEDLKNH
         ESPVPS  RSAGLIPLQHSSK  P VN  QS + +FAG          EAS VEDLKNH
Subjt:  SESPVPSIGRSAGLIPLQHSSKPSPEVNEPQSRVFNFAGKQTSDPATVREASLVEDLKNH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G02380.1 unknown protein5.2e-1731.11Show/hide
Query:  LDCMATKDREFEIDLEGGGNTSEDDLSSETDSISKPHARKAFSRLRSGFLCADGSINRGCSFASSSNSTKLVKLGADENVELLMDKGSDGEKRRELGTLA
        +D M + +++ E+D+E G +    + +S+T S                    +G  +   +F  S       K+  D +  L+ D+       + L  L+
Subjt:  LDCMATKDREFEIDLEGGGNTSEDDLSSETDSISKPHARKAFSRLRSGFLCADGSINRGCSFASSSNSTKLVKLGADENVELLMDKGSDGEKRRELGTLA

Query:  EKKNVKEKIKKNGKVHKPPRPPRGPSLDAADRIFVKEITELAVKKRATVERI-KALKKMKAEKTSSFNSSLP--ALFITLLFFVVIIFQGMSAKGNAMVS
        EKK    K KK+ K  KPPRPP+GPSL   DR  +++I ELA++KRA +ER+ K+LK++KA KTS  +  +   ++ IT +FF  ++FQG S   ++M S
Subjt:  EKKNVKEKIKKNGKVHKPPRPPRGPSLDAADRIFVKEITELAVKKRATVERI-KALKKMKAEKTSSFNSSLP--ALFITLLFFVVIIFQGMSAKGNAMVS

Query:  E-SPVPSIGRSAGLIPLQHSSKPSP
        + SP P++  +  +I +Q  +  +P
Subjt:  E-SPVPSIGRSAGLIPLQHSSKPSP

AT3G17120.1 unknown protein6.2e-1846.22Show/hide
Query:  KEKIKKNGKVHKPPRPPRGPSLDAADRIFVKEITELAVKKRATVERIKALKKMKAEKTSSFNSSLP---ALFITLLFFVVIIFQGMSAKGNAMVSESPVP
        KEK KK+    KPPRPPRGPSLDAAD+  ++EI ELA+ KRA +ER++ALKK +A K +S  SSL    A   T +FF V++FQG+S +      +S + 
Subjt:  KEKIKKNGKVHKPPRPPRGPSLDAADRIFVKEITELAVKKRATVERIKALKKMKAEKTSSFNSSLP---ALFITLLFFVVIIFQGMSAKGNAMVSESPVP

Query:  SIGR-SAGLIPLQHSSKPS
          G+ + G + +Q++  PS
Subjt:  SIGR-SAGLIPLQHSSKPS

AT3G17120.2 unknown protein6.2e-1846.22Show/hide
Query:  KEKIKKNGKVHKPPRPPRGPSLDAADRIFVKEITELAVKKRATVERIKALKKMKAEKTSSFNSSLP---ALFITLLFFVVIIFQGMSAKGNAMVSESPVP
        KEK KK+    KPPRPPRGPSLDAAD+  ++EI ELA+ KRA +ER++ALKK +A K +S  SSL    A   T +FF V++FQG+S +      +S + 
Subjt:  KEKIKKNGKVHKPPRPPRGPSLDAADRIFVKEITELAVKKRATVERIKALKKMKAEKTSSFNSSLP---ALFITLLFFVVIIFQGMSAKGNAMVSESPVP

Query:  SIGR-SAGLIPLQHSSKPS
          G+ + G + +Q++  PS
Subjt:  SIGR-SAGLIPLQHSSKPS

AT4G01960.1 unknown protein2.1e-2131.2Show/hide
Query:  KDREFEIDLEGGGNTSEDDLSSETDSISKPHARKAFSRLRSGFLCADGSINRGCSFASSSNSTKLVKLGADENVELLMDKGSDGEKRRELGTLAEKKNVK
        ++ + ++D+E G      ++++   S ++  +    + + SG L  DGS                 +  AD+ V+ LM +G   E+  +   L++ K   
Subjt:  KDREFEIDLEGGGNTSEDDLSSETDSISKPHARKAFSRLRSGFLCADGSINRGCSFASSSNSTKLVKLGADENVELLMDKGSDGEKRRELGTLAEKKNVK

Query:  EKIKKNGKVHKPPRPPRGPSLDAADRIFVKEITELAVKKRATVERIKALKKMKAEKTSSFNSSLPALFITLLFFVVIIFQGMSAKGNAMVSE-SPVPSIG
        +K KK  K  KPPRPP+GP L A D+  ++EITELA++KRA +ER+K L+++KA K+SS  SS+ A+ +T++FFV +IFQG      ++ S+ SP P+  
Subjt:  EKIKKNGKVHKPPRPPRGPSLDAADRIFVKEITELAVKKRATVERIKALKKMKAEKTSSFNSSLPALFITLLFFVVIIFQGMSAKGNAMVSE-SPVPSIG

Query:  RSAGLIPLQHSSKPSPEVNEPQSRVFNFAGKQTS
         +  ++ +Q  ++ +P      S   +F  K+ S
Subjt:  RSAGLIPLQHSSKPSPEVNEPQSRVFNFAGKQTS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTGGTTTAGATTGTATGGCTACAAAAGATAGAGAGTTTGAAATAGATCTTGAAGGTGGTGGTAATACTAGCGAAGATGATTTGAGCAGTGAAACTGACTCAATATC
AAAACCACATGCTAGAAAAGCCTTCAGCAGGCTTCGAAGTGGGTTTCTATGCGCTGATGGATCTATAAATCGAGGTTGTAGCTTTGCCTCCAGTAGTAATTCTACCAAGC
TCGTTAAGCTTGGTGCTGATGAGAATGTGGAGTTGTTGATGGACAAGGGTTCAGATGGAGAAAAGAGAAGAGAACTTGGGACTCTTGCAGAGAAGAAGAATGTTAAAGAG
AAGATTAAGAAGAATGGAAAGGTTCACAAGCCACCACGACCTCCAAGGGGTCCATCGTTGGATGCTGCTGACAGAATCTTTGTTAAAGAGATCACGGAGTTGGCAGTGAA
AAAGCGTGCCACGGTCGAGCGAATAAAAGCCTTGAAGAAGATGAAAGCAGAAAAAACATCTTCATTCAACAGCAGCTTACCTGCCTTATTTATAACATTGCTTTTCTTTG
TAGTCATTATCTTTCAAGGAATGAGTGCTAAAGGTAATGCAATGGTGTCCGAGTCTCCTGTGCCTTCCATTGGCAGAAGCGCAGGCTTGATACCTCTTCAGCATTCGTCT
AAACCTTCACCTGAAGTTAACGAACCTCAATCCCGCGTTTTCAATTTTGCAGGAAAGCAAACTTCTGACCCTGCTACCGTACGAGAAGCGAGCTTGGTGGAAGATTTGAA
GAACCATTGA
mRNA sequenceShow/hide mRNA sequence
TTTTAGAATATACAAATGCTCTTTAAGCGCCGCACCTCACGATTTCAGATCTGAAATTTCTCTTCATTTCGATTTTGTCCCAAATTTCTTTTCGCATCCCACGATCTGGG
TTCTCTCTCATCGTTTTCGTTCTCCAAATCTTTCATATCCTCAATTTTTTTTTTATTTTTATTTATTCCCCAGATCTCTTTCTGTTGTTCGTGTGTCAGTTGGATCTCTC
CATGGACTCTACCTTGAAAATGGCGGAAATGGCTTCTACTTATTGATTCACTGTTTGTCGGTTCACCATAGTGTTAAGAAGATGAGAAGAAAAGAAGGCCCCTTCGATTC
GGGATCCTGGTAGAATGAATGCCCTGCCGAAATGGGAAGTAAAACCTCTTTAGATGAGTGGTTTAGATTGTATGGCTACAAAAGATAGAGAGTTTGAAATAGATCTTGAA
GGTGGTGGTAATACTAGCGAAGATGATTTGAGCAGTGAAACTGACTCAATATCAAAACCACATGCTAGAAAAGCCTTCAGCAGGCTTCGAAGTGGGTTTCTATGCGCTGA
TGGATCTATAAATCGAGGTTGTAGCTTTGCCTCCAGTAGTAATTCTACCAAGCTCGTTAAGCTTGGTGCTGATGAGAATGTGGAGTTGTTGATGGACAAGGGTTCAGATG
GAGAAAAGAGAAGAGAACTTGGGACTCTTGCAGAGAAGAAGAATGTTAAAGAGAAGATTAAGAAGAATGGAAAGGTTCACAAGCCACCACGACCTCCAAGGGGTCCATCG
TTGGATGCTGCTGACAGAATCTTTGTTAAAGAGATCACGGAGTTGGCAGTGAAAAAGCGTGCCACGGTCGAGCGAATAAAAGCCTTGAAGAAGATGAAAGCAGAAAAAAC
ATCTTCATTCAACAGCAGCTTACCTGCCTTATTTATAACATTGCTTTTCTTTGTAGTCATTATCTTTCAAGGAATGAGTGCTAAAGGTAATGCAATGGTGTCCGAGTCTC
CTGTGCCTTCCATTGGCAGAAGCGCAGGCTTGATACCTCTTCAGCATTCGTCTAAACCTTCACCTGAAGTTAACGAACCTCAATCCCGCGTTTTCAATTTTGCAGGAAAG
CAAACTTCTGACCCTGCTACCGTACGAGAAGCGAGCTTGGTGGAAGATTTGAAGAACCATTGAAGTTTTGTGTCAATAATCTGGTAAAACATCGCCAGCTCCTTTAGACT
GAAATTTTCATATCTTATAACTTGATTGAAGGCTTCAGATCATGTTGGTATCTGTAAATATGCAAAGGCCTCAAACCTTCCAATGGTCTTTTGCTAGGCTTTTCTTGTAC
AGAGAGAGTTCACATTTGATTGATCCATATTAGTTTTGTTATAATCAAATTGGCATGGTCTCACCATAACCCAGAACTTTGAATTTAGTCAAATGTACTTCAAAAAATTG
GTGGTACTTACAAGATTGATATAAATATATTATGAAGGGGCAATTTCATCTTTTTTTCTCAAGTGTAATGCCATCTGAAACC
Protein sequenceShow/hide protein sequence
MSGLDCMATKDREFEIDLEGGGNTSEDDLSSETDSISKPHARKAFSRLRSGFLCADGSINRGCSFASSSNSTKLVKLGADENVELLMDKGSDGEKRRELGTLAEKKNVKE
KIKKNGKVHKPPRPPRGPSLDAADRIFVKEITELAVKKRATVERIKALKKMKAEKTSSFNSSLPALFITLLFFVVIIFQGMSAKGNAMVSESPVPSIGRSAGLIPLQHSS
KPSPEVNEPQSRVFNFAGKQTSDPATVREASLVEDLKNH