; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0012788 (gene) of Snake gourd v1 genome

Gene IDTan0012788
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionNuclear transport factor 2 family protein
Genome locationLG06:62720904..62724403
RNA-Seq ExpressionTan0012788
SyntenyTan0012788
Gene Ontology termsNA
InterPro domainsIPR018790 - Protein of unknown function DUF2358
IPR032710 - NTF2-like domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6575195.1 hypothetical protein SDJN03_25834, partial [Cucurbita argyrosperma subsp. sororia]1.2e-11587.71Show/hide
Query:  MATIFFLQSNTSLQTSLNALRSNRPLKTYRIRCQGDNPSTDSTKNQESKPENAVLKVAWYGSELLGIAASFLRPPSDAGAPVRAQEELTRDVFGAIRRPV
        MATIF  QS TSLQTSLN +R NRPLK   IRCQGDNP+TDS K +ESKPENAVLKVAWYGSELLGIAASFLRPP+D   PVRAQ ELTRDV GAIRRPV
Subjt:  MATIFFLQSNTSLQTSLNALRSNRPLKTYRIRCQGDNPSTDSTKNQESKPENAVLKVAWYGSELLGIAASFLRPPSDAGAPVRAQEELTRDVFGAIRRPV

Query:  IVETIKEDFGRSYFVTGNFTVDAYEEQCEFADPAGSFKGLHRFKRNCTNFGSLVDKSNMKLTKWEDFEDKAIGHWKFSCILSFPWRPILSATGYTEYYFD
        IVETIK+DF RSYFVTGN TV+AYEEQCEFADPAGSFKGL RFKRNCTNFGSLVDK NMKLTKWEDFEDK IGHWKFSCILSFPWRPILSATGYTEYYFD
Subjt:  IVETIKEDFGRSYFVTGNFTVDAYEEQCEFADPAGSFKGLHRFKRNCTNFGSLVDKSNMKLTKWEDFEDKAIGHWKFSCILSFPWRPILSATGYTEYYFD

Query:  AGSGKVCRHVEHWNVPKMALLKQILRPTRDWLWFKK
        AGSGKV RHVEHWNVPKMALL QILRPTR WLWFKK
Subjt:  AGSGKVCRHVEHWNVPKMALLKQILRPTRDWLWFKK

XP_022959200.1 uncharacterized protein LOC111460260 [Cucurbita moschata]3.0e-11787.45Show/hide
Query:  MATIFFLQSNTSLQTSLNALRSNRPLKTYRIRCQGDNPSTDSTKNQESKPENAVLKVAWYGSELLGIAASFLRPPSDAGAPVRAQEELTRDVFGAIRRPV
        MATIF  QS TSLQTSLNA+R NRPLK  +IRCQG+NP+T S KN+ESKPENAVLKVAWYGSELLGIAASFLRPP+D   PVRAQ EL RDV GAIRRPV
Subjt:  MATIFFLQSNTSLQTSLNALRSNRPLKTYRIRCQGDNPSTDSTKNQESKPENAVLKVAWYGSELLGIAASFLRPPSDAGAPVRAQEELTRDVFGAIRRPV

Query:  IVETIKEDFGRSYFVTGNFTVDAYEEQCEFADPAGSFKGLHRFKRNCTNFGSLVDKSNMKLTKWEDFEDKAIGHWKFSCILSFPWRPILSATGYTEYYFD
        IVETIK+DF RSYFVTGN TV+AYEEQCEFADPAGSFKGL RFKRNCTNFGSLVDKSNMKLTKWEDFEDK IGHWKFSCILSFPWRPILSATGYTEYYFD
Subjt:  IVETIKEDFGRSYFVTGNFTVDAYEEQCEFADPAGSFKGLHRFKRNCTNFGSLVDKSNMKLTKWEDFEDKAIGHWKFSCILSFPWRPILSATGYTEYYFD

Query:  AGSGKVCRHVEHWNVPKMALLKQILRPTRDWLWFKKAGA
        AGSGKV RHVEHWNVPKMALL QILRPTR WLWFKK GA
Subjt:  AGSGKVCRHVEHWNVPKMALLKQILRPTRDWLWFKKAGA

XP_023006651.1 uncharacterized protein LOC111499312 [Cucurbita maxima]1.7e-11787.87Show/hide
Query:  MATIFFLQSNTSLQTSLNALRSNRPLKTYRIRCQGDNPSTDSTKNQESKPENAVLKVAWYGSELLGIAASFLRPPSDAGAPVRAQEELTRDVFGAIRRPV
        MATIF  QS TSL+TSLNA+R NRPLK   IRCQGDNP+TDS KN+ESKPENAVLKVAWYGSELLGIAASFLRPP+D   PVRAQ ELTRDV GAIRRPV
Subjt:  MATIFFLQSNTSLQTSLNALRSNRPLKTYRIRCQGDNPSTDSTKNQESKPENAVLKVAWYGSELLGIAASFLRPPSDAGAPVRAQEELTRDVFGAIRRPV

Query:  IVETIKEDFGRSYFVTGNFTVDAYEEQCEFADPAGSFKGLHRFKRNCTNFGSLVDKSNMKLTKWEDFEDKAIGHWKFSCILSFPWRPILSATGYTEYYFD
        IVETIK+DF RSYFVTGN TV+AYEEQCEFADPAGSFKGL RFKRNCTNFGSLVDKSNMKLTKW DFEDK IGHWKFSCILSFPWRPILSATGYTEYYFD
Subjt:  IVETIKEDFGRSYFVTGNFTVDAYEEQCEFADPAGSFKGLHRFKRNCTNFGSLVDKSNMKLTKWEDFEDKAIGHWKFSCILSFPWRPILSATGYTEYYFD

Query:  AGSGKVCRHVEHWNVPKMALLKQILRPTRDWLWFKKAGA
        AGSGKV RHVEHWNVPKMALL QILRPTR WLWFKK GA
Subjt:  AGSGKVCRHVEHWNVPKMALLKQILRPTRDWLWFKKAGA

XP_023547468.1 uncharacterized protein LOC111806404 [Cucurbita pepo subsp. pepo]1.2e-11888.7Show/hide
Query:  MATIFFLQSNTSLQTSLNALRSNRPLKTYRIRCQGDNPSTDSTKNQESKPENAVLKVAWYGSELLGIAASFLRPPSDAGAPVRAQEELTRDVFGAIRRPV
        MATIF  QS TSLQTSLNA+R NRPLK   IRCQGDNP+TDS KNQESKPENAVLKVAWYGSELLGIAASFLRPP+D   PVRAQ EL RDV GAIRRPV
Subjt:  MATIFFLQSNTSLQTSLNALRSNRPLKTYRIRCQGDNPSTDSTKNQESKPENAVLKVAWYGSELLGIAASFLRPPSDAGAPVRAQEELTRDVFGAIRRPV

Query:  IVETIKEDFGRSYFVTGNFTVDAYEEQCEFADPAGSFKGLHRFKRNCTNFGSLVDKSNMKLTKWEDFEDKAIGHWKFSCILSFPWRPILSATGYTEYYFD
        IVETIK+DF RSYFVTGN TV+AYEEQCEFADPAGSFKGL RFKRNCTNFGSLVDKSNMKLTKWEDFEDK IGHWKFSCILSFPWRPILSATGYTEYYFD
Subjt:  IVETIKEDFGRSYFVTGNFTVDAYEEQCEFADPAGSFKGLHRFKRNCTNFGSLVDKSNMKLTKWEDFEDKAIGHWKFSCILSFPWRPILSATGYTEYYFD

Query:  AGSGKVCRHVEHWNVPKMALLKQILRPTRDWLWFKKAGA
        AGSGKV RHVEHWNVPKMALL QILRPTR WLWFKK GA
Subjt:  AGSGKVCRHVEHWNVPKMALLKQILRPTRDWLWFKKAGA

XP_038906789.1 uncharacterized protein LOC120092709 isoform X1 [Benincasa hispida]7.8e-11886.55Show/hide
Query:  MATIFFLQSNTSLQTSLNALRSNRPLKTYRIRCQGDNPSTDSTKNQESKPENAVLKVAWYGSELLGIAASFLRPPSDAGAPVRAQEELTRDVFGAIRRPV
        MATIF +QS++SL+TSLN +R N PL+  RIRCQG+NP+TDS KNQESKPENAVLKVAWYGSELLGIAASFLRPPSD   P+RAQ EL RDV GAI RP+
Subjt:  MATIFFLQSNTSLQTSLNALRSNRPLKTYRIRCQGDNPSTDSTKNQESKPENAVLKVAWYGSELLGIAASFLRPPSDAGAPVRAQEELTRDVFGAIRRPV

Query:  IVETIKEDFGRSYFVTGNFTVDAYEEQCEFADPAGSFKGLHRFKRNCTNFGSLVDKSNMKLTKWEDFEDKAIGHWKFSCILSFPWRPILSATGYTEYYFD
        IVETIKEDFGRSYFVTGN T++AYEEQCEFADPAGSFKGL RFKRNCTNFGSLVDKSNMKLTKWEDFEDK IGHW+FSCILSFPWRPILSATGYTEYYFD
Subjt:  IVETIKEDFGRSYFVTGNFTVDAYEEQCEFADPAGSFKGLHRFKRNCTNFGSLVDKSNMKLTKWEDFEDKAIGHWKFSCILSFPWRPILSATGYTEYYFD

Query:  AGSGKVCRHVEHWNVPKMALLKQILRPTRDWLWFKKAG
        A SGKVCRHVEHWNVPKMALLKQILRPTR+WLWFKKAG
Subjt:  AGSGKVCRHVEHWNVPKMALLKQILRPTRDWLWFKKAG

TrEMBL top hitse value%identityAlignment
A0A1S3C8H3 uncharacterized protein LOC1034976903.7e-11384.45Show/hide
Query:  MATIFFLQSNTSLQTSLNALRSNRPLKTYRIRCQGDNPSTDSTKNQESKPENAVLKVAWYGSELLGIAASFLRPPSDAGAPVRAQEELTRDVFGAIRRPV
        MATI   Q ++SLQTSL ++R N  L+  RI C+G+NP+TDS  NQESKPENAVLKVAWYGSELLGIAASFLRPPSD   PVRAQ ELT DV GAI RP+
Subjt:  MATIFFLQSNTSLQTSLNALRSNRPLKTYRIRCQGDNPSTDSTKNQESKPENAVLKVAWYGSELLGIAASFLRPPSDAGAPVRAQEELTRDVFGAIRRPV

Query:  IVETIKEDFGRSYFVTGNFTVDAYEEQCEFADPAGSFKGLHRFKRNCTNFGSLVDKSNMKLTKWEDFEDKAIGHWKFSCILSFPWRPILSATGYTEYYFD
        IVETIKEDF RSYFVTGN T++AYEEQCEFADPAGSFKGL RFKRNCTNFGSLVDKSNMKLTKWEDFEDK IGHWKFSCILSFPWRPILSATGYT+YYFD
Subjt:  IVETIKEDFGRSYFVTGNFTVDAYEEQCEFADPAGSFKGLHRFKRNCTNFGSLVDKSNMKLTKWEDFEDKAIGHWKFSCILSFPWRPILSATGYTEYYFD

Query:  AGSGKVCRHVEHWNVPKMALLKQILRPTRDWLWFKKAG
        A SGKVCRHVEHWNVPKMALLKQILRPTRDWLWFKKAG
Subjt:  AGSGKVCRHVEHWNVPKMALLKQILRPTRDWLWFKKAG

A0A5D3BY85 Nuclear transport factor 2 family protein3.7e-11384.45Show/hide
Query:  MATIFFLQSNTSLQTSLNALRSNRPLKTYRIRCQGDNPSTDSTKNQESKPENAVLKVAWYGSELLGIAASFLRPPSDAGAPVRAQEELTRDVFGAIRRPV
        MATI   Q ++SLQTSL ++R N  L+  RI C+G+NP+TDS  NQESKPENAVLKVAWYGSELLGIAASFLRPPSD   PVRAQ ELT DV GAI RP+
Subjt:  MATIFFLQSNTSLQTSLNALRSNRPLKTYRIRCQGDNPSTDSTKNQESKPENAVLKVAWYGSELLGIAASFLRPPSDAGAPVRAQEELTRDVFGAIRRPV

Query:  IVETIKEDFGRSYFVTGNFTVDAYEEQCEFADPAGSFKGLHRFKRNCTNFGSLVDKSNMKLTKWEDFEDKAIGHWKFSCILSFPWRPILSATGYTEYYFD
        IVETIKEDF RSYFVTGN T++AYEEQCEFADPAGSFKGL RFKRNCTNFGSLVDKSNMKLTKWEDFEDK IGHWKFSCILSFPWRPILSATGYT+YYFD
Subjt:  IVETIKEDFGRSYFVTGNFTVDAYEEQCEFADPAGSFKGLHRFKRNCTNFGSLVDKSNMKLTKWEDFEDKAIGHWKFSCILSFPWRPILSATGYTEYYFD

Query:  AGSGKVCRHVEHWNVPKMALLKQILRPTRDWLWFKKAG
        A SGKVCRHVEHWNVPKMALLKQILRPTRDWLWFKKAG
Subjt:  AGSGKVCRHVEHWNVPKMALLKQILRPTRDWLWFKKAG

A0A6J1CZI8 uncharacterized protein LOC111016220 isoform X19.3e-10981.89Show/hide
Query:  MATIFFLQS--NTSLQTS-LNALRSNRPLKTYRIRCQGD--NPSTDSTKNQESKPENAVLKVAWYGSELLGIAASFLRPPSDAGAPVRAQEELTRDVFGA
        MA I   QS   T LQTS +N LR N PLK  RIRC+G+  NP+TDS KN+ES+PENA+LKVAWYGSELLGIAASFLR PSDA APVRA  +L  DV GA
Subjt:  MATIFFLQS--NTSLQTS-LNALRSNRPLKTYRIRCQGD--NPSTDSTKNQESKPENAVLKVAWYGSELLGIAASFLRPPSDAGAPVRAQEELTRDVFGA

Query:  IRRPVIVETIKEDFGRSYFVTGNFTVDAYEEQCEFADPAGSFKGLHRFKRNCTNFGSLVDKSNMKLTKWEDFEDKAIGHWKFSCILSFPWRPILSATGYT
        IRR +IVETIKEDFGRSYFVTGN T+DAYEE+CEFADPAGSFKGL RF+RNCTNFGSLV+ SNMKLTKWEDFEDK IGHWKFSC+LSFPWRPILSATGYT
Subjt:  IRRPVIVETIKEDFGRSYFVTGNFTVDAYEEQCEFADPAGSFKGLHRFKRNCTNFGSLVDKSNMKLTKWEDFEDKAIGHWKFSCILSFPWRPILSATGYT

Query:  EYYFDAGSGKVCRHVEHWNVPKMALLKQILRPTRDWLWFKKAG
        EYYFDAGSGKVCRHVEHWNVPKMALLKQILRPTR+WLWFKKAG
Subjt:  EYYFDAGSGKVCRHVEHWNVPKMALLKQILRPTRDWLWFKKAG

A0A6J1H5M4 uncharacterized protein LOC1114602601.4e-11787.45Show/hide
Query:  MATIFFLQSNTSLQTSLNALRSNRPLKTYRIRCQGDNPSTDSTKNQESKPENAVLKVAWYGSELLGIAASFLRPPSDAGAPVRAQEELTRDVFGAIRRPV
        MATIF  QS TSLQTSLNA+R NRPLK  +IRCQG+NP+T S KN+ESKPENAVLKVAWYGSELLGIAASFLRPP+D   PVRAQ EL RDV GAIRRPV
Subjt:  MATIFFLQSNTSLQTSLNALRSNRPLKTYRIRCQGDNPSTDSTKNQESKPENAVLKVAWYGSELLGIAASFLRPPSDAGAPVRAQEELTRDVFGAIRRPV

Query:  IVETIKEDFGRSYFVTGNFTVDAYEEQCEFADPAGSFKGLHRFKRNCTNFGSLVDKSNMKLTKWEDFEDKAIGHWKFSCILSFPWRPILSATGYTEYYFD
        IVETIK+DF RSYFVTGN TV+AYEEQCEFADPAGSFKGL RFKRNCTNFGSLVDKSNMKLTKWEDFEDK IGHWKFSCILSFPWRPILSATGYTEYYFD
Subjt:  IVETIKEDFGRSYFVTGNFTVDAYEEQCEFADPAGSFKGLHRFKRNCTNFGSLVDKSNMKLTKWEDFEDKAIGHWKFSCILSFPWRPILSATGYTEYYFD

Query:  AGSGKVCRHVEHWNVPKMALLKQILRPTRDWLWFKKAGA
        AGSGKV RHVEHWNVPKMALL QILRPTR WLWFKK GA
Subjt:  AGSGKVCRHVEHWNVPKMALLKQILRPTRDWLWFKKAGA

A0A6J1KYC4 uncharacterized protein LOC1114993128.4e-11887.87Show/hide
Query:  MATIFFLQSNTSLQTSLNALRSNRPLKTYRIRCQGDNPSTDSTKNQESKPENAVLKVAWYGSELLGIAASFLRPPSDAGAPVRAQEELTRDVFGAIRRPV
        MATIF  QS TSL+TSLNA+R NRPLK   IRCQGDNP+TDS KN+ESKPENAVLKVAWYGSELLGIAASFLRPP+D   PVRAQ ELTRDV GAIRRPV
Subjt:  MATIFFLQSNTSLQTSLNALRSNRPLKTYRIRCQGDNPSTDSTKNQESKPENAVLKVAWYGSELLGIAASFLRPPSDAGAPVRAQEELTRDVFGAIRRPV

Query:  IVETIKEDFGRSYFVTGNFTVDAYEEQCEFADPAGSFKGLHRFKRNCTNFGSLVDKSNMKLTKWEDFEDKAIGHWKFSCILSFPWRPILSATGYTEYYFD
        IVETIK+DF RSYFVTGN TV+AYEEQCEFADPAGSFKGL RFKRNCTNFGSLVDKSNMKLTKW DFEDK IGHWKFSCILSFPWRPILSATGYTEYYFD
Subjt:  IVETIKEDFGRSYFVTGNFTVDAYEEQCEFADPAGSFKGLHRFKRNCTNFGSLVDKSNMKLTKWEDFEDKAIGHWKFSCILSFPWRPILSATGYTEYYFD

Query:  AGSGKVCRHVEHWNVPKMALLKQILRPTRDWLWFKKAGA
        AGSGKV RHVEHWNVPKMALL QILRPTR WLWFKK GA
Subjt:  AGSGKVCRHVEHWNVPKMALLKQILRPTRDWLWFKKAGA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G46100.1 Nuclear transport factor 2 (NTF2) family protein4.8e-8164.11Show/hide
Query:  RSNRPLKTYRIRCQGDNPSTDSTKNQESKPENAVLKVAWYGSELLGIAASFLRPPSDAGAPVRAQEELTRDVFGAIRRPVIVETIKEDFGRSYFVTGNFT
        R+NR  +   + C+G NP+ +   ++  +P+N +LK+AWYGSELLGIAAS  R P    +P+    E+  D  G   R  +V++IK+DF RSYFVTGN T
Subjt:  RSNRPLKTYRIRCQGDNPSTDSTKNQESKPENAVLKVAWYGSELLGIAASFLRPPSDAGAPVRAQEELTRDVFGAIRRPVIVETIKEDFGRSYFVTGNFT

Query:  VDAYEEQCEFADPAGSFKGLHRFKRNCTNFGSLVDKSNMKLTKWEDFEDKAIGHWKFSCILSFPWRPILSATGYTEYYFDAGSGKVCRHVEHWNVPKMAL
         + YEE+CEFADPAGSFKGL RFKRNCTNFGSL++KSNMKL KWE+FEDK IGHWKFSC++SFPW+PILSATGYTEYYFD  SGK+CRHVEHWNVPK+AL
Subjt:  VDAYEEQCEFADPAGSFKGLHRFKRNCTNFGSLVDKSNMKLTKWEDFEDKAIGHWKFSCILSFPWRPILSATGYTEYYFDAGSGKVCRHVEHWNVPKMAL

Query:  LKQILRPTR
         KQ+LRP+R
Subjt:  LKQILRPTR

AT2G46100.2 Nuclear transport factor 2 (NTF2) family protein1.6e-4456.76Show/hide
Query:  RSNRPLKTYRIRCQGDNPSTDSTKNQESKPENAVLKVAWYGSELLGIAASFLRPPSDAGAPVRAQEELTRDVFGAIRRPVIVETIKEDFGRSYFVTGNFT
        R+NR  +   + C+G NP+ +   ++  +P+N +LK+AWYGSELLGIAAS  R P    +P+    E+  D  G   R  +V++IK+DF RSYFVTGN T
Subjt:  RSNRPLKTYRIRCQGDNPSTDSTKNQESKPENAVLKVAWYGSELLGIAASFLRPPSDAGAPVRAQEELTRDVFGAIRRPVIVETIKEDFGRSYFVTGNFT

Query:  VDAYEEQCEFADPAGSFKGLHRFKRNCTNFGSLVDKSNMKLTKWEDFE
         + YEE+CEFADPAGSFKGL RFKRNCTNFGSL++KSNMKL KWE+FE
Subjt:  VDAYEEQCEFADPAGSFKGLHRFKRNCTNFGSLVDKSNMKLTKWEDFE

AT3G04890.1 Uncharacterized conserved protein (DUF2358)1.7e-1732.43Show/hide
Query:  KPENAVLKVAWYG-SELLGIAASFLRPPSDAGAPVRAQEELTR----DVFGAIRRPVIVETIKEDFGRSYFVTGNFTVDAYEEQCEFADPAGSFKGLHRF
        K   AVLK A  G +E L + +      S A    R++ E+T     DV G +R          D+   YFVTG  T   Y + C F DP  SF+G   +
Subjt:  KPENAVLKVAWYG-SELLGIAASFLRPPSDAGAPVRAQEELTR----DVFGAIRRPVIVETIKEDFGRSYFVTGNFTVDAYEEQCEFADPAGSFKGLHRF

Query:  KRNCTNFGSLVDKSNMKLTKWEDFEDK----AIGHWKFSCILSFPWRPILSATGYTEYYFDAGSGKVCRHVEHWNVPKMALLKQI
        +RN       ++ ++++L   E  E       +  WK    L  PWRP++S  G T Y  D    K+ RHVE WNV  +  + QI
Subjt:  KRNCTNFGSLVDKSNMKLTKWEDFEDK----AIGHWKFSCILSFPWRPILSATGYTEYYFDAGSGKVCRHVEHWNVPKMALLKQI

AT3G04890.2 Uncharacterized conserved protein (DUF2358)2.1e-1231.06Show/hide
Query:  KPENAVLKVAWYG-SELLGIAASFLRPPSDAGAPVRAQEELTR----DVFGAIRRPVIVETIKEDFGRSYFVTGNFTVDAYEEQCEFADPAGSFKGLHRF
        K   AVLK A  G +E L + +      S A    R++ E+T     DV G +R          D+   YFVTG  T   Y + C F DP  SF+G   +
Subjt:  KPENAVLKVAWYG-SELLGIAASFLRPPSDAGAPVRAQEELTR----DVFGAIRRPVIVETIKEDFGRSYFVTGNFTVDAYEEQCEFADPAGSFKGLHRF

Query:  KRNCTNFGSLVDKSNMKLTKWEDFEDK----AIGHWKFSCILSFPWRPILSATGYTEYYFD
        +RN       ++ ++++L   E  E       +  WK    L  PWRP++S  G T Y  D
Subjt:  KRNCTNFGSLVDKSNMKLTKWEDFEDK----AIGHWKFSCILSFPWRPILSATGYTEYYFD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAACCATCTTCTTTTTACAATCAAATACATCTCTCCAAACCTCTCTCAACGCCTTGCGATCTAATCGCCCTCTCAAAACCTACAGAATCCGGTGCCAGGGGGACAA
TCCCTCCACCGATTCGACGAAGAATCAAGAATCCAAGCCCGAGAATGCGGTGCTCAAGGTCGCTTGGTATGGCTCCGAGCTTTTGGGAATCGCCGCTTCATTTCTCCGCC
CGCCGTCGGATGCCGGAGCTCCCGTTAGGGCTCAGGAGGAGCTTACGAGAGATGTGTTCGGTGCAATTCGTCGCCCTGTGATTGTGGAAACGATTAAGGAGGATTTTGGG
CGGTCGTATTTCGTCACAGGGAACTTTACTGTTGATGCTTATGAAGAGCAGTGTGAATTTGCTGATCCGGCAGGTTCTTTCAAAGGGCTTCACCGATTTAAAAGAAACTG
TACAAACTTTGGATCCCTTGTGGATAAGTCAAACATGAAACTTACCAAATGGGAGGATTTTGAGGACAAGGCCATTGGACACTGGAAGTTTAGTTGTATCTTGTCATTTC
CTTGGAGACCAATTCTGTCTGCAACTGGATATACAGAGTATTATTTTGATGCAGGATCTGGAAAAGTATGCAGGCATGTAGAGCACTGGAATGTTCCTAAAATGGCTTTA
CTGAAGCAAATTTTGAGGCCCACTCGAGACTGGTTGTGGTTTAAGAAAGCAGGTGCCAGGTAG
mRNA sequenceShow/hide mRNA sequence
CGCGGCAAAGCTCCAACACGTTCTGATTCAATCGACTACCACCAAATCACCTCTTTCTCTCTCTCTCTCTCTCTCTCTATCTCCATGTCTGTACATTGCAATGGCAACCA
TCTTCTTTTTACAATCAAATACATCTCTCCAAACCTCTCTCAACGCCTTGCGATCTAATCGCCCTCTCAAAACCTACAGAATCCGGTGCCAGGGGGACAATCCCTCCACC
GATTCGACGAAGAATCAAGAATCCAAGCCCGAGAATGCGGTGCTCAAGGTCGCTTGGTATGGCTCCGAGCTTTTGGGAATCGCCGCTTCATTTCTCCGCCCGCCGTCGGA
TGCCGGAGCTCCCGTTAGGGCTCAGGAGGAGCTTACGAGAGATGTGTTCGGTGCAATTCGTCGCCCTGTGATTGTGGAAACGATTAAGGAGGATTTTGGGCGGTCGTATT
TCGTCACAGGGAACTTTACTGTTGATGCTTATGAAGAGCAGTGTGAATTTGCTGATCCGGCAGGTTCTTTCAAAGGGCTTCACCGATTTAAAAGAAACTGTACAAACTTT
GGATCCCTTGTGGATAAGTCAAACATGAAACTTACCAAATGGGAGGATTTTGAGGACAAGGCCATTGGACACTGGAAGTTTAGTTGTATCTTGTCATTTCCTTGGAGACC
AATTCTGTCTGCAACTGGATATACAGAGTATTATTTTGATGCAGGATCTGGAAAAGTATGCAGGCATGTAGAGCACTGGAATGTTCCTAAAATGGCTTTACTGAAGCAAA
TTTTGAGGCCCACTCGAGACTGGTTGTGGTTTAAGAAAGCAGGTGCCAGGTAGTAGTGGTTGATAGCTGACTGATTTTTCATTCTGATAAGAACACAGCTTAGAAAAAAA
CCTTTTGAATATGGAAGGGAATCAGCTTCCTGTAAATTTTATAGTTTGATTTGGAAAATATTTTTCATGGAGTAATGTTAGGAAATTACTTGTAATAGAGAG
Protein sequenceShow/hide protein sequence
MATIFFLQSNTSLQTSLNALRSNRPLKTYRIRCQGDNPSTDSTKNQESKPENAVLKVAWYGSELLGIAASFLRPPSDAGAPVRAQEELTRDVFGAIRRPVIVETIKEDFG
RSYFVTGNFTVDAYEEQCEFADPAGSFKGLHRFKRNCTNFGSLVDKSNMKLTKWEDFEDKAIGHWKFSCILSFPWRPILSATGYTEYYFDAGSGKVCRHVEHWNVPKMAL
LKQILRPTRDWLWFKKAGAR