; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0020897 (gene) of Snake gourd v1 genome

Gene IDTan0020897
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionNAB domain-containing protein
Genome locationLG07:2610681..2611892
RNA-Seq ExpressionTan0020897
SyntenyTan0020897
Gene Ontology termsGO:0016310 - phosphorylation (biological process)
GO:0016301 - kinase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022148179.1 uncharacterized protein LOC111016915 [Momordica charantia]1.0e-10875.87Show/hide
Query:  MEKQSLLCLTDTSSSDSPLHSSTLHTTHSVSIVSDIDKKMKAIVTLLEEDGHYRNRILEIKYMLEEYNRSYQSLAEKYDCLKFVFVNTFYSGSSSSTDAE
        MEKQS LCL D+SSSDSPLHS TLHTT S   VSD+DKKMKAI+T LEEDG Y+NR LEIK MLEEYNRSYQSLAEKYD LKF+FVN  +SGSS+ +DAE
Subjt:  MEKQSLLCLTDTSSSDSPLHSSTLHTTHSVSIVSDIDKKMKAIVTLLEEDGHYRNRILEIKYMLEEYNRSYQSLAEKYDCLKFVFVNTFYSGSSSSTDAE

Query:  IFRCNIKSPDIKSAAADRSIKCNNDIFNKEFLIKLRDELVSSKICRQNSEKIVEVRDDDKTYGRITAHPNINNAEITMNEFELCRVKDVHPSVAIARWES
        IFRCNIK P +KS + D S+KCNNDI +KEFLIKLRDELVSSK C  NS++IVE R DDK YG I A   I++ E+ MN+FEL RVKDVHP VAIA+WES
Subjt:  IFRCNIKSPDIKSAAADRSIKCNNDIFNKEFLIKLRDELVSSKICRQNSEKIVEVRDDDKTYGRITAHPNINNAEITMNEFELCRVKDVHPSVAIARWES

Query:  RWNELNSQVTMLMEENLQHQEELTRRNNEKREAIKELQQQIQRLKSEKRALQSSLRFTKEKFKPYRSPISRLAETISNKFWGRGCS
         WNELNS+VT+LMEENLQHQEELTRRNNEKREAIKEL QQI+ LKSE R+LQSSLRF KE+ KPYRSPISRLAETISNK  G GCS
Subjt:  RWNELNSQVTMLMEENLQHQEELTRRNNEKREAIKELQQQIQRLKSEKRALQSSLRFTKEKFKPYRSPISRLAETISNKFWGRGCS

XP_022928569.1 uncharacterized protein LOC111435336 isoform X1 [Cucurbita moschata]5.9e-9670.98Show/hide
Query:  MEKQSLLCLTDTSSSDSPLHSSTLHTTHSVSIVSDIDKKMKAIVTLLEEDGHYRNRILEIKYMLEEYNRSYQSLAEKYDCLKFVFVNTFYSGSSSSTDAE
        MEKQ+LLCLTDTSSSDSP+H              DIDKKMKAIVTLL+ED HY+NR LEIKYMLEEYNRSY SLAEKYDCLKF+FVNT YS SSSS++AE
Subjt:  MEKQSLLCLTDTSSSDSPLHSSTLHTTHSVSIVSDIDKKMKAIVTLLEEDGHYRNRILEIKYMLEEYNRSYQSLAEKYDCLKFVFVNTFYSGSSSSTDAE

Query:  IFRCNIKSPDIKSAAADRSIKCNNDIFNKEFLIKLRDELVSSKICRQNSEKIVEVRDDDKTYGRITAHPNINNAEITMNEFELCRVKDVHPSVAIARWES
        IFRCNIK PDI+SA  D  +   NDIF+KEFLIK+R+E+                  ++KTYGRITA P I++AE  MN+FE  +V+D++PSVAIA+WES
Subjt:  IFRCNIKSPDIKSAAADRSIKCNNDIFNKEFLIKLRDELVSSKICRQNSEKIVEVRDDDKTYGRITAHPNINNAEITMNEFELCRVKDVHPSVAIARWES

Query:  RWNELNSQVTMLMEENLQHQEELTRRNNEKREAIKELQQQIQRLKSEKRALQSSLRFTKEKFKPYRSPISRLAETISNKFWGRGCS
        R NELNSQVTMLMEENLQ+QEEL RRNN+KR AIKELQQQI  LKSEKRALQSSLRFTKEKFKPYRSPISRLAETISNKF  RGCS
Subjt:  RWNELNSQVTMLMEENLQHQEELTRRNNEKREAIKELQQQIQRLKSEKRALQSSLRFTKEKFKPYRSPISRLAETISNKFWGRGCS

XP_031739167.1 uncharacterized protein LOC105434918 [Cucumis sativus]2.2e-10675.96Show/hide
Query:  MEKQSLLCLTDTSSSDSPLHSSTLHTTHSVSIVSDIDKKMKAIVTLLEEDGHYRNRILEIKYMLEEYNRSYQSLAEKYDCLKFVFVNTFYSGSSSSTDAE
        MEKQSLLCLTDTSSSDSPLHSSTLH TH     SDI+KKMKAIVTLLEEDGH+RNR LEIKYMLEEYNR+YQ+LAEKYDCLKF+ VNT YS  SS +DAE
Subjt:  MEKQSLLCLTDTSSSDSPLHSSTLHTTHSVSIVSDIDKKMKAIVTLLEEDGHYRNRILEIKYMLEEYNRSYQSLAEKYDCLKFVFVNTFYSGSSSSTDAE

Query:  IFRCNIKSPDIKSAAADRSIKCNNDIFNKEFLIKLRDEL-VSSKICRQNSEKIVEVRDDDKTYGRITAHPNINNAEITMNEFELCRVKDVHPSVAIARWE
        IF+CN +S D+KSAA D + K   DIF++EFLIKLRDEL VSSK+C QNSEKIV   D    YGRIT H  I+  E  MNE ++ +VKDVHPSV I RWE
Subjt:  IFRCNIKSPDIKSAAADRSIKCNNDIFNKEFLIKLRDEL-VSSKICRQNSEKIVEVRDDDKTYGRITAHPNINNAEITMNEFELCRVKDVHPSVAIARWE

Query:  SRWNELNSQVTMLMEENLQHQEELTRRNNEKREAIKELQQQIQRLKSEKRALQSSLRFTKEKFKPYRSPISRLAETISNKFWGRGCS
        SRW ELNSQVTMLMEENL HQEELTRRNNEKREAIKELQQQIQ LKSE RALQS+LRFTKEK KPY+SPISRLA  ISNK   RGCS
Subjt:  SRWNELNSQVTMLMEENLQHQEELTRRNNEKREAIKELQQQIQRLKSEKRALQSSLRFTKEKFKPYRSPISRLAETISNKFWGRGCS

XP_038878075.1 uncharacterized protein LOC120070255 isoform X1 [Benincasa hispida]2.8e-11480.28Show/hide
Query:  MEKQSLLCLTDTSSSDSPLHSSTLHTTHSVSIVSDIDKKMKAIVTLLEEDGHYRNRILEIKYMLEEYNRSYQSLAEKYDCLKFVFVNTFYSGSSSSTDAE
        MEKQ LLCLTD SSSDSPLHSST HTTHS   VSDIDKKMK+IVTLLEEDGHYRNR LEIKYMLEEYNRSYQSLAEKYDCLKF+FVNT YS SSSS+DAE
Subjt:  MEKQSLLCLTDTSSSDSPLHSSTLHTTHSVSIVSDIDKKMKAIVTLLEEDGHYRNRILEIKYMLEEYNRSYQSLAEKYDCLKFVFVNTFYSGSSSSTDAE

Query:  IFRCNIKSPDIKSAA--ADRSIKCNNDIFNKEFLIKLRDEL-VSSKICRQNSEKIVEVRDDDKTYGRITAHPNINNAEITMNEFELCRVKDVHPSVAIAR
        IFRCN KSPD KSAA     S+K N DIF+ EFLIKLRDEL VSSK+C QNS+KIV    D   YGRI  H  I+N EI MNE E+ +VK+VHPSVAI R
Subjt:  IFRCNIKSPDIKSAA--ADRSIKCNNDIFNKEFLIKLRDEL-VSSKICRQNSEKIVEVRDDDKTYGRITAHPNINNAEITMNEFELCRVKDVHPSVAIAR

Query:  WESRWNELNSQVTMLMEENLQHQEELTRRNNEKREAIKELQQQIQRLKSEKRALQSSLRFTKEKFKPYRSPISRLAETISNKFWGRGCS
        WESRWNELNSQVTMLMEENLQHQEELTRRNN+KREAIKELQQQI+ LKSEK ALQSSLRFTKEK KPYRSPISRLA TISNK   RGCS
Subjt:  WESRWNELNSQVTMLMEENLQHQEELTRRNNEKREAIKELQQQIQRLKSEKRALQSSLRFTKEKFKPYRSPISRLAETISNKFWGRGCS

XP_038878076.1 uncharacterized protein LOC120070255 isoform X2 [Benincasa hispida]2.6e-11279.58Show/hide
Query:  MEKQSLLCLTDTSSSDSPLHSSTLHTTHSVSIVSDIDKKMKAIVTLLEEDGHYRNRILEIKYMLEEYNRSYQSLAEKYDCLKFVFVNTFYSGSSSSTDAE
        MEKQ LLCLTD SSSDSPLHSST HTTH     SDIDKKMK+IVTLLEEDGHYRNR LEIKYMLEEYNRSYQSLAEKYDCLKF+FVNT YS SSSS+DAE
Subjt:  MEKQSLLCLTDTSSSDSPLHSSTLHTTHSVSIVSDIDKKMKAIVTLLEEDGHYRNRILEIKYMLEEYNRSYQSLAEKYDCLKFVFVNTFYSGSSSSTDAE

Query:  IFRCNIKSPDIKSAA--ADRSIKCNNDIFNKEFLIKLRDEL-VSSKICRQNSEKIVEVRDDDKTYGRITAHPNINNAEITMNEFELCRVKDVHPSVAIAR
        IFRCN KSPD KSAA     S+K N DIF+ EFLIKLRDEL VSSK+C QNS+KIV    D   YGRI  H  I+N EI MNE E+ +VK+VHPSVAI R
Subjt:  IFRCNIKSPDIKSAA--ADRSIKCNNDIFNKEFLIKLRDEL-VSSKICRQNSEKIVEVRDDDKTYGRITAHPNINNAEITMNEFELCRVKDVHPSVAIAR

Query:  WESRWNELNSQVTMLMEENLQHQEELTRRNNEKREAIKELQQQIQRLKSEKRALQSSLRFTKEKFKPYRSPISRLAETISNKFWGRGCS
        WESRWNELNSQVTMLMEENLQHQEELTRRNN+KREAIKELQQQI+ LKSEK ALQSSLRFTKEK KPYRSPISRLA TISNK   RGCS
Subjt:  WESRWNELNSQVTMLMEENLQHQEELTRRNNEKREAIKELQQQIQRLKSEKRALQSSLRFTKEKFKPYRSPISRLAETISNKFWGRGCS

TrEMBL top hitse value%identityAlignment
A0A5D3E2T3 Putative Kinase interacting family protein1.5e-8975.4Show/hide
Query:  MKAIVTLLEEDGHYRNRILEIKYMLEEYNRSYQSLAEKYDCLKFVFVNTFYSGSSSSTDAEIFRCNIKSPDIKSAAADRSIKCNNDIFNKEFLIKLRDEL
        MKAIVTLLEEDGHYRNR LEIKYMLEEYN++YQ+LAEKYD LKF+ VNT YS  SS +DAEIF+CN  SPD+KSAA D + K   DIF KEFL KLRDEL
Subjt:  MKAIVTLLEEDGHYRNRILEIKYMLEEYNRSYQSLAEKYDCLKFVFVNTFYSGSSSSTDAEIFRCNIKSPDIKSAAADRSIKCNNDIFNKEFLIKLRDEL

Query:  -VSSKICRQNSEKIVEVRDDDKTYGRITAHPNINNAEITMNEFELCRVKDVHPSVAIARWESRWNELNSQVTMLMEENLQHQEELTRRNNEKREAIKELQ
         VSSK+C QNSEKIVEV       GRI  H  I+  E  +NE ++ RVKDVHPSVAI RWESRWNELNSQVTMLMEENL HQEELTRRNNEKREAIKEL 
Subjt:  -VSSKICRQNSEKIVEVRDDDKTYGRITAHPNINNAEITMNEFELCRVKDVHPSVAIARWESRWNELNSQVTMLMEENLQHQEELTRRNNEKREAIKELQ

Query:  QQIQRLKSEKRALQSSLRFTKEKFKPYRSPISRLAETISNKFWGRGCS
        QQIQ L SE RALQSSLRFTKEK KPYRSPISRLA  ISNK   RGCS
Subjt:  QQIQRLKSEKRALQSSLRFTKEKFKPYRSPISRLAETISNKFWGRGCS

A0A6J1D380 uncharacterized protein LOC1110169155.0e-10975.87Show/hide
Query:  MEKQSLLCLTDTSSSDSPLHSSTLHTTHSVSIVSDIDKKMKAIVTLLEEDGHYRNRILEIKYMLEEYNRSYQSLAEKYDCLKFVFVNTFYSGSSSSTDAE
        MEKQS LCL D+SSSDSPLHS TLHTT S   VSD+DKKMKAI+T LEEDG Y+NR LEIK MLEEYNRSYQSLAEKYD LKF+FVN  +SGSS+ +DAE
Subjt:  MEKQSLLCLTDTSSSDSPLHSSTLHTTHSVSIVSDIDKKMKAIVTLLEEDGHYRNRILEIKYMLEEYNRSYQSLAEKYDCLKFVFVNTFYSGSSSSTDAE

Query:  IFRCNIKSPDIKSAAADRSIKCNNDIFNKEFLIKLRDELVSSKICRQNSEKIVEVRDDDKTYGRITAHPNINNAEITMNEFELCRVKDVHPSVAIARWES
        IFRCNIK P +KS + D S+KCNNDI +KEFLIKLRDELVSSK C  NS++IVE R DDK YG I A   I++ E+ MN+FEL RVKDVHP VAIA+WES
Subjt:  IFRCNIKSPDIKSAAADRSIKCNNDIFNKEFLIKLRDELVSSKICRQNSEKIVEVRDDDKTYGRITAHPNINNAEITMNEFELCRVKDVHPSVAIARWES

Query:  RWNELNSQVTMLMEENLQHQEELTRRNNEKREAIKELQQQIQRLKSEKRALQSSLRFTKEKFKPYRSPISRLAETISNKFWGRGCS
         WNELNS+VT+LMEENLQHQEELTRRNNEKREAIKEL QQI+ LKSE R+LQSSLRF KE+ KPYRSPISRLAETISNK  G GCS
Subjt:  RWNELNSQVTMLMEENLQHQEELTRRNNEKREAIKELQQQIQRLKSEKRALQSSLRFTKEKFKPYRSPISRLAETISNKFWGRGCS

A0A6J1EL78 uncharacterized protein LOC111435336 isoform X12.8e-9670.98Show/hide
Query:  MEKQSLLCLTDTSSSDSPLHSSTLHTTHSVSIVSDIDKKMKAIVTLLEEDGHYRNRILEIKYMLEEYNRSYQSLAEKYDCLKFVFVNTFYSGSSSSTDAE
        MEKQ+LLCLTDTSSSDSP+H              DIDKKMKAIVTLL+ED HY+NR LEIKYMLEEYNRSY SLAEKYDCLKF+FVNT YS SSSS++AE
Subjt:  MEKQSLLCLTDTSSSDSPLHSSTLHTTHSVSIVSDIDKKMKAIVTLLEEDGHYRNRILEIKYMLEEYNRSYQSLAEKYDCLKFVFVNTFYSGSSSSTDAE

Query:  IFRCNIKSPDIKSAAADRSIKCNNDIFNKEFLIKLRDELVSSKICRQNSEKIVEVRDDDKTYGRITAHPNINNAEITMNEFELCRVKDVHPSVAIARWES
        IFRCNIK PDI+SA  D  +   NDIF+KEFLIK+R+E+                  ++KTYGRITA P I++AE  MN+FE  +V+D++PSVAIA+WES
Subjt:  IFRCNIKSPDIKSAAADRSIKCNNDIFNKEFLIKLRDELVSSKICRQNSEKIVEVRDDDKTYGRITAHPNINNAEITMNEFELCRVKDVHPSVAIARWES

Query:  RWNELNSQVTMLMEENLQHQEELTRRNNEKREAIKELQQQIQRLKSEKRALQSSLRFTKEKFKPYRSPISRLAETISNKFWGRGCS
        R NELNSQVTMLMEENLQ+QEEL RRNN+KR AIKELQQQI  LKSEKRALQSSLRFTKEKFKPYRSPISRLAETISNKF  RGCS
Subjt:  RWNELNSQVTMLMEENLQHQEELTRRNNEKREAIKELQQQIQRLKSEKRALQSSLRFTKEKFKPYRSPISRLAETISNKFWGRGCS

A0A6J1ES10 uncharacterized protein LOC111435336 isoform X26.8e-9068.88Show/hide
Query:  MEKQSLLCLTDTSSSDSPLHSSTLHTTHSVSIVSDIDKKMKAIVTLLEEDGHYRNRILEIKYMLEEYNRSYQSLAEKYDCLKFVFVNTFYSGSSSSTDAE
        MEKQ+LLCLTDTSSSDSP+H              DIDKKMKAIVTLL+ED HY+NR LEIKYMLEEYNRSY SLAEKYDCLKF+FVNT YS SSSS++AE
Subjt:  MEKQSLLCLTDTSSSDSPLHSSTLHTTHSVSIVSDIDKKMKAIVTLLEEDGHYRNRILEIKYMLEEYNRSYQSLAEKYDCLKFVFVNTFYSGSSSSTDAE

Query:  IFRCNIKSPDIKSAAADRSIKCNNDIFNKEFLIKLRDELVSSKICRQNSEKIVEVRDDDKTYGRITAHPNINNAEITMNEFELCRVKDVHPSVAIARWES
        IFRCNIK PDI+SA  D  +   NDIF+KEFLIK+R+E+                  ++KTYGRITA P I++AE  MN+FE            IA+WES
Subjt:  IFRCNIKSPDIKSAAADRSIKCNNDIFNKEFLIKLRDELVSSKICRQNSEKIVEVRDDDKTYGRITAHPNINNAEITMNEFELCRVKDVHPSVAIARWES

Query:  RWNELNSQVTMLMEENLQHQEELTRRNNEKREAIKELQQQIQRLKSEKRALQSSLRFTKEKFKPYRSPISRLAETISNKFWGRGCS
        R NELNSQVTMLMEENLQ+QEEL RRNN+KR AIKELQQQI  LKSEKRALQSSLRFTKEKFKPYRSPISRLAETISNKF  RGCS
Subjt:  RWNELNSQVTMLMEENLQHQEELTRRNNEKREAIKELQQQIQRLKSEKRALQSSLRFTKEKFKPYRSPISRLAETISNKFWGRGCS

A0A6J1JG23 uncharacterized protein LOC1114865856.6e-9369.23Show/hide
Query:  MEKQSLLCLTDTSSSDSPLHSSTLHTTHSVSIVSDIDKKMKAIVTLLEEDGHYRNRILEIKYMLEEYNRSYQSLAEKYDCLKFVFVNTFYSGSSSSTDAE
        MEKQSLLCLTDTSSSDSP+H              DIDKKMKAIVTLL+ED HY+NR LEIKYMLEEYNRSY SLAEKYDCLKF+FVNT YS SSSS+D E
Subjt:  MEKQSLLCLTDTSSSDSPLHSSTLHTTHSVSIVSDIDKKMKAIVTLLEEDGHYRNRILEIKYMLEEYNRSYQSLAEKYDCLKFVFVNTFYSGSSSSTDAE

Query:  IFRCNIKSPDIKSAAADRSIKCNNDIFNKEFLIKLRDELVSSKICRQNSEKIVEVRDDDKTYGRITAHPNINNAEITMNEFELCRVKDVHPSVAIARWES
        IFRC I+ PDI+SA  D    C N IF+KEFLIK+R+E+                  D+KTYGRITA P I++AE  MN+FE  +++D++PSVAIA+WES
Subjt:  IFRCNIKSPDIKSAAADRSIKCNNDIFNKEFLIKLRDELVSSKICRQNSEKIVEVRDDDKTYGRITAHPNINNAEITMNEFELCRVKDVHPSVAIARWES

Query:  RWNELNSQVTMLMEENLQHQEELTRRNNEKREAIKELQQQIQRLKSEKRALQSSLRFTKEKFKPYRSPISRLAETISNKFWGRGCS
        R NELNSQVTMLM+ENLQHQEEL RRNN+K  AIKELQQQI  LKSEKRALQSSLRFTKEKFK YRSPISRLAE ISNK   RGCS
Subjt:  RWNELNSQVTMLMEENLQHQEELTRRNNEKREAIKELQQQIQRLKSEKRALQSSLRFTKEKFKPYRSPISRLAETISNKFWGRGCS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G03470.1 Kinase interacting (KIP1-like) family protein1.9e-0424.8Show/hide
Query:  HSSTLHTTHSVSIVSDIDKKMKAIVTLLEEDGH--------YRNRILEIKYMLEEYNRSYQSLAEKYDCLKFVFVNTFYSGSSSSTDAEIFRCNIKSPDI
        H++T  +    S +S++D+K K ++ +++ED          Y  +  E+  M+EE+ RS++SLAE+YD L+   V+   S S S                
Subjt:  HSSTLHTTHSVSIVSDIDKKMKAIVTLLEEDGH--------YRNRILEIKYMLEEYNRSYQSLAEKYDCLKFVFVNTFYSGSSSSTDAEIFRCNIKSPDI

Query:  KSAAADRSIKCNNDIFNKEFLIKLRDELVSSKICRQNSEKIVEVRDDDKTYGRITAHPNINNAEIT-MNEFELCR-VKDVHPS-VAIARWESRWNELNSQ
             ++S  C              DE   S+ C  + E                A   I+N E   ++E E+   V+++ PS V  +        +  +
Subjt:  KSAAADRSIKCNNDIFNKEFLIKLRDELVSSKICRQNSEKIVEVRDDDKTYGRITAHPNINNAEIT-MNEFELCR-VKDVHPS-VAIARWESRWNELNSQ

Query:  VTMLMEENLQHQEELTRRNNEKREAIKELQQQIQRLKSEKRALQSSLRFT
        +  L EEN  + E +  ++ EKREAI+++   IQ LK E   L+  +  T
Subjt:  VTMLMEENLQHQEELTRRNNEKREAIKELQQQIQRLKSEKRALQSSLRFT

AT1G03470.2 Kinase interacting (KIP1-like) family protein1.9e-0424.8Show/hide
Query:  HSSTLHTTHSVSIVSDIDKKMKAIVTLLEEDGH--------YRNRILEIKYMLEEYNRSYQSLAEKYDCLKFVFVNTFYSGSSSSTDAEIFRCNIKSPDI
        H++T  +    S +S++D+K K ++ +++ED          Y  +  E+  M+EE+ RS++SLAE+YD L+   V+   S S S                
Subjt:  HSSTLHTTHSVSIVSDIDKKMKAIVTLLEEDGH--------YRNRILEIKYMLEEYNRSYQSLAEKYDCLKFVFVNTFYSGSSSSTDAEIFRCNIKSPDI

Query:  KSAAADRSIKCNNDIFNKEFLIKLRDELVSSKICRQNSEKIVEVRDDDKTYGRITAHPNINNAEIT-MNEFELCR-VKDVHPS-VAIARWESRWNELNSQ
             ++S  C              DE   S+ C  + E                A   I+N E   ++E E+   V+++ PS V  +        +  +
Subjt:  KSAAADRSIKCNNDIFNKEFLIKLRDELVSSKICRQNSEKIVEVRDDDKTYGRITAHPNINNAEIT-MNEFELCR-VKDVHPS-VAIARWESRWNELNSQ

Query:  VTMLMEENLQHQEELTRRNNEKREAIKELQQQIQRLKSEKRALQSSLRFT
        +  L EEN  + E +  ++ EKREAI+++   IQ LK E   L+  +  T
Subjt:  VTMLMEENLQHQEELTRRNNEKREAIKELQQQIQRLKSEKRALQSSLRFT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAAAACAAAGCCTCCTTTGTTTAACAGATACAAGCAGCTCCGATAGTCCTCTTCATTCCTCCACCCTTCACACAACTCATTCAGTTTCAATTGTTTCAGACATAGA
CAAGAAAATGAAGGCCATAGTGACCCTTTTGGAAGAAGATGGGCATTACCGAAATAGAATTCTGGAAATTAAATATATGCTTGAAGAGTACAACAGATCTTACCAATCCT
TGGCTGAAAAATATGACTGTTTAAAGTTCGTATTTGTCAACACTTTCTATTCTGGGTCGTCTTCGTCTACCGACGCCGAGATATTCCGATGCAATATCAAAAGTCCAGAT
ATCAAATCTGCTGCTGCCGATCGCAGCATCAAATGTAACAATGACATCTTCAACAAAGAGTTCCTGATCAAGCTAAGAGATGAATTGGTTTCAAGTAAGATATGCAGGCA
GAATTCTGAAAAAATAGTTGAAGTAAGAGACGATGATAAAACGTATGGCAGAATCACGGCTCATCCCAATATCAATAATGCTGAAATAACGATGAACGAGTTCGAATTAT
GTCGTGTTAAAGACGTACATCCATCAGTGGCTATTGCTAGGTGGGAGAGTAGATGGAATGAACTAAACTCACAAGTGACAATGCTCATGGAGGAAAACCTGCAGCATCAG
GAGGAGTTAACTAGGAGAAACAATGAAAAGAGAGAAGCCATTAAAGAACTTCAGCAACAGATACAACGCCTAAAGAGTGAGAAGAGGGCCCTCCAGAGTTCTCTCAGATT
CACCAAAGAAAAGTTTAAGCCATATCGCTCTCCAATATCACGATTGGCAGAAACGATTTCGAACAAATTTTGGGGGCGAGGTTGTTCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAAAACAAAGCCTCCTTTGTTTAACAGATACAAGCAGCTCCGATAGTCCTCTTCATTCCTCCACCCTTCACACAACTCATTCAGTTTCAATTGTTTCAGACATAGA
CAAGAAAATGAAGGCCATAGTGACCCTTTTGGAAGAAGATGGGCATTACCGAAATAGAATTCTGGAAATTAAATATATGCTTGAAGAGTACAACAGATCTTACCAATCCT
TGGCTGAAAAATATGACTGTTTAAAGTTCGTATTTGTCAACACTTTCTATTCTGGGTCGTCTTCGTCTACCGACGCCGAGATATTCCGATGCAATATCAAAAGTCCAGAT
ATCAAATCTGCTGCTGCCGATCGCAGCATCAAATGTAACAATGACATCTTCAACAAAGAGTTCCTGATCAAGCTAAGAGATGAATTGGTTTCAAGTAAGATATGCAGGCA
GAATTCTGAAAAAATAGTTGAAGTAAGAGACGATGATAAAACGTATGGCAGAATCACGGCTCATCCCAATATCAATAATGCTGAAATAACGATGAACGAGTTCGAATTAT
GTCGTGTTAAAGACGTACATCCATCAGTGGCTATTGCTAGGTGGGAGAGTAGATGGAATGAACTAAACTCACAAGTGACAATGCTCATGGAGGAAAACCTGCAGCATCAG
GAGGAGTTAACTAGGAGAAACAATGAAAAGAGAGAAGCCATTAAAGAACTTCAGCAACAGATACAACGCCTAAAGAGTGAGAAGAGGGCCCTCCAGAGTTCTCTCAGATT
CACCAAAGAAAAGTTTAAGCCATATCGCTCTCCAATATCACGATTGGCAGAAACGATTTCGAACAAATTTTGGGGGCGAGGTTGTTCTTGA
Protein sequenceShow/hide protein sequence
MEKQSLLCLTDTSSSDSPLHSSTLHTTHSVSIVSDIDKKMKAIVTLLEEDGHYRNRILEIKYMLEEYNRSYQSLAEKYDCLKFVFVNTFYSGSSSSTDAEIFRCNIKSPD
IKSAAADRSIKCNNDIFNKEFLIKLRDELVSSKICRQNSEKIVEVRDDDKTYGRITAHPNINNAEITMNEFELCRVKDVHPSVAIARWESRWNELNSQVTMLMEENLQHQ
EELTRRNNEKREAIKELQQQIQRLKSEKRALQSSLRFTKEKFKPYRSPISRLAETISNKFWGRGCS