; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0028853 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0028853
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionUlp1 peptidase-like
Genome locationchr8:31824368..31836396
RNA-Seq ExpressionLag0028853
SyntenyLag0028853
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR015410 - Domain of unknown function DUF1985


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022146372.1 uncharacterized protein LOC111015600 [Momordica charantia]4.1e-5142.5Show/hide
Query:  DHFPAALTCCSHLNKTIGNIKSTLSPTQLNLFRQTCFGPLIDVNVIFNGQLVHYILLREVEDNRADVMSFKLLGQKVSFGKREFDLITGLRHSFRPMRRV
        D FPA LT  +H++KT   IK+ L+PTQL++FRQTCFGP++D+ V+FNG L+H++LL EVE+ R DV+SF L  ++VSFGKREFDLITGL H    +   
Subjt:  DHFPAALTCCSHLNKTIGNIKSTLSPTQLNLFRQTCFGPLIDVNVIFNGQLVHYILLREVEDNRADVMSFKLLGQKVSFGKREFDLITGLRHSFRPMRRV

Query:  IEGPPNRLQRLYFRENVGMKVEELDKSFPTLQFENDEDAVKTASFYFFELALMGRERKQQVDASTLNLMDDWVAFCNEDWSTIVFDKTI---KIWAYETV
        I  P  RL+  YF+++V +K  EL+K F    F +DED VK    YF ELA+MG+ERKQ +D  T+ ++D W AFCN DWS+++FD+TI   K    + +
Subjt:  IEGPPNRLQRLYFRENVGMKVEELDKSFPTLQFENDEDAVKTASFYFFELALMGRERKQQVDASTLNLMDDWVAFCNEDWSTIVFDKTI---KIWAYETV

Query:  SSLTGRVANRISETTIPRICRWSCSHSSSYTIIADEVFGSRAARVRLSLVSSAEESQFMNRVMKPPRAPEPVPEPILEPE
        S+   +     +      +  +         ++A EVF +  ++V+  L+++  E Q M RV+ PP     +P+P   P+
Subjt:  SSLTGRVANRISETTIPRICRWSCSHSSSYTIIADEVFGSRAARVRLSLVSSAEESQFMNRVMKPPRAPEPVPEPILEPE

XP_022153201.1 uncharacterized protein LOC111020757 [Momordica charantia]7.0e-6736.42Show/hide
Query:  DHFPAALTCCSHLNKTIGNIKSTLSPTQLNLFRQTCFGPLIDVNVIFNGQLVHYILLREVEDNRADVMSFKLLGQKVSFGKREFDLITGLRHSFRPMRRV
        D FPA LT  +H++KT   IK+ L+PTQL++FRQTCFGP++D++V+FNG L+H++LLREVE+ R DV+SF L G++VSFGKREFDLITGL H    +   
Subjt:  DHFPAALTCCSHLNKTIGNIKSTLSPTQLNLFRQTCFGPLIDVNVIFNGQLVHYILLREVEDNRADVMSFKLLGQKVSFGKREFDLITGLRHSFRPMRRV

Query:  IEGPPNRLQRLYFRENVGMKVEELDKSFPTLQFENDEDAVKTASFYFFELALMGRERKQQVDASTLNLMDDWVAFCNEDWSTIVFDKTI-----------
        I  P  RL+  YF++ V +K  EL+K F    F +DED VK    YF ELA+MG+ERKQ +D + L ++D W  FCN DWS+++FD+TI           
Subjt:  IEGPPNRLQRLYFRENVGMKVEELDKSFPTLQFENDEDAVKTASFYFFELALMGRERKQQVDASTLNLMDDWVAFCNEDWSTIVFDKTI-----------

Query:  --------------------------KIWAYETVSSLTGRVANRISETTIPRICRWSCSHSSSYTIIADEVFGSRAARVRLSLVSSAEESQFMNRVMKPP
                                  ++WAYET+S+L        S+  IPR+ RWSC +S  + ++  EVF +  ++V+  L+++  + Q M RV+ PP
Subjt:  --------------------------KIWAYETVSSLTGRVANRISETTIPRICRWSCSHSSSYTIIADEVFGSRAARVRLSLVSSAEESQFMNRVMKPP

Query:  RA-----PEPVPEPILEPEPEPDQETERQPTVSEAILPVVEEDVVADTKMLDVTEASPEVSNKRGREQDDKNKGKEKENEVEKEDESTKEKKKKKRKKKQ
               P  VP+  + P+P    E    P     +     ED V D   +D  EA P  ++  G E                     K  KK K KK+ 
Subjt:  RA-----PEPVPEPILEPEPEPDQETERQPTVSEAILPVVEEDVVADTKMLDVTEASPEVSNKRGREQDDKNKGKEKENEVEKEDESTKEKKKKKRKKKQ

Query:  TCECSQWLQLMDSRMERMDARMSDMETCLKSITKFLRRLSKGKFVDPEKYFGPKDGSDDEGGPSKGPDDVGGPSKGLDDKGSPSKGPDDDGRPDDNKEGH
            S+ L+ +D+ +  ++ ++ D    LK I  +L++L+KGKF D  KYFG   G DD+G   + PD+   P  G        +  D+D R D++ E  
Subjt:  TCECSQWLQLMDSRMERMDARMSDMETCLKSITKFLRRLSKGKFVDPEKYFGPKDGSDDEGGPSKGPDDVGGPSKGLDDKGSPSKGPDDDGRPDDNKEGH

Query:  EEGGNDRG
        +E  +  G
Subjt:  EEGGNDRG

XP_022154995.1 uncharacterized protein LOC111022139 [Momordica charantia]1.0e-4951.5Show/hide
Query:  MEITEKIPFADHFPAALTCCSHLNKTIGNIKSTLSPTQLNLFRQTCFGPLIDVNVIFNGQLVHYILLREVEDNRADVMSFKLLGQKVSFGKREFDLITGL
        M++T KI   D FPAAL+  +H+ KT   +K+ L+P+QL++F QTCFG ++ +N +FN  L+H++LLREVE+ R D++SF L G +VSFGKREFDLITGL
Subjt:  MEITEKIPFADHFPAALTCCSHLNKTIGNIKSTLSPTQLNLFRQTCFGPLIDVNVIFNGQLVHYILLREVEDNRADVMSFKLLGQKVSFGKREFDLITGL

Query:  RHSFRPMRRVIEGPPN-RLQRLYFRENVGMKVEELDKSFPTLQFENDEDAVKTASFYFFELALMGRERKQQVDASTLNLMDDWVAFCNEDWSTIVFDKTI
        RH+   M RV +   N RL+ LYF++   +K  EL+K F    F+NDEDAVK A  YF ELA+MG+ERKQ++D S L ++D W  FCN DWS+++ + T+
Subjt:  RHSFRPMRRVIEGPPN-RLQRLYFRENVGMKVEELDKSFPTLQFENDEDAVKTASFYFFELALMGRERKQQVDASTLNLMDDWVAFCNEDWSTIVFDKTI

XP_022155158.1 uncharacterized protein LOC111022300 [Momordica charantia]9.2e-5152.38Show/hide
Query:  DHFPAALTCCSHLNKTIGNIKSTLSPTQLNLFRQTCFGPLIDVNVIFNGQLVHYILLREVEDNRADVMSFKLLGQKVSFGKREFDLITGLRHSFRPMRRV
        D FP  LT  +H +KT   +K  L+PTQ+++FRQTCFGP++D++V+FNG L+H++LLREVE+ R D++SF L G++VSFGKREFDLITGL  S+R +R  
Subjt:  DHFPAALTCCSHLNKTIGNIKSTLSPTQLNLFRQTCFGPLIDVNVIFNGQLVHYILLREVEDNRADVMSFKLLGQKVSFGKREFDLITGLRHSFRPMRRV

Query:  IEGPPNRLQRLYFRENVGMKVEELDKSFPTLQFENDEDAVKTASFYFFELALMGRERKQQVDASTLNLMDDWVAFCNEDWSTIVFDKTI
         + P  RL+  YF+++V +K  EL+K F    F +DEDAVK    YF ELA+MG+ERKQ +DA+ L ++D W  FCN DWS+++F++T+
Subjt:  IEGPPNRLQRLYFRENVGMKVEELDKSFPTLQFENDEDAVKTASFYFFELALMGRERKQQVDASTLNLMDDWVAFCNEDWSTIVFDKTI

XP_022157020.1 uncharacterized protein LOC111023847 [Momordica charantia]6.2e-6344Show/hide
Query:  MEITEKIPFADHFPAALTCCSHLNKTIGNIKSTLSPTQLNLFRQTCFGPLIDVNVIFNGQLVHYILLREVEDNRADVMSFKLLGQKVSFGKREFDLITGL
        M +T KI   D FPAAL+  +H+ KT   +K+ L+P+QL++F QTCFGP++ +NV+FNG L+H++LLREVE+ + D++SF L G +VSFGKREFDLITGL
Subjt:  MEITEKIPFADHFPAALTCCSHLNKTIGNIKSTLSPTQLNLFRQTCFGPLIDVNVIFNGQLVHYILLREVEDNRADVMSFKLLGQKVSFGKREFDLITGL

Query:  RHSFRPMRRVIEGPPN-RLQRLYFRENVGMKVEELDKSFPTLQFENDEDAVKTASFYFFELALMGRERKQQVDASTLNLMDDWVAFCNEDWSTIVFDKTI
        RH+   M RV E   N RL+ LYF++   +K  EL+K F    FENDEDAVK A  YF ELA+MG+ERK ++D S L ++D W  FCN DWS+++F++T+
Subjt:  RHSFRPMRRVIEGPPN-RLQRLYFRENVGMKVEELDKSFPTLQFENDEDAVKTASFYFFELALMGRERKQQVDASTLNLMDDWVAFCNEDWSTIVFDKTI

Query:  -------------------------------------KIWAYETVSSLTGRVANRISETTIPRICRWSCSHSSSYTIIADEVFGSRAARVRLSLVSSAEE
                                             ++WAYET+S+L+ RVA R+++  IPR+ RWSC++S ++ ++  EVF +  ++V + L ++  E
Subjt:  -------------------------------------KIWAYETVSSLTGRVANRISETTIPRICRWSCSHSSSYTIIADEVFGSRAARVRLSLVSSAEE

TrEMBL top hitse value%identityAlignment
A0A6J1CZE8 uncharacterized protein LOC1110156002.0e-5142.5Show/hide
Query:  DHFPAALTCCSHLNKTIGNIKSTLSPTQLNLFRQTCFGPLIDVNVIFNGQLVHYILLREVEDNRADVMSFKLLGQKVSFGKREFDLITGLRHSFRPMRRV
        D FPA LT  +H++KT   IK+ L+PTQL++FRQTCFGP++D+ V+FNG L+H++LL EVE+ R DV+SF L  ++VSFGKREFDLITGL H    +   
Subjt:  DHFPAALTCCSHLNKTIGNIKSTLSPTQLNLFRQTCFGPLIDVNVIFNGQLVHYILLREVEDNRADVMSFKLLGQKVSFGKREFDLITGLRHSFRPMRRV

Query:  IEGPPNRLQRLYFRENVGMKVEELDKSFPTLQFENDEDAVKTASFYFFELALMGRERKQQVDASTLNLMDDWVAFCNEDWSTIVFDKTI---KIWAYETV
        I  P  RL+  YF+++V +K  EL+K F    F +DED VK    YF ELA+MG+ERKQ +D  T+ ++D W AFCN DWS+++FD+TI   K    + +
Subjt:  IEGPPNRLQRLYFRENVGMKVEELDKSFPTLQFENDEDAVKTASFYFFELALMGRERKQQVDASTLNLMDDWVAFCNEDWSTIVFDKTI---KIWAYETV

Query:  SSLTGRVANRISETTIPRICRWSCSHSSSYTIIADEVFGSRAARVRLSLVSSAEESQFMNRVMKPPRAPEPVPEPILEPE
        S+   +     +      +  +         ++A EVF +  ++V+  L+++  E Q M RV+ PP     +P+P   P+
Subjt:  SSLTGRVANRISETTIPRICRWSCSHSSSYTIIADEVFGSRAARVRLSLVSSAEESQFMNRVMKPPRAPEPVPEPILEPE

A0A6J1DJX9 uncharacterized protein LOC1110207573.4e-6736.42Show/hide
Query:  DHFPAALTCCSHLNKTIGNIKSTLSPTQLNLFRQTCFGPLIDVNVIFNGQLVHYILLREVEDNRADVMSFKLLGQKVSFGKREFDLITGLRHSFRPMRRV
        D FPA LT  +H++KT   IK+ L+PTQL++FRQTCFGP++D++V+FNG L+H++LLREVE+ R DV+SF L G++VSFGKREFDLITGL H    +   
Subjt:  DHFPAALTCCSHLNKTIGNIKSTLSPTQLNLFRQTCFGPLIDVNVIFNGQLVHYILLREVEDNRADVMSFKLLGQKVSFGKREFDLITGLRHSFRPMRRV

Query:  IEGPPNRLQRLYFRENVGMKVEELDKSFPTLQFENDEDAVKTASFYFFELALMGRERKQQVDASTLNLMDDWVAFCNEDWSTIVFDKTI-----------
        I  P  RL+  YF++ V +K  EL+K F    F +DED VK    YF ELA+MG+ERKQ +D + L ++D W  FCN DWS+++FD+TI           
Subjt:  IEGPPNRLQRLYFRENVGMKVEELDKSFPTLQFENDEDAVKTASFYFFELALMGRERKQQVDASTLNLMDDWVAFCNEDWSTIVFDKTI-----------

Query:  --------------------------KIWAYETVSSLTGRVANRISETTIPRICRWSCSHSSSYTIIADEVFGSRAARVRLSLVSSAEESQFMNRVMKPP
                                  ++WAYET+S+L        S+  IPR+ RWSC +S  + ++  EVF +  ++V+  L+++  + Q M RV+ PP
Subjt:  --------------------------KIWAYETVSSLTGRVANRISETTIPRICRWSCSHSSSYTIIADEVFGSRAARVRLSLVSSAEESQFMNRVMKPP

Query:  RA-----PEPVPEPILEPEPEPDQETERQPTVSEAILPVVEEDVVADTKMLDVTEASPEVSNKRGREQDDKNKGKEKENEVEKEDESTKEKKKKKRKKKQ
               P  VP+  + P+P    E    P     +     ED V D   +D  EA P  ++  G E                     K  KK K KK+ 
Subjt:  RA-----PEPVPEPILEPEPEPDQETERQPTVSEAILPVVEEDVVADTKMLDVTEASPEVSNKRGREQDDKNKGKEKENEVEKEDESTKEKKKKKRKKKQ

Query:  TCECSQWLQLMDSRMERMDARMSDMETCLKSITKFLRRLSKGKFVDPEKYFGPKDGSDDEGGPSKGPDDVGGPSKGLDDKGSPSKGPDDDGRPDDNKEGH
            S+ L+ +D+ +  ++ ++ D    LK I  +L++L+KGKF D  KYFG   G DD+G   + PD+   P  G        +  D+D R D++ E  
Subjt:  TCECSQWLQLMDSRMERMDARMSDMETCLKSITKFLRRLSKGKFVDPEKYFGPKDGSDDEGGPSKGPDDVGGPSKGLDDKGSPSKGPDDDGRPDDNKEGH

Query:  EEGGNDRG
        +E  +  G
Subjt:  EEGGNDRG

A0A6J1DL69 uncharacterized protein LOC1110221394.9e-5051.5Show/hide
Query:  MEITEKIPFADHFPAALTCCSHLNKTIGNIKSTLSPTQLNLFRQTCFGPLIDVNVIFNGQLVHYILLREVEDNRADVMSFKLLGQKVSFGKREFDLITGL
        M++T KI   D FPAAL+  +H+ KT   +K+ L+P+QL++F QTCFG ++ +N +FN  L+H++LLREVE+ R D++SF L G +VSFGKREFDLITGL
Subjt:  MEITEKIPFADHFPAALTCCSHLNKTIGNIKSTLSPTQLNLFRQTCFGPLIDVNVIFNGQLVHYILLREVEDNRADVMSFKLLGQKVSFGKREFDLITGL

Query:  RHSFRPMRRVIEGPPN-RLQRLYFRENVGMKVEELDKSFPTLQFENDEDAVKTASFYFFELALMGRERKQQVDASTLNLMDDWVAFCNEDWSTIVFDKTI
        RH+   M RV +   N RL+ LYF++   +K  EL+K F    F+NDEDAVK A  YF ELA+MG+ERKQ++D S L ++D W  FCN DWS+++ + T+
Subjt:  RHSFRPMRRVIEGPPN-RLQRLYFRENVGMKVEELDKSFPTLQFENDEDAVKTASFYFFELALMGRERKQQVDASTLNLMDDWVAFCNEDWSTIVFDKTI

A0A6J1DM82 uncharacterized protein LOC1110223004.5e-5152.38Show/hide
Query:  DHFPAALTCCSHLNKTIGNIKSTLSPTQLNLFRQTCFGPLIDVNVIFNGQLVHYILLREVEDNRADVMSFKLLGQKVSFGKREFDLITGLRHSFRPMRRV
        D FP  LT  +H +KT   +K  L+PTQ+++FRQTCFGP++D++V+FNG L+H++LLREVE+ R D++SF L G++VSFGKREFDLITGL  S+R +R  
Subjt:  DHFPAALTCCSHLNKTIGNIKSTLSPTQLNLFRQTCFGPLIDVNVIFNGQLVHYILLREVEDNRADVMSFKLLGQKVSFGKREFDLITGLRHSFRPMRRV

Query:  IEGPPNRLQRLYFRENVGMKVEELDKSFPTLQFENDEDAVKTASFYFFELALMGRERKQQVDASTLNLMDDWVAFCNEDWSTIVFDKTI
         + P  RL+  YF+++V +K  EL+K F    F +DEDAVK    YF ELA+MG+ERKQ +DA+ L ++D W  FCN DWS+++F++T+
Subjt:  IEGPPNRLQRLYFRENVGMKVEELDKSFPTLQFENDEDAVKTASFYFFELALMGRERKQQVDASTLNLMDDWVAFCNEDWSTIVFDKTI

A0A6J1DRZ7 uncharacterized protein LOC1110238473.0e-6344Show/hide
Query:  MEITEKIPFADHFPAALTCCSHLNKTIGNIKSTLSPTQLNLFRQTCFGPLIDVNVIFNGQLVHYILLREVEDNRADVMSFKLLGQKVSFGKREFDLITGL
        M +T KI   D FPAAL+  +H+ KT   +K+ L+P+QL++F QTCFGP++ +NV+FNG L+H++LLREVE+ + D++SF L G +VSFGKREFDLITGL
Subjt:  MEITEKIPFADHFPAALTCCSHLNKTIGNIKSTLSPTQLNLFRQTCFGPLIDVNVIFNGQLVHYILLREVEDNRADVMSFKLLGQKVSFGKREFDLITGL

Query:  RHSFRPMRRVIEGPPN-RLQRLYFRENVGMKVEELDKSFPTLQFENDEDAVKTASFYFFELALMGRERKQQVDASTLNLMDDWVAFCNEDWSTIVFDKTI
        RH+   M RV E   N RL+ LYF++   +K  EL+K F    FENDEDAVK A  YF ELA+MG+ERK ++D S L ++D W  FCN DWS+++F++T+
Subjt:  RHSFRPMRRVIEGPPN-RLQRLYFRENVGMKVEELDKSFPTLQFENDEDAVKTASFYFFELALMGRERKQQVDASTLNLMDDWVAFCNEDWSTIVFDKTI

Query:  -------------------------------------KIWAYETVSSLTGRVANRISETTIPRICRWSCSHSSSYTIIADEVFGSRAARVRLSLVSSAEE
                                             ++WAYET+S+L+ RVA R+++  IPR+ RWSC++S ++ ++  EVF +  ++V + L ++  E
Subjt:  -------------------------------------KIWAYETVSSLTGRVANRISETTIPRICRWSCSHSSSYTIIADEVFGSRAARVRLSLVSSAEE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTCAGTGTTTCTAATAACAATAGTAAGTTACCTTTGTTTGTTGAGTCAGATGCAGCGGAGGTAATCAAGCTTCTCAATTTCGAATCGATTGACATCTCAGACACAGC
TCTGATCGTGAATGAGGTGGTCTTCCTAGCGAATCAATCTGGGGTGGTTTCGTTTGCTAAATGCCCTAGAAAGTGCAATCAAATCGTGCATCTTCTTGCACGGTCAGCGG
CTGGTTTTTCGCCGCAGTTCCTTCTGTCGTTCGATGTCGACGGTGGTGGAGATGGGACCAGGAAACGACCCAGAGGAAGACCAGACCAACGGGTCGGGCCAACGTGGCCC
GACCCGTATTCGCTTGCGCGAGCCGAGTCCGTTTGCCTCCGCTCGGTCCCTACCGCCTCTAGCCGCCCCGGTTCCACTTGGTTCACCCCAAAACGCCTCTGGATTCCTAA
AAACCCTAGGAGCACGAGCATGTATTTATACCCCTCTTCACCACTGAAGAAGGGTTCCGAAAACTCTATTCTCGACTTCTCTCCTTACTCTCTAGCTTCTCAGTTTTCTG
ACTTAGACATTGGAGGCGGTGTGGCCTACACCACACTGGTGTGCAGCGATTCTTGCAGGTCACGTCTTCCCCAGCTTCTAAAAATTCACTGTTGGTGTCACGTGAAGGCC
AGGAAACGACCCAGAGGAAGACCAGACCAACGGGTCGGCCAACGTGGCCCGACCCGTATGGTCGACCTCGGCCTGGCCGAGTCCGTTTGCCTCCGCTTGGTCCCTACCGC
CTCTAGCCGCCCCAGTTCCACTTGGTTCGCGCCAAAACACCTCAGGATTCCTAAAAACCCTAGGAGCACGAGCATGTATTTATACCCCTCTTCACCACTGAAGAAGGATT
CCGAAAACTCTATTCTCGACTTCTCTCTTTACTCTCTAGCTTCTCAGTTTTCTGACTTAGGCATCGGAGGCGCTGTGGCCTACACCACACTGGTGTGCAACGATTCTTAC
TGGTCTTGCAGGTCACGTCTTCCCCAGCTTCTACAAATTCACTATTGGTGTCACGTGAAGGCCATGTCTATTATGGAAATCACCGAGAAAATCCCCTTCGCTGACCATTT
TCCCGCTGCGTTGACATGTTGCTCACACTTGAACAAAACCATTGGCAATATTAAGTCAACCCTAAGTCCAACCCAATTAAACCTTTTTAGGCAAACATGTTTCGGGCCCT
TAATAGATGTAAATGTTATTTTTAACGGCCAATTAGTACATTACATCCTCCTTAGGGAAGTAGAGGATAATAGGGCAGATGTGATGAGTTTTAAATTATTGGGTCAGAAG
GTTTCGTTTGGTAAGAGAGAATTTGACCTCATAACTGGCCTTCGTCATTCCTTTAGACCAATGAGGAGAGTTATAGAGGGTCCTCCAAATAGGCTCCAAAGATTATATTT
TAGGGAGAACGTAGGTATGAAGGTGGAGGAGTTAGATAAGTCGTTTCCGACCCTTCAGTTTGAGAACGACGAAGATGCAGTTAAGACCGCATCGTTTTATTTTTTTGAGT
TGGCTTTGATGGGGAGGGAACGCAAACAACAAGTAGATGCCAGCACTCTAAACTTGATGGATGATTGGGTTGCATTCTGCAACGAGGACTGGAGTACCATCGTGTTTGAC
AAGACAATAAAAATATGGGCTTACGAGACTGTTTCATCTCTCACTGGACGTGTTGCCAATCGTATAAGTGAGACGACCATCCCACGCATTTGTCGATGGTCTTGCTCCCA
CTCCTCATCATACACAATCATTGCTGATGAGGTTTTTGGATCCCGAGCGGCAAGAGTTAGGTTGAGTCTTGTTTCTTCTGCAGAGGAGAGTCAATTCATGAATCGAGTGA
TGAAGCCTCCACGTGCACCAGAGCCAGTGCCTGAGCCAATACTAGAGCCAGAGCCAGAACCAGATCAAGAAACAGAGAGACAACCTACTGTATCTGAGGCTATACTCCCT
GTTGTAGAGGAGGATGTTGTAGCCGATACTAAGATGTTGGATGTCACTGAAGCTTCTCCAGAAGTTTCAAATAAAAGAGGAAGGGAACAAGATGACAAAAACAAAGGGAA
AGAGAAAGAGAATGAAGTAGAAAAAGAGGATGAGAGCACGAAGGAGAAGAAAAAGAAAAAAAGAAAAAAAAAGCAGACTTGTGAATGTAGCCAGTGGCTACAACTTATGG
ATAGCCGGATGGAGAGAATGGATGCTCGCATGTCTGATATGGAGACATGCCTAAAGTCTATTACCAAGTTCTTGCGTCGTCTATCTAAGGGTAAATTCGTGGACCCTGAG
AAGTATTTTGGACCCAAAGATGGTTCGGATGATGAGGGTGGTCCATCAAAAGGACCAGATGACGTCGGTGGTCCATCGAAAGGACTCGATGACAAGGGTAGTCCATCGAA
AGGACCGGATGACGATGGTAGACCAGATGACAATAAGGAAGGACATGAAGAAGGAGGAAACGACAGAGGAGAGGACGAGAAGGAGAATGACGTTGATGAAGCGCACGACA
TAGATCATATTATGGAGTTGGAGTCTCAACCAACCACTGACATATCGTCTCACTCCATTACTGACGTGGAGTATCAACCAAGTATAGACCCAATCGTGTTCATTGTACCT
AAGGCTGAGCCTGTTGAGTTAAGTGATGAAGATGTTGAAGAGGTTGAAGAGGTTAAAGTTATTGGACCGCATGAATTGGTAAAAAGACGGGGAAAGCGAACCCGACAAAT
TTCTTGGAAGCTTCGGTCTCCATGGGCTGACACCAGATCGGACGGCAAAAGGAGGAAAGTTAAGCGATACGATCCCATGCGTGCCATTCCTGAGGAGTACGAGACCAAGT
TCCAGAAATGGCTAGCGTCGAGACGCCAGCCCTTGAGCGTCTCGACGCTGGCATTCCATATCAGAATAGGTGCGAAATCGTCGCAACGTCTCGATGCTGCAACCTTAGCG
TCTCGACGCTGCATAATTTCCTTTTCAGAATATGCGTGTTTAGGGACAGCGTCTCGACGCTACAACGAAAAATCAGTATTTATAGTTCTTTTCGGCTAG
mRNA sequenceShow/hide mRNA sequence
ATGTTCAGTGTTTCTAATAACAATAGTAAGTTACCTTTGTTTGTTGAGTCAGATGCAGCGGAGGTAATCAAGCTTCTCAATTTCGAATCGATTGACATCTCAGACACAGC
TCTGATCGTGAATGAGGTGGTCTTCCTAGCGAATCAATCTGGGGTGGTTTCGTTTGCTAAATGCCCTAGAAAGTGCAATCAAATCGTGCATCTTCTTGCACGGTCAGCGG
CTGGTTTTTCGCCGCAGTTCCTTCTGTCGTTCGATGTCGACGGTGGTGGAGATGGGACCAGGAAACGACCCAGAGGAAGACCAGACCAACGGGTCGGGCCAACGTGGCCC
GACCCGTATTCGCTTGCGCGAGCCGAGTCCGTTTGCCTCCGCTCGGTCCCTACCGCCTCTAGCCGCCCCGGTTCCACTTGGTTCACCCCAAAACGCCTCTGGATTCCTAA
AAACCCTAGGAGCACGAGCATGTATTTATACCCCTCTTCACCACTGAAGAAGGGTTCCGAAAACTCTATTCTCGACTTCTCTCCTTACTCTCTAGCTTCTCAGTTTTCTG
ACTTAGACATTGGAGGCGGTGTGGCCTACACCACACTGGTGTGCAGCGATTCTTGCAGGTCACGTCTTCCCCAGCTTCTAAAAATTCACTGTTGGTGTCACGTGAAGGCC
AGGAAACGACCCAGAGGAAGACCAGACCAACGGGTCGGCCAACGTGGCCCGACCCGTATGGTCGACCTCGGCCTGGCCGAGTCCGTTTGCCTCCGCTTGGTCCCTACCGC
CTCTAGCCGCCCCAGTTCCACTTGGTTCGCGCCAAAACACCTCAGGATTCCTAAAAACCCTAGGAGCACGAGCATGTATTTATACCCCTCTTCACCACTGAAGAAGGATT
CCGAAAACTCTATTCTCGACTTCTCTCTTTACTCTCTAGCTTCTCAGTTTTCTGACTTAGGCATCGGAGGCGCTGTGGCCTACACCACACTGGTGTGCAACGATTCTTAC
TGGTCTTGCAGGTCACGTCTTCCCCAGCTTCTACAAATTCACTATTGGTGTCACGTGAAGGCCATGTCTATTATGGAAATCACCGAGAAAATCCCCTTCGCTGACCATTT
TCCCGCTGCGTTGACATGTTGCTCACACTTGAACAAAACCATTGGCAATATTAAGTCAACCCTAAGTCCAACCCAATTAAACCTTTTTAGGCAAACATGTTTCGGGCCCT
TAATAGATGTAAATGTTATTTTTAACGGCCAATTAGTACATTACATCCTCCTTAGGGAAGTAGAGGATAATAGGGCAGATGTGATGAGTTTTAAATTATTGGGTCAGAAG
GTTTCGTTTGGTAAGAGAGAATTTGACCTCATAACTGGCCTTCGTCATTCCTTTAGACCAATGAGGAGAGTTATAGAGGGTCCTCCAAATAGGCTCCAAAGATTATATTT
TAGGGAGAACGTAGGTATGAAGGTGGAGGAGTTAGATAAGTCGTTTCCGACCCTTCAGTTTGAGAACGACGAAGATGCAGTTAAGACCGCATCGTTTTATTTTTTTGAGT
TGGCTTTGATGGGGAGGGAACGCAAACAACAAGTAGATGCCAGCACTCTAAACTTGATGGATGATTGGGTTGCATTCTGCAACGAGGACTGGAGTACCATCGTGTTTGAC
AAGACAATAAAAATATGGGCTTACGAGACTGTTTCATCTCTCACTGGACGTGTTGCCAATCGTATAAGTGAGACGACCATCCCACGCATTTGTCGATGGTCTTGCTCCCA
CTCCTCATCATACACAATCATTGCTGATGAGGTTTTTGGATCCCGAGCGGCAAGAGTTAGGTTGAGTCTTGTTTCTTCTGCAGAGGAGAGTCAATTCATGAATCGAGTGA
TGAAGCCTCCACGTGCACCAGAGCCAGTGCCTGAGCCAATACTAGAGCCAGAGCCAGAACCAGATCAAGAAACAGAGAGACAACCTACTGTATCTGAGGCTATACTCCCT
GTTGTAGAGGAGGATGTTGTAGCCGATACTAAGATGTTGGATGTCACTGAAGCTTCTCCAGAAGTTTCAAATAAAAGAGGAAGGGAACAAGATGACAAAAACAAAGGGAA
AGAGAAAGAGAATGAAGTAGAAAAAGAGGATGAGAGCACGAAGGAGAAGAAAAAGAAAAAAAGAAAAAAAAAGCAGACTTGTGAATGTAGCCAGTGGCTACAACTTATGG
ATAGCCGGATGGAGAGAATGGATGCTCGCATGTCTGATATGGAGACATGCCTAAAGTCTATTACCAAGTTCTTGCGTCGTCTATCTAAGGGTAAATTCGTGGACCCTGAG
AAGTATTTTGGACCCAAAGATGGTTCGGATGATGAGGGTGGTCCATCAAAAGGACCAGATGACGTCGGTGGTCCATCGAAAGGACTCGATGACAAGGGTAGTCCATCGAA
AGGACCGGATGACGATGGTAGACCAGATGACAATAAGGAAGGACATGAAGAAGGAGGAAACGACAGAGGAGAGGACGAGAAGGAGAATGACGTTGATGAAGCGCACGACA
TAGATCATATTATGGAGTTGGAGTCTCAACCAACCACTGACATATCGTCTCACTCCATTACTGACGTGGAGTATCAACCAAGTATAGACCCAATCGTGTTCATTGTACCT
AAGGCTGAGCCTGTTGAGTTAAGTGATGAAGATGTTGAAGAGGTTGAAGAGGTTAAAGTTATTGGACCGCATGAATTGGTAAAAAGACGGGGAAAGCGAACCCGACAAAT
TTCTTGGAAGCTTCGGTCTCCATGGGCTGACACCAGATCGGACGGCAAAAGGAGGAAAGTTAAGCGATACGATCCCATGCGTGCCATTCCTGAGGAGTACGAGACCAAGT
TCCAGAAATGGCTAGCGTCGAGACGCCAGCCCTTGAGCGTCTCGACGCTGGCATTCCATATCAGAATAGGTGCGAAATCGTCGCAACGTCTCGATGCTGCAACCTTAGCG
TCTCGACGCTGCATAATTTCCTTTTCAGAATATGCGTGTTTAGGGACAGCGTCTCGACGCTACAACGAAAAATCAGTATTTATAGTTCTTTTCGGCTAG
Protein sequenceShow/hide protein sequence
MFSVSNNNSKLPLFVESDAAEVIKLLNFESIDISDTALIVNEVVFLANQSGVVSFAKCPRKCNQIVHLLARSAAGFSPQFLLSFDVDGGGDGTRKRPRGRPDQRVGPTWP
DPYSLARAESVCLRSVPTASSRPGSTWFTPKRLWIPKNPRSTSMYLYPSSPLKKGSENSILDFSPYSLASQFSDLDIGGGVAYTTLVCSDSCRSRLPQLLKIHCWCHVKA
RKRPRGRPDQRVGQRGPTRMVDLGLAESVCLRLVPTASSRPSSTWFAPKHLRIPKNPRSTSMYLYPSSPLKKDSENSILDFSLYSLASQFSDLGIGGAVAYTTLVCNDSY
WSCRSRLPQLLQIHYWCHVKAMSIMEITEKIPFADHFPAALTCCSHLNKTIGNIKSTLSPTQLNLFRQTCFGPLIDVNVIFNGQLVHYILLREVEDNRADVMSFKLLGQK
VSFGKREFDLITGLRHSFRPMRRVIEGPPNRLQRLYFRENVGMKVEELDKSFPTLQFENDEDAVKTASFYFFELALMGRERKQQVDASTLNLMDDWVAFCNEDWSTIVFD
KTIKIWAYETVSSLTGRVANRISETTIPRICRWSCSHSSSYTIIADEVFGSRAARVRLSLVSSAEESQFMNRVMKPPRAPEPVPEPILEPEPEPDQETERQPTVSEAILP
VVEEDVVADTKMLDVTEASPEVSNKRGREQDDKNKGKEKENEVEKEDESTKEKKKKKRKKKQTCECSQWLQLMDSRMERMDARMSDMETCLKSITKFLRRLSKGKFVDPE
KYFGPKDGSDDEGGPSKGPDDVGGPSKGLDDKGSPSKGPDDDGRPDDNKEGHEEGGNDRGEDEKENDVDEAHDIDHIMELESQPTTDISSHSITDVEYQPSIDPIVFIVP
KAEPVELSDEDVEEVEEVKVIGPHELVKRRGKRTRQISWKLRSPWADTRSDGKRRKVKRYDPMRAIPEEYETKFQKWLASRRQPLSVSTLAFHIRIGAKSSQRLDAATLA
SRRCIISFSEYACLGTASRRYNEKSVFIVLFG