; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg037125 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg037125
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationscaffold1:50521165..50536283
RNA-Seq ExpressionSpg037125
SyntenySpg037125
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022142767.1 uncharacterized protein LOC111012805 [Momordica charantia]1.4e-3843.65Show/hide
Query:  GLEDERLLNSIGKSQPRTYAELVSRAQKYMSAEELLKSKKTEREHKRSSSSNYDSKKDKRQRTDEEGRGRADHGRGRA---HPYGKFEKYTPTAVPQEQV
        G+ DE L  S GK  P T++E +SRAQ+YMSA E   SK+          SN      KR+R+ ++ +G     R R+    P  KFEKYT T VP EQV
Subjt:  GLEDERLLNSIGKSQPRTYAELVSRAQKYMSAEELLKSKKTEREHKRSSSSNYDSKKDKRQRTDEEGRGRADHGRGRA---HPYGKFEKYTPTAVPQEQV

Query:  LMEIRNTGLLKFLGRMKSSANRRDKSQYCLFHRDHGHSTRNCIQLKDEIEALIQNGYLKEFVGEPRAEADQGWLRPSLTKDGRDKEEPLREIRTIFGGLA
        LMEI+N  LLK+  RM +S+ +R K +YCLFH DHGH+T++C  LK+E+E LI  GYLKE+V EP+A           T++G   + P REIRTI GG  
Subjt:  LMEIRNTGLLKFLGRMKSSANRRDKSQYCLFHRDHGHSTRNCIQLKDEIEALIQNGYLKEFVGEPRAEADQGWLRPSLTKDGRDKEEPLREIRTIFGGLA

Query:  GGGSSRKRKAIVREARSEPEYRGEPGLPSLPRLIQIEIMRAWTTESTSPHPD
           S RKRK  VREAR+  E      + +  R + IE      T    PH D
Subjt:  GGGSSRKRKAIVREARSEPEYRGEPGLPSLPRLIQIEIMRAWTTESTSPHPD

XP_022158414.1 uncharacterized protein LOC111024904 [Momordica charantia]8.0e-3432.69Show/hide
Query:  LQDERLLNSIGESQPRTYVEFMTRAQRYISAEELLKSKQEERESRGMSTPDWRREDKGKRHQVEGRGRSRPEHSSANGRGRPEAKELQGRAEPKSRYDRY
        L DE L   +GE  P T+VE + +A++ I  +ELL++K    E +     D ++  + KR + + + R +   SSA+   R E + L+        Y+RY
Subjt:  LQDERLLNSIGESQPRTYVEFMTRAQRYISAEELLKSKQEERESRGMSTPDWRREDKGKRHQVEGRGRSRPEHSSANGRGRPEAKELQGRAEPKSRYDRY

Query:  TPLTASLEQVLTAIQDT---NLLKRPEKLRSDPDRRNRNKYCMFHGDHGHTTRECIQLRDEIETLIREGYLKEFVGGDRNKRPLPADQGKGGANPPLE--
        T  T  + ++LT I+++    LLKRPEKLR D ++RN+ KYC FH DHGH T  C +L+ +IE LI++GY K+FVG  R+      ++ K    PP    
Subjt:  TPLTASLEQVLTAIQDT---NLLKRPEKLRSDPDRRNRNKYCMFHGDHGHTTRECIQLRDEIETLIREGYLKEFVGGDRNKRPLPADQGKGGANPPLE--

Query:  ----IRTILGGPSGGESGRKRKTAVREAQQE-----------------PDGQGMYSLHLD----------------------------------ENSPKL
            I TI GGP+GG+SG KRK   REA++E                  D +G++  H D                                  + +   
Subjt:  ----IRTILGGPSGGESGRKRKTAVREAQQE-----------------PDGQGMYSLHLD----------------------------------ENSPKL

Query:  EFTEKEAKG---------------AVASTYHQILKFPTEEGVGAVCGEQRMSRECYFMALR
        EF   + +                AV ST HQ+LK+ T   VG V GEQ+ SRECY  AL+
Subjt:  EFTEKEAKG---------------AVASTYHQILKFPTEEGVGAVCGEQRMSRECYFMALR

XP_022158830.1 uncharacterized protein LOC111025293 [Momordica charantia]8.9e-4143.75Show/hide
Query:  TGLEDERLLNSIGKSQPRTYAELVSRAQKYMSAEELLKS------KKTEREHKRSSSSNYDSKKDKRQRTDEEGRGRADHGRGRAHPYGKFEKYTPTAVP
        +G+ DE L  S GK  P T++E +SRAQ+YMSA E   S      K+T+ + +RS      S+ +KR R+ ++             P  KFEKYTPT VP
Subjt:  TGLEDERLLNSIGKSQPRTYAELVSRAQKYMSAEELLKS------KKTEREHKRSSSSNYDSKKDKRQRTDEEGRGRADHGRGRAHPYGKFEKYTPTAVP

Query:  QEQVLMEIRNTGLLKFLGRMKSSANRRDKSQYCLFHRDHGHSTRNCIQLKDEIEALIQNGYLKEFVGEPRAEADQGWLRPSLTKDGRDKEEPLREIRTIF
         EQVLMEI++  LLK+  RMK+S+ +R K +YCLFHRDHGH+T++C  LK+E+E LI+ GYLKE+V EP+A           T++G   + P REIRTI 
Subjt:  QEQVLMEIRNTGLLKFLGRMKSSANRRDKSQYCLFHRDHGHSTRNCIQLKDEIEALIQNGYLKEFVGEPRAEADQGWLRPSLTKDGRDKEEPLREIRTIF

Query:  GGLAGGGSSRKRKAIVREARSEPEYRGEPGLPSLPRLIQIEIMRAWTTESTSPHPD
        GG     S RKRKA VREAR+  E        +  R + IE      T    PH D
Subjt:  GGLAGGGSSRKRKAIVREARSEPEYRGEPGLPSLPRLIQIEIMRAWTTESTSPHPD

XP_022159192.1 uncharacterized protein LOC111025612 [Momordica charantia]2.0e-3747.09Show/hide
Query:  TGLEDERLLNSIGKSQPRTYAELVSRAQKYMSAEEL------LKSKKTEREHKRSSSSNYDSKKDKRQRTDEEGRGRADHGRGRAHPYGKFEKYTPTAVP
        +G++DE L  S GK  P T+AE +SRAQKYMSAEE       L+ K+ +++ +RS      SK +K+ R      G+ D  R       KFEKYTPT VP
Subjt:  TGLEDERLLNSIGKSQPRTYAELVSRAQKYMSAEEL------LKSKKTEREHKRSSSSNYDSKKDKRQRTDEEGRGRADHGRGRAHPYGKFEKYTPTAVP

Query:  QEQVLMEIRNTGLLKFLGRMKSSANRRDKSQYCLFHRDHGHSTRNCIQLKDEIEALIQNGYLKEFVGEPRAEADQGWLRPSLTKDGRDKEEPLREIRTIF
         EQVLMEI++  LLK+   MK+  N+R K +YCLFHRDHGH+T +C  LK+E+E LI+ GYLKE+V + +A  ++            D + P REIRTI 
Subjt:  QEQVLMEIRNTGLLKFLGRMKSSANRRDKSQYCLFHRDHGHSTRNCIQLKDEIEALIQNGYLKEFVGEPRAEADQGWLRPSLTKDGRDKEEPLREIRTIF

Query:  GGLAGGGSSRKRKAIVREARSEP
        GG     S RKRKA VREAR  P
Subjt:  GGLAGGGSSRKRKAIVREARSEP

XP_022159368.1 uncharacterized protein LOC111025785 [Momordica charantia]2.4e-3842.19Show/hide
Query:  TGLEDERLLNSIGKSQPRTYAELVSRAQKYMSAEELLKS------KKTEREHKRSSSSNYDSKKDKRQRTDEEGRGRADHGRGRAHPYGKFEKYTPTAVP
        +G+ DE L  S GK  P T++E +SRAQ+YMSA E   S      K+T+++ +RS      S+ +KR R+ ++             P  KFEKYTPT VP
Subjt:  TGLEDERLLNSIGKSQPRTYAELVSRAQKYMSAEELLKS------KKTEREHKRSSSSNYDSKKDKRQRTDEEGRGRADHGRGRAHPYGKFEKYTPTAVP

Query:  QEQVLMEIRNTGLLKFLGRMKSSANRRDKSQYCLFHRDHGHSTRNCIQLKDEIEALIQNGYLKEFVGEPRAEADQGWLRPSLTKDGRDKEEPLREIRTIF
         EQVLMEI++  LLK+  RMK  + +R K +YCLFHRDH H+T++   LK+E+E LI+ GYL+E+V EP+A           T++G  ++ P REIRTI 
Subjt:  QEQVLMEIRNTGLLKFLGRMKSSANRRDKSQYCLFHRDHGHSTRNCIQLKDEIEALIQNGYLKEFVGEPRAEADQGWLRPSLTKDGRDKEEPLREIRTIF

Query:  GGLAGGGSSRKRKAIVREARSEPEYRGEPGLPSLPRLIQIEIMRAWTTESTSPHPD
        GG     S+RKRKA VREAR   E        +  R + IE      T    PH D
Subjt:  GGLAGGGSSRKRKAIVREARSEPEYRGEPGLPSLPRLIQIEIMRAWTTESTSPHPD

TrEMBL top hitse value%identityAlignment
A0A6J1CNT2 uncharacterized protein LOC1110128056.9e-3943.65Show/hide
Query:  GLEDERLLNSIGKSQPRTYAELVSRAQKYMSAEELLKSKKTEREHKRSSSSNYDSKKDKRQRTDEEGRGRADHGRGRA---HPYGKFEKYTPTAVPQEQV
        G+ DE L  S GK  P T++E +SRAQ+YMSA E   SK+          SN      KR+R+ ++ +G     R R+    P  KFEKYT T VP EQV
Subjt:  GLEDERLLNSIGKSQPRTYAELVSRAQKYMSAEELLKSKKTEREHKRSSSSNYDSKKDKRQRTDEEGRGRADHGRGRA---HPYGKFEKYTPTAVPQEQV

Query:  LMEIRNTGLLKFLGRMKSSANRRDKSQYCLFHRDHGHSTRNCIQLKDEIEALIQNGYLKEFVGEPRAEADQGWLRPSLTKDGRDKEEPLREIRTIFGGLA
        LMEI+N  LLK+  RM +S+ +R K +YCLFH DHGH+T++C  LK+E+E LI  GYLKE+V EP+A           T++G   + P REIRTI GG  
Subjt:  LMEIRNTGLLKFLGRMKSSANRRDKSQYCLFHRDHGHSTRNCIQLKDEIEALIQNGYLKEFVGEPRAEADQGWLRPSLTKDGRDKEEPLREIRTIFGGLA

Query:  GGGSSRKRKAIVREARSEPEYRGEPGLPSLPRLIQIEIMRAWTTESTSPHPD
           S RKRK  VREAR+  E      + +  R + IE      T    PH D
Subjt:  GGGSSRKRKAIVREARSEPEYRGEPGLPSLPRLIQIEIMRAWTTESTSPHPD

A0A6J1DWY0 uncharacterized protein LOC1110252934.3e-4143.75Show/hide
Query:  TGLEDERLLNSIGKSQPRTYAELVSRAQKYMSAEELLKS------KKTEREHKRSSSSNYDSKKDKRQRTDEEGRGRADHGRGRAHPYGKFEKYTPTAVP
        +G+ DE L  S GK  P T++E +SRAQ+YMSA E   S      K+T+ + +RS      S+ +KR R+ ++             P  KFEKYTPT VP
Subjt:  TGLEDERLLNSIGKSQPRTYAELVSRAQKYMSAEELLKS------KKTEREHKRSSSSNYDSKKDKRQRTDEEGRGRADHGRGRAHPYGKFEKYTPTAVP

Query:  QEQVLMEIRNTGLLKFLGRMKSSANRRDKSQYCLFHRDHGHSTRNCIQLKDEIEALIQNGYLKEFVGEPRAEADQGWLRPSLTKDGRDKEEPLREIRTIF
         EQVLMEI++  LLK+  RMK+S+ +R K +YCLFHRDHGH+T++C  LK+E+E LI+ GYLKE+V EP+A           T++G   + P REIRTI 
Subjt:  QEQVLMEIRNTGLLKFLGRMKSSANRRDKSQYCLFHRDHGHSTRNCIQLKDEIEALIQNGYLKEFVGEPRAEADQGWLRPSLTKDGRDKEEPLREIRTIF

Query:  GGLAGGGSSRKRKAIVREARSEPEYRGEPGLPSLPRLIQIEIMRAWTTESTSPHPD
        GG     S RKRKA VREAR+  E        +  R + IE      T    PH D
Subjt:  GGLAGGGSSRKRKAIVREARSEPEYRGEPGLPSLPRLIQIEIMRAWTTESTSPHPD

A0A6J1DYL6 uncharacterized protein LOC1110257851.2e-3842.19Show/hide
Query:  TGLEDERLLNSIGKSQPRTYAELVSRAQKYMSAEELLKS------KKTEREHKRSSSSNYDSKKDKRQRTDEEGRGRADHGRGRAHPYGKFEKYTPTAVP
        +G+ DE L  S GK  P T++E +SRAQ+YMSA E   S      K+T+++ +RS      S+ +KR R+ ++             P  KFEKYTPT VP
Subjt:  TGLEDERLLNSIGKSQPRTYAELVSRAQKYMSAEELLKS------KKTEREHKRSSSSNYDSKKDKRQRTDEEGRGRADHGRGRAHPYGKFEKYTPTAVP

Query:  QEQVLMEIRNTGLLKFLGRMKSSANRRDKSQYCLFHRDHGHSTRNCIQLKDEIEALIQNGYLKEFVGEPRAEADQGWLRPSLTKDGRDKEEPLREIRTIF
         EQVLMEI++  LLK+  RMK  + +R K +YCLFHRDH H+T++   LK+E+E LI+ GYL+E+V EP+A           T++G  ++ P REIRTI 
Subjt:  QEQVLMEIRNTGLLKFLGRMKSSANRRDKSQYCLFHRDHGHSTRNCIQLKDEIEALIQNGYLKEFVGEPRAEADQGWLRPSLTKDGRDKEEPLREIRTIF

Query:  GGLAGGGSSRKRKAIVREARSEPEYRGEPGLPSLPRLIQIEIMRAWTTESTSPHPD
        GG     S+RKRKA VREAR   E        +  R + IE      T    PH D
Subjt:  GGLAGGGSSRKRKAIVREARSEPEYRGEPGLPSLPRLIQIEIMRAWTTESTSPHPD

A0A6J1DZ52 uncharacterized protein LOC1110256129.9e-3847.09Show/hide
Query:  TGLEDERLLNSIGKSQPRTYAELVSRAQKYMSAEEL------LKSKKTEREHKRSSSSNYDSKKDKRQRTDEEGRGRADHGRGRAHPYGKFEKYTPTAVP
        +G++DE L  S GK  P T+AE +SRAQKYMSAEE       L+ K+ +++ +RS      SK +K+ R      G+ D  R       KFEKYTPT VP
Subjt:  TGLEDERLLNSIGKSQPRTYAELVSRAQKYMSAEEL------LKSKKTEREHKRSSSSNYDSKKDKRQRTDEEGRGRADHGRGRAHPYGKFEKYTPTAVP

Query:  QEQVLMEIRNTGLLKFLGRMKSSANRRDKSQYCLFHRDHGHSTRNCIQLKDEIEALIQNGYLKEFVGEPRAEADQGWLRPSLTKDGRDKEEPLREIRTIF
         EQVLMEI++  LLK+   MK+  N+R K +YCLFHRDHGH+T +C  LK+E+E LI+ GYLKE+V + +A  ++            D + P REIRTI 
Subjt:  QEQVLMEIRNTGLLKFLGRMKSSANRRDKSQYCLFHRDHGHSTRNCIQLKDEIEALIQNGYLKEFVGEPRAEADQGWLRPSLTKDGRDKEEPLREIRTIF

Query:  GGLAGGGSSRKRKAIVREARSEP
        GG     S RKRKA VREAR  P
Subjt:  GGLAGGGSSRKRKAIVREARSEP

A0A6J1DZB9 uncharacterized protein LOC1110249043.9e-3432.69Show/hide
Query:  LQDERLLNSIGESQPRTYVEFMTRAQRYISAEELLKSKQEERESRGMSTPDWRREDKGKRHQVEGRGRSRPEHSSANGRGRPEAKELQGRAEPKSRYDRY
        L DE L   +GE  P T+VE + +A++ I  +ELL++K    E +     D ++  + KR + + + R +   SSA+   R E + L+        Y+RY
Subjt:  LQDERLLNSIGESQPRTYVEFMTRAQRYISAEELLKSKQEERESRGMSTPDWRREDKGKRHQVEGRGRSRPEHSSANGRGRPEAKELQGRAEPKSRYDRY

Query:  TPLTASLEQVLTAIQDT---NLLKRPEKLRSDPDRRNRNKYCMFHGDHGHTTRECIQLRDEIETLIREGYLKEFVGGDRNKRPLPADQGKGGANPPLE--
        T  T  + ++LT I+++    LLKRPEKLR D ++RN+ KYC FH DHGH T  C +L+ +IE LI++GY K+FVG  R+      ++ K    PP    
Subjt:  TPLTASLEQVLTAIQDT---NLLKRPEKLRSDPDRRNRNKYCMFHGDHGHTTRECIQLRDEIETLIREGYLKEFVGGDRNKRPLPADQGKGGANPPLE--

Query:  ----IRTILGGPSGGESGRKRKTAVREAQQE-----------------PDGQGMYSLHLD----------------------------------ENSPKL
            I TI GGP+GG+SG KRK   REA++E                  D +G++  H D                                  + +   
Subjt:  ----IRTILGGPSGGESGRKRKTAVREAQQE-----------------PDGQGMYSLHLD----------------------------------ENSPKL

Query:  EFTEKEAKG---------------AVASTYHQILKFPTEEGVGAVCGEQRMSRECYFMALR
        EF   + +                AV ST HQ+LK+ T   VG V GEQ+ SRECY  AL+
Subjt:  EFTEKEAKG---------------AVASTYHQILKFPTEEGVGAVCGEQRMSRECYFMALR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCCACGCCACAGTCAGAAGAGCTCCGCTTCAGTCACGAATCTGGCGTAATCGAACTCCGACGGCGGTGCAGCCCTCCCCCTCGTGCGCGATTCTCCCTCTCCCCTTC
TCTTCGTGTTCGCGTCGCCGGCCAGCACCCCAGCCCGCGCCGCATCCTTCTTCCTTTCTCTCCGTGTTGTAGCACCGCCACAGCCGTCGTCGGAGCTCGCGCAGCCGCCG
CCCCTGCTCGAGCGCCGCCCTTCCTCTCCGTCGGCATCTCTCTCTCTCCGCGTGCGAATCCCTCCCTTCCCCTCTGTGCGTTTTCAGCCTTAAAAGCTCGTGGATCTCGT
GAGGTTGCACCTTGGAGCTCGTTCTCTCACGTTTTTACTTCTGTTCAACGGCGTTCTTGGGTGTTTCGGCGTCGCTTAGTGTTTTCGCGCCGTAAAAGTGTTCGATTGAG
TTCGAATCACTTAAAAATTGAATACCCATTGTCCAAGGAGTATTCTAGCGCGTTGTTCGAGGAGTTCATCTTGATCTTTGAGTTGTTTAAAGAGTTCTTAAGGAGCGCTC
GTAAGCTTTTGGTGCTAACGGATATTATGAGCCCGGTGGTGCTGACGAGGAGGCGCACGAGGCGTGACTCCACTGGCCTAGGCGTTTTTGCAATTTTGGACCACCCTGAT
GTACAAGGAGCTGACGAGGACAACCGGGGAGAAATCAGGCTGAGAGGTGGACCCAAGAGGCGAAACCGGCAAGTGGGACGGGCCAAGACCGAAGGGGTCGAGTTTTTGGC
CCGACCCCCTGTTCGGCCTCGGCCATGGGCCGAGGCCGAGCCCGTCCGACTCCGCTTGGTCCCTACCGCCTTTGGCCGCCCCGTTGGCGCCGTCTGTGGGAAAGAAAGCT
TGCCAAATCTGTGCACCGGTCATTCCATGAGTAAGGAGATGGAGAAGAAAAACCAGGACGTGAACGCAGAGCATTCGGATGGTGACCACCACCAACGGAGGTCACGGGAA
GAAGGCCGAGGCCGACCTCAGACCGAATCTCCTCGACCTCGGTCTCCACTGCCCTCATCCCGAGAGAAGCAAGCTGATTTAAAATTTGTTGCTCTCGAAAACAAAGTAAG
TGCGATGGATCATAATTTGTCCAGGATACTTCGTATCTTGGATAAACCTGGTCCTAGCACTAAAACCCCTGATGAGAGGTTGGTTAGGGATCCGAGGAAGGGGAAGGAGC
CCATGGAGCACACTGCAGAATCAAAGACGAGGTTGAAGGGAAAGAAGACTGACAACATGACCAGCAAGGTCAGGGGGCTCAAACCTACTGATCGTACGATTTTGAGGAGC
CCAGAGTCAAGCACACTGAAGGGACGTGACTGTACAGTTTCTACCCCAAGCTACGGTCATACTAATACAGACCTGAGAAATCTGATCGAGGAGAAGCGCAGAACCGGACT
GGAAGACGAAAGACTACTCAACTCAATAGGTAAGAGCCAGCCTCGAACCTATGCAGAGCTCGTCTCTCGGGCACAGAAATACATGAGCGCAGAGGAGTTGCTCAAGTCAA
AGAAGACGGAACGAGAGCACAAGAGGTCTTCTTCATCTAACTACGACAGTAAGAAGGACAAAAGGCAGCGGACTGACGAGGAAGGCCGAGGCCGAGCAGACCATGGTCGA
GGCCGAGCACATCCCTATGGTAAGTTCGAGAAATATACGCCAACAGCTGTTCCACAGGAGCAAGTACTGATGGAGATCCGAAATACGGGTCTCCTGAAATTCTTAGGGAG
GATGAAGTCAAGTGCCAATAGAAGAGACAAGAGCCAGTATTGCCTTTTCCACCGAGACCATGGACATTCAACCAGGAATTGTATTCAGTTGAAGGATGAGATCGAAGCAC
TGATCCAGAATGGGTATTTGAAAGAGTTCGTCGGTGAGCCTAGGGCTGAGGCCGACCAGGGATGGCTGAGGCCGAGCCTTACCAAAGATGGCCGAGACAAAGAAGAACCC
CTACGAGAGATCAGAACCATCTTTGGAGGACTAGCTGGAGGAGGTTCGAGTAGGAAGAGGAAAGCTATTGTCAGGGAAGCAAGGTCCGAACCAGAATATCGAGGCGAGCC
TGGACTACCGAGTCTACCTCGCCTCATCCAGATTGAGATTATGCGAGCCTGGACTACCGAGTCTACCTCGCCTCATCCAGACTTAAGATTGCATGAGCCTGGACCACCGA
GTCTACTTCACCTCGTCTGGCTCACCTGGCTGAGGCAGAAACCGAGCACCTCTTGCCGAGGCCGAGCACAAACTTTGAAGGATCGTGAGATCTTCTATTTTTTCTACATA
TTACTTGGTCCAGGGGCAGAGCGCCTGGTGGGGGTTCGGGGGCAACGCCCCCTATGCAAAATTACAAAATGGAATGCTCGGCCTCATCCCGAGGCCGAGGCCGAGGCTGT
GACATTTACACGAATGGCAATTTTGGACCACCCCGATGTACAAAGAGTTGACGAGGACAACCGGGGAGAAATCAGGCTGAGAGGTGGACCCAAGAGGCGAAACCGGCAAG
TGGGACGAGCCAAGACCAAAGGGGTCGGGTTTTTGGCCCGACCCCCTGCTCGGCCTCGGTCGAGCCCGTCCGACTCCGCTTGGTCCCTACCGCCTTTGGCGCATCGGAGG
CGGTGTGGCTTCACCACACCGGTGTGCAGGTTCTCTCTTTTGCAGGCCACGTCTTCCCCGCTCTCAAACAAATTCACTGTCGATTATCACGTGGAGCGAAGGATGGGCCC
AGTTGGCGCCGTCTGTGGGGAAGACGTTGACAAGCAAACGCGACCGGTAAGTACAATGGAGAACGAGAATCGACCGACCACGGGCGAGCCGAGTTCCCTAATTCGTCTCC
AGGCCCAAGAGACTGAGATCGCAGAGATTAAGGGGAGGATGGACGATATGGGACAGAACTTGACTAAAATCCTCACTCTGTTGAAAAAGCCCGAGCCTGCCAGGCACGAG
GAAGAGCATCCGCGCAGAGACCCAAAAAAGGGTAAGGGGATAGCCAACGAGGAGGTAGGAGATTCGAAAAGTGTGACCAGTCGGATGCCACCTCCCGGGGATGACCGGGT
CCAGAAGGAGGCCGGGCCGAGCCGCAAAAAGGCCCGCAGGAGTTCTCCGCTAACCCAGGCACCAGGTTTACAGGACGAAAGGCTACTCAATTCAATCGGAGAAAGCCAGC
CACGAACGTACGTAGAGTTCATGACTCGGGCTCAAAGATATATAAGCGCCGAGGAGCTGTTGAAATCCAAGCAGGAAGAAAGAGAGAGTCGGGGAATGTCAACACCGGAC
TGGCGCCGCGAGGACAAGGGAAAGAGGCACCAGGTCGAGGGAAGAGGTCGGAGCCGACCTGAGCACTCCTCGGCCAATGGCCGAGGCCGACCAGAAGCCAAGGAGCTGCA
AGGTCGAGCAGAGCCTAAGTCCAGGTACGACAGGTATACGCCACTGACAGCTTCGCTTGAACAGGTCTTGACCGCAATACAGGACACGAATCTGTTGAAACGCCCGGAAA
AGTTGAGATCGGACCCAGACAGGAGAAACCGAAACAAATATTGCATGTTCCACGGGGATCACGGCCATACAACCCGAGAGTGCATCCAGTTGCGGGATGAGATAGAAACT
CTGATCCGAGAAGGTTACCTCAAGGAGTTCGTGGGGGGCGATAGAAACAAGAGGCCGCTACCAGCAGACCAAGGCAAGGGTGGTGCCAACCCGCCGCTTGAGATTCGAAC
CATTTTAGGGGGACCCTCAGGAGGAGAGTCAGGAAGGAAGCGGAAGACTGCTGTTCGAGAGGCACAGCAAGAGCCCGACGGGCAAGGTATGTACTCACTCCATCTTGATG
AAAACTCACCAAAGTTAGAGTTTACAGAAAAGGAGGCCAAAGGAGCTGTGGCTTCAACCTACCACCAAATCCTGAAGTTCCCTACGGAAGAAGGTGTAGGAGCAGTGTGC
GGCGAACAGAGAATGTCAAGGGAATGCTACTTCATGGCACTCAGGAACATTGACAGAAAGATTCAGGCGACACCAGCCTCGGGATATGGCCGAGGCCGAGAAGTCGAAGG
AGCAAGCTTTCCCCTCCCAATGGAGTATTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCCACGCCACAGTCAGAAGAGCTCCGCTTCAGTCACGAATCTGGCGTAATCGAACTCCGACGGCGGTGCAGCCCTCCCCCTCGTGCGCGATTCTCCCTCTCCCCTTC
TCTTCGTGTTCGCGTCGCCGGCCAGCACCCCAGCCCGCGCCGCATCCTTCTTCCTTTCTCTCCGTGTTGTAGCACCGCCACAGCCGTCGTCGGAGCTCGCGCAGCCGCCG
CCCCTGCTCGAGCGCCGCCCTTCCTCTCCGTCGGCATCTCTCTCTCTCCGCGTGCGAATCCCTCCCTTCCCCTCTGTGCGTTTTCAGCCTTAAAAGCTCGTGGATCTCGT
GAGGTTGCACCTTGGAGCTCGTTCTCTCACGTTTTTACTTCTGTTCAACGGCGTTCTTGGGTGTTTCGGCGTCGCTTAGTGTTTTCGCGCCGTAAAAGTGTTCGATTGAG
TTCGAATCACTTAAAAATTGAATACCCATTGTCCAAGGAGTATTCTAGCGCGTTGTTCGAGGAGTTCATCTTGATCTTTGAGTTGTTTAAAGAGTTCTTAAGGAGCGCTC
GTAAGCTTTTGGTGCTAACGGATATTATGAGCCCGGTGGTGCTGACGAGGAGGCGCACGAGGCGTGACTCCACTGGCCTAGGCGTTTTTGCAATTTTGGACCACCCTGAT
GTACAAGGAGCTGACGAGGACAACCGGGGAGAAATCAGGCTGAGAGGTGGACCCAAGAGGCGAAACCGGCAAGTGGGACGGGCCAAGACCGAAGGGGTCGAGTTTTTGGC
CCGACCCCCTGTTCGGCCTCGGCCATGGGCCGAGGCCGAGCCCGTCCGACTCCGCTTGGTCCCTACCGCCTTTGGCCGCCCCGTTGGCGCCGTCTGTGGGAAAGAAAGCT
TGCCAAATCTGTGCACCGGTCATTCCATGAGTAAGGAGATGGAGAAGAAAAACCAGGACGTGAACGCAGAGCATTCGGATGGTGACCACCACCAACGGAGGTCACGGGAA
GAAGGCCGAGGCCGACCTCAGACCGAATCTCCTCGACCTCGGTCTCCACTGCCCTCATCCCGAGAGAAGCAAGCTGATTTAAAATTTGTTGCTCTCGAAAACAAAGTAAG
TGCGATGGATCATAATTTGTCCAGGATACTTCGTATCTTGGATAAACCTGGTCCTAGCACTAAAACCCCTGATGAGAGGTTGGTTAGGGATCCGAGGAAGGGGAAGGAGC
CCATGGAGCACACTGCAGAATCAAAGACGAGGTTGAAGGGAAAGAAGACTGACAACATGACCAGCAAGGTCAGGGGGCTCAAACCTACTGATCGTACGATTTTGAGGAGC
CCAGAGTCAAGCACACTGAAGGGACGTGACTGTACAGTTTCTACCCCAAGCTACGGTCATACTAATACAGACCTGAGAAATCTGATCGAGGAGAAGCGCAGAACCGGACT
GGAAGACGAAAGACTACTCAACTCAATAGGTAAGAGCCAGCCTCGAACCTATGCAGAGCTCGTCTCTCGGGCACAGAAATACATGAGCGCAGAGGAGTTGCTCAAGTCAA
AGAAGACGGAACGAGAGCACAAGAGGTCTTCTTCATCTAACTACGACAGTAAGAAGGACAAAAGGCAGCGGACTGACGAGGAAGGCCGAGGCCGAGCAGACCATGGTCGA
GGCCGAGCACATCCCTATGGTAAGTTCGAGAAATATACGCCAACAGCTGTTCCACAGGAGCAAGTACTGATGGAGATCCGAAATACGGGTCTCCTGAAATTCTTAGGGAG
GATGAAGTCAAGTGCCAATAGAAGAGACAAGAGCCAGTATTGCCTTTTCCACCGAGACCATGGACATTCAACCAGGAATTGTATTCAGTTGAAGGATGAGATCGAAGCAC
TGATCCAGAATGGGTATTTGAAAGAGTTCGTCGGTGAGCCTAGGGCTGAGGCCGACCAGGGATGGCTGAGGCCGAGCCTTACCAAAGATGGCCGAGACAAAGAAGAACCC
CTACGAGAGATCAGAACCATCTTTGGAGGACTAGCTGGAGGAGGTTCGAGTAGGAAGAGGAAAGCTATTGTCAGGGAAGCAAGGTCCGAACCAGAATATCGAGGCGAGCC
TGGACTACCGAGTCTACCTCGCCTCATCCAGATTGAGATTATGCGAGCCTGGACTACCGAGTCTACCTCGCCTCATCCAGACTTAAGATTGCATGAGCCTGGACCACCGA
GTCTACTTCACCTCGTCTGGCTCACCTGGCTGAGGCAGAAACCGAGCACCTCTTGCCGAGGCCGAGCACAAACTTTGAAGGATCGTGAGATCTTCTATTTTTTCTACATA
TTACTTGGTCCAGGGGCAGAGCGCCTGGTGGGGGTTCGGGGGCAACGCCCCCTATGCAAAATTACAAAATGGAATGCTCGGCCTCATCCCGAGGCCGAGGCCGAGGCTGT
GACATTTACACGAATGGCAATTTTGGACCACCCCGATGTACAAAGAGTTGACGAGGACAACCGGGGAGAAATCAGGCTGAGAGGTGGACCCAAGAGGCGAAACCGGCAAG
TGGGACGAGCCAAGACCAAAGGGGTCGGGTTTTTGGCCCGACCCCCTGCTCGGCCTCGGTCGAGCCCGTCCGACTCCGCTTGGTCCCTACCGCCTTTGGCGCATCGGAGG
CGGTGTGGCTTCACCACACCGGTGTGCAGGTTCTCTCTTTTGCAGGCCACGTCTTCCCCGCTCTCAAACAAATTCACTGTCGATTATCACGTGGAGCGAAGGATGGGCCC
AGTTGGCGCCGTCTGTGGGGAAGACGTTGACAAGCAAACGCGACCGGTAAGTACAATGGAGAACGAGAATCGACCGACCACGGGCGAGCCGAGTTCCCTAATTCGTCTCC
AGGCCCAAGAGACTGAGATCGCAGAGATTAAGGGGAGGATGGACGATATGGGACAGAACTTGACTAAAATCCTCACTCTGTTGAAAAAGCCCGAGCCTGCCAGGCACGAG
GAAGAGCATCCGCGCAGAGACCCAAAAAAGGGTAAGGGGATAGCCAACGAGGAGGTAGGAGATTCGAAAAGTGTGACCAGTCGGATGCCACCTCCCGGGGATGACCGGGT
CCAGAAGGAGGCCGGGCCGAGCCGCAAAAAGGCCCGCAGGAGTTCTCCGCTAACCCAGGCACCAGGTTTACAGGACGAAAGGCTACTCAATTCAATCGGAGAAAGCCAGC
CACGAACGTACGTAGAGTTCATGACTCGGGCTCAAAGATATATAAGCGCCGAGGAGCTGTTGAAATCCAAGCAGGAAGAAAGAGAGAGTCGGGGAATGTCAACACCGGAC
TGGCGCCGCGAGGACAAGGGAAAGAGGCACCAGGTCGAGGGAAGAGGTCGGAGCCGACCTGAGCACTCCTCGGCCAATGGCCGAGGCCGACCAGAAGCCAAGGAGCTGCA
AGGTCGAGCAGAGCCTAAGTCCAGGTACGACAGGTATACGCCACTGACAGCTTCGCTTGAACAGGTCTTGACCGCAATACAGGACACGAATCTGTTGAAACGCCCGGAAA
AGTTGAGATCGGACCCAGACAGGAGAAACCGAAACAAATATTGCATGTTCCACGGGGATCACGGCCATACAACCCGAGAGTGCATCCAGTTGCGGGATGAGATAGAAACT
CTGATCCGAGAAGGTTACCTCAAGGAGTTCGTGGGGGGCGATAGAAACAAGAGGCCGCTACCAGCAGACCAAGGCAAGGGTGGTGCCAACCCGCCGCTTGAGATTCGAAC
CATTTTAGGGGGACCCTCAGGAGGAGAGTCAGGAAGGAAGCGGAAGACTGCTGTTCGAGAGGCACAGCAAGAGCCCGACGGGCAAGGTATGTACTCACTCCATCTTGATG
AAAACTCACCAAAGTTAGAGTTTACAGAAAAGGAGGCCAAAGGAGCTGTGGCTTCAACCTACCACCAAATCCTGAAGTTCCCTACGGAAGAAGGTGTAGGAGCAGTGTGC
GGCGAACAGAGAATGTCAAGGGAATGCTACTTCATGGCACTCAGGAACATTGACAGAAAGATTCAGGCGACACCAGCCTCGGGATATGGCCGAGGCCGAGAAGTCGAAGG
AGCAAGCTTTCCCCTCCCAATGGAGTATTAG
Protein sequenceShow/hide protein sequence
MSTPQSEELRFSHESGVIELRRRCSPPPRARFSLSPSLRVRVAGQHPSPRRILLPFSPCCSTATAVVGARAAAAPARAPPFLSVGISLSPRANPSLPLCAFSALKARGSR
EVAPWSSFSHVFTSVQRRSWVFRRRLVFSRRKSVRLSSNHLKIEYPLSKEYSSALFEEFILIFELFKEFLRSARKLLVLTDIMSPVVLTRRRTRRDSTGLGVFAILDHPD
VQGADEDNRGEIRLRGGPKRRNRQVGRAKTEGVEFLARPPVRPRPWAEAEPVRLRLVPTAFGRPVGAVCGKESLPNLCTGHSMSKEMEKKNQDVNAEHSDGDHHQRRSRE
EGRGRPQTESPRPRSPLPSSREKQADLKFVALENKVSAMDHNLSRILRILDKPGPSTKTPDERLVRDPRKGKEPMEHTAESKTRLKGKKTDNMTSKVRGLKPTDRTILRS
PESSTLKGRDCTVSTPSYGHTNTDLRNLIEEKRRTGLEDERLLNSIGKSQPRTYAELVSRAQKYMSAEELLKSKKTEREHKRSSSSNYDSKKDKRQRTDEEGRGRADHGR
GRAHPYGKFEKYTPTAVPQEQVLMEIRNTGLLKFLGRMKSSANRRDKSQYCLFHRDHGHSTRNCIQLKDEIEALIQNGYLKEFVGEPRAEADQGWLRPSLTKDGRDKEEP
LREIRTIFGGLAGGGSSRKRKAIVREARSEPEYRGEPGLPSLPRLIQIEIMRAWTTESTSPHPDLRLHEPGPPSLLHLVWLTWLRQKPSTSCRGRAQTLKDREIFYFFYI
LLGPGAERLVGVRGQRPLCKITKWNARPHPEAEAEAVTFTRMAILDHPDVQRVDEDNRGEIRLRGGPKRRNRQVGRAKTKGVGFLARPPARPRSSPSDSAWSLPPLAHRR
RCGFTTPVCRFSLLQATSSPLSNKFTVDYHVERRMGPVGAVCGEDVDKQTRPVSTMENENRPTTGEPSSLIRLQAQETEIAEIKGRMDDMGQNLTKILTLLKKPEPARHE
EEHPRRDPKKGKGIANEEVGDSKSVTSRMPPPGDDRVQKEAGPSRKKARRSSPLTQAPGLQDERLLNSIGESQPRTYVEFMTRAQRYISAEELLKSKQEERESRGMSTPD
WRREDKGKRHQVEGRGRSRPEHSSANGRGRPEAKELQGRAEPKSRYDRYTPLTASLEQVLTAIQDTNLLKRPEKLRSDPDRRNRNKYCMFHGDHGHTTRECIQLRDEIET
LIREGYLKEFVGGDRNKRPLPADQGKGGANPPLEIRTILGGPSGGESGRKRKTAVREAQQEPDGQGMYSLHLDENSPKLEFTEKEAKGAVASTYHQILKFPTEEGVGAVC
GEQRMSRECYFMALRNIDRKIQATPASGYGRGREVEGASFPLPMEY