; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0019131 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0019131
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionTy3-gypsy retrotransposon protein
Genome locationchr5:38929920..38935234
RNA-Seq ExpressionLag0019131
SyntenyLag0019131
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0056121.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]7.5e-8257.62Show/hide
Query:  MFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDEET-
        M ELTNT+Q+KGELV++YINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSIA+R  +D L+P  R +    ++T 
Subjt:  MFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDEET-

Query:  ------IEESMVVNTTLPKSSSKGK-----RQTNG--AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHL
              I+ESMVV+ T  KS SK K     R+ +G      TLKERQ+K+YPFPD+D+ DMLEQLLE QLI+LP+CKRPE+  KVDDP YCKYHRVI H 
Subjt:  ------IEESMVVNTTLPKSSSKGK-----RQTNG--AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHL

Query:  VEGCFVLKDLILKLAKEGKIELDLDEVAQSNLATIKGKSKHQRKKDPKKLQPKRK--------RSKKFSQPQQLVMLNKSFSKNFHKKEKKNLATSYCID
        VE CFVLK+LILKLA+E KIELD+DEVAQ+N A I+  S   + KD   LQ +R         RS     P++++ +    + +  + +  N  +S    
Subjt:  VEGCFVLKDLILKLAKEGKIELDLDEVAQSNLATIKGKSKHQRKKDPKKLQPKRK--------RSKKFSQPQQLVMLNKSFSKNFHKKEKKNLATSYCID

Query:  VEEVDNSKKSEQRTSVFDRIKPPTTRPS
         +EV+NS +  QRTSVFDRIKP TTR S
Subjt:  VEEVDNSKKSEQRTSVFDRIKPPTTRPS

KAA0058295.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]6.0e-7967.37Show/hide
Query:  MFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDEET-
        M ELTNT+Q+KGELV+NYINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSIA+R  +D L+P  R +    ++T 
Subjt:  MFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDEET-

Query:  ------IEESMVVNTTLPKSSSKGKRQTNGAHH-------LTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHL
              I+ESMVV+ T  KS SK K       H        TLKERQ+K+YPFPD+D+ DMLEQLLE QLI+LP+CKRPE+ EKVDDP YCKYHRVI H 
Subjt:  ------IEESMVVNTTLPKSSSKGKRQTNGAHH-------LTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHL

Query:  VEGCFVLKDLILKLAKEGKIELDLDEVAQSNLATIK
        VE CFVLK+LILKLA+E KIELD+DEVAQ+N   I+
Subjt:  VEGCFVLKDLILKLAKEGKIELDLDEVAQSNLATIK

XP_031739134.1 uncharacterized protein LOC116402863 [Cucumis sativus]4.6e-7968.09Show/hide
Query:  MFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRN-----
        M ELTNTKQRKGE V++YINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSIASR  +D L+P ++K+ +      
Subjt:  MFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRN-----

Query:  --DEETIEESMVVNTTLPKSSSKGK-----RQTNGA--HHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHL
           + T +ESMVVNTT P   SKGK     ++ +G+    LTLKERQ+K+YPFPD+DI DMLEQLLE QLI+LP+CKRPE+  KVDDP YCKYHRVI H 
Subjt:  --DEETIEESMVVNTTLPKSSSKGK-----RQTNGA--HHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHL

Query:  VEGCFVLKDLILKLAKEGKIELDLDEVAQSNLATI
        VE CFVLK+LIL+LA+E +IELDL+EVAQ+N A +
Subjt:  VEGCFVLKDLILKLAKEGKIELDLDEVAQSNLATI

XP_031742032.1 uncharacterized protein LOC116404025 [Cucumis sativus]2.0e-7968.09Show/hide
Query:  MFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRN-----
        M ELTNTKQRKGE V++YINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSIASR  +D L+P ++K+ +      
Subjt:  MFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRN-----

Query:  --DEETIEESMVVNTTLPKSSSKGK-----RQTNGA--HHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHL
           + T++ESMVVNTT P   SKGK     ++ +G+    LTLKERQ+K+YPFPD+DI DMLEQLLE QLI+LP+CKRPE+  KVDDP YCKYHRVI H 
Subjt:  --DEETIEESMVVNTTLPKSSSKGK-----RQTNGA--HHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHL

Query:  VEGCFVLKDLILKLAKEGKIELDLDEVAQSNLATI
        VE CFVLK+LIL+LA+E +IELDL+EVAQ+N A +
Subjt:  VEGCFVLKDLILKLAKEGKIELDLDEVAQSNLATI

XP_031742390.1 uncharacterized protein LOC116401672 [Cucumis sativus]4.6e-7968.09Show/hide
Query:  MFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRN-----
        M ELTNTKQRKGE V++YINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSIASR  +D L+P ++K+ +      
Subjt:  MFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRN-----

Query:  --DEETIEESMVVNTTLPKSSSKGK-----RQTNGA--HHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHL
           + T +ESMVVNTT P   SKGK     ++ +G+    LTLKERQ+K+YPFPD+DI DMLEQLLE QLI+LP+CKRPE+  KVDDP YCKYHRVI H 
Subjt:  --DEETIEESMVVNTTLPKSSSKGK-----RQTNGA--HHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHL

Query:  VEGCFVLKDLILKLAKEGKIELDLDEVAQSNLATI
        VE CFVLK+LIL+LA+E +IELDL+EVAQ+N A +
Subjt:  VEGCFVLKDLILKLAKEGKIELDLDEVAQSNLATI

TrEMBL top hitse value%identityAlignment
A0A5A7TST6 Ty3-gypsy retrotransposon protein1.9e-7866.53Show/hide
Query:  MFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDEET-
        M ELTNT+Q+KGELV++YINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSI +RE +D L+P  R +    ++T 
Subjt:  MFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDEET-

Query:  ------IEESMVVNTTLPKSSSK-------GKRQTNGAHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHL
              I+ESMVV+ T  KS SK        K   N     TLKERQ+K+YPFPD+D+ DMLEQLLE QLI+LP+CKRPE+  KVDDP YCKYHRVI H 
Subjt:  ------IEESMVVNTTLPKSSSK-------GKRQTNGAHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHL

Query:  VEGCFVLKDLILKLAKEGKIELDLDEVAQSNLATIK
        VE CFVLK+LILKLA+E KIELD+DEVAQ+N   I+
Subjt:  VEGCFVLKDLILKLAKEGKIELDLDEVAQSNLATIK

A0A5A7TZU9 Ribonuclease H9.3e-7866.95Show/hide
Query:  MFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDEET-
        M ELT TKQRKGE V++YINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSIA+R N DLL+P +RKE +  + T 
Subjt:  MFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDEET-

Query:  ------IEESMVVNTT----LPKSSSKGKRQTNG-AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHLVE
               +E+MVV+TT    + K     KRQ  G     TLKERQ+K+YPFPD+D+PDML+QLLE QLI+LP+CKRP EM +V+DP YCKYHRVI H VE
Subjt:  ------IEESMVVNTT----LPKSSSKGKRQTNG-AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHLVE

Query:  GCFVLKDLILKLAKEGKIELDLDEVAQSNLATI
         CFVLK+LILKLA + KIEL+LD+VAQ+N A +
Subjt:  GCFVLKDLILKLAKEGKIELDLDEVAQSNLATI

A0A5A7URH1 Ty3-gypsy retrotransposon protein3.6e-8257.62Show/hide
Query:  MFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDEET-
        M ELTNT+Q+KGELV++YINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSIA+R  +D L+P  R +    ++T 
Subjt:  MFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDEET-

Query:  ------IEESMVVNTTLPKSSSKGK-----RQTNG--AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHL
              I+ESMVV+ T  KS SK K     R+ +G      TLKERQ+K+YPFPD+D+ DMLEQLLE QLI+LP+CKRPE+  KVDDP YCKYHRVI H 
Subjt:  ------IEESMVVNTTLPKSSSKGK-----RQTNG--AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHL

Query:  VEGCFVLKDLILKLAKEGKIELDLDEVAQSNLATIKGKSKHQRKKDPKKLQPKRK--------RSKKFSQPQQLVMLNKSFSKNFHKKEKKNLATSYCID
        VE CFVLK+LILKLA+E KIELD+DEVAQ+N A I+  S   + KD   LQ +R         RS     P++++ +    + +  + +  N  +S    
Subjt:  VEGCFVLKDLILKLAKEGKIELDLDEVAQSNLATIKGKSKHQRKKDPKKLQPKRK--------RSKKFSQPQQLVMLNKSFSKNFHKKEKKNLATSYCID

Query:  VEEVDNSKKSEQRTSVFDRIKPPTTRPS
         +EV+NS +  QRTSVFDRIKP TTR S
Subjt:  VEEVDNSKKSEQRTSVFDRIKPPTTRPS

A0A5A7UXF0 Ty3-gypsy retrotransposon protein2.9e-7967.37Show/hide
Query:  MFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDEET-
        M ELTNT+Q+KGELV+NYINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSIA+R  +D L+P  R +    ++T 
Subjt:  MFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDEET-

Query:  ------IEESMVVNTTLPKSSSKGKRQTNGAHH-------LTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHL
              I+ESMVV+ T  KS SK K       H        TLKERQ+K+YPFPD+D+ DMLEQLLE QLI+LP+CKRPE+ EKVDDP YCKYHRVI H 
Subjt:  ------IEESMVVNTTLPKSSSKGKRQTNGAHH-------LTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHL

Query:  VEGCFVLKDLILKLAKEGKIELDLDEVAQSNLATIK
        VE CFVLK+LILKLA+E KIELD+DEVAQ+N   I+
Subjt:  VEGCFVLKDLILKLAKEGKIELDLDEVAQSNLATIK

A0A5D3D4X3 Ty3-gypsy retrotransposon protein7.1e-7866.95Show/hide
Query:  MFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDEET-
        M ELTNT+Q+KGELV++YINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSIA+R  +D L+P  R +    ++T 
Subjt:  MFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDEET-

Query:  ------IEESMVVNTTLPKSSSKGK-----RQTNG--AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHL
              I+ESMVV+ T  KS SK K     R+ +G      TLKERQ+K+YPFPD+D+ DMLEQLLE QLI+LP+CKRPE+  KVDDP YCKYHRVI H 
Subjt:  ------IEESMVVNTTLPKSSSKGK-----RQTNG--AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHL

Query:  VEGCFVLKDLILKLAKEGKIELDLDEVAQSNLATIK
        VE CFVLK+LILKLA+E KIELD+DEVAQ+N   I+
Subjt:  VEGCFVLKDLILKLAKEGKIELDLDEVAQSNLATIK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTCGAGCTCACAAACACTAAACAACGAAAAGGTGAACTCGTTGTTAACTATATAAATCGCTGGAGAGCTATGAGTCTAGATTGCAAAGATCGCCTCACTGAACTCTC
TTCCGTTGAGATGTGCATTCAAGGCATGCACTGGGAACTCCTCTACATCCTTAAAGGTATAAAGCCTCGCACCTTTGAGGAACTAGCAACTCGTGCCCACGATATGGAGC
TAAGTATTGCTAGTCGAGAAAACCAAGACCTTCTCCTCCCTAATATGAGAAAAGAAGGAAGGAACGACGAAGAGACTATAGAAGAATCTATGGTTGTCAACACAACCCTT
CCCAAGTCGTCTTCGAAAGGAAAGCGACAAACAAATGGAGCGCATCACTTAACTTTAAAGGAAAGACAGAAGAAAATCTATCCTTTCCCTGATGCCGACATCCCTGATAT
GTTGGAACAACTATTGGAAGCGCAACTGATAGAGCTTCCTAAGTGTAAACGACCAGAAGAGATGGAGAAAGTCGATGATCCCAAGTATTGCAAGTATCATCGAGTTATTG
GTCATCTAGTGGAAGGATGTTTCGTCCTAAAGGACTTAATTTTAAAGCTGGCTAAGGAAGGCAAAATTGAGCTTGACCTTGATGAAGTAGCCCAATCAAATCTTGCTACA
ATCAAAGGAAAGAGCAAACATCAAAGAAAGAAGGATCCTAAGAAACTTCAACCCAAGAGGAAGAGAAGTAAAAAGTTTTCTCAACCTCAACAACTGGTGATGTTGAATAA
ATCGTTCTCCAAAAATTTCCACAAAAAGGAAAAAAAGAACCTTGCAACTTCCTACTGCATCGACGTAGAAGAAGTTGACAATTCCAAGAAGAGTGAACAAAGGACTTCCG
TCTTTGATCGCATCAAGCCTCCAACTACTCGTCCTTCGTTCCTTCTCTCCAAGTTCGAGGGTCCTTACACTGTACGCTATTGCGTTGTTCCTTCTCCAAGTTCGAAGGTT
CTTCGTTGTATCCTGCTGCGTTGTTCCTTCTCCAAGTTCGAGGGTTCTCAGTTGTACAACTGCTACGTTGTTCCTCCTCCAAGTGCGAATGATCTTATGTGGTGCGTTGT
TGCATTGTTCCCTCTTCTCTCAAGTTCGATGGTTCTCACGCAGCTTTGCTGGAGTTTCTTCTCCCCAAGTTCGAAGGTTTTCACGCCCTCCGTTGTAGTTCCTTCTTTCC
AAGGTCGAAGGTTCTCACTCGCTGCGTTGCAGTTCTTTCTCCCCAAGTTCGAAGGTTCACACACTTCGCTCCAGTTCCTTCTCCCAAATTCGAAGGTTCTCATGCGCTTC
GCGCTGCAGTTCCTTCCCTCCAAGTTTGAAGGTTCTCACATCGCTTCGCTGCGATCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCTCTGCAATTCTTTCCCCCA
AGTTCGAAGGTTCTCAGGCGCTTCGTGCAGTTCCTTCCTCCAAATTCGAAGGTTCTCACGCACTTCGCTGCATTTCCTTCCCCCCAAATTCGAAGGTTCTCACGCGCTTC
GCTGCGGTTCCTTCCCCCCAAGTTCGAAGGTTCTCACGCACTTCGCTGCAGTTCCTTCCTCCAAGTTTGAAGTTCCTTCCCCCCAAGTTCGAAGGTTCTCACGTCGCTTC
GCTGCGCTCACGCGTTTCGCTGCAGTTCCTTCCTCCAATTTTGAAGGTTCTCACATCGCTTCGCTTCGCTCACGCGCTTCGCTCGCTCCTTCTCCAAGTTCGAAGGCGCT
TCTCTCCGTTGCTACTTCTCCAAGTTCGAAGGCGCTTCTCTCCACTGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCCACTGCTCCTTCTCCAAGTTCGAAGGTGCTTC
TCTCCACCCCTCTTTTTGAAGGTTCGCCACTGAGGTTCTCCTTCTCCAAGTTCGAAGGTTCACCGTTGCTCCTTTTCAAATGTTTGGCGGAGGTTGACGTCCTCGTTCCG
CTTCATCTTCAAATGTTGGTAGTTGACGGCGTCTGCTGCGCTTCATCTTCAAATGTTGGCAGAAACTACAGTCATCAAAGTGACTGGTCTAGACAGGTGGTGAAGTCACT
GCAATTGAATCTGATGACGACCGTTGAAGGCGAGTCGGGTCTGGTGACCACCCCTGCAGGTTACTCAAATCACCCAATAAAATGGGGACTGGGTTTAGCAGGAGTGCATG
AGGCGAATCTGGTGACTACCCCTGCAGGTTACTCAGATCACCCAATAAAATGGGGACTGGGTCTAGCAGGAGTGCATGAAGGCGAATCTGGTGACTACCCCTGCAGGTTA
CTCAGATCACCCAATGAAATAGGGGACTGGTCTAGCAGGAGTGATATCACTGCAAGCGAATTTGATCATCAAGCCAACAGGCCGATCCAAGAGATCAACAAGCCAACCGA
CCGATCAAGAAGATCAACAAGTCAGCAGGCCGATCATCCAAGAGATCAACAAGCCAACCGACCGATCAAGAAGATCAACAAGTCAGCAGGCCGATCATCCAAGAGGATCA
ACAAGCTAACAAGCCGATCCAACAGATCATCAAGCCAACAGGCTGATCCAAGAGATCAACAAGCCAACCGACCGATCAAGAAGATCAACAAGTCAGCAGGCCGATCATCC
AAGAGATCAACAAGCCAACCGACCGATCAAGAAGATCAACAAGTCAGCAGGCCGATCATCCAGGAAGATCAACAAGCCAACAAGCCGATCCAAGAGATCATCACGCCAAC
AGGCCGATCATCCAAGAAGATCAACAAGCCCAATAGGTCGATCCAGGAGATCATCAACCTAACAGGCCGATCATCCAAGAAGATCAACAAGTCCAATAGGTCGATCCAGG
AGATCATCAACCTAACAGACCGATCATCCAAGAAGATCAACAAGCCAATAAGCCGATCCAAGAGATCATCAAGCCAACAGGCCGATCCAAGAGATCATCAACCTAGCAAG
TTGATCATCTAA
mRNA sequenceShow/hide mRNA sequence
ATGTTCGAGCTCACAAACACTAAACAACGAAAAGGTGAACTCGTTGTTAACTATATAAATCGCTGGAGAGCTATGAGTCTAGATTGCAAAGATCGCCTCACTGAACTCTC
TTCCGTTGAGATGTGCATTCAAGGCATGCACTGGGAACTCCTCTACATCCTTAAAGGTATAAAGCCTCGCACCTTTGAGGAACTAGCAACTCGTGCCCACGATATGGAGC
TAAGTATTGCTAGTCGAGAAAACCAAGACCTTCTCCTCCCTAATATGAGAAAAGAAGGAAGGAACGACGAAGAGACTATAGAAGAATCTATGGTTGTCAACACAACCCTT
CCCAAGTCGTCTTCGAAAGGAAAGCGACAAACAAATGGAGCGCATCACTTAACTTTAAAGGAAAGACAGAAGAAAATCTATCCTTTCCCTGATGCCGACATCCCTGATAT
GTTGGAACAACTATTGGAAGCGCAACTGATAGAGCTTCCTAAGTGTAAACGACCAGAAGAGATGGAGAAAGTCGATGATCCCAAGTATTGCAAGTATCATCGAGTTATTG
GTCATCTAGTGGAAGGATGTTTCGTCCTAAAGGACTTAATTTTAAAGCTGGCTAAGGAAGGCAAAATTGAGCTTGACCTTGATGAAGTAGCCCAATCAAATCTTGCTACA
ATCAAAGGAAAGAGCAAACATCAAAGAAAGAAGGATCCTAAGAAACTTCAACCCAAGAGGAAGAGAAGTAAAAAGTTTTCTCAACCTCAACAACTGGTGATGTTGAATAA
ATCGTTCTCCAAAAATTTCCACAAAAAGGAAAAAAAGAACCTTGCAACTTCCTACTGCATCGACGTAGAAGAAGTTGACAATTCCAAGAAGAGTGAACAAAGGACTTCCG
TCTTTGATCGCATCAAGCCTCCAACTACTCGTCCTTCGTTCCTTCTCTCCAAGTTCGAGGGTCCTTACACTGTACGCTATTGCGTTGTTCCTTCTCCAAGTTCGAAGGTT
CTTCGTTGTATCCTGCTGCGTTGTTCCTTCTCCAAGTTCGAGGGTTCTCAGTTGTACAACTGCTACGTTGTTCCTCCTCCAAGTGCGAATGATCTTATGTGGTGCGTTGT
TGCATTGTTCCCTCTTCTCTCAAGTTCGATGGTTCTCACGCAGCTTTGCTGGAGTTTCTTCTCCCCAAGTTCGAAGGTTTTCACGCCCTCCGTTGTAGTTCCTTCTTTCC
AAGGTCGAAGGTTCTCACTCGCTGCGTTGCAGTTCTTTCTCCCCAAGTTCGAAGGTTCACACACTTCGCTCCAGTTCCTTCTCCCAAATTCGAAGGTTCTCATGCGCTTC
GCGCTGCAGTTCCTTCCCTCCAAGTTTGAAGGTTCTCACATCGCTTCGCTGCGATCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCTCTGCAATTCTTTCCCCCA
AGTTCGAAGGTTCTCAGGCGCTTCGTGCAGTTCCTTCCTCCAAATTCGAAGGTTCTCACGCACTTCGCTGCATTTCCTTCCCCCCAAATTCGAAGGTTCTCACGCGCTTC
GCTGCGGTTCCTTCCCCCCAAGTTCGAAGGTTCTCACGCACTTCGCTGCAGTTCCTTCCTCCAAGTTTGAAGTTCCTTCCCCCCAAGTTCGAAGGTTCTCACGTCGCTTC
GCTGCGCTCACGCGTTTCGCTGCAGTTCCTTCCTCCAATTTTGAAGGTTCTCACATCGCTTCGCTTCGCTCACGCGCTTCGCTCGCTCCTTCTCCAAGTTCGAAGGCGCT
TCTCTCCGTTGCTACTTCTCCAAGTTCGAAGGCGCTTCTCTCCACTGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCCACTGCTCCTTCTCCAAGTTCGAAGGTGCTTC
TCTCCACCCCTCTTTTTGAAGGTTCGCCACTGAGGTTCTCCTTCTCCAAGTTCGAAGGTTCACCGTTGCTCCTTTTCAAATGTTTGGCGGAGGTTGACGTCCTCGTTCCG
CTTCATCTTCAAATGTTGGTAGTTGACGGCGTCTGCTGCGCTTCATCTTCAAATGTTGGCAGAAACTACAGTCATCAAAGTGACTGGTCTAGACAGGTGGTGAAGTCACT
GCAATTGAATCTGATGACGACCGTTGAAGGCGAGTCGGGTCTGGTGACCACCCCTGCAGGTTACTCAAATCACCCAATAAAATGGGGACTGGGTTTAGCAGGAGTGCATG
AGGCGAATCTGGTGACTACCCCTGCAGGTTACTCAGATCACCCAATAAAATGGGGACTGGGTCTAGCAGGAGTGCATGAAGGCGAATCTGGTGACTACCCCTGCAGGTTA
CTCAGATCACCCAATGAAATAGGGGACTGGTCTAGCAGGAGTGATATCACTGCAAGCGAATTTGATCATCAAGCCAACAGGCCGATCCAAGAGATCAACAAGCCAACCGA
CCGATCAAGAAGATCAACAAGTCAGCAGGCCGATCATCCAAGAGATCAACAAGCCAACCGACCGATCAAGAAGATCAACAAGTCAGCAGGCCGATCATCCAAGAGGATCA
ACAAGCTAACAAGCCGATCCAACAGATCATCAAGCCAACAGGCTGATCCAAGAGATCAACAAGCCAACCGACCGATCAAGAAGATCAACAAGTCAGCAGGCCGATCATCC
AAGAGATCAACAAGCCAACCGACCGATCAAGAAGATCAACAAGTCAGCAGGCCGATCATCCAGGAAGATCAACAAGCCAACAAGCCGATCCAAGAGATCATCACGCCAAC
AGGCCGATCATCCAAGAAGATCAACAAGCCCAATAGGTCGATCCAGGAGATCATCAACCTAACAGGCCGATCATCCAAGAAGATCAACAAGTCCAATAGGTCGATCCAGG
AGATCATCAACCTAACAGACCGATCATCCAAGAAGATCAACAAGCCAATAAGCCGATCCAAGAGATCATCAAGCCAACAGGCCGATCCAAGAGATCATCAACCTAGCAAG
TTGATCATCTAA
Protein sequenceShow/hide protein sequence
MFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDEETIEESMVVNTTL
PKSSSKGKRQTNGAHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHLVEGCFVLKDLILKLAKEGKIELDLDEVAQSNLAT
IKGKSKHQRKKDPKKLQPKRKRSKKFSQPQQLVMLNKSFSKNFHKKEKKNLATSYCIDVEEVDNSKKSEQRTSVFDRIKPPTTRPSFLLSKFEGPYTVRYCVVPSPSSKV
LRCILLRCSFSKFEGSQLYNCYVVPPPSANDLMWCVVALFPLLSSSMVLTQLCWSFFSPSSKVFTPSVVVPSFQGRRFSLAALQFFLPKFEGSHTSLQFLLPNSKVLMRF
ALQFLPSKFEGSHIASLRSFLQVRRFSRASLCNSFPQVRRFSGASCSSFLQIRRFSRTSLHFLPPKFEGSHALRCGSFPPSSKVLTHFAAVPSSKFEVPSPQVRRFSRRF
AALTRFAAVPSSNFEGSHIASLRSRASLAPSPSSKALLSVATSPSSKALLSTAPSPSSKALLSTAPSPSSKVLLSTPLFEGSPLRFSFSKFEGSPLLLFKCLAEVDVLVP
LHLQMLVVDGVCCASSSNVGRNYSHQSDWSRQVVKSLQLNLMTTVEGESGLVTTPAGYSNHPIKWGLGLAGVHEANLVTTPAGYSDHPIKWGLGLAGVHEGESGDYPCRL
LRSPNEIGDWSSRSDITASEFDHQANRPIQEINKPTDRSRRSTSQQADHPRDQQANRPIKKINKSAGRSSKRINKLTSRSNRSSSQQADPRDQQANRPIKKINKSAGRSS
KRSTSQPTDQEDQQVSRPIIQEDQQANKPIQEIITPTGRSSKKINKPNRSIQEIINLTGRSSKKINKSNRSIQEIINLTDRSSKKINKPISRSKRSSSQQADPRDHQPSK
LII